PredictBias

identification of genomic and pathogenicity islands in prokaryotic genome
Home | Help | Analyzed genomes
 
A) Input parameters
Genome2237.gbkThreshold dinucleotide bias2
Threshold codon bias4Threshold %GC bias3
E-value (RPSBlast)0.05Genome (non-pathogenic)
 
B) Compare a potential GI or PAI in related non-pathogenic sp. (phylogenetic tree)
Potential GI or PAI start    end  
Select Organism     
 
C) Potential GIs and PAIs in NC_008321 (download)
S.NoStartEndBiasVirulenceInsertion elementsPrediction
1Shewmr4_0060Shewmr4_0073Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_00602150.519356TrkH family potassium uptake protein
Shewmr4_00610100.234671TrkA domain-containing protein
Shewmr4_00620120.185973two component transcriptional regulator
Shewmr4_0063-112-0.473179periplasmic sensor signal transduction histidine
Shewmr4_0064-110-0.241588hypothetical protein
Shewmr4_0065-1160.085800pirin domain-containing protein
Shewmr4_0066-1150.637775signal transduction histidine kinase, LytS
Shewmr4_0067-1231.996821response regulator receiver protein
Shewmr4_00680273.208733major facilitator transporter
Shewmr4_00691263.621199hypothetical protein
Shewmr4_00701243.692950hypothetical protein
Shewmr4_00710274.653638hypothetical protein
Shewmr4_0072-1264.437874hypothetical protein
Shewmr4_0073-1214.029729NLP/P60 protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0062TCRTETB1184e-31 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 118 bits (298), Expect = 4e-31
Identities = 76/400 (19%), Positives = 159/400 (39%), Gaps = 20/400 (5%)

Query: 30 FLAAVDQTLLATATPAIVEDLGGLR-QASWITIGYMLAMAASVPIYGWLGDNYGRAKILM 88
F + +++ +L + P I D +W+ +ML + +YG L D G ++L+
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 89 IALVVFALGSIVSA-SAGTMDHMIAGRILQGLGGGGLMSLSQSLVGELVPIRQRARFQGY 147
+++ GS++ +I R +QG G +L +V +P R + G
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143

Query: 148 FAAMFTLASVGGPVIGGFVVHAYSWHWLFWANIPLV-MLAVWRLNRLHKQSVKPVRQGRF 206
++ + GP IGG + H W +L IP++ ++ V L +L K+ V+ +G F
Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHYIHWSYLL--LIPMITIITVPFLMKLLKKEVR--IKGHF 199

Query: 207 DLLGVLLFPTIITALLYWLSVAGQDFAWLSATSLGFMGFICVGALVLLWWERRRESPFLP 266
D+ G++L I + + + F +S S + + R+ PF+
Sbjct: 200 DIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFL----------IFVKHIRKVTDPFVD 249

Query: 267 LDLLANKAIYMPLFTAALFAACLFAMIFFLPIYLQVGLHTNPAKTGLLLM-PMTFGIVTG 325
L N + + + + + +P ++ + A+ G +++ P T ++
Sbjct: 250 PGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIF 309

Query: 326 STIAGRLLSRDVAPKWLPTFGMGLAFIGLLLIGLVPPNANLIGALGV-LVGIGLGTVMPS 384
I G L+ R P ++ G+ + L + + + + V GL
Sbjct: 310 GYIGGILVDR-RGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTV 368

Query: 385 VQLVVQSVSGKARLSQITAMVSLSRSMGAAIGTALFSLLL 424
+ +V S + ++++ + + G A+ LL
Sbjct: 369 ISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLL 408


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0065TCRTETA522e-09 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 52.1 bits (125), Expect = 2e-09
Identities = 40/235 (17%), Positives = 82/235 (34%), Gaps = 23/235 (9%)

Query: 22 LMFFMFAMTSDAVGV-----IIPQLISEFGLSLSQASAFHYMPMIFI---AISGLFLGFL 73
L+ + + DAVG+ ++P L+ + S + + + ++ LG L
Sbjct: 7 LIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGAL 66

Query: 74 ADKIGRKLTILLGLLLFAIACFLFALGESFYYFLLLLALVGLAIGVFKTGALALIGDISR 133
+D+ GR+ +L+ L A+ + A + + + G+ A A I DI+
Sbjct: 67 SDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATG-AVAGAYIADIT- 124

Query: 134 STKQHSSTMNTVEGFFGVGAMVGPAIVSYLLISGVSWKYLYFGAGVFCLLLCWLAF---- 189
+ + + FG G + GP + + G S +F A L
Sbjct: 125 DGDERARHFGFMSACFGFGMVAGPVLGGLM--GGFSPHAPFFAAAALNGLNFLTGCFLLP 182

Query: 190 ------RADYPQVKRSSTETINLTNTFSMMKNPYALGFSL-AIGLYVATEVAIYV 237
R + + + +++ A+ F + +G A I+
Sbjct: 183 ESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFG 237


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0066ECOLIPORIN372e-04 E.coli/Salmonella-type porin signature.
		>ECOLIPORIN#E.coli/Salmonella-type porin signature.

Length = 383

Score = 37.2 bits (86), Expect = 2e-04
Identities = 59/276 (21%), Positives = 100/276 (36%), Gaps = 67/276 (24%)

Query: 411 DDTSVTLGYYNATQ-NINMT----WMWNSYLMEVKGDNAALLDVVAADGTAYSDAGLYGY 465
D T + +G+ TQ N +T W +N +G+ A +A G + D G + Y
Sbjct: 53 DQTYMRVGFKGETQINDQLTGYGQWEYNVQANTTEGEGANSWTRLAFAGLKFGDYGSFDY 112

Query: 466 GVPYWGNCCQRNYDTEYNIKAPYLAVASSFGDLSLDASVRYDSGDASGY----------- 514
G RNY Y+++ + + FG S + Y +G A+G
Sbjct: 113 G---------RNYGVLYDVEG-WTDMLPEFGGDSYTYADNYMTGRANGVATYRNTDFFGL 162

Query: 515 -----YAGNVQSQVDMNLDGVISIPEQSVSSIDNANPQPVNYDWSYTSYSLGANYQFDSD 569
+A Q + + ++I + ++ D+ N D + + Y
Sbjct: 163 VDGLNFALQYQGKNESQSADDVNIGTNNRNNGDDIRYD--NGD----GFGISTTYDIGMG 216

Query: 570 LAAFGRLSHGGRANADRLLFGKVRADGSVAKEDAVDNVDQYELGVKYRYDDLSVFATAFY 629
+A + R N +++ G A G D D + G+KY D +++ Y
Sbjct: 217 FSAGAAYTTSDRTN-EQVNAGGTIAGG--------DKADAWTAGLKY--DANNIYLATMY 265

Query: 630 SET-------------------EEQNFEATSQRFFD 646
SET + QNFE T+Q FD
Sbjct: 266 SETRNMTPYGKTDKGYDGGVANKTQNFEVTAQYQFD 301


2Shewmr4_0091Shewmr4_0107Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_00912190.797403hypothetical protein
Shewmr4_00922171.373271GntR family transcriptional regulator
Shewmr4_00931182.938837ABC transporter-like protein
Shewmr4_0094-1153.889827hypothetical protein
Shewmr4_0095-1174.418754AMP-dependent synthetase and ligase
Shewmr4_0096-1164.238464MORN repeat-containing protein
Shewmr4_0097-1173.628782hypothetical protein
Shewmr4_0098-1193.732500thioesterase superfamily protein
Shewmr4_00990213.277291HPP family protein
Shewmr4_01001193.034820hypothetical protein
Shewmr4_01011183.496941hypothetical protein
Shewmr4_01021184.298307hypothetical protein
Shewmr4_01031215.164483thioesterase superfamily protein
Shewmr4_0104-1205.116453MerR family transcriptional regulator
Shewmr4_0105-1215.201487NAD(P)H dehydrogenase (quinone)
Shewmr4_0106-1184.317327glutathione-dependent formaldehyde-activating,
Shewmr4_0107-1173.356505MerR family transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0096TOXICSSTOXIN280.019 Staphylococcal toxic shock syndrome toxin signature.
		>TOXICSSTOXIN#Staphylococcal toxic shock syndrome toxin signature.

Length = 234

Score = 28.5 bits (63), Expect = 0.019
Identities = 12/30 (40%), Positives = 16/30 (53%)

Query: 205 LRKLLKQTYDLPKSHFYTSSYWKIGCNEGE 234
+R L Q + L +S T YWKI N+G
Sbjct: 173 IRHQLTQIHGLYRSSDKTGGYWKITMNDGS 202


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0097UREASE441e-06 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 43.6 bits (103), Expect = 1e-06
Identities = 23/56 (41%), Positives = 31/56 (55%), Gaps = 8/56 (14%)

Query: 348 LAGLTLNAAKALGIEENVGSLVVGKQADFCLWDIATPAQLAYSYGVNPCKDVVKNG 403
+A T+N A A G+ +GSL VGK+AD LW+ PA +GV P V+ G
Sbjct: 406 IAKYTINPAIAHGLSHEIGSLEVGKRADLVLWN---PAF----FGVKP-DMVLLGG 453



Score = 31.6 bits (72), Expect = 0.007
Identities = 18/61 (29%), Positives = 29/61 (47%), Gaps = 6/61 (9%)

Query: 23 YGAITNAAIAVKDGKIAWLGPRSE---LPAFDVL---SIPVYRGKGGWITPGLIDAHTHL 76
+ I A I +KDG+IA +G P ++ V G+G +T G +D+H H
Sbjct: 80 HWGIVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGEGKIVTAGGMDSHIHF 139

Query: 77 V 77
+
Sbjct: 140 I 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0107TCRTETOQM502e-08 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 50.2 bits (120), Expect = 2e-08
Identities = 32/101 (31%), Positives = 48/101 (47%), Gaps = 16/101 (15%)

Query: 8 HVDHGKSTLIRALT---------------GMNTDRLPEEKRRGMTIDLGYAFMPLRDGTR 52
HVD GK+TL +L TD E++RG+TI G + T+
Sbjct: 11 HVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITSF-QWENTK 69

Query: 53 LAFIDVPGHEKFINNMLVGVSHVRHALLVLACDDGVMPQTR 93
+ ID PGH F+ + +S + A+L+++ DGV QTR
Sbjct: 70 VNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTR 110


3Shewmr4_0180Shewmr4_0224Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_0180223-2.260611type 12 methyltransferase
Shewmr4_0181123-2.143874peptidase M14, carboxypeptidase A
Shewmr4_0182328-2.529727hypothetical protein
Shewmr4_0183639-3.093874hypothetical protein
Shewmr4_0184857-3.235643sodium:dicarboxylate symporter
Shewmr4_0185859-2.694129hypothetical protein
Shewmr4_0186956-3.405071hypothetical protein
Shewmr4_01871052-2.117920hypothetical protein
Shewmr4_01881253-1.112239OmpA/MotB domain-containing protein
Shewmr4_01891254-0.906832hypothetical protein
Shewmr4_01901352-0.785654transporter AbgT
Shewmr4_01911249-0.872052phosphoenolpyruvate carboxykinase
Shewmr4_01921251-0.807114Hsp33-like chaperonin
Shewmr4_01931253-0.605282RNA-binding S4 domain-containing protein
Shewmr4_0194952-1.318005general secretion pathway protein C
Shewmr4_0195952-1.319229general secretion pathway protein D
Shewmr4_0196951-1.307260hypothetical protein
Shewmr4_01971061-0.708015type II secretion system protein E (GspE)
Shewmr4_0198853-0.633532general secretion pathway protein F
Shewmr4_0199853-0.370802general secretion pathway protein G
Shewmr4_0200851-0.411597general secretion pathway protein H
Shewmr4_0201650-0.844351hypothetical protein
Shewmr4_0202751-0.630017general secretion pathway protein I
Shewmr4_0203848-2.048166hypothetical protein
Shewmr4_0204846-1.791170general secretion pathway protein J
Shewmr4_0205946-2.756118hypothetical protein
Shewmr4_0206843-3.444949general secretion pathway protein K
Shewmr4_0207846-2.998666hypothetical protein
Shewmr4_0208545-2.832583general secretion pathway protein L
Shewmr4_0209644-1.793833general secretion pathway M protein
Shewmr4_0210644-1.486705type II secretion system protein N
Shewmr4_0211543-0.943871hypothetical protein
Shewmr4_0212745-0.572401HAD family hydrolase
Shewmr4_0213743-0.673513hypothetical protein
Shewmr4_0214541-2.190986ADP-ribose diphosphatase NudE
Shewmr4_0215539-2.6878583'(2'),5'-bisphosphate nucleotidase
Shewmr4_0216539-2.986993GCN5-related N-acetyltransferase
Shewmr4_0217542-2.985726TetR family transcriptional regulator
Shewmr4_0218542-2.936354phospholipid/glycerol acyltransferase
Shewmr4_0219741-2.476710tRNA 2-selenouridine synthase
Shewmr4_02201148-0.933135selenophosphate synthetase
Shewmr4_0221836-1.862575Delta-9 acyl-phospholipid desaturase
Shewmr4_0222634-1.213959hypothetical protein
Shewmr4_0223630-1.384753DNA-binding transcriptional repressor FabR
Shewmr4_0224324-1.148187tRNA (uracil-5-)-methyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0185TCRTETOQM832e-19 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 83.0 bits (205), Expect = 2e-19
Identities = 55/199 (27%), Positives = 92/199 (46%), Gaps = 5/199 (2%)

Query: 13 VNVGTIGHVDHGKTTLTAAI--SHVLAKTYGGEAKDFSQIDNAPEERERGITINTSHIEY 70
+N+G + HVD GKTTLT ++ + G K ++ DN ER+RGITI T +
Sbjct: 4 INIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITSF 63

Query: 71 DTPTRHYAHVDCPGHADYVKNMITGAAQMDGAILVVASTDGPMPQTREHILLSRQVGVPF 130
+D PGH D++ + + +DGAIL++++ DG QTR R++G+P
Sbjct: 64 QWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIP- 122

Query: 131 IIVFMNKCDMVDDAELLELVEMEVRELLSEYDFPGDDLPVIQGSALKALEGEPEWEAKII 190
I F+NK D L V +++E LS + + + +W+ I
Sbjct: 123 TIFFINKIDQN--GIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIE 180

Query: 191 ELAEALDSYIPEPERDIDK 209
+ L+ Y+ + +
Sbjct: 181 GNDDLLEKYMSGKSLEALE 199


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0186SECETRNLCASE1188e-38 Bacterial translocase SecE signature.
		>SECETRNLCASE#Bacterial translocase SecE signature.

Length = 127

Score = 118 bits (297), Expect = 8e-38
Identities = 63/125 (50%), Positives = 90/125 (72%), Gaps = 2/125 (1%)

Query: 1 MTTNTENQ--TNSLDIVKWGLAILLLAAAVIGNQMYSETSAVIRALAVIVAFAIAGFIAL 58
M+ NTE Q L+ +KW + + LL A++GN +Y + +RALAV++ A AG +AL
Sbjct: 1 MSANTEAQGSGRGLEAMKWVVVVALLLVAIVGNYLYRDIMLPLRALAVVILIAAAGGVAL 60

Query: 59 QTEKGKKALAFARESQIEVRKVVWPTRQEALNTTFIVLAATGILALVLWGLDAVLMHIVN 118
T KGK +AFARE++ EVRKV+WPTRQE L+TT IV A T +++L+LWGLD +L+ +V+
Sbjct: 61 LTTKGKATVAFAREARTEVRKVIWPTRQETLHTTLIVAAVTAVMSLILWGLDGILVRLVS 120

Query: 119 FITGV 123
FITG+
Sbjct: 121 FITGL 125


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0196TCRTETOQM6020.0 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 602 bits (1554), Expect = 0.0
Identities = 172/680 (25%), Positives = 295/680 (43%), Gaps = 71/680 (10%)

Query: 9 RYRNIGICAHVDAGKTTTTERVLFYTGLSHKIGEVHDGAATTDWMVQEQERGITITSAAV 68
+ NIG+ AHVDAGKTT TE +L+ +G ++G V G TD + E++RGITI +
Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61

Query: 69 TTFWRGMDAQFTEHRINIIDTPGHVDFTIEVERSLRVLDGAVVVFCGSSGVEPQSETVWR 128
+ W ++NIIDTPGH+DF EV RSL VLDGA+++ GV+ Q+ ++
Sbjct: 62 SFQWEN-------TKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFH 114

Query: 129 QADKYRVPRLVFVNKMDRAGADFERVVKQIRTRLGATCVPIQLNIGAEENFTGVIDLIKM 188
K +P + F+NK+D+ G D V + I+ +L A V Q V M
Sbjct: 115 ALRKMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQ----------KVELYPNM 164

Query: 189 KAINWNEADQGMTFTYEEIPASLAAKAAEMHEYLVEAAAEASDELMDKYLEEGTLSEDEI 248
N+ E++Q + E +D+L++KY+ +L E+
Sbjct: 165 CVTNFTESEQ------------------------WDTVIEGNDDLLEKYMSGKSLEALEL 200

Query: 249 KKALRQRTINNEIVLATCGSAFKNKGVQAVLDAVVEFLPAPVDVPPIKGIDDDEQEVERP 308
++ R N + GSA N G+ +++ + +
Sbjct: 201 EQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVITNKFYSSTH----------------- 243

Query: 309 SDDNAPFAALAFKIATDPFVGTLTFIRVYSGVLESGSGVYNSVKQKRERIGRIVQMHAND 368
+ FKI L +IR+YSGVL V S K+K +I + +
Sbjct: 244 -RGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKEK-IKITEMYTSINGE 301

Query: 369 RTELKEVRAGDIAAAIG--LK-EVTTGDTLCDPDHKVILERMEFPEPVITIAVEPKSKAD 425
++ + +G+I LK GDT P ER+E P P++ VEP
Sbjct: 302 LCKIDKAYSGEIVILQNEFLKLNSVLGDTKLLPQR----ERIENPLPLLQTTVEPSKPQQ 357

Query: 426 QDKMGIALQKLAAEDPSFRVETDEESSQTLISGMGELHLDIIVDRMRREFGVECNVGKPQ 485
++ + AL +++ DP R D + + ++S +G++ +++ ++ ++ VE + +P
Sbjct: 358 REMLLDALLEISDSDPLLRYYVDSATHEIILSFLGKVQMEVTCALLQEKYHVEIEIKEPT 417

Query: 486 VAYRETIRASVEAEGKFVRQSGGRGQFGHVWLKLEPNEEGAGYEFINAIVGGVVPREFIP 545
V Y E R +AE + + + L + P G+G ++ +++ G + + F
Sbjct: 418 VIYME--RPLKKAEYTIHIEVPPNPFWASIGLSVSPLPLGSGMQYESSVSLGYLNQSFQN 475

Query: 546 AVDKGIQEQMKNGVLAGFPVLDVKVTLFDGSYHDVDSNEMAFKIAGSMGFKKGALEANPV 605
AV +GI+ + G L G+ V D K+ G Y+ S F++ + ++ +A
Sbjct: 476 AVMEGIRYGCEQG-LYGWNVTDCKICFKYGLYYSPVSTPADFRMLAPIVLEQVLKKAGTE 534

Query: 606 LLEPCMKVEVTTPENYMGDVVGDLNRRRGLIEGMDDGFGGIKIVHAVVPLSEMFGYATDL 665
LLEP + ++ P+ Y+ D + I I+ +P + Y +DL
Sbjct: 535 LLEPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQLK-NNEVILSGEIPARCIQEYRSDL 593

Query: 666 RSATQGRASYSMEFLKYSDA 685
T GR+ E Y
Sbjct: 594 TFFTNGRSVCLTELKGYHVT 613


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0197TCRTETOQM833e-19 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 83.0 bits (205), Expect = 3e-19
Identities = 55/199 (27%), Positives = 91/199 (45%), Gaps = 5/199 (2%)

Query: 13 VNVGTIGHVDHGKTTLTAAI--SHVLAKTYGGEAKDFSQIDNAPEERERGITINTSHIEY 70
+N+G + HVD GKTTLT ++ + G K ++ DN ER+RGITI T +
Sbjct: 4 INIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITSF 63

Query: 71 DTPTRHYAHVDCPGHADYVKNMITGAAQMDGAILVVASTDGPMPQTREHILLSRQVGVPF 130
+D PGH D++ + + +DGAIL++++ DG QTR R++G+P
Sbjct: 64 QWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIP- 122

Query: 131 IIVFMNKCDMVDDAELLELVEMEVRELLSEYDFPGDDLPVIQGSALKALEGEPEWEAKII 190
I F+NK D L V +++E LS + + + +W+ I
Sbjct: 123 TIFFINKIDQN--GIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIE 180

Query: 191 ELAAALDSYIPEPERDIDK 209
L+ Y+ + +
Sbjct: 181 GNDDLLEKYMSGKSLEALE 199


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0210PF06872270.018 EspG protein
		>PF06872#EspG protein

Length = 398

Score = 26.6 bits (58), Expect = 0.018
Identities = 11/30 (36%), Positives = 17/30 (56%)

Query: 75 PVTGKADRVGFRFEDGKKVRFFKSNSELVK 104
P + RV +F DG +R +NSEL++
Sbjct: 87 PAHNELGRVYAKFSDGSSLRISVTNSELIE 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0219SECYTRNLCASE464e-164 Preprotein translocase SecY subunit signature.
		>SECYTRNLCASE#Preprotein translocase SecY subunit signature.

Length = 437

Score = 464 bits (1196), Expect = e-164
Identities = 180/428 (42%), Positives = 258/428 (60%), Gaps = 14/428 (3%)

Query: 16 SELKARLLFVIGAIIVFRAGSFVPIPGIDAAVLAELFAQQKGT--ILGMFNMFSGGALSR 73
+L+ +LLF + I+V+R G+ +PIPG+D + + + G + G+ NMFSGGAL +
Sbjct: 12 PDLRKKLLFTLAIIVVYRVGTHIPIPGVDYKNVQQCVREASGNQGLFGLVNMFSGGALLQ 71

Query: 74 ASIFALGIMPYISASIIMQLLTVVHPALAELKKEGESGRKKISQYTRWGTLVLGTFQSIG 133
+IFALGIMPYI+ASII+QLLTVV P L LKKEG++G KI+QYTR+ T+ L Q G
Sbjct: 72 ITIFALGIMPYITASIILQLLTVVIPRLEALKKEGQAGTAKITQYTRYLTVALAILQGTG 131

Query: 134 IATGLPN--------LVPGLVVNIGFGFYFVAVVSLVTGTMFLMWLGEQITERGIGNGIS 185
+ + + +V + V+ + GT +MWLGE IT+RGIGNG+S
Sbjct: 132 LVATARSAPLFGRCSVGGQIVPDQSIFTTITMVICMTAGTCVVMWLGELITDRGIGNGMS 191

Query: 186 ILIFAGIVAGLPSAIGQTAEQARQGDLNVLVLLLLAVIIFAVTYFVVFVERGQRRIVVNY 245
IL+F I A PSA+ +Q + ++AV + + VVFVE+ QRRI V Y
Sbjct: 192 ILMFISIAATFPSALWAIKKQGTLAGGWIEFGTVIAVGLI-MVALVVFVEQAQRRIPVQY 250

Query: 246 AKRQQGRKVFAAQSTHLPLKINMAGVIPPIFASSIILFPGTLAQWFGQNESMSWLSDFSL 305
AKR GR+ + ST++PLK+N AGVIP IFASS++ P +AQ+ G N + +L
Sbjct: 251 AKRMIGRRSYGGTSTYIPLKVNQAGVIPVIFASSLLYIPALVAQFAGGNSGWKSWVEQNL 310

Query: 306 AVSPGQPLYSLLYAAAIIFFCFFYTALVFNPRETADNLKKSGAFIPGIRPGEQTSRYIDK 365
P+Y + Y I+FF FFY A+ FNP E ADN+KK G FIPGIR G T+ Y+
Sbjct: 311 T-KGDHPIYIVTYFLLIVFFAFFYVAISFNPEEVADNMKKYGGFIPGIRAGRPTAEYLSY 369

Query: 366 VMTRLTLAGALYITFICLIPEFMLIAWKV--QFYFGGTSLLIMVVVIMDFMAQVQTHMMS 423
V+ R+T G+LY+ I L+P L+ + F FGGTS+LI+V V ++ + Q+++ +
Sbjct: 370 VLNRITWPGSLYLGLIALVPTMALVGFGASQNFPFGGTSILIIVGVGLETVKQIESQLQQ 429

Query: 424 HQYESVMK 431
YE ++
Sbjct: 430 RNYEGFLR 437


4Shewmr4_0319Shewmr4_0344Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_03192201.817293iron-containing alcohol dehydrogenase
Shewmr4_03202172.269953TetR family transcriptional regulator
Shewmr4_03210121.566358hypothetical protein
Shewmr4_03221141.188375hypothetical protein
Shewmr4_03231140.857279hypothetical protein
Shewmr4_03241130.406298hypothetical protein
Shewmr4_0325015-1.083166ribokinase-like domain-containing protein
Shewmr4_0326221-5.638934diguanylate cyclase
Shewmr4_0327326-8.277501hypothetical protein
Shewmr4_0328225-8.742719MOSC domain-containing protein
Shewmr4_0329222-8.932660hypothetical protein
Shewmr4_0330020-4.674700methyl-accepting chemotaxis sensory transducer
Shewmr4_0331021-2.890999hypothetical protein
Shewmr4_03320211.389430electron-transferring-flavoprotein
Shewmr4_03331213.412753molybdenum cofactor biosynthesis protein A
Shewmr4_03340213.587889molybdenum cofactor biosynthesis protein MoaC
Shewmr4_03350204.651254molybdopterin synthase subunit MoaD
Shewmr4_03361204.002662molybdopterin synthase subunit MoaE
Shewmr4_03371193.929115molybdenum ABC transporter periplasmic
Shewmr4_03382203.681336hypothetical protein
Shewmr4_03391173.437040molybdate ABC transporter inner membrane
Shewmr4_03400183.870571hypothetical protein
Shewmr4_03411193.764020ABC transporter-like protein
Shewmr4_03420223.811294hypothetical protein
Shewmr4_0343-2213.705080hypothetical protein
Shewmr4_0344-2213.738206two component transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0328SACTRNSFRASE356e-05 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 34.5 bits (79), Expect = 6e-05
Identities = 18/65 (27%), Positives = 28/65 (43%), Gaps = 2/65 (3%)

Query: 69 EHAEIKSMRTAATYKQQGIASKVLQHLINDAKAAGVQRLSLETGSMAFFQPARRLYCKFG 128
+A I+ + A Y+++G+ + +L I AK L LET + A Y K
Sbjct: 88 GYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINI--SACHFYAKHH 145

Query: 129 FEICG 133
F I
Sbjct: 146 FIIGA 150


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0336DHBDHDRGNASE1051e-29 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 105 bits (263), Expect = 1e-29
Identities = 70/248 (28%), Positives = 113/248 (45%), Gaps = 15/248 (6%)

Query: 5 VLVTGSSRGIGKAITLKLAAAGYDIALHYHSNQAAADASAAELSALGVNVSLLKFDVADR 64
+TG+++GIG+A+ LA+ G IA N + + L A + DV D
Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIA-AVDYNPEKLEKVVSSLKAEARHAEAFPADVRDS 69

Query: 65 AAVKAAIEADIEANGAYYGVILNAGINRDNAFPAMSETEWDSVIHTNLDGFYNVIHPCVM 124
AA+ G ++ AG+ R ++S+ EW++ N G +N V
Sbjct: 70 AAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRS-VS 128

Query: 125 PMVQGRKGGRIITLASVSGIAGNRGQVNYSASKAGLIGATKALSLELAKRKITVNCIAPG 184
+ R+ G I+T+ S Y++SKA + TK L LELA+ I N ++PG
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188

Query: 185 LIETDM----------VADIPKDMVEQL---VPMRRMGKPNEIAALAAFLMSDDAAYITR 231
ETDM + K +E +P++++ KP++IA FL+S A +IT
Sbjct: 189 STETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHITM 248

Query: 232 QVISVNGG 239
+ V+GG
Sbjct: 249 HNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0339BONTOXILYSIN310.004 Bontoxilysin signature.
		>BONTOXILYSIN#Bontoxilysin signature.

Length = 1196

Score = 31.0 bits (70), Expect = 0.004
Identities = 16/61 (26%), Positives = 29/61 (47%), Gaps = 11/61 (18%)

Query: 137 VYDGNTLSSEQSLLLGDEFKAEYLMAMMQLIYWPEQSIKSHLEGGELVTGLCDAIPCRQF 196
+YD N LS D + +L A++ L+ + I + + G +L++ + AIP
Sbjct: 63 IYDSNFLSQ-------DSERENFLQAIIILL----KRINNTISGKQLLSLISTAIPFPYG 111

Query: 197 Y 197
Y
Sbjct: 112 Y 112


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0341ACRIFLAVINRP405e-05 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 39.8 bits (93), Expect = 5e-05
Identities = 37/187 (19%), Positives = 69/187 (36%), Gaps = 26/187 (13%)

Query: 656 EDIAALKARFAEDPQVQLIDKVADISTVMGHYRLLTLKLLGLALVIALLLFSLSFGVKRA 715
+A L+ F + + + D + + +K L A+++ L+ L RA
Sbjct: 308 AKLAELQPFFPQGMK---VLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMRA 364

Query: 716 ALVV--AVPALAALLTLAILGLVGSPLSLFHALALILVFGIGIDYS-------------- 759
L+ AVP + L T AIL G ++ ++L G+ +D +
Sbjct: 365 TLIPTIAVP-VVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMMED 423

Query: 760 -LFFVSAEQHG-KAVMMAVFMSACSTLLAFGLLAFSQTQA---IHYFGLTLSLGIGFTFV 814
L A + + A+ A F +AF F +T+ + + +
Sbjct: 424 KLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVL 483

Query: 815 LSPLILT 821
++ LILT
Sbjct: 484 VA-LILT 489



Score = 34.8 bits (80), Expect = 0.002
Identities = 23/117 (19%), Positives = 44/117 (37%), Gaps = 17/117 (14%)

Query: 694 LLGLA-LVIALLLFSLSFGVKRAALVVAVPALAALLTLAILGLVGSPLSLFHALALILVF 752
L+ ++ +V+ L L +L V+ V L + L L ++ + L+
Sbjct: 875 LVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTI 934

Query: 753 GIGIDYSLFFVS-----AEQHGKAVMMAV-----------FMSACSTLLAFGLLAFS 793
G+ ++ V E+ GK V+ A M++ + +L LA S
Sbjct: 935 GLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAIS 991


5Shewmr4_0360Shewmr4_0368Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_0360-1143.854153TetR family transcriptional regulator
Shewmr4_03610133.571321anaerobic C4-dicarboxylate transporter
Shewmr4_03621124.173866hypothetical protein
Shewmr4_03631133.818703aminotransferase, class V
Shewmr4_03640123.624461hypothetical protein
Shewmr4_03650131.682328hypothetical protein
Shewmr4_0366015-0.045118sodium:dicarboxylate symporter
Shewmr4_0367118-0.468275TetR family transcriptional regulator
Shewmr4_0368218-0.292421flavocytochrome c
6Shewmr4_0379Shewmr4_0384Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_03798251.682391hypothetical protein
Shewmr4_03808251.706339glutamine synthetase
Shewmr4_03818251.758357GTP-binding protein TypA
Shewmr4_03828251.957121AraC family transcriptional regulator
Shewmr4_03838252.312775short-chain dehydrogenase/reductase SDR
Shewmr4_03848242.464144hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0380OMPADOMAIN885e-23 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 88.1 bits (218), Expect = 5e-23
Identities = 33/118 (27%), Positives = 53/118 (44%), Gaps = 12/118 (10%)

Query: 77 NVLFPNDSAYIAPEYYPQIEEIAMFLRQY--PTTKVTIEGHTSRTGTDERNAVLSQERAD 134
+VLF + A + PE ++++ L V + G+T R G+D N LS+ RA
Sbjct: 220 DVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGLSERRAQ 279

Query: 135 AVTAVLADRFSIDRSRLTAIGYGSSRPLVLEHTPDAETR---------NRRVVAEVTG 183
+V L + I +++A G G S P+ + + R +RRV EV G
Sbjct: 280 SVVDYLISK-GIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIEVKG 336


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0382RTXTOXIND313e-104 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 313 bits (804), Expect = e-104
Identities = 92/432 (21%), Positives = 198/432 (45%), Gaps = 11/432 (2%)

Query: 29 RLIIWALAAMVVCFLLWAGFAQLDKVTTGSGKVIPSSQVQVIQSLDGGIMQELYVREGDI 88
RL+ + + +V + + Q++ V T +GK+ S + + I+ ++ I++E+ V+EG+
Sbjct: 58 RLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGES 117

Query: 89 VTKGQPLVRIDDTRFRSDFAQQEQEVFGLKTNVIRMRAELDSILISDMTSDWREQVKITK 148
V KG L+++ +D + + + + R + SI E K+ +
Sbjct: 118 VRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSI----------ELNKLPE 167

Query: 149 KALVFPESLTEAEPALVKRQQEEYNGRLDNLSNQLEILVRQIQQRQQEIDDLASKTTTLT 208
L V R + NQ + +++ E + ++
Sbjct: 168 LKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYE 227

Query: 209 TSMQLISRELELTRPLAKKGIVPEVELLKLERAVNDAQGELNSLRLLRPKLKAALDEAIL 268
++ L+ L K + + +L+ E +A EL + ++++ + A
Sbjct: 228 NLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKE 287

Query: 269 KRREAVFVYAADLRAQLNETQTRLSRMNEAQVGAQDKVSKAIITSPVNGTIKSVHINTLG 328
+ + ++ ++ +L +T + + +++ ++I +PV+ ++ + ++T G
Sbjct: 288 EYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEG 347

Query: 329 GVVQPGVDIVEIVPSEDQLLIETKILPKDIAFLHTGLPAVVKVTAYDFTRYGGLKGTVEH 388
GVV ++ IVP +D L + + KDI F++ G A++KV A+ +TRYG L G V++
Sbjct: 348 GVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKN 407

Query: 389 ISADTSQDEEGNSFYLIKVRTEESSLVKDDGTQMPIIPGMLTTVDVITGQRSILEYILNP 448
I+ D +D+ + + + EE+ L +P+ GM T ++ TG RS++ Y+L+P
Sbjct: 408 INLDAIEDQRLGLVFNVIISIEENCLS-TGNKNIPLSSGMAVTAEIKTGMRSVISYLLSP 466

Query: 449 ILRAKDTALRER 460
+ + +LRER
Sbjct: 467 LEESVTESLRER 478


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0384CABNDNGRPT724e-14 NodO calcium binding signature.
		>CABNDNGRPT#NodO calcium binding signature.

Length = 479

Score = 71.5 bits (175), Expect = 4e-14
Identities = 37/124 (29%), Positives = 52/124 (41%), Gaps = 9/124 (7%)

Query: 5248 DTVNLGEGDDTVYGGEGTQMVYGGAGNDLLIGGAGIDGLRGGDGNDTLIGGLGDDVLRGD 5307
++ G + GG G ++ G + +++L GGAG D L GG G DTL GG G D
Sbjct: 332 VSIAHGVTIENAIGGSGNDILVGNSADNILQGGAGNDVLYGGAGADTLYGGAGRDTFVYG 391

Query: 5308 GGSDTFVWRYADADQGTDHIMDFNVREDKLDLSDLLQGETANTLESYLKFSLDNGSTVID 5367
G D+ V D I DF DK+DLS +F+ ++
Sbjct: 392 SGQDSTV-------AAYDWIADFQKGIDKIDLSAF--RNEGQLSFVQDQFTGKGQEVMLQ 442

Query: 5368 IDAN 5371
DA
Sbjct: 443 WDAA 446


7Shewmr4_0416Shewmr4_0470Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_0416-120-4.080541hypothetical protein
Shewmr4_0417-123-4.4993173-oxoacyl-(acyl carrier protein) synthase II
Shewmr4_0418024-5.4272043-ketoacyl-(acyl-carrier-protein) reductase
Shewmr4_0419-124-5.165927thioester dehydrase family protein
Shewmr4_0420024-5.1200903-oxoacyl-(acyl carrier protein) synthase I
Shewmr4_0421023-4.992796hypothetical protein
Shewmr4_0422-117-2.752162hypothetical protein
Shewmr4_0423-118-1.646782monooxygenase, FAD-binding
Shewmr4_04242231.248745hypothetical protein
Shewmr4_04253321.228478hypothetical protein
Shewmr4_04260190.277058hypothetical protein
Shewmr4_04270170.018021hypothetical protein
Shewmr4_0428018-0.102308thioesterase superfamily protein
Shewmr4_0429-112-0.346055histidine ammonia-lyase
Shewmr4_0430-29-1.290864glycosyl transferase family protein
Shewmr4_0431-19-1.294929thioester dehydrase family protein
Shewmr4_0432120-0.853690hypothetical protein
Shewmr4_0433225-0.839031hypothetical protein
Shewmr4_0434429-0.959272acyl carrier protein
Shewmr4_0435020-0.479175acyl carrier protein
Shewmr4_04360180.176088phospholipid/glycerol acyltransferase
Shewmr4_0437020-0.100964hypothetical protein
Shewmr4_0438-1121.088487hypothetical protein
Shewmr4_0439-1121.883938ATP-dependent DNA helicase RecG
Shewmr4_04400142.259100periplasmic binding protein/LacI transcriptional
Shewmr4_0441-1173.192763hypothetical protein
Shewmr4_0442-1153.513761diguanylate cyclase
Shewmr4_0443-1163.637071hypothetical protein
Shewmr4_0444-2173.798723amino acid ABC transporter periplasmic protein
Shewmr4_0445-1152.975596hypothetical protein
Shewmr4_04460153.057135hypothetical protein
Shewmr4_04471143.011035hypothetical protein
Shewmr4_04480160.873377DNA-binding transcriptional regulator IlvY
Shewmr4_0449216-0.311873ketol-acid reductoisomerase
Shewmr4_0450013-0.087805acetolactate synthase 2 catalytic subunit
Shewmr4_04510130.511348acetolactate synthase 2 regulatory subunit
Shewmr4_0452013-0.539134dihydroxy-acid dehydratase
Shewmr4_0453011-1.054129threonine dehydratase
Shewmr4_04540130.759446alanine-glyoxylate aminotransferase
Shewmr4_04550162.960308hypothetical protein
Shewmr4_04561163.312857hypothetical protein
Shewmr4_04570172.891256hypothetical protein
Shewmr4_04580183.565623phosphatidylglycerophosphatase
Shewmr4_04591203.880813hypothetical protein
Shewmr4_04601203.269668hypothetical protein
Shewmr4_04611223.142378hypothetical protein
Shewmr4_04621243.395815hypothetical protein
Shewmr4_04630191.742844hypothetical protein
Shewmr4_0464117-0.658847hypothetical protein
Shewmr4_0465114-1.836736hypothetical protein
Shewmr4_0466218-5.396185ATP-dependent DNA helicase Rep
Shewmr4_0467420-7.060418diguanylate cyclase/phosphodiesterase with GAF
Shewmr4_0468214-4.561555****diguanylate cyclase/phosphodiesterase
Shewmr4_0469112-3.939002hypothetical protein
Shewmr4_0470012-4.539363hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0418PREPILNPTASE332e-117 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 332 bits (854), Expect = e-117
Identities = 167/304 (54%), Positives = 205/304 (67%), Gaps = 15/304 (4%)

Query: 5 ISLLSHSLAQSPWLFIALSFVFAATIGSFLNVVIHRFPVMMKREWQQECNQYLQEYHVDV 64
++LL PWL+ +L F+F+ IGSFLNVVIHR P+M++REWQ E Y
Sbjct: 1 MALLLELAHGLPWLYFSLVFLFSLMIGSFLNVVIHRLPIMLEREWQAEYRSYFNPD---- 56

Query: 65 VKQIGIEKLNKPIDTYPEKYNLVVPGSACPKCKTAIKPWHNLPIVGWLMLRGKCAACDAP 124
YNL+VP S CP C I N+P++ WL LRG+C C AP
Sbjct: 57 -----------DEGVDEPPYNLMVPRSCCPHCNHPITALENIPLLSWLWLRGRCRGCQAP 105

Query: 125 ISSRYPIIELVTGLLVATLAWHFGPSWQFVFAAVLTFVLIALTGIDLDEMLLPDQMTLPL 184
IS+RYP++EL+T LL +A P W + A +LT+VL+ALT IDLD+MLLPDQ+TLPL
Sbjct: 106 ISARYPLVELLTALLSVAVAMTLAPGWGTLAALLLTWVLVALTFIDLDKMLLPDQLTLPL 165

Query: 185 LWLGLLINLNHTFTTPTDAVIGAAAGYLSLWSVFWLFKLLTGKEGMGYGDFKLLAVFGAW 244
LW GLL NL F + DAVIGA AGYL LWS++W FKLLTGKEGMGYGDFKLLA GAW
Sbjct: 166 LWGGLLFNLLGGFVSLGDAVIGAMAGYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAW 225

Query: 245 LGWQMLPLVILLSSLVGALVGITLIVLKRNQLANPIPFGPYIAAAGWIALIWGQPIVDWY 304
LGWQ LP+V+LLSSLVGA +GI LI+L+ + + PIPFGPY+A AGWIAL+WG I WY
Sbjct: 226 LGWQALPIVLLLSSLVGAFMGIGLILLRNHHQSKPIPFGPYLAIAGWIALLWGDSITRWY 285

Query: 305 LSTL 308
L+
Sbjct: 286 LTNF 289


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0419BCTERIALGSPF391e-136 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 391 bits (1006), Expect = e-136
Identities = 117/405 (28%), Positives = 211/405 (52%), Gaps = 10/405 (2%)

Query: 25 TFEWKGLNRDGKKTGGELKGTSVAEVKSQLKLQGVNPKVVRKK---------ASPLFAHN 75
+ ++ L+ GKK G + S + + L+ +G+ P V +
Sbjct: 3 QYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLRRK 62

Query: 76 PDIKPMDIAMVTRQIATMLAAGVPLVTTIEMLGRGHEKQKMRELLGTILSEVQSGIPLSD 135
+ D+A++TRQ+AT++AA +PL ++ + + EK + +L+ + S+V G L+D
Sbjct: 63 IRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSLAD 122

Query: 136 SLRPHRRYFDDLYVDLVAAGEHSGSLDAVFDRIATYREKAEALKSKIKKAMFYPAAVVVV 195
+++ F+ LY +VAAGE SG LDAV +R+A Y E+ + ++S+I++AM YP + VV
Sbjct: 123 AMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLTVV 182

Query: 196 AIAVTTLLLLFVVPQFESIFASFGAELPAFTQMIVGISRFLQSSWYIFFAAIAISIWLFV 255
AIAV ++LL VVP+ F LP T++++G+S +++ + + ++ ++
Sbjct: 183 AIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRT-FGPWMLLALLAGFMAF 241

Query: 256 RAHRNSQMFRDRIDELVLKIPIIGEILHKAAMARFSRTLATTFAAGVPLIDGMESAAGAS 315
R + R +L +P+IG I AR++RTL+ A+ VPL+ M +
Sbjct: 242 RVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDVM 301

Query: 316 GNAVYRSALLKVRQEVMAGMQMNVAMRTTGLFPDMLIQMVMIGEESGSLDSMLNKVANIY 375
N R L V G+ ++ A+ T LFP M+ M+ GE SG LDSML + A+
Sbjct: 302 SNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADNQ 361

Query: 376 EMQVDDAVDGLSSLIEPIMMVVIGILVGGLIVGMYLPIFQMGNVV 420
+ + + L EP+++V + +V +++ + PI Q+ ++
Sbjct: 362 DREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLM 406


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0422BCTERIALGSPG483e-10 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 48.3 bits (115), Expect = 3e-10
Identities = 21/60 (35%), Positives = 35/60 (58%), Gaps = 5/60 (8%)

Query: 1 MKATNQTLNKKAQGFTLIELMIVVAIIGILAAIALPAYKEYVNKSKINSCLSEASAWTKA 60
M+AT+ K +GFTL+E+M+V+ IIG+LA++ +P K+ +S+ A A
Sbjct: 1 MRATD-----KQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENA 55


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0441DHBDHDRGNASE762e-18 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 76.2 bits (187), Expect = 2e-18
Identities = 49/184 (26%), Positives = 80/184 (43%), Gaps = 2/184 (1%)

Query: 3 GLTGKVVIITGASEGIGRALAIAMARVGCQLVLSARNETRLASLALEIANYGPTPFVFAA 62
G+ GK+ ITGA++GIG A+A +A G + N +L + + F A
Sbjct: 5 GIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPA 64

Query: 63 DVSSASQCEALIQATIVHYGRLDILVNNAGMTMWSRFDELNQLSVLEDIMRVNYLGPAYL 122
DV ++ + + G +DILVN AG+ L+ E VN G
Sbjct: 65 DVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEW-EATFSVNSTGVFNA 123

Query: 123 THAALPYLKSSQ-GQVVVVASVAGLTGVPTRSGYAASKHAVIGFFDSLRIELTDDNVAVT 181
+ + Y+ + G +V V S + + YA+SK A + F L +EL + N+
Sbjct: 124 SRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCN 183

Query: 182 VICP 185
++ P
Sbjct: 184 IVSP 187


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0452TCRTETA320.003 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 32.5 bits (74), Expect = 0.003
Identities = 53/286 (18%), Positives = 101/286 (35%), Gaps = 30/286 (10%)

Query: 15 LFVPVTGLSLFALASGYLMSLIPLSLSYFELSPDLAP---WLASIFYLGLLLGAPCIAPI 71
L V ++ ++L A+ G +M ++P L S D+ L +++ L AP + +
Sbjct: 7 LIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGAL 66

Query: 72 VARIGHSKAFILFLNILLCSVVVMVLLPQGGIWL--ASRLVAGVAVAGIFVVVESWLLMA 129
R G + +L +++ +V ++ +W+ R+VAG+ A V +++
Sbjct: 67 SDRFG--RRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGA-TGAVAGAYIADI 123

Query: 130 DTQKQRAKRLGLYMTALYG-GTAIGQLAVDYLGTTGNLPYLVVIGLLAAASLPALLVKRG 188
+RA+ G +M+A +G G G + +G L +
Sbjct: 124 TDGDERARHFG-FMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLP 182

Query: 189 QPQSSEQHSIALSDLKNLSKPAVVGCLVSGLLLGPIYGLLPVYVSQDMGFAQQTGQFMAL 248
+ E+ + L L+ + + L+ V+ Q GQ A
Sbjct: 183 ESHKGERRPLRREALNPLAS------FRWARGMTVVAALMAVFFI-----MQLVGQVPAA 231

Query: 249 IIMGGMIVQPLVSYLSPRFQKSALMAAFCLIGAAALFLLTQKSLVG 294
+ V + RF A L L L Q + G
Sbjct: 232 L---------WVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITG 268


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0457INFPOTNTIATR704e-18 Macrophage infectivity potentiator signature.
		>INFPOTNTIATR#Macrophage infectivity potentiator signature.

Length = 233

Score = 70.4 bits (172), Expect = 4e-18
Identities = 39/96 (40%), Positives = 51/96 (53%), Gaps = 2/96 (2%)

Query: 12 GEGKEAVKGALITTQYRGFLQDGTQFDSSYDRGQAFQCVIGTGRVIKGWDQGLMGMKVGG 71
G G + K +T +Y G L DGT FDS+ G+ +VI GW + L M G
Sbjct: 136 GTGAKPGKSDTVTVEYTGTLIDGTVFDSTEKAGKP--ATFQVSQVIPGWTEALQLMPAGS 193

Query: 72 KRKLFVPAHLAYGERQIGAHIKPNSDLTFEIELLEV 107
++FVPA LAYG R +G I PN L F+I L+ V
Sbjct: 194 TWEVFVPADLAYGPRSVGGPIGPNETLIFKIHLISV 229


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0468SECA491e-08 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 49.5 bits (118), Expect = 1e-08
Identities = 15/20 (75%), Positives = 18/20 (90%)

Query: 2 KLGRNDPCHCGSGKKFKRCC 21
K+GRNDPC CGSGKK+K+C
Sbjct: 878 KVGRNDPCPCGSGKKYKQCH 897


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0469RTXTOXIND310.012 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 30.6 bits (69), Expect = 0.012
Identities = 13/100 (13%), Positives = 30/100 (30%), Gaps = 5/100 (5%)

Query: 220 AEKDTLLRTLFEQGLLTRHQALALTRKESANTDETYQYWLTPDLITPYIQQAQSQLQQEQ 279
+ +L + + +H L K +E Y + I I A+ + Q
Sbjct: 234 KSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVT 293

Query: 280 AEALAKKLAIEKAEKEKMLTDIYNQREHSWQQAQEQANRT 319
K ++K + + + +E+ +
Sbjct: 294 QL--FKNEILDKLRQTTDNIGLLTLEL---AKNEERQQAS 328


8Shewmr4_0505Shewmr4_0525Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_05051154.048003thioredoxin
Shewmr4_05061164.066925ATP-dependent RNA helicase RhlB
Shewmr4_05072193.699476Ppx/GppA phosphatase
Shewmr4_05082183.539718hypothetical protein
Shewmr4_05091182.613767hypothetical protein
Shewmr4_05102182.040383mutator MutT protein
Shewmr4_05111162.485437hypothetical protein
Shewmr4_05121153.767861hypothetical protein
Shewmr4_0513-1195.261295dephospho-CoA kinase
Shewmr4_05140205.431919type 4 prepilin peptidase 1. Aspartic peptidase.
Shewmr4_05150215.375801hypothetical protein
Shewmr4_05160215.137384type II secretion system protein
Shewmr4_0517-1204.583830type IV-A pilus assembly ATPase PilB
Shewmr4_0518-2204.264405O-antigen polymerase
Shewmr4_05190182.213040methylation site containing protein
Shewmr4_0520-1171.896527hypothetical protein
Shewmr4_0521-1181.883681O-antigen polymerase
Shewmr4_0522-3171.155844nicotinate-nucleotide pyrophosphorylase
Shewmr4_0523-2160.119069N-acetyl-anhydromuranmyl-L-alanine amidase
Shewmr4_0524113-0.949626regulatory protein AmpE
Shewmr4_0525315-1.859214transcriptional regulator PdhR
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0507IGASERPTASE330.001 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 32.7 bits (74), Expect = 0.001
Identities = 20/80 (25%), Positives = 30/80 (37%), Gaps = 2/80 (2%)

Query: 145 PMAYDDTPVAVSPPVRVTTSMQYSPSEGRMVSNMPTNSATVISQTGASTARASTASAEQI 204
P + +T P + T+S P N NS + T ++E
Sbjct: 1158 PQSQTNTTADTEQPAKETSSNVEQPVTESTTVN-TGNSVVENPENTTPATTQPTVNSES- 1215

Query: 205 ANVPRARAARSVSSLPSNAR 224
+N P+ R RSV S+P N
Sbjct: 1216 SNKPKNRHRRSVRSVPHNVE 1235


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0508TCRTETA522e-09 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 52.1 bits (125), Expect = 2e-09
Identities = 61/322 (18%), Positives = 107/322 (33%), Gaps = 22/322 (6%)

Query: 50 VAHVSYAISAYALGVVVGSPIIMVLGVRIKRRTLLIALAAMMAVANGLSALAPSLNWLVF 109
AH ++ YAL +P++ L R RR +L+ A AV + A AP L L
Sbjct: 42 TAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYI 101

Query: 110 FRFLSGLPHGAYFGVAMLLAASLVPPDMKARAVSRVIIGLTLATIVGVPFATWMGQTVGW 169
R ++G+ GA VA A + D +AR + + G MG
Sbjct: 102 GRIVAGIT-GATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGG-FSP 159

Query: 170 RSGIGIVAIIAAVTAVMLYFLAPNVAVPQNASPKKELQTLKNREVWLTLGIAAIGFGGIF 229
+ A + + + FL P + + N +
Sbjct: 160 HAPFFAAAALNGLNFLTGCFLLPE---SHKGERRPLRREALNPLASFRWARGMTVVAALM 216

Query: 230 CVYTYLAETLIQVTQV------------EPFKIPVMIAVFGI-GATLGTLVCGWAADK-S 275
V+ ++ + + QV + I + +A FGI + ++ G A +
Sbjct: 217 AVF-FIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLG 275

Query: 276 ALAAAFWSLVLSTLVLALYPSLTGSYWALMPI-VFFVGSGIGLATIVQARLMDVAPDGQA 334
A ++ L + W PI V GIG+ + V + Q
Sbjct: 276 ERRALMLGMIADGTGYILL-AFATRGWMAFPIMVLLASGGIGMPALQAMLSRQVDEERQG 334

Query: 335 MTGALVQCAFNLANAIGPWVGS 356
+ +L + +GP + +
Sbjct: 335 QLQGSLAALTSLTSIVGPLLFT 356


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0509DHBDHDRGNASE522e-10 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 52.4 bits (125), Expect = 2e-10
Identities = 41/183 (22%), Positives = 80/183 (43%), Gaps = 5/183 (2%)

Query: 2 ILITGASSGLGAALAALYAKDNQALTLTGRDANRLHTVANALSPFSTQSISAISADLADE 61
ITGA+ G+G A+A A + + +L V ++L + + A AD+ D
Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHA-EAFPADVRDS 69

Query: 62 ASVEALFDGL---TDTPNTVIHCAGSGYFGALETQGTSEIQALLNNNVTSTILLVRELVK 118
A+++ + + + +++ AG G + + E +A + N T R + K
Sbjct: 70 AAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSK 129

Query: 119 RYK-QQAVKVVVVMSTAALAAKAGESTYCAAKWAVRGFIESVRLELKQSQMKLIAVYPGG 177
+++ +V V S A + + Y ++K A F + + LEL + ++ V PG
Sbjct: 130 YMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPGS 189

Query: 178 MDT 180
+T
Sbjct: 190 TET 192


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0510RTXTOXIND290.007 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.0 bits (65), Expect = 0.007
Identities = 9/23 (39%), Positives = 11/23 (47%)

Query: 126 GIVGAIWVKDGDDVAFDQPLFTL 148
IV I VK+G+ V L L
Sbjct: 105 SIVKEIIVKEGESVRKGDVLLKL 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0516RTXTOXIND310.015 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 30.6 bits (69), Expect = 0.015
Identities = 21/167 (12%), Positives = 52/167 (31%), Gaps = 17/167 (10%)

Query: 80 EVQAQIARQQQAELAIAAADRAVYNPEL-GLNYQNADTDTYSLGLSQTIDWGDKRGVATR 138
+ + QA L + EL L + Y +S+ + +
Sbjct: 135 DTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQ 194

Query: 139 LAQLEAQILLADISLERSQMLAERLLALAEQAQSNKALTFAEQQLRFTQAQLNIAEQRFA 198
+ + Q +++L++ + AE+ + E R +++L+
Sbjct: 195 FSTWQNQKYQKELNLDKKR---------AERLTVLARINRYENLSRVEKSRLDDFSSLLH 245

Query: 199 AGDLS-----DVELQLLKL--ELASNTADYAMAEQAALVADGKVIEL 238
++ + E + ++ EL + E L A + +
Sbjct: 246 KQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLV 292


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0517RTXTOXIND552e-10 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 55.2 bits (133), Expect = 2e-10
Identities = 31/138 (22%), Positives = 56/138 (40%), Gaps = 9/138 (6%)

Query: 157 EVAKAQAEYINAAAEWNRVRR---MSESAVSVSRRMQAQVDAELKRAILEAIKMTAEQIR 213
V + + +Y+ A E + ES + ++ V K IL+ ++ T + I
Sbjct: 253 AVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIG 312

Query: 214 TLE----STPEAIGSYQLLAPIDGRVQQ-DIAMLGQVFTAGTPLMQLT-DESHLWVEAQL 267
L E + + AP+ +VQQ + G V T LM + ++ L V A +
Sbjct: 313 LLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALV 372

Query: 268 TPAQAANVNAGGPALVQV 285
+N G A+++V
Sbjct: 373 QNKDIGFINVGQNAIIKV 390



Score = 42.9 bits (101), Expect = 2e-06
Identities = 28/149 (18%), Positives = 55/149 (36%), Gaps = 5/149 (3%)

Query: 101 SLSNLNLDTMATATLVVDRDRTATLAPQLDVRVQARHVVPGQEVKKGEPLLTLGG----A 156
L + + A L R+ + P + V+ V G+ V+KG+ LL L A
Sbjct: 76 VLGQVEIVATANGKLTHS-GRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEA 134

Query: 157 EVAKAQAEYINAAAEWNRVRRMSESAVSVSRRMQAQVDAELKRAILEAIKMTAEQIRTLE 216
+ K Q+ + A E R + +S S D + + E + + +
Sbjct: 135 DTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQ 194

Query: 217 STPEAIGSYQLLAPIDGRVQQDIAMLGQV 245
+ YQ +D + + + +L ++
Sbjct: 195 FSTWQNQKYQKELNLDKKRAERLTVLARI 223


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0518ACRIFLAVINRP6530.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 653 bits (1686), Expect = 0.0
Identities = 224/1077 (20%), Positives = 440/1077 (40%), Gaps = 72/1077 (6%)

Query: 9 AIKNRLLVVLALLAVVAASVAMLPKLNLDAFPDVTNVQVTINTAAEGLAAEEVEKLISYP 68
I+ + + + ++ A + +L + +P + V+++ G A+ V+ ++
Sbjct: 5 FIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQV 64

Query: 69 VESAMYALPAVTEVRSLS-RTGLSIVTVVFAEGTDIYFARQQVFEQLQAAREMIPSGVGV 127
+E M + + + S S G +T+ F GTD A+ QV +LQ A ++P V
Sbjct: 65 IEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQQ 124

Query: 128 PEIGPNTSGLGQIYQYILRADPSSGINAAELRSLNDYLVKLIMMPVGGVTDVLSFGGEVR 187
I S + +D + G ++ VK + + GV DV FG +
Sbjct: 125 QGISVEKSSSSYLMVAGFVSD-NPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQ-Y 182

Query: 188 QYQVQVDPNKLRAYGLSMAQVSEALESNNRNAGGWFMDQGQEQLVVRGYGMLPAGEAGLA 247
++ +D + L Y L+ V L+ N + G L + +
Sbjct: 183 AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLG-GTPALPGQQLNASIIAQTRFK 241

Query: 248 AIAQIPLTEVR----GTPVRVGDIAKVDFGSEIRVGAVTMTRRDEAGQAQALGEVVAGVV 303
+ +R G+ VR+ D+A+V+ G E + G+ +
Sbjct: 242 NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARIN-----GK-----PAAGLGI 291

Query: 304 LKRMGANTKATIDDIDARINLIEQALPKGVSFEVFYDQADLVDKAVTTVRDALLMAFVFI 363
GAN T I A++ ++ P+G+ YD V ++ V L A + +
Sbjct: 292 KLATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLV 351

Query: 364 VVILALFLVNIRATLLVLLSIPVSIGLALMVMSYYGLSANLMSLGGLAVAIGMLVDGSVV 423
+++ LFL N+RATL+ +++PV + +++ +G S N +++ G+ +AIG+LVD ++V
Sbjct: 352 FLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIV 411

Query: 424 MVENIFKHLTQPDRRHLAQARSRADGEVDPYHGDEDGSVRAVEADNSMAVRIMLAAKEVC 483
+VEN+ + + + + EA + ++
Sbjct: 412 VVENVERVM-------------------------MEDKLPPKEA-------TEKSMSQIQ 439

Query: 484 SPIFFATAIIIVVFAPLFALEGVEGKLFQPMAVSIILAMISALLVALIAVPALAVYLFK- 542
+ ++ VF P+ G G +++ +++I+ AM ++LVALI PAL L K
Sbjct: 440 GALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKP 499

Query: 543 ---------RGVVLKESVVLVPLDSAYRKLLSATLARPKLVMISALLMFAMSMVLLPRLG 593
G + + Y + L ++ L+ A +VL RL
Sbjct: 500 VSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRLP 559

Query: 594 TEFVPELEEGTINLRVTLAPTASLGTSLDVAPKLEAMLLEFPEVEYALSRIGAPELGGDP 653
+ F+PE ++G + L A+ + V ++ L+ + S
Sbjct: 560 SSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNEKANVE-SVFTVNGFSFSG 618

Query: 654 EPVSNIEVYIGLKPIEEWQSASSRLE--LQRLMEEKLSVFPGLLLTFSQPIATRVDELLS 711
+ + ++ LKP EE + E + R E + G ++ F+ P + EL +
Sbjct: 619 QAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGFVIPFNMP---AIVELGT 675

Query: 712 GVKAQLA-IKLFGPDLDVLSEKGQVLTDLVAKIPGAV-DVSLEQVSGEAQLVVRPDRSQL 769
I G D L++ L + A+ P ++ V + AQ + D+ +
Sbjct: 676 ATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKA 735

Query: 770 ARYGISVDQVMSLVSQGIGGASAGQVIDGNARYDINLRLAAEYRSSPDVIKDLLLSGSNG 829
G+S+ + +S +GG ID + ++ A++R P+ + L + +NG
Sbjct: 736 QALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRMLPEDVDKLYVRSANG 795

Query: 830 ATVRLGEVASVEVEMAPPNIRRDDVQRRVVVQANVA-GRDMGSVVKDIYELVPQADLPAG 888
V + P + R + + +Q A G G + + L + LPAG
Sbjct: 796 EMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAMALMENLASK--LPAG 853

Query: 889 YTVIVGGQYENQQRAQQKLMLVVPISIALIALLLYFSFGAVKQVLLIMANVPLALIGGIV 948
G ++ + + +V IS ++ L L + + + +M VPL ++G ++
Sbjct: 854 IGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLL 913

Query: 949 ALFVSGTYLSVPSSIGFITLFGVAVLNGVVLVDSINQ-RRQSGESLYDSVYEGTVGRLRP 1007
A + V +G +T G++ N +++V+ + G+ + ++ RLRP
Sbjct: 914 AATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRP 973

Query: 1008 VLMTALTSALGLIPILVSSGVGSEIQKPLAVVIIGGLFSSTALTLLVLPTLYRWLYR 1064
+LMT+L LG++P+ +S+G GS Q + + ++GG+ S+T L + +P + + R
Sbjct: 974 ILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIRR 1030



Score = 109 bits (273), Expect = 4e-26
Identities = 85/550 (15%), Positives = 186/550 (33%), Gaps = 68/550 (12%)

Query: 10 IKNRLLVVLALLAVVAASVAMLPKLNLDAFPDVTNVQVTIN-TAAEGLAAEEVEKLI--- 65
+ + +L +VA V + +L P+ G E +K++
Sbjct: 534 LGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQV 593

Query: 66 -SYPVESAMYALPAVTEVRSLSRTGLS----IVTVVFAEGTDIYFARQQVFEQLQAAREM 120
Y +++ + +V V S +G + + V + + A+
Sbjct: 594 TDYYLKNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKME 653

Query: 121 ---IPSGVGVPEIGPNTSGLGQIYQYILRADPSSGINAAELRSLNDYLVKLIMMPVGGVT 177
I G +P P LG + +G+ L + L+ + +
Sbjct: 654 LGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLV 713

Query: 178 DVLSFGGEVR-QYQVQVDPNKLRAYGLSMAQVSEALES--NNRNAGGWFMDQGQEQLVVR 234
V G E Q++++VD K +A G+S++ +++ + + + ++L V+
Sbjct: 714 SVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQ 773

Query: 235 GYGMLPAGEA-GLAAIAQIPLTEVRGTPVRVGDIAKVDFGSEIRVGAVTMTRRDEAGQAQ 293
A + ++ + G V + G+ + R + +
Sbjct: 774 ----ADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVY----GSPRLERYNGLPSME 825

Query: 294 ALGEVVAGVVLKRMGANTKATIDDIDARINLIEQALPKGVSFEVFYDQADLVDKAVTTVR 353
GE G D A + + LP G+ ++ + + +
Sbjct: 826 IQGEAAPGTSS-----------GDAMALMENLASKLPAGIGYD-WTGMSYQERLSGNQAP 873

Query: 354 DALLMAFVFIVVILALFLVNIRATLLVLLSIPVSIGLALMVMSYYGLSANLMSLGGLAVA 413
+ ++FV + + LA + + V+L +P+ I L+ + + ++ + GL
Sbjct: 874 ALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTT 933

Query: 414 IGMLVDGSVVMVENIFKHLTQPDRRHLAQARSRADGEVDPYHGDEDGSVRAVEADNSMAV 473
IG+ ++++VE A+ +G+ VEA
Sbjct: 934 IGLSAKNAILIVE-------------FAKDLMEKEGK------------GVVEA------ 962

Query: 474 RIMLAAKEVCSPIFFATAIIIVVFAPLFALEGVEGKLFQPMAVSIILAMISALLVALIAV 533
++A + PI + I+ PL G + + ++ M+SA L+A+ V
Sbjct: 963 -TLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFV 1021

Query: 534 PALAVYLFKR 543
P V + +
Sbjct: 1022 PVFFVVIRRC 1031



Score = 98.8 bits (246), Expect = 5e-23
Identities = 66/347 (19%), Positives = 140/347 (40%), Gaps = 16/347 (4%)

Query: 735 VLTDLVAKIPGAVDVSLEQVSGEAQLVVRPDRSQLARYGISVDQVMSLVSQGIGGASAGQ 794
+ D ++++ G DV L + + + D L +Y ++ V++ + +AGQ
Sbjct: 161 NVKDTLSRLNGVGDVQL--FGAQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQ 218

Query: 795 VIDGNARYDINLRLA----AEYRSSPDVIKDLLLSGSNGATVRLGEVASVEVEMAPPNIR 850
+ A L + +++ + K L S+G+ VRL +VA VE+ N+
Sbjct: 219 LGGTPALPGQQLNASIIAQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVI 278

Query: 851 -RDDVQRRVVVQANVA-GRDMGSVVKDIYELVP--QADLPAGYTVIVGGQYENQQRAQQK 906
R + + + +A G + K I + Q P G V+ Y+ Q
Sbjct: 279 ARINGKPAAGLGIKLATGANALDTAKAIKAKLAELQPFFPQGMKVLY--PYDTTPFVQLS 336

Query: 907 LMLVVP---ISIALIALLLYFSFGAVKQVLLIMANVPLALIGGIVALFVSGTYLSVPSSI 963
+ VV +I L+ L++Y ++ L+ VP+ L+G L G ++ +
Sbjct: 337 IHEVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMF 396

Query: 964 GFITLFGVAVLNGVVLVDSINQRRQS-GESLYDSVYEGTVGRLRPVLMTALTSALGLIPI 1022
G + G+ V + +V+V+++ + ++ + ++ A+ + IP+
Sbjct: 397 GMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPM 456

Query: 1023 LVSSGVGSEIQKPLAVVIIGGLFSSTALTLLVLPTLYRWLYRHDKSP 1069
G I + ++ I+ + S + L++ P L L + +
Sbjct: 457 AFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAE 503


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0521MECHCHANNEL1741e-59 Bacterial mechano-sensitive ion channel signature.
		>MECHCHANNEL#Bacterial mechano-sensitive ion channel signature.

Length = 136

Score = 174 bits (443), Expect = 1e-59
Identities = 89/136 (65%), Positives = 111/136 (81%), Gaps = 1/136 (0%)

Query: 1 MSLIKEFKAFASRGNVIDMAVGIIIGAAFGKIVSSFVADIIMPPIGIILGGVNFSDLSIV 60
MS+IKEF+ FA RGNV+D+AVG+IIGAAFGKIVSS VADIIMPP+G+++GG++F ++
Sbjct: 1 MSIIKEFREFAMRGNVVDLAVGVIIGAAFGKIVSSLVADIIMPPLGLLIGGIDFKQFAVT 60

Query: 61 LQAAQGDAPAVVIAYGKFIQTVIDFTIIAFAIFMGLKAINSLKRKQEEAPKAPPAPTKDQ 120
L+ AQGD PAVV+ YG FIQ V DF I+AFAIFM +K IN L RK+EE P A PAPTK++
Sbjct: 61 LRDAQGDIPAVVMHYGVFIQNVFDFLIVAFAIFMAIKLINKLNRKKEE-PAAAPAPTKEE 119

Query: 121 ELLSEIRDLLKAQQEK 136
LL+EIRDLLK Q +
Sbjct: 120 VLLTEIRDLLKEQNNR 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0523RTXTOXIND952e-23 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 94.5 bits (235), Expect = 2e-23
Identities = 38/290 (13%), Positives = 91/290 (31%), Gaps = 28/290 (9%)

Query: 71 LAQLEDNQFSAKVSQAEASLGSAKADLQTLAAKVELQHALISQASAGVVAAQADKLRAEQ 130
+ + + S + + + ++ + A A + + +
Sbjct: 176 FQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKS 235

Query: 131 QLTRAKKLKVSNYSSQDDVDQLQAGFDSAAAGLDEAKA--------LLVAKERELAVFN- 181
+L L ++ V + + + A L K+ +L AKE V
Sbjct: 236 RLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQL 295

Query: 182 ------AQLNQAGSVVEQSNAALELAKIQLNDTRVTAPFSGVIGKRGAM-VGQYVQPGQA 234
+L Q + L + + + + AP S + + G V +
Sbjct: 296 FKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAET 355

Query: 235 LYSLVPDGAV-WITANFKETQIQHMQPGQSVQVSLDAFPDKTF---TGVIDSLSPASGAK 290
L +VP+ +TA + I + GQ+ + ++AFP + G + +++ +
Sbjct: 356 LMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDA--- 412

Query: 291 FSLLPAENATGNFTKIVQRIPVRIRLDLSEEEARVVPGLSAVVKVDTASH 340
+ G ++ I + + G++ ++ T
Sbjct: 413 ----IEDQRLGLVFNVIISIEENCLST-GNKNIPLSSGMAVTAEIKTGMR 457



Score = 57.5 bits (139), Expect = 2e-11
Identities = 24/128 (18%), Positives = 48/128 (37%), Gaps = 2/128 (1%)

Query: 59 VTDNQHVRKGELLAQLEDNQFSAKVSQAEASLGSAKADLQTLAAKVELQHALISQASAGV 118
V + + VRKG++L +L A + ++SL A+ + L +
Sbjct: 112 VKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSR-SIELNKLPELKL 170

Query: 119 VAAQADKLRAEQQLTRAKKLKVSNYSS-QDDVDQLQAGFDSAAAGLDEAKALLVAKEREL 177
+ +E+++ R L +S+ Q+ Q + D A A + E
Sbjct: 171 PDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLS 230

Query: 178 AVFNAQLN 185
V ++L+
Sbjct: 231 RVEKSRLD 238


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0524TCRTETB1231e-32 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 123 bits (310), Expect = 1e-32
Identities = 89/421 (21%), Positives = 177/421 (42%), Gaps = 19/421 (4%)

Query: 18 SEYERGSRRSWIAVFGGLIGAFMAILDIQITNASMKEIQGSLGATLEEGSWISTAYLVAE 77
+ Y + + R + I +F ++L+ + N S+ +I +W++TA+++
Sbjct: 3 TSYSQSNLRHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTF 62

Query: 78 MIAIPLSGWLSTGLSVRRYLLWTTAAFIFASILCSISWN-LEAMIAFRALQGFFGGALIP 136
I + G LS L ++R LL+ F S++ + + +I R +QG A
Sbjct: 63 SIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPA 122

Query: 137 LAFRLILEFLPENKRAVGMALFGVTATFAPSIGPTLGGWLTEHFSWHYLFYINVPPGLLV 196
L ++ ++P+ R L G +GP +GG + + W YL +P ++
Sbjct: 123 LVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLL--LIPMITII 180

Query: 197 MAMLAYGLEKRPVVWDKLKNADLAGIVTMALGMGCLEVVLEEGNRKDWFGSDLIRNLAII 256
L K+ V + D+ GI+ M++G+ + F + + I+
Sbjct: 181 TVPFLMKLLKKEVRIKG--HFDIKGIILMSVGIVFFML----------FTTSYSISFLIV 228

Query: 257 AAVNLVLFVWIQLKRKDPLVNLRLLGKRDFVLSTIAYFLLGMALFGAIYLIPLYLSQVHD 316
+ ++ ++FV K DP V+ L F++ + ++ + G + ++P + VH
Sbjct: 229 SVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQ 288

Query: 317 YTPLEIGEVIMWMGFPQLLVL-PLVPRLMQRFDGRYLAAFGFFMFALSYYMNSQMTADYA 375
+ EIG VI++ G +++ + L+ R Y+ G ++S+ S +
Sbjct: 289 LSTAEIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFL--LET 346

Query: 376 GPQMIASQVVRALG-QPFILVPIGMLATAHLKPHENPSASTVLNVMRNLGGAFGIALVAT 434
+ +V LG F I + ++ LK E + ++LN L GIA+V
Sbjct: 347 TSWFMTIIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGG 406

Query: 435 L 435
L
Sbjct: 407 L 407


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0525SACTRNSFRASE438e-08 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 43.0 bits (101), Expect = 8e-08
Identities = 21/72 (29%), Positives = 30/72 (41%), Gaps = 5/72 (6%)

Query: 81 ASIGRVVVSPAGRGKGLAMPLMQRAIDAALSTWPAAGIQIGAQDYLKS---FYQKLGFSA 137
A I + V+ R KG+ L+ +AI+ A G+ + QD S FY K F
Sbjct: 90 ALIEDIAVAKDYRKKGVGTALLHKAIEWAKEN-HFCGLMLETQDINISACHFYAKHHFII 148

Query: 138 CS-DMYLEDGIP 148
+ D L P
Sbjct: 149 GAVDTMLYSNFP 160


9Shewmr4_0596Shewmr4_0602Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_0596217-0.462000redoxin domain-containing protein
Shewmr4_0597323-1.679270hypothetical protein
Shewmr4_0598529-2.291143cytochrome C biogenesis protein
Shewmr4_0599528-2.407859hypothetical protein
Shewmr4_0600532-2.839576cytochrome c-type biogenesis protein CcmF
Shewmr4_0601535-3.140242cytochrome c
Shewmr4_0602331-2.394093hypothetical protein
10Shewmr4_0630Shewmr4_0646Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_0630-3173.429833hypothetical protein
Shewmr4_0631-2184.148373major facilitator transporter
Shewmr4_0632-1194.762816hypothetical protein
Shewmr4_06330184.532872short-chain dehydrogenase/reductase SDR
Shewmr4_06341173.882278hypothetical protein
Shewmr4_06351193.730898biotin carboxyl carrier protein
Shewmr4_06362183.0558523-dehydroquinate dehydratase
Shewmr4_06372192.661275peptidyl-tRNA hydrolase domain protein
Shewmr4_06381161.838528hypothetical protein
Shewmr4_0639-1140.358243hypothetical protein
Shewmr4_0640-1150.124290hypothetical protein
Shewmr4_0641-117-1.442032hypothetical protein
Shewmr4_0642-117-4.353472hypothetical protein
Shewmr4_0643120-7.145608outer membrane efflux protein
Shewmr4_0644018-5.760706hypothetical protein
Shewmr4_0645-212-3.084002RND family efflux transporter MFP subunit
Shewmr4_0646-211-3.145835hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0636PF07201300.027 Hypersensitivity response secretion protein HrpJ
		>PF07201#Hypersensitivity response secretion protein HrpJ

Length = 293

Score = 30.2 bits (68), Expect = 0.027
Identities = 41/161 (25%), Positives = 64/161 (39%), Gaps = 17/161 (10%)

Query: 420 FSRRGRELINTQRQLSENRALLEHAQRIAIAGELGASLSHELNQPLAAIGHYCHGAEVRL 479
FS R +EL +R+LS+++A + + + S EL Q + L
Sbjct: 60 FSER-KELSLDKRKLSDSQARV--SDVEEQVNQYL-SKVPELEQK-QNVSELLS----LL 110

Query: 480 QRGTRPE--ELQAVLNLIQQEVSRADSIISRLRNLLKKRPVSKQPLYLHQLVNDTAPLLA 537
+L+A L +E S ++ LR+ LK RP L LV L++
Sbjct: 111 SNSPNISLSQLKAYLEGKSEEPSEQFKMLCGLRDALKGRPELAH---LSHLVEQA--LVS 165

Query: 538 YELEQ-HQIQLSTNISGEAYQLPLDEVGMQQLLLNLLKNAA 577
EQ I L I+ EAY+ V Q L + ++A
Sbjct: 166 MAEEQGETIVLGARITPEAYRESQSGVNPLQPLRDTYRDAV 206


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0637HTHFIS971e-25 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 96.8 bits (241), Expect = 1e-25
Identities = 28/123 (22%), Positives = 52/123 (42%)

Query: 7 VYLIDDDESVRRSLRFMLESYGLNIHDFDSAEAFFAAIDLSEPGCALVDVRMPGLSGPQL 66
+ + DDD ++R L L G ++ +A + I + + DV MP + L
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 67 HAQLVQHNSPLAVIYLTGHGDVPMAVEALKLGAVDFFQKPADGAKLADAVVKALEHAKAY 126
++ + L V+ ++ A++A + GA D+ KP D +L + +AL K
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRR 125

Query: 127 HQN 129

Sbjct: 126 PSK 128


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0638GPOSANCHOR535e-09 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 53.1 bits (127), Expect = 5e-09
Identities = 53/316 (16%), Positives = 101/316 (31%), Gaps = 10/316 (3%)

Query: 598 EYAASEQELRIRLSKAEEALQSAQELQAEAESQLVAINGELDKLSRELTFARTAYKNSRD 657
EL LS A+E L+ + +E S++ + L + L A
Sbjct: 82 ALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSA 141

Query: 658 DLRRLFDEKRSEQDKINKALAERKAFAQQRLTQLDGELKQLKHQHQLWLEEQKEQALEAR 717
++ L EK + KA K + + E ++ LE
Sbjct: 142 KIKTLEAEK---AALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKA 198

Query: 718 MEKQAYWQEVIGALDNQLGQIKATIDARRESAKAEQKACETWYKNELKSRGVDEDNILKL 777
+E + A L KA + AR+ + + + + E L
Sbjct: 199 LEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAAL 258

Query: 778 KQQIRELETKISRAEQRRSEVLRFDDWY-----QHTWLLRKPKLQTQLSDVKR-AASEID 831
+ + ELE + A + + Q+Q+ + R +
Sbjct: 259 EARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDL 318

Query: 832 QQLKAKTQEVKTRRQQLETDRKASDAAQIEASENLTKLRAVMRKLAELKLPANNEEAQGS 891
+ ++++ Q+LE K S+A++ +L R ++L E + E+ + S
Sbjct: 319 DASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQL-EAEHQKLEEQNKIS 377

Query: 892 LGERLRQGEDLLLKRD 907
R DL R+
Sbjct: 378 EASRQSLRRDLDASRE 393



Score = 34.3 bits (78), Expect = 0.004
Identities = 49/346 (14%), Positives = 114/346 (32%), Gaps = 27/346 (7%)

Query: 360 WRADMENLSERHKLQTEKHQDIEAAYNARRSKIGEQLNRELEGLHADQDKQREARDKQRE 419
+ + + K+ D+ A + N EL ++ ++ DK
Sbjct: 55 VQERADKFEIENNTLKLKNSDLSFNNKAL-----KDHNDELTEELSNAKEKLRKNDKSLS 109

Query: 420 VARADLDALEAQWRSQMDAGKASFSEQEYQFKLNAAELKLRVDGVTYTEEEKLSLAIFDE 479
+ + LEA + D KA + +A L + + ++
Sbjct: 110 EKASKIQELEA---RKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADL----EK 162

Query: 480 RIHRADEEQESCNAKVERLSSDERKLRAKRDQANEALRIASLRVNERQTALDELHHMLFP 539
+ A + +AK++ L +++ L A++ + +AL A + L
Sbjct: 163 ALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAA 222

Query: 540 QSHTL--LEFLRKEAQGWEQSLGKVIAPELLHRTDLHPSVTGTSDTLFGVHLDLKAIDVP 597
+ LE + A + + I + L +
Sbjct: 223 LAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKA------------LE 270

Query: 598 EYAASEQELRIRLSKAEEALQSAQELQAEAESQLVAINGELDKLSRELTFARTAYKNSRD 657
++ E + + +A+ E Q +N L R+L +R A K
Sbjct: 271 GAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEA 330

Query: 658 DLRRLFDEKRSEQDKINKALAERKAFAQQRLTQLDGELKQLKHQHQ 703
+ ++L ++ + + ++L +++ QL+ E ++L+ Q++
Sbjct: 331 EHQKLEEQNKI-SEASRQSLRRDLDASREAKKQLEAEHQKLEEQNK 375


11Shewmr4_0693Shewmr4_0698Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_0693428-0.948764hypothetical protein
Shewmr4_0694428-1.241945hypothetical protein
Shewmr4_0695531-1.213936hypothetical protein
Shewmr4_0696532-1.159537diguanylate cyclase/phosphodiesterase
Shewmr4_0697223-0.636003hypothetical protein
Shewmr4_06982280.270813GumN family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0694BLACTAMASEA310.003 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 31.3 bits (71), Expect = 0.003
Identities = 19/83 (22%), Positives = 35/83 (42%), Gaps = 6/83 (7%)

Query: 1 MRALALSAV-LMVTTMIGMPAVAKEWQENKSWNAHFSEHKTQGVVVLWNENTQQGFTNDL 59
MR + L + L+ T + + A + ++ K + S G++ + + G T
Sbjct: 1 MRYIRLCIISLLATLPLAVHASPQPLEQIKLSESQLSGRV--GMIEM---DLASGRTLTA 55

Query: 60 KRANQAFLPASTFKIPNSLIALD 82
RA++ F STFK+ L
Sbjct: 56 WRADERFPMMSTFKVVLCGAVLA 78


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0698TCRTETOQM5400.0 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 540 bits (1393), Expect = 0.0
Identities = 180/689 (26%), Positives = 300/689 (43%), Gaps = 70/689 (10%)

Query: 6 KYRNIGIFAHVDAGKTTTTERILKLTGRIHKAGETHDGESTTDFMVQEAERGITIQSAAV 65
K NIG+ AHVDAGKTT TE +L +G I + G G + TD + E +RGITIQ+
Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61

Query: 66 SCFWKDHRFNVIDTPGHVDFTVEVYRSLKVLDGGIAVFCGSGGVEPQSETNWRYANESEV 125
S W++ + N+IDTPGH+DF EVYRSL VLDG I + GV+ Q+ + + +
Sbjct: 62 SFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGI 121

Query: 126 ARIIFVNKLDRMGADFLRVVKQTKDVLAATPLVMVLPIGIEDEFKGVVDLLTRQAYVWDE 185
I F+NK+D+ G D V + K+ L+A ++ ++ +
Sbjct: 122 PTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIK------------------QKVEL--- 160

Query: 186 TGLPENYSIQEIPADMVDLVEEYREQLIETAVEQDDDLMEAYMEGEEPSIEDLKRCIRKG 245
P + + + +T +E +DDL+E YM G+ +L++
Sbjct: 161 --YPNMCVT-----NFTESEQ------WDTVIEGNDDLLEKYMSGKSLEALELEQEESIR 207

Query: 246 TRTMAFFPTFCGSAFKNKGMQLVLDAVVDYLPAPNEVDPQPLTDEEGNETGEYAIVSADE 305
+ FP + GSA N G+ +++ + + + T +E
Sbjct: 208 FHNCSLFPVYHGSAKNNIGIDNLIEVITNKFYSS--------THRGQSE----------- 248

Query: 306 SLKALAFKI-MDDRFGALTFVRIYSGRLKKGDTILNSATGKTERIGRMCEMLANDRIEIE 364
L FKI ++ L ++R+YSG L D++ S K +I M + + +I+
Sbjct: 249 -LCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKEK-IKITEMYTSINGELCKID 306

Query: 365 SAEAGDIIAIVGMKNVQTGHTLCDVKYPCTLEAMVFPEPVISIAVAPKDKGGSEKMAIAI 424
A +G+I+ + + ++ L D K E + P P++ V P E + A+
Sbjct: 307 KAYSGEIVILQN-EFLKLNSVLGDTKLLPQRERIENPLPLLQTTVEPSKPQQREMLLDAL 365

Query: 425 GKMIAEDPSFRVETDEDSGETILKGMGELHLDIKVDILKRTYGVELIVGEPQVAYRETIT 484
++ DP R D + E IL +G++ +++ +L+ Y VE+ + EP V Y E
Sbjct: 366 LEISDSDPLLRYYVDSATHEIILSFLGKVQMEVTCALLQEKYHVEIEIKEPTVIYMERPL 425

Query: 485 QMVEDQYTHKKQSGGSGQFGKIEYIIRPGEPNSGFVFKSSVVGGNVPKEYWPAVEKGFAS 544
+ E YT + + + I + P SG ++SSV G + + + AV +G
Sbjct: 426 KKAE--YTIHIEVPPNPFWASIGLSVSPLPLGSGMQYESSVSLGYLNQSFQNAVMEGIRY 483

Query: 545 MMNTGTIAGFPVLDVEFELTDGAYHAVDSSAIAFEIAAKAAFRQSIAKAKPQLLEPIMKV 604
G + G+ V D + G Y++ S+ F + A Q + KA +LLEP +
Sbjct: 484 GCEQG-LYGWNVTDCKICFKYGLYYSPVSTPADFRMLAPIVLEQVLKKAGTELLEPYLSF 542

Query: 605 DVFSPDDNVGDVIGDLNRRRGMIKDQVAGVTGVRVKADVPLSEMFGYIGTLRTMTSGRGQ 664
+++P + + D + I D V + ++P + Y L T+GR
Sbjct: 543 KIYAPQEYLSRAYTDAPKYCANIVDTQLKNNEVILSGEIPARCIQEYRSDLTFFTNGRSV 602

Query: 665 FSMEFSHYSPC----------PNSVADKV 683
E Y PNS DKV
Sbjct: 603 CLTELKGYHVTTGEPVCQPRRPNSRIDKV 631


12Shewmr4_0830Shewmr4_0854Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_0830215-2.871834hypothetical protein
Shewmr4_0831116-3.903452transposase IS3/IS911 family protein
Shewmr4_0832325-7.558636OmpA/MotB domain-containing protein
Shewmr4_0833215-5.115026hypothetical protein
Shewmr4_0834216-5.210416molybdate ABC transporter periplasmic
Shewmr4_0835112-1.930415hypothetical protein
Shewmr4_0836112-0.571943magnesium transporter
Shewmr4_08371140.151021HPr family phosphocarrier protein
Shewmr4_0838-1224.665242hypothetical protein
Shewmr4_08392265.095013PTS IIA-like nitrogen-regulatory protein PtsN
Shewmr4_08402244.187099sigma 54 modulation protein / SSU ribosomal
Shewmr4_08412234.193973RNA polymerase factor sigma-54
Shewmr4_08421203.724115ABC transporter-like protein
Shewmr4_08431213.345859OstA family protein
Shewmr4_08442213.147496hypothetical protein
Shewmr4_08452202.444439hypothetical protein
Shewmr4_08462201.8089953-deoxy-D-manno-octulosonate 8-phosphate
Shewmr4_08470210.547858KpsF/GutQ family protein
Shewmr4_0848023-0.578083ABC transporter-like protein
Shewmr4_0849226-1.150972hypothetical protein
Shewmr4_0850528-1.711029hypothetical protein
Shewmr4_0851628-1.680536hypothetical protein
Shewmr4_0852526-1.852439toluene tolerance family protein
Shewmr4_0853327-1.160604hypothetical protein
Shewmr4_0854222-0.482710SpoIIAA family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0836CHANLCOLICIN290.006 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 28.9 bits (64), Expect = 0.006
Identities = 18/72 (25%), Positives = 32/72 (44%), Gaps = 5/72 (6%)

Query: 35 QEINEIALQQQQREKARQQAVKERQLAEYQQQQIAIQQAAEQRRIAQQNEAARIRKAEAW 94
Q + +I + + +R + E A A+Q E+ R+A+ E AR ++AEA
Sbjct: 92 QRLKDIVNEALRHNASRTPSATELAHANNA----AMQAEDERLRLAKAEEKAR-KEAEAA 146

Query: 95 RKYYIVPEDCKN 106
K + E +
Sbjct: 147 EKAFQEAEQRRK 158


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0838BCTERIALGSPD320.022 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 31.8 bits (72), Expect = 0.022
Identities = 14/71 (19%), Positives = 31/71 (43%), Gaps = 5/71 (7%)

Query: 354 SGLEPLTIDAQTLFVNVGERTN---VTGSAKFLKLIKEGKFEQALDVAREQVESGAQIID 410
+P+ + + + +TN VT + + ++ + LD+ R QV A I +
Sbjct: 298 QAAKPVAALDKNIIIKAHGQTNALIVTAAPDVMNDLE--RVIAQLDIRRPQVLVEAIIAE 355

Query: 411 INMDEGMLDGV 421
+ +G+ G+
Sbjct: 356 VQDADGLNLGI 366


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0840PF05272300.016 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.0 bits (67), Expect = 0.016
Identities = 10/58 (17%), Positives = 22/58 (37%), Gaps = 4/58 (6%)

Query: 14 ATSANMALQVSQLSWAIEGKTILSEISFALPKG----EMLGLIGPNGAGKSSLLRCLY 67
T + + + + ++ ++ + G + L G G GKS+L+ L
Sbjct: 560 KTPDDYKPRRLRYLQLVGKYILMGHVARVMEPGCKFDYSVVLEGTGGIGKSTLINTLV 617


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0848FERRIBNDNGPP391e-05 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 39.2 bits (91), Expect = 1e-05
Identities = 46/196 (23%), Positives = 74/196 (37%), Gaps = 19/196 (9%)

Query: 4 RRFI-ALGLSLALLPI---AAMAEPAKRIIALSPHAVEMLYAIGAGESIVAATDYADY-- 57
RR + A+ LS L + A A RI+AL VE+L A+G D +Y
Sbjct: 10 RRLLTAMALSPLLWQMNTAHAAAIDPNRIVALEWLPVELLLALGI--VPYGVADTINYRL 67

Query: 58 ----PEAAKKIPSIGGYYGIQIERVLELNPDLIVVWDTGNKA--EDINQL-KSLGFKLYS 110
P + +G +E + E+ P + VW G E + ++ GF +S
Sbjct: 68 WVSEPPLPDSVIDVGLRTEPNLELLTEMKPSFM-VWSAGYGPSPEMLARIAPGRGFN-FS 125

Query: 111 SSPKTLEDVAKEIEELGALTGRTEQASQVAADYRNQLLQLRSENAAKSE-PKVFYQLWST 169
+ L K + E+ L A A Y + + ++ + P + L
Sbjct: 126 DGKQPLAMARKSLTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDP 185

Query: 170 PLMTV-AKNSWIQQII 184
M V NS Q+I+
Sbjct: 186 RHMLVFGPNSLFQEIL 201


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0852BCTERIALGSPG429e-08 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 41.8 bits (98), Expect = 9e-08
Identities = 18/55 (32%), Positives = 29/55 (52%), Gaps = 3/55 (5%)

Query: 3 LSKIKVNTGFTLIELMIAIAIVGILASIALPSYQEHVRNTRRTDARD---ALSNA 54
+ GFTL+E+M+ I I+G+LAS+ +P+ + + A AL NA
Sbjct: 1 MRATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENA 55


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0853BCTERIALGSPG382e-06 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 38.3 bits (89), Expect = 2e-06
Identities = 17/51 (33%), Positives = 30/51 (58%)

Query: 3 TKKILGFTLTELMVVVAIVAIIAGIAAPSFASMIRENTARTQVNELLALTN 53
T K GFTL E+MVV+ I+ ++A + P+ + + V++++AL N
Sbjct: 4 TDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALEN 54


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0854BCTERIALGSPG300.002 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 29.9 bits (67), Expect = 0.002
Identities = 13/23 (56%), Positives = 17/23 (73%), Gaps = 2/23 (8%)

Query: 4 RKQKGFSLIEIMVTSFIVAFGIL 26
KQ+GF+L+EIMV IV G+L
Sbjct: 5 DKQRGFTLLEIMVV--IVIIGVL 25


13Shewmr4_0925Shewmr4_0940Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_09252151.192053glucose-methanol-choline oxidoreductase
Shewmr4_09262151.109134glutathione-dependent formaldehyde-activating,
Shewmr4_09272150.818744hypothetical protein
Shewmr4_09282121.419385ABC transporter-like protein
Shewmr4_09292130.933194hypothetical protein
Shewmr4_0930014-0.251215chromosome segregation ATPase
Shewmr4_0931-114-0.683565pirin domain-containing protein
Shewmr4_0932-116-0.510941ATP-dependent RNA helicase DbpA
Shewmr4_0933017-0.764666glyoxalase/bleomycin resistance
Shewmr4_0934122-1.672650aldo/keto reductase
Shewmr4_0935225-1.805917Na(+)-translocating NADH-quinone reductase
Shewmr4_0936442-2.133650Na(+)-translocating NADH-quinone reductase
Shewmr4_0937335-2.270008Na(+)-translocating NADH-quinone reductase
Shewmr4_0938231-3.061773Na(+)-translocating NADH-quinone reductase
Shewmr4_0939225-3.001729Na(+)-translocating NADH-quinone reductase
Shewmr4_0940220-2.837234Na(+)-translocating NADH-quinone reductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0934LUXSPROTEIN2723e-97 Bacterial autoinducer-2 (AI-2) production protein Lu...
		>LUXSPROTEIN#Bacterial autoinducer-2 (AI-2) production protein LuxS

signature.
Length = 171

Score = 272 bits (697), Expect = 3e-97
Identities = 131/168 (77%), Positives = 150/168 (89%)

Query: 2 PLLDSFTVDHTRMNAPAVRVAKHMSTPKGDAITVFDLRFCAPNKDILSERGIHTLEHLFA 61
PLLDSFTVDHTRMNAPAVRVAK M TPKGD ITVFDLRF APNKDILSE+GIHTLEHL+A
Sbjct: 1 PLLDSFTVDHTRMNAPAVRVAKTMQTPKGDTITVFDLRFTAPNKDILSEKGIHTLEHLYA 60

Query: 62 GFMRDHLNGSDVEIIDISPMGCRTGFYMSLIGEPSERQVADAWLASMEDVLKVVEQSEIP 121
GFMR+HLNG VEIIDISPMGCRTGFYMSLIG PSE+QVADAW+A+MEDVLKV Q++IP
Sbjct: 61 GFMRNHLNGDSVEIIDISPMGCRTGFYMSLIGTPSEQQVADAWIAAMEDVLKVENQNKIP 120

Query: 122 ELNEYQCGTYQMHSLEQAQDIARNIIAAGVSVNRNDDLKLSDEILGKL 169
ELNEYQCGT MHSL++A+ IA+NI+ GV+VN+ND+L L + +L +L
Sbjct: 121 ELNEYQCGTAAMHSLDEAKQIAKNILEVGVAVNKNDELALPESMLREL 168


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0935ECOLIPORIN300.031 E.coli/Salmonella-type porin signature.
		>ECOLIPORIN#E.coli/Salmonella-type porin signature.

Length = 383

Score = 29.9 bits (67), Expect = 0.031
Identities = 55/244 (22%), Positives = 87/244 (35%), Gaps = 48/244 (19%)

Query: 417 EHVSINREVKQAFAGLDEADFSDSDWMPQLGVLYDAGDWRFSTDIRRAWTAASAGNT--- 473
E N + AFAGL D+ D+ GVLYD W TD+ + S
Sbjct: 87 EGEGANSWTRLAFAGLKFGDYGSFDYGRNYGVLYDVEGW---TDMLPEFGGDSYTYADNY 143

Query: 474 -TQEAQVSLHYQVSAQYAREGIKADLRAYVQ-----EFDNLHVDCDSYSMCADERLLTQE 527
T A Y+ + + G+ L +Q E + + + + +
Sbjct: 144 MTGRANGVATYRNTDFF---GLVDGLNFALQYQGKNESQSADDVNIGTNNRNNGDDIRYD 200

Query: 528 NIPDVLTYGVELGLGYRWDLGGVELPLGLNYQYLSAEYQTSTCTDVQ----GCVLEGDRL 583
N G G+ +D+G +G + A Y TS T+ Q G + GD+
Sbjct: 201 N-------GDGFGISTTYDIG-----MGFS---AGAAYTTSDRTNEQVNAGGTIAGGDKA 245

Query: 584 -AWLPEHQLQLSAGIKYAQYRLNLEAAYQSERDFSQFGSEQERISGQW-----RVDLAAN 637
AW +AG+KY + L Y R+ + +G + G ++ A
Sbjct: 246 DAW--------TAGLKYDANNIYLATMYSETRNMTPYGKTDKGYDGGVANKTQNFEVTAQ 297

Query: 638 YDFD 641
Y FD
Sbjct: 298 YQFD 301


14Shewmr4_1020Shewmr4_1036Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_1020219-0.891191hypothetical protein
Shewmr4_1021429-0.549787hypothetical protein
Shewmr4_1022532-0.901657hypothetical protein
Shewmr4_1023632-0.368523OsmC family protein
Shewmr4_1024633-0.151675MarR family transcriptional regulator
Shewmr4_1025426-0.501103FAD dependent oxidoreductase
Shewmr4_1026531-0.66369423S rRNA methyluridine methyltransferase
Shewmr4_1027324-1.471502periplasmic sensor signal transduction histidine
Shewmr4_1028323-1.006386hypothetical protein
Shewmr4_1029222-1.785292two component transcriptional regulator
Shewmr4_1030227-1.993910hypothetical protein
Shewmr4_1031440-2.105277hypothetical protein
Shewmr4_1032334-2.141727MltA-interacting MipA family protein
Shewmr4_1033333-1.422445hypothetical protein
Shewmr4_1034232-1.373845cell wall anchor domain-containing protein
Shewmr4_1035338-1.143349hypothetical protein
Shewmr4_1036224-0.410172chromate transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1022adhesinb310.003 Adhesin B signature.
		>adhesinb#Adhesin B signature.

Length = 310

Score = 31.4 bits (71), Expect = 0.003
Identities = 18/95 (18%), Positives = 34/95 (35%), Gaps = 16/95 (16%)

Query: 142 REARRTFEVIAEELDIVIQKNGTMAFDNAIIAY----EPLWAVGTGKSATPEQAQEVHAF 197
+EA+ F I E +++ G F AY +W + T + TP+Q + +
Sbjct: 186 KEAKEKFNNIPGEKKMIVTSEG--CFKYFSKAYNVPSAYIWEINTEEEGTPDQIKTLVEK 243

Query: 198 IRKRLSEVSPFIGENIRILYGGSVTPSNAADLFAQ 232
+RK + L+ S ++
Sbjct: 244 LRKT----------KVPSLFVESSVDDRPMKTVSK 268


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1023SECGEXPORT1212e-39 Protein-export SecG membrane protein signature.
		>SECGEXPORT#Protein-export SecG membrane protein signature.

Length = 110

Score = 121 bits (305), Expect = 2e-39
Identities = 63/110 (57%), Positives = 82/110 (74%)

Query: 1 MYEVLVVVYLLVALGLIGLILIQQGKGADMGASFGAGASGTLFGSSGSGNFLTRTTAILA 60
MYE L+VV+L+VA+GL+GLI++QQGKGADMGASFGAGAS TLFGSSGSGNF+TR TA+LA
Sbjct: 1 MYEALLVVFLIVAIGLVGLIMLQQGKGADMGASFGAGASATLFGSSGSGNFMTRMTALLA 60

Query: 61 IAFFTLSLLIGNLSANHAKNEDTWKNLGVDEQVTQPVDQATEKSETKIPD 110
FF +SL++GN+++N W+NL + Q A K + IP+
Sbjct: 61 TLFFIISLVLGNINSNKTNKGSEWENLSAPAKTEQTQPAAPAKPTSDIPN 110


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1026TCRTETOQM694e-14 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 69.1 bits (169), Expect = 4e-14
Identities = 38/133 (28%), Positives = 57/133 (42%), Gaps = 18/133 (13%)

Query: 392 IMGHVDHGKTSLLDYIRRAKVAAGEAG------------------GITQHIGAYHVETDN 433
++ HVD GKT+L + + A E G GIT G + +N
Sbjct: 8 VLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITSFQWEN 67

Query: 434 GMITFLDTPGHAAFTAMRARGAKATDIVVLVVAADDGVMPQTIEAIQHAKAGNVPLIVAV 493
+ +DTPGH F A R D +L+++A DGV QT + +P I +
Sbjct: 68 TKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIPTIFFI 127

Query: 494 NKMDKPEADIDRV 506
NK+D+ D+ V
Sbjct: 128 NKIDQNGIDLSTV 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1032SYCDCHAPRONE290.014 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 29.1 bits (65), Expect = 0.014
Identities = 11/75 (14%), Positives = 20/75 (26%), Gaps = 3/75 (4%)

Query: 69 NEQRARFHYDRGVIYDSVGLRLMARIDFMQALKLQPDLADAYNFLGIYYTQEGEYDSAYE 128
+ +RF G ++G +A + + Q+GE A
Sbjct: 66 DHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAES 125

Query: 129 AFDGVLEL---SPNY 140
EL +
Sbjct: 126 GLFLAQELIADKTEF 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1033TCRTETOQM1983e-58 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 198 bits (504), Expect = 3e-58
Identities = 107/461 (23%), Positives = 210/461 (45%), Gaps = 47/461 (10%)

Query: 10 KRRTFAIISHPDAGKTTITEKVLLFGNALQKAGTV-KGKKSGQHAKSDWMEMEKDRGISI 68
K +++H DAGKTT+TE +L A+ + G+V KG ++D +E+ RGI+I
Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGT-----TRTDNTLLERQRGITI 56

Query: 69 TTSVMQFPYGGALVNLLDTPGHEDFSEDTYRTLTAVDSCLMVIDSAKGVEDRTIKLMEVT 128
T + F + VN++DTPGH DF + YR+L+ +D +++I + GV+ +T L
Sbjct: 57 QTGITSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHAL 116

Query: 129 RLRDTPIVTFMNKLDRDIRDPIELMDEVEDVLNIACAPITWPIGSGKEFKGVYHILRDEV 188
R P + F+NK+D++ D + ++++ L+ +++ +V
Sbjct: 117 RKMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEI------------------VIKQKV 158

Query: 189 VLYQSGMGHTIQERRVIEGIDNPELDKAIGSYAADLR-DEMELVRGASNEFDHQAFLKGE 247
LY + E + + D + Y + + +EL + S F
Sbjct: 159 ELYPNMCVTNFTESEQWDTVIEGN-DDLLEKYMSGKSLEALELEQEES-----IRFHNCS 212

Query: 248 LTPVFFGTALGNFGVDHILDGIVEWAPKPLPRESDARMIMPDEEKFTGFVFKIQANMDPK 307
L PV+ G+A N G+D++++ I R + + G VFKI+ K
Sbjct: 213 LFPVYHGSAKNNIGIDNLIEVITNKFYSSTHR---------GQSELCGKVFKIE--YSEK 261

Query: 308 HRDRVAFMRVCSGRYEQGMKMHHVRIGKDVNVSDALTFMAGDRERAEVAYPGDIIGLHNH 367
R R+A++R+ SG + K + +++ T + G+ + + AY G+I+ L N
Sbjct: 262 -RQRLAYIRLYSGVLHLRDSVRISEKEK-IKITEMYTSINGELCKIDKAYSGEIVILQNE 319

Query: 368 GTIRIGDTFTQGEKFRFTGVPNFAPEMFR-RIRLRDPLKQKQLLKGLVQLSEEG-AVQVF 425
+++ + + + + P +++ LL L+++S+ ++ +
Sbjct: 320 F-LKLNSVLGDTKLLPQRERIENPLPLLQTTVEPSKPQQREMLLDALLEISDSDPLLRYY 378

Query: 426 RPLDTNDLIVGAVGVLQFEVVVGRLKSEYNVEAIYEGISVS 466
T+++I+ +G +Q EV L+ +Y+VE + +V
Sbjct: 379 VDSATHEIILSFLGKVQMEVTCALLQEKYHVEIEIKEPTVI 419


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1036CHANNELTSX823e-20 Nucleoside-specific channel-forming protein Tsx signa...
		>CHANNELTSX#Nucleoside-specific channel-forming protein Tsx

signature.
Length = 294

Score = 81.6 bits (201), Expect = 3e-20
Identities = 77/240 (32%), Positives = 109/240 (45%), Gaps = 23/240 (9%)

Query: 60 YLEMEFGGRSGIFDLYGYVDVFNLANESSKDGDKNPGSGTSKLFMKFAPRVSIDALTGKD 119
YLE E + FD YGY+D +S K + S LFM+ PR SID LT D
Sbjct: 58 YLEYEAFAKKDWFDFYGYIDAPVFFGGNSTA--KGIWNKGSPLFMEIEPRFSIDKLTNTD 115

Query: 120 LSFGPIQEVYFSTLFNWD-GLNGEGVNSTFW-GVGADVNVPWLGKTGMNLYGYYD----- 172
LSFGP +E YF+ + +D G N ST++ G+G D++ +N+Y Y
Sbjct: 116 LSFGPFKEWYFANNYIYDMGRNDSQEQSTWYMGLGTDIDTGLPMSLSLNVYAKYQWQNYG 175

Query: 173 -MNAKEWNGYQFSANWFKPFYFFDNKSFLSFQGYIDYQFGAD------EDKTAFVPKTSN 225
N EW+GY+F +F P S LS+ G+ ++ +G+D D +TSN
Sbjct: 176 ASNENEWDGYRFKVKYFVPLTDLWGGS-LSYIGFTNFDWGSDLGDDNFYDLNGKHARTSN 234

Query: 226 --GGNIFFGL---YWHSDRYALGYGLKG-FKDVYLLEDGAGALALESTGWSHYLSATYKF 279
+ L +WH A + G + D L G G ++ STGW Y Y F
Sbjct: 235 SIASSHILALNYAHWHYSIVARYFHNGGQWADDAKLNFGDGPFSVRSTGWGGYFVVGYNF 294


15Shewmr4_1161Shewmr4_1175Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_11613170.132736hypothetical protein
Shewmr4_11623170.296184Na+/H+ antiporter NhaC
Shewmr4_11632170.079829hypothetical protein
Shewmr4_11642180.428915hypothetical protein
Shewmr4_11650170.437805FMN-binding domain-containing protein
Shewmr4_1166-117-0.496393hypothetical protein
Shewmr4_1167-116-0.306176ApbE family protein
Shewmr4_1168-117-0.937805hypothetical protein
Shewmr4_1169-116-1.085007methyl-accepting chemotaxis sensory transducer
Shewmr4_1170-115-1.399315hypothetical protein
Shewmr4_1171-120-2.909831hypothetical protein
Shewmr4_1172022-3.353313AraC family transcriptional regulator
Shewmr4_1173-220-3.491929GreA/GreB family elongation factor
Shewmr4_1174-121-4.705921O-acetylhomoserine/O-acetylserine sulfhydrylase
Shewmr4_1175018-3.103224hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1167DHBDHDRGNASE524e-10 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 52.4 bits (125), Expect = 4e-10
Identities = 41/189 (21%), Positives = 75/189 (39%), Gaps = 10/189 (5%)

Query: 12 VLITGASSGIGLQLAKDYLAAGWHVIACGRDKAKLDALAETVLIGA---TCISFDINERS 68
ITGA+ GIG +A+ + G H+ A + KL+ + ++ A D+ + +
Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSA 70

Query: 69 QVQENALRIKDLLAQCACQLDLVVLNAGGCEYIDDAKHFDDRLFERVVHTNLIAMGYCLG 128
+ E RI+ + +D++V N G D +E N +
Sbjct: 71 AIDEITARIEREMGP----IDILV-NVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 129 AFLPLMP--RGARLALMSSSATYLAFPRAEAYGASKAGVQYLAASLRLDLAQHGISVSVI 186
+ M R + + S+ + AY +SKA L L+LA++ I +++
Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185

Query: 187 CPGFVATPL 195
PG T +
Sbjct: 186 SPGSTETDM 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1174MYCMG045250.035 Hypothetical mycoplasma lipoprotein (MG045) signature.
		>MYCMG045#Hypothetical mycoplasma lipoprotein (MG045) signature.

Length = 483

Score = 24.7 bits (53), Expect = 0.035
Identities = 8/30 (26%), Positives = 15/30 (50%)

Query: 1 MAKSMKNYWMLMLVFVSLSLTACSNIDFQF 30
M K +K + + V +S L++C + F
Sbjct: 1 MKKQLKYCFFSLFVSLSSILSSCGSTTFVL 30


16Shewmr4_1224Shewmr4_1239Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_12242140.514942methyl-accepting chemotaxis sensory transducer
Shewmr4_12251140.252154endonuclease/exonuclease/phosphatase
Shewmr4_12261140.684372hypothetical protein
Shewmr4_12273181.511849hypothetical protein
Shewmr4_12282151.469479putative lipoprotein
Shewmr4_12293181.387697hypothetical protein
Shewmr4_12302181.399974SAM-dependent methyltransferase
Shewmr4_12311171.624110ribose-5-phosphate isomerase A
Shewmr4_12322151.674174hypothetical protein
Shewmr4_12331181.8944193'-5' exonuclease
Shewmr4_12342231.212723hypothetical protein
Shewmr4_12351230.894764*hypothetical protein
Shewmr4_12361230.911113hypothetical protein
Shewmr4_12371240.805518TonB-dependent siderophore receptor
Shewmr4_12382200.543461hypothetical protein
Shewmr4_1239214-1.428281hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1226SYCDCHAPRONE320.001 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 32.2 bits (73), Expect = 0.001
Identities = 29/156 (18%), Positives = 57/156 (36%), Gaps = 11/156 (7%)

Query: 76 PELEDVHVAMAYYYQTVGDLVRTEQAYQDAINTKDASGDSMNNFGVFLCQQKQYDKAEKM 135
+ ++ +AM + + G + + + + + Q +Y+ A K+
Sbjct: 6 TDTQEYQLAMESFLKGGGTI-------AMLNEISSDTLEQLYSLAFNQYQSGKYEDAHKV 58

Query: 136 FLAAIEMPKYTRTASSYEN-LGICSRDAGQTEKARQYFQMALKYDPRRSVSLLELAELGL 194
F A + Y S + LG C + GQ + A + D + AE L
Sbjct: 59 FQALCVLDHYD---SRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFHAAECLL 115

Query: 195 DKGDYVDAQNQLARYHQVAAQTPESLTLGIKIEQAL 230
KG+ +A++ L ++ A E L ++ L
Sbjct: 116 QKGELAEAESGLFLAQELIADKTEFKELSTRVSSML 151


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1232TCRTETOQM330.005 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 32.5 bits (74), Expect = 0.005
Identities = 37/159 (23%), Positives = 67/159 (42%), Gaps = 35/159 (22%)

Query: 199 IKLAIIGKPNVGKSTLTNRIL----GEERVVVYDEPGTTRDSIYIPMER----------- 243
I + ++ + GK+TLT +L + D+ T D+ + +R
Sbjct: 4 INIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITSF 63

Query: 244 --DGREYVIIDTAGVRRRSKVHEVIEKFSVIKTLKAVEDANVVLLIIDAREGIAEQDLGL 301
+ + IIDT G + + E V ++L ++ A +L+I A++G+ Q L
Sbjct: 64 QWENTKVNIIDTPG-----HMDFLAE---VYRSLSVLDGA---ILLISAKDGVQAQTRIL 112

Query: 302 LGFALNAGRALVIAVNKWD--GID-----QGIKDRVKSE 333
G + +NK D GID Q IK+++ +E
Sbjct: 113 FHALRKMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAE 151


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1238OMS28PORIN310.019 OMS28 porin signature.
		>OMS28PORIN#OMS28 porin signature.

Length = 257

Score = 31.3 bits (70), Expect = 0.019
Identities = 32/138 (23%), Positives = 61/138 (44%), Gaps = 11/138 (7%)

Query: 138 VLDDFAKADVLFKRTEPAPFKSVNVLAEGRRAL------EVANVEMGLALAEDEIDYLVE 191
++ D AK V+ + K ++AEG + V + +++A E +L+E
Sbjct: 102 LMSDVAKGTVVASQEATIVAKCSGMVAEGANKVVEMSKKAVQETQKAVSVA-GEATFLIE 160

Query: 192 NFVRLNRNPNDIELMMFAQ--ANSEHCRHKIFNADWTIDGEAQ-PKSLFKMIKNTFETTP 248
+ LN++PN+ EL + + A E + + ++ +D Q + + M+ +
Sbjct: 161 KQIMLNKSPNNKELELTKEEFAKVEQVKETLMASERALDETVQEAQKVLNMVNGLNPSNK 220

Query: 249 DHVLSAYKDNAAVMEGSV 266
D VL A KD A + V
Sbjct: 221 DQVL-AKKDVAKAISNVV 237


17Shewmr4_1252Shewmr4_1274Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_1252-121-4.030115rod shape-determining protein RodA
Shewmr4_1253019-4.520736peptidoglycan glycosyltransferase
Shewmr4_1254120-4.888045rRNA large subunit methyltransferase
Shewmr4_1255022-4.750875iojap-like protein
Shewmr4_1256-121-4.168132nicotinate-nucleotide adenylyltransferase
Shewmr4_1257120-2.917288DNA polymerase III subunit delta
Shewmr4_1258219-1.585171rare lipoprotein B
Shewmr4_1259418-1.047122hypothetical protein
Shewmr4_1260317-0.625541leucyl-tRNA synthetase
Shewmr4_1261317-0.133963hypothetical protein
Shewmr4_12621170.252630methylation site containing protein
Shewmr4_1263-1121.011313apolipoprotein N-acyltransferase
Shewmr4_1264-213-0.235995hypothetical protein
Shewmr4_1265-115-1.247282CBS domain-containing protein
Shewmr4_1266-116-1.516981putative metalloprotease
Shewmr4_1267122-2.512969PhoH family protein
Shewmr4_1268225-3.352695(dimethylallyl)adenosine tRNA
Shewmr4_1269235-5.622222hypothetical protein
Shewmr4_1270140-6.094758hypothetical protein
Shewmr4_1272137-6.4715412-octaprenyl-3-methyl-6-methoxy-1,4-benzoquinol
Shewmr4_1273131-5.328480peptidyl-tRNA hydrolase
Shewmr4_1274022-3.851950GTP-dependent nucleic acid-binding protein EngD
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1257HTHFIS634e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 62.5 bits (152), Expect = 4e-13
Identities = 24/128 (18%), Positives = 52/128 (40%), Gaps = 12/128 (9%)

Query: 180 HIMVIDDSAVARKQIIRALESLNLQIDTAKDGREALDKLKTIAAEMNNVAEEIPLIISDI 239
I+V DD A R + +AL + + + A + L+++D+
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIA---------AGDGDLVVTDV 55

Query: 240 EMPEMDGYTLTAEIRDDPKLKHIKVVLHTSLSGVFNQAMVQKVGANDFIAK-FNPDELAA 298
MP+ + + L I+ + V++ ++ + + GA D++ K F+ EL
Sbjct: 56 VMPDENAFDLLPRIKK--ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIG 113

Query: 299 AVNKHLSL 306
+ + L+
Sbjct: 114 IIGRALAE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1260FLGHOOKAP1333e-04 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 33.0 bits (75), Expect = 3e-04
Identities = 9/38 (23%), Positives = 18/38 (47%)

Query: 99 NVNVMEEMADMISASRSYQMNVQVAEAAKSMLQQTLGM 136
VN+ EE ++ + Y N QV + A ++ + +
Sbjct: 508 GVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545



Score = 29.9 bits (67), Expect = 0.003
Identities = 16/67 (23%), Positives = 29/67 (43%), Gaps = 6/67 (8%)

Query: 5 SIFDVAGSGMSAQSVRLNTTASNIANADSVSSSIDKTYRSRHPIFEAEMAKAQSQQQASQ 64
S+ + A SG++A LNT ++NI++ + Y + I + +
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYN------VAGYTRQTTIMAQANSTLGAGGWVGN 55

Query: 65 GVAVKGI 71
GV V G+
Sbjct: 56 GVYVSGV 62


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1262FLGHOOKAP1401e-05 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 40.3 bits (94), Expect = 1e-05
Identities = 15/35 (42%), Positives = 22/35 (62%)

Query: 2 SFNIALSGISAAQKDLNTTANNIANANTIGFKESR 36
N A+SG++AAQ LNT +NNI++ N G+
Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYTRQT 37



Score = 37.6 bits (87), Expect = 9e-05
Identities = 12/49 (24%), Positives = 25/49 (51%)

Query: 405 SISSSALEQSNIDLTTELVDLISAQRNFQANSRTLEVNNTLQQTVLQIR 453
+S+ S ++L E +L Q+ + AN++ L+ N + ++ IR
Sbjct: 498 QLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1264FLGHOOKAP1451e-07 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 45.3 bits (107), Expect = 1e-07
Identities = 19/119 (15%), Positives = 41/119 (34%), Gaps = 4/119 (3%)

Query: 145 DDATSITVSAEGEVSVKTPGTAENQVVGQLSMSDFINPSGLDPMGQNLYTETG---ASGT 201
D I +++E + + + Q + + +L ++ G A+
Sbjct: 427 TDEAKIAMASEEDAGDSDNRNGQALLDLQSNSKTVGGAKSFNDAYASLVSDIGNKTATLK 486

Query: 202 PIQGTASLDGMGAIRQGALETSNVNVTEELVNLIESQRIYEMNSKVISAVDQMLAYVNQ 260
T + + S VN+ EE NL Q+ Y N++V+ + + +
Sbjct: 487 TSSATQGNV-VTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALIN 544



Score = 35.7 bits (82), Expect = 2e-04
Identities = 9/36 (25%), Positives = 20/36 (55%)

Query: 5 LWISKTGLDAQQTDIAVISNNVANASTVGYKKSRAV 40
+ + +GL+A Q + SNN+++ + GY + +
Sbjct: 4 INNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTI 39


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1265FLGLRINGFLGH1451e-45 Flagellar L-ring protein signature.
		>FLGLRINGFLGH#Flagellar L-ring protein signature.

Length = 232

Score = 145 bits (368), Expect = 1e-45
Identities = 76/220 (34%), Positives = 108/220 (49%), Gaps = 19/220 (8%)

Query: 11 LLLTACSSTSKKPIADDPFYAPVYPEAPPTKIAATGSIYQDSQAA-----SLYSDIRAHK 65
L LT C+ P+ A P P A GSI+Q +Q L+ D R
Sbjct: 17 LSLTGCAWIPSTPLVQGATSAQPVPGPTP---VANGSIFQSAQPINYGYQPLFEDRRPRN 73

Query: 66 VGDIITIVLKEATQAKKSAGNQIKKGSDLSLDPIYAGGSNVS------IGGVPLDLRYKD 119
+GD +TIVL+E A KS+ + + G V G D+
Sbjct: 74 IGDTLTIVLQENVSASKSSSANASRDGKTNF-----GFDTVPRYLQGLFGNARADVEASG 128

Query: 120 SMNTKRESDADQSNSLDGSISANVMQVLNNGNLVVRGEKWISINNGDEFIRVTGIVRSQD 179
+ A+ SN+ G+++ V QVL NGNL V GEK I+IN G EFIR +G+V +
Sbjct: 129 GNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRFSGVVNPRT 188

Query: 180 IKPDNTIDSTRMANARIQYSGTGTFADAQKVGWLSQFFMS 219
I NT+ ST++A+ARI+Y G G +AQ +GWL +FF++
Sbjct: 189 ISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLN 228


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1266FLGPRINGFLGI378e-133 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 378 bits (973), Expect = e-133
Identities = 161/367 (43%), Positives = 223/367 (60%), Gaps = 14/367 (3%)

Query: 5 LILAVAMLAFSLPSQAE--RIKDIANVQGVRSNQLIGYGLVVGLPGTGEKTR---YTEQT 59
L+ + + P+QA+ RIKDIA++Q R NQLIGYGLVVGL GTG+ R +TEQ+
Sbjct: 11 LVFSALPFLSTPPAQADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSPFTEQS 70

Query: 60 FTTMLKNFGINLPDNFRPKIKNVAVVAVHADMPAFIKPGQELDVTVSSLGEAKSLRGGTL 119
ML+N GI + KN+A V V A++P F PG +DVTVSSLG+A SLRGG L
Sbjct: 71 MRAMLQNLGITTQGG-QSNAKNIAAVMVTANLPPFASPGSRVDVTVSSLGDATSLRGGNL 129

Query: 120 LQTFLKGVDGNVYAIAQGSLVVSGFSADGLDGSKVIQNTPTVGRIPNGAIVERSVATPFS 179
+ T L G DG +YA+AQG+L+V+GFSA G D + + Q T R+PNGAI+ER + + F
Sbjct: 130 IMTSLSGADGQIYAVAQGALIVNGFSAQG-DAATLTQGVTTSARVPNGAIIERELPSKFK 188

Query: 180 TGDYLTFNLRRSDFSTAQRMADAINDL----LGPDMARPLDATSVQVSAPRDVSQRVSFL 235
L LR DFSTA R+AD +N G +A P D+ + V PR V+ +
Sbjct: 189 DSVNLVLQLRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKPR-VADLTRLM 247

Query: 236 ATLENIEVEPADESAKVIVNSRTGTIVVGQNVKLLPAAVTHGGLTVTIAEATQVSQPNAL 295
A +EN+ VE D AKV++N RTGTIV+G +V++ AV++G LTV + E+ QV QP
Sbjct: 248 AEIENLTVET-DTPAKVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESPQVIQPAPF 306

Query: 296 ANGQTTVTSNSTINASESNRRMFMFNPGTTLDELVRAVNLVGAAPSDVLAILEALKVAGA 355
+ GQT V + I A + ++ + G L LV +N +G ++AIL+ +K AGA
Sbjct: 307 SRGQTAVQPQTDIMAMQEGSKVAIVE-GPDLRTLVAGLNSIGLKADGIIAILQGIKSAGA 365

Query: 356 LHGELII 362
L EL++
Sbjct: 366 LQAELVL 372


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1267FLGFLGJ1806e-56 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 180 bits (457), Expect = 6e-56
Identities = 108/362 (29%), Positives = 158/362 (43%), Gaps = 79/362 (21%)

Query: 12 DLGGLDSLRAQAQKDEKGALKKVAQQFEGVFVQMLMKSMRDANAVFESDSPLNSQYTKFY 71
D L+ L+A+A +D ++ VA+Q EG+FVQM++KSMRDA D +S++T+ Y
Sbjct: 14 DAQSLNELKAKAGEDPAANIRPVARQVEGMFVQMMLKSMRDALP---KDGLFSSEHTRLY 70

Query: 72 EQMRDQQLSVDLSDKGVLGLADMMVQQLSPESSQFTPASVLRNDGGMKLQHDAKAFNIPA 131
M QQ++ Q T G+ L P
Sbjct: 71 TS--------------------MYDQQIA---QQMTAG------KGLGLAEMMVKQMTPE 101

Query: 132 QATSAAETQTAAAPVVAAQGVPASIARPSANVDNGDGVTSSLDIDRPERLLAIDTPKPAW 191
Q E T AAP+ + ++ +
Sbjct: 102 Q--PLPEESTPAAPM--------KFPLETVVRYQNQALSQLV------------------ 133

Query: 192 SEQPLSPIEPVISGQILPTAAFRETQKTLKFGSREEFLATLYPHAEKAAKALGTQPEVLL 251
Q P S G + FLA L A+ A++ G ++L
Sbjct: 134 --QKAVPRNYDDSLP----------------GDSKAFLAQLSLPAQLASQQSGVPHHLIL 175

Query: 252 AQSALETGWGQKIVRGNNGAPSHNLFNIKADRRWQGDKANVSTLEFEQGIAVRQKADFRV 311
AQ+ALE+GWGQ+ +R NG PS+NLF +KA W+G ++T E+E G A + KA FRV
Sbjct: 176 AQAALESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRV 235

Query: 312 YADFEHSFNDFVSFIAEGERYQAAKKVAASPTQFIRALQDAGYATDPKYAEKVIKVMQSI 371
Y+ + + +D+V + RY AA AAS Q +ALQDAGYATDP YA K+ ++Q +
Sbjct: 236 YSSYLEALSDYVGLLTRNPRY-AAVTTAASAEQGAQALQDAGYATDPHYARKLTNMIQQM 294

Query: 372 SE 373

Sbjct: 295 KS 296


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1268FLGHOOKAP12122e-63 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 212 bits (542), Expect = 2e-63
Identities = 129/486 (26%), Positives = 205/486 (42%), Gaps = 20/486 (4%)

Query: 4 DLLNIARTGVLASQSQLGVTSNNIANANTAGYHRQVATQTTLDSQRLGNSFYGTGTYVSD 63
L+N A +G+ A+Q+ L SNNI++ N AGY RQ +S + G G YVS
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVSG 61

Query: 64 VKRIYNDYAARELRIGQTTLSAAEASYGKLSELDQVFSQIGKIVPQSLNDLFSGLNSLAD 123
V+R Y+ + +LR QT S A Y ++S++D + S + + D F+ L +L
Sbjct: 62 VQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLVS 121

Query: 124 LPADLGIRSSTLNDAKQLANSLNQMQSTLNGQLTQTNDQITGMTKRINEISTELANLNLE 183
D R + + ++ L N L Q Q N I +IN + ++A+LN +
Sbjct: 122 NAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLNDQ 181

Query: 184 LMKSPNQDAM-----LLDKQDALVQELSQYAQVNVIPLENGAKSIMLGGAIMLVSGEV-- 236
+ + A LLD++D LV EL+Q V V + G +I + LV G
Sbjct: 182 ISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTAR 241

Query: 237 PMSISTATGDPFPNELQLMSTIGSQSVRVDPTKLGGQLGALFEYREQTLVPAGLELDQLA 296
++ ++ DP + + + G LG + +R Q L L QLA
Sbjct: 242 QLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQLA 301

Query: 297 LGVADNFNKLQAEGFDLNGQVGADIFKDINDPLMSIGRVAGFSGNTGNATLGVNIDDTSA 356
L A+ FN GFD NG G D F + V + N G+ +G + D SA
Sbjct: 302 LAFAEAFNTQHKAGFDANGDAGEDFFA------IGKPAVLQNTKNKGDVAIGATVTDASA 355

Query: 357 LSGGSYELSF--TAPATYELRDTQTGTITPLTLNGTKLEGGAGFSIDIKAGAMASGDRFA 414
+ Y++SF L T T+TP +G G A D F
Sbjct: 356 VLATDYKISFDNNQWQVTRLASNTTFTVTPDANGKVAFDGLELTFT----GTPAVNDSFT 411

Query: 415 IRPTAGAANGIEVVMTDPKGIAAAAPKITPDAANSGNTQVKVTQITNRSAANFPTTGSEL 474
++P + A ++V++TD IA A+ + D+ N N Q + +N + ++
Sbjct: 412 LKPVSDAIVNMDVLITDEAKIAMASEEDAGDSDN-RNGQALLDLQSNSKTVGGAKSFNDA 470

Query: 475 TIQLNT 480
L +
Sbjct: 471 YASLVS 476



Score = 81.5 bits (201), Expect = 2e-18
Identities = 37/102 (36%), Positives = 52/102 (50%)

Query: 536 EGDNTNALAMAKLSETKVMNGGKSTLADVFEQTKQDIGSQTKAAEVRVGAADAIYQQAYA 595
+ DN N A+ L GG + D + DIG++T + + Q
Sbjct: 442 DSDNRNGQALLDLQSNSKTVGGAKSFNDAYASLVSDIGNKTATLKTSSATQGNVVTQLSN 501

Query: 596 RVESESGVNLDEEAANLMRFQQAYQASARIMSTAQQIFDTLL 637
+ +S SGVNLDEE NL RFQQ Y A+A+++ TA IFD L+
Sbjct: 502 QQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALI 543


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1269FLAGELLIN507e-09 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 50.4 bits (120), Expect = 7e-09
Identities = 33/233 (14%), Positives = 83/233 (35%), Gaps = 4/233 (1%)

Query: 20 QTATSKILDQLSSGKKVNTSGDDPVAALGIDNLNQRNALVDQFMKNIDYATNHLQQTESQ 79
Q++ S +++LSSG ++N++ DD + + Q +N + + Q TE
Sbjct: 21 QSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQASRNANDGISIAQTTEGA 80

Query: 80 LGQADTLISSMKDLMLQGSNGSQTSEERQTIADDLRKSLDQLLTIANTKDESGNYLFAGN 139
L + + + +++L +Q +NG+ + + ++I D++++ L+++ ++N +G + + +
Sbjct: 81 LNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEIDRVSNQTQFNGVKVLSQD 140

Query: 140 KTDTLPFQFDANGKIVYQGDSGVHSAIIASGIQLNTNVAGDTAFIKSPNAMGDYSVNYLP 199
+ + I ++ G NV G +V
Sbjct: 141 NQMKIQVGANDGETITIDLQKIDVKSLGLDG----FNVNGPKEATVGDLKSSFKNVTGYD 196

Query: 200 SQQGEFSVTSAKLDGVTPSLSDYKINFLDDGAGGINVEVTDTATPANVISAAA 252
+ + ++ D T N +
Sbjct: 197 TYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDDAENNTAVDL 249


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1270FLAGELLIN2061e-62 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 206 bits (524), Expect = 1e-62
Identities = 156/507 (30%), Positives = 231/507 (45%), Gaps = 25/507 (4%)

Query: 2 AISVNTNVTSMKAQNQLNGANNRLSTSMERLSSGLRINSAKDDAAGLQISNRMTSQINGI 61
A +NTN S+ QN LN + + LS+++ERLSSGLRINSAKDDAAG I+NR TS I G+
Sbjct: 1 AQVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGL 60

Query: 62 GVAMRNANDGISIAQTAEGAMQESTNILQRMRDLSLQSANGSNSSEDRAAMQKELNALQS 121
A RNANDGISIAQT EGA+ E N LQR+R+LS+Q+ NG+NS D ++Q E+
Sbjct: 61 TQASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLE 120

Query: 122 ELTRIADTTSFGGQKLLDGSYGTQSFQVGAQANETISVSLKSVAAADIGAYKSDAAGSKF 181
E+ R+++ T F G K+L QVGA ETI++ L+ + +G + G K
Sbjct: 121 EIDRVSNQTQFNGVKVLSQD-NQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKE 179

Query: 182 GGDLVTAVAGSNGSAGGNLSITQGTKTTTFATAANDTAAEVAGKINKAGTGVTATAQTTI 241
+ N + ++ + A T +K TT
Sbjct: 180 ATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTD 239

Query: 242 EANLTEAFDKGLTMKVDDGNSTSSLDLTGIGN-NDDLAKAINNVSGETGVSAKLENGVLT 300
+A A D T K G + + I + V+ +
Sbjct: 240 DAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKV 299

Query: 301 ITSSKGADISFSD-----------------------TATAGTGTLTLKNISADGTSSAVS 337
T+ G ++ + + G T K + S +
Sbjct: 300 STTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLE 359

Query: 338 TIDGTDTTVDNATAVGSVSLTASSAYSISGGIAAELTTEVAGVFAGVNSVDISSAAGSQS 397
+ + A+ G + +GV +N ++ + +
Sbjct: 360 ANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAAKKSTAN 419

Query: 398 ALAIIDGAIASIDSSRSDLGAVQNRMSFTINNLSNIQSNVTDARSRIQDVDFASETAQLT 457
LA ID A++ +D+ RS LGA+QNR I NL N +N+ ARSRI+D D+A+E + ++
Sbjct: 420 PLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNMS 479

Query: 458 KQQILSQTSSAMLAQANQLPQTALSLL 484
K QIL Q +++LAQANQ+PQ LSLL
Sbjct: 480 KAQILQQAGTSVLAQANQVPQNVLSLL 506


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1273FLAGELLIN2073e-63 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 207 bits (528), Expect = 3e-63
Identities = 156/507 (30%), Positives = 230/507 (45%), Gaps = 25/507 (4%)

Query: 2 AISVNTNVTSMRAQNQLNGANSKLSTSMERLSSGLRINSAKDDAAGLQISNRMTSQVNGI 61
A +NTN S+ QN LN + S LS+++ERLSSGLRINSAKDDAAG I+NR TS + G+
Sbjct: 1 AQVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGL 60

Query: 62 GVAMRNANDGISIAQTAEGAMQESTNILQRMRDLSLQSANGSNSSEDRAAMQKEVNALQS 121
A RNANDGISIAQT EGA+ E N LQR+R+LS+Q+ NG+NS D ++Q E+
Sbjct: 61 TQASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLE 120

Query: 122 ELTRISETTSFGGQKLLDGSYGTQSFQVGAQANETISVSLKSVAAADIGAYKSDAAGSKF 181
E+ R+S T F G K+L QVGA ETI++ L+ + +G + G K
Sbjct: 121 EIDRVSNQTQFNGVKVLSQD-NQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKE 179

Query: 182 GGDLVTAVAGSNGSAGGNLSITQGTKTTTFATAANDTAAEVAGKINKAGTGVTATAQTTI 241
+ N + ++ + A T +K TT
Sbjct: 180 ATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTD 239

Query: 242 EANLTEAFDKGLTMKVDDGNSTSSLDLTGIGN-NDDLAKAINNVSGETGVSAKLENGVLT 300
+A A D T K G + + I + V+ +
Sbjct: 240 DAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKV 299

Query: 301 ITSSKGADISFSD-----------------------TATAGTGTLTLKNISADGTSSAVS 337
T+ G ++ + + G T K + S +
Sbjct: 300 STTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLE 359

Query: 338 TIDGTDTTVDNATAVGSVSLTASSAYSISGGIAAELTTEVAGVFAGVNSVDLSTASGSQS 397
+ + A+ G + +GV +N + + +
Sbjct: 360 ANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAAKKSTAN 419

Query: 398 ALAIIDGAIAGIDSQRADLGAVQNRMNFTINNLSNIQSNVTDARSRIQDVDFASETAQLT 457
LA ID A++ +D+ R+ LGA+QNR + I NL N +N+ ARSRI+D D+A+E + ++
Sbjct: 420 PLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNMS 479

Query: 458 KQQILSQTSSAMLAQANQLPQTALSLL 484
K QIL Q +++LAQANQ+PQ LSLL
Sbjct: 480 KAQILQQAGTSVLAQANQVPQNVLSLL 506


18Shewmr4_1306Shewmr4_1340Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_1306024-3.091974TatD-related deoxyribonuclease
Shewmr4_1307019-3.170970nucleoside transporter
Shewmr4_1308-117-3.064079hypothetical protein
Shewmr4_1309-118-3.562045nucleoside-specific channel-forming protein,
Shewmr4_1310-120-3.535553hypothetical protein
Shewmr4_1311022-4.248283deoxyribose-phosphate aldolase
Shewmr4_1312-123-4.852626thymidine phosphorylase
Shewmr4_1313-127-5.177170phosphopentomutase
Shewmr4_1314-129-6.157829purine nucleoside phosphorylase
Shewmr4_1315127-5.810048membrane protein
Shewmr4_1316326-6.548728hypothetical protein
Shewmr4_1317327-6.452385phosphoserine phosphatase
Shewmr4_1318230-8.165819alpha/beta hydrolase fold domain-containing
Shewmr4_1319335-10.909087type IV pilus assembly PilZ
Shewmr4_1320338-12.285390DNA repair protein RadA
Shewmr4_1321147-15.128637type IV pilus assembly PilZ
Shewmr4_1322355-17.520971DNA-binding transcriptional regulator TorR
Shewmr4_1323150-15.555006TMAO reductase system periplasmic protein TorT
Shewmr4_1324149-13.317650periplasmic sensory protein associated with the
Shewmr4_1325145-11.009082multi-sensor hybrid histidine kinase
Shewmr4_1326041-9.563649hypothetical protein
Shewmr4_1327038-8.207350chaperone protein TorD
Shewmr4_1328135-6.130913trimethylamine-N-oxide reductase TorA
Shewmr4_1329032-6.071930hypothetical protein
Shewmr4_1330028-6.095764trimethylamine-N-oxide reductase c-type
Shewmr4_1331-126-5.640159hypothetical protein
Shewmr4_1332-128-5.257697periplasmic nitrate reductase NapE
Shewmr4_1333025-4.309332hypothetical protein
Shewmr4_1335022-5.132788xanthine/uracil/vitamin C permease
Shewmr4_1336-125-5.396715CBS domain-containing protein
Shewmr4_1337-126-5.654218phospholipid/glycerol acyltransferase
Shewmr4_1338130-5.660490metallophosphoesterase
Shewmr4_1339027-5.046261nucleoside recognition domain-containing
Shewmr4_1340020-3.747663hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1309TYPE3IMSPROT552e-12 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 55.2 bits (133), Expect = 2e-12
Identities = 18/87 (20%), Positives = 32/87 (36%), Gaps = 3/87 (3%)

Query: 8 TQQAVALGYD-GKH-APKVVASGEGLVADEIIALAKASGVYIHQDPHLSNFL-RLLELGE 64
T A+ + Y G+ P V + +A+ GV I Q L+ L +
Sbjct: 265 THIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIAEEEGVPILQRIPLARALYWDALVDH 324

Query: 65 EIPKELYLLIAELIAFVYMLDGKFPEQ 91
IP E AE++ ++ + +
Sbjct: 325 YIPAEQIEATAEVLRWLERQNIEKQHS 351


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1311VACJLIPOPROT2305e-78 VacJ lipoprotein signature.
		>VACJLIPOPROT#VacJ lipoprotein signature.

Length = 251

Score = 230 bits (588), Expect = 5e-78
Identities = 91/257 (35%), Positives = 134/257 (52%), Gaps = 16/257 (6%)

Query: 9 LLGFALLPKVYGAEATVPDTTPKETASAVKITYDDPRDPLEGFNRAMWDFNYLYLDRYIY 68
L AL + A+ + DPLEGFNR M++FN+ LD YI
Sbjct: 5 LSALALGTTLLVGCASSGTDQQGRS------------DPLEGFNRTMYNFNFNVLDPYIV 52

Query: 69 RPIAHGYNDYLPLPAKTGINNFVQNLEEPSSLVNNALQGKWGWAANAGGRFTVNTTIGLL 128
RP+A + DY+P PA+ G++NF NLEEP+ +VN LQG RF +NT +G+
Sbjct: 53 RPVAVAWRDYVPQPARNGLSNFTGNLEEPAVMVNYFLQGDPYQGMVHFTRFFLNTILGMG 112

Query: 129 GVFDVADMMGMPRKQDE---FNEVLGYYGVPNGPYFMAPFAGPYVVRELASDWVDGLYFP 185
G DVA M ++ E F LG+YGV GPY PF G + +R+ D D LY
Sbjct: 113 GFIDVAGMANPKLQRTEPHRFGSTLGHYGVGYGPYVQLPFYGSFTLRDDGGDMADALYPV 172

Query: 186 LSELTVWQSIVKWGLKSLHARASAIDQERLVDNALDPYTFVKDAYLQHMDYKVYDGNV-P 244
LS LT S+ KW L+ + RA +D + L+ + DPY V++AY Q D+ G + P
Sbjct: 173 LSWLTWPMSVGKWTLEGIETRAQLLDSDGLLRQSSDPYIMVREAYFQRHDFIANGGELKP 232

Query: 245 QKQEDDELLDQYMQELE 261
Q+ + + + +++++
Sbjct: 233 QENPNAQAIQDDLKDID 249


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1312HTHFIS952e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 94.9 bits (236), Expect = 2e-23
Identities = 29/101 (28%), Positives = 47/101 (46%)

Query: 7 SILLVEDDPVFRQIVATFLSGRGAEVVQACDGEQGLSIFKQQRFDIILADLSMPKLGGLD 66
+IL+ +DD R ++ LS G +V + D+++ D+ MP D
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 67 MLKEMSKLEPLVPSIVISGNNVMADVVEALRVGACDYLVKP 107
+L + K P +P +V+S N ++A GA DYL KP
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKP 105


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1318NUCEPIMERASE1798e-56 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 179 bits (456), Expect = 8e-56
Identities = 85/361 (23%), Positives = 149/361 (41%), Gaps = 51/361 (14%)

Query: 1 MKILVTGGAGFIGSAVVRHIISNTQDSVINLDKLT--YAGNL-ESLVSVEASERYAFEQV 57
MK LVTG AGFIG V + ++ V+ +D L Y +L ++ + + A + F ++
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAG-HQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKI 59

Query: 58 DICDRAELDRVFSQYKPDAVMHLAAESHVDRSITGPADFIQTNIVGTYTLLEAARHYWMQ 117
D+ DR + +F+ + V V S+ P + +N+ G +LE RH +Q
Sbjct: 60 DLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQ 119

Query: 118 LDAERKVAFRFHHISTDEVYGDLPHPDEQEGQVVNQELPLFTETTPYAPSSPYSASKASS 177
+ S+ VYG N+++P T+ + P S Y+A+K ++
Sbjct: 120 ---------HLLYASSSSVYGL------------NRKMPFSTDDSVDHPVSLYAATKKAN 158

Query: 178 DHLVRAWLRTYGLPTIVTNCSNNYGPYHFPEKLIPLVILNALEGKQLPIYGKGDQIRDWL 237
+ + + YGLP YGP+ P+ + LEGK + +Y G RD+
Sbjct: 159 ELMAHTYSHLYGLPATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFT 218

Query: 238 YVEDHARALYKVV------------------TEGQIGETYNIGGHNEKQNLEVVQTICTI 279
Y++D A A+ ++ YNIG +E++ I +
Sbjct: 219 YIDDIAEAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNS---SPVELMDYIQAL 275

Query: 280 LDALVPKASSYAEQITYVTDRPGHDRRYAIDASKISNELNWQPQETFETGLCKTVEWYLA 339
DAL +A + + +PG + D + + + P+ T + G+ V WY
Sbjct: 276 EDALGIEA-----KKNMLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRD 330

Query: 340 N 340

Sbjct: 331 F 331


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1336PF06291300.001 Lambda prophage Bor protein
		>PF06291#Lambda prophage Bor protein

Length = 102

Score = 30.4 bits (68), Expect = 0.001
Identities = 17/46 (36%), Positives = 25/46 (54%), Gaps = 3/46 (6%)

Query: 13 QTVKMNMKLSQISLALLALMITACSEPAKTVANEPVAAPHQDTQTN 58
Q KM L +LA+L IT C++ TV N+P A ++T T+
Sbjct: 2 QDNKMKKMLFSAALAML---ITGCAQQTFTVGNKPTAVTPKETITH 44


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1337PF06580485e-08 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 47.6 bits (113), Expect = 5e-08
Identities = 35/198 (17%), Positives = 74/198 (37%), Gaps = 38/198 (19%)

Query: 266 NTMQDGLGLIERNLSRAAELV--------HNFKRTAADQSVLERERFNLKTYIFQIFSSL 317
N + + LI + ++A E++ ++ + + A Q L E + +Y+ + S
Sbjct: 177 NALNNIRALILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQ-LAS-- 233

Query: 318 RPLMR-KKNIALNVELDDDIFIESYPGAIAQIFTNLVANSFRHGFPESFTGDKIITIRVQ 376
++ + + +++ I P + Q LV N +HG + G I ++
Sbjct: 234 ---IQFEDRLQFENQINPAIMDVQVPPMLVQT---LVENGIKHGIAQLPQG-GKILLKGT 286

Query: 377 KQDSNICMQYQDNGVGMTDEVKLKAFEPFFTTARKDGGTGLGMSIIYNLVTQKLHG---T 433
K + + ++ ++ G K TG G+ + + Q L+G
Sbjct: 287 KDNGTVTLEVENTGSLALKNTK--------------ESTGTGLQNVRERL-QMLYGTEAQ 331

Query: 434 IMLTSSPYQGVKVEIQIP 451
I L+ V + IP
Sbjct: 332 IKLSEKQ-GKVNAMVLIP 348


19Shewmr4_1399Shewmr4_1411Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_13992152.454714phosphatidylglycerophosphatase
Shewmr4_14002163.378228UspA domain-containing protein
Shewmr4_14011143.630788recombination and repair protein
Shewmr4_14021123.239636auxin efflux carrier
Shewmr4_14030121.780019LysR family transcriptional regulator
Shewmr4_14040111.484836hypothetical protein
Shewmr4_1405-1150.351824hypothetical protein
Shewmr4_1406025-5.717475hybrid sensory histidine kinase BarA
Shewmr4_1407-122-5.82792523S rRNA 5-methyluridine methyltransferase
Shewmr4_1408-123-6.526475(p)ppGpp synthetase I, SpoT/RelA
Shewmr4_1409025-7.433321hypothetical protein
Shewmr4_1410-123-6.912166nucleoside triphosphate pyrophosphohydrolase
Shewmr4_1411018-5.903452CTP synthetase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1402IGASERPTASE502e-08 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 50.4 bits (120), Expect = 2e-08
Identities = 32/194 (16%), Positives = 67/194 (34%), Gaps = 13/194 (6%)

Query: 473 NQQGQDQNDQQQGDQQSSQNDQAQDQSQEQQSQQQNNSDQADKKPSQEQSTSSEQNDPEQ 532
NQ N + Q+ + + + + A PS+ T +E + E
Sbjct: 989 NQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQES 1048

Query: 533 GAQDKQQASDENAKQDQQDAQQEQQQAEQQANQQNGADNNAEDKEDPASNEAKMQAKVE- 591
+K + ++ +E + + Q N + + ++ + E K A VE
Sbjct: 1049 KTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEK 1108

Query: 592 DDKSKAKQEQQQAVAQKADKEK-----------QAQADKKPDTAVESVEA-PPSNSEPLP 639
++K+K + E+ Q V + + QA+ ++ D V E +N+
Sbjct: 1109 EEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADT 1168

Query: 640 AEMQRALRGVSEDP 653
+ + E P
Sbjct: 1169 EQPAKETSSNVEQP 1182



Score = 44.7 bits (105), Expect = 1e-06
Identities = 32/225 (14%), Positives = 78/225 (34%), Gaps = 15/225 (6%)

Query: 423 KAKERYQAALDKQPEFPQAKANLELAEKLLNQQQSQQNADNQDKQSQGDQNQQGQDQNDQ 482
A+ P P +AE + ++ + + ++ + ++
Sbjct: 1017 IARVDEAPVPPPAPATPSETTET-VAENSKQESKTVEKNEQDATETTAQNREVAKEAKSN 1075

Query: 483 QQGDQQSSQNDQAQDQSQEQQSQQQNNSDQADKKPSQEQSTSSEQNDPEQGAQDKQQASD 542
+ + Q+++ Q+ +++E Q+ + + +K+ + T Q P+ +Q +
Sbjct: 1076 VKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQ 1135

Query: 543 ENAKQDQQDAQQE--------QQQAEQ--QANQQNGADNNAEDKEDPASNEAKMQAKVED 592
Q Q + +E + Q++ A+ + A + + E P + V
Sbjct: 1136 SETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTE----STTVNT 1191

Query: 593 DKSKAKQEQQQAVAQKADKEKQAQADKKPDTAVESVEAPPSNSEP 637
S + + A ++K + SV + P N EP
Sbjct: 1192 GNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEP 1236



Score = 42.4 bits (99), Expect = 6e-06
Identities = 26/200 (13%), Positives = 64/200 (32%), Gaps = 7/200 (3%)

Query: 480 NDQQQGDQQSSQNDQAQDQSQEQQSQQQNNSDQADKKPSQEQSTSSEQNDPEQGAQDKQQ 539
N + + Q+ + Q S+ + E + +
Sbjct: 982 NPEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPS---ETTE 1038

Query: 540 ASDENAKQDQQDAQQEQQQAEQQANQQNGADNNAEDKEDPASNEAKMQAKVEDDKSKAKQ 599
EN+KQ+ + ++ +Q A + Q A+ A+ + A+ + + +
Sbjct: 1039 TVAENSKQESKTVEKNEQDATETTAQNREVAKEAK-SNVKANTQTNEVAQSGSETKETQT 1097

Query: 600 EQQQAVAQKADKEKQ-AQADKKPDTAVESVEAPPSNSEPLPAEMQRALRGVSEDPQVLLR 658
+ + A +EK + +K + + + P + +A DP V ++
Sbjct: 1098 TETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQS-ETVQPQAEPARENDPTVNIK 1156

Query: 659 NKMQLEYQKRRQNGQISRDN 678
Q + Q +++
Sbjct: 1157 EP-QSQTNTTADTEQPAKET 1175


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1406HTHFIS348e-04 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 33.7 bits (77), Expect = 8e-04
Identities = 35/139 (25%), Positives = 53/139 (38%), Gaps = 19/139 (13%)

Query: 28 LIALIANG--HLLVEGPPGLAKT---RAVKALCDGVEGDFHRIQ---FTPDLLPADLTG- 78
++A + L++ G G K RA+ G F I DL+ ++L G
Sbjct: 152 VLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGH 211

Query: 79 -----TDIYRSQTGTFEFEAGPIFHNLILADEINRAPAKVQSALLEAMAEGQVT-VGKNS 132
T TG FE G + DEI P Q+ LL + +G+ T VG +
Sbjct: 212 EKGAFTGAQTRSTGRFEQAEG----GTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRT 267

Query: 133 YKLPPLFLVMATQNPLENE 151
+ +V AT L+
Sbjct: 268 PIRSDVRIVAATNKDLKQS 286


20Shewmr4_1437Shewmr4_1445Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_14370163.952073peptidylprolyl isomerase, FKBP-type
Shewmr4_14380133.365515bifunctional aspartokinase I/homoserine
Shewmr4_14390122.336652bifunctional aspartokinase I/homeserine
Shewmr4_1440-114-0.004510homoserine kinase
Shewmr4_1441122-3.636269threonine synthase
Shewmr4_1442120-3.366711hypothetical protein
Shewmr4_1443026-5.268250hypothetical protein
Shewmr4_1444023-5.617363extracellular solute-binding protein
Shewmr4_1445019-5.065890hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1442OMS28PORIN310.008 OMS28 porin signature.
		>OMS28PORIN#OMS28 porin signature.

Length = 257

Score = 30.9 bits (69), Expect = 0.008
Identities = 44/219 (20%), Positives = 93/219 (42%), Gaps = 16/219 (7%)

Query: 239 FLDSVAVAMAQMTSAIEEVSVNASNTSLQTKDNASQMSASQGRISHTVDSIGQLSTKIGA 298
F DS + + S + E S N L KD +Q + +++ V S + G
Sbjct: 23 FADSNNANILKPQSNVLEHSDQKDNKKLDQKDQVNQALDTINKVTEDVSSKLE-----GV 77

Query: 299 AFSSVEQLSKDAAGI----DAVVTTINSISEQTNLLALNAAIEAARAGEQGRGFAVVADE 354
SS+E + + AG+ ++ ++ +++ T + + A I A +G G V +
Sbjct: 78 RESSLELVESNDAGVVKKFVGSMSLMSDVAKGTVVASQEATIVAKCSGMVAEGANKVVEM 137

Query: 355 VRTLAGRTQ-------QATVEIQAMIEGLQSGTRQLSNITTEIVDRADEGRSAIIAVGDD 407
+ TQ +AT I+ I +S + +T E + ++ + ++A
Sbjct: 138 SKKAVQETQKAVSVAGEATFLIEKQIMLNKSPNNKELELTKEEFAKVEQVKETLMASERA 197

Query: 408 VEGMAQSVNAVFDMSSQIAASAEEQSVAARDIAGQLNEI 446
++ Q V +M + + S ++Q +A +D+A ++ +
Sbjct: 198 LDETVQEAQKVLNMVNGLNPSNKDQVLAKKDVAKAISNV 236


21Shewmr4_1578Shewmr4_1592Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_1578130-5.144677acriflavin resistance protein
Shewmr4_1579130-4.846182hypothetical protein
Shewmr4_1580028-4.258477RND family efflux transporter MFP subunit
Shewmr4_1581-128-4.405646TetR family transcriptional regulator
Shewmr4_1582-126-4.344221hypothetical protein
Shewmr4_1583-126-4.230680hypothetical protein
Shewmr4_1584012-1.963724hypothetical protein
Shewmr4_1585013-1.586392hypothetical protein
Shewmr4_1586218-2.835738hypothetical protein
Shewmr4_1587420-1.671733*hypothetical protein
Shewmr4_1588216-0.485752hypothetical protein
Shewmr4_1589316-0.401752hypothetical protein
Shewmr4_1590214-0.450449hypothetical protein
Shewmr4_15912110.769689hypothetical protein
Shewmr4_15922110.991852hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1579PF06580371e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 37.2 bits (86), Expect = 1e-04
Identities = 21/102 (20%), Positives = 34/102 (33%), Gaps = 23/102 (22%)

Query: 356 LMENAFRLCISQ------VEVSARFNDQGDFELIVEDDGPGVEENLRQKIIQRGVRADTQ 409
L+EN + I+Q + + D G L VE+ G +N ++
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTK-DNGTVTLEVENTGSLALKNTKE------------ 309

Query: 410 SPGQGIGLA-VCDEIVSSYGGYLSIE-ESHLEGARFRITIPA 449
G GL V + + YG I+ + IP
Sbjct: 310 --STGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIPG 349


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1580HTHFIS794e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 78.7 bits (194), Expect = 4e-19
Identities = 32/124 (25%), Positives = 59/124 (47%), Gaps = 1/124 (0%)

Query: 1 MSIMRILVVEDDLILSHHLKVQLSDLGNQVQVALTAKEGFFQATNYPIDVAIVDLGLPDQ 60
M+ ILV +DD + L LS G V++ A + D+ + D+ +PD+
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 DGISLIQQLREEGVKAPILILTARVNWQDKVEGLNAGADDYLVKPFQKEELVARLD-ALV 119
+ L+ ++++ P+L+++A+ + ++ GA DYL KPF EL+ + AL
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 120 RRSA 123

Sbjct: 121 EPKR 124


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1583INTIMIN421e-05 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 42.4 bits (99), Expect = 1e-05
Identities = 45/184 (24%), Positives = 78/184 (42%), Gaps = 16/184 (8%)

Query: 54 ATPAEVQATVVDSKTGPKAGIVVTFKLDNDELGTFTPSTGTQLTDSSGVAKIKLDTGSLA 113
ATV + +A + V+F + + GT S + T+ SG A + L +
Sbjct: 575 TEAITYTATVKKNGV-AQANVPVSFNIVS---GTAVLSANSANTNGSGKATVTLKSDKP- 629

Query: 114 GAGSVTASIGTGESASMGFYSKGDGAINPGTGNKLKLSLVNAQEQAITSISSATPGIVKA 173
G V+A SA + I ++ K S+ + T++++ I
Sbjct: 630 GQVVVSAKTAEMTSA-----LNANAVIFV---DQTKASITEIKADKTTAVANGQDAITYT 681

Query: 174 LYTNSSDEPLVGKVITFTSTLGKFQPESGTALTDAQGLAKIAITAGTVAGAGKIIAKVDD 233
+ D+P+ + +TFT+TLGK + T TD G AK+ +T+ T G + A+V D
Sbjct: 682 VKVMKGDKPVSNQEVTFTTTLGK--LSNSTEKTDTNGYAKVTLTSTTP-GKSLVSARVSD 738

Query: 234 TESE 237
+
Sbjct: 739 VAVD 742



Score = 37.0 bits (85), Expect = 5e-04
Identities = 60/305 (19%), Positives = 99/305 (32%), Gaps = 48/305 (15%)

Query: 379 TGMPTTNISATQPGKVTV---ALVDKDSTPLVGKVVSFSSTLGNFLPTQGTALTDALGRA 435
T SA G + A V K+ VSF+ G + + +A T+ G+A
Sbjct: 561 TDFTADKTSAKADGTEAITYTATVKKNGVAQANVPVSFNIVSGTAVLSANSANTNGSGKA 620

Query: 436 SITLTAGSIEGAGEITATYG-----TAKAIIGFVTAGDEIDPVEASPEISFDIYDCNGVA 490
++TL + T A A+I I ++A NG
Sbjct: 621 TVTLKSDKPGQVVVSAKTAEMTSALNANAVIFVDQTKASITEIKADK----TTAVANGQD 676

Query: 491 AWDKALKNFEVCKTTDNITNDKPGIIGAKVTRSGSTQALQQVLISAATTIGAISPSSGTA 550
A +K + DKP + +VT + TT+G +S S+
Sbjct: 677 AITYTVKVMK---------GDKP-VSNQEVTFT--------------TTLGKLSNSTEK- 711

Query: 551 ITNAEGKAILDLYANGNVGAGEISLKVKDVTATKAFEIGRVNISLKLETSLGGNLLPAGG 610
T+ G A + L + G +S +V DV A ++ + ++ + G
Sbjct: 712 -TDTNGYAKVTLTST-TPGKSLVSARVSDV----AVDVKAPEVEFFTTLTIDDGNIEIVG 765

Query: 611 STI---LDVTVLNPDGS--LATGQPFTLVFTSECQASNKAIIDSPVITNGGKGYATYRST 665
+ + L L A+G + S A S +T KG T
Sbjct: 766 TGVKGKLPTVWLQYGQVNLKASGGNGKYTWRSANPAIASVDASSGQVTLKEKGTTTISVI 825

Query: 666 GCETQ 670
+ Q
Sbjct: 826 SSDNQ 830


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1588HTHFIS772e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 76.8 bits (189), Expect = 2e-18
Identities = 32/112 (28%), Positives = 50/112 (44%), Gaps = 6/112 (5%)

Query: 7 KIIIADDHPLFRNALRQALSSAFEHAQWYEADSADALQSVLDS-QNVSYDLVLLDLQMPG 65
I++ADD R L QALS A Y+ ++ DLV+ D+ MP
Sbjct: 5 TILVADDDAAIRTVLNQALSRAG-----YDVRITSNAATLWRWIAAGDGDLVVTDVVMPD 59

Query: 66 SHGYSTLIHLRSHYPELPVVVISAHEDINTISRAIHYGGSGFIPKSASMETL 117
+ + L ++ P+LPV+V+SA T +A G ++PK + L
Sbjct: 60 ENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTEL 111


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1589TYPE3OMGPROT290.010 Type III secretion system outer membrane G protein ...
		>TYPE3OMGPROT#Type III secretion system outer membrane G protein

family signature.
Length = 607

Score = 28.7 bits (64), Expect = 0.010
Identities = 21/97 (21%), Positives = 41/97 (42%), Gaps = 21/97 (21%)

Query: 15 AIEQRINQSEARVIKAVFPSITNHHNTLFGGEALAWMDETAFIAATRFCRKTLVTVSSDR 74
+E + A+V+ P++ N +A+ ET + V V+
Sbjct: 350 LLEN---EGSAQVVSR--PTLLTQENA----QAVIDHSETYY-----------VKVTGKE 389

Query: 75 IDFKKPIPAGTLAELIARVIHVGNTSLKVEVNIFVED 111
+ K I GT+ + RV+ G+ S ++ +N+ +ED
Sbjct: 390 VAELKGITYGTMLRMTPRVLTQGDKS-EISLNLHIED 425


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1592PHPHTRNFRASE3017e-95 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 301 bits (772), Expect = 7e-95
Identities = 110/418 (26%), Positives = 188/418 (44%), Gaps = 65/418 (15%)

Query: 384 QPGDVLVTDMTDPDWEPIMK-RASAIVTNRGGRTCHAAIIARELGVPAVVGCGDVTDRIK 442
+ ++ D+T D + K T+ GGRT H+AI++R L +PAVVG +VT++I+
Sbjct: 155 EETVIIAEDLTPSDTAQLNKQFVKGFATDIGGRTSHSAIMSRSLEIPAVVGTKEVTEKIQ 214

Query: 443 NGQLVTVSCAEG---------DTGYIYEGKQEFEVVSNRVDALPELP--------MKIMM 485
+G +V V EG + E + FE L P +++
Sbjct: 215 HGDMVIVDGIEGIVIVNPTEEEVKAYEEKRAAFEKQKQEWAKLVGEPSTTKDGAHVELAA 274

Query: 486 NVGNPDRAFDFARLPNEGVGLARLEFIINRMIGIHPKALLEFNQQDAALQEEINEMIAGY 545
N+G P EG+GL R EF+ + +D EE E Y
Sbjct: 275 NIGTPKDVDGVLANGGEGIGLYRTEFL--------------YMDRDQLPTEE--EQFEAY 318

Query: 546 DSPVEFYIARLVEGIASIGSAFYPKKVIVRMSDFKSNEYANLVGGDRYEPEEENPMLGFR 605
V+ K V++R D ++ + + P+E NP LGFR
Sbjct: 319 KEVVQ---------------RMDGKPVVIRTLDIGGDKELSYL----QLPKELNPFLGFR 359

Query: 606 GASRYISESFRDCFALECEAIKRVRNEMGLKNVEVMIPFVRTVKEAEQVIGLLKEQGLER 665
+ + +D F + A+ R N++VM P + T++E Q +++E+ +
Sbjct: 360 AIRLCLEK--QDIFRTQLRALLRAS---TYGNLKVMFPMIATLEELRQAKAIMQEEKDKL 414

Query: 666 GKDG------LRVIMMCEVPSNALLADQFLEHFDGFSIGSNDLTQLSLGLDRDSGIISHL 719
+G + V +M E+PS A+ A+ F + D FSIG+NDL Q ++ DR + +S+L
Sbjct: 415 LSEGVDVSDSIEVGIMVEIPSTAVAANLFAKEVDFFSIGTNDLIQYTMAADRMNERVSYL 474

Query: 720 FDERNEAVKILLSMAIKAAKAKGAYIGICGQGPSDHADFAAWLVEQGIDTVSLNPDTV 777
+ + A+ L+ M IKAA ++G ++G+CG+ D L+ G+D S++ ++
Sbjct: 475 YQPYHPAILRLVDMVIKAAHSEGKWVGMCGEMAGDE-VAIPLLLGLGLDEFSMSATSI 531


22Shewmr4_1628Shewmr4_1641Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_16282200.217766flagellar export protein FliJ
Shewmr4_1629530-1.016049flagellar hook-length control protein
Shewmr4_1630637-0.548582flagellar basal body-associated protein FliL
Shewmr4_1631740-0.139862flagellar motor switch protein FliM
Shewmr4_1632742-0.282777flagellar motor switch protein
Shewmr4_1633746-0.324767flagellar biosynthesis protein, FliO
Shewmr4_1634643-0.268679hypothetical protein
Shewmr4_1635440-0.178737flagellar biosynthesis protein FliP
Shewmr4_1636439-1.402338flagellar biosynthesis protein FliP
Shewmr4_1637229-1.380815flagellar biosynthetic protein FliQ
Shewmr4_1638119-0.823535flagellar biosynthesis protein FliR
Shewmr4_1639119-0.599167flagellar biosynthesis protein FlhB
Shewmr4_1640218-1.532630flagellar biosynthesis protein FlhA
Shewmr4_1641217-1.721834flagellar biosynthesis regulator FlhF
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1628ACRIFLAVINRP6660.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 666 bits (1721), Expect = 0.0
Identities = 262/1094 (23%), Positives = 474/1094 (43%), Gaps = 100/1094 (9%)

Query: 7 SVKRPVTVWMFMLAIMLFGMVGFSRLAVKLLPDLSYPTLTIRTMYDGAAPVEVEQLVSKP 66
++RP+ W+ + +M+ G + +L V P ++ P +++ Y GA V+ V++
Sbjct: 5 FIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQV 64

Query: 67 IEEAVGVVKGLRKISSISRS-GMSDVVLEFEWGTTMDMASLDVREKLDTI--ALPLDVKK 123
IE+ + + L +SS S S G + L F+ GT D+A + V+ KL LP +V++
Sbjct: 65 IEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQQ 124

Query: 124 PLLLRFNPNLDPIMRLALSVPNASEAELKQMRTYAEEELKRRLEALSGVAAVRLSGGLEQ 183
+ + +M N + Y +K L L+GV V+L G +
Sbjct: 125 QGISVEKSSSSYLMVAGFVSDNPGT-TQDDISDYVASNVKDTLSRLNGVGDVQLFGA-QY 182

Query: 184 EVHIQLNQEKLSQLNLNADDIKRRINEENINLSAGKVIQGD------REYLVRTLNQFNS 237
+ I L+ + L++ L D+ ++ +N ++AG++ + +F +
Sbjct: 183 AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKN 242

Query: 238 LEELGQVIVYRDAQ-TLVRLFEVATITDAFKERSDITRIGSQESIELAIYKEGDANTVAV 296
EE G+V + ++ ++VRL +VA + + + I RI + + L I AN +
Sbjct: 243 PEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALDT 302

Query: 297 AKKLRDELVKINQD-PKQNKLEVIYDQSEFIESAVSEVTSSALMGSILSMLVIYLFLRNI 355
AK ++ +L ++ P+ K+ YD + F++ ++ EV + +L LV+YLFL+N+
Sbjct: 303 AKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNM 362

Query: 356 IPTLIISISIPFSVIATFNMMYFADISLNIMSLGGIALAIGLLVDNAIVVLENIDRC-RS 414
TLI +I++P ++ TF ++ S+N +++ G+ LAIGLLVD+AIVV+EN++R
Sbjct: 363 RATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMME 422

Query: 415 EGMSKLDAAVTGTKEVAGAIFASTMTTLAVFVPLVFVDGIAGALFSDQALTVTFALLASL 474
+ + +A ++ GA+ M AVF+P+ F G GA++ ++T+ A+ S+
Sbjct: 423 DKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSV 482

Query: 475 LVALTSIPMLASREGFTALPELIKKTPKEKPTTKLGKLKHYSATVFSFPIVLIFSYLPSV 534
LVAL P L + L+K E K
Sbjct: 483 LVALILTPALCAT--------LLKPVSAEHHENK-------------------------- 508

Query: 535 LLTLALVIGRFFSWLLGLVMRPLSSGFNFVYHAIESVYHKLLAMALRKQVATLLLTIGIT 594
G FF W FN + + Y + L LL+ I
Sbjct: 509 --------GGFFGW------------FNTTFDHSVNHYTNSVGKILGSTGRYLLIYALIV 548

Query: 595 GACISLLPRLGMELIPPMNQGEFYVEILLPPGTAVGETDKVLQQLAMSI--KDRPEVKHA 652
+ L RL +P +QG F I LP G T KVL Q+ ++ V+
Sbjct: 549 AGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNEKANVESV 608

Query: 653 YSQAGSGGLMTSDTARGGENWGRLQVVL---SDHTAYHQVTQVLRDTARRIPELEAKIEQ 709
++ G + +N G V L + + + A+ KI
Sbjct: 609 FTVNGFS------FSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMEL---GKIRD 659

Query: 710 PELFSFKTPLEIEL---SGYDLHLLKRSADNLVKALSASDRFA-----------DVNTSL 755
+ F P +EL +G+D L+ ++ A ++ V +
Sbjct: 660 GFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNG 719

Query: 756 RDGQPELSIRFDHARLAALGMDAPTVANRIAQRVGGTVASQYTVRDRKIDILVRSELDER 815
+ + + D + ALG+ + I+ +GGT + + R R + V+++ R
Sbjct: 720 LEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFR 779

Query: 816 DQISDIDALIINPNSQQPIALSAVAEVSLQLGPSAINRISQQRVALVSANLAYG-DLSDA 874
D+D L + + + + SA G + R + + A G DA
Sbjct: 780 MLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDA 839

Query: 875 VAEAQQILSAQVLPASVQARFGGQNEEMEHSFQSLKIALILAVFLVYLVMASQFESLLHP 934
+A + + S LPA + + G + + S + ++ +V+L +A+ +ES P
Sbjct: 840 MALMENLASK--LPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIP 897

Query: 935 LLILFAVPMALAGSVLGLYITQTHLSVVVFIGLIMLAGIVVNNAIVLVDRINQL-RTEGV 993
+ ++ VP+ + G +L + V +GL+ G+ NAI++V+ L EG
Sbjct: 898 VSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGK 957

Query: 994 DKLEAIKVAAKSRLRPIMMTTLTTTLGLLPMALGLGDGSEVRAPMAITVIFGLSLSTLLT 1053
+EA +A + RLRPI+MT+L LG+LP+A+ G GS + + I V+ G+ +TLL
Sbjct: 958 GVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLA 1017

Query: 1054 LIVIPVLYALFDRK 1067
+ +PV + + R
Sbjct: 1018 IFFVPVFFVVIRRC 1031


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1629RTXTOXIND416e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 41.0 bits (96), Expect = 6e-06
Identities = 13/53 (24%), Positives = 29/53 (54%)

Query: 72 GLIEAINVEEGDRVQKGQILAVIDAKRQQYDLDRSEAEVKIIEQELNRLKKMS 124
+++ I V+EG+ V+KG +L + A + D ++++ + E R + +S
Sbjct: 105 SIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILS 157



Score = 40.2 bits (94), Expect = 9e-06
Identities = 36/202 (17%), Positives = 80/202 (39%), Gaps = 24/202 (11%)

Query: 91 LAVIDAKRQ----QYDLDRSEAEVKIIEQELNRLK---KMSNKEFIS--ADSMAKLEYNL 141
AV++ + + +L +++++ IE E+ K ++ + F + D + + N+
Sbjct: 252 HAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNI 311

Query: 142 QAAIAKRDLAELQVKESHVVSPINGIIAKRYVKAGNMAKEFGD-LFYIV-NQDELHGIVH 199
+ E + + S + +P++ + + V + L IV D L
Sbjct: 312 GLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTAL 371

Query: 200 LPEQQLTSLRLGQEAQV-FS--NQQSKNAIHAKVLRISP--VVDPQSGT-FKVTLAVP-- 251
+ + + + +GQ A + + KV I+ + D + G F V +++
Sbjct: 372 VQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVIISIEEN 431

Query: 252 -----NQNAHLKAGMFTRVELK 268
N+N L +GM E+K
Sbjct: 432 CLSTGNKNIPLSSGMAVTAEIK 453


23Shewmr4_1684Shewmr4_1721Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_1684-124-7.173910GAF sensor signal transduction histidine kinase
Shewmr4_1685-128-8.550205glucose-1-phosphate thymidylyltransferase
Shewmr4_1686139-11.219784dTDP-4-dehydrorhamnose 3,5-epimerase
Shewmr4_1688238-11.386138hypothetical protein
Shewmr4_1689129-7.410287hypothetical protein
Shewmr4_1690123-6.446241hypothetical protein
Shewmr4_1691119-2.120056hypothetical protein
Shewmr4_16923241.529170putative lipoprotein
Shewmr4_16932231.275370hypothetical protein
Shewmr4_16941241.295819hypothetical protein
Shewmr4_1695323-0.931221hypothetical protein
Shewmr4_1696226-4.679042patatin
Shewmr4_1697131-6.289595hypothetical protein
Shewmr4_1698234-6.458177prolyl-tRNA synthetase
Shewmr4_1699235-7.183776hexapaptide repeat-containing transferase
Shewmr4_1700337-7.838141hypothetical protein
Shewmr4_1701334-7.290985putative lipoprotein
Shewmr4_1702327-6.465180hypothetical protein
Shewmr4_1703216-2.248408hypothetical protein
Shewmr4_17042150.532971hypothetical protein
Shewmr4_17051140.102617amidohydrolase
Shewmr4_17062140.838643hypothetical protein
Shewmr4_17072140.758160histone family protein nucleoid-structuring
Shewmr4_17083140.701400electron transfer flavoprotein beta-subunit
Shewmr4_1709114-0.763306electron transfer flavoprotein subunit alpha
Shewmr4_1710320-4.016580Na+/H+ antiporter NhaC
Shewmr4_1711222-4.059444peptidyl-dipeptidase Dcp
Shewmr4_1712227-5.521702hypothetical protein
Shewmr4_1713227-5.202329hypothetical protein
Shewmr4_1714225-4.810560hypothetical protein
Shewmr4_1715126-4.951228thymidine kinase
Shewmr4_1716228-4.942266hypothetical protein
Shewmr4_1717228-5.364563two component, sigma54 specific, Fis family
Shewmr4_1718127-5.136402periplasmic sensor signal transduction histidine
Shewmr4_1719022-4.454412TRAP dicarboxylate transporter, DctM subunit
Shewmr4_1720-121-4.017970hypothetical protein
Shewmr4_1721-218-3.201925tripartite ATP-independent periplasmic
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1696FLGPRINGFLGI300.002 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 30.3 bits (68), Expect = 0.002
Identities = 12/43 (27%), Positives = 22/43 (51%), Gaps = 1/43 (2%)

Query: 4 TIKRNQPPAMPDRFYQVIEKELANVDPKDANAITLNFRDPDYS 46
T+ + + +IE+EL + KD+ + L R+PD+S
Sbjct: 162 TLTQGVTTSARVPNGAIIERELPS-KFKDSVNLVLQLRNPDFS 203


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1701HTHFIS326e-108 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 326 bits (837), Expect = e-108
Identities = 120/344 (34%), Positives = 176/344 (51%), Gaps = 28/344 (8%)

Query: 192 FDDIITLDPEMLLLKAKAQVLASHEVSVLICGESGTGKEMFARAIHNASARRDKPFVAVN 251
++ M + L +++++I GESGTGKE+ ARA+H+ RR+ PFVA+N
Sbjct: 136 GMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAIN 195

Query: 252 CGAFPSELIDSILFGHKKGAFTGAVSDKVGVFELAHSGTLFLDEFGELDSSAQVRLLRVL 311
A P +LI+S LFGH+KGAFTGA + G FE A GTLFLDE G++ AQ RLLRVL
Sbjct: 196 MAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVL 255

Query: 312 QDGKFTRLGDSKERSSNFRLITATNRDLMADVSKGRFREDLFYRVAIGVLSLPPLRSRQS 371
Q G++T +G S+ R++ ATN+DL +++G FREDL+YR+ + L LPPLR R
Sbjct: 256 QQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRAE 315

Query: 372 DLDHLADQFTAMLTQEYPSLGGKKISTAAKKIISNHRWPGNIRELKATILRAALWSETAV 431
D+ L F +E L K+ A +++ H WPGN+REL+ + R V
Sbjct: 316 DIPDLVRHFVQQAEKEG--LDVKRFDQEALELMKAHPWPGNVRELENLVRRLTALYPQDV 373

Query: 432 LEDVDILRAILSTL---------QNSESILERD----------------ISKGVDIKSII 466
+ I + S + S S+ + ++
Sbjct: 374 ITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVL 433

Query: 467 DLVERHYLERGLAFTSGNKRKTALLLGYNNHQTLNNRLKKLGLE 510
+E + L T GN+ K A LLG N TL ++++LG+
Sbjct: 434 AEMEYPLILAALTATRGNQIKAADLLGL-NRNTLRKKIRELGVS 476


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1717MALTOSEBP280.033 Maltose binding protein signature.
		>MALTOSEBP#Maltose binding protein signature.

Length = 396

Score = 28.2 bits (62), Expect = 0.033
Identities = 21/66 (31%), Positives = 29/66 (43%), Gaps = 4/66 (6%)

Query: 46 ALGQFANNFLQAVGILKPDEDVKQLGERALQAAQDGITPEM----YDDFDDYVQALRNFE 101
L + F + GI E +L E+ Q A G P++ +D F Y Q+ E
Sbjct: 45 GLAEVGKKFEKDTGIKVTVEHPDKLEEKFPQVAATGDGPDIIFWAHDRFGGYAQSGLLAE 104

Query: 102 IDPDKA 107
I PDKA
Sbjct: 105 ITPDKA 110


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1718PF07299310.005 Fibronectin-binding protein (FBP)
		>PF07299#Fibronectin-binding protein (FBP)

Length = 219

Score = 30.6 bits (69), Expect = 0.005
Identities = 17/68 (25%), Positives = 29/68 (42%), Gaps = 7/68 (10%)

Query: 176 NAILRRVDTEQLMSYLPALFPRVFTVIDGNELALLAGRIEPFKIANPYPEPSPETLYQLQ 235
+ + EQ L V TV + + +I P+ I P+ E + +TL +L
Sbjct: 51 IHVFENLTDEQ-----KELIDTVLTVQNREDAESFLLKINPYVI--PFQEVTAQTLKKLF 103

Query: 236 RQFKKLPI 243
+ KKL +
Sbjct: 104 PKAKKLKL 111


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1719RTXTOXIND354e-04 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 34.8 bits (80), Expect = 4e-04
Identities = 12/75 (16%), Positives = 29/75 (38%), Gaps = 10/75 (13%)

Query: 104 ELQALEQQLAQAHGANSQLEQELQQLQ----------SKLYEQQNKNLTLENALTSATAK 153
E + ++ + + + L + EQ+NK + N L ++
Sbjct: 215 ERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQ 274

Query: 154 LKQLEAQYQQAQQAL 168
L+Q+E++ A++
Sbjct: 275 LEQIESEILSAKEEY 289


24Shewmr4_1735Shewmr4_1742Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_1735219-0.064206hypothetical protein
Shewmr4_1736217-0.376317hypothetical protein
Shewmr4_1737317-0.144172twin-arginine translocation pathway signal
Shewmr4_17382160.689125hypothetical protein
Shewmr4_17393160.765829hypothetical protein
Shewmr4_17402161.588837pseudouridylate synthase
Shewmr4_17413171.251897redoxin domain-containing protein
Shewmr4_17422151.199371hypothetical protein
25Shewmr4_1783Shewmr4_1799Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_17832151.564210hypothetical protein
Shewmr4_17842161.652104diguanylate cyclase/phosphodiesterase with
Shewmr4_17850150.460949Insulysin
Shewmr4_17860131.517235phosphohistidine phosphatase, SixA
Shewmr4_17871151.729908hypothetical protein
Shewmr4_17882141.733898N5-glutamine S-adenosyl-L-methionine-dependent
Shewmr4_17892162.169938chorismate synthase
Shewmr4_17902152.407716major facilitator transporter
Shewmr4_17912142.682107ATP-NAD/AcoX kinase
Shewmr4_17920192.416360hypothetical protein
Shewmr4_17930202.758283tRNA/rRNA methyltransferase (SpoU)
Shewmr4_17941203.139900FAD dependent oxidoreductase
Shewmr4_17951202.5834563-oxoacyl-[acyl-carrier-protein] synthase I
Shewmr4_17962191.208002D-isomer specific 2-hydroxyacid dehydrogenase,
Shewmr4_17971180.485944aspartate semialdehyde dehydrogenase
Shewmr4_1798318-0.278905hypothetical protein
Shewmr4_1799218-1.987998hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1796cdtoxinb280.034 Cytolethal distending toxin B signature.
		>cdtoxinb#Cytolethal distending toxin B signature.

Length = 269

Score = 27.7 bits (61), Expect = 0.034
Identities = 18/64 (28%), Positives = 28/64 (43%), Gaps = 1/64 (1%)

Query: 93 LSKERGGQALDCQCLGIIPTEIDELDRQTLKAEGLPLPHMGWNQLTFSNPSQVHPLFAGV 152
+S E L Q G P+ + + + G+P+ + WN T S P QV+ F+ V
Sbjct: 49 ISGENAVDILAVQEAGSPPSTAVDTGTL-IPSPGIPVRELIWNLSTNSRPQQVYIYFSAV 107

Query: 153 PAGS 156
A
Sbjct: 108 DALG 111


26Shewmr4_1840Shewmr4_1848Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_18402150.262490anthranilate synthase component II
Shewmr4_1841317-2.996111anthranilate synthase component I
Shewmr4_1842218-3.182996phosphotransferase domain-containing protein
Shewmr4_1843428-7.375981translation factor SUA5
Shewmr4_1844532-8.291779condensin subunit ScpA
Shewmr4_1845435-9.174732condensin subunit ScpB
Shewmr4_1846227-7.688780ribosomal large subunit pseudouridine synthase
Shewmr4_1847122-4.557195short chain dehydrogenase
Shewmr4_1848122-3.824841hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1847ISCHRISMTASE320.001 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 31.9 bits (72), Expect = 0.001
Identities = 31/137 (22%), Positives = 52/137 (37%), Gaps = 23/137 (16%)

Query: 2 KSAVLVIDVQSIL---FDPEPQPFESQIVLAKINEVTKLARAKSVPVIFIQHEQPKSVIE 58
++ +L+ D+Q+ F P + A I ++ +PV++ QP S
Sbjct: 30 RAVLLIHDMQNYFVDAFTAGASPVTE--LSANIRKLKNQCVQLGIPVVYTA--QPGSQNP 85

Query: 59 YESA------GWALQSS---------LVIQTGDYFVRKTTPDSFLNTNLQSVLNELDVDS 103
+ A G L S L + D + K +F TNL ++ + D
Sbjct: 86 DDRALLTDFWGPGLNSGPYEEKIITELAPEDDDLVLTKWRYSAFKRTNLLEMMRKEGRDQ 145

Query: 104 LIVCG-YASEFCVDTTI 119
LI+ G YA C+ T
Sbjct: 146 LIITGIYAHIGCLVTAC 162


27Shewmr4_1898Shewmr4_1933Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_1898223-1.273085chromosome segregation protein SMC
Shewmr4_1899120-1.709033cell division protein ZipA
Shewmr4_1900021-2.307713DNA ligase, NAD-dependent
Shewmr4_1901122-2.732195hypothetical protein
Shewmr4_1902224-3.060800metal dependent phosphohydrolase
Shewmr4_1903224-3.163144NAD-dependent epimerase/dehydratase
Shewmr4_1904-114-0.920381DSBA oxidoreductase
Shewmr4_1905-212-0.024823hypothetical protein
Shewmr4_1906-311-0.190904hypothetical protein
Shewmr4_1907-312-0.443167Na+/solute symporter
Shewmr4_1908-313-0.606652cyclic nucleotide-binding protein
Shewmr4_1909-213-0.579711exonuclease, RNase T and DNA polymerase III
Shewmr4_1910-112-1.067626MarR family transcriptional regulator
Shewmr4_1911319-2.036790alcohol dehydrogenase
Shewmr4_1912219-0.686751hypothetical protein
Shewmr4_1913119-0.336488hypothetical protein
Shewmr4_19144240.1785283-oxoacyl-(acyl carrier protein) synthase III
Shewmr4_1915323-0.103754GntR family transcriptional regulator
Shewmr4_19162210.165403D,D-heptose 1,7-bisphosphate phosphatase
Shewmr4_19170160.720355**hypothetical protein
Shewmr4_1918117-0.008848hypothetical protein
Shewmr4_1919228-0.767598bifunctional 2',3'-cyclic nucleotide
Shewmr4_1920026-1.371425bifunctional 2',3'-cyclic nucleotide
Shewmr4_1921229-1.605536peptidyl-dipeptidase Dcp
Shewmr4_1922127-2.216531hypothetical protein
Shewmr4_1923431-2.290149prolyl 4-hydroxylase subunit alpha
Shewmr4_1924326-1.553680GCN5-related N-acetyltransferase
Shewmr4_1925-115-0.613240GCN5-related N-acetyltransferase
Shewmr4_1926-215-0.398666hypothetical protein
Shewmr4_1927-1120.019123hypothetical protein
Shewmr4_1928-1130.309394hypothetical protein
Shewmr4_1929014-0.275605hypothetical protein
Shewmr4_1930-113-0.791543hypothetical protein
Shewmr4_1931112-0.354723hypothetical protein
Shewmr4_1932316-0.985640PKD domain-containing protein
Shewmr4_1933216-1.166438hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1904HTHFIS929e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 91.8 bits (228), Expect = 9e-24
Identities = 36/128 (28%), Positives = 63/128 (49%), Gaps = 7/128 (5%)

Query: 2 SRILLVDDDPLFRVWLTDALKTQGHEVECAINGIEGLKRIRSFMPDIIMLDLIMPQMDGF 61
+ IL+ DDD R L AL G++V N + I + D+++ D++MP + F
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 SLLE---ARDCMTPIMMLSARDNEEDRIRCYELGADDFLTKPFSIKELLVRLHALERRLI 118
LL P++++SA++ I+ E GA D+L KPF + EL+ + R +
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELI----GIIGRAL 119

Query: 119 SRPPEQMA 126
+ P + +
Sbjct: 120 AEPKRRPS 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1907SYCDCHAPRONE260.028 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 25.7 bits (56), Expect = 0.028
Identities = 7/34 (20%), Positives = 17/34 (50%)

Query: 1 MTDINQVIDQMPEEVYERLRSAAELGKWEDGTVL 34
+ +N++ E++Y + + GK+ED +
Sbjct: 25 IAMLNEISSDTLEQLYSLAFNQYQSGKYEDAHKV 58


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1925DNABINDINGHU1131e-36 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 113 bits (284), Expect = 1e-36
Identities = 31/89 (34%), Positives = 56/89 (62%), Gaps = 1/89 (1%)

Query: 2 TKSELIEKLATRQSQLSAKEVEGAIKEMLEQMATTLESGDRIEIRGFGSFSLHYRAPRTG 61
K +LI K+A ++L+ K+ A+ + +++ L G+++++ GFG+F + RA R G
Sbjct: 3 NKQDLIAKVA-EATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKG 61

Query: 62 RNPKTGSSVELEGKYVPHFKPGKELRERV 90
RNP+TG ++++ VP FK GK L++ V
Sbjct: 62 RNPQTGEEIKIKASKVPAFKAGKALKDAV 90


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1929DHBDHDRGNASE961e-25 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 95.5 bits (237), Expect = 1e-25
Identities = 73/258 (28%), Positives = 120/258 (46%), Gaps = 14/258 (5%)

Query: 10 QGKNVVVVGGTSGINLAIANAFALAGANVAVASRSQDKIDAAV--LQLKQSNPDGIHLGV 67
+GK + G GI A+A A GA++A + +K++ V L+ + + +
Sbjct: 7 EGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPA-- 64

Query: 68 SFDVRDLAAVEQGFDTIASEFGFIDVLVSGAAGNFPATAAKLSANGFKAVMDIDLLGSFQ 127
DVRD AA+++ I E G ID+LV+ A P LS ++A ++ G F
Sbjct: 65 --DVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFN 122

Query: 128 VLK-TAYPLLRRPQGNIIQISAPQASIAMPMQAHVCAAKAGVDMLTRTLAIEWGCEGIRI 186
+ + ++ R G+I+ + + A + A ++KA M T+ L +E IR
Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 187 NSIVPGPIAGTEGFNRLAPSAALQQGVAQS-------VPLKRNGEGQDIANAAMFLGSEL 239
N + PG ++ A +Q + S +PLK+ + DIA+A +FL S
Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242

Query: 240 ASYITGVVLPVDGGWSLG 257
A +IT L VDGG +LG
Sbjct: 243 AGHITMHNLCVDGGATLG 260


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1932BLACTAMASEA310.015 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 30.5 bits (69), Expect = 0.015
Identities = 35/158 (22%), Positives = 62/158 (39%), Gaps = 22/158 (13%)

Query: 22 LLLGSMLSILLPTQAIAAIHPLDESTFAKQIADIAPRHS-QVALLARDLSTNTLLYSQQA 80
++ + ++ L A +QI + S +V ++ DL++ L + +A
Sbjct: 7 CIISLLATLPLAVHASPQPL--------EQIKLSESQLSGRVGMIEMDLASGRTLTAWRA 58

Query: 81 DTLFIPASTQKVLT--AVTAMASLGPDF--RYVT----ELWSDAPIRQGHIAGSVYLRFS 132
D F ST KV+ AV A G + R + +L +P+ + H+A + +
Sbjct: 59 DERFPMMSTFKVVLCGAVLARVDAGDEQLERKIHYRQQDLVDYSPVSEKHLADGMTVGEL 118

Query: 133 GDPTLTQDDLKA---LFAHLQKQGITSIEGHLYLIGDK 167
+T D A L A + G + L IGD
Sbjct: 119 CAAAITMSDNSAANLLLATV--GGPAGLTAFLRQIGDN 154


28Shewmr4_1943Shewmr4_1957Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_1943022-4.148943hypothetical protein
Shewmr4_1944-121-3.758648ribosome modulation factor
Shewmr4_1945017-2.2360543-hydroxydecanoyl-(acyl carrier protein)
Shewmr4_1946017-2.163675ATP-dependent protease
Shewmr4_1947317-2.137714**response regulator
Shewmr4_1948418-1.355798excinuclease ABC subunit C
Shewmr4_1950015-1.212895CDP-diacylglycerol--glycerol-3-phosphate
Shewmr4_1951015-0.956286****histone family protein DNA-binding protein
Shewmr4_1952222-2.128391hypothetical protein
Shewmr4_1953230-2.510063ABC transporter-like protein
Shewmr4_1954233-2.129141hypothetical protein
Shewmr4_1955333-2.514676hypothetical protein
Shewmr4_1956437-2.893971hypothetical protein
Shewmr4_1957334-3.180602hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1943SURFACELAYER320.003 Lactobacillus surface layer protein signature.
		>SURFACELAYER#Lactobacillus surface layer protein signature.

Length = 439

Score = 32.3 bits (73), Expect = 0.003
Identities = 14/55 (25%), Positives = 27/55 (49%), Gaps = 1/55 (1%)

Query: 246 SEPFAAYNAVKDWLNESKITEGHLFRSISRDGKTLRPYQVSDNVT-SKSSLIRNS 299
++ YN V +N +K+ G + + +GK Y +DN+ +K +L N+
Sbjct: 331 TDKVTRYNTVTVAMNTTKLANGISYYEVIENGKATGKYINADNIDGTKRTLKHNA 385


29Shewmr4_2068Shewmr4_2073Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_2068-114-4.391537leucyl/phenylalanyl-tRNA--protein transferase
Shewmr4_2069114-4.42284017 kDa surface antigen
Shewmr4_2070216-5.182313hypothetical protein
Shewmr4_2071116-4.982617hypothetical protein
Shewmr4_2072015-4.869451auxin efflux carrier
Shewmr4_2073012-3.076624methionyl-tRNA synthetase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2072FLAGELLIN290.024 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 29.2 bits (65), Expect = 0.024
Identities = 14/71 (19%), Positives = 36/71 (50%), Gaps = 6/71 (8%)

Query: 19 SINDTATQLKTSSSAVSRTLKTLRHIFEDELLTRK--NGKMELTTKAIALREKVSRIIEE 76
+ ND + +T+ A++ L+ + E L+ + NG + ++++++ + +EE
Sbjct: 66 NANDGISIAQTTEGALNEINNNLQRVRE---LSVQATNGTNSDSDLK-SIQDEIQQRLEE 121

Query: 77 IESLTENESFN 87
I+ ++ FN
Sbjct: 122 IDRVSNQTQFN 132


30Shewmr4_2087Shewmr4_2095Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_20872170.167157hypothetical protein
Shewmr4_20882160.361489hypothetical protein
Shewmr4_20892160.075201paraquat-inducible protein A
Shewmr4_2090215-1.571815paraquat-inducible protein A
Shewmr4_2091215-2.046353YebG family protein
Shewmr4_2092117-3.760361putative GAF sensor protein
Shewmr4_2093317-1.591156putative solute/DNA competence effector
Shewmr4_2094217-1.335181carboxy-terminal protease
Shewmr4_2095217-1.143177carboxy-terminal protease
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2087HTHFIS692e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 69.1 bits (169), Expect = 2e-14
Identities = 36/199 (18%), Positives = 70/199 (35%), Gaps = 19/199 (9%)

Query: 255 KILLVDDQQSMVDYFSSLLRSHGLMVKGMTKPEQVLPTLEQFEPDLFIFDLYMPDVNGLE 314
IL+ DD ++ + L G V+ + + + + DL + D+ MPD N +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 315 LAKMIRQLDKYSSSPILVLSSDDTMQNKVSIIQAGSDDLISKQTAP--SLFVTQVISRAQ 372
L I++ P+LV+S+ +T + + G+ D + K + +
Sbjct: 65 LLPRIKKARPD--LPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122

Query: 373 RGHDIRSSASRDSLTGLLNHTQILVAARRCFNLAKRINSSVCIAMLDLDHFKQVNDTYGH 432
+ + L+ + + R + + ++ I G
Sbjct: 123 KRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMIT--------------GE 168

Query: 433 SGGDKVLLAFAHLLQQSLR 451
SG K L+A A L R
Sbjct: 169 SGTGKELVARA-LHDYGKR 186



Score = 56.0 bits (135), Expect = 2e-10
Identities = 26/123 (21%), Positives = 54/123 (43%), Gaps = 1/123 (0%)

Query: 131 RIAIIEDDNNVGAMITKQLHEFGFNVQHFLNFTDFLEIQNTSPFDLILLDLILPDYTEEA 190
I + +DD + ++ + L G++V+ N DL++ D+++PD
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 191 LFTAATEFEKHNTRVFVLSSRGDFEMRLLAIRANVSEYFVKPAETTLLVRKIHQWLKMSE 250
L + + + V V+S++ F + A +Y KP + T L+ I + L +
Sbjct: 65 LLPRIKK-ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 251 KQP 253
++P
Sbjct: 124 RRP 126


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2088HTHFIS672e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 67.2 bits (164), Expect = 2e-14
Identities = 30/107 (28%), Positives = 49/107 (45%), Gaps = 7/107 (6%)

Query: 3 IKVLVVDDSALIRNLLGQMIE-ADSELSLVGMAADAYMAKDMVNQHRPDVITLDIEMPKV 61
+LV DD A IR +L Q + A ++ + AA + D++ D+ MP
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATL---WRWIAAGDGDLVVTDVVMPDE 60

Query: 62 DGLTFLDRLMKARPTAVVMISSLTEEG-ADATFNALALGAVDFIPKP 107
+ L R+ KARP V++ ++ + A GA D++PKP
Sbjct: 61 NAFDLLPRIKKARPDLPVLV--MSAQNTFMTAIKASEKGAYDYLPKP 105


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2093PF06580389e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 38.3 bits (89), Expect = 9e-05
Identities = 15/70 (21%), Positives = 32/70 (45%), Gaps = 10/70 (14%)

Query: 452 EIDKGMIEKLVDPLT--HLVRNSLDHGIEKPEKRLAAGKTEAGVLSLKASQRGGSIVIAV 509
+I+ +++ V P+ LV N + HGI + + G + LK ++ G++ + V
Sbjct: 245 QINPAIMDVQVPPMLVQTLVENGIKHGIAQ--------LPQGGKILLKGTKDNGTVTLEV 296

Query: 510 HDDGGGLNRE 519
+ G +
Sbjct: 297 ENTGSLALKN 306


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2094HTHFIS865e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 86.0 bits (213), Expect = 5e-23
Identities = 30/122 (24%), Positives = 54/122 (44%), Gaps = 3/122 (2%)

Query: 1 MSK-KILIVDDSAAIRQMVEATLKSANYQVVLAKDGREALDLCGGQRFDFILTDQNMPRM 59
M+ IL+ DD AAIR ++ L A Y V + + D ++TD MP
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 60 DGLTLIKSLRGMSAFMRTPIVMLTTEAGEDMKAQGRAAGATGWMVKPFDPQKLLAITAKV 119
+ L+ ++ A P+++++ + + GA ++ KPFD +L+ I +
Sbjct: 61 NAFDLLPRIK--KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118

Query: 120 LG 121
L
Sbjct: 119 LA 120


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2095HTHFIS813e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 80.6 bits (199), Expect = 3e-18
Identities = 32/104 (30%), Positives = 51/104 (49%)

Query: 8 ILVIDDDLVTNQILTAFIHSKGWGVITCCNLEEAYEEINQQNIELILLDYYLPDGTALTL 67
ILV DDD +L + G+ V N + I + +L++ D +PD A L
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 68 LERLRYREPTVPVIVISADNEYQKILSCFRLGALDFIIKPINLE 111
L R++ P +PV+V+SA N + + GA D++ KP +L
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLT 109


31Shewmr4_2107Shewmr4_2115Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_21073160.494108hypothetical protein
Shewmr4_2108314-1.300198bifunctional acetaldehyde-CoA/alcohol
Shewmr4_2109213-1.572790hypothetical protein
Shewmr4_2110113-2.770597methyl-accepting chemotaxis sensory transducer
Shewmr4_2111117-3.681120hypothetical protein
Shewmr4_2112125-5.624897*phage integrase family protein
Shewmr4_2113126-5.549299hypothetical protein
Shewmr4_2114224-4.405489phage transcriptional regulator, AlpA
Shewmr4_2115127-4.480839hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2111PF07520310.020 Virulence protein SrfB
		>PF07520#Virulence protein SrfB

Length = 1041

Score = 30.7 bits (69), Expect = 0.020
Identities = 21/77 (27%), Positives = 32/77 (41%), Gaps = 7/77 (9%)

Query: 266 KAKRFTPAQDQQLQKYLVRRVLIEQDDNFKD---WADNLLPEMKSDDMFERRLRWAIREQ 322
K + F D + ++R+ ++D N D W + L EM D R +I E+
Sbjct: 175 KPREFRLVSDPGAMSWFLQRLEADEDGNAVDLQLWVSDWLKEMFLDFKRAERPGRSISEE 234

Query: 323 DN----THIARYLDLLS 335
+ H ARYL L
Sbjct: 235 NLPHMFEHWARYLSYLQ 251


32Shewmr4_2136Shewmr4_2152Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_2136-117-3.058329hypothetical protein
Shewmr4_2137021-4.449327zeta toxin family protein
Shewmr4_2138020-4.436401hypothetical protein
Shewmr4_2139016-4.184035hypothetical protein
Shewmr4_2140-115-4.700199hypothetical protein
Shewmr4_2141-115-1.199386ATPase-like protein
Shewmr4_2142-112-0.628278hypothetical protein
Shewmr4_2143-111-0.717015hypothetical protein
Shewmr4_2144-211-1.273275hypothetical protein
Shewmr4_2145-111-1.640387hypothetical protein
Shewmr4_2146-114-2.305633hypothetical protein
Shewmr4_2147-117-3.504893hypothetical protein
Shewmr4_2148-121-4.349532type 12 methyltransferase
Shewmr4_2149023-3.692036IS4 family transposase
Shewmr4_2150025-4.434250L-serine ammonia-lyase
Shewmr4_2151023-3.492739beta-hexosaminidase
Shewmr4_2152-124-3.188153hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2147GPOSANCHOR373e-04 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 36.6 bits (84), Expect = 3e-04
Identities = 21/68 (30%), Positives = 38/68 (55%), Gaps = 7/68 (10%)

Query: 594 ALEEAKIQ-QAIAEQEAIAAQAKAA--EEAALAKAKAEAEAEAERQRL----EQEEQMKA 646
ALEEA + A+ + ++K +E A +AK EAEA+A +++L E+ +++A
Sbjct: 401 ALEEANSKLAALEKLNKELEESKKLTEKEKAELQAKLEAEAKALKEKLAKQAEELAKLRA 460

Query: 647 SEQSQPET 654
+ S +T
Sbjct: 461 GKASDSQT 468



Score = 34.7 bits (79), Expect = 0.001
Identities = 21/82 (25%), Positives = 33/82 (40%), Gaps = 13/82 (15%)

Query: 596 EEAKIQQAIAEQEAIAAQAKAAEEAALAKAK--------AEAEAEAERQRLEQEEQMKAS 647
EA+ AE+ + Q A + A+ + EAE Q+LE EQ K S
Sbjct: 286 LEAEKAALEAEKADLEHQ-SQVLNANRQSLRRDLDASREAKKQLEAEHQKLE--EQNKIS 342

Query: 648 EQSQPETGSQEAIATSDESLAK 669
E S+ + + S E+ +
Sbjct: 343 EASR--QSLRRDLDASREAKKQ 362



Score = 32.0 bits (72), Expect = 0.008
Identities = 26/88 (29%), Positives = 43/88 (48%), Gaps = 10/88 (11%)

Query: 592 VEA-LEEAKIQQAIAEQEAIAAQAK--AAEEAALAKAKAEAEAEAERQRLEQ-----EEQ 643
+EA ++ + Q I+E + + A+ EA KA EA ++ LE+ EE
Sbjct: 363 LEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQVEKALEEANSKLAALEKLNKELEES 422

Query: 644 MKASEQSQPETGSQ-EAIATS-DESLAK 669
K +E+ + E ++ EA A + E LAK
Sbjct: 423 KKLTEKEKAELQAKLEAEAKALKEKLAK 450



Score = 30.8 bits (69), Expect = 0.020
Identities = 16/61 (26%), Positives = 29/61 (47%), Gaps = 4/61 (6%)

Query: 611 AAQAKAAEEAALAKAKAEAE-AEAERQRLEQEEQMKASEQSQPETGSQEAIATSD-ESLA 668
+ +AK EA K + + + +EA RQ L + + AS +++ + A S +L
Sbjct: 356 SREAKKQLEAEHQKLEEQNKISEASRQSLRR--DLDASREAKKQVEKALEEANSKLAALE 413

Query: 669 K 669
K
Sbjct: 414 K 414


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2148HTHTETR388e-06 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 38.5 bits (89), Expect = 8e-06
Identities = 27/164 (16%), Positives = 56/164 (34%), Gaps = 7/164 (4%)

Query: 21 WEQRRDYLTQVALRSLRGHKTFDLCRSHLVQVSQISKGTIYNHFTTEADLIVAVASAQYD 80
++ R ++ VALR + + + +++G IY HF ++DL +
Sbjct: 9 AQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSES 68

Query: 81 EWLCAAKQ-DVQRYPDPFSRF--LYHHCFRLHQVLSQQRFVIERIMPNQTLLFEATESCR 137
+ + DP S + H ++R ++E I+ ++ +
Sbjct: 69 NIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLME-IIFHKCEFVGEMAVVQ 127

Query: 138 QRFETLFDEYHQWNRNTISEV---GDIPGFNRTELVMDYLRGAM 178
Q L E + T+ +P T +RG +
Sbjct: 128 QAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYI 171


33Shewmr4_2219Shewmr4_2242Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_2219019-4.786619hypothetical protein
Shewmr4_2220030-7.084460hypothetical protein
Shewmr4_2221029-6.455101hypothetical protein
Shewmr4_2222029-6.412199hypothetical protein
Shewmr4_2223029-6.766598hypothetical protein
Shewmr4_2224129-6.408546aromatic amino acid transporter
Shewmr4_2225025-4.343807hypothetical protein
Shewmr4_2226023-5.037725bifunctional phosphoribosyl-AMP
Shewmr4_2227-125-7.292894imidazole glycerol phosphate synthase subunit
Shewmr4_2228239-9.9786761-(5-phosphoribosyl)-5-[(5-
Shewmr4_2229546-12.531071imidazole glycerol phosphate synthase subunit
Shewmr4_2230650-13.963424imidazole glycerol-phosphate
Shewmr4_2231653-14.068019histidinol-phosphate aminotransferase
Shewmr4_2232654-12.647527histidinol dehydrogenase
Shewmr4_2234449-11.510372ATP phosphoribosyltransferase
Shewmr4_2235447-11.062656hypothetical protein
Shewmr4_2236145-10.706502hypothetical protein
Shewmr4_2237231-8.021841acyltransferase domain-containing protein
Shewmr4_2238026-6.749136hypothetical protein
Shewmr4_2240-122-5.979391hypothetical protein
Shewmr4_2241019-5.389021helicase c2
Shewmr4_2242-114-3.652680TonB-dependent receptor
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2223HTHFIS682e-15 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 67.5 bits (165), Expect = 2e-15
Identities = 28/119 (23%), Positives = 51/119 (42%), Gaps = 2/119 (1%)

Query: 10 DKATILLIDDHPMLRNGVKQLIGMADNLCIVAEASCGKDGIILATQLDPDLILLDLNMPE 69
ATIL+ DD +R + Q + A + D DL++ D+ MP+
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSN--AATLWRWIAAGDGDLVVTDVVMPD 59

Query: 70 FNGLETLTKLRECELSSRIIVFTVSNYEGDIVNAFKYGADGYLLKDMEPEDLLQSIQQA 128
N + L ++++ ++V + N + A + GA YL K + +L+ I +A
Sbjct: 60 ENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2225TCRTETB290.033 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 29.5 bits (66), Expect = 0.033
Identities = 17/56 (30%), Positives = 29/56 (51%), Gaps = 1/56 (1%)

Query: 126 NTTPFSIFVIIALLCGFGGANF-ASSMANISFFYPKDKQGSALGLNGGLGNLGVSV 180
+ FS+ ++ + G G A F A M ++ + PK+ +G A GL G + +G V
Sbjct: 99 GHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGV 154


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2234FIMBRIALPAPE361e-04 Escherichia coli: P pili tip fibrillum papE protein...
		>FIMBRIALPAPE#Escherichia coli: P pili tip fibrillum papE protein

signature.
Length = 173

Score = 35.8 bits (82), Expect = 1e-04
Identities = 39/133 (29%), Positives = 58/133 (43%), Gaps = 16/133 (12%)

Query: 216 TVNVGDF---NGVGTGAGEKNFNLNVTC--QSGT-KAYISFSDAYNSSNSSDSLSTLTST 269
VN GD N V +G +K+F +++ C GT K I+ + +S L TST
Sbjct: 45 EVNWGDIEIQNLVQSGGNQKDFTVDMNCPYSLGTMKVTITSNGQTGNS----ILVPNTST 100

Query: 270 TSADGVNIQILSNNWGG---AIVLNTPFYVVGADDSAAASYTIPFSAR--YIQVDPVLTA 324
S DG+ I + ++N G A+ L + G A + I A+ Y L A
Sbjct: 101 ASGDGLLIYLYNSNNSGIGNAVTLGSQV-TPGKITGTAPARKITLYAKLGYKGNMQSLQA 159

Query: 325 GTVEAKTIVVMTY 337
GT A +V +Y
Sbjct: 160 GTFSATATLVASY 172


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2235PF005776170.0 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 617 bits (1593), Expect = 0.0
Identities = 224/855 (26%), Positives = 388/855 (45%), Gaps = 50/855 (5%)

Query: 13 SVFPIFFILFGSFTFAKEN--EMIEFDSAFFNVNDASLI--QKFAYSNYVPSGVYPADLF 68
+ F + + +F + F+ F + ++ +F +P G Y D++
Sbjct: 24 AGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPGTYRVDIY 83

Query: 69 VNGSFLKKVSLEVKEDGGK--SRPCIDMNILNSLGIKESILIESDNLIKD-CIVLEEVIV 125
+N ++ + + PC+ L S+G+ + + + L D C+ L +I
Sbjct: 84 LNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVPLTSMIH 143

Query: 126 GSAVNFTMGMSKLDIQVPQIYLDKVVRGYVDPAEWERGISAAYISYDLFGV---SPNDVK 182
+ +G +L++ +PQ ++ RGY+ P W+ GI+A ++Y+ G +
Sbjct: 144 DATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQNRIGGN 203

Query: 183 NFNAYFD--SGININGWYFKHKGMYSWNEYTGE-----QYNALDSYLYRPVASIKGNVVL 235
+ AY + SG+NI W + +S+N ++ ++++L R + ++ + L
Sbjct: 204 SHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRSRLTL 263

Query: 236 GKKNTNGRLFDTVPLLGMQIYDDQNMLADSQKGYAPEIFGVAKTNAIVKVEQNGRIIYET 295
G T G +FD + G Q+ D NML DSQ+G+AP I G+A+ A V ++QNG IY +
Sbjct: 264 GDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIYNS 323

Query: 296 LVSPGEFLINDLYPTGYGGNLNVIILEADGTEQRFSVPYESMAQLLRPNTSRYSFSIGQY 355
V PG F IND+Y G G+L V I EADG+ Q F+VPY S+ L R +RYS + G+Y
Sbjct: 324 TVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITAGEY 383

Query: 356 KNDRLS-FTPMLIEGTYELGINNFVTGYWGGQANEDFLSLQGGLALGT-SLGTFSFGGTQ 413
++ P + T G+ T Y G Q + + + G+ +LG S TQ
Sbjct: 384 RSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALSVDMTQ 443

Query: 414 SYAFL--DENYSGQSYELKYSKNIFSSGTNLSLGAYKYSTKEYMDYQTAMLYLDNIRRGY 471
+ + L D + GQS Y+K++ SGTN+ L Y+YST Y ++ N
Sbjct: 444 ANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGYNIE 503

Query: 472 L--------------YNDLYNSKNRYLVTVGQSFPEGYGNLYLSGVFENFWNETNYSRQY 517
YN YN + + +TV Q LYLSG + +W +N Q+
Sbjct: 504 TQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGR-TSTLYLSGSHQTYWGTSNVDEQF 562

Query: 518 QVSYNNQYKRLSWGVSVSRNEDQYG-QYQTNYALSLSIPLGF------EGELRDSRVRLN 570
Q N ++ ++W +S S ++ + AL+++IP + + R + +
Sbjct: 563 QAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSDSKSQWRHASASYS 622

Query: 571 LNRDSDGYQQAQVGLSDVVGEEKLLSYDVSVS-----NDESSNSFNGNASYVSPIAKLSA 625
++ D +G G+ + E+ LSY V + S ++ +Y +
Sbjct: 623 MSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYGNANI 682

Query: 626 NASKGNNYKSYGFGLSGTLVGHSGGVTLSPYSGNTFALVEAKGMEGATVSGYKGIKINNL 685
S ++ K +G+SG ++ H+ GVTL +T LV+A G + A V G++ +
Sbjct: 683 GYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKVENQTGVRTDWR 742

Query: 686 GHAVVPNLRPYQVNHIHVDPNGTSKNASLSNTVNKTVPYDSAIVKLQYGTEIGFSILINS 745
G+AV+P Y+ N + +D N + N L N V VP AIV+ ++ +G +L+
Sbjct: 743 GYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKARVGIKLLMTL 802

Query: 746 SFKGDALPFGADVIDENGLNIGNVAQGSIVFARTKSSNGILKV--NIKNGEYCSIKYDVN 803
+ LPFGA V E+ + G VA V+ G ++V + +C Y +
Sbjct: 803 THNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGKVQVKWGEEENAHCVANYQLP 862

Query: 804 KNKSKGTLNILNVDC 818
+ L L+ +C
Sbjct: 863 PESQQQLLTQLSAEC 877


34Shewmr4_2270Shewmr4_2277Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_2270216-2.203351hypothetical protein
Shewmr4_2271117-2.327647glucan biosynthesis protein G
Shewmr4_2272218-3.391410glucan biosynthesis protein G
Shewmr4_2273221-4.207115glucosyltransferase MdoH
Shewmr4_2274022-4.969972peptidoglycan binding domain-containing protein
Shewmr4_2275122-4.552034hypothetical protein
Shewmr4_2276123-3.529557hypothetical protein
Shewmr4_2277124-3.064425hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2272HTHFIS453e-07 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 44.8 bits (106), Expect = 3e-07
Identities = 19/130 (14%), Positives = 49/130 (37%), Gaps = 18/130 (13%)

Query: 180 MPGKKVLIVDDSSTARRQVRETLGQLGIEIIEASDGLQALHLLQKWRDEGKNVAKELLMM 239
M G +L+ DD + R + + L + G ++ S+ + ++
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWI---AAGDGDLV------ 51

Query: 240 ITDAEMPEMDGYKLTYEVRNDKAMAD-LFITLNTSLSGSFNHAMVE--KVGCDRFISK-F 295
+TD MP+ + + L ++ + L ++ + ++ + G ++ K F
Sbjct: 52 VTDVVMPDENAFDLLPRIKKARPDLPVLVMSAQNTF-----MTAIKASEKGAYDYLPKPF 106

Query: 296 QPDLLVEVVQ 305
L+ ++
Sbjct: 107 DLTELIGIIG 116


35Shewmr4_2292Shewmr4_2304Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_22922130.364538hypothetical protein
Shewmr4_22932150.802416periplasmic nitrate reductase subunit NapC
Shewmr4_2294213-0.072718hypothetical protein
Shewmr4_2295113-0.396390hypothetical protein
Shewmr4_22961130.010262ATP-dependent OLD family endonuclease
Shewmr4_2297-1120.083216isochorismatase hydrolase
Shewmr4_2298-111-0.731618hypothetical protein
Shewmr4_2299011-0.047431GCN5-related N-acetyltransferase
Shewmr4_23002120.676947hypothetical protein
Shewmr4_23011130.8775873-deoxy-manno-octulosonate cytidylyltransferase
Shewmr4_23021131.001420iron-containing alcohol dehydrogenase
Shewmr4_23033211.407701DegT/DnrJ/EryC1/StrS aminotransferase
Shewmr4_23042191.321353carbonate dehydratase
36Shewmr4_2321Shewmr4_2330Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_23211203.592893hypothetical protein
Shewmr4_23221193.597346AraC family transcriptional regulator
Shewmr4_23231193.548048glucan 1,4-alpha-glucosidase
Shewmr4_23241162.018368hypothetical protein
Shewmr4_2325-111-1.730366major facilitator transporter
Shewmr4_2326-117-3.414128YCII-like protein
Shewmr4_2327-120-4.872771alpha/beta hydrolase fold domain-containing
Shewmr4_2328-220-4.796338hypothetical protein
Shewmr4_2329024-5.422026alkylhydroperoxidase
Shewmr4_2330124-4.363638AraC family transcriptional regulator
37Shewmr4_2400Shewmr4_2407Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_24004180.655673hypothetical protein
Shewmr4_24012130.472587phosphatidylserine synthase
Shewmr4_24023160.361147multidrug resistance protein D
Shewmr4_24033150.162460beta-lactamase
Shewmr4_24043150.182288beta-lactamase
Shewmr4_24051140.060233DTW domain-containing protein
Shewmr4_2406014-1.056169RND family efflux transporter MFP subunit
Shewmr4_2407216-1.175125hydrophobe/amphiphile efflux-1 (HAE1) family
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2405IGASERPTASE521e-08 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 52.0 bits (124), Expect = 1e-08
Identities = 46/263 (17%), Positives = 82/263 (31%), Gaps = 16/263 (6%)

Query: 838 PAQFVPNDALGDAQDYPAPTAQTAEAAETTEAKATEVAAEPTVTSPAAVVETADVSVVAE 897
P N + Q + + + E V PA + VAE
Sbjct: 983 PEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAE 1042

Query: 898 TNKAVE--TEVVESVATEQVA--KLKAEVAVTEAPADTAVNNKPETSVKAEDVAVEPAKA 953
+K E E ATE A + A+ A + A+T N ++ + ++ K
Sbjct: 1043 NSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKE 1102

Query: 954 VEAVTEAIVTPAETPVAQAKTETVAVEAVAPEVEKAEVKDAVKSVASAPMAKPAPIV--- 1010
V + ET +T V V +V + + + P + P V
Sbjct: 1103 TATVEKEEKAKVET------EKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIK 1156

Query: 1011 KPQAKPVVTQAVAAPAAETVVSKPKAASRFGSMVSSEMTKPTVEVRTQVEVPKGREYDNT 1070
+PQ++ T PA ET + + + + VE + N+
Sbjct: 1157 EPQSQTNTTADTEQPAKETSSNVEQPVT---ESTTVNTGNSVVENPENTTPATTQPTVNS 1213

Query: 1071 ASAEVTAPKLKQSNSAESDMARP 1093
S+ + ++S + P
Sbjct: 1214 ESSNKPKNRHRRSVRSVPHNVEP 1236



Score = 48.9 bits (116), Expect = 1e-07
Identities = 50/304 (16%), Positives = 95/304 (31%), Gaps = 18/304 (5%)

Query: 731 RRQPRKDAAVANETVEATDAASQVEANAEVSAEKPVVENRAEAPADVTEVKTEAETDADN 790
R Q + D S N E++ PA T +T ET A+N
Sbjct: 988 RNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPP---PAPATPSET-TETVAEN 1043

Query: 791 AELSADADDKAKRESRDGQRRSRRSPRHLRAAGQRRRRDEDEQGVSAPAQFVPNDALGDA 850
++ + +K ++++ + ++R A + + + + AQ +
Sbjct: 1044 SKQESKTVEKNEQDATETTAQNR------EVAKEAKSNVKANTQTNEVAQSGSETK--ET 1095

Query: 851 QDYPAPTAQTAEAAETTEAKATEVAAEPTVTSPAAVVETADVSVVAETNKAVETEVVESV 910
Q T E E + + + P VTS + + +V + A E + ++
Sbjct: 1096 QTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNI 1155

Query: 911 ATEQVAKLKAEVAVTEAPADTAVNN--KPETSVKAEDVAVEPAKAVEAVTEAIVTPAETP 968
++ A TE PA +N +P T + + E T A P
Sbjct: 1156 --KEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNS 1213

Query: 969 VAQAKTETVAVEAVAPEVEKAEVKDAVKSVASAPMAKPAPIVKPQAKPVVTQAVAAPAAE 1028
+ K + +V V+ A S + V++ A A
Sbjct: 1214 ESSNKPKNRHRRSVRS--VPHNVEPATTSSNDRSTVALCDLTSTNTNAVLSDARAKAQFV 1271

Query: 1029 TVVS 1032
+
Sbjct: 1272 ALNV 1275



Score = 45.1 bits (106), Expect = 2e-06
Identities = 59/335 (17%), Positives = 94/335 (28%), Gaps = 32/335 (9%)

Query: 531 QKVEQAAAPVANKVEATEPGFFSKLFSAIGAFFSSSDKPAAEKTTETKKSDSQTANANRR 590
Q V+ N ++A P S I + P A T S+T
Sbjct: 990 QTVDTTNITTPNNIQADVPSVPSN-NEEIARVDEAPVPPPAPAT------PSETTETVAE 1042

Query: 591 NRRNDTRRT-RNNQDADKAKEGTREPRTRNAKKSADAPQAQERPAREKDENAKRTKPEPK 649
N + +++ +N QDA + RE Q E A+ E + E K
Sbjct: 1043 NSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEV-AQSGSETKETQTTETK 1101

Query: 650 SRNQAPKEVVAESQDDTPKQEVARERRQRRNMRRKVRIDNANETTEAPIQEEVVLAEVAA 709
KE A+ + + QEV + Q + + AE A
Sbjct: 1102 ETATVEKEEKAKVETEK-TQEVPKVTSQVSPKQEQSE-------------TVQPQAEPAR 1147

Query: 710 VNAANTDVATEPQAETKAPRSRRQPRKDAAVANETVEATDAASQVEANAEVSAEKPVVEN 769
N T EPQ++T QP K+ + E + + E
Sbjct: 1148 ENDP-TVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPE--NTTP 1204

Query: 770 RAEAPADVTEVKTEAETDADNAELSADADDKAKRESRDGQRRSRRSPRHLRAAGQRRRRD 829
P +E + + + S + + S + RS L
Sbjct: 1205 ATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTS-----SNDRSTVALCDLTSTNTNA 1259

Query: 830 EDEQGVSAPAQFVPNDALGDAQDYPAPTAQTAEAA 864
A AQFV + + + E
Sbjct: 1260 VLSD-ARAKAQFVALNVGKAVSQHISQLEMNNEGQ 1293


38Shewmr4_2444Shewmr4_2467Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_2444315-1.060820hypothetical protein
Shewmr4_2445215-0.609586hypothetical protein
Shewmr4_2446115-0.417123putative DNA-binding transcriptional regulator
Shewmr4_24471140.110321inner membrane transport protein YdhC
Shewmr4_24480130.464576inner membrane transport protein YdhC
Shewmr4_24492151.418221*hypothetical protein
Shewmr4_2450-2160.223878ribulokinase
Shewmr4_2451-216-0.072134L-ribulose-5-phosphate 4-epimerase
Shewmr4_2452-312-0.893261L-arabinose isomerase
Shewmr4_2453-312-2.436298HAD family hydrolase
Shewmr4_2454016-3.657737GntR family transcriptional regulator
Shewmr4_2455020-3.400343glycoside hydrolase family protein
Shewmr4_2456123-3.805866hypothetical protein
Shewmr4_2457122-3.970039dihydroxy-acid dehydratase
Shewmr4_2458536-5.141588short-chain dehydrogenase/reductase SDR
Shewmr4_2459639-5.093821aldose 1-epimerase
Shewmr4_2460534-3.903075periplasmic binding protein/LacI transcriptional
Shewmr4_2461434-4.598386hypothetical protein
Shewmr4_2462537-6.130355ABC transporter-like protein
Shewmr4_2463437-6.398487inner-membrane translocator
Shewmr4_2464028-5.198843inner membrane ABC transporter permease YjfF
Shewmr4_2465-220-3.747616inner membrane ABC transporter permease protein
Shewmr4_2466-319-3.515346Alpha-N-arabinofuranosidase
Shewmr4_2467-218-3.427839hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2449RTXTOXIND482e-07 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 47.9 bits (114), Expect = 2e-07
Identities = 19/200 (9%), Positives = 53/200 (26%), Gaps = 30/200 (15%)

Query: 519 QPAVSHEDLPTDLQLQAAQDAEALALDNLNKARAEYRGLQKQLEAQQQQANDLATALGDK 578
+ L + Q + A + + R ++ + + ++ +
Sbjct: 122 DVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSE 181

Query: 579 VELSLESHSQTLERLMQQAKQADDAAQALQLLQQQIKTLQQQESTLAQQLELERE----- 633
E+ + E+ Q L + + T+ + + +E+
Sbjct: 182 EEVLRLTSLIK-EQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDF 240

Query: 634 ------------RYREQEGKVERLSGQLAEKALRIPEEYRTLDVLNQAIANNQQQLEQI- 680
EQE K +L ++ + + I + +++ + +
Sbjct: 241 SSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQ-------IESEILSAKEEYQLVT 293

Query: 681 ----KRQIDVLRTAQQQATQ 696
+D LR
Sbjct: 294 QLFKNEILDKLRQTTDNIGL 313



Score = 40.6 bits (95), Expect = 3e-05
Identities = 36/214 (16%), Positives = 72/214 (33%), Gaps = 25/214 (11%)

Query: 192 DTLKAKAADIRNLVKEQRARRDGILQTAALTSDDELTAELSRIEPEFAAATAAKEQSVAA 251
DTLK +++ ++ +++ R + L+ EL P+ E+ V
Sbjct: 135 DTLKTQSSLLQARLEQTRYQ--------ILSRSIELNKLPELKLPDEPYFQNVSEEEVLR 186

Query: 252 HLAALKQRDSAQQLFAEFTRLQELQAEALSLNEQQAQIATQTTRLEVAKQALRV------ 305
+ +K+ Q + + + L++++A+ T R+ + RV
Sbjct: 187 LTSLIKE-----QFSTWQNQKYQKELN---LDKKRAERLTVLARINRYENLSRVEKSRLD 238

Query: 306 --KPLLDNALSREQEASVAAAQRDSAQLTLDAAKLALSHAETAAQEIIPLEHKLREVEQQ 363
LL + + A L K L E+ E++L +
Sbjct: 239 DFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAK-EEYQLVTQLFK 297

Query: 364 NSHLSALVPQLAEFASLEQALAQAKEILQHTKLQ 397
N L L L LA+ +E Q + ++
Sbjct: 298 NEILDKLRQTTDNIGLLTLELAKNEERQQASVIR 331



Score = 30.2 bits (68), Expect = 0.044
Identities = 28/200 (14%), Positives = 65/200 (32%), Gaps = 16/200 (8%)

Query: 617 LQQQESTLAQQLELERERYREQEGKVERLSGQLAEKALRIPEEYRTLDVLNQAIANNQQQ 676
+ +S+L Q LE+ RY+ +E + + + + + + + ++Q
Sbjct: 136 TLKTQSSLLQAR-LEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQ 194

Query: 677 LEQIKRQIDVLRTAQQQATQQSVAAQTALSAAIERCHATVELQAQAQQTLQTALDNAGFI 736
+ Q + + + A I R ++ + L + I
Sbjct: 195 FSTWQNQKYQKELNLDKKRAERLTVL----ARINRYENLSRVEKSRLDDFSS-LLHKQAI 249

Query: 737 DRDALREALLTDEQMQTLAEGKETYHRQCALNQSQLTQLTTKLSESTSPDLDALEALLTE 796
+ A+ EQ E + + +SQL Q+ +++ + + E
Sbjct: 250 AKHAVL------EQENKYVE----AVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNE 299

Query: 797 RLAQLKNAEELWSQLNTRLT 816
L +L+ + L L
Sbjct: 300 ILDKLRQTTDNIGLLTLELA 319


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2459SYCDCHAPRONE290.019 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 29.1 bits (65), Expect = 0.019
Identities = 16/83 (19%), Positives = 30/83 (36%), Gaps = 1/83 (1%)

Query: 198 YFNQKKYKKAVGVLEVMVPLFPDDGRLWVQLAQFYLMVEDYDKSLATYDLAYRNGFLDTG 257
+ KY+ A V + + L D R ++ L + YD ++ +Y +
Sbjct: 46 QYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPR 105

Query: 258 ANITRLAQLMAQKGAPYKAAKVF 280
A+ + QKG +A
Sbjct: 106 FPFHA-AECLLQKGELAEAESGL 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2460PF035441082e-31 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 108 bits (272), Expect = 2e-31
Identities = 37/169 (21%), Positives = 64/169 (37%), Gaps = 10/169 (5%)

Query: 39 TPVIEITMDRQDSKAQNKPRVVPKPPPPPEQPQKPDTTPPDTSSNID----TSMSFNMGG 94
P E + K PKP P P+ P S N
Sbjct: 75 EPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAP 134

Query: 95 VEAGGHGTGGFKLGNMMTRDGDATPIVRIEPQYPIAAARDGKEGWVQLRFTINELGGVDD 154
+ + + R +PQYP A EG V+++F + G VD+
Sbjct: 135 ARPTSSTATAATSKPVTSVASGPRALSRNQPQYPARAQALRIEGQVKVKFDVTPDGRVDN 194

Query: 155 VEIINAEPKRLFDKEAIRALKKWKYKPKIVDGKPLKQPGMTVQLDFTLD 203
V+I++A+P +F++E A+++W+Y+P G+ V + F ++
Sbjct: 195 VQILSAKPANMFEREVKNAMRRWRYEP------GKPGSGIVVNILFKIN 237


39Shewmr4_2488Shewmr4_2494Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_2488219-0.644535response regulator receiver modulated metal
Shewmr4_2489421-0.633237hypothetical protein
Shewmr4_2490423-0.995483hypothetical protein
Shewmr4_2491528-1.227270cbb3-type cytochrome c oxidase subunit I
Shewmr4_2492428-1.068159cbb3-type cytochrome c oxidase subunit II
Shewmr4_2493225-0.787094hypothetical protein
Shewmr4_2494231-1.563021Cbb3-type cytochrome oxidase component
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2488HTHFIS290.020 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 29.0 bits (65), Expect = 0.020
Identities = 13/28 (46%), Positives = 16/28 (57%)

Query: 42 TLAIVGEAGSGKSTLARILVGAEPRSGG 69
TL I GE+G+GK +AR L R G
Sbjct: 162 TLMITGESGTGKELVARALHDYGKRRNG 189


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2492DNABINDINGHU1194e-39 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 119 bits (300), Expect = 4e-39
Identities = 53/88 (60%), Positives = 69/88 (78%)

Query: 2 NKSELIEKIASGADISKAAAGRALDSFIAAVTEGLKEGDKISLVGFGTFEVRERAERTGR 61
NK +LI K+A +++K + A+D+ +AV+ L +G+K+ L+GFG FEVRERA R GR
Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62

Query: 62 NPQTGEEIKIAAAKIPAFKAGKALKDAV 89
NPQTGEEIKI A+K+PAFKAGKALKDAV
Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAV 90


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2493PF05272330.006 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 32.7 bits (74), Expect = 0.006
Identities = 15/86 (17%), Positives = 35/86 (40%), Gaps = 6/86 (6%)

Query: 286 AEATVVRSYVDWMTSVPWSQRSKIKRDLAKAQEVLDTDHYGLEKVKDRILEYLAVQSRVR 345
A+ V + DW+ + W + ++++ L D+ +++ + V
Sbjct: 527 ADMNRVHPFRDWVKAQQWDEVPRLEKWLVHVLGKTPDDYKPRRLRYLQLVGKYILMGHVA 586

Query: 346 QLKGP------ILCLVGPPGVGKTSL 365
++ P + L G G+GK++L
Sbjct: 587 RVMEPGCKFDYSVVLEGTGGIGKSTL 612


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2494HTHFIS300.017 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 30.2 bits (68), Expect = 0.017
Identities = 14/70 (20%), Positives = 28/70 (40%), Gaps = 13/70 (18%)

Query: 64 KLPTPHELRAHLDDYVIGQDRAKKVLSVAVYNHYKRLRNSSPKDGVELGKSNILLIGPTG 123
+ P+ E + ++G+ A +Y RL + +++ G +G
Sbjct: 124 RRPSKLEDDSQDGMPLVGRSAA----MQEIYRVLARLMQT---------DLTLMITGESG 170

Query: 124 SGKTLLAETL 133
+GK L+A L
Sbjct: 171 TGKELVARAL 180


40Shewmr4_2504Shewmr4_2510Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_2504216-0.348944C32 tRNA thiolase
Shewmr4_2505319-1.324921hypothetical protein
Shewmr4_2506420-1.101932hypothetical protein
Shewmr4_2507420-1.031369bax protein
Shewmr4_2508521-0.753449hypothetical protein
Shewmr4_2509522-0.892063hypothetical protein
Shewmr4_2510415-0.029010aromatic amino acid aminotransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2504TCRTETOQM396e-05 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 39.1 bits (91), Expect = 6e-05
Identities = 37/148 (25%), Positives = 58/148 (39%), Gaps = 47/148 (31%)

Query: 14 NAGKSTLFNAL---TGANQQVG---------NW------SGVTVEKKTGHFTLNGADVYL 55
+AGK+TL +L +GA ++G + G+T++ F V +
Sbjct: 13 DAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITSFQWENTKVNI 72

Query: 56 TDLPGIYDLLPAGNSCDCSLDEQIAQQYLAEQRVDGIINLVDA-------TNIERHLYLT 108
D PG D +A+ Y + +DG I L+ A T I L
Sbjct: 73 IDTPGHMDF--------------LAEVYRSLSVLDGAILLISAKDGVQAQTRI-----LF 113

Query: 109 AQLRELSIPMVVVLNKIDAAIKRGIRVD 136
LR++ IP + +NKID GI +
Sbjct: 114 HALRKMGIPTIFFINKIDQN---GIDLS 138


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2508INTIMIN350.001 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 35.0 bits (80), Expect = 0.001
Identities = 23/114 (20%), Positives = 40/114 (35%), Gaps = 13/114 (11%)

Query: 325 PSISSASMDANGTVTVAVTLSNPSTGTV--------YSDSADKLKFISDLRVYAN----W 372
S S+ D NG V +T + P V A +++F + L +
Sbjct: 705 LSNSTEKTDTNGYAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFFTTLTIDDGNIEIV 764

Query: 373 GTSFDYSTRSARSIRLPESTPVSGNNGTYTYTISGLTVPAGTEADHGGLAIQGR 426
GT + + SG NG YT+ + A +A G + ++ +
Sbjct: 765 GTGVKGKLPTVWLQYGQVNLKASGGNGKYTWRSAN-PAIASVDASSGQVTLKEK 817



Score = 33.1 bits (75), Expect = 0.004
Identities = 22/81 (27%), Positives = 34/81 (41%), Gaps = 5/81 (6%)

Query: 325 PSISSASMDANGTVTVAVTLSNPSTGTVYSDSADKLKFISDLRVYANWGTSFDYSTRSAR 384
S +SA+ + +G TV + P V SA + S L AN D + S
Sbjct: 607 LSANSANTNGSGKATVTLKSDKPGQVVV---SAKTAEMTSALN--ANAVIFVDQTKASIT 661

Query: 385 SIRLPESTPVSGNNGTYTYTI 405
I+ ++T V+ TYT+
Sbjct: 662 EIKADKTTAVANGQDAITYTV 682


41Shewmr4_2520Shewmr4_2531Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_2520225-3.4856023-methyl-2-oxobutanoate dehydrogenase
Shewmr4_2521532-4.743329succinylglutamate desuccinylase
Shewmr4_2522024-2.281930peptide methionine sulfoxide reductase
Shewmr4_25232210.961528phosphoglucomutase
Shewmr4_25242211.204103replication initiation regulator SeqA
Shewmr4_25254181.540600alpha/beta hydrolase fold domain-containing
Shewmr4_25262172.886672hypothetical protein
Shewmr4_25272173.253314LexA regulated protein
Shewmr4_25281182.722584flavodoxin FldA
Shewmr4_25291192.895622hypothetical protein
Shewmr4_25301153.639631elongation factor P
Shewmr4_25310153.780269Na+/H+ antiporter NhaC
42Shewmr4_2569Shewmr4_2594Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_2569-1133.332309LysR family transcriptional regulator
Shewmr4_2570-2154.645450radical SAM domain-containing protein
Shewmr4_2571-1255.457262sulfatase
Shewmr4_25721255.454759hypothetical protein
Shewmr4_25730245.410435hypothetical protein
Shewmr4_25741215.721139PepSY-associated TM helix domain-containing
Shewmr4_25752172.573178hypothetical protein
Shewmr4_25762191.203764beta-lactamase domain-containing protein
Shewmr4_25771190.803241hypothetical protein
Shewmr4_25782190.063126hypothetical protein
Shewmr4_25792180.082890hypothetical protein
Shewmr4_2580217-0.934407AraC family transcriptional regulator
Shewmr4_2581-114-0.501922diguanylate cyclase
Shewmr4_2582-1141.272205TonB-dependent receptor
Shewmr4_2583-1141.183153hypothetical protein
Shewmr4_2584-1152.5920923-phytase
Shewmr4_2585-1152.229311hypothetical protein
Shewmr4_2586-2152.410092ABC transporter-like protein
Shewmr4_25870173.169243activator of Hsp90 ATPase family protein
Shewmr4_2588-1151.965817hypothetical protein
Shewmr4_25890142.518955response regulator receiver modulated
Shewmr4_2590-1133.080548response regulator receiver modulated CheB
Shewmr4_2591-2143.477556chemoreceptor glutamine deamidase CheD
Shewmr4_2592-2143.753921MCP methyltransferase, CheR-type
Shewmr4_2593-2143.045544methyl-accepting chemotaxis sensory transducer
Shewmr4_2594-2143.051487CheW protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2570SUBTILISIN1423e-40 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 142 bits (359), Expect = 3e-40
Identities = 71/210 (33%), Positives = 97/210 (46%), Gaps = 24/210 (11%)

Query: 126 AGMKVCIIDSGLDSSNPDFNWNNITG----DSDPGTGNWFQNGGPHGTHVAGTIGAADNN 181
G+KV ++D+G D+ +PD I G D D G F++ HGTHVAGTI A +N
Sbjct: 41 RGVKVAVLDTGCDADHPDLKARIIGGRNFTDDDEGDPEIFKDYNGHGTHVAGTIAATENE 100

Query: 182 IGVVGMAPGVPMHIVKVFNASGWGYSSDLAYAANKCSNAGAKIISMSLGGGAANNTEKNA 241
GVVG+AP + I+KV N G G + IISMSLGG A
Sbjct: 101 NGVVGVAPEADLLIIKVLNKQGSGQYDWIIQGIYYAIEQKVDIISMSLGGPEDVPELHEA 160

Query: 242 FDAFTAAGGLVVAAAGNDGNSVRS-----YPAGYPSVMMIGANDANNNIADFSQYPSCVS 296
A+ LV+ AAGN+G+ YP Y V+ +GA + + + ++FS
Sbjct: 161 VKKAVASQILVMCAAGNEGDGDDRTDELGYPGCYNEVISVGAINFDRHASEFSNS----- 215

Query: 297 GRGKKAVNDDGICVEVTAGGVDTLSTYPAG 326
V++ A G D LST P G
Sbjct: 216 ----------NNEVDLVAPGEDILSTVPGG 235



Score = 53.3 bits (128), Expect = 8e-10
Identities = 19/70 (27%), Positives = 27/70 (38%), Gaps = 7/70 (10%)

Query: 447 YGFMSGTSMATPAVSGMAALVWSN-----HSQCTGTQIRKALKATAMDAGTVGKDNYFGY 501
Y SGTSMATP V+G AL+ T ++ L + G G
Sbjct: 237 YATFSGTSMATPHVAGALALIKQLANASFERDLTEPELYAQLIKRTIPLG--NSPKMEGN 294

Query: 502 GIVNAKAADA 511
G++ A +
Sbjct: 295 GLLYLTAVEE 304


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2573ABC2TRNSPORT401e-05 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 39.5 bits (92), Expect = 1e-05
Identities = 44/166 (26%), Positives = 78/166 (46%), Gaps = 23/166 (13%)

Query: 186 GVILTMTMVMFT----SAAIVREREQGNMEFLITTPVRPLELMLGKI----TPYVLVGFV 237
G++ T M T AA R Q E ++ T +R +++LG++ T L G
Sbjct: 72 GMVATSAMTAATFETIYAAFGRMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGAG 131

Query: 238 QLAIILTAGH-----LLFAVPIRGGLDSIALAAMLFICASLTLGLVISTIAKTQLQSMQM 292
+ G+ LL+A+P+ IAL + F +LG+V++ +A + +
Sbjct: 132 IGVVAAALGYTQWLSLLYALPV------IALTGLAFA----SLGMVVTALAPSYDYFIFY 181

Query: 293 TVFILLPSILLSGFMFPYEAMPVAAQWIAEALPATHFMRMSRAIVL 338
++ P + LSG +FP + +P+ Q A LP +H + + R I+L
Sbjct: 182 QTLVITPILFLSGAVFPVDQLPIVFQTAARFLPLSHSIDLIRPIML 227


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2575RTXTOXIND582e-11 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 57.9 bits (140), Expect = 2e-11
Identities = 35/180 (19%), Positives = 59/180 (32%), Gaps = 11/180 (6%)

Query: 17 PSSRGWGKLLASLLGAALLLQLTACGDESPRVLGTV--ERDRLTLTAPVGELINRINVVE 74
R + L A +L + + G + + ++ I V E
Sbjct: 55 RRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKE 114

Query: 75 GQQVQAGEVLLELDSTAAQARLGQRQAELKQA-------QAKLEEAVTGARSEDIDKARA 127
G+ V+ G+VLL+L + A+A + Q+ L QA Q E
Sbjct: 115 GESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEP 174

Query: 128 ALDGANASVKEARQNFERTQ--QLFKTKVLSQADLDAARAARDTSLAKQAEAEQSLRLLQ 185
+ + + Q K + +LD RA R T LA+ E R+ +
Sbjct: 175 YFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEK 234



Score = 49.8 bits (119), Expect = 8e-09
Identities = 27/231 (11%), Positives = 75/231 (32%), Gaps = 24/231 (10%)

Query: 84 LLELDSTAAQARLGQRQAELKQAQAKLEEAVTGARSEDIDKARAALDGANASVKEARQNF 143
+ E + + + ++ + + + + E + R+E A ++ + +
Sbjct: 179 VSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAE-RLTVLARINRYENLSRVEKSRL 237

Query: 144 ERTQQLFKTKVLS--------------QADLDAARAARDTSLAKQAEAEQSLRLLQNGTR 189
+ L + ++ +L ++ + ++ A++ +L+ +
Sbjct: 238 DDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFK 297

Query: 190 SEQLEQARAAVEAAMAGVAQEQKALKDLSLVAAK-PA---VVDTLPWRVGDRVAAGSQLI 245
+E L++ R + + K + + P V G V L+
Sbjct: 298 NEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLM 357

Query: 246 GLLAIEHPY-VRVYLPATWLDRVKAGNQVKILVDG----RTQPIAGTVRNI 291
++ + V + + + G I V+ R + G V+NI
Sbjct: 358 VIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNI 408


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2576HTHTETR739e-18 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 72.7 bits (178), Expect = 9e-18
Identities = 24/151 (15%), Positives = 55/151 (36%), Gaps = 6/151 (3%)

Query: 31 SDARQRLITAAVSLFSERSYPTVSTREIARVAEVDAALIRYYFGSKAGLFEQMVRETLEP 90
+ RQ ++ A+ LFS++ + S EIA+ A V I ++F K+ LF ++ +
Sbjct: 10 QETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESN 69

Query: 91 VITRLREVSTAQAPNN---VGDLMQTYYRVMAPNPGLPRLIVRVLQESDGTEAYRIMLSV 147
+ E + + +++ L+ + + + ++
Sbjct: 70 IGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQA 129

Query: 148 FEQMLSLSRQWLEASF---VSAGILKEGLDP 175
+ S +E + + A +L L
Sbjct: 130 QRNLCLESYDRIEQTLKHCIEAKMLPADLMT 160


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2585BCTLIPOCALIN2582e-91 Bacterial lipocalin signature.
		>BCTLIPOCALIN#Bacterial lipocalin signature.

Length = 171

Score = 258 bits (659), Expect = 2e-91
Identities = 112/171 (65%), Positives = 145/171 (84%)

Query: 1 MKKLLLLISVLVLSGCLGMPRNVEPVKDFQLERYLGKWYEIARLDHSFERGLTQVTAEYS 60
M+ + L++ ++L+GCLGMP +V+PV DF+L YLGKWYE+ARLDHSFERGL+QVTAEY
Sbjct: 1 MRAIFLILCSVLLNGCLGMPESVKPVSDFELNNYLGKWYEVARLDHSFERGLSQVTAEYR 60

Query: 61 LKADGGVKVINRGYSADTQQWKEAEGKAYFVNGDEEAYLKVSFFGPFYGSYVVFGLDQQD 120
++ DGG+ V+NRGYS + +WKEAEGKAYFVNG + YLKVSFFGPFYGSYVVF LD+++
Sbjct: 61 VRNDGGISVLNRGYSEEKGEWKEAEGKAYFVNGSTDGYLKVSFFGPFYGSYVVFELDREN 120

Query: 121 YQYAFISGPDTDYLWLLARTPTVSPEVMKQFVEMASARGFDTNSLIYVEQK 171
Y YAF+SGP+T+YLWLL+RTPTV ++ +F+EM+ RGFDTN LIYV+Q+
Sbjct: 121 YSYAFVSGPNTEYLWLLSRTPTVERGILDKFIEMSKERGFDTNRLIYVQQQ 171


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2588FbpA_PF05833270.032 Fibronectin-binding protein
		>FbpA_PF05833#Fibronectin-binding protein

Length = 577

Score = 26.8 bits (59), Expect = 0.032
Identities = 5/42 (11%), Positives = 16/42 (38%)

Query: 75 NAEDVKQLTRNHLAYVEEQISKLQNLRSQLQQMVSECQGGEQ 116
++ +K + + V I++ L + +C+ +
Sbjct: 293 KSDRLKSKSSDLQKIVMNNINRCTKKDKILNNTLKKCEDKDI 334


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2591DHBDHDRGNASE1075e-30 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 107 bits (267), Expect = 5e-30
Identities = 74/263 (28%), Positives = 126/263 (47%), Gaps = 22/263 (8%)

Query: 3 LKDKVVVITGGAGGLGLAMAHNFAQAGAKLALIDVDQEKLERACADLGS-ATEVQGYALD 61
++ K+ ITG A G+G A+A A GA +A +D + EKLE+ + L + A + + D
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 62 ITDEEDVVAGFAYILEDFGKINVLVNNAGILRDGMLIKAKDGKVTDRMSFDQFQSVINVN 121
+ D + A I + G I++LVN AG+LR G+ +S +++++ +VN
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGL---------IHSLSDEEWEATFSVN 116

Query: 122 LTGTFLCGREAAAAMIESGQAGVIVNISSLAKAGNVGQSNYAASKAGVAAMSVGWAKELA 181
TG F R + M++ ++ S+ A + YA+SKA + ELA
Sbjct: 117 STGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELA 176

Query: 182 RYNIRSAAVAPGVIATEMTAAMKPE----------ALERLEKLVPVGRLGQAEEIASTVR 231
YNIR V+PG T+M ++ + +LE + +P+ +L + +IA V
Sbjct: 177 EYNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVL 236

Query: 232 FIIEND--YVNGRVFEVDGGIRL 252
F++ ++ VDGG L
Sbjct: 237 FLVSGQAGHITMHNLCVDGGATL 259


43Shewmr4_2662Shewmr4_2683Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_26623142.616437TetR family transcriptional regulator
Shewmr4_26632132.950560LysR family transcriptional regulator
Shewmr4_26642133.010504hypothetical protein
Shewmr4_26652132.694452hypothetical protein
Shewmr4_26662132.676418hypothetical protein
Shewmr4_26672141.916900hypothetical protein
Shewmr4_26682132.260093tetraheme cytochrome c
Shewmr4_26691151.079264hypothetical protein
Shewmr4_2670116-0.032836flavocytochrome c
Shewmr4_26710180.914127hypothetical protein
Shewmr4_2672-1191.034820alcohol dehydrogenase
Shewmr4_26730180.812598diguanylate cyclase/phosphodiesterase with
Shewmr4_2674-1160.021543**hypothetical protein
Shewmr4_2675-117-2.215771DNA polymerase III subunit epsilon
Shewmr4_2676219-3.229923ribonuclease H
Shewmr4_2677219-3.263537LysR family transcriptional regulator
Shewmr4_2678218-4.432130type 11 methyltransferase
Shewmr4_2679115-4.539339hydroxyacylglutathione hydrolase
Shewmr4_2680114-4.312774MltD domain-containing protein
Shewmr4_2681014-3.862494intracellular proteinase inhibitor
Shewmr4_2682013-3.434652hypothetical protein
Shewmr4_2683014-3.345319AsmA family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2663TYPE4SSCAGA270.039 Type IV secretion system CagA exotoxin signature.
		>TYPE4SSCAGA#Type IV secretion system CagA exotoxin signature.

Length = 1147

Score = 27.4 bits (60), Expect = 0.039
Identities = 19/66 (28%), Positives = 32/66 (48%), Gaps = 2/66 (3%)

Query: 91 AKVAELLGDLQKVMEQEVSFDQVASLLQKGADAAAYAKSLIEQQGVEQAMQSLKQMALAS 150
+KV + DL+ ++ + +V + A + AK+ + VEQA+ LK +
Sbjct: 758 SKVTQAKSDLENSVKDVIINQKVTDKVDNLNQAVSVAKATGDFSRVEQALADLKN--FSK 815

Query: 151 EQFAQQ 156
EQ AQQ
Sbjct: 816 EQLAQQ 821


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2666TONBPROTEIN320.028 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 31.9 bits (72), Expect = 0.028
Identities = 13/71 (18%), Positives = 28/71 (39%), Gaps = 1/71 (1%)

Query: 1149 VQVPIQTAPVAVAPVAVTAAKPVVPAQAPVVQALVTEPKATIAPVSEPQVPQPQVQQPQV 1208
+++P P++V V +P Q P + EP+ P + P +++P+
Sbjct: 36 IELPAPAQPISVTMVTPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVV-IEKPKP 94

Query: 1209 TQSQVAQPQVQ 1219
+P +
Sbjct: 95 KPKPKPKPVKK 105


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2674HTHFIS367e-125 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 367 bits (944), Expect = e-125
Identities = 138/374 (36%), Positives = 197/374 (52%), Gaps = 35/374 (9%)

Query: 103 KPIDMSLLCETLADFAQHLVANTSTRIRPFASELDQYGLLVGSSLPMHRLYRTIRRVSAA 162
KP D+ L +A R + LVG S M +YR + R+
Sbjct: 104 KPFDL----TELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQT 159

Query: 163 ESNVLIIGESGAGKELVANTIHLASPRVNKPYIAINCGALSPELVDSELFGHVKGSFTGA 222
+ ++I GESG GKELVA +H R N P++AIN A+ +L++SELFGH KG+FTGA
Sbjct: 160 DLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGA 219

Query: 223 NRDHQGVFEQAEGGTLFLDEVTEMPLEHQVKLLRVLENNEYRPVGSPKVLKANVRIVAAT 282
G FEQAEGGTLFLDE+ +MP++ Q +LLRVL+ EY VG ++++VRIVAAT
Sbjct: 220 QTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAAT 279

Query: 283 NRDPLVAIEQGQLREDLYFRLAHFPIQVPPLRERGEDIVGLSKHFLAYRNAAEKQSKAFS 342
N+D +I QG REDLY+RL P+++PPLR+R EDI L +HF+ K F
Sbjct: 280 NKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDVKRFD 339

Query: 343 PSSLEAIAAHTWPGNVRELKHAIERAYILADHE-ITPEHLQ-----------LTPSLEKE 390
+LE + AH WPGNVREL++ + R L + IT E ++ + + +
Sbjct: 340 QEALELMKAHPWPGNVRELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARS 399

Query: 391 TTAEENVVIPQGMR-------------------LEELEKIAIYQALETSLGNKTDTAEQL 431
+ + + + MR L E+E I AL + GN+ A+ L
Sbjct: 400 GSLSISQAVEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLL 459

Query: 432 GISVKTLYNKLSKY 445
G++ TL K+ +
Sbjct: 460 GLNRNTLRKKIREL 473


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2681NEISSPPORIN575e-11 Neisseria sp. porin signature.
		>NEISSPPORIN#Neisseria sp. porin signature.

Length = 348

Score = 56.5 bits (136), Expect = 5e-11
Identities = 44/192 (22%), Positives = 86/192 (44%), Gaps = 16/192 (8%)

Query: 1 MKKVILTASVFAISALPVLADESPSVYGRLDLSVTHSELSSTVYSGTSGVKVGESGTYLE 60
MKK ++ ++ALPV A ++YG + V S V+ +G+ +
Sbjct: 1 MKKSLIA---LTLAALPVAAMADVTLYGAIKAGVQTYRSVEHTDGKVSKVE---TGSEIA 54

Query: 61 NNSSNIGVKGKSAISDGINVVYKMEFGVNNTSNRANDSSKVFSARNTYLGVETAYGTLLV 120
+ S IG KG+ + +G+ V+++E S ++ + + +++G++ +GT+
Sbjct: 55 DFGSKIGFKGQEDLGNGLKAVWQLE---QGASVAGTNTG--WGNKQSFVGLKGGFGTIRA 109

Query: 121 GRNDTVFKTAEGKVDIFGTTNADINQL-VSGQTRSAD---GVWYYSPKLFGLMDINATYL 176
G ++ K V+ + + N L +SG + V Y SP+ G + Y
Sbjct: 110 GSLNSPLKNTGANVNAWESGKFTGNVLEISGMAQREHRYLSVRYDSPEFAGFSG-SVQYA 168

Query: 177 LQDNYGADNELY 188
+DN G++ E Y
Sbjct: 169 PKDNSGSNGESY 180


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2682TCRTETB355e-04 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 35.2 bits (81), Expect = 5e-04
Identities = 23/115 (20%), Positives = 48/115 (41%), Gaps = 7/115 (6%)

Query: 75 AFFFTYAIGKFSNGFLADYANIGRFMSVSLMLSSITCMAMGMGVAGLFFVILWGMNGWFQ 134
AF T++IG G L+D I R + ++++ + +G + +I M + Q
Sbjct: 57 AFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLI---MARFIQ 113

Query: 135 SVGSAP----SCVSIFQWYSPKQRGSVYSVWGGSRNIGEAISWILTASLVSFFGW 185
G+A V + ++ + RG + + G +GE + + + + W
Sbjct: 114 GAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHW 168


44Shewmr4_2770Shewmr4_2782Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_2770-220-3.280656pili assembly chaperone
Shewmr4_2771-118-2.505953hypothetical protein
Shewmr4_2772-118-2.962045fimbrial protein
Shewmr4_2773014-2.776209hypothetical protein
Shewmr4_2774218-4.023435fimbrial protein
Shewmr4_2775219-4.288042IS4 family transposase
Shewmr4_2776219-3.964467fimbrial protein
Shewmr4_2777220-5.150409hypothetical protein
Shewmr4_2778321-5.529954diguanylate phosphodiesterase
Shewmr4_2779428-7.700734*diguanylate cyclase
Shewmr4_2780434-7.850197hypothetical protein
Shewmr4_2781025-5.493042hypothetical protein
Shewmr4_2782-121-4.164095hypothetical protein
45Shewmr4_2937Shewmr4_2951Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_2937119-3.274478translocation protein TolB
Shewmr4_2938123-4.288986hypothetical protein
Shewmr4_2939125-4.814003TolA family protein
Shewmr4_2940227-5.186869biopolymer transport protein ExbD/TolR
Shewmr4_2941228-5.018992MotA/TolQ/ExbB proton channel
Shewmr4_2942331-5.2766294-hydroxybenzoyl-CoA thioesterase
Shewmr4_2943-127-6.321852prolyl oligopeptidase
Shewmr4_2944032-6.875539hypothetical protein
Shewmr4_2945336-7.864165OmpA/MotB domain-containing protein
Shewmr4_2946337-7.704485hypothetical protein
Shewmr4_2947339-8.005000ribonuclease T
Shewmr4_2948339-7.900541alkyl hydroperoxide reductase/ Thiol specific
Shewmr4_2949236-6.807051Na+/H+ antiporter NhaC
Shewmr4_2950134-6.233459hypothetical protein
Shewmr4_2951-128-5.114913RDD domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2938BINARYTOXINA300.009 Clostridial binary toxin A signature.
		>BINARYTOXINA#Clostridial binary toxin A signature.

Length = 454

Score = 30.4 bits (68), Expect = 0.009
Identities = 15/48 (31%), Positives = 25/48 (52%), Gaps = 2/48 (4%)

Query: 247 YFFVSPKRPELAAAILAGLENMISDGSFDEMFNRELKIDKLYRDAQFE 294
Y+F SP++ I +N IS F+E+ +E DKL++ F+
Sbjct: 133 YYFESPEKFAFNKEIRTENQNEISLEKFNEL--KETIQDKLFKQDGFK 178


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2941SALSPVBPROT300.028 Salmonella virulence plasmid 65kDa B protein signature.
		>SALSPVBPROT#Salmonella virulence plasmid 65kDa B protein signature.

Length = 591

Score = 29.7 bits (66), Expect = 0.028
Identities = 13/38 (34%), Positives = 16/38 (42%)

Query: 369 DVLTPHYNQVTTYVWERVVDFIKLHYCISDRTDSDFWL 406
DV P VT Y F +L Y + + DFWL
Sbjct: 123 DVSFPQSYTVTRYQPRTESSFYRLEYWVGNSNGDDFWL 160


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2947BCTERIALGSPG361e-05 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 36.4 bits (84), Expect = 1e-05
Identities = 15/45 (33%), Positives = 26/45 (57%)

Query: 3 NKLLGFTLVELMVTIAVAAILLTIGVPSLISVYEGVRVNNNIAKI 47
+K GFTL+E+MV I + +L ++ VP+L+ E ++ I
Sbjct: 5 DKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDI 49


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2949BCTERIALGSPG561e-12 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 55.7 bits (134), Expect = 1e-12
Identities = 21/64 (32%), Positives = 41/64 (64%)

Query: 2 KKNRLQGFTLIEVMIAVVIVGILASIAYPSYIDYVVKSGRSEGVAAVMKVANLQEQYYLD 61
++ +GFTL+E+M+ +VI+G+LAS+ P+ + K+ + + V+ ++ + N + Y LD
Sbjct: 3 ATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLD 62

Query: 62 NRSY 65
N Y
Sbjct: 63 NHHY 66


46Shewmr4_2960Shewmr4_2970Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_2960-223-3.061547TonB-dependent siderophore receptor
Shewmr4_2961026-3.224364hypothetical protein
Shewmr4_2962027-3.049966acyl-CoA dehydrogenase domain-containing
Shewmr4_2963125-3.086785aldose 1-epimerase
Shewmr4_2964227-2.999315integrase catalytic subunit
Shewmr4_2965227-3.785874transposase
Shewmr4_2966526-2.8023896-phosphogluconate dehydrogenase, NAD-binding
Shewmr4_2967315-2.341112thioesterase superfamily protein
Shewmr4_2968215-1.486114hypothetical protein
Shewmr4_2969116-0.5135453-oxoacyl-[acyl-carrier-protein] synthase II
Shewmr4_2970215-0.088404hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2963PHPHTRNFRASE290.020 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 29.0 bits (65), Expect = 0.020
Identities = 10/36 (27%), Positives = 16/36 (44%), Gaps = 2/36 (5%)

Query: 24 RPDFLAYSQELIQVCQRLTPSDIATLMKVSDNIAGL 59
++E + + + LTPSD A L K + G
Sbjct: 147 TGSLATIAEETVIIAEDLTPSDTAQLNK--QFVKGF 180


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2966OMPADOMAIN1496e-44 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 149 bits (377), Expect = 6e-44
Identities = 89/378 (23%), Positives = 145/378 (38%), Gaps = 58/378 (15%)

Query: 7 MKNTLK--VVLLTSMLPLAASASQELTPWYVGAGLGVNNYEHIATDNGD----DNPYAWD 60
MK T V L +A +A ++ T WY GA LG + Y N + +N
Sbjct: 1 MKKTAIAIAVALAGFATVAQAAPKDNT-WYTGAKLGWSQYHDTGFINNNGPTHENQLGAG 59

Query: 61 IFAGYMFNDYFGAEIGYRDLGSADWTYAGIGNDADVKGATLGLVGVWPLGNRWSLSAEAG 120
F GY N Y G E+GY LG + + +G L +P+ + + G
Sbjct: 60 AFGGYQVNPYVGFEMGYDWLGRMPYKGSVENGAYKAQGVQLTAKLGYPITDDLDIYTRLG 119

Query: 121 AMYYTLENNQRVGSVSTSYSENDFAPYFGAGVGYNFTDNLKLQAKYRRYENLDDNAGANA 180
M + +V + +P F GV Y T + + +Y+ N+ G
Sbjct: 120 GMVW---RADTKSNVYGKNHDTGVSPVFAGGVEYAITPEIATRLEYQWTNNI----GDAH 172

Query: 181 IVPVNADSNYWGLELSYRFGSPAAPVAAAVVAATPVDSDNDGVYDDKDQCPATPATHKVD 240
+ D+ L +SYRFG A A VVA P PA K
Sbjct: 173 TIGTRPDNGMLSLGVSYRFGQGEA---APVVAPAPA--------------PAPEVQTKHF 215

Query: 241 SVGCTIYENVKKQEDVGSIQFANDSAVVKKEYYKDIERLANYL--NKNPEFTVEIAGHAS 298
++ + F + A +K E +++L + L + +V + G+
Sbjct: 216 TLKSDVL-------------FNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTD 262

Query: 299 NVGKPDYNMTLSDKRADAVAKILVEKYGISQSRVTSNGYGITKPLVAGDS---------- 348
+G YN LS++RA +V L+ K GI ++++ G G + P V G++
Sbjct: 263 RIGSDAYNQGLSERRAQSVVDYLISK-GIPADKISARGMGESNP-VTGNTCDNVKQRAAL 320

Query: 349 KEAHAANRRIEAIVTTTE 366
+ A +RR+E V +
Sbjct: 321 IDCLAPDRRVEIEVKGIK 338


47Shewmr4_3017Shewmr4_3027Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_30172162.360348hypothetical protein
Shewmr4_30181141.481769hypothetical protein
Shewmr4_30192131.349972hypothetical protein
Shewmr4_30202111.153585GTP cyclohydrolase II
Shewmr4_30212120.893433hypothetical protein
Shewmr4_30221110.694221anaerobic ribonucleotide reductase-activating
Shewmr4_3023010-0.684478anaerobic ribonucleoside triphosphate reductase
Shewmr4_3024-1130.647813hypothetical protein
Shewmr4_30250150.788792patatin
Shewmr4_30262161.164594hypothetical protein
Shewmr4_30272150.307392DEAD/DEAH box helicase domain-containing
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3019HTHFIS290.045 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 29.0 bits (65), Expect = 0.045
Identities = 15/60 (25%), Positives = 29/60 (48%), Gaps = 6/60 (10%)

Query: 274 RAPAVQ-VSTDAKAALQPDAPKGVLLLGVQGSGKSLAAKAV---AGVWQRPLLRLDMGAL 329
R+ A+Q + +Q D +++ G G+GK L A+A+ P + ++M A+
Sbjct: 142 RSAAMQEIYRVLARLMQTDLT--LMITGESGTGKELVARALHDYGKRRNGPFVAINMAAI 199


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3020PF06057310.005 Type IV secretory pathway VirJ component
		>PF06057#Type IV secretory pathway VirJ component

Length = 243

Score = 31.3 bits (71), Expect = 0.005
Identities = 10/28 (35%), Positives = 13/28 (46%), Gaps = 2/28 (7%)

Query: 17 GCGELGKEVAIELQRLGVEVIGVD--RY 42
G L K V LQ+ G V+G +Y
Sbjct: 62 GWATLDKAVGGILQQQGWPVVGWSSLKY 89


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3021BCTERIALGSPC330.002 Bacterial general secretion pathway protein C signa...
		>BCTERIALGSPC#Bacterial general secretion pathway protein C

signature.
Length = 272

Score = 33.4 bits (76), Expect = 0.002
Identities = 26/108 (24%), Positives = 45/108 (41%), Gaps = 12/108 (11%)

Query: 348 GMIWFRLPLEGDKRVWPLSTLIAVAQQQPLAPH-IELEILSQANSESAQHEAPGSS---L 403
MI++R+ L + V + A A+QQP+ + L +S +++ +A S
Sbjct: 31 AMIFWRIGLPDNAPVSSVQITPAQARQQPVTLNDFTLFGVSPEKNKAGALDASQMSNLPP 90

Query: 404 FQLVLVNKGNLAGKLPSQLSLAAQACSGY-------DAQNGYQAKLTQ 444
L L G +AG S+ S+A + + GY AK+
Sbjct: 91 STLNLSLTGVMAGDDDSR-SIAIISKDNEQFSRGVNEEVPGYNAKIVS 137


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3022SYCDCHAPRONE290.050 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 28.7 bits (64), Expect = 0.050
Identities = 17/68 (25%), Positives = 33/68 (48%), Gaps = 6/68 (8%)

Query: 103 ENSELSEAQLALVNSLRDAQSLAEAEQIAAQLKESLAPALTWYSLGAMAFDAKEYDKASD 162
E ++ E QLA+ + L+ ++A +I++ E L YSL + + +Y+ A
Sbjct: 4 ETTDTQEYQLAMESFLKGGGTIAMLNEISSDTLEQL------YSLAFNQYQSGKYEDAHK 57

Query: 163 YFKKVIAL 170
F+ + L
Sbjct: 58 VFQALCVL 65


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3025V8PROTEASE389e-05 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 38.1 bits (88), Expect = 9e-05
Identities = 18/58 (31%), Positives = 30/58 (51%), Gaps = 1/58 (1%)

Query: 635 IDSVPVNFLS-TLDTTGGNSGSPTLNGRAELVGLLFDGVYESIIGDWAYDDNINRSIQ 691
I + + L TTGGNSGSP N + E++G+ + GV G ++N+ ++
Sbjct: 218 ITYLKGEAMQYDLSTTGGNSGSPVFNEKNEVIGIHWGGVPNEFNGAVFINENVRNFLK 275



Score = 30.4 bits (68), Expect = 0.025
Identities = 9/53 (16%), Positives = 15/53 (28%), Gaps = 9/53 (16%)

Query: 43 DAKSISKLTEFPMNAVISLG--------GCTASFVSPKGLVVTNHHCAYGSIQ 87
D I+ T V + + V K ++TN H +
Sbjct: 75 DRHQITDTTNGHYAPVTYIQVEAPTGTFIASG-VVVGKDTLLTNKHVVDATHG 126


48Shewmr4_3052Shewmr4_3058Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_3052-115-3.183607hypothetical protein
Shewmr4_3053-118-4.102080hypothetical protein
Shewmr4_3054019-4.655579IS4 family transposase
Shewmr4_3055119-4.774933transposase IS200-family protein
Shewmr4_3056222-5.389533TonB-dependent receptor
Shewmr4_3057120-4.546620hypothetical protein
Shewmr4_3058018-3.436725porin
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3058PRTACTNFAMLY355e-04 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 34.6 bits (79), Expect = 5e-04
Identities = 15/33 (45%), Positives = 19/33 (57%)

Query: 126 PQPQPQPQPQPQPQPQPQPQPQNSITEPAQAQA 158
P P+P PQP PQP PQPQP+ +P +
Sbjct: 573 PAPKPAPQPGPQPPQPPQPQPEAPAPQPPAGRE 605



Score = 34.3 bits (78), Expect = 6e-04
Identities = 13/34 (38%), Positives = 16/34 (47%)

Query: 123 GTTPQPQPQPQPQPQPQPQPQPQPQNSITEPAQA 156
P+P PQP PQP PQPQP+ +
Sbjct: 572 PPAPKPAPQPGPQPPQPPQPQPEAPAPQPPAGRE 605



Score = 33.9 bits (77), Expect = 0.001
Identities = 16/35 (45%), Positives = 17/35 (48%)

Query: 121 LQGTTPQPQPQPQPQPQPQPQPQPQPQNSITEPAQ 155
L G P P+P PQP PQP PQPQ P
Sbjct: 566 LVGAKAPPAPKPAPQPGPQPPQPPQPQPEAPAPQP 600



Score = 30.8 bits (69), Expect = 0.008
Identities = 18/34 (52%), Positives = 18/34 (52%), Gaps = 1/34 (2%)

Query: 126 PQPQPQPQPQPQPQP-QPQPQPQNSITEPAQAQA 158
PQP PQP PQPQP P PQP A A A
Sbjct: 579 PQPGPQPPQPPQPQPEAPAPQPPAGRELSAAANA 612


49Shewmr4_3112Shewmr4_3141Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_3112316-0.137980hypothetical protein
Shewmr4_3113412-0.422807outer membrane protein precursor MtrB
Shewmr4_3114412-0.627584hypothetical protein
Shewmr4_3115014-0.835928hypothetical protein
Shewmr4_3116-114-0.729205hypothetical protein
Shewmr4_31170190.364179phage SPO1 DNA polymerase domain-containing
Shewmr4_31181190.796186vibriolysin
Shewmr4_31192260.567248hypothetical protein
Shewmr4_31202230.897708CdaR family transcriptional regulator
Shewmr4_31210162.144982catalase domain-containing protein
Shewmr4_31220122.060552hypothetical protein
Shewmr4_31231182.204271gluconate transporter
Shewmr4_31241152.576487hypothetical protein
Shewmr4_31251152.634407glycerate kinase
Shewmr4_31261163.066037pyridoxal-dependent decarboxylase
Shewmr4_31271162.700244hypothetical protein
Shewmr4_31281173.157787ATPase central domain-containing protein
Shewmr4_31291173.351893hypothetical protein
Shewmr4_31300202.899506hypothetical protein
Shewmr4_31310191.827034hypothetical protein
Shewmr4_31322191.139095hypothetical protein
Shewmr4_3133-1110.834183hypothetical protein
Shewmr4_3134-1110.478523hypothetical protein
Shewmr4_3135-111-0.342386streptogramin A acetyl transferase
Shewmr4_3136017-2.873400AraC family transcriptional regulator
Shewmr4_3137022-4.025909AzlC family protein
Shewmr4_3138022-3.943535branched-chain amino acid transport
Shewmr4_3139124-4.388618major facilitator transporter
Shewmr4_3140124-4.785714N-acetylglucosamine 6-phosphate deacetylase
Shewmr4_3141123-4.454833N-acetylglucosamine kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3128PF08280280.049 M protein trans-acting positive regulator
		>PF08280#M protein trans-acting positive regulator

Length = 530

Score = 28.3 bits (63), Expect = 0.049
Identities = 14/59 (23%), Positives = 27/59 (45%), Gaps = 1/59 (1%)

Query: 140 THSQLLTRSTFTYLDSYRNPKEKGTLSCFDDVLAALLTESFERHYALGKVDFKPLNVIQ 198
H L L + + P ++ + + A LLT+SF R+++ +DF ++Q
Sbjct: 406 KHFHLFCHYVEQILRNIQPPLVVVFVA-SNFINAHLLTDSFPRYFSDKSIDFHSYYLLQ 463


50Shewmr4_3197Shewmr4_3223Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_3197222-5.278804hypothetical protein
Shewmr4_3198331-6.987357hypothetical protein
Shewmr4_3199334-6.793662hypothetical protein
Shewmr4_3200337-8.813222lysine decarboxylase transcriptional regulator,
Shewmr4_3201337-8.867879hypothetical protein
Shewmr4_3202336-9.209710diguanylate cyclase
Shewmr4_3203342-12.212158FAD linked oxidase domain-containing protein
Shewmr4_3204339-11.397183hypothetical protein
Shewmr4_3206340-11.989520hypothetical protein
Shewmr4_3207340-11.854394Lipocalin family protein
Shewmr4_3208441-12.380886hypothetical protein
Shewmr4_3209241-12.499317ABC transporter-like protein
Shewmr4_3210330-8.228394copper-translocating P-type ATPase
Shewmr4_3212135-10.796638MerR family transcriptional regulator
Shewmr4_3213130-8.516149peptidase S9 prolyl oligopeptidase
Shewmr4_3214122-5.830358hypothetical protein
Shewmr4_3215119-5.188937hypothetical protein
Shewmr4_3216218-4.2303493-ketoacyl-(acyl-carrier-protein) reductase
Shewmr4_3217118-1.9399153-ketoacyl-(acyl-carrier-protein) reductase
Shewmr4_3218116-0.1303793-hydroxyisobutyrate dehydrogenase
Shewmr4_3219013-1.103338enoyl-CoA hydratase/isomerase
Shewmr4_3220-113-3.065160enoyl-CoA hydratase
Shewmr4_3221-212-2.721440acyl-CoA dehydrogenase domain-containing
Shewmr4_3222-111-2.873625methylmalonate-semialdehyde dehydrogenase
Shewmr4_3223-114-3.730241acetyl-CoA acetyltransferases
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3208SECA493e-08 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 49.5 bits (118), Expect = 3e-08
Identities = 17/28 (60%), Positives = 21/28 (75%)

Query: 517 LQLGKPKIGRNQTCNCGSGRKYKQCCGK 544
Q G+ K+GRN C CGSG+KYKQC G+
Sbjct: 872 AQTGERKVGRNDPCPCGSGKKYKQCHGR 899


51Shewmr4_3274Shewmr4_3294Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_32741193.46598630S ribosomal protein S2
Shewmr4_32751193.539229methionine aminopeptidase
Shewmr4_32771152.779176PII uridylyl-transferase
Shewmr4_32781171.7206572,3,4,5-tetrahydropyridine-2,6-carboxylate
Shewmr4_3279-1141.967062hypothetical protein
Shewmr4_3280-1161.601713hypothetical protein
Shewmr4_3281-1151.458948metal dependent phosphohydrolase
Shewmr4_3282-2122.288572formyltetrahydrofolate deformylase
Shewmr4_3283-1143.026091PTS system, glucose-like IIB subunit
Shewmr4_32840133.287072flavodoxin
Shewmr4_32852123.504190tRNA pseudouridine synthase C
Shewmr4_32862113.739881hypothetical protein
Shewmr4_32872123.528148hypothetical protein
Shewmr4_32882101.641969hypothetical protein
Shewmr4_32891120.684762hypothetical protein
Shewmr4_3290216-1.004117hypothetical protein
Shewmr4_3291322-2.972149GCN5-related N-acetyltransferase
Shewmr4_3292626-4.520736hypothetical protein
Shewmr4_3293731-6.030545hypothetical protein
Shewmr4_3294219-1.599618hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3278HTHFIS832e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 83.0 bits (205), Expect = 2e-20
Identities = 28/134 (20%), Positives = 60/134 (44%)

Query: 23 ILLVEDEQDLAQMIMVNLTALNFRVFHAASLHQANALLQAKRIDLVLLDRMLPDGDGLLL 82
IL+ +D+ + ++ L+ + V ++ + A DLV+ D ++PD + L
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 83 CQQLRNDGQQMPVMLLTARDGEADTVLGLESGADDYMTKPFSVLELRARTKALLRRHLSA 142
+++ +PV++++A++ + E GA DY+ KPF + EL L
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRR 125

Query: 143 TPTRQLIEFEGLRI 156
+ +G+ +
Sbjct: 126 PSKLEDDSQDGMPL 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3279OUTRMMBRANEA280.046 Outer membrane protein A signature.
		>OUTRMMBRANEA#Outer membrane protein A signature.

Length = 346

Score = 27.6 bits (61), Expect = 0.046
Identities = 6/16 (37%), Positives = 6/16 (37%)

Query: 27 PPPEPPAPPPVVMKSF 42
P P P V K F
Sbjct: 200 VAPAPAPAPEVQTKHF 215


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3283HTHFIS652e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 65.2 bits (159), Expect = 2e-14
Identities = 41/167 (24%), Positives = 68/167 (40%), Gaps = 8/167 (4%)

Query: 1 MSQIKVAIADDHPLFRTALTQAVLKNVNTADVLEAENFQELITLVENNPDIELIFLDLHM 60
M+ + +ADD RT L QA L DV N L + +L+ D+ M
Sbjct: 1 MTGATILVADDDAAIRTVLNQA-LSRAG-YDVRITSNAATLWRWIAAGD-GDLVVTDVVM 57

Query: 61 PGNEGFTGLTLLQNHFPDIAVIMVSSDDQPEIIRKAINFGASAFIPKSASLTQISTAIAT 120
P F L ++ PD+ V+++S+ + KA GA ++PK LT++ I
Sbjct: 58 PDENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117

Query: 121 VLEGEVWLPEHTDINVDQQ-----TAAEHQRLAKQLAQLTPQQYTVL 162
L P + + +A Q + + LA+L T++
Sbjct: 118 ALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLM 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3284HTHFIS399e-05 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 39.0 bits (91), Expect = 9e-05
Identities = 13/71 (18%), Positives = 29/71 (40%), Gaps = 2/71 (2%)

Query: 1048 ISVLVIDNDELMLKAISSLLLGWGCHVLTARDKTSAEQQLVQQVLPKLIIADYHLDDDQN 1107
++LV D+D + ++ L G V + + + + L++ D + D+ N
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWI-AAGDGDLVVTDVVMPDE-N 61

Query: 1108 GVDLVQSLLTH 1118
DL+ +
Sbjct: 62 AFDLLPRIKKA 72


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3285TCRTETB290.024 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 29.5 bits (66), Expect = 0.024
Identities = 26/107 (24%), Positives = 40/107 (37%), Gaps = 4/107 (3%)

Query: 252 GIVGTIAGILYSRKQPLRLPIIRLSGLLIFLTVLGLSFGSAPWLQTLCAI-VLGFCIFLP 310
I G I GIL R+ PL + I ++ L + + W T+ + VLG F
Sbjct: 307 IIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTK 366

Query: 311 VTALVSIPHELPKMTSQKITVIFSLFWSISYLISTLVLWLFGKLVDI 357
+ L Q+ SL S+L + + G L+ I
Sbjct: 367 TVISTIVSSSL---KQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSI 410


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3289SURFACELAYER300.044 Lactobacillus surface layer protein signature.
		>SURFACELAYER#Lactobacillus surface layer protein signature.

Length = 439

Score = 29.6 bits (66), Expect = 0.044
Identities = 23/101 (22%), Positives = 42/101 (41%), Gaps = 7/101 (6%)

Query: 400 VSALSAPVELTVRVGPEPANAAPALSNIQVSVSGQCASVTGSVVDANQNLASVTVGFSSG 459
++A + PV + + A A + V V+ S++ A + G +G
Sbjct: 21 IAATAMPVNAATTINADSAINANTNAKYDVDVT---PSISAIAAVAKSDTMPAIPGSLTG 77

Query: 460 QQVSASINGTQYSAQGCNLPGGANLATVIAMDSTQLSSQDS 500
+SAS NG Y+A NLP + AT+ ++ + +
Sbjct: 78 S-ISASYNGKSYTA---NLPKDSGNATITDSNNNTVKPAEL 114


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3290DHBDHDRGNASE1184e-34 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 118 bits (297), Expect = 4e-34
Identities = 76/259 (29%), Positives = 121/259 (46%), Gaps = 6/259 (2%)

Query: 33 GLKGKVGLITGSTSGIGLATAHVLAEQGCHLILHGLMPEAEGQRLAAEFAEQYHIHTFFS 92
G++GK+ ITG+ GIG A A LA QG H+ PE + +++ AE H F
Sbjct: 5 GIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAF-- 62

Query: 93 NADLRDPESIHAFMDAGVKALGSIDILVNNAGIQHTENVAHFPIDKWNDIIAINLSSAFH 152
AD+RD +I + +G IDILVN AG+ + ++W ++N + F+
Sbjct: 63 PADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFN 122

Query: 153 TIQQAVPAMAEKRWGRIINIASVHGLVASVNKAAYCAAKHGIVGLTKVVAIECAEQGITV 212
+ M ++R G I+ + S V + AAY ++K V TK + +E AE I
Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 213 NAICPGWVDTPLINK-QIEAIASNKGLSYDEAKYQLVTAKQPLPEMLDPRQIGEFVLFLC 271
N + PG +T + + + + + ++ PL ++ P I + VLFL
Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGI---PLKKLAKPSDIADAVLFLV 239

Query: 272 SSAARGITGASLAMDGAWT 290
S A IT +L +DG T
Sbjct: 240 SGQAGHITMHNLCVDGGAT 258


52Shewmr4_3322Shewmr4_3333Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_3322021-4.696482hypothetical protein
Shewmr4_3323128-9.092030porin
Shewmr4_3324-128-7.265456hypothetical protein
Shewmr4_3325031-7.373643major facilitator transporter
Shewmr4_3326030-6.425913putative phosphoglycerate transport regulatory
Shewmr4_3327026-5.277911hypothetical protein
Shewmr4_3328-119-1.839512periplasmic sensor signal transduction histidine
Shewmr4_33290243.494944two component, sigma54 specific, Fis family
Shewmr4_3330-1153.777962hypothetical protein
Shewmr4_3331-1144.222970hypothetical protein
Shewmr4_33321204.592634cyclic nucleotide-binding protein
Shewmr4_33330203.581116fumarylacetoacetate (FAA) hydrolase
53Shewmr4_3380Shewmr4_3393Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_3380-1203.297731hypothetical protein
Shewmr4_3381-2223.855080thiamine biosynthesis protein ThiI
Shewmr4_3382-1244.035150flagellar motor protein MotB
Shewmr4_3383-1233.836382flagellar motor protein PomA
Shewmr4_33840234.562910exodeoxyribonuclease VII small subunit
Shewmr4_3385-1223.544355farnesyl-diphosphate synthase
Shewmr4_3386-1152.1921181-deoxy-D-xylulose-5-phosphate synthase
Shewmr4_3387-1110.574326heat shock protein GrpE
Shewmr4_3388012-1.583768inorganic polyphosphate/ATP-NAD kinase
Shewmr4_3389015-2.704388L-lactate permease
Shewmr4_3390018-4.166395hypothetical protein
Shewmr4_3391122-4.821365FAD linked oxidase domain-containing protein
Shewmr4_3392026-4.496712hypothetical protein
Shewmr4_3393128-4.2804954Fe-4S ferredoxin iron-sulfur binding
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3389HTHFIS330.002 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 32.9 bits (75), Expect = 0.002
Identities = 37/145 (25%), Positives = 62/145 (42%), Gaps = 17/145 (11%)

Query: 30 RTVIRSLILGLLCSGHVLLEGLPGTAKTRSVKAL------ANALAISFGRIQFTPDLLPS 83
+ + R L + +++ G GT K +AL N ++ DL+ S
Sbjct: 147 QEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIES 206

Query: 84 DVTGTE--VLHEAEGKSTLRFQP---GPVFNQIVLADEINRAPAKVQAALLEAMAEGTIT 138
++ G E A+ +ST RF+ G +F DEI P Q LL + +G T
Sbjct: 207 ELFGHEKGAFTGAQTRSTGRFEQAEGGTLF-----LDEIGDMPMDAQTRLLRVLQQGEYT 261

Query: 139 -VAGQTHVLPELFMVLATQNPIEQE 162
V G+T + ++ +V AT ++Q
Sbjct: 262 TVGGRTPIRSDVRIVAATNKDLKQS 286


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3393INTIMIN320.027 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 31.6 bits (71), Expect = 0.027
Identities = 41/175 (23%), Positives = 67/175 (38%), Gaps = 18/175 (10%)

Query: 1038 YDPNGQFNDLVKGETANEVFSYTITDEIGAT--STTEVTISVVGINAAP-VAVADTAVTT 1094
YD + N+++ ++ S I +I T ST ++ + V + D+A+ +
Sbjct: 435 YDLVQRNNNIILEYKKQDILSLNIPHDINGTERSTQKIQLIVKSKYGLDRIVWDDSALRS 494

Query: 1095 KSGSIQIDLLANDTD------------ADGDTLTITAIDVGSLKGKVTNNNDGTVTYSPN 1142
+ G IQ + D ++ +T A D G +NN T+T N
Sbjct: 495 QGGQIQHSGSQSAQDYQAILPAYVQGGSNVYKVTARAYDR---NGNSSNNVLLTITVLSN 551

Query: 1143 GQFGHLYQGQSATETFTYTISDGDAEMTASVTVTINGEGQAPVEPEKEGSSGGSL 1197
GQ T T +DG +T + TV NG QA V SG ++
Sbjct: 552 GQVVDQVGVTDFTADKTSAKADGTEAITYTATVKKNGVAQANVPVSFNIVSGTAV 606


54Shewmr4_3434Shewmr4_3445Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_3434-1163.149796uroporphyrin-III C/tetrapyrrole
Shewmr4_3435-1162.921788SmpA/OmlA domain-containing protein
Shewmr4_3436-2173.195642hypothetical protein
Shewmr4_3437-1173.120084hypothetical protein
Shewmr4_3438-1183.369292cyclase/dehydrase
Shewmr4_3439-1173.333100SsrA-binding protein
Shewmr4_34400192.664166phage integrase family protein
Shewmr4_34410192.510230type III restriction enzyme, res subunit
Shewmr4_34420181.889521DNA methylase N-4/N-6 domain-containing protein
Shewmr4_3443-113-0.901163phage transcriptional regulator, AlpA
Shewmr4_3444013-1.767833resolvase domain-containing protein
Shewmr4_3445116-4.115877phage integrase family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3434RTXTOXIND612e-12 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 61.0 bits (148), Expect = 2e-12
Identities = 30/220 (13%), Positives = 72/220 (32%), Gaps = 25/220 (11%)

Query: 80 FELAVSQAKLALEQVRQDNAELDASLLAAKAEVNASSTTAQQKRREAKRLDALYATHGVS 139
+ S + Q + + A L A +N ++ ++ +L ++
Sbjct: 191 IKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIA 250

Query: 140 -------QQQRDQADSDAAAAEANLSAASARLEKLKVSRGLYGED------------NLR 180
+ + +A ++ ++ L + + K L +
Sbjct: 251 KHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDN 310

Query: 181 MRQARNALEQAELNLSYTQIHADQDGVVTNLQL-EVGSFASVGQPLLALV--SDKLDIIA 237
+ L + E + I A V L++ G + + L+ +V D L++ A
Sbjct: 311 IGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTA 370

Query: 238 DFREKTLRGVNASYPALIAFDGEPGRLY---RAQVSSVDA 274
+ K + +N A+I + P Y +V +++
Sbjct: 371 LVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINL 410



Score = 60.6 bits (147), Expect = 2e-12
Identities = 30/197 (15%), Positives = 65/197 (32%), Gaps = 11/197 (5%)

Query: 1 MTPDQQFARLVKIAMLGFVAV-FGYFMFADAMMPLTPQAMATRVVT------KVTPQISG 53
TP + RLV ++GF+ + F + + A A +T ++ P +
Sbjct: 50 ETPVSRRPRLVAYFIMGFLVIAFILSVLG----QVEIVATANGKLTHSGRSKEIKPIENS 105

Query: 54 KIQTINVKNNQVVAKGDLLFQVDPAPFELAVSQAKLALEQVRQDNAELDASLLAAKAEVN 113
++ I VK + V KGD+L ++ E + + +L Q R + + +
Sbjct: 106 IVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKL 165

Query: 114 ASSTTAQQKRREAKRLDALYATHGVSQQQRDQADSDAAAAEANLSAASARLEKLKVSRGL 173
+ + + + + ++Q + E NL A +
Sbjct: 166 PELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINR 225

Query: 174 YGEDNLRMRQARNALEQ 190
Y + + +
Sbjct: 226 YENLSRVEKSRLDDFSS 242


55Shewmr4_3520Shewmr4_3529Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_3520018-3.542142cytochrome c assembly protein
Shewmr4_3521-115-3.142481hypothetical protein
Shewmr4_3522-114-3.195759hypothetical protein
Shewmr4_3523-119-4.271263hypothetical protein
Shewmr4_3524018-3.671281transposase IS200-family protein
Shewmr4_3525015-2.4367174'-phosphopantetheinyl transferase
Shewmr4_35261171.821581pyridoxine 5'-phosphate synthase
Shewmr4_35270182.888588DNA repair protein RecO
Shewmr4_3528-1183.208733GTP-binding protein Era
Shewmr4_35290193.489215ribonuclease III
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3520BCTERIALGSPG481e-09 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 48.0 bits (114), Expect = 1e-09
Identities = 18/59 (30%), Positives = 33/59 (55%)

Query: 1 MSRLHTSKGFTLIELVVVIIILGILAVVAAPRFINLSQDAHNARAKAAFAAFTSGVKLY 59
M +GFTL+E++VVI+I+G+LA + P + + A +A + A + + +Y
Sbjct: 1 MRATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMY 59


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3522HTHFIS310.014 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 30.6 bits (69), Expect = 0.014
Identities = 11/36 (30%), Positives = 18/36 (50%), Gaps = 3/36 (8%)

Query: 49 TPKNILMIGPTGVGKTEIAR---RLAKLANAPFIKV 81
T +++ G +G GK +AR K N PF+ +
Sbjct: 159 TDLTLMITGESGTGKELVARALHDYGKRRNGPFVAI 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3524FLGFLIH270.036 Flagellar assembly protein FliH signature.
		>FLGFLIH#Flagellar assembly protein FliH signature.

Length = 228

Score = 27.5 bits (60), Expect = 0.036
Identities = 15/45 (33%), Positives = 21/45 (46%)

Query: 146 DRDQRLAATDNPVAEARLNQLDDEFYKDLDNLDIKLESYAIQMGL 190
++ A + AR+ QL EF LD LD + S +QM L
Sbjct: 81 EQGLAEAKSQQAPIHARMQQLVSEFQTTLDALDSVIASRLMQMAL 125


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3527DHBDHDRGNASE433e-07 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 43.1 bits (101), Expect = 3e-07
Identities = 37/195 (18%), Positives = 74/195 (37%), Gaps = 29/195 (14%)

Query: 55 LEEEIKQLSQNISQLDWLINCIGMLHTEDKGPEKSLQALDGDFFLHNIQLNTLPSMMLAK 114
++E ++ + + +D L+N G+L G SL + + +N+ ++
Sbjct: 72 IDEITARIEREMGPIDILVNVAGVLRP---GLIHSLSDEE---WEATFSVNSTGVFNASR 125

Query: 115 HFEPTLKRSASARFAVVSAKVGSISDNRLGGWYSYRASKAALNMFLKTLSIEWQRSMKHC 174
+ S V + + + +Y +SKAA MF K L +E C
Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMA---AYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 175 VVLALHPGTTDTPLSKP------------------FQQNVPKQKLFTPEYVAQCLVSIIA 216
+++ PG+T+T + F+ +P +KL P +A ++ +++
Sbjct: 183 NIVS--PGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVS 240

Query: 217 NATPAQTGSFLAYDG 231
T L DG
Sbjct: 241 GQAGHITMHNLCVDG 255


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3529HTHFIS872e-22 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 86.8 bits (215), Expect = 2e-22
Identities = 26/110 (23%), Positives = 57/110 (51%)

Query: 3 RLLIIEDDQALAGVLARRLTRHGFECRLSHDASNALLVAREFCPSHILLDMKLAEANGLS 62
+L+ +DD A+ VL + L+R G++ R++ +A+ ++ D+ + + N
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 63 LIVPLRNLLPKVIMVLLTGYASIATAVEAIRLGADNYLAKPVDTQTLLAA 112
L+ ++ P + +++++ + TA++A GA +YL KP D L+
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGI 114



Score = 48.3 bits (115), Expect = 3e-09
Identities = 19/58 (32%), Positives = 29/58 (50%), Gaps = 2/58 (3%)

Query: 116 NSQASALPEDEIDDSPLSPKRLEWEHIQQVLNANQGNVSATARQLGMHRRTLQRKLLK 173
S ALP + D L +E+ I L A +GN A LG++R TL++K+ +
Sbjct: 417 ASFGDALPPSGLYDRVL--AEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRE 472


56Shewmr4_3557Shewmr4_3570Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_3557218-1.263052glutamate synthase subunit alpha
Shewmr4_3558217-1.330728glutamate synthase subunit beta
Shewmr4_3559317-1.313906methylthioadenosine nucleosidase
Shewmr4_3560326-0.250602cobalamin biosynthesis protein CbiB
Shewmr4_35614240.235782hypothetical protein
Shewmr4_35623220.167169hypothetical protein
Shewmr4_35632151.313372hypothetical protein
Shewmr4_35651152.425588hypothetical protein
Shewmr4_35662163.037125hypothetical protein
Shewmr4_35671163.254215hypothetical protein
Shewmr4_35681182.747000hypothetical protein
Shewmr4_35690183.086713hypothetical protein
Shewmr4_35700183.036405tyrosyl-tRNA synthetase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3559PF02370300.024 M protein repeat
		>PF02370#M protein repeat

Length = 168

Score = 29.7 bits (66), Expect = 0.024
Identities = 13/69 (18%), Positives = 32/69 (46%)

Query: 388 DERKSKLRIQQEALKQAQKIRSAREEALKVEAETNERLEQMVQERTLELEITLRELHEVN 447
D RK + + Q + + ++ + +E + E + ++ QE+ + + ++L
Sbjct: 59 DLRKREGQYQDKIEELEKERKEKQERPERREKFERQHQDKHYQEQQKKHQQEQQQLEAEK 118

Query: 448 QKLTEQSTI 456
QKL ++ I
Sbjct: 119 QKLAKEKQI 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3562SECA13200.0 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 1320 bits (3417), Expect = 0.0
Identities = 662/907 (72%), Positives = 764/907 (84%), Gaps = 7/907 (0%)

Query: 1 MFGKLLTKVFGSRNDRTLKGLQKVVNKINALEADYEKLTDEQLKAKTAEFRERLAAGASL 60
M KLLTKVFGSRNDRTL+ ++KVVN INA+E + EKL+DE+LK KTAEFR RL G L
Sbjct: 1 MLIKLLTKVFGSRNDRTLRRMRKVVNIINAMEPEMEKLSDEELKGKTAEFRARLEKGEVL 60

Query: 61 ESIMAEAFATVREASKRVFEMRHFDVQLLGGMVLDSNRIAEMRTGEGKTLTATLPAYLNA 120
E+++ EAFA VREASKRVF MRHFDVQLLGGMVL+ IAEMRTGEGKTLTATLPAYLNA
Sbjct: 61 ENLIPEAFAVVREASKRVFGMRHFDVQLLGGMVLNERCIAEMRTGEGKTLTATLPAYLNA 120

Query: 121 LTGKGVHVITVNDYLARRDAENNRPLFEFLGLTVGINVAGLGQQAKKDAYNADITYGTNN 180
LTGKGVHV+TVNDYLA+RDAENNRPLFEFLGLTVGIN+ G+ AK++AY ADITYGTNN
Sbjct: 121 LTGKGVHVVTVNDYLAQRDAENNRPLFEFLGLTVGINLPGMPAPAKREAYAADITYGTNN 180

Query: 181 EFGFDYLRDNMAFSPQERVQRPLHYALIDEVDSILIDEARTPLIISGAAEDSSELYIKIN 240
E+GFDYLRDNMAFSP+ERVQR LHYAL+DEVDSILIDEARTPLIISG AEDSSE+Y ++N
Sbjct: 181 EYGFDYLRDNMAFSPEERVQRKLHYALVDEVDSILIDEARTPLIISGPAEDSSEMYKRVN 240

Query: 241 TLIPNLIRQDKEDSEEYVGEGDYSIDEKAKQVHFTERGQEKVENLLIERGMLAEGDSLYS 300
+IP+LIRQ+KEDSE + GEG +S+DEK++QV+ TERG +E LL++ G++ EG+SLYS
Sbjct: 241 KIIPHLIRQEKEDSETFQGEGHFSVDEKSRQVNLTERGLVLIEELLVKEGIMDEGESLYS 300

Query: 301 AANISLLHHVNAALRAHTLFERDVDYIVQDGEVIIVDEHTGRTMPGRRWSEGLHQAVEAK 360
ANI L+HHV AALRAH LF RDVDYIV+DGEVIIVDEHTGRTM GRRWS+GLHQAVEAK
Sbjct: 301 PANIMLMHHVTAALRAHALFTRDVDYIVKDGEVIIVDEHTGRTMQGRRWSDGLHQAVEAK 360

Query: 361 EGVRIQNENQTLASITFQNYFRLYEKLAGMTGTADTEAFEFQHIYGLDTVVVPTNRPMVR 420
EGV+IQNENQTLASITFQNYFRLYEKLAGMTGTADTEAFEF IY LDTVVVPTNRPM+R
Sbjct: 361 EGVQIQNENQTLASITFQNYFRLYEKLAGMTGTADTEAFEFSSIYKLDTVVVPTNRPMIR 420

Query: 421 KDMADLVYLTANEKYQAIIKDIKDCRERGQPVLVGTVSIEQSELLARLMVKEKIPHQVLN 480
KD+ DLVY+T EK QAII+DIK+ +GQPVLVGT+SIE+SEL++ + K I H VLN
Sbjct: 421 KDLPDLVYMTEAEKIQAIIEDIKERTAKGQPVLVGTISIEKSELVSNELTKAGIKHNVLN 480

Query: 481 AKFHEKEAEIVAQAGRTGAVTIATNMAGRGTDIVLGGNWNMEIEALENPTAEQKAKIKAD 540
AKFH EA IVAQAG AVTIATNMAGRGTDIVLGG+W E+ ALENPTAEQ KIKAD
Sbjct: 481 AKFHANEAAIVAQAGYPAAVTIATNMAGRGTDIVLGGSWQAEVAALENPTAEQIEKIKAD 540

Query: 541 WQERHDAVVAAGGLHILGTERHESRRIDNQLRGRAGRQGDAGSSRFYLSMEDSLMRIFAS 600
WQ RHDAV+ AGGLHI+GTERHESRRIDNQLRGR+GRQGDAGSSRFYLSMED+LMRIFAS
Sbjct: 541 WQVRHDAVLEAGGLHIIGTERHESRRIDNQLRGRSGRQGDAGSSRFYLSMEDALMRIFAS 600

Query: 601 DRVSGMMKKLGMEEGEAIEHPWVSRAIENAQRKVEARNFDIRKQLLEFDDVANDQRQVVY 660
DRVSGMM+KLGM+ GEAIEHPWV++AI NAQRKVE+RNFDIRKQLLE+DDVANDQR+ +Y
Sbjct: 601 DRVSGMMRKLGMKPGEAIEHPWVTKAIANAQRKVESRNFDIRKQLLEYDDVANDQRRAIY 660

Query: 661 AQRNELMDAESIEDTIKNIQDDVISAVIDQYIPPQSVEELWDVPGLEQRLQQEFMLKLPI 720
+QRNEL+D + +TI +I++DV A ID YIPPQS+EE+WD+PGL++RL+ +F L LPI
Sbjct: 661 SQRNELLDVSDVSETINSIREDVFKATIDAYIPPQSLEEMWDIPGLQERLKNDFDLDLPI 720

Query: 721 QEWLDKEDDLHEETLRERIITSWSDAYKAKEEMVGAPVLRQFEKAVMLQTLDGLWKEHLA 780
EWLDKE +LHEETLRERI+ + Y+ KEE+VGA ++R FEK VMLQTLD LWKEHLA
Sbjct: 721 AEWLDKEPELHEETLRERILAQSIEVYQRKEEVVGAEMMRHFEKGVMLQTLDSLWKEHLA 780

Query: 781 AMDHLRQGIHLRGYAQKNPKQEYKRESFELFQQLLSTLKHDVISVLSKVQVQAQSDVEEM 840
AMD+LRQGIHLRGYAQK+PKQEYKRESF +F +L +LK++VIS LSKVQV+ +VEE+
Sbjct: 781 AMDYLRQGIHLRGYAQKDPKQEYKRESFSMFAAMLESLKYEVISTLSKVQVRMPEEVEEL 840

Query: 841 EARRREEDAKIQRDYQHAAAEALVGGDDGSDEMMAHTPMIRDGDKVGRNDPCPCGSGRKY 900
E +RR E A + D A KVGRNDPCPCGSG+KY
Sbjct: 841 EQQRRMEAE-------RLAQMQQLSHQDDDSAAAAALAAQTGERKVGRNDPCPCGSGKKY 893

Query: 901 KQCHGKL 907
KQCHG+L
Sbjct: 894 KQCHGRL 900


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3567SHAPEPROTEIN696e-15 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 68.6 bits (168), Expect = 6e-15
Identities = 51/221 (23%), Positives = 91/221 (41%), Gaps = 20/221 (9%)

Query: 150 SGMRMEAKVHIVTC----ANDMAKNITK-SVERCGLKVDDLVFSGIASADAVLTFDEKDL 204
S M ++ C A + + + S + G + L+ +A+A +
Sbjct: 100 SNSFMRPSPRVLVCVPVGATQVERRAIRESAQGAGAREVFLIEEPMAAAIGAGLPVSEAT 159

Query: 205 GVCIVDIGGGTTDIAVYTNGALRHCAVVPVAGNQVTNDIAKIFR------TPSSHAEQIK 258
G +VDIGGGTT++AV + + + + V + G++ I R + AE+IK
Sbjct: 160 GSMVVDIGGGTTEVAVISLNGVVYSSSVRIGGDRFDEAIINYVRRNYGSLIGEATAERIK 219

Query: 259 VQFACARSSMVSREDSIEVPS---VGGRPSR-SMSRHTLAEVVEPRYQELFELVLKELKD 314
+ A IEV G P +++ + + E ++ + V+ L+
Sbjct: 220 HEIGSAYPG--DEVREIEVRGRNLAEGVPRGFTLNSNEILEALQEPLTGIVSAVMVALEQ 277

Query: 315 SGLE---DQIAAGIVLTGGTASIQGVVDIAEATFGMPVRVA 352
E D G+VLTGG A ++ + + G+PV VA
Sbjct: 278 CPPELASDISERGMVLTGGGALLRNLDRLLMEETGIPVVVA 318


57Shewmr4_3646Shewmr4_3660Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_36462180.837414hypothetical protein
Shewmr4_36470160.798600hypothetical protein
Shewmr4_3648-113-1.280702tryptophan halogenase
Shewmr4_3649-213-0.750178TonB-dependent receptor
Shewmr4_3650-113-0.819283hypothetical protein
Shewmr4_3651-211-0.812521LacI family transcription regulator
Shewmr4_3652-213-2.912796NADH dehydrogenase
Shewmr4_3653013-3.166518hypothetical protein
Shewmr4_3654014-2.016051nitrogen regulatory protein P-II
Shewmr4_3655120-2.419895methylation site containing protein
Shewmr4_3656221-2.542998methylation site containing protein
Shewmr4_3657321-2.165180hypothetical protein
Shewmr4_3658321-0.969862hypothetical protein
Shewmr4_3659320-0.361076hypothetical protein
Shewmr4_3660221-0.459075methylation site containing protein
58Shewmr4_3716Shewmr4_3722Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_37162231.072442hypothetical protein
Shewmr4_37173240.956439ATPase
Shewmr4_37183271.052819hypothetical protein
Shewmr4_37194341.209442ribosomal large subunit pseudouridine synthase
Shewmr4_37204341.176743putative lipoprotein
Shewmr4_37214321.066902***methyl-accepting chemotaxis sensory transducer
Shewmr4_37222240.274586globin
59Shewmr4_3759Shewmr4_3777Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_37592171.817665hypothetical protein
Shewmr4_37602150.531189transposase IS116/IS110/IS902 family protein
Shewmr4_3761114-1.648476dipeptidyl-peptidase 7
Shewmr4_3762113-0.172634hypothetical protein
Shewmr4_37631120.283014aromatic amino acid permease
Shewmr4_37642140.031902hypothetical protein
Shewmr4_37651160.873157N-acetylglucosamine-binding protein A
Shewmr4_37660141.260143N-acetylglucosamine-binding protein A
Shewmr4_37670122.212830hypothetical protein
Shewmr4_3768-1121.564192catalase
Shewmr4_37692101.939099hypothetical protein
Shewmr4_37702102.089672hypothetical protein
Shewmr4_37713131.121375hypothetical protein
Shewmr4_37723141.269275endonuclease/exonuclease/phosphatase
Shewmr4_37734151.127800hypothetical protein
Shewmr4_37745160.221211peptidylprolyl isomerase, FKBP-type
Shewmr4_3775115-2.018341hypothetical protein
Shewmr4_3776321-3.208722WD-40 repeat-containing protein
Shewmr4_3777219-1.034897hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3759PF035441204e-36 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 120 bits (303), Expect = 4e-36
Identities = 47/183 (25%), Positives = 76/183 (41%), Gaps = 22/183 (12%)

Query: 34 PQPLSGQASDAPQINI-LMSERTPIPPKEN-------RKPEPPKPIPARERVTAPGESNQ 85
P + Q P + E P PPKE + PKP P ++ +
Sbjct: 61 EPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKP 120

Query: 86 LVDFNTQTFTPEMPSQTTLFTQSAMTAE--------ALPVVQVSPRYPIDAAQNGKEGYV 137
+ F P++ T T +A T++ + + P+YP A EG V
Sbjct: 121 VESRPASPFENTAPARPTSSTATAATSKPVTSVASGPRALSRNQPQYPARAQALRIEGQV 180

Query: 138 VVGFDITADGSVSNVRVLDANPKRVFDKAALSAVQNWKYKPKFDSGKAVPQLNQQVQLDF 197
V FD+T DG V NV++L A P +F++ +A++ W+Y+P V V + F
Sbjct: 181 KVKFDVTPDGRVDNVQILSAKPANMFEREVKNAMRRWRYEPGKPGSGIV------VNILF 234

Query: 198 KLD 200
K++
Sbjct: 235 KIN 237


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3774IGASERPTASE716e-15 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 71.2 bits (174), Expect = 6e-15
Identities = 41/263 (15%), Positives = 75/263 (28%), Gaps = 11/263 (4%)

Query: 17 DEVVEQTPVSTPSQTEQAEALAKQHAEEARLAAEKAAAEQALADKLAAEKAEAEAQRV-A 75
++ V+ T ++TP+ + + EE +A A A +E E A+
Sbjct: 989 NQTVDTTNITTPNNIQADVPSVPSNNEE--IARVDEAPVPPPAPATPSETTETVAENSKQ 1046

Query: 76 EEQAARIAEQQAAEAAHLAAEQALAEQL----AAEQAQAERVAAEQAAKAQAEAEAEALR 131
E + EQ A E E A + + + + +E E + A
Sbjct: 1047 ESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATV 1106

Query: 132 IAEEQAARLAEQQAAEAAHLAAEQALAEQLAAEQAETERVAAEQAAKAQAEAEAEAEAEA 191
EE+A E+ + EQ Q + E A + E +++
Sbjct: 1107 EKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAE--PARENDPTVNIKEPQSQTNT 1164

Query: 192 EAEAE--AQRIAEEQAARLAEQQAAEVARLAAEQAQAEQLAAEQAEAERVAAEAQAEAER 249
A+ E A+ + + E E + A Q ++ R
Sbjct: 1165 TADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHR 1224

Query: 250 VAAAQVQAEQPLEQQPEPQAKPA 272
+ V
Sbjct: 1225 RSVRSVPHNVEPATTSSNDRSTV 1247



Score = 68.6 bits (167), Expect = 4e-14
Identities = 37/210 (17%), Positives = 72/210 (34%), Gaps = 18/210 (8%)

Query: 66 KAEAEAQRVAEEQAARIAEQQAAEAAHLAAEQALAEQLAAEQAQAERVAAEQAAKAQAEA 125
+ E Q V QA + + E++A A E
Sbjct: 984 EVEKRNQTVDTTNITTPNNIQADVPS----VPSNNEEIARVDEAPVPPPAPATPSETTET 1039

Query: 126 EAEALRIAEEQAARLAEQQAAEAAHLAAEQALAEQLAAEQAETERVAAEQAAKAQAEAEA 185
AE + E + EQ A E + A++A++ A Q +
Sbjct: 1040 VAENSK-QESKTVEKNEQDATETT-------AQNREVAKEAKSNVKANTQTNEVAQSGSE 1091

Query: 186 EAEAEAEAEAEAQRIAEEQAARLAEQQAAEVARLAAEQAQAEQLAAEQAEAERVAAEAQA 245
E + E + +E+ A++ ++ EV ++ + Q++ +Q ++E V +A+
Sbjct: 1092 TKETQTTETKETATVEKEEKAKVETEKTQEVPKVTS------QVSPKQEQSETVQPQAEP 1145

Query: 246 EAERVAAAQVQAEQPLEQQPEPQAKPAKES 275
E ++ Q +PAKE+
Sbjct: 1146 ARENDPTVNIKEPQSQTNTTADTEQPAKET 1175



Score = 59.7 bits (144), Expect = 2e-11
Identities = 41/238 (17%), Positives = 75/238 (31%), Gaps = 17/238 (7%)

Query: 18 EVVEQTPVSTPSQTEQAEALAKQHAEEARLAAEKAAAEQALADKLAAEKAEAEAQRVAEE 77
E EQ T +Q + AK + + E A + K E V +E
Sbjct: 1052 EKNEQDATETTAQNREVAKEAKSNVKANTQTNEVA--QSGSETKETQTTETKETATVEKE 1109

Query: 78 QAARI-AEQQAAEAAHLAAEQALAEQLAAEQAQAERVAAEQAAKAQAEAEAEALRIAEEQ 136
+ A++ E+ + EQ Q QAE E +++ A+
Sbjct: 1110 EKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTAD-- 1167

Query: 137 AARLAEQQAAEAAHLAAEQALAEQLAAEQAETERVAAEQAAKAQAEAEAEAEAEAEAEAE 196
+Q A+ EQ + E + E A + +E+ + +
Sbjct: 1168 -----TEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNR 1222

Query: 197 AQRIAEEQAARLAEQQAAEVARLAAEQAQAEQLA-AEQAEAERVAAEAQAEAERVAAA 253
+R + + E A ++ L V ++A+A+A+ VA
Sbjct: 1223 HRR------SVRSVPHNVEPATTSSNDRSTVALCDLTSTNTNAVLSDARAKAQFVALN 1274


60Shewmr4_3790Shewmr4_3797Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_37902162.586314diguanylate cyclase with PAS/PAC sensor
Shewmr4_37912151.990234hypothetical protein
Shewmr4_37922161.249597phosphate transporter
Shewmr4_37932171.325941hypothetical protein
Shewmr4_37942181.226132glucan biosynthesis protein D
Shewmr4_37952200.668130carbon starvation protein CstA
Shewmr4_37962171.083581GCN5-related N-acetyltransferase
Shewmr4_37972181.688590LrgB family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3791PF05616310.009 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 31.3 bits (70), Expect = 0.009
Identities = 19/73 (26%), Positives = 33/73 (45%), Gaps = 7/73 (9%)

Query: 344 IAAVVTYERNAWGNNTGDA--VQAKDVDAHKSGGTNSEPVATTPPPATTDAPKPATEPAA 401
+ V T+ R++ GN T D + D+ + N++P+ P + A PA PA
Sbjct: 289 VQVVATFGRDSQGNTTVDVQVIPRPDLTPGSAEAPNAQPL-----PEVSPAENPANNPAP 343

Query: 402 SVDPASLPTLSHD 414
+ +P + P D
Sbjct: 344 NENPGTRPNPEPD 356


61Shewmr4_3886Shewmr4_3892Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_3886-219-5.518740Ion transport 2 domain-containing protein
Shewmr4_3887024-7.956982adenylate cyclase
Shewmr4_3888335-12.506240hypothetical protein
Shewmr4_3889231-11.253685phosphate transporter
Shewmr4_3890331-11.056342SH3 domain-containing protein
Shewmr4_3891225-8.299455bifunctional proline
Shewmr4_3892221-4.238758hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3886NUCEPIMERASE5660.0 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 566 bits (1460), Expect = 0.0
Identities = 232/333 (69%), Positives = 265/333 (79%), Gaps = 1/333 (0%)

Query: 1 MKYLVTGAAGFIGAKVSERLCAQGHEVVGIDNLNDYYDVGLKLARLAPLEALSNFRFIKL 60
MKYLVTGAAGFIG VS+RL GH+VVGIDNLNDYYDV LK ARL L F+F K+
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQ-PGFQFHKI 59

Query: 61 DLADRDGIAALFAEQGFQRVIHLAAQAGVRYSLDNPLAYADSNLVGHLTILEGCRHHKIE 120
DLADR+G+ LFA F+RV + VRYSL+NP AYADSNL G L ILEGCRH+KI+
Sbjct: 60 DLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQ 119

Query: 121 HLVYASSSSVYGLNQKMPFSTEDSVDHPISLYAATKKANELMSHTYSHLYQLPTTGLRFF 180
HL+YASSSSVYGLN+KMPFST+DSVDHP+SLYAATKKANELM+HTYSHLY LP TGLRFF
Sbjct: 120 HLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFF 179

Query: 181 TVYGPWGRPDMALFKFTKAILAGDTIDVYNHGDLSRDFTYIDDIVEGIIRVQDKPPRPTP 240
TVYGPWGRPDMALFKFTKA+L G +IDVYN+G + RDFTYIDDI E IIR+QD P
Sbjct: 180 TVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIPHADT 239

Query: 241 DWRVETGTPANSSAPYRVFNIGNGSPVQLLDFITALESALGIEAKKQFLPMQPGDVHSTW 300
W VETGTPA S APYRV+NIGN SPV+L+D+I ALE ALGIEAKK LP+QPGDV T
Sbjct: 240 QWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLPLQPGDVLETS 299

Query: 301 ADTEDLFKAVGYKPQVDINTGVSRFVEWYRAFY 333
ADT+ L++ +G+ P+ + GV FV WYR FY
Sbjct: 300 ADTKALYEVIGFTPETTVKDGVKNFVNWYRDFY 332


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3888LPSBIOSNTHSS2263e-79 Lipopolysaccharide core biosynthesis protein signat...
		>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein

signature.
Length = 166

Score = 226 bits (578), Expect = 3e-79
Identities = 80/155 (51%), Positives = 113/155 (72%)

Query: 5 AIYPGTFDPITNGHADLIERAAKLFKHVIIGIAANPSKQPRFTLEERVELVNRVTAHLDN 64
AIYPG+FDPIT GH D+IER +LF V + + NP+KQP F+++ER+E + + AHL N
Sbjct: 3 AIYPGSFDPITFGHLDIIERGCRLFDQVYVAVLRNPNKQPMFSVQERLEQIAKAIAHLPN 62

Query: 65 VEVVGFSGLLVDFAKEQRASVLVRGLRAVSDFEYEFQLANMNRRLSPDLESVFLTPAEEN 124
+V F GL V++A++++A ++RGLR +SDFE E Q+AN N+ L+ DLE+VFLT + E
Sbjct: 63 AQVDSFEGLTVNYARQRQAGAILRGLRVLSDFELELQMANTNKTLASDLETVFLTTSTEY 122

Query: 125 SFISSTLVKEVALHGGDVSQFVHPEVASALAAKLN 159
SF+SS+LVKEVA GG+V FV VA+AL + +
Sbjct: 123 SFLSSSLVKEVARFGGNVEHFVPSHVAAALYDQFH 157


62Shewmr4_3920Shewmr4_3928Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_3920221-1.140738hypoxanthine phosphoribosyltransferase
Shewmr4_3921324-1.493916aromatic acid decarboxylase
Shewmr4_3922433-1.523802UDP-N-acetylmuramate
Shewmr4_3923639-0.410550hypothetical protein
Shewmr4_3924947-1.212264hypothetical protein
Shewmr4_3925950-1.221712sterol desaturase family protein
Shewmr4_3926642-1.939366hypothetical protein
Shewmr4_3927540-2.434260OmpA domain-containing protein
Shewmr4_3928327-3.535316hypothetical protein
63Shewmr4_0053Shewmr4_0062N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_0053-2171.747748rhodanese domain-containing protein
Shewmr4_0054-2140.161842preprotein translocase subunit SecB
Shewmr4_0055-113-0.330195NAD(P)H-dependent glycerol-3-phosphate
Shewmr4_0056-213-1.442411NAD(P)H-dependent glycerol-3-phosphate
Shewmr4_00571140.132636hypothetical protein
Shewmr4_00581130.171821hypothetical protein
Shewmr4_0059113-0.223367hypothetical protein
Shewmr4_00602150.519356TrkH family potassium uptake protein
Shewmr4_00610100.234671TrkA domain-containing protein
Shewmr4_00620120.185973two component transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0053HTHFIS929e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 92.2 bits (229), Expect = 9e-24
Identities = 37/112 (33%), Positives = 60/112 (53%), Gaps = 1/112 (0%)

Query: 4 KILVVDDEPQIHTFMRISLEAEGFEYLSATSIATALKQYQSHQPHLIVLDLGLPDGDGIE 63
ILV DD+ I T + +L G++ ++ AT + + L+V D+ +PD + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 64 LLHGLRQQDK-TPVLVLTARDQEEEKIRLLEAGANDYLSKPFGIRELIVRIK 114
LL +++ PVLV++A++ I+ E GA DYL KPF + ELI I
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0056PF065802024e-62 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 202 bits (514), Expect = 4e-62
Identities = 59/205 (28%), Positives = 109/205 (53%), Gaps = 13/205 (6%)

Query: 351 EQLQEMTRKAEFTALQSKINPHFLFNALNAISSLIRIRPQQARELIANLADYLRYNLDKG 410
++ M ++A+ AL+++INPHF+FNALN I +LI P +ARE++ +L++ +RY+L
Sbjct: 152 WKMASMAQEAQLMALKAQINPHFMFNALNNIRALILEDPTKAREMLTSLSELMRYSLRYS 211

Query: 411 D-ELIDIQEEVKQVRDYVAIEQARYGDKLEVVFDVDD--VHFCVPCLLLQPLVENAILHG 467
+ + + +E+ V Y+ + ++ D+L+ ++ + VP +L+Q LVEN I HG
Sbjct: 212 NARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPMLVQTLVENGIKHG 271

Query: 468 IQPRSAPGRVTIEVKKLDAGIRVAVRDTGYGISQEVIDGVAAGRIESSSIGLTNVHQRVK 527
I G++ ++ K + + + V +TG ES+ GL NV +R++
Sbjct: 272 IAQLPQGGKILLKGTKDNGTVTLEVENTG--------SLALKNTKESTGTGLQNVRERLQ 323

Query: 528 LLYGE--GLQLKRLNPGTEVSFYLP 550
+LYG ++L +P
Sbjct: 324 MLYGTEAQIKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0057HTHFIS683e-15 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 67.5 bits (165), Expect = 3e-15
Identities = 24/132 (18%), Positives = 59/132 (44%), Gaps = 6/132 (4%)

Query: 3 KAIIVEDEYLAREELE-YLVKSHSEIDIVASFEDGLEAFKYLQDHEVDVVFLDIQIPSID 61
++ +D+ R L L ++ ++ I ++ + + D+V D+ +P +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRW---IAAGDGDLVVTDVVMPDEN 61

Query: 62 GLLLAKNLHKSTHPPHVVFVTAHKEF--AVEAFELEAFDYILKPYNEPRIISLLQKIEQV 119
L + K+ V+ ++A F A++A E A+DY+ KP++ +I ++ +
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 120 GRQAPKPQHEAA 131
++ P + +
Sbjct: 122 PKRRPSKLEDDS 133


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0058TCRTETA330.002 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 33.3 bits (76), Expect = 0.002
Identities = 65/350 (18%), Positives = 127/350 (36%), Gaps = 42/350 (12%)

Query: 43 PVSQVAFVFGLL----SLSLAVASSMAGKLQERFGVRNVTLGAGVLLGLGFLLTAQASNL 98
+ V +G+L +L + + G L +RFG R V L + + + + A A L
Sbjct: 37 HSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFL 96

Query: 99 MMLYLCAGILVGFADGTGY--------LMTLSNCVKWFPERKGLISALAIGAYGLGSLGF 150
+LY+ I+ G TG + + F G +SA +G G +
Sbjct: 97 WVLYI-GRIVAGITGATGAVAGAYIADITDGDERARHF----GFMSA----CFGFGMVAG 147

Query: 151 KYINMLLLENTGLETTFQLWGLIAMALVLCGGMLMKDA------PAQSAASQQAESRDFT 204
+ L+ F + L G L+ ++ P + A S F
Sbjct: 148 PVLGGLMGG-FSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLAS--FR 204

Query: 205 LAEAMSKPQYWMLALMFLSACMSG----LYVIGVAKDIGEKMVDLPVLVAANAVAVIAMA 260
A M ++A+ F+ + L+VI + + +AA + +
Sbjct: 205 WARGM-TVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGI----LH 259

Query: 261 NLCGRLVLGILSDKIPRIRVISLAQIITLVGMVLLLFVPLNANLFFVAVACVAFSFGGTI 320
+L ++ G ++ ++ R + L I G +LL F + F + +A G +
Sbjct: 260 SLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFA-TRGWMAFPIMVLLAS-GGIGM 317

Query: 321 TVYPSLVSDFFGLNNLTKNYGVIYLGFGIGSIIGS-IVASLFGGFIATFN 369
+++S + G + + SI+G + +++ I T+N
Sbjct: 318 PALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAASITTWN 367


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0062TCRTETB1184e-31 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 118 bits (298), Expect = 4e-31
Identities = 76/400 (19%), Positives = 159/400 (39%), Gaps = 20/400 (5%)

Query: 30 FLAAVDQTLLATATPAIVEDLGGLR-QASWITIGYMLAMAASVPIYGWLGDNYGRAKILM 88
F + +++ +L + P I D +W+ +ML + +YG L D G ++L+
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 89 IALVVFALGSIVSA-SAGTMDHMIAGRILQGLGGGGLMSLSQSLVGELVPIRQRARFQGY 147
+++ GS++ +I R +QG G +L +V +P R + G
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143

Query: 148 FAAMFTLASVGGPVIGGFVVHAYSWHWLFWANIPLV-MLAVWRLNRLHKQSVKPVRQGRF 206
++ + GP IGG + H W +L IP++ ++ V L +L K+ V+ +G F
Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHYIHWSYLL--LIPMITIITVPFLMKLLKKEVR--IKGHF 199

Query: 207 DLLGVLLFPTIITALLYWLSVAGQDFAWLSATSLGFMGFICVGALVLLWWERRRESPFLP 266
D+ G++L I + + + F +S S + + R+ PF+
Sbjct: 200 DIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFL----------IFVKHIRKVTDPFVD 249

Query: 267 LDLLANKAIYMPLFTAALFAACLFAMIFFLPIYLQVGLHTNPAKTGLLLM-PMTFGIVTG 325
L N + + + + + +P ++ + A+ G +++ P T ++
Sbjct: 250 PGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIF 309

Query: 326 STIAGRLLSRDVAPKWLPTFGMGLAFIGLLLIGLVPPNANLIGALGV-LVGIGLGTVMPS 384
I G L+ R P ++ G+ + L + + + + V GL
Sbjct: 310 GYIGGILVDR-RGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTV 368

Query: 385 VQLVVQSVSGKARLSQITAMVSLSRSMGAAIGTALFSLLL 424
+ +V S + ++++ + + G A+ LL
Sbjct: 369 ISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLL 408


64Shewmr4_0155Shewmr4_0162N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_01550160.872290peptidase M6, immune inhibitor A
Shewmr4_01560181.163644hypothetical protein
Shewmr4_0157-1191.558542chorismate lyase
Shewmr4_0158-1170.453695flagellar basal body-associated protein
Shewmr4_0159-1160.284703flagellar basal body-associated protein
Shewmr4_0160-2140.872332putative SAM-dependent methyltransferase
Shewmr4_0161-1120.453297hypothetical protein
Shewmr4_0162-1130.653455hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0155BCTERIALGSPC1823e-58 Bacterial general secretion pathway protein C signa...
		>BCTERIALGSPC#Bacterial general secretion pathway protein C

signature.
Length = 272

Score = 182 bits (463), Expect = 3e-58
Identities = 72/295 (24%), Positives = 137/295 (46%), Gaps = 33/295 (11%)

Query: 8 IAKAAGIPHKPLSQIVFWFGFILSLLLAAQITWKLVPTTSSPTAWSPTAVTTTGKGAGQI 67
I+K + + +I+F+ +L A I W++ ++P + +V T A Q
Sbjct: 3 ISKLPPLSPSVIRRILFYLLMLLFCQQLAMIFWRIGLPDNAPVS----SVQITPAQARQ- 57

Query: 68 DMAGLQQLALFGKADAKSDKPKVEVVETVTDAPKTSLSIQLTGVVASTADQKGLAVIESS 127
L LFG + K+ ++ +++ P ++L++ LTGV+A D + +A+I
Sbjct: 58 QPVTLNDFTLFGVSPEKNKAGALDA-SQMSNLPPSTLNLSLTGVMAGDDDSRSIAIISKD 116

Query: 128 GSQETYSLGDKIKGTSASLKEVYADRIIITNAGRYETLMLDGLVYTSQSPANQQLQKAKS 187
Q + + +++ G +A + + DR+++ GRYE L L +
Sbjct: 117 NEQFSRGVNEEVPGYNAKIVSIRPDRVVLQYQGRYEVLGLYSQEDSGSDGVPG------- 169

Query: 188 EKAEVVSRVDQRKNTEISQELAESRSELLADPSKITDYIAISPVRQGENVVGYRLNPGKD 247
A+V ++ QR + ++DY++ SP+ + GYRLNPG
Sbjct: 170 --AQVNEQLQQR------------------ASTTMSDYVSFSPIMNDNKLQGYRLNPGPK 209

Query: 248 VNLFKQAGFKPNDLAKSINGYDLTVMSQALEMMSQLPELTEVSIMVEREGQLVEI 302
+ F + G + ND+A ++NG DL QA + M ++ ++ ++ VER+GQ +I
Sbjct: 210 SDSFYRVGLQDNDMAVALNGLDLRDAEQAKKAMERMADVHNFTLTVERDGQRQDI 264


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0156BCTERIALGSPD5980.0 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 598 bits (1542), Expect = 0.0
Identities = 326/678 (48%), Positives = 444/678 (65%), Gaps = 31/678 (4%)

Query: 6 IRRKLIAGVVAGATMLTSQFVWSEQYAANFKGTDIQEFINIVGKNLNKTIIVDPTIRGKI 65
IR + ++ A + +E+++A+FKGTDIQEFIN V KNLNKT+I+DP++RG I
Sbjct: 7 IRSFSLTLLIFAALLFRP--AAAEEFSASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTI 64

Query: 66 NVRSYDLLNDEQYYQFFLNVLQVYGYAIVEMENNVIKVIKDKDAKTAAIRVANDNDPGLG 125
VRSYD+LN+EQYYQFFL+VL VYG+A++ M N V+KV++ KDAKTAA+ VA+D PG+G
Sbjct: 65 TVRSYDMLNEEQYYQFFLSVLDVYGFAVINMNNGVLKVVRSKDAKTAAVPVASDAAPGIG 124

Query: 126 DEMVTRIVALYNTEAKQLAPLLRQLNDNAGGGNVVNYDPSNVLMLSGRAAVVNKLVEIVR 185
DE+VTR+V L N A+ LAPLLRQLNDNAG G+VV+Y+PSNVL+++GRAAV+ +L+ IV
Sbjct: 125 DEVVTRVVPLTNVAARDLAPLLRQLNDNAGVGSVVHYEPSNVLLMTGRAAVIKRLLTIVE 184

Query: 186 RVDKQGDTSVQVVPLEYASAGEMVRIIDTLYRATANQAQLPGQAPKVVADERINAVVVSG 245
RVD GD SV VPL +ASA ++V+++ L + T+ A VVADER NAV+VSG
Sbjct: 185 RVDNAGDRSVVTVPLSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSG 244

Query: 246 DEKSRQRVVELIHRLDAEQASTGNTKVRYLRYAKAEDLVEVLTGFAQKLEGEKDPSAQAG 305
+ SRQR++ +I +LD +QA+ GNTKV YL+YAKA DLVEVLTG + ++ EK +
Sbjct: 245 EPNSRQRIIAMIKQLDRQQATQGNTKVIYLKYAKASDLVEVLTGISSTMQSEKQAAKPVA 304

Query: 306 GKRRNEINIMAHTDTNALVISAEPDQMRTIESVINQLDIRRAQVLVEAIIVEVAEGDNVG 365
+N I I AH TNAL+++A PD M +E VI QLDIRR QVLVEAII EV + D +
Sbjct: 305 ALDKN-IIIKAHGQTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLN 363

Query: 366 FGVQWAAKAGGGTQFNNLGPTIGEIGAGIWQAQGEDGTTVCTENGTCTENPDSRGDVTLL 425
G+QWA K G TQF N G I AG Q + + + L
Sbjct: 364 LGIQWANKNAGMTQFTNSGLPISTAIAGANQYNKDGTVS------------------SSL 405

Query: 426 AQALGKVNGMAWGVAMGDFGALIQAVSADTNSNVLATPSITTLDNQEASFIVGDEVPILT 485
A AL NG+A G G++ L+ A+S+ T +++LATPSI TLDN EA+F VG EVP+LT
Sbjct: 406 ASALSSFNGIAAGFYQGNWAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVPVLT 465

Query: 486 GSTASSNNSNPFQTVERKEVGVKLKVVPQINEGNAVKLAIEQEVSGVNG-----NTGVDI 540
GS +++ N F TVERK VG+KLKV PQINEG++V L IEQEVS V ++ +
Sbjct: 466 GSQ-TTSGDNIFNTVERKTVGIKLKVKPQINEGDSVLLEIEQEVSSVADAASSTSSDLGA 524

Query: 541 SFATRRLTTTVMADSGQIVVLGGLINEEVQESVQKVPFLGDIPVLGHLFKSSSSKKTKKN 600
+F TR + V+ SG+ VV+GGL+++ V ++ KVP LGDIPV+G LF+S+S K +K+N
Sbjct: 525 TFNTRTVNNAVLVGSGETVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRN 584

Query: 601 LMIFIKPTIIRDGVTMEGIAGRKYNYFRALQLEQ--QERGVNLMPNTQVPILEEWNQSEY 658
LM+FI+PT+IRD + +Y F Q +Q +E ++ + I Q
Sbjct: 585 LMLFIRPTVIRDRDEYRQASSGQYTAFNDAQSKQRGKENNDAMLNQDLLEIYP--RQDTA 642

Query: 659 LPPEVNDILDRYKEGKGL 676
+V+ +D + G L
Sbjct: 643 AFRQVSAAIDAFNLGGNL 660


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0158BCTERIALGSPF5040.0 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 504 bits (1299), Expect = 0.0
Identities = 228/407 (56%), Positives = 305/407 (74%), Gaps = 1/407 (0%)

Query: 1 MPAFEYKALDAKGKQLKGVIEADTARHARSQLRDQRMMPLEILPVTEKEAKAKGTGFS-P 59
M + Y+ALDA+GK+ +G EAD+AR AR LR++ ++PL + + K+ TG S
Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLR 60

Query: 60 FKRGISVAELALITRQIATLVAAGLPIEECLKAVGQQCEKARLASMIMAVRSRVVEGYSL 119
K +S ++LAL+TRQ+ATLVAA +P+EE L AV +Q EK L+ ++ AVRS+V+EG+SL
Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSL 120

Query: 120 ADSLAEFPHIFDDLYRAMVASGEKSGHLEVVLNRLADYTERRQQLKSKLQQAMIYPIMLT 179
AD++ FP F+ LY AMVA+GE SGHL+ VLNRLADYTE+RQQ++S++QQAMIYP +LT
Sbjct: 121 ADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLT 180

Query: 180 IVAIGVVSVLLAAVVPKVVGQFEHMGAELPATTRFLIAASDFVQSYGLLVVLIIGILLVV 239
+VAI VVS+LL+ VVPKVV QF HM LP +TR L+ SD V+++G ++L + +
Sbjct: 181 VVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMA 240

Query: 240 FQRLLKSPIFKMKYHTFLLKMPVVGRVSKGLNTARFARTLSILSASSVPLLDGMRIASEV 299
F+ +L+ ++ +H LL +P++GR+++GLNTAR+ARTLSIL+AS+VPLL MRI+ +V
Sbjct: 241 FRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDV 300

Query: 300 LQNVRVRAAVDDATARVREGTSLSTALTNTKLFPAMMLYMIASGEKSGQLEDMLERAADN 359
+ N R + AT VREG SL AL T LFP MM +MIASGE+SG+L+ MLERAADN
Sbjct: 301 MSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADN 360

Query: 360 QDREFESNVTLALGVFEPALVVSMAGVVLFIVMAILQPILALNNLIS 406
QDREF S +TLALG+FEP LVVSMA VVLFIV+AILQPIL LN L+S
Sbjct: 361 QDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLMS 407


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0159BCTERIALGSPG2302e-81 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 230 bits (588), Expect = 2e-81
Identities = 98/144 (68%), Positives = 118/144 (81%)

Query: 1 MQMNKKHKGFTLLEVMVVIVILGILASMVVPNLMGNKDKADQQKAVSDIVALENALDMYK 60
M+ K +GFTLLE+MVVIVI+G+LAS+VVPNLMGNK+KAD+QKAVSDIVALENALDMYK
Sbjct: 1 MRATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYK 60

Query: 61 LDNGIYPTTEQGLEALVQKPTISPEPRNYRDEGYVKRLPQDPWRNNYLLLSPGENSKLDI 120
LDN YPTT QGLE+LV+ PT+ P NY EGY+KRLP DPW N+Y+L++PGE+ D+
Sbjct: 61 LDNHHYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDL 120

Query: 121 FSAGPDGQPGTEDDIGNWNLQNFQ 144
SAGPDG+ GTEDDI NW L +
Sbjct: 121 LSAGPDGEMGTEDDITNWGLSKKK 144


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0160BCTERIALGSPH612e-14 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 61.5 bits (149), Expect = 2e-14
Identities = 35/154 (22%), Positives = 56/154 (36%), Gaps = 39/154 (25%)

Query: 1 MGLTAAAVTMSIGNSGPQQALEKTAQQFIAATELVLDETVLSGQFIGIVVEKTSYQFVYY 60
MG++A V ++ S A + T +F A V + +GQF G+ V +QF+
Sbjct: 18 MGVSAGMVLLAFPASRDDSAAQ-TLARFEAQLRFVQQRGLQTGQFFGVSVHPDRWQFLVL 76

Query: 61 KDG---------------KWNPLEKDRILSEKQMEPGVVINLVLDGLPLVQEDEQDESWF 105
+ +W PL R+ + + G L Q E+W
Sbjct: 77 EARDGADPAPADDGWSGYRWLPLRAGRVATSGS----------IAGGKLNLAFAQGEAW- 125

Query: 106 DEPLIEPSAEDKKKHPEPQILLFPSGEMSAFELS 139
P +L+FP GEM+ F L+
Sbjct: 126 ------------TPGDNPDVLIFPGGEMTPFRLT 147


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0161PilS_PF08805310.001 PilS N terminal
		>PilS_PF08805#PilS N terminal

Length = 185

Score = 30.7 bits (69), Expect = 0.001
Identities = 11/32 (34%), Positives = 17/32 (53%)

Query: 5 KGMTLLEVIVALAVFAIAAVSITKSLGEQMAN 36
KG TL+EV++ + V + A S K +N
Sbjct: 26 KGATLMEVLLVVGVIVVLAASAYKLYSMVQSN 57


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0162BCTERIALGSPG366e-05 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 35.6 bits (82), Expect = 6e-05
Identities = 15/41 (36%), Positives = 28/41 (68%), Gaps = 3/41 (7%)

Query: 3 LKQTNAQKGFTLLEMLIAIAIFAMLGLAANAVLSTVLTNDE 43
++ T+ Q+GFTLLE+++ I I +G+ A+ V+ ++ N E
Sbjct: 1 MRATDKQRGFTLLEIMVVIVI---IGVLASLVVPNLMGNKE 38


65Shewmr4_0248Shewmr4_0261N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_0248-1132.80340550S ribosomal protein L23
Shewmr4_02490142.88667250S ribosomal protein L2
Shewmr4_02500142.24662230S ribosomal protein S19
Shewmr4_0251-1140.39729750S ribosomal protein L22
Shewmr4_0252-2140.83293630S ribosomal protein S3
Shewmr4_0253-2141.38766450S ribosomal protein L16
Shewmr4_0254-2160.99366850S ribosomal protein L29
Shewmr4_0255-1160.60546830S ribosomal protein S17
Shewmr4_0256-2160.54517850S ribosomal protein L14
Shewmr4_0257-1181.11380950S ribosomal protein L24
Shewmr4_0258017-0.44395550S ribosomal protein L5
Shewmr4_0259017-1.35174530S ribosomal protein S14
Shewmr4_0260-116-1.45729430S ribosomal protein S8
Shewmr4_0261-117-0.68931850S ribosomal protein L6
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0248DHBDHDRGNASE938e-25 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 93.2 bits (231), Expect = 8e-25
Identities = 71/257 (27%), Positives = 116/257 (45%), Gaps = 16/257 (6%)

Query: 6 IALITGASRGLGKNAVLTLAAQGVDIILTYQSNAAAAAEVVAEIEWHGRKAVALPLDVGN 65
IA ITGA++G+G+ TLA+QG I N +VV+ ++ R A A P DV +
Sbjct: 10 IAFITGAAQGIGEAVARTLASQGAHI-AAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 66 SQSFSDFSQRVKTALEQTWQRDSFNYLVNNAGIGIHVPMAETSMEQFDTLMNIHVKGPFF 125
S + + + R++ + + LVN AG+ + S E+++ +++ G F
Sbjct: 69 SAAIDEITARIEREMGP------IDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFN 122

Query: 126 LTQALLPLLAD--NGSIINVSTGLTRFAVPGFGAYATMKGAVETMTKYWAKELGPRGIRV 183
++++ + D +GSI+ V + AYA+ K A TK EL IR
Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 184 NVLAPGAIETDF-------GGGAVRDNRQMNEFLAQQTALGRVGLPEDIGGAISVLLSPA 236
N+++PG+ ETD GA + + E L ++ P DI A+ L+S
Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242

Query: 237 AAWINAQRIEASGGMFL 253
A I + GG L
Sbjct: 243 AGHITMHNLCVDGGATL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0250HTHFIS343e-114 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 343 bits (882), Expect = e-114
Identities = 121/375 (32%), Positives = 200/375 (53%), Gaps = 17/375 (4%)

Query: 259 FHRDSALHVQTQALALTQTKSTRTLQDKPSNQLGVRFRDPLLERAWQQANKVITKQIPLL 318
F + + +ALA + + ++ D + R ++ ++ +++ + L+
Sbjct: 106 FDLTELIGIIGRALAEPKRRPSKLEDDSQDGM-PLVGRSAAMQEIYRVLARLMQTDLTLM 164

Query: 319 VLGETGVGKEQFVKKLHAQSARRAQPLVAVNCAALPAELVESELFGYQAGAFTGANRTGF 378
+ GE+G GKE + LH RR P VA+N AA+P +L+ESELFG++ GAFTGA
Sbjct: 165 ITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRS- 223

Query: 379 IGKIRQAHGGFLFLDEIGEMPLAAQSRLLRVLQEREVVPVGSNQSFKVDIQIIAATHMDL 438
G+ QA GG LFLDEIG+MP+ AQ+RLLRVLQ+ E VG + D++I+AAT+ DL
Sbjct: 224 TGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDL 283

Query: 439 ESLVAQGLFRQDLFYRLNGLQVRLPALRERQ-DIERIIH---KLHRRHRSSAQTLCTELL 494
+ + QGLFR+DL+YRLN + +RLP LR+R DI ++ + + + E L
Sbjct: 284 KQSINQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDVKRFDQEAL 343

Query: 495 AQLMRYDWPGNLRELDNLMQVACLMAEGEAVLEINHLPDYLAQKLMNLAFEPQTLTEVVD 554
+ + WPGN+REL+NL++ + + V+ + + L ++ + E
Sbjct: 344 ELMKAHPWPGNVRELENLVRRLTALYPQD-VITREIIENELRSEIPDSPIEKAAARSGSL 402

Query: 555 AETTAHPHELSESSSTTIDSLHGTINLN----------VLQAYRACEGNVSQCAKRLGIS 604
+ + A + + ++ D+L + + +L A A GN + A LG++
Sbjct: 403 SISQAVEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLN 462

Query: 605 RNALYRKLKQLGVKD 619
RN L +K+++LGV
Sbjct: 463 RNTLRKKIRELGVSV 477


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0251PF06580384e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 38.3 bits (89), Expect = 4e-05
Identities = 25/121 (20%), Positives = 52/121 (42%), Gaps = 12/121 (9%)

Query: 274 IAYEAEQLEKLIAELLELSRVKLSTNETKVRLGLAESLSQVLDDAEFEADQQGKKIT--I 331
I + + +++ L EL R L + + + LA+ L+ V + + Q ++
Sbjct: 186 ILEDPTKAREMLTSLSELMRYSLRYSNARQ-VSLADELTVVDSYLQLASIQFEDRLQFEN 244

Query: 332 DIDEAIELGHYPKSLSRAIENLLRNAIRYA------QSDIHLRASQTNGQVQITIKDDGP 385
I+ AI P L ++ L+ N I++ I L+ ++ NG V + +++ G
Sbjct: 245 QINPAIMDVQVPPML---VQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGS 301

Query: 386 G 386

Sbjct: 302 L 302


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0252HTHFIS972e-25 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 96.8 bits (241), Expect = 2e-25
Identities = 46/163 (28%), Positives = 77/163 (47%), Gaps = 3/163 (1%)

Query: 2 SRILLIDDDLGLSELLGQLLELEGFKLTLAYDGKQGLELALAGDYDLILLDVMLPKLNGF 61
+ IL+ DDD + +L Q L G+ + + + AGD DL++ DV++P N F
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 EVLRALRQH-KQTPVLMLTARGDEIDRVVGLEIGADDYLPKPFNDRELIARIRAIIRRSH 120
++L +++ PVL+++A+ + + E GA DYLPKPF+ ELI I +
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 121 LTAQEIHATPAQEFGDLRLDPSRQEAYCNEQLIILTGTEFTLL 163
++ + + QE Y L L T+ TL+
Sbjct: 124 RRPSKLEDDSQDGMPLVGRSAAMQEIY--RVLARLMQTDLTLM 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0255PF01206270.008 SirA family protein
		>PF01206#SirA family protein

Length = 76

Score = 27.0 bits (60), Expect = 0.008
Identities = 6/35 (17%), Positives = 14/35 (40%)

Query: 90 PLLMWRSRVTCAQSGKVVIVECLDERKRRSLIRWC 124
P+L + + +G+V+ V D + +
Sbjct: 18 PILKAKKTLATMNAGEVLYVMATDPGSVKDFESFS 52


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0256OMPADOMAIN692e-16 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 69.2 bits (169), Expect = 2e-16
Identities = 50/202 (24%), Positives = 69/202 (34%), Gaps = 28/202 (13%)

Query: 1 MKKLSLVAVTLLSALVAGQASAATDNTGFYVGGAL-------NRVTVDAFDDSETGTGFG 53
MKK +A+ + A A A AA + +Y G L + E G G
Sbjct: 1 MKKT-AIAIAVALAGFATVAQAAPKDNTWYTGAKLGWSQYHDTGFINNNGPTHENQLGAG 59

Query: 54 VYGGYNFNEWFGLEANFF----ATADLGDSDVDISAGALTFTPKFTLQINDMFSAYAKVG 109
+GGY N + G E + + A + T K I D Y ++G
Sbjct: 60 AFGGYQVNPYVGFEMGYDWLGRMPYKGSVENGAYKAQGVQLTAKLGYPITDDLDIYTRLG 119

Query: 110 VA----SMAVNVDGLGFDEDFTGFGWTYGVGVNAAVTEHLNVRLSYDITT--GDLDADHS 163
NV G D TG + GV A+T + RL Y T GD
Sbjct: 120 GMVWRADTKSNVYGKNHD---TGVSPVFAGGVEYAITPEIATRLEYQWTNNIGDAHTI-- 174

Query: 164 YLNMKDIDTDMKQLAIGVHYQF 185
D L++GV Y+F
Sbjct: 175 -----GTRPDNGMLSLGVSYRF 191


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0257HTHFIS5570.0 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 557 bits (1438), Expect = 0.0
Identities = 198/474 (41%), Positives = 294/474 (62%), Gaps = 12/474 (2%)

Query: 7 VWILDDDSSIRWVLEKALQGAKLSTASFAAAESLWQALEISQPRVIVSDIRMPGTDGLTL 66
+ + DDD++IR VL +AL A + A +LW+ + ++V+D+ MP + L
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 67 LERLQIHYPHIPVIIMTAHSDLDSAVSAYQAGAFEYLPKPFDIDEAISLVERALTHATEQ 126
L R++ P +PV++M+A + +A+ A + GA++YLPKPFD+ E I ++ RAL +
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE--PK 123

Query: 127 SPSPAPQETQVKTPEIIGEAPAMQEVFRAIGRLSRSSISVLINGQSGTGKELVAGALHKH 186
++ ++G + AMQE++R + RL ++ ++++I G+SGTGKELVA ALH +
Sbjct: 124 RRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDY 183

Query: 187 SPRKDKPFIALNMAAIPKDLIESELFGHEKGAFTGAANVRQGRFEQANGGTLFLDEIGDM 246
R++ PF+A+NMAAIP+DLIESELFGHEKGAFTGA GRFEQA GGTLFLDEIGDM
Sbjct: 184 GKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDM 243

Query: 247 PLDVQTRLLRVLADGQFYRVGGHSAVQVDVRIIAATHQDLEQLVLKGGFREDLFHRLNVI 306
P+D QTRLLRVL G++ VGG + ++ DVRI+AAT++DL+Q + +G FREDL++RLNV+
Sbjct: 244 PMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVV 303

Query: 307 RVHLPPLSQRREDIPQLATHFLASAAKEIGVEAKILTKETAAKLSQLPWPGNVRQLENTC 366
+ LPPL R EDIP L HF+ A KE G++ K +E + PWPGNVR+LEN
Sbjct: 304 PLRLPPLRDRAEDIPDLVRHFVQQAEKE-GLDVKRFDQEALELMKAHPWPGNVRELENLV 362

Query: 367 RWLTVMASGQEILPQDLPPELLKEPASINPMAKGSQDWQSALTEWIDQKLSE-------- 418
R LT + I + + EL E ++ ++++ +++ + +
Sbjct: 363 RRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDA 422

Query: 419 -GNSDLLTEVQPAFERILLETALRHTQGHKQEAAKRLGWGRNTLTRKLKELSMD 471
S L V E L+ AL T+G++ +AA LG RNTL +K++EL +
Sbjct: 423 LPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVS 476


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0258PF06580415e-06 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 40.6 bits (95), Expect = 5e-06
Identities = 35/187 (18%), Positives = 72/187 (38%), Gaps = 33/187 (17%)

Query: 167 LIIEQADRLRSLVDRL-------LGPQRPTQHSLHNIHQVVQKVYKLVEMALPTNIQLKR 219
LI+E + R ++ L L Q SL + VV +L + +Q +
Sbjct: 185 LILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFEN 244

Query: 220 DYDPSIPDIEMDPDQMQQAVLNILQNAVQALEHTGGEILIRTRTQHQVTIGSQRHKLVLT 279
+P+I D+++ P +Q V N +++ + L GG+IL++ + +T
Sbjct: 245 QINPAIMDVQVPPMLVQTLVENGIKHGIAQLPQ-GGKILLKGTKDNGT----------VT 293

Query: 280 LSIIDNGPGIPPELMDTLFYPMVTSREQGSGLGLSIAHNIARLHSG---RIDCVSSPGHT 336
L + + G + + ++ +G GL ++ G +I G
Sbjct: 294 LEVENTGSL------------ALKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKV 341

Query: 337 EFIISLP 343
++ +P
Sbjct: 342 NAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0261HTHTETR595e-13 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 58.9 bits (142), Expect = 5e-13
Identities = 25/68 (36%), Positives = 32/68 (47%)

Query: 1 MKTETQSTRQHILDVGYSLIVKQGFSCLGLAQLLKAAQVPKGSFYHYFKSKEQFGEALLT 60
K E Q TRQHILDV L +QG S L ++ KAA V +G+ Y +FK K +
Sbjct: 5 TKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWE 64

Query: 61 GYFEQYQA 68

Sbjct: 65 LSESNIGE 72


66Shewmr4_0507Shewmr4_0510N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_05072193.699476Ppx/GppA phosphatase
Shewmr4_05082183.539718hypothetical protein
Shewmr4_05091182.613767hypothetical protein
Shewmr4_05102182.040383mutator MutT protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0507IGASERPTASE330.001 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 32.7 bits (74), Expect = 0.001
Identities = 20/80 (25%), Positives = 30/80 (37%), Gaps = 2/80 (2%)

Query: 145 PMAYDDTPVAVSPPVRVTTSMQYSPSEGRMVSNMPTNSATVISQTGASTARASTASAEQI 204
P + +T P + T+S P N NS + T ++E
Sbjct: 1158 PQSQTNTTADTEQPAKETSSNVEQPVTESTTVN-TGNSVVENPENTTPATTQPTVNSES- 1215

Query: 205 ANVPRARAARSVSSLPSNAR 224
+N P+ R RSV S+P N
Sbjct: 1216 SNKPKNRHRRSVRSVPHNVE 1235


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0508TCRTETA522e-09 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 52.1 bits (125), Expect = 2e-09
Identities = 61/322 (18%), Positives = 107/322 (33%), Gaps = 22/322 (6%)

Query: 50 VAHVSYAISAYALGVVVGSPIIMVLGVRIKRRTLLIALAAMMAVANGLSALAPSLNWLVF 109
AH ++ YAL +P++ L R RR +L+ A AV + A AP L L
Sbjct: 42 TAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYI 101

Query: 110 FRFLSGLPHGAYFGVAMLLAASLVPPDMKARAVSRVIIGLTLATIVGVPFATWMGQTVGW 169
R ++G+ GA VA A + D +AR + + G MG
Sbjct: 102 GRIVAGIT-GATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGG-FSP 159

Query: 170 RSGIGIVAIIAAVTAVMLYFLAPNVAVPQNASPKKELQTLKNREVWLTLGIAAIGFGGIF 229
+ A + + + FL P + + N +
Sbjct: 160 HAPFFAAAALNGLNFLTGCFLLPE---SHKGERRPLRREALNPLASFRWARGMTVVAALM 216

Query: 230 CVYTYLAETLIQVTQV------------EPFKIPVMIAVFGI-GATLGTLVCGWAADK-S 275
V+ ++ + + QV + I + +A FGI + ++ G A +
Sbjct: 217 AVF-FIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLG 275

Query: 276 ALAAAFWSLVLSTLVLALYPSLTGSYWALMPI-VFFVGSGIGLATIVQARLMDVAPDGQA 334
A ++ L + W PI V GIG+ + V + Q
Sbjct: 276 ERRALMLGMIADGTGYILL-AFATRGWMAFPIMVLLASGGIGMPALQAMLSRQVDEERQG 334

Query: 335 MTGALVQCAFNLANAIGPWVGS 356
+ +L + +GP + +
Sbjct: 335 QLQGSLAALTSLTSIVGPLLFT 356


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0509DHBDHDRGNASE522e-10 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 52.4 bits (125), Expect = 2e-10
Identities = 41/183 (22%), Positives = 80/183 (43%), Gaps = 5/183 (2%)

Query: 2 ILITGASSGLGAALAALYAKDNQALTLTGRDANRLHTVANALSPFSTQSISAISADLADE 61
ITGA+ G+G A+A A + + +L V ++L + + A AD+ D
Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHA-EAFPADVRDS 69

Query: 62 ASVEALFDGL---TDTPNTVIHCAGSGYFGALETQGTSEIQALLNNNVTSTILLVRELVK 118
A+++ + + + +++ AG G + + E +A + N T R + K
Sbjct: 70 AAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSK 129

Query: 119 RYK-QQAVKVVVVMSTAALAAKAGESTYCAAKWAVRGFIESVRLELKQSQMKLIAVYPGG 177
+++ +V V S A + + Y ++K A F + + LEL + ++ V PG
Sbjct: 130 YMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPGS 189

Query: 178 MDT 180
+T
Sbjct: 190 TET 192


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0510RTXTOXIND290.007 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.0 bits (65), Expect = 0.007
Identities = 9/23 (39%), Positives = 11/23 (47%)

Query: 126 GIVGAIWVKDGDDVAFDQPLFTL 148
IV I VK+G+ V L L
Sbjct: 105 SIVKEIIVKEGESVRKGDVLLKL 127


67Shewmr4_0516Shewmr4_0527N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_05160215.137384type II secretion system protein
Shewmr4_0517-1204.583830type IV-A pilus assembly ATPase PilB
Shewmr4_0518-2204.264405O-antigen polymerase
Shewmr4_05190182.213040methylation site containing protein
Shewmr4_0520-1171.896527hypothetical protein
Shewmr4_0521-1181.883681O-antigen polymerase
Shewmr4_0522-3171.155844nicotinate-nucleotide pyrophosphorylase
Shewmr4_0523-2160.119069N-acetyl-anhydromuranmyl-L-alanine amidase
Shewmr4_0524113-0.949626regulatory protein AmpE
Shewmr4_0525315-1.859214transcriptional regulator PdhR
Shewmr4_0526113-1.885280pyruvate dehydrogenase subunit E1
Shewmr4_0527113-1.867099pyruvate dehydrogenase complex dihydrolipoamide
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0516RTXTOXIND310.015 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 30.6 bits (69), Expect = 0.015
Identities = 21/167 (12%), Positives = 52/167 (31%), Gaps = 17/167 (10%)

Query: 80 EVQAQIARQQQAELAIAAADRAVYNPEL-GLNYQNADTDTYSLGLSQTIDWGDKRGVATR 138
+ + QA L + EL L + Y +S+ + +
Sbjct: 135 DTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQ 194

Query: 139 LAQLEAQILLADISLERSQMLAERLLALAEQAQSNKALTFAEQQLRFTQAQLNIAEQRFA 198
+ + Q +++L++ + AE+ + E R +++L+
Sbjct: 195 FSTWQNQKYQKELNLDKKR---------AERLTVLARINRYENLSRVEKSRLDDFSSLLH 245

Query: 199 AGDLS-----DVELQLLKL--ELASNTADYAMAEQAALVADGKVIEL 238
++ + E + ++ EL + E L A + +
Sbjct: 246 KQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLV 292


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0517RTXTOXIND552e-10 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 55.2 bits (133), Expect = 2e-10
Identities = 31/138 (22%), Positives = 56/138 (40%), Gaps = 9/138 (6%)

Query: 157 EVAKAQAEYINAAAEWNRVRR---MSESAVSVSRRMQAQVDAELKRAILEAIKMTAEQIR 213
V + + +Y+ A E + ES + ++ V K IL+ ++ T + I
Sbjct: 253 AVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIG 312

Query: 214 TLE----STPEAIGSYQLLAPIDGRVQQ-DIAMLGQVFTAGTPLMQLT-DESHLWVEAQL 267
L E + + AP+ +VQQ + G V T LM + ++ L V A +
Sbjct: 313 LLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALV 372

Query: 268 TPAQAANVNAGGPALVQV 285
+N G A+++V
Sbjct: 373 QNKDIGFINVGQNAIIKV 390



Score = 42.9 bits (101), Expect = 2e-06
Identities = 28/149 (18%), Positives = 55/149 (36%), Gaps = 5/149 (3%)

Query: 101 SLSNLNLDTMATATLVVDRDRTATLAPQLDVRVQARHVVPGQEVKKGEPLLTLGG----A 156
L + + A L R+ + P + V+ V G+ V+KG+ LL L A
Sbjct: 76 VLGQVEIVATANGKLTHS-GRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEA 134

Query: 157 EVAKAQAEYINAAAEWNRVRRMSESAVSVSRRMQAQVDAELKRAILEAIKMTAEQIRTLE 216
+ K Q+ + A E R + +S S D + + E + + +
Sbjct: 135 DTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQ 194

Query: 217 STPEAIGSYQLLAPIDGRVQQDIAMLGQV 245
+ YQ +D + + + +L ++
Sbjct: 195 FSTWQNQKYQKELNLDKKRAERLTVLARI 223


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0518ACRIFLAVINRP6530.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 653 bits (1686), Expect = 0.0
Identities = 224/1077 (20%), Positives = 440/1077 (40%), Gaps = 72/1077 (6%)

Query: 9 AIKNRLLVVLALLAVVAASVAMLPKLNLDAFPDVTNVQVTINTAAEGLAAEEVEKLISYP 68
I+ + + + ++ A + +L + +P + V+++ G A+ V+ ++
Sbjct: 5 FIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQV 64

Query: 69 VESAMYALPAVTEVRSLS-RTGLSIVTVVFAEGTDIYFARQQVFEQLQAAREMIPSGVGV 127
+E M + + + S S G +T+ F GTD A+ QV +LQ A ++P V
Sbjct: 65 IEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQQ 124

Query: 128 PEIGPNTSGLGQIYQYILRADPSSGINAAELRSLNDYLVKLIMMPVGGVTDVLSFGGEVR 187
I S + +D + G ++ VK + + GV DV FG +
Sbjct: 125 QGISVEKSSSSYLMVAGFVSD-NPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQ-Y 182

Query: 188 QYQVQVDPNKLRAYGLSMAQVSEALESNNRNAGGWFMDQGQEQLVVRGYGMLPAGEAGLA 247
++ +D + L Y L+ V L+ N + G L + +
Sbjct: 183 AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLG-GTPALPGQQLNASIIAQTRFK 241

Query: 248 AIAQIPLTEVR----GTPVRVGDIAKVDFGSEIRVGAVTMTRRDEAGQAQALGEVVAGVV 303
+ +R G+ VR+ D+A+V+ G E + G+ +
Sbjct: 242 NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARIN-----GK-----PAAGLGI 291

Query: 304 LKRMGANTKATIDDIDARINLIEQALPKGVSFEVFYDQADLVDKAVTTVRDALLMAFVFI 363
GAN T I A++ ++ P+G+ YD V ++ V L A + +
Sbjct: 292 KLATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLV 351

Query: 364 VVILALFLVNIRATLLVLLSIPVSIGLALMVMSYYGLSANLMSLGGLAVAIGMLVDGSVV 423
+++ LFL N+RATL+ +++PV + +++ +G S N +++ G+ +AIG+LVD ++V
Sbjct: 352 FLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIV 411

Query: 424 MVENIFKHLTQPDRRHLAQARSRADGEVDPYHGDEDGSVRAVEADNSMAVRIMLAAKEVC 483
+VEN+ + + + + EA + ++
Sbjct: 412 VVENVERVM-------------------------MEDKLPPKEA-------TEKSMSQIQ 439

Query: 484 SPIFFATAIIIVVFAPLFALEGVEGKLFQPMAVSIILAMISALLVALIAVPALAVYLFK- 542
+ ++ VF P+ G G +++ +++I+ AM ++LVALI PAL L K
Sbjct: 440 GALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKP 499

Query: 543 ---------RGVVLKESVVLVPLDSAYRKLLSATLARPKLVMISALLMFAMSMVLLPRLG 593
G + + Y + L ++ L+ A +VL RL
Sbjct: 500 VSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRLP 559

Query: 594 TEFVPELEEGTINLRVTLAPTASLGTSLDVAPKLEAMLLEFPEVEYALSRIGAPELGGDP 653
+ F+PE ++G + L A+ + V ++ L+ + S
Sbjct: 560 SSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNEKANVE-SVFTVNGFSFSG 618

Query: 654 EPVSNIEVYIGLKPIEEWQSASSRLE--LQRLMEEKLSVFPGLLLTFSQPIATRVDELLS 711
+ + ++ LKP EE + E + R E + G ++ F+ P + EL +
Sbjct: 619 QAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGFVIPFNMP---AIVELGT 675

Query: 712 GVKAQLA-IKLFGPDLDVLSEKGQVLTDLVAKIPGAV-DVSLEQVSGEAQLVVRPDRSQL 769
I G D L++ L + A+ P ++ V + AQ + D+ +
Sbjct: 676 ATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKA 735

Query: 770 ARYGISVDQVMSLVSQGIGGASAGQVIDGNARYDINLRLAAEYRSSPDVIKDLLLSGSNG 829
G+S+ + +S +GG ID + ++ A++R P+ + L + +NG
Sbjct: 736 QALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRMLPEDVDKLYVRSANG 795

Query: 830 ATVRLGEVASVEVEMAPPNIRRDDVQRRVVVQANVA-GRDMGSVVKDIYELVPQADLPAG 888
V + P + R + + +Q A G G + + L + LPAG
Sbjct: 796 EMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAMALMENLASK--LPAG 853

Query: 889 YTVIVGGQYENQQRAQQKLMLVVPISIALIALLLYFSFGAVKQVLLIMANVPLALIGGIV 948
G ++ + + +V IS ++ L L + + + +M VPL ++G ++
Sbjct: 854 IGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLL 913

Query: 949 ALFVSGTYLSVPSSIGFITLFGVAVLNGVVLVDSINQ-RRQSGESLYDSVYEGTVGRLRP 1007
A + V +G +T G++ N +++V+ + G+ + ++ RLRP
Sbjct: 914 AATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRP 973

Query: 1008 VLMTALTSALGLIPILVSSGVGSEIQKPLAVVIIGGLFSSTALTLLVLPTLYRWLYR 1064
+LMT+L LG++P+ +S+G GS Q + + ++GG+ S+T L + +P + + R
Sbjct: 974 ILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIRR 1030



Score = 109 bits (273), Expect = 4e-26
Identities = 85/550 (15%), Positives = 186/550 (33%), Gaps = 68/550 (12%)

Query: 10 IKNRLLVVLALLAVVAASVAMLPKLNLDAFPDVTNVQVTIN-TAAEGLAAEEVEKLI--- 65
+ + +L +VA V + +L P+ G E +K++
Sbjct: 534 LGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQV 593

Query: 66 -SYPVESAMYALPAVTEVRSLSRTGLS----IVTVVFAEGTDIYFARQQVFEQLQAAREM 120
Y +++ + +V V S +G + + V + + A+
Sbjct: 594 TDYYLKNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKME 653

Query: 121 ---IPSGVGVPEIGPNTSGLGQIYQYILRADPSSGINAAELRSLNDYLVKLIMMPVGGVT 177
I G +P P LG + +G+ L + L+ + +
Sbjct: 654 LGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLV 713

Query: 178 DVLSFGGEVR-QYQVQVDPNKLRAYGLSMAQVSEALES--NNRNAGGWFMDQGQEQLVVR 234
V G E Q++++VD K +A G+S++ +++ + + + ++L V+
Sbjct: 714 SVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQ 773

Query: 235 GYGMLPAGEA-GLAAIAQIPLTEVRGTPVRVGDIAKVDFGSEIRVGAVTMTRRDEAGQAQ 293
A + ++ + G V + G+ + R + +
Sbjct: 774 ----ADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVY----GSPRLERYNGLPSME 825

Query: 294 ALGEVVAGVVLKRMGANTKATIDDIDARINLIEQALPKGVSFEVFYDQADLVDKAVTTVR 353
GE G D A + + LP G+ ++ + + +
Sbjct: 826 IQGEAAPGTSS-----------GDAMALMENLASKLPAGIGYD-WTGMSYQERLSGNQAP 873

Query: 354 DALLMAFVFIVVILALFLVNIRATLLVLLSIPVSIGLALMVMSYYGLSANLMSLGGLAVA 413
+ ++FV + + LA + + V+L +P+ I L+ + + ++ + GL
Sbjct: 874 ALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTT 933

Query: 414 IGMLVDGSVVMVENIFKHLTQPDRRHLAQARSRADGEVDPYHGDEDGSVRAVEADNSMAV 473
IG+ ++++VE A+ +G+ VEA
Sbjct: 934 IGLSAKNAILIVE-------------FAKDLMEKEGK------------GVVEA------ 962

Query: 474 RIMLAAKEVCSPIFFATAIIIVVFAPLFALEGVEGKLFQPMAVSIILAMISALLVALIAV 533
++A + PI + I+ PL G + + ++ M+SA L+A+ V
Sbjct: 963 -TLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFV 1021

Query: 534 PALAVYLFKR 543
P V + +
Sbjct: 1022 PVFFVVIRRC 1031



Score = 98.8 bits (246), Expect = 5e-23
Identities = 66/347 (19%), Positives = 140/347 (40%), Gaps = 16/347 (4%)

Query: 735 VLTDLVAKIPGAVDVSLEQVSGEAQLVVRPDRSQLARYGISVDQVMSLVSQGIGGASAGQ 794
+ D ++++ G DV L + + + D L +Y ++ V++ + +AGQ
Sbjct: 161 NVKDTLSRLNGVGDVQL--FGAQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQ 218

Query: 795 VIDGNARYDINLRLA----AEYRSSPDVIKDLLLSGSNGATVRLGEVASVEVEMAPPNIR 850
+ A L + +++ + K L S+G+ VRL +VA VE+ N+
Sbjct: 219 LGGTPALPGQQLNASIIAQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVI 278

Query: 851 -RDDVQRRVVVQANVA-GRDMGSVVKDIYELVP--QADLPAGYTVIVGGQYENQQRAQQK 906
R + + + +A G + K I + Q P G V+ Y+ Q
Sbjct: 279 ARINGKPAAGLGIKLATGANALDTAKAIKAKLAELQPFFPQGMKVLY--PYDTTPFVQLS 336

Query: 907 LMLVVP---ISIALIALLLYFSFGAVKQVLLIMANVPLALIGGIVALFVSGTYLSVPSSI 963
+ VV +I L+ L++Y ++ L+ VP+ L+G L G ++ +
Sbjct: 337 IHEVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMF 396

Query: 964 GFITLFGVAVLNGVVLVDSINQRRQS-GESLYDSVYEGTVGRLRPVLMTALTSALGLIPI 1022
G + G+ V + +V+V+++ + ++ + ++ A+ + IP+
Sbjct: 397 GMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPM 456

Query: 1023 LVSSGVGSEIQKPLAVVIIGGLFSSTALTLLVLPTLYRWLYRHDKSP 1069
G I + ++ I+ + S + L++ P L L + +
Sbjct: 457 AFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAE 503


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0521MECHCHANNEL1741e-59 Bacterial mechano-sensitive ion channel signature.
		>MECHCHANNEL#Bacterial mechano-sensitive ion channel signature.

Length = 136

Score = 174 bits (443), Expect = 1e-59
Identities = 89/136 (65%), Positives = 111/136 (81%), Gaps = 1/136 (0%)

Query: 1 MSLIKEFKAFASRGNVIDMAVGIIIGAAFGKIVSSFVADIIMPPIGIILGGVNFSDLSIV 60
MS+IKEF+ FA RGNV+D+AVG+IIGAAFGKIVSS VADIIMPP+G+++GG++F ++
Sbjct: 1 MSIIKEFREFAMRGNVVDLAVGVIIGAAFGKIVSSLVADIIMPPLGLLIGGIDFKQFAVT 60

Query: 61 LQAAQGDAPAVVIAYGKFIQTVIDFTIIAFAIFMGLKAINSLKRKQEEAPKAPPAPTKDQ 120
L+ AQGD PAVV+ YG FIQ V DF I+AFAIFM +K IN L RK+EE P A PAPTK++
Sbjct: 61 LRDAQGDIPAVVMHYGVFIQNVFDFLIVAFAIFMAIKLINKLNRKKEE-PAAAPAPTKEE 119

Query: 121 ELLSEIRDLLKAQQEK 136
LL+EIRDLLK Q +
Sbjct: 120 VLLTEIRDLLKEQNNR 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0523RTXTOXIND952e-23 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 94.5 bits (235), Expect = 2e-23
Identities = 38/290 (13%), Positives = 91/290 (31%), Gaps = 28/290 (9%)

Query: 71 LAQLEDNQFSAKVSQAEASLGSAKADLQTLAAKVELQHALISQASAGVVAAQADKLRAEQ 130
+ + + S + + + ++ + A A + + +
Sbjct: 176 FQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKS 235

Query: 131 QLTRAKKLKVSNYSSQDDVDQLQAGFDSAAAGLDEAKA--------LLVAKERELAVFN- 181
+L L ++ V + + + A L K+ +L AKE V
Sbjct: 236 RLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQL 295

Query: 182 ------AQLNQAGSVVEQSNAALELAKIQLNDTRVTAPFSGVIGKRGAM-VGQYVQPGQA 234
+L Q + L + + + + AP S + + G V +
Sbjct: 296 FKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAET 355

Query: 235 LYSLVPDGAV-WITANFKETQIQHMQPGQSVQVSLDAFPDKTF---TGVIDSLSPASGAK 290
L +VP+ +TA + I + GQ+ + ++AFP + G + +++ +
Sbjct: 356 LMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDA--- 412

Query: 291 FSLLPAENATGNFTKIVQRIPVRIRLDLSEEEARVVPGLSAVVKVDTASH 340
+ G ++ I + + G++ ++ T
Sbjct: 413 ----IEDQRLGLVFNVIISIEENCLST-GNKNIPLSSGMAVTAEIKTGMR 457



Score = 57.5 bits (139), Expect = 2e-11
Identities = 24/128 (18%), Positives = 48/128 (37%), Gaps = 2/128 (1%)

Query: 59 VTDNQHVRKGELLAQLEDNQFSAKVSQAEASLGSAKADLQTLAAKVELQHALISQASAGV 118
V + + VRKG++L +L A + ++SL A+ + L +
Sbjct: 112 VKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSR-SIELNKLPELKL 170

Query: 119 VAAQADKLRAEQQLTRAKKLKVSNYSS-QDDVDQLQAGFDSAAAGLDEAKALLVAKEREL 177
+ +E+++ R L +S+ Q+ Q + D A A + E
Sbjct: 171 PDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLS 230

Query: 178 AVFNAQLN 185
V ++L+
Sbjct: 231 RVEKSRLD 238


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0524TCRTETB1231e-32 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 123 bits (310), Expect = 1e-32
Identities = 89/421 (21%), Positives = 177/421 (42%), Gaps = 19/421 (4%)

Query: 18 SEYERGSRRSWIAVFGGLIGAFMAILDIQITNASMKEIQGSLGATLEEGSWISTAYLVAE 77
+ Y + + R + I +F ++L+ + N S+ +I +W++TA+++
Sbjct: 3 TSYSQSNLRHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTF 62

Query: 78 MIAIPLSGWLSTGLSVRRYLLWTTAAFIFASILCSISWN-LEAMIAFRALQGFFGGALIP 136
I + G LS L ++R LL+ F S++ + + +I R +QG A
Sbjct: 63 SIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPA 122

Query: 137 LAFRLILEFLPENKRAVGMALFGVTATFAPSIGPTLGGWLTEHFSWHYLFYINVPPGLLV 196
L ++ ++P+ R L G +GP +GG + + W YL +P ++
Sbjct: 123 LVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLL--LIPMITII 180

Query: 197 MAMLAYGLEKRPVVWDKLKNADLAGIVTMALGMGCLEVVLEEGNRKDWFGSDLIRNLAII 256
L K+ V + D+ GI+ M++G+ + F + + I+
Sbjct: 181 TVPFLMKLLKKEVRIKG--HFDIKGIILMSVGIVFFML----------FTTSYSISFLIV 228

Query: 257 AAVNLVLFVWIQLKRKDPLVNLRLLGKRDFVLSTIAYFLLGMALFGAIYLIPLYLSQVHD 316
+ ++ ++FV K DP V+ L F++ + ++ + G + ++P + VH
Sbjct: 229 SVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQ 288

Query: 317 YTPLEIGEVIMWMGFPQLLVL-PLVPRLMQRFDGRYLAAFGFFMFALSYYMNSQMTADYA 375
+ EIG VI++ G +++ + L+ R Y+ G ++S+ S +
Sbjct: 289 LSTAEIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFL--LET 346

Query: 376 GPQMIASQVVRALG-QPFILVPIGMLATAHLKPHENPSASTVLNVMRNLGGAFGIALVAT 434
+ +V LG F I + ++ LK E + ++LN L GIA+V
Sbjct: 347 TSWFMTIIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGG 406

Query: 435 L 435
L
Sbjct: 407 L 407


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0525SACTRNSFRASE438e-08 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 43.0 bits (101), Expect = 8e-08
Identities = 21/72 (29%), Positives = 30/72 (41%), Gaps = 5/72 (6%)

Query: 81 ASIGRVVVSPAGRGKGLAMPLMQRAIDAALSTWPAAGIQIGAQDYLKS---FYQKLGFSA 137
A I + V+ R KG+ L+ +AI+ A G+ + QD S FY K F
Sbjct: 90 ALIEDIAVAKDYRKKGVGTALLHKAIEWAKEN-HFCGLMLETQDINISACHFYAKHHFII 148

Query: 138 CS-DMYLEDGIP 148
+ D L P
Sbjct: 149 GAVDTMLYSNFP 160


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0527HTHFIS355e-121 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 355 bits (913), Expect = e-121
Identities = 137/479 (28%), Positives = 228/479 (47%), Gaps = 48/479 (10%)

Query: 18 LLVLDPEQSLPE-CSEELKQAAWNCLKAVSAAEALVLLQKYDLRVAIAFIN--DTNQVLL 74
+LV D + ++ ++ L +A ++ +AA + D + + + D N L
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 75 ANEIAIIQTEYPSLHWIAVTD-STLEQHCSWLSAANFIDYYHRPFDWGRFADTLGHAWGM 133
+ I+ P L + ++ +T + DY +PFD +G A
Sbjct: 66 ---LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAY-DYLPKPFDLTELIGIIGRALAE 121

Query: 134 AQLTVAKKGKSAPTEALTTIKGDHPLLQQLRQRLHKFSLSDDTVLLSGETGSGKGLCAKT 193
+ +K + + G +Q++ + L + +D T++++GE+G+GK L A+
Sbjct: 122 PKRRPSKLEDDSQDGM--PLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARA 179

Query: 194 LHSLSKRRDGPFITVNCGALPIGLIHSALFGHEKGAFTDADKRYIGHLEQANGGTLFLDE 253
LH KRR+GPF+ +N A+P LI S LFGHEKGAFT A R G EQA GGTLFLDE
Sbjct: 180 LHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDE 239

Query: 254 IADLPLDLQVNLLHVLDDKQIMRIGGNVPIKVDCRLLFASHQDLEVAIDEGRFREDLYHR 313
I D+P+D Q LL VL + +GG PI+ D R++ A+++DL+ +I++G FREDLY+R
Sbjct: 240 IGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYR 299

Query: 314 INVLRLHVPSLRQYSDEVMLLAEDFLQE-NTDSNVQFHFSDDARCAMKHYNWPGNVRELR 372
+NV+ L +P LR ++++ L F+Q+ + F +A MK + WPGNVREL
Sbjct: 300 LNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDVKRFDQEALELMKAHPWPGNVRELE 359

Query: 373 NRIRRAMVLSDDSKITAQLLGLDQLPSRAGQDLARCRV---------------------- 410
N +RR L IT +++ + + +
Sbjct: 360 NLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASF 419

Query: 411 ---------------EHEAEVLLKAISDHKHNISAAARSLNISRATFYRLLKKCQIKMP 454
E E ++L A++ + N AA L ++R T + +++ + +
Sbjct: 420 GDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVSVY 478


68Shewmr4_0543Shewmr4_0549N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_0543014-2.686863short chain dehydrogenase
Shewmr4_0544114-2.157551hypothetical protein
Shewmr4_0545114-1.929772hypothetical protein
Shewmr4_0546116-0.732857hypothetical protein
Shewmr4_0547118-1.403794phosphoribosylamine--glycine ligase
Shewmr4_0548020-2.692488bifunctional
Shewmr4_0549-118-2.960252zinc-responsive transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0543PF06580412e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 40.6 bits (95), Expect = 2e-05
Identities = 21/107 (19%), Positives = 37/107 (34%), Gaps = 22/107 (20%)

Query: 608 LVIRNLISNAIKH---HDKGEGVIKVICETSNHHYWFSVIDDGPGISSRFHGKVFQMFQT 664
++++ L+ N IKH G I + N V + G
Sbjct: 258 MLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGS---------------- 301

Query: 665 LRPRDEVEGSGLGLSLVKKTVESLGGK---IQLESEGRGCCFRFSWP 708
L ++ E +G GL V++ ++ L G I+L + P
Sbjct: 302 LALKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0544HTHFIS443e-08 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 44.4 bits (105), Expect = 3e-08
Identities = 21/108 (19%), Positives = 41/108 (37%), Gaps = 8/108 (7%)

Query: 11 TILLVDDDDVDYMAVQRAMKQLRLLNPLVRARDGLEALAILTHSDAIKGSYLILLDLNMP 70
TIL+ DDD + +A+ + + + + L++ D+ MP
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGY--DVRITSNAATLWRWIAA----GDGDLVVTDVVMP 58

Query: 71 RMNGFEFLEHIRANPALSSSVVFMLTTSSTDEDRMRAYSHHVAGYMVK 118
N F+ L I+ A V +++ +T ++A Y+ K
Sbjct: 59 DENAFDLLPRIKK--ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPK 104


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0545HTHFIS614e-12 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 61.4 bits (149), Expect = 4e-12
Identities = 27/102 (26%), Positives = 48/102 (47%), Gaps = 3/102 (2%)

Query: 3 LLLIDDDDVDRTAIIRALRQSKLAFNVIEANCAFDGLNLALERHFDGILLDYLLPDANGL 62
+L+ DDD RT + +AL S+ ++V + A D ++ D ++PD N
Sbjct: 6 ILVADDDAAIRTVLNQAL--SRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 63 EVLIKLNSMTQEQTVVLMLSRYEDEKLAQRCIELGAQDFLLK 104
++L ++ + VL++S A + E GA D+L K
Sbjct: 64 DLLPRIKKA-RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPK 104


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0547OMPADOMAIN337e-04 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 33.0 bits (75), Expect = 7e-04
Identities = 35/200 (17%), Positives = 66/200 (33%), Gaps = 54/200 (27%)

Query: 47 VAIQGGIDYSHDSGFYAGTWASNVDFGDDTSYELDLYAGYGGNITEDLSYDIGYLYYAYP 106
+ G HD+GF +N + + GY + + +++GY +
Sbjct: 30 TGAKLGWSQYHDTGFI-----NNNGPTHENQLGAGAFGGY--QVNPYVGFEMGYDWLGRM 82

Query: 107 DAEGSID-------------------------FGELHGAITWKWFELSYSHVINAGDDVA 141
+GS++ + L G + W + S+V D
Sbjct: 83 PYKGSVENGAYKAQGVQLTAKLGYPITDDLDIYTRLGGMV---WRADTKSNVYGKNHDTG 139

Query: 142 AEPLDNKDMSYLAATASFPLTDKLSLSLHYGYSSGDVVESWFDEDNYADYNVTLSADTSM 201
P+ A + +T +++ L Y +++ N D + T+
Sbjct: 140 VSPV-------FAGGVEYAITPEIATRLEYQWTN-----------NIGDAH-TIGTRPDN 180

Query: 202 GTVSFMVSDTDLQGDDAKVV 221
G +S VS QG+ A VV
Sbjct: 181 GMLSLGVSYRFGQGEAAPVV 200


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0548DNABINDINGHU1092e-35 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 109 bits (275), Expect = 2e-35
Identities = 44/88 (50%), Positives = 66/88 (75%)

Query: 2 NKTELIAKIAENADITKAEAARALKSFEAAITESMKNGDKISIVGFGSFETSTRAARTGR 61
NK +LIAK+AE ++TK ++A A+ + +A++ + G+K+ ++GFG+FE RAAR GR
Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62

Query: 62 NPQTGKEIQIAEATVPKFKAGKTLRDSV 89
NPQTG+EI+I + VP FKAGK L+D+V
Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAV 90


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0549HTHFIS616e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 61.4 bits (149), Expect = 6e-13
Identities = 24/107 (22%), Positives = 43/107 (40%), Gaps = 3/107 (2%)

Query: 146 RVLVVDDSRMARNVIKRTIGNLGIKQITEAEDGAQAIELMRNNMFDLIITDYNMPSVDGL 205
+LV DD R V+ + + G + A + DL++TD MP +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 206 ALTQFIRNESQQSHVPILMVSSEANDAHLSNVSQAGVNALCDKPFEP 252
L I+ + +P+L++S++ S+ G KPF+
Sbjct: 64 DLLPRIKKA--RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDL 108



Score = 47.9 bits (114), Expect = 2e-08
Identities = 29/155 (18%), Positives = 58/155 (37%), Gaps = 6/155 (3%)

Query: 10 SILIVEPSETQRRIIIKRLQQEGIISIQNAASLTQARELIARHKPDLIASAMYFEDGTAT 69
+IL+ + R ++ + L + G ++ ++ IA DL+ + + D A
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGY-DVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 70 EFLSYLRTKSEYKDIQFMLVSSECRREQLEIFRQSGVVAILPKPFNAEHLGKALNATIDL 129
+ L ++ D+ +++S++ + G LPKPF+ L + +
Sbjct: 64 DLLPRIK--KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 130 LSHDELDLSHFDVHDVRVLVVDDSRM--ARNVIKR 162
L D D LV + M V+ R
Sbjct: 122 PKRRPSKLED-DSQDGMPLVGRSAAMQEIYRVLAR 155


69Shewmr4_0651Shewmr4_0658N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_0651-2131.366345large-conductance mechanosensitive channel
Shewmr4_0652-1131.607417LysR family transcriptional regulator
Shewmr4_0653-1120.874871secretion protein HlyD family protein
Shewmr4_0654010-0.168092hypothetical protein
Shewmr4_0655113-0.919783EmrB/QacA family drug resistance transporter
Shewmr4_0656112-1.158542GCN5-related N-acetyltransferase
Shewmr4_0657114-1.104820antibiotic biosynthesis monooxygenase
Shewmr4_0658113-0.960485sigma-54 dependent trancsriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0651HTHFIS832e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 82.6 bits (204), Expect = 2e-20
Identities = 31/131 (23%), Positives = 61/131 (46%), Gaps = 1/131 (0%)

Query: 1 MQNPHILIVEDEAVTRNTLRSIFEAEGYVVTEANDGAEMHKAMQENKINLVVMDINLPGK 60
M IL+ +D+A R L GY V ++ A + + + +LVV D+ +P +
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 NGLLLARELREIN-NIGLIFLTGRDNEVDKILGLEIGADDYITKPFNPRELTIRARNLLT 119
N L +++ ++ ++ ++ ++ + I E GA DY+ KPF+ EL L
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 120 RVNSAGNEVEE 130
+++E+
Sbjct: 121 EPKRRPSKLED 131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0653CARBMTKINASE310.009 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 30.9 bits (70), Expect = 0.009
Identities = 18/81 (22%), Positives = 27/81 (33%), Gaps = 5/81 (6%)

Query: 202 DYSAALLAEALKASAVEIWTDVAGIYTTDPRLAPNAHPIAEISFNEAAEMATFGAKVLHP 261
D + LAE + A I TDV G + E+ E + G
Sbjct: 216 DLAGEKLAEEVNADIFMILTDVNGAALY--YGTEKEQWLREVKVEELRKYYEEGH--FKA 271

Query: 262 ATILPAVRQQIQVFVGSSKEP 282
++ P V I+ F+ E
Sbjct: 272 GSMGPKVLAAIR-FIEWGGER 291


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0657HTHFIS643e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 63.7 bits (155), Expect = 3e-14
Identities = 27/159 (16%), Positives = 62/159 (38%), Gaps = 9/159 (5%)

Query: 6 SVLVVDDHPLLRKGICQLIASDPDFSLFGEVGGGLDALSAVATDEPDIVLLDLNMKGMTG 65
++LV DD +R + Q S + + +A + D+V+ D+ M
Sbjct: 5 TILVADDDAAIRTVLNQ-ALSRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 66 LDTLNAMRQEGVTSRIVILTVSDAKQDVIRLLRAGADGYLLKDTEPDLLLDKLKNAMLGH 125
D L +++ +++++ + I+ GA YL K + L+ + A+
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL--- 119

Query: 126 RVISDEVEEYLYELKNAADEQEWISSLTPRELQILQQLA 164
E + +L++ + + + + +I + LA
Sbjct: 120 ----AEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLA 154


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0658PF06580424e-06 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 42.2 bits (99), Expect = 4e-06
Identities = 27/148 (18%), Positives = 56/148 (37%), Gaps = 19/148 (12%)

Query: 405 NQLTEINEGVSTAYVQLRELL----STFRLTIKEPNLKN-AMEAMLEQLRANTDI----- 454
N L I + + RE+L R +++ N + ++ L + + +
Sbjct: 177 NALNNIRALILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQF 236

Query: 455 --KIHLDYKLSPQWLEAKQHIHILQITREATLNAIKHANASR----VIIRCYKDDNGMVN 508
++ + +++P ++ + ++Q E N IKH A I+ DNG V
Sbjct: 237 EDRLQFENQINPAIMDVQVPPMLVQTLVE---NGIKHGIAQLPQGGKILLKGTKDNGTVT 293

Query: 509 ISVSDNGIGIGYLKERDQHFGIGIMHER 536
+ V + G + G+ + ER
Sbjct: 294 LEVENTGSLALKNTKESTGTGLQNVRER 321


70Shewmr4_0709Shewmr4_0716N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_07091182.832423hypothetical protein
Shewmr4_0710-2131.894813hypothetical protein
Shewmr4_0711-2131.935170hypothetical protein
Shewmr4_0712-2132.149270periplasmic solute binding protein
Shewmr4_0713-2101.627855hypothetical protein
Shewmr4_0714-391.416645ABC-3 protein
Shewmr4_0715-2100.9581561-acyl-sn-glycerol-3-phosphate acyltransferase
Shewmr4_07161180.650718hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0709BCTERIALGSPG412e-07 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 41.0 bits (96), Expect = 2e-07
Identities = 16/31 (51%), Positives = 23/31 (74%)

Query: 2 ASRTNAGFTLVELMVAIAIIGILASIALPSY 32
A+ GFTL+E+MV I IIG+LAS+ +P+
Sbjct: 3 ATDKQRGFTLLEIMVVIVIIGVLASLVVPNL 33


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0710BCTERIALGSPG543e-12 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 53.7 bits (129), Expect = 3e-12
Identities = 19/64 (29%), Positives = 38/64 (59%)

Query: 5 RKGFTLIELMIAVAIIGILAAIAIPSFNEYLKQGRRFDAQQYLVSSAQALERHYSRNGLY 64
++GFTL+E+M+ + IIG+LA++ +P+ ++ + A +V+ AL+ + N Y
Sbjct: 7 QRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLDNHHY 66

Query: 65 PASQ 68
P +
Sbjct: 67 PTTN 70


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0715HTHFIS789e-17 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 78.3 bits (193), Expect = 9e-17
Identities = 34/139 (24%), Positives = 61/139 (43%), Gaps = 5/139 (3%)

Query: 1283 SILVADDNATARDIMRTTLESMGFRVDTVRSGEEAVTRCSQQEYAVALIDWKMPNLDGIE 1342
+ILVADD+A R ++ L G+ V + + + + + D MP+ + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 1343 TAKQIKQLAKNAPRILMVSAHATQEFLSQIEAL--GLAGYISKPISASRLLDGIMNSLGR 1400
+IK+ + P ++M SA T + I+A G Y+ KP + L+ I +L
Sbjct: 65 LLPRIKKARPDLPVLVM-SAQNTFM--TAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 1401 AGVLPVRRNSESIDPKLLL 1419
P + +S D L+
Sbjct: 122 PKRRPSKLEDDSQDGMPLV 140



Score = 70.6 bits (173), Expect = 3e-14
Identities = 29/130 (22%), Positives = 49/130 (37%), Gaps = 5/130 (3%)

Query: 1425 RILLVEDNEMNLEVATEFLEQVGIILSIATNGQIALDKLAQQSFDLVLMDCQMPVMDGYQ 1484
IL+ +D+ V + L + G + I +N +A DLV+ D MP + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 1485 ATQAIRKRPELAELPVIAMTANAMAGDKEMCLKAGMNDHIAKP---IEVNLLYQTLLKYL 1541
I+K +LPV+ M+A + G D++ KP E+ + L
Sbjct: 65 LLPRIKKA--RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122

Query: 1542 GAGVLPTEAA 1551
E
Sbjct: 123 KRRPSKLEDD 132


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0716HTHFIS904e-22 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 90.3 bits (224), Expect = 4e-22
Identities = 39/159 (24%), Positives = 65/159 (40%), Gaps = 6/159 (3%)

Query: 1 MDKATILVVDDTPENIDILVGILG-EDYKVKVAIDGPRALALVAKTLPDLILLDVMMPGM 59
M ATILV DD +L L Y V++ + +A DL++ DV+MP
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 60 NGYEVCKLLKQEPLTCHIPVIFVTALSEVADETQGFELGAVDYITKPVSAPVVKARVRTH 119
N +++ +K+ +PV+ ++A + + E GA DY+ KP + +
Sbjct: 61 NAFDLLPRIKKARP--DLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118

Query: 120 LALYDQKRLLEQQVKERTQEL--EETRF-EIIRRLGRAA 155
LA ++ + + L EI R L R
Sbjct: 119 LAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLM 157


71Shewmr4_0723Shewmr4_0730N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_0723-1122.271678PhoH family protein
Shewmr4_07240162.413421multi-sensor hybrid histidine kinase
Shewmr4_07251172.937789hypothetical protein
Shewmr4_07260172.424607hypothetical protein
Shewmr4_07270142.815322serine/threonine protein kinase
Shewmr4_07280132.372029hypothetical protein
Shewmr4_0729-2132.258449thiopurine S-methyltransferase
Shewmr4_0730-2151.328506BFD/(2Fe-2S)-binding domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0723SUBTILISIN2042e-61 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 204 bits (520), Expect = 2e-61
Identities = 103/298 (34%), Positives = 140/298 (46%), Gaps = 39/298 (13%)

Query: 128 WGMNNTGQNGGTVDADIDAPEAWEITTGSSDVVIGVIDTGVDYNHPDLQTNMWVNGGEIP 187
N G + I AP W T G V + V+DTG D +HPDL+ + I
Sbjct: 16 EQQVNEIPRGVEM---IQAPAVWNQTRGR-GVKVAVLDTGCDADHPDLKARI------IG 65

Query: 188 GNGIDDDGNGVIDDVHGYSAVNNNGNPMDGNGHGTHVSGTIGAKGNNGVGVVGVNWDVKI 247
G DD G + D NGHGTHV+GTI A N GVVGV + +
Sbjct: 66 GRNFTDDDEGDPEI------------FKDYNGHGTHVAGTIAA-TENENGVVGVAPEADL 112

Query: 248 AACQFLDADGYGSTAGAIACLDYFTDLKVNHGVDIKATNNSWGGGSFSQALKDAIEAGGE 307
+ L+ G G I + Y + VDI + S GG L +A++
Sbjct: 113 LIIKVLNKQGSGQYDWIIQGIYYA----IEQKVDI--ISMSLGGPEDVPELHEAVKKAVA 166

Query: 308 AGILFVAAAGNDAVDND--ASPHYPSSYDSDVVLSIASTDRNDRMSDFSQWGLTSVDMGA 365
+ IL + AAGN+ +D YP Y+ V+S+ + + + S+FS VD+ A
Sbjct: 167 SQILVMCAAGNEGDGDDRTDELGYPGCYNE--VISVGAINFDRHASEFSNSN-NEVDLVA 223

Query: 366 PGTAILSTIPGGGYATYSGTSMATPHVTGAAALVWSLNP-----DLSPVEMKSLLMAS 418
PG ILST+PGG YAT+SGTSMATPHV GA AL+ L DL+ E+ + L+
Sbjct: 224 PGEDILSTVPGGKYATFSGTSMATPHVAGALALIKQLANASFERDLTEPELYAQLIKR 281


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0725LPSBIOSNTHSS300.007 Lipopolysaccharide core biosynthesis protein signat...
		>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein

signature.
Length = 166

Score = 29.8 bits (67), Expect = 0.007
Identities = 7/26 (26%), Positives = 14/26 (53%)

Query: 34 HQGHITLVKEAAKKCDHVVVSIFVNP 59
GH+ +++ + D V V++ NP
Sbjct: 13 TFGHLDIIERGCRLFDQVYVAVLRNP 38


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0729PF04605320.001 Virulence-associated protein D (VapD)
		>PF04605#Virulence-associated protein D (VapD)

Length = 125

Score = 31.8 bits (72), Expect = 0.001
Identities = 9/52 (17%), Positives = 17/52 (32%), Gaps = 2/52 (3%)

Query: 65 AADDILRTLEAFGFEWDDEVLYQSDRT--EAYQAKLDELLAEDNAYFCQCSR 114
I + + GFE Y S E ++ L + + +C +
Sbjct: 27 PYSLIKKFMLENGFEHRQYSGYTSKEPINERRVIRIVNKLTKKFTWLGECVK 78


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0730INVEPROTEIN270.031 Salmonella/Shigella invasion protein E (InvE) signat...
		>INVEPROTEIN#Salmonella/Shigella invasion protein E (InvE)

signature.
Length = 372

Score = 27.0 bits (59), Expect = 0.031
Identities = 22/81 (27%), Positives = 33/81 (40%), Gaps = 13/81 (16%)

Query: 24 GEEYMNAKQLGHFKTILEAWRNQLREEVDRTLSHMQDEAANFPDPVDRAAQEEEFSLELR 83
E AKQ+ ++ L + + + S FPDP D E LR
Sbjct: 88 DEALPKAKQILKLISVH---GGALEDFLRQARSL-------FPDPSDLVLVLREL---LR 134

Query: 84 ARDRERKLIKKIEKTLQKIEE 104
+D E + KK+E L+ +EE
Sbjct: 135 RKDLEEIVRKKLESLLKHVEE 155


72Shewmr4_0788Shewmr4_0792N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_07880120.565009TonB-dependent receptor
Shewmr4_0789-1110.191150hypothetical protein
Shewmr4_07900120.455714hypothetical protein
Shewmr4_07910130.139427methyl-accepting chemotaxis sensory transducer
Shewmr4_0792-1121.119893molydopterin dinucleotide-binding region
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0788ACRIFLAVINRP498e-161 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 498 bits (1285), Expect = e-161
Identities = 210/1043 (20%), Positives = 444/1043 (42%), Gaps = 49/1043 (4%)

Query: 11 FARNSVAANLLMWALLIGGLFSTVLINKEVFPSFNLNLLSITVAYPGAAPQEIEEGINIK 70
F R + A +L L++ G + + + +P+ +S++ YPGA Q +++ +
Sbjct: 5 FIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQV 64

Query: 71 IEEAIQDINGIKKVTSVA-SEGVGAITVEVEDDYDVQTVLDEAKLRLDAI-STFPVNIEK 128
IE+ + I+ + ++S + S G IT+ + D + + +L P +++
Sbjct: 65 IEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQQ 124

Query: 129 PQIFKIEPENNVIWV----SVYGDMSLHDMKELAKS-VRDDLTQLPAVTRAKVTGVRDYE 183
I + ++ + V S + D+ + S V+D L++L V ++ G Y
Sbjct: 125 QGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFG-AQYA 183

Query: 184 IGIEVSEDKLREYGLTFSQVALAVQNSSIDLPGGSIRAEDG------DILLRTKGQAYTG 237
+ I + D L +Y LT V ++ + + G + + + + +
Sbjct: 184 MRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKNP 243

Query: 238 DDFANIVVTTRADGSRVMLPQVATIKDDFEERLEYTRFNGKPAAIIEVTSVNDQNALDIA 297
++F + + +DGS V L VA ++ E R NGKPAA + + NALD A
Sbjct: 244 EEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALDTA 303

Query: 298 QQVKDYVEKRRATLPANAKLDTWGDLTHYLKGRLNMMMSNMFYGALLVFVILALFL-DLK 356
+ +K + + + P K+ D T +++ ++ ++ +F +LVF+++ LFL +++
Sbjct: 304 KAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMR 363

Query: 357 LAFWVMMGLPVCFLGTMLIMPLEPFSMTINMLTLFAFILVLGIVVDDAIVIGESAYSE-V 415
+ +PV LGT I+ F +IN LT+F +L +G++VDDAIV+ E+ +
Sbjct: 364 ATLIPTIAVPVVLLGTFAILAA--FGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMM 421

Query: 416 ERHGHSIDNVIRGAQKVAMPATFGVLTTIAAFIPMLMVSGPMGIIWKSIGMVVIMCLAFS 475
E + + ++ + A FIPM G G I++ + ++ +A S
Sbjct: 422 EDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALS 481

Query: 476 LVESKFILPAHLAHM-KFRKPGE---PTGFFGRFKDRFNNRVQHFIHHSYRNFLERCIQH 531
++ + + PA A + K GFFG F F++ V H Y N + + +
Sbjct: 482 VLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNH-----YTNSVGKILGS 536

Query: 532 RYNVVAAFIGVLILSVALVVSGKVRWVFFPDIPSDFIQVQLEMDEGSSEQNTLKVVQDIE 591
+ + ++ V L + ++ F P+ +++ G++++ T KV+ +
Sbjct: 537 TGRYLLIYALIVAGMVVLFL--RLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVT 594

Query: 592 EALYKMNAKMEKDNGSEVVKHSFINMSSRTSAFIFAELTKGEDREVDGET---IAAAWRE 648
+ K + + V SF ++ + F L E+R D + + +
Sbjct: 595 DYYLKNEKANVESVFT-VNGFSFSGQ-AQNAGMAFVSLKPWEERNGDENSAEAVIHRAKM 652

Query: 649 QLPELLSVKKLDFNAS-----GNGGGGGDISFRLTSSDLEELSAAARELKQKLATY-EGV 702
+L ++ + FN G G + L+ A +L A + +
Sbjct: 653 ELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASL 712

Query: 703 YDIADNFSSGSHEIRLKI-RPEAEALGLTLSDLARQVRYGFYGYEAQRILRNKEEIKVMV 761
+ N + + +L++ + +A+ALG++LSD+ + + G + K+ V
Sbjct: 713 VSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYV 772

Query: 762 RYPLEQRRTVGYLENMLIRTPQGKSVPFSTVAEVEKGESYASITRVDGKRAITIIANANK 821
+ + R ++ + +R+ G+ VPFS + R +G ++ I
Sbjct: 773 QADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGE--- 829

Query: 822 HKVEPSKVVNEIQKDFLPQLQAKYPK-IQTTLDGGSLDEQNAMVGLMQGFFFALFTIYAL 880
P + + L +K P I G S E+ + + ++
Sbjct: 830 --AAPGTSSGDAM-ALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLC 886

Query: 881 MAVPLKSYSQPLIIMSVIPFGIIGALFGHLIQGLAMSVLSLCGIVALAGVVVNDSLILVD 940
+A +S+S P+ +M V+P GI+G L + V + G++ G+ +++++V+
Sbjct: 887 LAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVE 946

Query: 941 FVNRARE-QGLSIKQAAVDSGCYRFRAIILTSLTTFVGLVPIILERSLQAQIVIPMATSL 999
F E +G + +A + + R R I++TSL +G++P+ + + + +
Sbjct: 947 FAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGV 1006

Query: 1000 AFGILFSTVVTLILVPLLYIILD 1022
G++ +T++ + VP+ ++++
Sbjct: 1007 MGGMVSATLLAIFFVPVFFVVIR 1029


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0789RTXTOXIND423e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 42.1 bits (99), Expect = 3e-06
Identities = 24/121 (19%), Positives = 47/121 (38%), Gaps = 7/121 (5%)

Query: 106 DYEADLMQAEATLAQATAALNEEIARGEVAKIEFKGYDKGLPPELGLRIPQLKKEQANVK 165
+ E ++A L + L + + AK E++ + E+ + +L++ N+
Sbjct: 256 EQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEI---LDKLRQTTDNIG 312

Query: 166 YAQAALARAQRNLERTVIRAPFDGIIKARNV-DLGQYVTLGTNLGELY---DTRIAEIRL 221
LA+ + + +VIRAP ++ V G VT L + DT +
Sbjct: 313 LLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALV 372

Query: 222 P 222

Sbjct: 373 Q 373



Score = 38.3 bits (89), Expect = 5e-05
Identities = 25/125 (20%), Positives = 53/125 (42%), Gaps = 11/125 (8%)

Query: 62 GVVTPKYKTQLVTEVQGRMLSISPQFVA-GGIVKKGDQLAQIEPSDYEADLMQAEATLAQ 120
G +T +++ + ++ + + V G V+KGD L ++ EAD ++ +++L Q
Sbjct: 88 GKLTHSGRSKEIKPIENSI--VKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQ 145

Query: 121 ATA------ALNEEIARGEVAKIEFKGYD--KGLPPELGLRIPQLKKEQANVKYAQAALA 172
A L+ I ++ +++ + + E LR+ L KEQ + Q
Sbjct: 146 ARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQK 205

Query: 173 RAQRN 177
+
Sbjct: 206 ELNLD 210


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0791HTHFIS657e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 65.2 bits (159), Expect = 7e-13
Identities = 27/138 (19%), Positives = 47/138 (34%), Gaps = 5/138 (3%)

Query: 785 TLLVVDDIQQNIDLLSVWLTRQGHKVITARDGEQALLRMQKADIDITLMDLQMPVMDGLT 844
T+LV DD +L+ L+R G+ V + + D D+ + D+ MP +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 845 AAKMRREQEAESQLPHMPIIALTASVLEQDKSAAEQAGMDGFANKPIDFALLTREIARVL 904
+ P +P++ ++A A + G + KP D L I R L
Sbjct: 65 L-----LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119

Query: 905 QLNPPQAEAESLQPLGSQ 922
+
Sbjct: 120 AEPKRRPSKLEDDSQDGM 137


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0792HTHFIS791e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 79.5 bits (196), Expect = 1e-18
Identities = 31/169 (18%), Positives = 60/169 (35%), Gaps = 23/169 (13%)

Query: 10 TLLLVDDEPVNLRVLKQVLHQ-DYHLIFAKSGEEALRLAQTELPSLILLDIMMPNMTGLE 68
T+L+ DD+ VL Q L + Y + + R L++ D++MP+ +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 69 VCQLLKNIPETQSIPVIFVTALNDEHDEAAGFAVGGVDYIVKPISATIVKARVKTHLSLV 128
+ +K +PV+ ++A N G DY+ KP T + + L+
Sbjct: 65 LLPRIKK--ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122

Query: 129 QADELRRTR---------------LQVIQRLGRAAEYKDN-----ETGT 157
+ + ++ + L R + E+GT
Sbjct: 123 KRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGT 171


73Shewmr4_0848Shewmr4_0855N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_0848023-0.578083ABC transporter-like protein
Shewmr4_0849226-1.150972hypothetical protein
Shewmr4_0850528-1.711029hypothetical protein
Shewmr4_0851628-1.680536hypothetical protein
Shewmr4_0852526-1.852439toluene tolerance family protein
Shewmr4_0853327-1.160604hypothetical protein
Shewmr4_0854222-0.482710SpoIIAA family protein
Shewmr4_0855-1180.124503BolA family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0848FERRIBNDNGPP391e-05 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 39.2 bits (91), Expect = 1e-05
Identities = 46/196 (23%), Positives = 74/196 (37%), Gaps = 19/196 (9%)

Query: 4 RRFI-ALGLSLALLPI---AAMAEPAKRIIALSPHAVEMLYAIGAGESIVAATDYADY-- 57
RR + A+ LS L + A A RI+AL VE+L A+G D +Y
Sbjct: 10 RRLLTAMALSPLLWQMNTAHAAAIDPNRIVALEWLPVELLLALGI--VPYGVADTINYRL 67

Query: 58 ----PEAAKKIPSIGGYYGIQIERVLELNPDLIVVWDTGNKA--EDINQL-KSLGFKLYS 110
P + +G +E + E+ P + VW G E + ++ GF +S
Sbjct: 68 WVSEPPLPDSVIDVGLRTEPNLELLTEMKPSFM-VWSAGYGPSPEMLARIAPGRGFN-FS 125

Query: 111 SSPKTLEDVAKEIEELGALTGRTEQASQVAADYRNQLLQLRSENAAKSE-PKVFYQLWST 169
+ L K + E+ L A A Y + + ++ + P + L
Sbjct: 126 DGKQPLAMARKSLTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDP 185

Query: 170 PLMTV-AKNSWIQQII 184
M V NS Q+I+
Sbjct: 186 RHMLVFGPNSLFQEIL 201


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0852BCTERIALGSPG429e-08 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 41.8 bits (98), Expect = 9e-08
Identities = 18/55 (32%), Positives = 29/55 (52%), Gaps = 3/55 (5%)

Query: 3 LSKIKVNTGFTLIELMIAIAIVGILASIALPSYQEHVRNTRRTDARD---ALSNA 54
+ GFTL+E+M+ I I+G+LAS+ +P+ + + A AL NA
Sbjct: 1 MRATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENA 55


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0853BCTERIALGSPG382e-06 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 38.3 bits (89), Expect = 2e-06
Identities = 17/51 (33%), Positives = 30/51 (58%)

Query: 3 TKKILGFTLTELMVVVAIVAIIAGIAAPSFASMIRENTARTQVNELLALTN 53
T K GFTL E+MVV+ I+ ++A + P+ + + V++++AL N
Sbjct: 4 TDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALEN 54


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0854BCTERIALGSPG300.002 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 29.9 bits (67), Expect = 0.002
Identities = 13/23 (56%), Positives = 17/23 (73%), Gaps = 2/23 (8%)

Query: 4 RKQKGFSLIEIMVTSFIVAFGIL 26
KQ+GF+L+EIMV IV G+L
Sbjct: 5 DKQRGFTLLEIMVV--IVIIGVL 25


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_0855BCTERIALGSPG334e-04 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 32.6 bits (74), Expect = 4e-04
Identities = 12/46 (26%), Positives = 23/46 (50%), Gaps = 2/46 (4%)

Query: 12 QTGFTLIELMISLT-LGLVVMLGASQIFVSVNKAYVETQRFSQLQG 56
Q GFTL+E+M+ + +G++ L + + KA + S +
Sbjct: 7 QRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAV-SDIVA 51


74Shewmr4_1015Shewmr4_1026N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_10150130.819369hypothetical protein
Shewmr4_10161160.456058hypothetical protein
Shewmr4_10171150.207034type 12 methyltransferase
Shewmr4_10181160.220156hypothetical protein
Shewmr4_1019116-0.171896hypothetical protein
Shewmr4_1020219-0.891191hypothetical protein
Shewmr4_1021429-0.549787hypothetical protein
Shewmr4_1022532-0.901657hypothetical protein
Shewmr4_1023632-0.368523OsmC family protein
Shewmr4_1024633-0.151675MarR family transcriptional regulator
Shewmr4_1025426-0.501103FAD dependent oxidoreductase
Shewmr4_1026531-0.66369423S rRNA methyluridine methyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1015SECFTRNLCASE781e-17 Bacterial translocase SecF protein signature.
		>SECFTRNLCASE#Bacterial translocase SecF protein signature.

Length = 333

Score = 78.0 bits (192), Expect = 1e-17
Identities = 31/172 (18%), Positives = 82/172 (47%), Gaps = 4/172 (2%)

Query: 442 VTIVEERTIGPTLGAENIQNGFAALGLGMGITLLFMALWYR-RLGWVANVALIANMVILF 500
+ I ++GP + E + +L + + ++ + + + A VAL+ ++++
Sbjct: 135 LKITSFESVGPKVSGELVWTAVWSLLAATVVIMFYIWVRFEWQFALGAVVALVHDVLLTV 194

Query: 501 GLLALIPGAVLTLPGIAGLVLTVGMAVDTNVLIFERIKDKLKEGRSFALA--IDTGFDSA 558
GL A++ L +A L+ G +++ V++F+R+++ L + ++ L ++ +
Sbjct: 195 GLFAVL-QLKFDLTTVAALLTITGYSINDTVVVFDRLRENLIKYKTMPLRDVMNLSVNET 253

Query: 559 FSTIFDANFTTMITAVVLYSIGNGPIQGFALTLGLGLLTSMFTGIFASRALI 610
S TT++ V + G I+GF + G+ T ++ ++ ++ ++
Sbjct: 254 LSRTVMTGMTTLLALVPMLIWGGDVIRGFVFAMVWGVFTGTYSSVYVAKNIV 305


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1016SECFTRNLCASE2398e-80 Bacterial translocase SecF protein signature.
		>SECFTRNLCASE#Bacterial translocase SecF protein signature.

Length = 333

Score = 239 bits (611), Expect = 8e-80
Identities = 90/305 (29%), Positives = 154/305 (50%), Gaps = 20/305 (6%)

Query: 2 KNLNLTKWRYVSSAISLFLMLASLTIIGMKGFNWGLDFTGGVVTEVQLDRRITSSELQPL 61
N + +W++ + ++ +M+AS+ + + G N+G+DF GG + I +
Sbjct: 12 TNFDFFRWQWATFGAAIVMMIASVILPLVIGLNFGIDFKGGTTIRTESTTAIDVGVYRAA 71

Query: 62 LNAAYQQEVTVISASEP--------------------GRWVLRYADTAQSNVNIQETLAP 101
L +V + +P G N A
Sbjct: 72 LEPLELGDVIISEVRDPSFREDQHVAMIRIQMQEDGQGAEGQGAQGQELVNKVETALTAV 131

Query: 102 LGEVQVLNTSIVGPQVGKELAEQGGLALLVAMLCILGYLSYRFEWRLASGALFALVHDVI 161
+++ + VGP+V EL +LL A + I+ Y+ RFEW+ A GA+ ALVHDV+
Sbjct: 132 DPALKITSFESVGPKVSGELVWTAVWSLLAATVVIMFYIWVRFEWQFALGAVVALVHDVL 191

Query: 162 FVLAFFSLTQMEFNLTVLAAVLAILGYSLNDSIIIADRIRELLIAKPKLAIQEINNQAIV 221
+ F++ Q++F+LT +AA+L I GYS+ND++++ DR+RE LI + ++++ N ++
Sbjct: 192 LTVGLFAVLQLKFDLTTVAALLTITGYSINDTVVVFDRLRENLIKYKTMPLRDVMNLSVN 251

Query: 222 ATFSRTMVTSGTTLMTVGALWIMGGGPLEGFSIAMFIGILTGTFSSISVGTSLPEFLGLT 281
T SRT++T TTL+ + + I GG + GF AM G+ TGT+SS+ V ++ F+GL
Sbjct: 252 ETLSRTVMTGMTTLLALVPMLIWGGDVIRGFVFAMVWGVFTGTYSSVYVAKNIVLFIGLD 311

Query: 282 PEHYK 286
K
Sbjct: 312 RNKEK 316


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1019HTHFIS340.002 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 34.0 bits (78), Expect = 0.002
Identities = 23/82 (28%), Positives = 32/82 (39%), Gaps = 18/82 (21%)

Query: 198 VLMVGPPGTGKTLLAKAIAGESK---VPFFT-----ISGSDFVEMFVGV------GASRV 243
+++ G GTGK L+A+A+ K PF I G GA
Sbjct: 163 LMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTR 222

Query: 244 RD-MFEQAKKSAPCIIFIDEID 264
FEQA+ +F+DEI
Sbjct: 223 STGRFEQAEGGT---LFLDEIG 241


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1022adhesinb310.003 Adhesin B signature.
		>adhesinb#Adhesin B signature.

Length = 310

Score = 31.4 bits (71), Expect = 0.003
Identities = 18/95 (18%), Positives = 34/95 (35%), Gaps = 16/95 (16%)

Query: 142 REARRTFEVIAEELDIVIQKNGTMAFDNAIIAY----EPLWAVGTGKSATPEQAQEVHAF 197
+EA+ F I E +++ G F AY +W + T + TP+Q + +
Sbjct: 186 KEAKEKFNNIPGEKKMIVTSEG--CFKYFSKAYNVPSAYIWEINTEEEGTPDQIKTLVEK 243

Query: 198 IRKRLSEVSPFIGENIRILYGGSVTPSNAADLFAQ 232
+RK + L+ S ++
Sbjct: 244 LRKT----------KVPSLFVESSVDDRPMKTVSK 268


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1023SECGEXPORT1212e-39 Protein-export SecG membrane protein signature.
		>SECGEXPORT#Protein-export SecG membrane protein signature.

Length = 110

Score = 121 bits (305), Expect = 2e-39
Identities = 63/110 (57%), Positives = 82/110 (74%)

Query: 1 MYEVLVVVYLLVALGLIGLILIQQGKGADMGASFGAGASGTLFGSSGSGNFLTRTTAILA 60
MYE L+VV+L+VA+GL+GLI++QQGKGADMGASFGAGAS TLFGSSGSGNF+TR TA+LA
Sbjct: 1 MYEALLVVFLIVAIGLVGLIMLQQGKGADMGASFGAGASATLFGSSGSGNFMTRMTALLA 60

Query: 61 IAFFTLSLLIGNLSANHAKNEDTWKNLGVDEQVTQPVDQATEKSETKIPD 110
FF +SL++GN+++N W+NL + Q A K + IP+
Sbjct: 61 TLFFIISLVLGNINSNKTNKGSEWENLSAPAKTEQTQPAAPAKPTSDIPN 110


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1026TCRTETOQM694e-14 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 69.1 bits (169), Expect = 4e-14
Identities = 38/133 (28%), Positives = 57/133 (42%), Gaps = 18/133 (13%)

Query: 392 IMGHVDHGKTSLLDYIRRAKVAAGEAG------------------GITQHIGAYHVETDN 433
++ HVD GKT+L + + A E G GIT G + +N
Sbjct: 8 VLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITSFQWEN 67

Query: 434 GMITFLDTPGHAAFTAMRARGAKATDIVVLVVAADDGVMPQTIEAIQHAKAGNVPLIVAV 493
+ +DTPGH F A R D +L+++A DGV QT + +P I +
Sbjct: 68 TKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIPTIFFI 127

Query: 494 NKMDKPEADIDRV 506
NK+D+ D+ V
Sbjct: 128 NKIDQNGIDLSTV 140


75Shewmr4_1072Shewmr4_1084N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_1072-2121.164775hypothetical protein
Shewmr4_1073-2141.119102hypothetical protein
Shewmr4_1074-1150.811835hypothetical protein
Shewmr4_1075119-0.103391hypothetical protein
Shewmr4_10762160.378792methylation site containing protein
Shewmr4_10772160.258121hypothetical protein
Shewmr4_1078-3121.432660methylation site containing protein
Shewmr4_1079-3131.309848hypothetical protein
Shewmr4_1080-2130.962213type IV pilus modification protein PilV
Shewmr4_1081-3130.356916hypothetical protein
Shewmr4_1082-190.209827prepilin-type cleavage/methylation-like protein
Shewmr4_10830100.248610hypothetical protein
Shewmr4_1084013-0.432157hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1072HTHTETR542e-11 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 54.2 bits (130), Expect = 2e-11
Identities = 33/160 (20%), Positives = 59/160 (36%), Gaps = 5/160 (3%)

Query: 23 LAKALEVFWRKGFEGTSLTDLTQAMGINKPSLYAAFGNKEQLFLKAIELYEQLPCAFFYP 82
L AL +F ++G TSL ++ +A G+ + ++Y F +K LF + EL E
Sbjct: 17 LDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELE 76

Query: 83 SLEK--ETAYQVAESMLYGAATNLVDKNHPQGCLIVQGALACSEAGQAIKETLITRRRDG 140
K V +L + V + + + + C G+ R
Sbjct: 77 YQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHK-CEFVGEMAVVQQAQRNLCL 135

Query: 141 E--QALCQRLQRAKDEGDLPADADPLLLSRYIGTVLQGMA 178
E + Q L+ + LPAD + + + G+
Sbjct: 136 ESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLM 175


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1073RTXTOXIND453e-07 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 45.2 bits (107), Expect = 3e-07
Identities = 23/123 (18%), Positives = 48/123 (39%), Gaps = 5/123 (4%)

Query: 64 SVTLVPRVSGYIASVNFKEGALVKKGDVLFHIDASVFEAEVARLKADLASALSAE---QL 120
S + P + + + KEG V+KGDVL + A EA+ + ++ L A + Q+
Sbjct: 96 SKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQI 155

Query: 121 ATNDLERARKLFAQKAVSAELLDTRESNKRQTTAAVASVKAALLR--AELDLDYTQVRAP 178
+ +E + + + E + T+ + + + +L+ + RA
Sbjct: 156 LSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAE 215

Query: 179 IDG 181

Sbjct: 216 RLT 218



Score = 39.4 bits (92), Expect = 2e-05
Identities = 21/102 (20%), Positives = 41/102 (40%), Gaps = 10/102 (9%)

Query: 101 EAEVARLKADLASALSAEQLATNDLERARKLFAQKAVSAELLDTRESNKRQTTAAVASVK 160
E+ K+ L S A + + +LF + + +L RQTT + +
Sbjct: 265 VNELRVYKSQLEQIESEILSAKEEYQLVTQLF-KNEILDKL--------RQTTDNIGLLT 315

Query: 161 AALLRAELDLDYTQVRAPIDGRASYANV-TAGNYVSAGQSVL 201
L + E + +RAP+ + V T G V+ ++++
Sbjct: 316 LELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLM 357


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1074ACRIFLAVINRP10350.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 1035 bits (2679), Expect = 0.0
Identities = 419/1043 (40%), Positives = 640/1043 (61%), Gaps = 18/1043 (1%)

Query: 2 LSQFFIKRPIFAAVLSLLFFITGAIAVWQLPITEYPEVVPPTVVVTANYPGANPKVIAET 61
++ FFI+RPIFA VL+++ + GA+A+ QLP+ +YP + PP V V+ANYPGA+ + + +T
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 62 VASPLEQEINGVEDMLYMSSQATSDGRMTLTITFAIGTDVDRAQTQVQSRVDRAMPRLPQ 121
V +EQ +NG+++++YMSS + S G +T+T+TF GTD D AQ QVQ+++ A P LPQ
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 122 EVQRLGIVTEKSSPDLTMVVHLLSPDNRYDMLYLSNYAALNVKDELARIKGVGAVRLFGA 181
EVQ+ GI EKSS MV +S + +S+Y A NVKD L+R+ GVG V+LFG
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFG- 179

Query: 182 GEYSLRIWLDPNKVSALGMSPAEIIAAVREQNQQAAAGSLGAQPSGNA-DFQLLINVKGR 240
+Y++RIWLD + ++ ++P ++I ++ QN Q AAG LG P+ I + R
Sbjct: 180 AQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239

Query: 241 LTELSEFEDIIIKVGQNGEVIRLKDVARVELGATSYALRSLLDNKDAVAIPVFQASGSNA 300
EF + ++V +G V+RLKDVARVELG +Y + + ++ K A + + A+G+NA
Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299

Query: 301 IQISDDVRAEMARLAKSFPEGLQYEIVYDPTVFVRGSIHAVVKTLLEAVLLVVLVVVLFL 360
+ + ++A++A L FP+G++ YD T FV+ SIH VVKTL EA++LV LV+ LFL
Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359

Query: 361 QTWRASIIPLVAVPVSLVGTFAFMHLMGFSLNALSLFGLVLAIGIVVDDAIVVVENVERN 420
Q RA++IP +AVPV L+GTFA + G+S+N L++FG+VLAIG++VDDAIVVVENVER
Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419

Query: 421 IAS-GLSPIAATQKAMKEVTGPIVATTLVLAAVFIPTAFMSGLTGQFYKQFALTITISTF 479
+ L P AT+K+M ++ G +V +VL+AVFIP AF G TG Y+QF++TI +
Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479

Query: 480 ISAINSLTLSPALSALLLKGHDAPKDALTRLMDKLFGGWLFTPFNRLFNRASEGYGYLVR 539
+S + +L L+PAL A LLK A G F FN F+ + Y V
Sbjct: 480 LSVLVALILTPALCATLLKPVSAE--------HHENKGGFFGWFNTTFDHSVNHYTNSVG 531

Query: 540 KVIRFGGIIGLVYLGMVALTGVQFVNTPTGYVPGQDKQYLVAFAQLPDAASLERTDAVIK 599
K++ G L+Y +VA V F+ P+ ++P +D+ + QLP A+ ERT V+
Sbjct: 532 KILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLD 591

Query: 600 KMSDIALNH--PGVAHSIAFPGLSINGFTNSPNSGVVFVALDDFELRKSPELSANAIAGQ 657
+++D L + V G S +G + N+G+ FV+L +E R E SA A+ +
Sbjct: 592 QVTDYYLKNEKANVESVFTVNGFSFSG--QAQNAGMAFVSLKPWEERNGDENSAEAVIHR 649

Query: 658 LNQQFAGIQDAFIAIFPPPPVQGLGTIGGFRLQIQDRANLGYEALYQVTQQVMYKAWADP 717
+ I+D F+ F P + LGT GF ++ D+A LG++AL Q Q++ A P
Sbjct: 650 AKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHP 709

Query: 718 -QLAGIFSSYQVNVPQLELDIDRTKAKQQAVSLDQIFQTLQTYMGSTYVNDFNRFGRTYQ 776
L + + + Q +L++D+ KA+ VSL I QT+ T +G TYVNDF GR +
Sbjct: 710 ASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKK 769

Query: 777 VNMQADEAFRQSPQQISQLKVPNVNGDMIPLGSFINVSQSAGPDRVMHYNGFTTAEINGG 836
+ +QAD FR P+ + +L V + NG+M+P +F G R+ YNG + EI G
Sbjct: 770 LYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGE 829

Query: 837 PAPGVSSGQAQAAIEKILAETLPIGMTYEWTELTYQQILAGNTGLLVFPLVILLVFMVLA 896
APG SSG A A +E + ++ LP G+ Y+WT ++YQ+ L+GN + + ++VF+ LA
Sbjct: 830 AAPGTSSGDAMALMENLASK-LPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLA 888

Query: 897 AQYESLSLPLAIILIIPMTLLSALSGVLIYGGDNNIFTQIGLIVLVGLATKNAILIVEFA 956
A YES S+P++++L++P+ ++ L ++ N+++ +GL+ +GL+ KNAILIVEFA
Sbjct: 889 ALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFA 948

Query: 957 KEKQDH-GMEVMESILEAARLRLRPILMTSIAFIMGVVPMVFSTGAGAEMRQAMGVAVFA 1015
K+ + G V+E+ L A R+RLRPILMTS+AFI+GV+P+ S GAG+ + A+G+ V
Sbjct: 949 KDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMG 1008

Query: 1016 GMIGVTLFGLILTPLFYYALAKR 1038
GM+ TL + P+F+ + +
Sbjct: 1009 GMVSATLLAIFFVPVFFVVIRRC 1031


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1079PF05272300.003 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.4 bits (68), Expect = 0.003
Identities = 21/74 (28%), Positives = 27/74 (36%), Gaps = 4/74 (5%)

Query: 6 KFLAHLTTALLITALPISTVLLAGCAQSD--PNTPSPSANASPTTLYQALGDDS--GVSA 61
+ LA+ TA + A S AG A P PSA A GDD
Sbjct: 372 RVLAYFGTARALLADVSSPTAAAGGAGGGEPPKKRDPSAGAGTDPGGPGGGDDGEDPFGE 431

Query: 62 IVDGLLARIARDPR 75
+D +AR+ R
Sbjct: 432 WLDDEVARLRLRGR 445


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1082TCRTETB591e-11 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 58.7 bits (142), Expect = 1e-11
Identities = 51/230 (22%), Positives = 105/230 (45%), Gaps = 15/230 (6%)

Query: 7 LWLCVLLMMFPQIMETIYSPALPNIAENFAVSVTSASQTLSVYFIAFAVGVFCWGRLADI 66
+WLC+L + E + + +LP+IA +F S + + + + F++G +G+L+D
Sbjct: 17 IWLCILSFFSV-LNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQ 75

Query: 67 IGRRNAMLAGLVCYAIGSAFALM-ISDFTLLLLARILSAFGAA----VGSVITQTMMRDS 121
+G + +L G++ GS + S F+LL++AR + GAA + V+ +
Sbjct: 76 LGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKE 135

Query: 122 YSGEELAKVFSVMGMSLGISPIIGLLLGSLLSAYWGYQGVFVALMVSAIVLLFLSLKSLP 181
G+ + S++ M G+ P IG ++ + +W Y + + + I+ + +K L
Sbjct: 136 NRGKAFGLIGSIVAMGEGVGPAIGGMIAHYI--HWSY---LLLIPMITIITVPFLMKLLK 190

Query: 182 ETRPAHTQKIAIVELAIKMLTDSGIIKNTLLVASFNLMWFSYFSLAPFMF 231
+ I + L GI+ L S+++ + L+ +F
Sbjct: 191 KEV-RIKGHFDIKGII---LMSVGIVFFMLFTTSYSISFLIVSVLSFLIF 236


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1083ACRIFLAVINRP7780.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 778 bits (2011), Expect = 0.0
Identities = 308/1032 (29%), Positives = 517/1032 (50%), Gaps = 28/1032 (2%)

Query: 3 LSDVSVKRPVVAIVLSLLLCVFGLVSFTKLSVREMPDVESPVVTVSTSYSGASAAIMESQ 62
+++ ++RP+ A VL+++L + G ++ +L V + P + P V+VS +Y GA A ++
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 63 ITKTLEDELTGISGIDEITSTT-RNGSSRITVKFLLGWNLTEGVSDVRDAVARAQRRLPE 121
+T+ +E + GI + ++ST+ GS IT+ F G + V++ + A LP+
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 122 DANDPVVSKDNGSGEPSVYVNLSSSVMDRTQ--LTDYAQRVLEDRFSLISGVSSISISGG 179
+ +S + S + S TQ ++DY ++D S ++GV + + G
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 180 LYKVMYVKLRPEQMAGRNVTVTDIINALRKENVETPGGQVRNDTTV------MSVRTKRL 233
Y + + L + + +T D+IN L+ +N + GQ+ + S+ +
Sbjct: 181 QYAMR-IWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239

Query: 234 YYTPKDFDYLVVRTASDGTPIYLKDVADVAVGAQNENSTFKSDGIVNLSLGVITQSDANP 293
+ P++F + +R SDG+ + LKDVA V +G +N N + +G LG+ + AN
Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299

Query: 294 LIVAQEVHKEVDRIQDFLPEGTSLVVDFDSTVFIDRSINEVYNTLYVTGALVVLVLYIFI 353
L A+ + ++ +Q F P+G ++ +D+T F+ SI+EV TL+ LV LV+Y+F+
Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359

Query: 354 GQTRATLIPAVTVPVSLISAFIAANMFGYSINLLTLMALILAIGLVVDDAIVVVENIFHH 413
RATLIP + VPV L+ F FGYSIN LT+ ++LAIGL+VDDAIVVVEN+
Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419

Query: 414 I-ERGEEPLLAAYKGTREVGFAVVATTAVLVMVFLPISFMEGMVGLLFTEFSVMLAVSVL 472
+ E P A K ++ A+V VL VF+P++F G G ++ +FS+ + ++
Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479

Query: 473 FSSLIALTLTPVLSSKLLKANVK-----PNRFNRFVDSGFARMEKVYRVGVTQAIRFKWL 527
S L+AL LTP L + LLK F + ++ F Y V + +
Sbjct: 480 LSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539

Query: 528 APLVILACVGGSAWLMQQVPSQLAPQEDRGVLFAFVKGAEGTSYNRMTANMDIVEDRLMP 587
L+ V G L ++PS P+ED+GV ++ G + R +D V D +
Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599

Query: 588 LLGQGVLRSFSVQAPAFGGRAGDQTGFVIMQLEDWEHRHVTAQQALGIIS---NALKDIP 644
V F+V +F G+ G + L+ WE R+ A +I L I
Sbjct: 600 NEKANVESVFTVNGFSFSGQ-AQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIR 658

Query: 645 DVMVRPM-MPGFRGQ-SSEPVQFVL---GGSDYAELFKWAQVLKEEANASP-MMEGADLD 698
D V P MP ++ F L G + L + L A P + +
Sbjct: 659 DGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPN 718

Query: 699 YAETTPELIVTVDKERAAELGISVDEVSQTLEVMLGGRKETTYVDRGEEYDVYLRGDENS 758
E T + + VD+E+A LG+S+ +++QT+ LGG ++DRG +Y++ D
Sbjct: 719 GLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKF 778

Query: 759 FNNVGDLSQIYMRSAKGELVTLDTLTHIEEVASAQKLSHTNKQKSITLKANISKGYTLGE 818
D+ ++Y+RSA GE+V T V + +L N S+ ++ + G + G+
Sbjct: 779 RMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGD 838

Query: 819 ALKFLDNKAIELLPKDISIGYTGESKDFKENQSSILIVFGLALLVAYLVLAAQFESFINP 878
A+ ++N A + LP I +TG S + + + + ++ +V +L LAA +ES+ P
Sbjct: 839 AMALMENLASK-LPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIP 897

Query: 879 LVVMFTVPMGVFGGFLGLLVTSQGINIYSQIGMIMLIGMVTKNGILIVEFANQLRDR-GL 937
+ VM VP+G+ G L + +Q ++Y +G++ IG+ KN ILIVEFA L ++ G
Sbjct: 898 VSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGK 957

Query: 938 ALDKAIIDASTRRLRPILMTAFTTLVGAVPLIFSSGAGSESRIAVGTVVFFGMAFATFVT 997
+ +A + A RLRPILMT+ ++G +PL S+GAGS ++ AVG V GM AT +
Sbjct: 958 GVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLA 1017

Query: 998 LFVIPAMYRLIS 1009
+F +P + +I
Sbjct: 1018 IFFVPVFFVVIR 1029


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1084RTXTOXIND513e-09 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 51.4 bits (123), Expect = 3e-09
Identities = 21/108 (19%), Positives = 46/108 (42%), Gaps = 3/108 (2%)

Query: 50 PLTQSISLIGKLA-ADRAVVIAPQVTGKIKQIAVTSNQAVKKGQLLIELDDMKAQAAVAE 108
+ + GKL + R+ I P +K+I V ++V+KG +L++L + A+A +
Sbjct: 79 QVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLK 138

Query: 109 ANAFLNDETRKLREFEKLISRNAITQTEIDAQKASVDIARARLASAQA 156
+ L +L + I +I ++ K + ++ +
Sbjct: 139 TQSSL--LQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEV 184


76Shewmr4_1151Shewmr4_1159N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_1151-2140.290681nucleotidyl transferase
Shewmr4_1152-2140.109472Dna-J like membrane chaperone protein
Shewmr4_1153-114-0.025401hypothetical protein
Shewmr4_1154-290.078536hypothetical protein
Shewmr4_1155-213-0.235469D-isomer specific 2-hydroxyacid dehydrogenase,
Shewmr4_1156-213-0.104886transposase IS116/IS110/IS902 family protein
Shewmr4_1157-113-0.072344hypothetical protein
Shewmr4_1158-114-0.286491OmpA/MotB domain-containing protein
Shewmr4_1159015-0.272481hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1151HTHFIS772e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 76.8 bits (189), Expect = 2e-18
Identities = 19/115 (16%), Positives = 52/115 (45%), Gaps = 2/115 (1%)

Query: 18 IRVGLVEDQQLVRQGIASLIAISQHIEVSWQAENGQEALKRLQTDAVDVLLSDIRMPVLD 77
+ + +D +R + ++ + + N + + D++++D+ MP +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRI--TSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 78 GLSLLKQLRAAQNSIPVIMLTTFDDSELFLNSLQAGANGFLLKDVSLDKLLEAIE 132
LL +++ A+ +PV++++ + + + + GA +L K L +L+ I
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1152PF06580452e-07 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 45.2 bits (107), Expect = 2e-07
Identities = 57/377 (15%), Positives = 137/377 (36%), Gaps = 84/377 (22%)

Query: 45 HIALYSLFILFFLLLTTLDFPRKNLNLSRLLSTALVGSVFGLMLLS--------QHPLLP 96
+ ++ L +L K ++ ++ +L+G V S + +
Sbjct: 16 QGIGWGVYTLTGFGFASLYGSPKLHSMIFNIAISLMGLVLTHAYRSFIKRQGWLKLNMGQ 75

Query: 97 ILLVIWAS----------------LLPEFFNRRVAIAQILLAN-LGYYLILHAQTSNSVM 139
I+L + + L F N + + LA + + +++ + +
Sbjct: 76 IILRVLPACVVIGMVWFVANTSIWRLLAFINTKPVAFTLPLALSIIFNVVVVTFMWSLLY 135

Query: 140 INVLIYMGFQLFAYSSSQARLSERESR------RIQEH-LNQQLIATRALLSQTSEQQER 192
+ ++ + +E++ +I H + L RAL+ + +
Sbjct: 136 FGWHFFKNYKQAEIDQWKMASMAQEAQLMALKAQINPHFMFNALNNIRALILEDPTKARE 195

Query: 193 LRISRDLHDILGHQLTALSLQLELLSHQAPTELKPTVNQSKSLAKELLESIRAVVRAQRV 252
+ LT+LS EL+ + L+ + + SLA EL + + ++ +
Sbjct: 196 M-------------LTSLS---ELMRYS----LRYSNARQVSLADEL-TVVDSYLQLASI 234

Query: 253 NIGLDLTPPLDAIVSRLPNVSLQYESLMPLQSTELAQALLLVLQEGISNAVRHG-----K 307
+ + LQ+E+ + + Q +++Q + N ++HG +
Sbjct: 235 ---------------QFED-RLQFENQIN-PAIMDVQVPPMLVQTLVENGIKHGIAQLPQ 277

Query: 308 ANQLTLSMQEEQAELIICLKDNGQGI--SQSPSQGVGLSSMQERLSPFHGSARLQANQAG 365
++ L ++ + + +++ G + S G GL +++ERL +G +A
Sbjct: 278 GGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGTGLQNVRERLQMLYG------TEAQ 331

Query: 366 VDSSRTQGC-SLMIRLP 381
+ S QG + M+ +P
Sbjct: 332 IKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1153ISCHRISMTASE381e-05 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 37.7 bits (87), Expect = 1e-05
Identities = 34/125 (27%), Positives = 52/125 (41%), Gaps = 18/125 (14%)

Query: 8 KTALLIIDMQQ---GLFYADAPPFNREQVLNNINLLIAKAREAGAPIWAVRHTG---PE- 60
+ LLI DMQ F A A P ++ NI L + + G P+ G P+
Sbjct: 30 RAVLLIHDMQNYFVDAFTAGASPV--TELSANIRKLKNQCVQLGIPVVYTAQPGSQNPDD 87

Query: 61 --------GSPIAAGTANWQLIESLAINPQLDNIFDKTKPSCFYQTGLAEALAHEGVSEL 112
G + +G ++I LA D + K + S F +T L E + EG +L
Sbjct: 88 RALLTDFWGPGLNSGPYEEKIITELAPEDD-DLVLTKWRYSAFKRTNLLEMMRKEGRDQL 146

Query: 113 VIVGM 117
+I G+
Sbjct: 147 IITGI 151


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1157HTHTETR534e-11 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 53.5 bits (128), Expect = 4e-11
Identities = 34/170 (20%), Positives = 65/170 (38%), Gaps = 9/170 (5%)

Query: 2 RNAEFDRAQVLRGAMAAFMHKGYTKTSMQDLTQATGLHPGSIYCAFSNKRGLLIAAIEQY 61
+ A+ R +L A+ F +G + TS+ ++ +A G+ G+IY F +K L E
Sbjct: 7 QEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELS 66

Query: 62 QQDRNEQFKLIFSNSRPVLGNLKTYLDNIVVECLSCDSQQA--CLLTKALNEIAEQDDEI 119
+ + E L G+ + L I++ L + LL + + E E+
Sbjct: 67 ESNIGE---LELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEM 123

Query: 120 QNIIS---QNLMLWQNALTAQFELAASQGMLQGELNSEQRAQYLMMGIYG 166
+ + + + + ML +L + RA +M G
Sbjct: 124 AVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTR-RAAIIMRGYIS 172


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1159HTHFIS320.012 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 31.7 bits (72), Expect = 0.012
Identities = 13/39 (33%), Positives = 17/39 (43%), Gaps = 6/39 (15%)

Query: 339 LFGYVENATFRGTVFTDFSLIRPGSLHKANGGVLLMDAI 377
LFG+ + A FT G +A GG L +D I
Sbjct: 208 LFGHEKGA------FTGAQTRSTGRFEQAEGGTLFLDEI 240


77Shewmr4_1243Shewmr4_1248N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_1243-3151.483922lipoyl synthase
Shewmr4_1244-2161.574908lipoate-protein ligase B
Shewmr4_1245-2161.673868hypothetical protein
Shewmr4_1246-3160.084483penicillin-binding protein 6
Shewmr4_1247-317-0.781710hypothetical protein
Shewmr4_1248-217-1.008648rare lipoprotein A
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1243IGASERPTASE340.002 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 34.3 bits (78), Expect = 0.002
Identities = 35/264 (13%), Positives = 85/264 (32%), Gaps = 35/264 (13%)

Query: 393 QASVQSIEQQASKAQRIAKQNGEEAQALMQQTDQIATAIEEMSTSIRDVANHAQDGANQS 452
+V+ EQ A++ ++ +EA++ ++ Q AQ G+
Sbjct: 1048 SKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEV--------------AQSGSETK 1093

Query: 453 QQVDLAAKEGQQQQTQVVQDLLKLSQQLSSSHQAVEKVSQE-SEAISKVTEVINSIAEQT 511
+ KE + + + Q + QE SE + E +
Sbjct: 1094 ETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAE-----PARE 1148

Query: 512 NLLALNAAIEAARAGEQGRGFAVVADEVRTLAQRTQSSI---LEISQTIDKLQSQVKTTT 568
N +N ++ + A+ T S++ + S T++ S V+
Sbjct: 1149 NDPTVNIKEPQSQTNTTA--------DTEQPAKETSSNVEQPVTESTTVNTGNSVVENPE 1200

Query: 569 SQMAQSHQLGIASANQGEETGKQLEEITRRIGELAISSRNIASATEQQSSVAQEITHNLH 628
+ + Q + S + + + + + +++ +S+VA + +
Sbjct: 1201 NTTPATTQPTVNSESSNKPKNRHRRSVRSV----PHNVEPATTSSNDRSTVALCDLTSTN 1256

Query: 629 QISELANEGEHRAAETVNSANDLS 652
+ L++ +N +S
Sbjct: 1257 TNAVLSDARAKAQFVALNVGKAVS 1280


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1245ACRIFLAVINRP376e-116 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 376 bits (967), Expect = e-116
Identities = 208/1050 (19%), Positives = 421/1050 (40%), Gaps = 60/1050 (5%)

Query: 1 MIKAFVENGRLVSLVIALLLVAGFGAISSLPRTEDPHITNRFASVITPYPGASAERVEAL 60
M F+ ++ +L++AG AI LP + P I SV YPGA A+ V+
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 61 VTEVLENQLRRLEEIKLIQSTS-RPGISVIQLELKDTVKDTDPVWSR--ARDLLADARNT 117
VT+V+E + ++ + + STS G I L + TDP ++ ++ L A
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQS---GTDPDIAQVQVQNKLQLATPL 117

Query: 118 LPDGVQT---STLDDQVGYAYTAILSLVWNNSSQPRVDMLNRYAKE-LQSRLRLLSGTDF 173
LP VQ S Y A ++Q D ++ Y ++ L L+G
Sbjct: 118 LPQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQ---DDISDYVASNVKDTLSRLNGVGD 174

Query: 174 VKLYGAPEEEILVQLDGYKMSQLQLTPGTIAKILSSADSKIAAGEINN------NNFRAF 227
V+L+GA + + + LD +++ +LTP + L + +IAAG++ A
Sbjct: 175 VQLFGA-QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNAS 233

Query: 228 VEVSGELDSQSRIRQVPLKVDTQGQIIRLGDIAHISRQPKTPADSIALVDGEQGVFVAAR 287
+ + +V L+V++ G ++RL D+A + + + IA ++G+ + +
Sbjct: 234 IIAQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENY-NVIARINGKPAAGLGIK 292

Query: 288 MLNNTRVDIWQGQVKQLVDEFNQELPANIKVQWLFEQNSYTSDRLGGLIINLLQGFVIIL 347
+ +K + E P +KV + ++ + + ++ L + +++
Sbjct: 293 LATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVF 352

Query: 348 AVLLLTLG-LRNAIIVALSLPLTALFTLACMKYIGLPIHQMSVTGLVVALGIMVDNAIVI 406
V+ L L +R +I +++P+ L T A + G I+ +++ G+V+A+G++VD+AIV+
Sbjct: 353 LVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVV 412

Query: 407 VDAIAQRRQ-QGMSRLRAVSETLHHLWLPLAGSTITTILAFAPIVLMPGAAGEFVGGIAM 465
V+ + + + A +++ + L G + F P+ G+ G ++
Sbjct: 413 VENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSI 472

Query: 466 SVMFALLGSYVISHTLIAGLAGRF--SLEGKHP-------VWYQHGINVPLVSGYFQASL 516
+++ A+ S +++ L L + +H W+ + V+ Y S+
Sbjct: 473 TIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFD-HSVNHY-TNSV 530

Query: 517 RFALNRPLLSATFIGIIPLLGFYASGKMTEQFFPPSDRDMFQIELYLAPHVSLENTLNQV 576
L +I ++ F P D+ +F + L + E T +
Sbjct: 531 GKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVL 590

Query: 577 QLMDKQL--HQIEGITQVDWVVGGNTPSFYYNLTQRQQGATNYAQAMVK-----ASDFER 629
+ ++ + V V G + Q A +K D
Sbjct: 591 DQVTDYYLKNEKANVESVFTVNGFSFSG--------QAQNAGMAFVSLKPWEERNGDENS 642

Query: 630 ANALIPELQQTLDK---AFPEAQVLVRKLEQGPPFNAPVELM-IFGPNLETLRSLGDEVR 685
A A+I + L K F + +E G EL+ G + L +++
Sbjct: 643 AEAVIHRAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLL 702

Query: 686 NILAATP-DVLHTRATLSAGAPKVWLQVNEDASLISGLTLTDIARQVQMATTGVIGGSVL 744
+ A P ++ R + L+V+++ + G++L+DI + + A G +
Sbjct: 703 GMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFI 762

Query: 745 EQTESLPIRVRLGDTSREQASRLSEIQLVTPSGTAVPLSALSHNEVQVSRGAIPRRNGQR 804
++ + V+ R + ++ + + +G VP SA + + + R NG
Sbjct: 763 DRGRVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLP 822

Query: 805 VNTIEAYIVSGVLPAQVLNDVKDKVAGISLPAGYRIEIGGESAKRNEAIGNLLSNLILVV 864
I+ G + +++ + LPAG + G S + + + + +
Sbjct: 823 SMEIQGEAAPGTSSGDAMALMENLAS--KLPAGIGYDWTGMSYQERLSGNQAPALVAISF 880

Query: 865 TLLLATVVLSFNSFRLTAIILLSALQSAGLGLLAVYVFGYPFGFPVIIALLGLMGLAINA 924
++ + + S+ + ++L LLA +F ++ LL +GL+
Sbjct: 881 VVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKN 940

Query: 925 AIVILAELEDTDNARA-GDKEVIITTVSGCGRHITSTTITTVGGFIPLII---AGGGFWP 980
AI+I+ +D G E + V R I T++ + G +PL I AG G
Sbjct: 941 AILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQN 1000

Query: 981 PFAIAIAGGTLLTTLLSLVWVPTMYLLLMK 1010
I + GG + TLL++ +VP ++++ +
Sbjct: 1001 AVGIGVMGGMVSATLLAIFFVPVFFVVIRR 1030


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1246RTXTOXIND423e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 41.7 bits (98), Expect = 3e-06
Identities = 25/106 (23%), Positives = 45/106 (42%), Gaps = 5/106 (4%)

Query: 75 SGKLSELTVDSGARVTQGQVLAKLDTRLLDAEHQEIQASLAQTQADVDLATSTLNRNLEL 134
+ + E+ V G V +G VL KL +A+ + Q+SL Q + + L+R++EL
Sbjct: 104 NSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQ-TRYQILSRSIEL 162

Query: 135 KKSGYVSEQLLDENRTQLASLEAGKKRLLASLQANQLKRDKSQLLA 180
K +L + ++ + L SL Q ++Q
Sbjct: 163 NK----LPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQ 204



Score = 39.8 bits (93), Expect = 1e-05
Identities = 29/166 (17%), Positives = 58/166 (34%), Gaps = 22/166 (13%)

Query: 101 RLLDAEHQ--EIQASLAQTQADVDLATSTLNR---NLELKKSGYVSEQL--LDENRTQLA 153
+L+ E++ E L ++ ++ S + +L + +E L L + +
Sbjct: 253 AVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIG 312

Query: 154 SLEAGKKRLLASLQANQLKRDKSQLLAPFNGIISQRQ-HNLGEVVEAGSPVFILVGSVNT 212
L L N+ ++ S + AP + + Q + H G VV + ++V +T
Sbjct: 313 LLTL-------ELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDT 365

Query: 213 -EAYIGVPVAVAQQFVNGQNVTV--SVHNQQ----FTAKIAGISAE 251
E V GQN + K+ I+ +
Sbjct: 366 LEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLD 411


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1247HTHTETR741e-18 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 74.3 bits (182), Expect = 1e-18
Identities = 40/197 (20%), Positives = 68/197 (34%), Gaps = 5/197 (2%)

Query: 11 RSEQKKQQVLVAAIDLFCRQGFPHTSMDEVAKQAGVSKQTVYSHYGSKDDLFVAAIE--S 68
+++ +Q +L A+ LF +QG TS+ E+AK AGV++ +Y H+ K DLF E
Sbjct: 8 EAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSE 67

Query: 69 KCVGHNLNADLLSDPSQPEATLTEFALQFGEMIVSPEAITVFKACVAQSESHP---EVSR 125
+G P P + L E + E V+ E + + V +
Sbjct: 68 SNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQ 127

Query: 126 LFFEAGPQHMLAMLTKYLGAVEALGVYRFSQPHHCAVRLCLMLFGELKLRLELGLETESL 185
+ + L + A + L ++ L
Sbjct: 128 QAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDL 187

Query: 186 LGEREQYIRGCAEMFLK 202
E Y+ EM+L
Sbjct: 188 KKEARDYVAILLEMYLL 204


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1248ECOLNEIPORIN290.032 E.coli/Neisseria porin superfamily signature.
		>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature.

Length = 331

Score = 28.6 bits (64), Expect = 0.032
Identities = 29/122 (23%), Positives = 42/122 (34%), Gaps = 32/122 (26%)

Query: 79 DKQWS-----IGMRVGYDRLDYDWRNIRLTGANNAAGLFSDAG--ETWENIDRYRAGLSL 131
D W IG++ G+ +L G N + D G W++ Y +
Sbjct: 88 DSGWGNRQSFIGLKGGFGKLRV--------GRLN--SVLKDTGDINPWDSKSDYLGVNKI 137

Query: 132 SYRMDKHWS--------FMLSPQLQYAYADTASASNAQSYGVVASAMYAFESGNMLGFGV 183
+ + S LS +QYA D A N++SY A Y GF V
Sbjct: 138 AEPEARLISVRYDSPEFAGLSGSVQYALNDNAGRHNSESYH--AGFNYKNG-----GFFV 190

Query: 184 AY 185
Y
Sbjct: 191 QY 192


78Shewmr4_1257Shewmr4_1273N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_1257120-2.917288DNA polymerase III subunit delta
Shewmr4_1258219-1.585171rare lipoprotein B
Shewmr4_1259418-1.047122hypothetical protein
Shewmr4_1260317-0.625541leucyl-tRNA synthetase
Shewmr4_1261317-0.133963hypothetical protein
Shewmr4_12621170.252630methylation site containing protein
Shewmr4_1263-1121.011313apolipoprotein N-acyltransferase
Shewmr4_1264-213-0.235995hypothetical protein
Shewmr4_1265-115-1.247282CBS domain-containing protein
Shewmr4_1266-116-1.516981putative metalloprotease
Shewmr4_1267122-2.512969PhoH family protein
Shewmr4_1268225-3.352695(dimethylallyl)adenosine tRNA
Shewmr4_1269235-5.622222hypothetical protein
Shewmr4_1270140-6.094758hypothetical protein
Shewmr4_1272137-6.4715412-octaprenyl-3-methyl-6-methoxy-1,4-benzoquinol
Shewmr4_1273131-5.328480peptidyl-tRNA hydrolase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1257HTHFIS634e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 62.5 bits (152), Expect = 4e-13
Identities = 24/128 (18%), Positives = 52/128 (40%), Gaps = 12/128 (9%)

Query: 180 HIMVIDDSAVARKQIIRALESLNLQIDTAKDGREALDKLKTIAAEMNNVAEEIPLIISDI 239
I+V DD A R + +AL + + + A + L+++D+
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIA---------AGDGDLVVTDV 55

Query: 240 EMPEMDGYTLTAEIRDDPKLKHIKVVLHTSLSGVFNQAMVQKVGANDFIAK-FNPDELAA 298
MP+ + + L I+ + V++ ++ + + GA D++ K F+ EL
Sbjct: 56 VMPDENAFDLLPRIKK--ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIG 113

Query: 299 AVNKHLSL 306
+ + L+
Sbjct: 114 IIGRALAE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1260FLGHOOKAP1333e-04 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 33.0 bits (75), Expect = 3e-04
Identities = 9/38 (23%), Positives = 18/38 (47%)

Query: 99 NVNVMEEMADMISASRSYQMNVQVAEAAKSMLQQTLGM 136
VN+ EE ++ + Y N QV + A ++ + +
Sbjct: 508 GVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545



Score = 29.9 bits (67), Expect = 0.003
Identities = 16/67 (23%), Positives = 29/67 (43%), Gaps = 6/67 (8%)

Query: 5 SIFDVAGSGMSAQSVRLNTTASNIANADSVSSSIDKTYRSRHPIFEAEMAKAQSQQQASQ 64
S+ + A SG++A LNT ++NI++ + Y + I + +
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYN------VAGYTRQTTIMAQANSTLGAGGWVGN 55

Query: 65 GVAVKGI 71
GV V G+
Sbjct: 56 GVYVSGV 62


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1262FLGHOOKAP1401e-05 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 40.3 bits (94), Expect = 1e-05
Identities = 15/35 (42%), Positives = 22/35 (62%)

Query: 2 SFNIALSGISAAQKDLNTTANNIANANTIGFKESR 36
N A+SG++AAQ LNT +NNI++ N G+
Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYTRQT 37



Score = 37.6 bits (87), Expect = 9e-05
Identities = 12/49 (24%), Positives = 25/49 (51%)

Query: 405 SISSSALEQSNIDLTTELVDLISAQRNFQANSRTLEVNNTLQQTVLQIR 453
+S+ S ++L E +L Q+ + AN++ L+ N + ++ IR
Sbjct: 498 QLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1264FLGHOOKAP1451e-07 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 45.3 bits (107), Expect = 1e-07
Identities = 19/119 (15%), Positives = 41/119 (34%), Gaps = 4/119 (3%)

Query: 145 DDATSITVSAEGEVSVKTPGTAENQVVGQLSMSDFINPSGLDPMGQNLYTETG---ASGT 201
D I +++E + + + Q + + +L ++ G A+
Sbjct: 427 TDEAKIAMASEEDAGDSDNRNGQALLDLQSNSKTVGGAKSFNDAYASLVSDIGNKTATLK 486

Query: 202 PIQGTASLDGMGAIRQGALETSNVNVTEELVNLIESQRIYEMNSKVISAVDQMLAYVNQ 260
T + + S VN+ EE NL Q+ Y N++V+ + + +
Sbjct: 487 TSSATQGNV-VTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALIN 544



Score = 35.7 bits (82), Expect = 2e-04
Identities = 9/36 (25%), Positives = 20/36 (55%)

Query: 5 LWISKTGLDAQQTDIAVISNNVANASTVGYKKSRAV 40
+ + +GL+A Q + SNN+++ + GY + +
Sbjct: 4 INNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTI 39


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1265FLGLRINGFLGH1451e-45 Flagellar L-ring protein signature.
		>FLGLRINGFLGH#Flagellar L-ring protein signature.

Length = 232

Score = 145 bits (368), Expect = 1e-45
Identities = 76/220 (34%), Positives = 108/220 (49%), Gaps = 19/220 (8%)

Query: 11 LLLTACSSTSKKPIADDPFYAPVYPEAPPTKIAATGSIYQDSQAA-----SLYSDIRAHK 65
L LT C+ P+ A P P A GSI+Q +Q L+ D R
Sbjct: 17 LSLTGCAWIPSTPLVQGATSAQPVPGPTP---VANGSIFQSAQPINYGYQPLFEDRRPRN 73

Query: 66 VGDIITIVLKEATQAKKSAGNQIKKGSDLSLDPIYAGGSNVS------IGGVPLDLRYKD 119
+GD +TIVL+E A KS+ + + G V G D+
Sbjct: 74 IGDTLTIVLQENVSASKSSSANASRDGKTNF-----GFDTVPRYLQGLFGNARADVEASG 128

Query: 120 SMNTKRESDADQSNSLDGSISANVMQVLNNGNLVVRGEKWISINNGDEFIRVTGIVRSQD 179
+ A+ SN+ G+++ V QVL NGNL V GEK I+IN G EFIR +G+V +
Sbjct: 129 GNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRFSGVVNPRT 188

Query: 180 IKPDNTIDSTRMANARIQYSGTGTFADAQKVGWLSQFFMS 219
I NT+ ST++A+ARI+Y G G +AQ +GWL +FF++
Sbjct: 189 ISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLN 228


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1266FLGPRINGFLGI378e-133 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 378 bits (973), Expect = e-133
Identities = 161/367 (43%), Positives = 223/367 (60%), Gaps = 14/367 (3%)

Query: 5 LILAVAMLAFSLPSQAE--RIKDIANVQGVRSNQLIGYGLVVGLPGTGEKTR---YTEQT 59
L+ + + P+QA+ RIKDIA++Q R NQLIGYGLVVGL GTG+ R +TEQ+
Sbjct: 11 LVFSALPFLSTPPAQADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSPFTEQS 70

Query: 60 FTTMLKNFGINLPDNFRPKIKNVAVVAVHADMPAFIKPGQELDVTVSSLGEAKSLRGGTL 119
ML+N GI + KN+A V V A++P F PG +DVTVSSLG+A SLRGG L
Sbjct: 71 MRAMLQNLGITTQGG-QSNAKNIAAVMVTANLPPFASPGSRVDVTVSSLGDATSLRGGNL 129

Query: 120 LQTFLKGVDGNVYAIAQGSLVVSGFSADGLDGSKVIQNTPTVGRIPNGAIVERSVATPFS 179
+ T L G DG +YA+AQG+L+V+GFSA G D + + Q T R+PNGAI+ER + + F
Sbjct: 130 IMTSLSGADGQIYAVAQGALIVNGFSAQG-DAATLTQGVTTSARVPNGAIIERELPSKFK 188

Query: 180 TGDYLTFNLRRSDFSTAQRMADAINDL----LGPDMARPLDATSVQVSAPRDVSQRVSFL 235
L LR DFSTA R+AD +N G +A P D+ + V PR V+ +
Sbjct: 189 DSVNLVLQLRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKPR-VADLTRLM 247

Query: 236 ATLENIEVEPADESAKVIVNSRTGTIVVGQNVKLLPAAVTHGGLTVTIAEATQVSQPNAL 295
A +EN+ VE D AKV++N RTGTIV+G +V++ AV++G LTV + E+ QV QP
Sbjct: 248 AEIENLTVET-DTPAKVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESPQVIQPAPF 306

Query: 296 ANGQTTVTSNSTINASESNRRMFMFNPGTTLDELVRAVNLVGAAPSDVLAILEALKVAGA 355
+ GQT V + I A + ++ + G L LV +N +G ++AIL+ +K AGA
Sbjct: 307 SRGQTAVQPQTDIMAMQEGSKVAIVE-GPDLRTLVAGLNSIGLKADGIIAILQGIKSAGA 365

Query: 356 LHGELII 362
L EL++
Sbjct: 366 LQAELVL 372


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1267FLGFLGJ1806e-56 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 180 bits (457), Expect = 6e-56
Identities = 108/362 (29%), Positives = 158/362 (43%), Gaps = 79/362 (21%)

Query: 12 DLGGLDSLRAQAQKDEKGALKKVAQQFEGVFVQMLMKSMRDANAVFESDSPLNSQYTKFY 71
D L+ L+A+A +D ++ VA+Q EG+FVQM++KSMRDA D +S++T+ Y
Sbjct: 14 DAQSLNELKAKAGEDPAANIRPVARQVEGMFVQMMLKSMRDALP---KDGLFSSEHTRLY 70

Query: 72 EQMRDQQLSVDLSDKGVLGLADMMVQQLSPESSQFTPASVLRNDGGMKLQHDAKAFNIPA 131
M QQ++ Q T G+ L P
Sbjct: 71 TS--------------------MYDQQIA---QQMTAG------KGLGLAEMMVKQMTPE 101

Query: 132 QATSAAETQTAAAPVVAAQGVPASIARPSANVDNGDGVTSSLDIDRPERLLAIDTPKPAW 191
Q E T AAP+ + ++ +
Sbjct: 102 Q--PLPEESTPAAPM--------KFPLETVVRYQNQALSQLV------------------ 133

Query: 192 SEQPLSPIEPVISGQILPTAAFRETQKTLKFGSREEFLATLYPHAEKAAKALGTQPEVLL 251
Q P S G + FLA L A+ A++ G ++L
Sbjct: 134 --QKAVPRNYDDSLP----------------GDSKAFLAQLSLPAQLASQQSGVPHHLIL 175

Query: 252 AQSALETGWGQKIVRGNNGAPSHNLFNIKADRRWQGDKANVSTLEFEQGIAVRQKADFRV 311
AQ+ALE+GWGQ+ +R NG PS+NLF +KA W+G ++T E+E G A + KA FRV
Sbjct: 176 AQAALESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRV 235

Query: 312 YADFEHSFNDFVSFIAEGERYQAAKKVAASPTQFIRALQDAGYATDPKYAEKVIKVMQSI 371
Y+ + + +D+V + RY AA AAS Q +ALQDAGYATDP YA K+ ++Q +
Sbjct: 236 YSSYLEALSDYVGLLTRNPRY-AAVTTAASAEQGAQALQDAGYATDPHYARKLTNMIQQM 294

Query: 372 SE 373

Sbjct: 295 KS 296


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1268FLGHOOKAP12122e-63 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 212 bits (542), Expect = 2e-63
Identities = 129/486 (26%), Positives = 205/486 (42%), Gaps = 20/486 (4%)

Query: 4 DLLNIARTGVLASQSQLGVTSNNIANANTAGYHRQVATQTTLDSQRLGNSFYGTGTYVSD 63
L+N A +G+ A+Q+ L SNNI++ N AGY RQ +S + G G YVS
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVSG 61

Query: 64 VKRIYNDYAARELRIGQTTLSAAEASYGKLSELDQVFSQIGKIVPQSLNDLFSGLNSLAD 123
V+R Y+ + +LR QT S A Y ++S++D + S + + D F+ L +L
Sbjct: 62 VQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLVS 121

Query: 124 LPADLGIRSSTLNDAKQLANSLNQMQSTLNGQLTQTNDQITGMTKRINEISTELANLNLE 183
D R + + ++ L N L Q Q N I +IN + ++A+LN +
Sbjct: 122 NAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLNDQ 181

Query: 184 LMKSPNQDAM-----LLDKQDALVQELSQYAQVNVIPLENGAKSIMLGGAIMLVSGEV-- 236
+ + A LLD++D LV EL+Q V V + G +I + LV G
Sbjct: 182 ISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTAR 241

Query: 237 PMSISTATGDPFPNELQLMSTIGSQSVRVDPTKLGGQLGALFEYREQTLVPAGLELDQLA 296
++ ++ DP + + + G LG + +R Q L L QLA
Sbjct: 242 QLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQLA 301

Query: 297 LGVADNFNKLQAEGFDLNGQVGADIFKDINDPLMSIGRVAGFSGNTGNATLGVNIDDTSA 356
L A+ FN GFD NG G D F + V + N G+ +G + D SA
Sbjct: 302 LAFAEAFNTQHKAGFDANGDAGEDFFA------IGKPAVLQNTKNKGDVAIGATVTDASA 355

Query: 357 LSGGSYELSF--TAPATYELRDTQTGTITPLTLNGTKLEGGAGFSIDIKAGAMASGDRFA 414
+ Y++SF L T T+TP +G G A D F
Sbjct: 356 VLATDYKISFDNNQWQVTRLASNTTFTVTPDANGKVAFDGLELTFT----GTPAVNDSFT 411

Query: 415 IRPTAGAANGIEVVMTDPKGIAAAAPKITPDAANSGNTQVKVTQITNRSAANFPTTGSEL 474
++P + A ++V++TD IA A+ + D+ N N Q + +N + ++
Sbjct: 412 LKPVSDAIVNMDVLITDEAKIAMASEEDAGDSDN-RNGQALLDLQSNSKTVGGAKSFNDA 470

Query: 475 TIQLNT 480
L +
Sbjct: 471 YASLVS 476



Score = 81.5 bits (201), Expect = 2e-18
Identities = 37/102 (36%), Positives = 52/102 (50%)

Query: 536 EGDNTNALAMAKLSETKVMNGGKSTLADVFEQTKQDIGSQTKAAEVRVGAADAIYQQAYA 595
+ DN N A+ L GG + D + DIG++T + + Q
Sbjct: 442 DSDNRNGQALLDLQSNSKTVGGAKSFNDAYASLVSDIGNKTATLKTSSATQGNVVTQLSN 501

Query: 596 RVESESGVNLDEEAANLMRFQQAYQASARIMSTAQQIFDTLL 637
+ +S SGVNLDEE NL RFQQ Y A+A+++ TA IFD L+
Sbjct: 502 QQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALI 543


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1269FLAGELLIN507e-09 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 50.4 bits (120), Expect = 7e-09
Identities = 33/233 (14%), Positives = 83/233 (35%), Gaps = 4/233 (1%)

Query: 20 QTATSKILDQLSSGKKVNTSGDDPVAALGIDNLNQRNALVDQFMKNIDYATNHLQQTESQ 79
Q++ S +++LSSG ++N++ DD + + Q +N + + Q TE
Sbjct: 21 QSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQASRNANDGISIAQTTEGA 80

Query: 80 LGQADTLISSMKDLMLQGSNGSQTSEERQTIADDLRKSLDQLLTIANTKDESGNYLFAGN 139
L + + + +++L +Q +NG+ + + ++I D++++ L+++ ++N +G + + +
Sbjct: 81 LNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEIDRVSNQTQFNGVKVLSQD 140

Query: 140 KTDTLPFQFDANGKIVYQGDSGVHSAIIASGIQLNTNVAGDTAFIKSPNAMGDYSVNYLP 199
+ + I ++ G NV G +V
Sbjct: 141 NQMKIQVGANDGETITIDLQKIDVKSLGLDG----FNVNGPKEATVGDLKSSFKNVTGYD 196

Query: 200 SQQGEFSVTSAKLDGVTPSLSDYKINFLDDGAGGINVEVTDTATPANVISAAA 252
+ + ++ D T N +
Sbjct: 197 TYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDDAENNTAVDL 249


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1270FLAGELLIN2061e-62 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 206 bits (524), Expect = 1e-62
Identities = 156/507 (30%), Positives = 231/507 (45%), Gaps = 25/507 (4%)

Query: 2 AISVNTNVTSMKAQNQLNGANNRLSTSMERLSSGLRINSAKDDAAGLQISNRMTSQINGI 61
A +NTN S+ QN LN + + LS+++ERLSSGLRINSAKDDAAG I+NR TS I G+
Sbjct: 1 AQVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGL 60

Query: 62 GVAMRNANDGISIAQTAEGAMQESTNILQRMRDLSLQSANGSNSSEDRAAMQKELNALQS 121
A RNANDGISIAQT EGA+ E N LQR+R+LS+Q+ NG+NS D ++Q E+
Sbjct: 61 TQASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLE 120

Query: 122 ELTRIADTTSFGGQKLLDGSYGTQSFQVGAQANETISVSLKSVAAADIGAYKSDAAGSKF 181
E+ R+++ T F G K+L QVGA ETI++ L+ + +G + G K
Sbjct: 121 EIDRVSNQTQFNGVKVLSQD-NQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKE 179

Query: 182 GGDLVTAVAGSNGSAGGNLSITQGTKTTTFATAANDTAAEVAGKINKAGTGVTATAQTTI 241
+ N + ++ + A T +K TT
Sbjct: 180 ATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTD 239

Query: 242 EANLTEAFDKGLTMKVDDGNSTSSLDLTGIGN-NDDLAKAINNVSGETGVSAKLENGVLT 300
+A A D T K G + + I + V+ +
Sbjct: 240 DAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKV 299

Query: 301 ITSSKGADISFSD-----------------------TATAGTGTLTLKNISADGTSSAVS 337
T+ G ++ + + G T K + S +
Sbjct: 300 STTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLE 359

Query: 338 TIDGTDTTVDNATAVGSVSLTASSAYSISGGIAAELTTEVAGVFAGVNSVDISSAAGSQS 397
+ + A+ G + +GV +N ++ + +
Sbjct: 360 ANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAAKKSTAN 419

Query: 398 ALAIIDGAIASIDSSRSDLGAVQNRMSFTINNLSNIQSNVTDARSRIQDVDFASETAQLT 457
LA ID A++ +D+ RS LGA+QNR I NL N +N+ ARSRI+D D+A+E + ++
Sbjct: 420 PLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNMS 479

Query: 458 KQQILSQTSSAMLAQANQLPQTALSLL 484
K QIL Q +++LAQANQ+PQ LSLL
Sbjct: 480 KAQILQQAGTSVLAQANQVPQNVLSLL 506


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1273FLAGELLIN2073e-63 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 207 bits (528), Expect = 3e-63
Identities = 156/507 (30%), Positives = 230/507 (45%), Gaps = 25/507 (4%)

Query: 2 AISVNTNVTSMRAQNQLNGANSKLSTSMERLSSGLRINSAKDDAAGLQISNRMTSQVNGI 61
A +NTN S+ QN LN + S LS+++ERLSSGLRINSAKDDAAG I+NR TS + G+
Sbjct: 1 AQVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGL 60

Query: 62 GVAMRNANDGISIAQTAEGAMQESTNILQRMRDLSLQSANGSNSSEDRAAMQKEVNALQS 121
A RNANDGISIAQT EGA+ E N LQR+R+LS+Q+ NG+NS D ++Q E+
Sbjct: 61 TQASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLE 120

Query: 122 ELTRISETTSFGGQKLLDGSYGTQSFQVGAQANETISVSLKSVAAADIGAYKSDAAGSKF 181
E+ R+S T F G K+L QVGA ETI++ L+ + +G + G K
Sbjct: 121 EIDRVSNQTQFNGVKVLSQD-NQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKE 179

Query: 182 GGDLVTAVAGSNGSAGGNLSITQGTKTTTFATAANDTAAEVAGKINKAGTGVTATAQTTI 241
+ N + ++ + A T +K TT
Sbjct: 180 ATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTD 239

Query: 242 EANLTEAFDKGLTMKVDDGNSTSSLDLTGIGN-NDDLAKAINNVSGETGVSAKLENGVLT 300
+A A D T K G + + I + V+ +
Sbjct: 240 DAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKV 299

Query: 301 ITSSKGADISFSD-----------------------TATAGTGTLTLKNISADGTSSAVS 337
T+ G ++ + + G T K + S +
Sbjct: 300 STTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLE 359

Query: 338 TIDGTDTTVDNATAVGSVSLTASSAYSISGGIAAELTTEVAGVFAGVNSVDLSTASGSQS 397
+ + A+ G + +GV +N + + +
Sbjct: 360 ANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAAKKSTAN 419

Query: 398 ALAIIDGAIAGIDSQRADLGAVQNRMNFTINNLSNIQSNVTDARSRIQDVDFASETAQLT 457
LA ID A++ +D+ R+ LGA+QNR + I NL N +N+ ARSRI+D D+A+E + ++
Sbjct: 420 PLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNMS 479

Query: 458 KQQILSQTSSAMLAQANQLPQTALSLL 484
K QIL Q +++LAQANQ+PQ LSLL
Sbjct: 480 KAQILQQAGTSVLAQANQVPQNVLSLL 506


79Shewmr4_1278Shewmr4_1303N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_1278-1100.298889hypothetical protein
Shewmr4_1279-1120.592754hypothetical protein
Shewmr4_1280-1111.213871hypothetical protein
Shewmr4_12810101.017602transcription elongation factor GreA
Shewmr4_12820120.758195hypothetical protein
Shewmr4_12830130.606625preprotein translocase subunit SecD
Shewmr4_12840130.465231preprotein translocase subunit SecF
Shewmr4_12850130.642556preprotein translocase subunit SecF
Shewmr4_1286114-0.536263hypothetical protein
Shewmr4_1287-114-0.70708923S rRNA methyltransferase J
Shewmr4_1288-117-0.602085membrane protease FtsH catalytic subunit
Shewmr4_1289-115-0.645621hypothetical protein
Shewmr4_1290-2170.270540dihydropteroate synthase
Shewmr4_1291-214-0.230307phosphoglucosamine mutase
Shewmr4_1292-113-0.405317triosephosphate isomerase
Shewmr4_1293-113-0.276555preprotein translocase subunit SecG
Shewmr4_1294-114-0.736103**hypothetical protein
Shewmr4_1295-115-1.150972transcription elongation factor NusA
Shewmr4_1296018-1.991814translation initiation factor IF-2
Shewmr4_1297217-1.458124ribosome-binding factor A
Shewmr4_1298118-1.236422tRNA pseudouridine synthase B
Shewmr4_1299119-1.15197230S ribosomal protein S15
Shewmr4_1300017-0.538156diguanylate cyclase/phosphodiesterase
Shewmr4_1301020-0.711803hypothetical protein
Shewmr4_1302-119-0.449530polynucleotide phosphorylase/polyadenylase
Shewmr4_1303-323-1.028231lipoprotein NlpI
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1278HTHFIS448e-157 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 448 bits (1155), Expect = e-157
Identities = 172/480 (35%), Positives = 263/480 (54%), Gaps = 19/480 (3%)

Query: 7 RILLVGTPSERLSRLCCIFEFLGEQIEII-STEKLSSCLQDTRYRALVLTTDNM----SV 61
IL+ + + L G + I + L + LV+T M +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGD-LVVTDVVMPDENAF 63

Query: 62 EALKSLANQYPWQPILL---FGNVGDLQVSNVLG---QIEEPLNYPQLTELLHFCQVYGQ 115
+ L + P P+L+ ++ G + +P + +L ++ +
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 116 VKRPQVPTSANQTKLFRSLVGRSEGIANVRHLISQVATSDATVLVLGQSGTGKEVVARNI 175
+ ++ + LVGRS + + +++++ +D T+++ G+SGTGKE+VAR +
Sbjct: 124 RRPSKLEDDSQD---GMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARAL 180

Query: 176 HYLSERRDGPFIPVNCGAIPPELLESELFGHEKGSFTGAISSRKGRFELAEGGTLFLDEI 235
H +RR+GPF+ +N AIP +L+ESELFGHEKG+FTGA + GRFE AEGGTLFLDEI
Sbjct: 181 HDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEI 240

Query: 236 GDMPLQMQVKLLRVLQERVFERVGGTKTINADVRVVAATHRDLESMITSNEFREDLYYRL 295
GDMP+ Q +LLRVLQ+ + VGG I +DVR+VAAT++DL+ I FREDLYYRL
Sbjct: 241 GDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRL 300

Query: 296 NVFPIEMPALSDRKDDVPLLLQELVSRVYNEGRGKVRFTQRAIESLKEHAWSGNVRELSN 355
NV P+ +P L DR +D+P L++ V + EG RF Q A+E +K H W GNVREL N
Sbjct: 301 NVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDVKRFDQEALELMKAHPWPGNVRELEN 360

Query: 356 LVERLTILYPGGLVDVNDLPIKYRHIDVPEYSIELSEEQQERDALASIFTSEEPVEIPET 415
LV RLT LYP ++ + + R ++P+ IE + + +++ EE +
Sbjct: 361 LVRRLTALYPQDVITREIIENELRS-EIPDSPIEKAAARSGSLSISQAV--EENMRQYFA 417

Query: 416 RFPSELPPEGVNLKDLLAELEIDMIRQALEQQDNVVARAAEMLGIRRTTLVEKMRKYGMT 475
F LPP G+ +LAE+E +I AL +AA++LG+ R TL +K+R+ G++
Sbjct: 418 SFGDALPPSGL-YDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVS 476


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1280HTHFIS463e-163 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 463 bits (1194), Expect = e-163
Identities = 171/483 (35%), Positives = 251/483 (51%), Gaps = 43/483 (8%)

Query: 1 MSEAKLLLVEDDASLREALLDTLMLAQYDCIDVASGEEAIIALKQHQFDLVISDVQMQGI 60
M+ A +L+ +DDA++R L L A YD ++ + DLV++DV M
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 GGLGLLNYLQQHHPKLPVLLMTAYATIGSAVSAIKLGAVDYLAKPFAPEVLLNQVSRYLP 120
LL +++ P LPVL+M+A T +A+ A + GA DYL KPF L+ + R L
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 121 LKQSSDQPVVAD-----------EKSLALLSLAQRVAASDASVMILGPSGSGKEVLARYI 169
+ + D + + R+ +D ++MI G SG+GKE++AR +
Sbjct: 121 EPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARAL 180

Query: 170 HQHSSRANQAFVAINCAAIPENMLEATLFGYEKGAFTGAYQACPGKFEQAQGGTLLLDEI 229
H + R N FVAIN AAIP +++E+ LFG+EKGAFTGA G+FEQA+GGTL LDEI
Sbjct: 181 HDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEI 240

Query: 230 SEMDLGLQAKLLRVLQEREVERLGGRKTIKLDVRVLATSNRDLKAVVAAGQFREDLYYRI 289
+M + Q +LLRVLQ+ E +GGR I+ DVR++A +N+DLK + G FREDLYYR+
Sbjct: 241 GDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRL 300

Query: 290 NVFPLTWPALNQRPADILPLARHLLAKHAKALNVLDLPEFDEAACRRLLSHRWPGNVREL 349
NV PL P L R DI L RH + + K LD+ FD+ A + +H WPGNVREL
Sbjct: 301 NVVPLRLPPLRDRAEDIPDLVRHFVQQAEK--EGLDVKRFDQEALELMKAHPWPGNVREL 358

Query: 350 DNVVQRALILRAGAVITANDIIIDAQDV---------------------------PLLSD 382
+N+V+R L VIT I + + +
Sbjct: 359 ENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFAS 418

Query: 383 DAEYVNEPEGLGEELKAQEHVIILETLAQCQGSRKLVAEKLGISARTLRYKMARMRDMGI 442
+ + L E+ +IL L +G++ A+ LG++ TLR K +R++G+
Sbjct: 419 FGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKK---IRELGV 475

Query: 443 QLP 445
+
Sbjct: 476 SVY 478


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1281FLGHOOKFLIE603e-15 Flagellar hook-basal body complex protein FliE signa...
		>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE

signature.
Length = 103

Score = 60.1 bits (145), Expect = 3e-15
Identities = 32/101 (31%), Positives = 55/101 (54%)

Query: 12 MQSLKGEIAPSFGISPNNIVQQVNNTSGADFGQLLSQAIGNVSGLQSTSSNLATRLEMGD 71
+Q ++G I+ + + Q+ F L A+ +S Q+ + A + +G+
Sbjct: 3 IQGIEGVISQLQATAMSARAQESLPQPTISFAGQLHAALDRISDTQTAARTQAEKFTLGE 62

Query: 72 TTVSLSDTVIAREKASVAFEATVQVRNKLVEAYKEIMSMPV 112
V+L+D + +KASV+ + +QVRNKLV AY+E+MSM V
Sbjct: 63 PGVALNDVMTDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV 103


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1282FLGMRINGFLIF2974e-96 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 297 bits (762), Expect = 4e-96
Identities = 160/567 (28%), Positives = 263/567 (46%), Gaps = 57/567 (10%)

Query: 26 LGGVDMMRQITMILALAICLALAVFVMIWAQEPEYRPL-GKMETQEMVQVLDVLDKNKIK 84
L + +I +I+A + +A+ V +++WA+ P+YR L + Q+ ++ L + I
Sbjct: 16 LNRLRANPRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQDGGAIVAQLTQMNIP 75

Query: 85 YQIDVD--VVKVPEDKYQEVKMMLSRAGIDSAAASSKDFLTQDSGFGVSQRMEQARLKHS 142
Y+ ++VP DK E+++ L++ G+ A + L Q FG+SQ EQ + +
Sbjct: 76 YRFANGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQ-EKFGISQFSEQVNYQRA 134

Query: 143 QEENLARAIEQLQSVSRAKVILALPKENVFARNTAQPSATVVINTRRG-GLGQGEVDAIV 201
E LAR IE L V A+V LA+PK ++F R PSA+V + G L +G++ A+V
Sbjct: 135 LEGELARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRALDEGQISAVV 194

Query: 202 DIVASAVQGLEPSRVTVTDSNGRLLNSGSQDGVSARARRELELVQQKEAEYRTKIDSILS 261
+V+SAV GL P VT+ D +G LL + G +L+ E+ + +I++ILS
Sbjct: 195 HLVSSAVAGLPPGNVTLVDQSGHLLTQSNTSGRDL-NDAQLKFANDVESRIQRRIEAILS 253

Query: 262 PILGPDNFTSQVDVSMDFTAVEQTAKRFNPDLPSLRSEMTVENNST-----GGSTGGIPG 316
PI+G N +QV +DF EQT + ++P+ + ++ + + G GG+PG
Sbjct: 254 PIVGNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNISEQVGAGYPGGVPG 313

Query: 317 ALSNQPP---------------MESNIPQEA-DKATESVTAGNSHREATRNFELDTTISH 360
ALSNQP N PQ + + S ++ R T N+E+D TI H
Sbjct: 314 ALSNQPAPPNEAPIATPPTNQQNAQNTPQTSTSTNSNSAGPRSTQRNETSNYEVDRTIRH 373

Query: 361 TRQQIGVVRRVSVSVAVDFKPGAAGENGQVARVARTEQELTNIRRLLEGAVGFSAQRGDV 420
T+ +G + R+SV+V V++K A G+ + T ++ I L A+GFS +RGD
Sbjct: 374 TKMNVGDIERLSVAVVVNYKTLADGKP-----LPLTADQMKQIEDLTREAMGFSDKRGDT 428

Query: 421 LEVVTVPFMDQLVEDVPAPELWEQPWFWRAVKLGVGALVILV----LILAVVRPMLKRLI 476
L VV PF + W+Q F + L++LV L VRP L R +
Sbjct: 429 LNVVNSPF-SAVDNTGGELPFWQQQSFIDQLLAAGRWLLVLVVAWILWRKAVRPQLTRRV 487

Query: 477 YPDNVNMPEDSRLGNELAEIEDQYAADTLGMLNTKEAEYSYADDGSIL---IPNLHKDDD 533
E E+ E + D + +
Sbjct: 488 E----EAKAAQEQAQVRQETEE-------------AVEVRLSKDEQLQQRRANQRLGAEV 530

Query: 534 MIKAIRALVANEPELSTQVVKNWLQDN 560
M + IR + N+P + V++ W+ ++
Sbjct: 531 MSQRIREMSDNDPRVVALVIRQWMSND 557


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1283FLGMOTORFLIG2909e-99 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 290 bits (743), Expect = 9e-99
Identities = 109/348 (31%), Positives = 194/348 (55%), Gaps = 5/348 (1%)

Query: 1 MAENKSKDAAEAPSFNIKDLSGIEKTAILLLSLSEADAASILKHLEPKQVQKVGMAMAAM 60
M E K K+ + + L+G +K AILL+S+ ++ + K+L ++++ + +A +
Sbjct: 1 MEEKKEKEILD-----VSALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKL 55

Query: 61 EDFGQEKVIGVHKLFLDDIQKYSSIGFNSEEFVRKALTAALGEDKAGNLIEQIIMGSGAK 120
E E V F + + I ++ R+ L +LG KA ++I + ++
Sbjct: 56 ETITSELKDNVLLEFKELMMAQEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSALQSR 115

Query: 121 GLDSLKWMDARQVATIIQNEHPQIQTIVLSYLEPDQAAEIFGQFPENTRLDLMMRIANLE 180
+ ++ D + IQ EHPQ ++LSYL+P +A+ I P + ++ RIA ++
Sbjct: 116 PFEFVRRADPANILNFIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMD 175

Query: 181 EVQPAALQELNDIMEKQFAGQGGAQAAKMGGLKAAANIMNYLDTGVESQLMETMRETDEE 240
P ++E+ ++EK+ A GG+ I+N D E ++E++ E D E
Sbjct: 176 RTSPEVVREVERVLEKKLASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPE 235

Query: 241 MAQQIQDLMFVFENLIDVDDRGIQTLLREVQQDVLMKALKGTDDQLKDKILGNMSKRAAE 300
+A++I+ MFVFE+++ +DDR IQ +LRE+ L KALK D +++KI NMSKRAA
Sbjct: 236 LAEEIKKKMFVFEDIVLLDDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAAS 295

Query: 301 LLRDDLEAMGPIRISEVEIAQKEILSIARRLSDSGEIMLGGGGGDEFL 348
+L++D+E +GP R +VE +Q++I+S+ R+L + GEI++ GG ++ L
Sbjct: 296 MLKEDMEFLGPTRRKDVEESQQKIVSLIRKLEEQGEIVISRGGEEDVL 343


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1284FLGFLIH865e-22 Flagellar assembly protein FliH signature.
		>FLGFLIH#Flagellar assembly protein FliH signature.

Length = 228

Score = 86.4 bits (213), Expect = 5e-22
Identities = 58/201 (28%), Positives = 103/201 (51%), Gaps = 4/201 (1%)

Query: 54 APKAVAAETIAPPTMAEIEDIRAQAEEEGFN---EGKTQGYAEGLEQGRLEGLEQGHTEG 110
AP I P IE+ E++ + QGY G+ +GR +G +QG+ EG
Sbjct: 16 APPQAEFVPIVEPEETIIEEAEPSLEQQLAQLQMQAHEQGYQAGIAEGRQQGHKQGYQEG 75

Query: 111 LVQGHEQGLEAGLAEAKTLIQRFEGLLSQFEKPLQLLDGDIEHSLMTLTMALAKSVIGHE 170
L QG EQGL ++ + R + L+S+F+ L LD I LM + + A+ VIG
Sbjct: 76 LAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALDSVIASRLMQMALEAARQVIGQT 135

Query: 171 LKTHPEQILSALRLGVESLPIKEQSVSIRMHPDDVALVEQLYTSTQLNRNQWQLEAEPSL 230
++ ++ ++ P+ +R+HPDD+ V+ + +T L+ + W+L +P+L
Sbjct: 136 PTVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRVDDMLGAT-LSLHGWRLRGDPTL 194

Query: 231 NPGDCIISSQRSLVDLTLSSR 251
+PG C +S+ +D ++++R
Sbjct: 195 HPGGCKVSADEGDLDASVATR 215


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1286FLGFLIJ436e-08 Flagellar FliJ protein signature.
		>FLGFLIJ#Flagellar FliJ protein signature.

Length = 147

Score = 42.9 bits (100), Expect = 6e-08
Identities = 36/145 (24%), Positives = 71/145 (48%)

Query: 1 MANADPLLLVLKLALDAEEQAALLLKSAQLECQKRQNQLDALNNYRLDYMKQMQSQQGQA 60
MA L + LA E AA LL + CQ+ + QL L +Y+ +Y + S
Sbjct: 1 MAEHGALATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAG 60

Query: 61 ISASHYHQFHRFIRQIDEAIAQQNRVVADGEKQKNYRQQHWLDKQKKRKAVELLLDNKEK 120
I+++ + + +FI+ +++AI Q + + ++ + W +K+++ +A + L + +
Sbjct: 61 ITSNRWINYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQERQST 120

Query: 121 KRQALELKKEQKMTDEFASQQFFRR 145
E + +QK DEFA + R+
Sbjct: 121 AALLAENRLDQKKMDEFAQRAAMRK 145


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1287FLGHOOKFLIK519e-09 Flagellar hook-length control protein signature.
		>FLGHOOKFLIK#Flagellar hook-length control protein signature.

Length = 375

Score = 50.6 bits (120), Expect = 9e-09
Identities = 37/132 (28%), Positives = 63/132 (47%), Gaps = 5/132 (3%)

Query: 460 MKQQLVTMVSQGIQQAEIRLDPPELGHMLVKVQVHGDQTQVQFHVTQAQTRDVVEQAIPR 519
+ Q + QG Q AE+RL P +LG + + ++V +Q Q+Q R +E A+P
Sbjct: 244 LSQHISLFTRQGQQSAELRLHPQDLGEVQISLKVDDNQAQIQMVSPHQHVRAALEAALPV 303

Query: 520 LRELLQEQGLQLADSHVSQGDQGQRREGGFGEAGGSSGGNVDDFSAEELD-----LGLNQ 574
LR L E G+QL S++S +++ + N + + E+ D + L
Sbjct: 304 LRTQLAESGIQLGQSNISGESFSGQQQAASQQQQSQRTANHEPLAGEDDDTLPVPVSLQG 363

Query: 575 ATSLNSGIDYYA 586
+ NSG+D +A
Sbjct: 364 RVTGNSGVDIFA 375


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1289FLGMOTORFLIM2497e-83 Flagellar motor switch protein FliM signature.
		>FLGMOTORFLIM#Flagellar motor switch protein FliM signature.

Length = 344

Score = 249 bits (637), Expect = 7e-83
Identities = 88/327 (26%), Positives = 165/327 (50%), Gaps = 12/327 (3%)

Query: 1 MSDLLSQDEIDALLHGVDDVEEDNELDAAGLEARS----YDFSSQDRIVRGRMPTLEIVN 56
M+++LSQDEID LL + + E DA + YDF D+ + +M TL +++
Sbjct: 1 MTEVLSQDEIDQLLTAISSGDASIE-DARPISDTRKITLYDFRRPDKFSKEQMRTLSLMH 59

Query: 57 ERFARHLRISMFNMMRRAAEVSINGVQMLKFGEYVHTLFVPTSLNMVRFHPLKGTALITM 116
E FAR S+ +R V + V L + E++ ++ P++L ++ PLKG A++ +
Sbjct: 60 ETFARLTTTSLSAQLRSMVHVHVASVDQLTYEEFIRSIPTPSTLAVITMDPLKGNAVLEV 119

Query: 117 EARLVFILVDNFFGGDGRFHAKIEGREFTPTERRIVQLLLKIIFEDYKDAWAPVMDVEFD 176
+ + F ++D FGG G+ R+ T E +++ ++ I + +++W V+D+
Sbjct: 120 DPSITFSIIDRLFGGTGQAAKVQ--RDLTDIENSVMEGVIVRILANVRESWTQVIDLRPR 177

Query: 177 YLDSEVNPAMANIVSPTEVVVINSFHIEVDGGGGDFHITMPYSMIEPIRELLDAG--VQS 234
E NP A IV P+E+VV+ + +V G + +PY IEPI L + S
Sbjct: 178 LGQIETNPQFAQIVPPSEMVVLVTLETKVGEEEGMMNFCIPYITIEPIISKLSSQFWFSS 237

Query: 235 DKQDTDMRWSQALHDEIMDVKVGFDATVVEHELTLKDVMNFKAGDIIPIE---LPEYIMM 291
++ + ++ L D++ V + A V L+++D++ + GDII + + + ++
Sbjct: 238 VRRSSTTQYMGVLRDKLSTVDMDVVAEVGSLRLSVRDILGLRVGDIIRLHDTHVGDPFVL 297

Query: 292 KIEDLPTYRCKMGRSRDNLALKIYEKI 318
I + + C+ G +A +I E+I
Sbjct: 298 SIGNRKKFLCQPGVVGKKIAAQILERI 324


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1290FLGMOTORFLIN1119e-35 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 111 bits (278), Expect = 9e-35
Identities = 54/122 (44%), Positives = 81/122 (66%)

Query: 2 STDDDWAAAMAEQALEEANAIDLDELVDDSQPISKAEAAKLDTILDIPVTISMEVGRSYI 61
+ DD WA A+ EQ + +D I+DIPV +++E+GR+ +
Sbjct: 14 ALDDLWADALNEQKATTTKSAADAVFQQLGGGDVSGAMQDIDLIMDIPVKLTVELGRTRM 73

Query: 62 SIRNLLQLNQGSVVELDRVAGEPLDVMVNGTLIAHGEVVVVNDKFGIRLTDVISQTERIK 121
+I+ LL+L QGSVV LD +AGEPLD+++NG LIA GEVVVV DK+G+R+TD+I+ +ER++
Sbjct: 74 TIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYGVRITDIITPSERMR 133

Query: 122 KL 123
+L
Sbjct: 134 RL 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1292FLGBIOSNFLIP2754e-96 Escherichia coli: Flagellar biosynthetic protein Fl...
		>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP

signature.
Length = 245

Score = 275 bits (705), Expect = 4e-96
Identities = 122/240 (50%), Positives = 176/240 (73%)

Query: 8 LIGVSTLLFAASVGAADGVLPAVTVKTAADGSTEYSVTMQILLLMTSLSFIPAMVIMLTS 67
L+ V+ +L A LP +T + G +S+ +Q L+ +TSL+FIPA+++M+TS
Sbjct: 4 LLSVAPVLLWLITPLAFAQLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILLMMTS 63

Query: 68 FTRIIVVLSILRQAIGLQQTPSNQVLIGMSLFMTFFIMAPVFDKIYDQGVKPYIDEQLTL 127
FTRII+V +LR A+G P NQVL+G++LF+TFFIM+PV DKIY +P+ +E++++
Sbjct: 64 FTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEEKISM 123

Query: 128 QQAFDKGKEPLRAFMLGQVRTTDLKTFIDISGYQNINSPEEAPMSVLVPAFITSELKTAF 187
Q+A +KG +PLR FML Q R DL F ++ + PE PM +L+PA++TSELKTAF
Sbjct: 124 QEALEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSELKTAF 183

Query: 188 QIGFMLFVPFLVLDLVVASILMAMGMMMLSPMIVSLPFKIMLFVLVDGWGLVMGTLANSF 247
QIGF +F+PFL++DLV+AS+LMA+GMMM+ P ++LPFK+MLFVLVDGW L++G+LA SF
Sbjct: 184 QIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLAQSF 243


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1293TYPE3IMQPROT471e-10 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 47.1 bits (112), Expect = 1e-10
Identities = 20/73 (27%), Positives = 39/73 (53%)

Query: 4 EALIDIFREALAVIVMMVSAIVLPGLGIGLIVAVFQAATSINEQTLSFLPRLIVTLLALM 63
+ L+ +AL +++++ + IGL+V +FQ T + EQTL F +L+ L L
Sbjct: 2 DDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLF 61

Query: 64 VMGHWLVQTLMDF 76
++ W + L+ +
Sbjct: 62 LLSGWYGEVLLSY 74


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1294TYPE3IMRPROT1241e-36 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 124 bits (314), Expect = 1e-36
Identities = 95/243 (39%), Positives = 142/243 (58%), Gaps = 1/243 (0%)

Query: 15 YMWPLFRVTSMLMVMVVFGATTTPTRVRLLLAVAITLAIAPVLPPVKDAELFSLSAVFIT 74
Y WPL RV +++ + + P RV+L LA+ IT AIAP LP D +FS A+++
Sbjct: 16 YFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLP-ANDVPVFSFFALWLA 74

Query: 75 AQQIIIGVAMGFVTQMVMQTFVLTGQIIGMQTSLGFASMVDPGSGQQTPVIGNFFLLLAT 134
QQI+IG+A+GF Q G+IIG+Q L FA+ VDP S PV+ +LA
Sbjct: 75 VQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHLNMPVLARIMDMLAL 134

Query: 135 LIFLAVDGHLLMIRMLVASFETLPISNQGLTLTSYRALAEWGSYMFGAALTMSLSAIIAL 194
L+FL +GHL +I +LV +F TLPI + L ++ AL + GS +F L ++L I L
Sbjct: 135 LLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLIFLNGLMLALPLITLL 194

Query: 195 LLVNLSFGVMTRAAPQLNIFSIGFPITMIGGLLILWLTLTPVMAHFEEVWASAQLLLCDI 254
L +NL+ G++ R APQL+IF IGFP+T+ G+ ++ + + E +++ LL DI
Sbjct: 195 LTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFCEHLFSEIFNLLADI 254

Query: 255 LGL 257
+
Sbjct: 255 ISE 257


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1295TYPE3IMSPROT338e-117 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 338 bits (868), Expect = e-117
Identities = 97/347 (27%), Positives = 178/347 (51%), Gaps = 2/347 (0%)

Query: 6 SGERSEEPTGRRLEQAREKGQVARSKELGTATVLLSAATGLYMLGPGIAKALSNVFERVF 65
SGE++E+PT +++ AR+KGQVA+SKE+ + ++++ + L L + S + +
Sbjct: 2 SGEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLM--LI 59

Query: 66 TMDRAAIFDTNQMFNVWGVVGSEIGWPLLKIMLLIVVVAFIGNVSLGGMNFSTQAMMPKA 125
+++ + + + V V E + ++ + ++A +V G S +A+ P
Sbjct: 60 PAEQSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDI 119

Query: 126 SKMSPIAGFKRMFGVQALVELTKGIAKFSVVAIAAYLLLSHYFNDILLLSADHLPGNVHH 185
K++PI G KR+F +++LVE K I K +++I ++++ +L L +
Sbjct: 120 KKINPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPL 179

Query: 186 ALDLLVWMFILLCSSVLVIVVIDVPFQIWNHNKQLKMTKQEVKDEYKDTEGKPEVKGRVR 245
+L + ++ +VI + D F+ + + K+LKM+K E+K EYK+ EG PE+K + R
Sbjct: 180 LGQILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRR 239

Query: 246 QMQRELAQRRMMAEVPNADVIVVNPEHYAVAIKYDVKRSAAPFVIAKGVDEVAFKIREVA 305
Q +E+ R M V + V+V NP H A+ I Y + P V K D +R++A
Sbjct: 240 QFHQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIA 299

Query: 306 RAHNIAIVSAPPLARAIYHTTKLEQQIPEGLFTAVAQVLAYVFQLRQ 352
+ I+ PLARA+Y ++ IP A A+VL ++ +
Sbjct: 300 EEEGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQNI 346


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1296HTHFIS310.027 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 30.6 bits (69), Expect = 0.027
Identities = 25/158 (15%), Positives = 51/158 (32%), Gaps = 19/158 (12%)

Query: 485 VVDAATVVATHISQILTNNAAKLLGYEEVQQLMDMLAKHSPKLVDGFIPDV-MPLGNVVK 543
V D + T ++Q L+ + L +A LV + DV MP N
Sbjct: 8 VADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLV---VTDVVMPDENAFD 64

Query: 544 VMQNLLNEGVSVR--------DLRTIVQTL----LEYGTKSNDTEVLTAAVRIAL---KR 588
++ + + T ++ +Y K D L + AL KR
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 589 MIVQEISGPELEIPVITLAPELEQMLHQSMQATGGDGP 626
+ + +P++ + ++++ + D
Sbjct: 125 RPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLT 162


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1297PF05272310.013 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.8 bits (69), Expect = 0.013
Identities = 9/25 (36%), Positives = 12/25 (48%)

Query: 238 VKQGGVVALVGPTGVGKTTSLAKLA 262
K V L G G+GK+T + L
Sbjct: 593 CKFDYSVVLEGTGGIGKSTLINTLV 617


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1300HTHFIS903e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 89.9 bits (223), Expect = 3e-24
Identities = 33/105 (31%), Positives = 51/105 (48%), Gaps = 3/105 (2%)

Query: 6 KILIVDDFSTMRRIIKNLLRDLGFNNTQEADDGSTALPMLQKGDFDFVVTDWNMPGMQGI 65
IL+ DD + +R ++ L G++ + +T + GD D VVTD MP
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 66 DLLKAIRADDSLKHLPVLMVTAEAKREQIIAAAQAGVNGYVVKPF 110
DLL I+ LPVL+++A+ I A++ G Y+ KPF
Sbjct: 64 DLLPRIKKAR--PDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPF 106


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1302PF06580442e-06 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 44.1 bits (104), Expect = 2e-06
Identities = 19/105 (18%), Positives = 37/105 (35%), Gaps = 23/105 (21%)

Query: 452 TLNKEIDLVLV---------GEETDLDKNLVEALADPLVH------LVRNSVDHGIEMPN 496
+L E+ +V + + + A+ D V LV N + HGI
Sbjct: 217 SLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPMLVQTLVENGIKHGIA--- 273

Query: 497 DREASGKPRTGTITLSASQEGDHILLKIEDDGAGMDPEKLKQIAI 541
P+ G I L +++ + L++E+ G+ +
Sbjct: 274 -----QLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGT 313


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1303HTHFIS642e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 64.5 bits (157), Expect = 2e-13
Identities = 28/135 (20%), Positives = 56/135 (41%), Gaps = 7/135 (5%)

Query: 2 AIKVLVVDDSSFFRRRVSEIVNQDPELEVIATASNGAEAVKMAAELNPQVITMDIEMPVM 61
+LV DD + R +++ +++ + SN A + A + ++ D+ MP
Sbjct: 3 GATILVADDDAAIRTVLNQALSR--AGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 62 DGITAVREIMAKCP-TPILMFSSLTHDGAKATLDALDAGALDFLPKRF--EDIATNKDDA 118
+ + I P P+L+ S+ + + A + GA D+LPK F ++ A
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSA--QNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118

Query: 119 ILLLQQRVKALGRRR 133
+ ++R L
Sbjct: 119 LAEPKRRPSKLEDDS 133


80Shewmr4_1505Shewmr4_1511N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_15051111.212125hypothetical protein
Shewmr4_15061111.463580hypothetical protein
Shewmr4_1507-1141.010976hypothetical protein
Shewmr4_15080160.771996hypothetical protein
Shewmr4_1509-115-0.441729pyrroline-5-carboxylate reductase
Shewmr4_1510-213-0.423418hypothetical protein
Shewmr4_1511-215-0.441134pilus retraction ATPase PilT
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1505PF06580280.045 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 27.9 bits (62), Expect = 0.045
Identities = 15/103 (14%), Positives = 35/103 (33%), Gaps = 7/103 (6%)

Query: 21 GFGLIKRKGLRTFVFIPLMINLVLFAAVIYVAIGQLDVLFTWMNAQLPEYLSWLNF---- 76
G+G+ L F F L + L + + +AI + ++ T + WL
Sbjct: 19 GWGVY---TLTGFGFASLYGSPKLHSMIFNIAISLMGLVLTHAYRSFIKRQGWLKLNMGQ 75

Query: 77 LLWPLAVTTMLVMLAFVFSSVMNWLAAPFNGLLAEKVEQLLTG 119
++ + +++ + + ++ W F L
Sbjct: 76 IILRVLPACVVIGMVWFVANTSIWRLLAFINTKPVAFTLPLAL 118


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1506GPOSANCHOR444e-06 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 43.5 bits (102), Expect = 4e-06
Identities = 41/293 (13%), Positives = 103/293 (35%), Gaps = 9/293 (3%)

Query: 616 AKQDNSQSLVQLSKEQTELSEAIADCEQAKAIQQAKLDELAQQLAQVRDSLSQGTKRLHQ 675
A + + +L ++ + + + + L ++ + LS ++L +
Sbjct: 44 ATRSQTDTLEKVQERADKFEIENNTLKLKNSDLSFNNKALKDHNDELTEELSNAKEKLRK 103

Query: 676 LQLDKATKSTQLNNAEIQAKQREAKRGQLAETLARTHAELAELAEQLILLAEQEDELAEA 735
+ K++++ E + E A++ L + LA ++ +L +A
Sbjct: 104 NDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKA 163

Query: 736 LEVSLEQQQQQSQDAQGDMARHQALKAQIGDAERRLTSLNASLQSIATRMAVSTEQIELQ 795
LE ++ S + A AL+A+ + E+ L + + ++ +
Sbjct: 164 LEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAAL 223

Query: 796 RVRVSELVHSKETLSA--QLANVAAQEGDQQTAQLSEQLAQLLNQQQSQQQALKSLRSQQ 853
R ++L + E + + + + A L + A+L + + ++
Sbjct: 224 AARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKI 283

Query: 854 SSLTETLNSIGLKQKQELGKLEGLTQSLSTLKLRREGLKGQADSQLAALSEQQ 906
+L + E LE +Q L+ R+ L+ D+ A + +
Sbjct: 284 KTLEAEKA----ALEAEKADLEHQSQVLNA---NRQSLRRDLDASREAKKQLE 329



Score = 42.7 bits (100), Expect = 7e-06
Identities = 50/283 (17%), Positives = 96/283 (33%), Gaps = 12/283 (4%)

Query: 227 KTHAELLVMRYQELQSQMASLSEQISSLELQQAAAQSLAQTGELESTELQLTLSHLAEQE 286
K L + L+ L+E++S+ + + + EL+ + L +
Sbjct: 70 KLKNSDLSFNNKALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQELEARKADLEKAL 129

Query: 287 QQAVEAYYLTGTEIAKLEQQLQSQKLRDAQLHNQLEQLSEQIIQNQAKLAAYQASFQALE 346
+ A+ +I LE + + R A L LE + AK+ +A ALE
Sbjct: 130 EGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALE 189

Query: 347 AELSQLGPQHELQQEMLDELQAQWEMSVSRSEAQSESARVLAAAVAQHKLQLELHRSKLA 406
A ++L E A+ + + A + L A+ +K+
Sbjct: 190 ARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIK 249

Query: 407 HQQQLNAHKTLLHQEQQQELASLNAHALEDSSASLNDEIAQLEQALAEQVEINQEFESTL 466
+ A + +Q EL E + + A+++ AE+ + E
Sbjct: 250 TLEAEKAAL----EARQAELEKAL----EGAMNFSTADSAKIKTLEAEKAALEAE----K 297

Query: 467 AADTHALDLARGEFEQLSQRLTSMRARFELVEQWLAKQEELSD 509
A H + + L + L + R + +E K EE +
Sbjct: 298 ADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNK 340



Score = 40.4 bits (94), Expect = 4e-05
Identities = 29/182 (15%), Positives = 68/182 (37%), Gaps = 7/182 (3%)

Query: 188 RENLERLGDIRSELAKQLEKLSQQAKAAKQYRELKQAERKTHAELLVMRYQELQSQMASL 247
+ +LE+ + + + +A K E +QAE + E + +++ +L
Sbjct: 157 KADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTL 216

Query: 248 SEQISSLELQQAAAQSLAQTGELESTELQLTLSHLAEQEQQAVEAYYLTGTEIAKLEQQL 307
+ ++L ++A + + ST + L ++ A+LE+ L
Sbjct: 217 EAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEA-------RQAELEKAL 269

Query: 308 QSQKLRDAQLHNQLEQLSEQIIQNQAKLAAYQASFQALEAELSQLGPQHELQQEMLDELQ 367
+ +++ L + +A+ A + Q L A L + +E +L+
Sbjct: 270 EGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLE 329

Query: 368 AQ 369
A+
Sbjct: 330 AE 331



Score = 39.3 bits (91), Expect = 1e-04
Identities = 43/265 (16%), Positives = 92/265 (34%), Gaps = 15/265 (5%)

Query: 146 QGTISRLIESKPQDLRTFIEEAAGISRYKERRRETENRIRHTRENLERLGDIRSELAKQL 205
+S E ++ ++ E+A+ I + R+ + E + L +
Sbjct: 91 TEELSNAKEKLRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEK 150

Query: 206 EKLSQQAKAAKQYRELKQAERKTHAELLVMRYQELQSQMASLSEQISSLELQQAAAQSLA 265
L+ + ++ E + + + L+++ A+L + + LE A + +
Sbjct: 151 AALAARKADLEKALEGAMNFSTADSA----KIKTLEAEKAALEARQAELEKALEGAMNFS 206

Query: 266 QTGELESTELQLTLSHLAEQEQQAVEAYYLTGTEIAKLEQQLQSQKLRDAQLHNQLEQLS 325
+ L+ + LA ++ +A ++++ + A L + +L
Sbjct: 207 TADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELE 266

Query: 326 EQIIQNQAKLAAYQASFQALEAELSQLGPQHELQQEMLDELQAQWEMSVSRSEAQSESAR 385
+ + A A + LEAE + L+ +A E A +S R
Sbjct: 267 KALEGAMNFSTADSAKIKTLEAE-----------KAALEAEKADLEHQSQVLNANRQSLR 315

Query: 386 VLAAAVAQHKLQLELHRSKLAHQQQ 410
A + K QLE KL Q +
Sbjct: 316 RDLDASREAKKQLEAEHQKLEEQNK 340


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1508HTHTETR354e-04 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 35.4 bits (81), Expect = 4e-04
Identities = 16/106 (15%), Positives = 38/106 (35%), Gaps = 7/106 (6%)

Query: 504 MLDRMGMKSATNLAQAIEAAKTTTLPRFLYALGIREVGEATASNLAT---HFGSLEALRV 560
M + ++ ++ A + + + + E+ +A HF L
Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60

Query: 561 ATIEQLIQVEDIGEVVAQHVAHFFAQPHNL--EVIDALIAAGVNWP 604
E +IGE+ ++ A F P ++ E++ ++ + V
Sbjct: 61 EIWEL--SESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEE 104


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1511DHBDHDRGNASE280.039 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 28.1 bits (62), Expect = 0.039
Identities = 10/51 (19%), Positives = 19/51 (37%), Gaps = 1/51 (1%)

Query: 1 MELSVVNQKSVAVVGCG-WFGFALAKHLVQAGYRVTGAKRHTEELAPLTEA 50
M + K + G G A+A+ L G + + E+L + +
Sbjct: 1 MNAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSS 51


81Shewmr4_1622Shewmr4_1629N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_1622-2140.371866two component, sigma54 specific, Fis family
Shewmr4_1623-2120.703066flagellar hook-basal body complex subunit FliE
Shewmr4_1624-1110.218969flagellar MS-ring protein
Shewmr4_1625-110-0.055152flagellar motor switch protein G
Shewmr4_1626-111-0.026501flagellar assembly protein H
Shewmr4_16270140.499238flagellum-specific ATP synthase
Shewmr4_16282200.217766flagellar export protein FliJ
Shewmr4_1629530-1.016049flagellar hook-length control protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1622TCRTETB652e-13 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 64.9 bits (158), Expect = 2e-13
Identities = 67/395 (16%), Positives = 140/395 (35%), Gaps = 37/395 (9%)

Query: 31 SKQRDTRLMWALCVASVVVYINLYLMQGMLPLIAEHFAVSGSKATLILSVTSFSLAFSLL 90
S R +++ LC+ S +N ++ LP IA F + + + + +
Sbjct: 8 SNLRHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTA 67

Query: 91 IYAVVSDRIGRHTPIVVSLWLLALSNLL-LIWAGDFNALVYVRFLQGVLLAAVPAIAMAY 149
+Y +SD++G ++ + + +++ + F+ L+ RF+QG AA PA+ M
Sbjct: 68 VYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVV 127

Query: 150 FKEQLSPSTMLKAAGIYIMANSIGGIVGRLLGGVMSQFLSWQESMWLLFLVTLAGVALTS 209
+ KA G+ ++G VG +GG+++ ++ W + L+ ++T+ V
Sbjct: 128 VARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWS-YLLLIPMITIITVPFLM 186

Query: 210 YLLPSGA---------DAQAVSGGQTTSPTLSKRARLL-------------QDIYGFSHH 247
LL +S G + + + I +
Sbjct: 187 KLLKKEVRIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFLIFVKHIRKVTDP 246

Query: 248 LTDPQM------RLAYAIGGITFMMMVNQFSFIQLHLMAAPYEWSRFQA--TLIFLCYSS 299
DP + + GGI F + S + +M ++ S + +IF S
Sbjct: 247 FVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPY-MMKDVHQLSTAEIGSVIIFPGTMS 305

Query: 300 GTVASYFTAKWLAKFGQHKLYQWSWCLMLLGSL---LTLFDTPVTICLGFLMTACGFFLT 356
+ Y + + G + + + L L T + + + G T
Sbjct: 306 VIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFT 365

Query: 357 HSCCNSFVAMRAS-RDRAKATSLYLCCYYLGAALG 390
+ ++ V+ ++ SL +L G
Sbjct: 366 KTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTG 400


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1627ACRIFLAVINRP503e-164 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 503 bits (1298), Expect = e-164
Identities = 215/1047 (20%), Positives = 451/1047 (43%), Gaps = 72/1047 (6%)

Query: 3 LTRLAIKRPVTTSMFFFAILLFGLASSRLLPLEMFPGIDIPQIVVQVPYKGSTPAEVERD 62
+ I+RP+ + +++ G + LP+ +P I P + V Y G+ V+
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 63 ITKVLEESLATMGGIDELESESSQEG-AEIEINMKWGENVATKSLEAREKIDAVRHLLPK 121
+T+V+E+++ + + + S S G I + + G + ++ + K+ LLP+
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 122 DVERVFIRQFSTADMPVLTIRISSDRELSGAFDLLD---KQLKRPLERVEGVSKVNLYGV 178
+V++ I ++ ++ SD + D+ D +K L R+ GV V L+G
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFG- 179

Query: 179 EQKQIEVRINANRLAASGYSATELQTRLGRENFVLSAGTL------RESNLVYQVSPKGE 232
Q + + ++A+ L + ++ +L +N ++AG L L + +
Sbjct: 180 AQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239

Query: 233 FRNLEDIKALVLLPGLT-----LGDVANVQFALPERVEGRHLDKHYAVGLDVFKESGANL 287
F+N E+ + L L DVA V+ ++ A GL + +GAN
Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299

Query: 288 VEVSDRVLKVIELAKQDQQF-QGIRLFIMEDQASGVKSSLSDLLLSGLIGALLSFIVLYL 346
++ + + +LA+ F QG+++ D V+ S+ +++ + +L F+V+YL
Sbjct: 300 LDTAKAIKA--KLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYL 357

Query: 347 FLRNFKMTLIVVSSVPISIGMTLAAMYLLGYSLNILSMMGLLLAVGMLIDNAVVVTESVL 406
FL+N + TLI +VP+ + T A + GYS+N L+M G++LA+G+L+D+A+VV E+V
Sbjct: 358 FLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVE 417

Query: 407 QEKQGKDTKSAEDNENAVMTGVDKVSLAVLAGTMTTAIVFLPNIFGVKVELTIFLEHVAI 466
+ E A + ++ A++ M + VF+P F I+ + +I
Sbjct: 418 RVMMEDKLPPKE----ATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQ-FSI 472

Query: 467 AICISLAASLLVAKTLIPLMLTKFHFDIAPEKAPGK-------------LQNFYNRSLNW 513
I ++A S+LVA L P + ++ E K N Y S+
Sbjct: 473 TIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGK 532

Query: 514 VLLRPWRSGLISVAILVSTALPLSMVKQDQEDSQSKERIYINYQVEGRHNLNVTEAMVSQ 573
+L R LI I+ + + + + Q+ T+ ++ Q
Sbjct: 533 ILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQ 592

Query: 574 MEEYLYNNKEEFHIDSVYSYY-------APDDASSVILLK--KDLPMPLDELKKKIRSGF 624
+ +Y N++ +++SV++ A + + + LK ++ + + I
Sbjct: 593 VTDYYLKNEKA-NVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAK 651

Query: 625 PKFSIAKPQFGWGNDNSGVRVTLTGRST--------------SELIHLSEQVLPLLS-NI 669
+ K + G+ + + G +T L Q+L + + +
Sbjct: 652 MEL--GKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHP 709

Query: 670 KGLVDVRSEVNGAQQEVVIRINRQMAARLDLKLNEVASSISMALRGSPLRSFRHDPNGEL 729
LV VR + + ++++ A L + L+++ +IS AL G+ + F D
Sbjct: 710 ASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFI-DRGRVK 768

Query: 730 RIEMAYEKEWQKSLEKLKQLPIVRIDQRLYTLDNLASIEILPRFDTIKHYNRQTSLSIGA 789
++ + + +++ E + +L + + + + + ++ YN S+ I
Sbjct: 769 KLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQG 828

Query: 790 NLDK-LTTEEAQTKIKQVMENVRFPNGYNYSLRGGFERQDEDESVMAINMLLAIAMIYIV 848
++ +A ++ + + P G Y G ++ + + ++ ++++
Sbjct: 829 EAAPGTSSGDAMALMENLASKL--PAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLC 886

Query: 849 MAALFESLLLPTAIITSILFSITGVFWALLLTGTPMSVMAMIGILILMGIVVNNGIVLVD 908
+AAL+ES +P +++ + I GV A L V M+G+L +G+ N I++V+
Sbjct: 887 LAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVE 946

Query: 909 QINQVTPELDK-LSDTIREVCITRLRPVLMTVGTTVLGLVPLAMGETQIGGGGPPYSPMA 967
+ + K + + RLRP+LMT +LG++PLA+ G G + +
Sbjct: 947 FAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAIS---NGAGSGAQNAVG 1003

Query: 968 IAIIGGLSFSTVTSLYLVPLCYQLLYR 994
I ++GG+ +T+ +++ VP+ + ++ R
Sbjct: 1004 IGVMGGMVSATLLAIFFVPVFFVVIRR 1030


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1628ACRIFLAVINRP6660.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 666 bits (1721), Expect = 0.0
Identities = 262/1094 (23%), Positives = 474/1094 (43%), Gaps = 100/1094 (9%)

Query: 7 SVKRPVTVWMFMLAIMLFGMVGFSRLAVKLLPDLSYPTLTIRTMYDGAAPVEVEQLVSKP 66
++RP+ W+ + +M+ G + +L V P ++ P +++ Y GA V+ V++
Sbjct: 5 FIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQV 64

Query: 67 IEEAVGVVKGLRKISSISRS-GMSDVVLEFEWGTTMDMASLDVREKLDTI--ALPLDVKK 123
IE+ + + L +SS S S G + L F+ GT D+A + V+ KL LP +V++
Sbjct: 65 IEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQQ 124

Query: 124 PLLLRFNPNLDPIMRLALSVPNASEAELKQMRTYAEEELKRRLEALSGVAAVRLSGGLEQ 183
+ + +M N + Y +K L L+GV V+L G +
Sbjct: 125 QGISVEKSSSSYLMVAGFVSDNPGT-TQDDISDYVASNVKDTLSRLNGVGDVQLFGA-QY 182

Query: 184 EVHIQLNQEKLSQLNLNADDIKRRINEENINLSAGKVIQGD------REYLVRTLNQFNS 237
+ I L+ + L++ L D+ ++ +N ++AG++ + +F +
Sbjct: 183 AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKN 242

Query: 238 LEELGQVIVYRDAQ-TLVRLFEVATITDAFKERSDITRIGSQESIELAIYKEGDANTVAV 296
EE G+V + ++ ++VRL +VA + + + I RI + + L I AN +
Sbjct: 243 PEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALDT 302

Query: 297 AKKLRDELVKINQD-PKQNKLEVIYDQSEFIESAVSEVTSSALMGSILSMLVIYLFLRNI 355
AK ++ +L ++ P+ K+ YD + F++ ++ EV + +L LV+YLFL+N+
Sbjct: 303 AKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNM 362

Query: 356 IPTLIISISIPFSVIATFNMMYFADISLNIMSLGGIALAIGLLVDNAIVVLENIDRC-RS 414
TLI +I++P ++ TF ++ S+N +++ G+ LAIGLLVD+AIVV+EN++R
Sbjct: 363 RATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMME 422

Query: 415 EGMSKLDAAVTGTKEVAGAIFASTMTTLAVFVPLVFVDGIAGALFSDQALTVTFALLASL 474
+ + +A ++ GA+ M AVF+P+ F G GA++ ++T+ A+ S+
Sbjct: 423 DKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSV 482

Query: 475 LVALTSIPMLASREGFTALPELIKKTPKEKPTTKLGKLKHYSATVFSFPIVLIFSYLPSV 534
LVAL P L + L+K E K
Sbjct: 483 LVALILTPALCAT--------LLKPVSAEHHENK-------------------------- 508

Query: 535 LLTLALVIGRFFSWLLGLVMRPLSSGFNFVYHAIESVYHKLLAMALRKQVATLLLTIGIT 594
G FF W FN + + Y + L LL+ I
Sbjct: 509 --------GGFFGW------------FNTTFDHSVNHYTNSVGKILGSTGRYLLIYALIV 548

Query: 595 GACISLLPRLGMELIPPMNQGEFYVEILLPPGTAVGETDKVLQQLAMSI--KDRPEVKHA 652
+ L RL +P +QG F I LP G T KVL Q+ ++ V+
Sbjct: 549 AGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNEKANVESV 608

Query: 653 YSQAGSGGLMTSDTARGGENWGRLQVVL---SDHTAYHQVTQVLRDTARRIPELEAKIEQ 709
++ G + +N G V L + + + A+ KI
Sbjct: 609 FTVNGFS------FSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMEL---GKIRD 659

Query: 710 PELFSFKTPLEIEL---SGYDLHLLKRSADNLVKALSASDRFA-----------DVNTSL 755
+ F P +EL +G+D L+ ++ A ++ V +
Sbjct: 660 GFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNG 719

Query: 756 RDGQPELSIRFDHARLAALGMDAPTVANRIAQRVGGTVASQYTVRDRKIDILVRSELDER 815
+ + + D + ALG+ + I+ +GGT + + R R + V+++ R
Sbjct: 720 LEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFR 779

Query: 816 DQISDIDALIINPNSQQPIALSAVAEVSLQLGPSAINRISQQRVALVSANLAYG-DLSDA 874
D+D L + + + + SA G + R + + A G DA
Sbjct: 780 MLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDA 839

Query: 875 VAEAQQILSAQVLPASVQARFGGQNEEMEHSFQSLKIALILAVFLVYLVMASQFESLLHP 934
+A + + S LPA + + G + + S + ++ +V+L +A+ +ES P
Sbjct: 840 MALMENLASK--LPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIP 897

Query: 935 LLILFAVPMALAGSVLGLYITQTHLSVVVFIGLIMLAGIVVNNAIVLVDRINQL-RTEGV 993
+ ++ VP+ + G +L + V +GL+ G+ NAI++V+ L EG
Sbjct: 898 VSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGK 957

Query: 994 DKLEAIKVAAKSRLRPIMMTTLTTTLGLLPMALGLGDGSEVRAPMAITVIFGLSLSTLLT 1053
+EA +A + RLRPI+MT+L LG+LP+A+ G GS + + I V+ G+ +TLL
Sbjct: 958 GVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLA 1017

Query: 1054 LIVIPVLYALFDRK 1067
+ +PV + + R
Sbjct: 1018 IFFVPVFFVVIRRC 1031


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1629RTXTOXIND416e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 41.0 bits (96), Expect = 6e-06
Identities = 13/53 (24%), Positives = 29/53 (54%)

Query: 72 GLIEAINVEEGDRVQKGQILAVIDAKRQQYDLDRSEAEVKIIEQELNRLKKMS 124
+++ I V+EG+ V+KG +L + A + D ++++ + E R + +S
Sbjct: 105 SIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILS 157



Score = 40.2 bits (94), Expect = 9e-06
Identities = 36/202 (17%), Positives = 80/202 (39%), Gaps = 24/202 (11%)

Query: 91 LAVIDAKRQ----QYDLDRSEAEVKIIEQELNRLK---KMSNKEFIS--ADSMAKLEYNL 141
AV++ + + +L +++++ IE E+ K ++ + F + D + + N+
Sbjct: 252 HAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNI 311

Query: 142 QAAIAKRDLAELQVKESHVVSPINGIIAKRYVKAGNMAKEFGD-LFYIV-NQDELHGIVH 199
+ E + + S + +P++ + + V + L IV D L
Sbjct: 312 GLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTAL 371

Query: 200 LPEQQLTSLRLGQEAQV-FS--NQQSKNAIHAKVLRISP--VVDPQSGT-FKVTLAVP-- 251
+ + + + +GQ A + + KV I+ + D + G F V +++
Sbjct: 372 VQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVIISIEEN 431

Query: 252 -----NQNAHLKAGMFTRVELK 268
N+N L +GM E+K
Sbjct: 432 CLSTGNKNIPLSSGMAVTAEIK 453


82Shewmr4_1936Shewmr4_1943N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_19360140.800692hypothetical protein
Shewmr4_19371120.814732hypothetical protein
Shewmr4_19381120.446107heat shock protein DnaJ domain-containing
Shewmr4_1939113-0.008150****23S rRNA m(2)G2445 methyltransferase
Shewmr4_1940113-0.397381glutaredoxin 2
Shewmr4_1941015-1.851212ABC transporter ATPase
Shewmr4_1942017-2.819586hypothetical protein
Shewmr4_1943022-4.148943hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1936TCRTETB669e-14 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 65.7 bits (160), Expect = 9e-14
Identities = 47/196 (23%), Positives = 78/196 (39%), Gaps = 16/196 (8%)

Query: 24 QLLFMLVFMVACGQMAQTIFVPALPLIAQGLAVDASKLQALMACYLLAYGLCQFIYGPLS 83
Q+L L + + + + +LP IA + + ++L + + +YG LS
Sbjct: 14 QILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLS 73

Query: 84 DRVGRKIPLLIGIGIFIVGAFMAA-EATSFSQLIFASLLQGLGTA-----SAGALCRSIP 137
D++G K LL GI I G+ + + FS LI A +QG G A + R IP
Sbjct: 74 DQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIP 133

Query: 138 RDHYFGDNLVKFNSYVSMAVVFLPLVAPFLGSFASAYFGWQAVYWVLFGFGSFVWLLICF 197
+++ G S V+M P + G + Y W Y +L + + +
Sbjct: 134 KENR-GKAFGLIGSIVAMGEGVGPAI----GGMIAHYIHWS--YLLLIPMITIITVPFLM 186

Query: 198 GFKESLPKEKREQQPI 213
L KE R +
Sbjct: 187 KL---LKKEVRIKGHF 199


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1939RTXTOXIND453e-07 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 45.2 bits (107), Expect = 3e-07
Identities = 28/147 (19%), Positives = 63/147 (42%), Gaps = 10/147 (6%)

Query: 111 VNRLKAKLSSQQAIVDKAERDVKRLKPLYEQDAASQLDYDNALSTLAQARSNLTASRAEV 170
+L ++ +++ E ++ K Y+ +QL + L L Q N+ E+
Sbjct: 261 YVEAVNELRVYKSQLEQIESEILSAKEEYQL--VTQLFKNEILDKLRQTTDNIGLLTLEL 318

Query: 171 EEAELELSYTEIKAPIAGLVSRSEV-DIGALVGSKGQSLLTRVKQVDPIYVSFNMSALDY 229
+ E + I+AP++ V + +V G +V + ++L+ V + D + V+ + D
Sbjct: 319 AKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTT-AETLMVIVPEDDTLEVTALVQNKDI 377

Query: 230 ------LNAQRRLTSYSAKKEAEVEGK 250
NA ++ ++ + + GK
Sbjct: 378 GFINVGQNAIIKVEAFPYTRYGYLVGK 404



Score = 37.1 bits (86), Expect = 1e-04
Identities = 21/176 (11%), Positives = 60/176 (34%), Gaps = 14/176 (7%)

Query: 73 EVRARVDGFVEEKRFVEGSAVKAGELLYQIDNKPYVAVVNRLKAKLSSQ-------QAIV 125
E++ + V+E EG +V+ G++L ++ A + ++ L Q +
Sbjct: 98 EIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILS 157

Query: 126 DKAERDVKRLKPLYEQDAASQLDYDNALSTLAQARSNLTASRAEVEEAELELSYTEIKAP 185
E + L ++ + + L + + + + + + EL+ + +A
Sbjct: 158 RSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQK--YQKELNLDKKRAE 215

Query: 186 IAGLVSRSEVDIGALVGSKGQSLLTRVKQVDPIYVSFNMSALDYLNAQRRLTSYSA 241
+++R I + +R+ + ++ L + +
Sbjct: 216 RLTVLAR----INRYENLS-RVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVN 266


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1940ACRIFLAVINRP9650.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 965 bits (2497), Expect = 0.0
Identities = 426/1028 (41%), Positives = 618/1028 (60%), Gaps = 9/1028 (0%)

Query: 1 MAQFFINRPIFASVISIVIVLLGVIAMFKLPVDQYPYITPPQVTISASYPGASSTTAAES 60
MA FFI RPIFA V++I++++ G +A+ +LPV QYP I PP V++SA+YPGA + T ++
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 61 VATPLEQEVNGVPNMIYMSSKSTNSGSTSVTITFDVGTNADLAAVDVQNSAQQASGGLPI 120
V +EQ +NG+ N++YMSS S ++GS ++T+TF GT+ D+A V VQN Q A+ LP
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 121 DVQTEGVTVSKDASVELLKLALTSNDERFDEIYLSNYATINIESALKRIPGVGRTRNTGS 180
+VQ +G++V K +S L+ S++ + +S+Y N++ L R+ GVG + G+
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 181 RSYAMRIWLKPDAMAGYSLTTTDVINAIKAQNKESPAGTIGTQPNNDDISLTLPISVAGR 240
+ YAMRIWL D + Y LT DVIN +K QN + AG +G P L I R
Sbjct: 181 Q-YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239

Query: 241 LSSVQAFNEIIVRANPDGSIIRLRDIAGVELGSSAYTLQSQLNGENATILQVYLLPGANA 300
+ + F ++ +R N DGS++RL+D+A VELG Y + +++NG+ A L + L GANA
Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299

Query: 301 LEVTHKVKQAMAELSQKFPQGMKWEVFYDASIFIQESIDEVIHTLIEALVLVVLVVYLFL 360
L+ +K +AEL FPQGMK YD + F+Q SI EV+ TL EA++LV LV+YLFL
Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359

Query: 361 QNVRATLIPAIAVPVSLIGTLAAMLAFGFTINTVSLLALVLAIGIVVDDAIVVVENVERL 420
QN+RATLIP IAVPV L+GT A + AFG++INT+++ +VLAIG++VDDAIVVVENVER+
Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419

Query: 421 IHEKGMSAIDATRIAMKELSGALVATSLVLCAVFVPVSFLAGITGIMYREFAVAITVAVL 480
+ E + +AT +M ++ GALV ++VL AVF+P++F G TG +YR+F++ I A+
Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479

Query: 481 ISTLVALTLSPALCALLLKPSKAP----QRGFFHWLNRKLDLGTNQYVGLVALTNKYAKR 536
+S LVAL L+PALCA LLKP A + GFF W N D N Y V R
Sbjct: 480 LSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539

Query: 537 SYLAFAIMFGGTYFIMSHLPSSFMPDEDQGRFFIDMTLPDGSTVNRTEAILKKAEQYVRA 596
L +A++ G + LPSSF+P+EDQG F + LP G+T RT+ +L + Y
Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599

Query: 597 NPAV-AYSFTLAGENRRSGANQANGQFEVVLKPWAEREASHATVQSVMKAIDKDLKNVLE 655
N S SG Q G V LKPW ER + ++V+ +L + +
Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRD 659

Query: 656 AEFNLYLPSAVPGLGNGSGVEMQLQDTSGTHFDGLIETANELVEQLKLQP-EVASASVSL 714
+ A+ LG +G + +L D +G D L + N+L+ P + S +
Sbjct: 660 GFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNG 719

Query: 715 QSAIPQLHLTVDEAKAMAIGVNVGDIYSTIKTLTDSSTVNDFNLFGRVYRVKIQAEESYR 774
Q L VD+ KA A+GV++ DI TI T + VNDF GRV ++ +QA+ +R
Sbjct: 720 LEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFR 779

Query: 775 QFPHQIKDYYVRSSNGAMVPIGVLAKYDYTVGPSSVTHYNLFSSASINVTPATGYATGDV 834
P + YVRS+NG MVP + G + YN S I A G ++GD
Sbjct: 780 MLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDA 839

Query: 835 IQAIERVATPILPDEFKYEWTGITYQEVQSANQTGIAIGLALLFVFLFLAALYESWSIPV 894
+ +E +A+ LP Y+WTG++YQE S NQ + ++ + VFL LAALYESWSIPV
Sbjct: 840 MALMENLASK-LPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPV 898

Query: 895 AVLLIAPIALLGAAVTTLISGMQSNLFFQVAFIALIGMAAKNAILIVEFANQLH-QQGRT 953
+V+L+ P+ ++G + + +++++F V + IG++AKNAILIVEFA L ++G+
Sbjct: 899 SVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKG 958

Query: 954 RISAALEAATMRFRPILMTSMAFILGVLPLVLSEGPGAVSRQSISLPILGGMVLATTIGI 1013
+ A L A MR RPILMTS+AFILGVLPL +S G G+ ++ ++ + ++GGMV AT + I
Sbjct: 959 VVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAI 1018

Query: 1014 VFVPLFFV 1021
FVP+FFV
Sbjct: 1019 FFVPVFFV 1026



Score = 110 bits (276), Expect = 1e-26
Identities = 74/513 (14%), Positives = 184/513 (35%), Gaps = 37/513 (7%)

Query: 5 FINRPIFASVISIVIVLLGVIAMFKLPVDQYPYITPPQVTISASYPGASSTTAAESVATP 64
+ +I +IV V+ +LP P P ++ + V
Sbjct: 533 ILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQ 592

Query: 65 LEQ-----EVNGVPNMIYMSSKSTNSGSTSVTITF------DVGTNADLAAVDVQNSAQQ 113
+ E V ++ ++ S + + + + F + + +A V + A+
Sbjct: 593 VTDYYLKNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKM 652

Query: 114 ASGGLP--------IDVQTEGVTVSKDASVELLKLALTSNDERFDEIYLSNYATINIESA 165
G + + E T + EL+ A +D L+ + A
Sbjct: 653 ELGKIRDGFVIPFNMPAIVELGTATG-FDFELIDQAGLGHDA------LTQARNQLLGMA 705

Query: 166 LKRIPGVGRTRNTGS-RSYAMRIWLKPDAMAGYSLTTTDVINAIKAQNKESPAGTIGTQP 224
+ + R G + ++ + + ++ +D+ I + +
Sbjct: 706 AQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRG 765

Query: 225 NNDDISLTLPISVAGRLSSVQAFNEIIVRANPDGSIIRLRDIAGVELGSSAYTLQSQLNG 284
+ + A + +++ VR + +G ++ + L + NG
Sbjct: 766 RVKKLYVQAD---AKFRMLPEDVDKLYVR-SANGEMVPFSAFTTSHWVYGSPRL-ERYNG 820

Query: 285 ENATILQVYLLPGANALEVTHKVKQAMAELSQKFPQGMKWEVFYDASIFIQESIDEVIHT 344
+ +Q PG ++ + ++ ++L P G+ + S + S ++
Sbjct: 821 LPSMEIQGEAAPGTSSGDAMALMENLASKL----PAGI-GYDWTGMSYQERLSGNQAPAL 875

Query: 345 LIEALVLVVLVVYLFLQNVRATLIPAIAVPVSLIGTLAAMLAFGFTINTVSLLALVLAIG 404
+ + V+V L + ++ + + VP+ ++G L A F + ++ L+ IG
Sbjct: 876 VAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIG 935

Query: 405 IVVDDAIVVVENVERLIHEKGMSAIDATRIAMKELSGALVATSLVLCAVFVPVSFLAGIT 464
+ +AI++VE + L+ ++G ++AT +A++ ++ TSL +P++ G
Sbjct: 936 LSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAG 995

Query: 465 GIMYREFAVAITVAVLISTLVALTLSPALCALL 497
+ + ++ +TL+A+ P ++
Sbjct: 996 SGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVI 1028



Score = 74.9 bits (184), Expect = 1e-15
Identities = 79/501 (15%), Positives = 173/501 (34%), Gaps = 37/501 (7%)

Query: 539 LAFAIMFGGTYFIMSHLPSSFMPDEDQGRFFIDMTLPDGSTVNRTEAILKKAEQYVRANP 598
LA +M G I+ LP + P + P + + + EQ +
Sbjct: 15 LAIILMMAGALAILQ-LPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQVIEQNMNGID 73

Query: 599 AVAYSFTLAGENRRSGANQANGQFEVVLKP-WAEREASHATVQSVMKAIDKDLKNVLEAE 657
+ Y ++ + +G+ F+ P A+ + +Q + ++++
Sbjct: 74 NLMY---MSSTSDSAGSVTITLTFQSGTDPDIAQVQVQ-NKLQLATPLLPQEVQ------ 123

Query: 658 FNLYLPSAVPGLGNGSGVEMQLQDTSGTHFDGLIETANELVEQLKLQPEVAS--ASVSLQ 715
+ + S M S + ++ + +K + V L
Sbjct: 124 -----QQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLF 178

Query: 716 SAIPQLHLTVDEAKAMAIGVNVGDIYSTIKTLTDSSTVNDFNLFGRVYRVKIQA---EES 772
A + + +D + D+ + +K D + ++ A ++
Sbjct: 179 GAQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQT 238

Query: 773 YRQFPHQIKDYYVRSS-NGAMVPIGVLAK-YDYTVGPSSVTHYNLFSSASINVTPATGYA 830
+ P + +R + +G++V + +A+ + + N +A + + ATG
Sbjct: 239 RFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGAN 298

Query: 831 TGDVIQAI-ERVAT--PILPDEFKYEWTGITYQEVQSANQT-----GIAIGLALLFVFLF 882
D +AI ++A P P K + T VQ + AI L L ++LF
Sbjct: 299 ALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLF 358

Query: 883 LAALYESWSIPVAVLLIAPIALLGAAVTTLISGMQSNLFFQVAFIALIGMAAKNAILIVE 942
L ++ + + P+ LLG G N + IG+ +AI++VE
Sbjct: 359 L----QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVE 414

Query: 943 FANQLHQQGRTRISAALEAATMRFR-PILMTSMAFILGVLPLVLSEGPGAVSRQSISLPI 1001
++ + + A E + + + ++ +M +P+ G + S+ I
Sbjct: 415 NVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITI 474

Query: 1002 LGGMVLATTIGIVFVPLFFVT 1022
+ M L+ + ++ P T
Sbjct: 475 VSAMALSVLVALILTPALCAT 495


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_1943SURFACELAYER320.003 Lactobacillus surface layer protein signature.
		>SURFACELAYER#Lactobacillus surface layer protein signature.

Length = 439

Score = 32.3 bits (73), Expect = 0.003
Identities = 14/55 (25%), Positives = 27/55 (49%), Gaps = 1/55 (1%)

Query: 246 SEPFAAYNAVKDWLNESKITEGHLFRSISRDGKTLRPYQVSDNVT-SKSSLIRNS 299
++ YN V +N +K+ G + + +GK Y +DN+ +K +L N+
Sbjct: 331 TDKVTRYNTVTVAMNTTKLANGISYYEVIENGKATGKYINADNIDGTKRTLKHNA 385


83Shewmr4_2087Shewmr4_2102N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_20872170.167157hypothetical protein
Shewmr4_20882160.361489hypothetical protein
Shewmr4_20892160.075201paraquat-inducible protein A
Shewmr4_2090215-1.571815paraquat-inducible protein A
Shewmr4_2091215-2.046353YebG family protein
Shewmr4_2092117-3.760361putative GAF sensor protein
Shewmr4_2093317-1.591156putative solute/DNA competence effector
Shewmr4_2094217-1.335181carboxy-terminal protease
Shewmr4_2095217-1.143177carboxy-terminal protease
Shewmr4_20961170.675803aminopeptidase N
Shewmr4_20971170.711010hypothetical protein
Shewmr4_20981180.771788hypothetical protein
Shewmr4_2099015-0.468550hypothetical protein
Shewmr4_2100-114-0.603033hypothetical protein
Shewmr4_2101016-0.408983hypothetical protein
Shewmr4_2102016-0.338602BNR repeat-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2087HTHFIS692e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 69.1 bits (169), Expect = 2e-14
Identities = 36/199 (18%), Positives = 70/199 (35%), Gaps = 19/199 (9%)

Query: 255 KILLVDDQQSMVDYFSSLLRSHGLMVKGMTKPEQVLPTLEQFEPDLFIFDLYMPDVNGLE 314
IL+ DD ++ + L G V+ + + + + DL + D+ MPD N +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 315 LAKMIRQLDKYSSSPILVLSSDDTMQNKVSIIQAGSDDLISKQTAP--SLFVTQVISRAQ 372
L I++ P+LV+S+ +T + + G+ D + K + +
Sbjct: 65 LLPRIKKARPD--LPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122

Query: 373 RGHDIRSSASRDSLTGLLNHTQILVAARRCFNLAKRINSSVCIAMLDLDHFKQVNDTYGH 432
+ + L+ + + R + + ++ I G
Sbjct: 123 KRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMIT--------------GE 168

Query: 433 SGGDKVLLAFAHLLQQSLR 451
SG K L+A A L R
Sbjct: 169 SGTGKELVARA-LHDYGKR 186



Score = 56.0 bits (135), Expect = 2e-10
Identities = 26/123 (21%), Positives = 54/123 (43%), Gaps = 1/123 (0%)

Query: 131 RIAIIEDDNNVGAMITKQLHEFGFNVQHFLNFTDFLEIQNTSPFDLILLDLILPDYTEEA 190
I + +DD + ++ + L G++V+ N DL++ D+++PD
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 191 LFTAATEFEKHNTRVFVLSSRGDFEMRLLAIRANVSEYFVKPAETTLLVRKIHQWLKMSE 250
L + + + V V+S++ F + A +Y KP + T L+ I + L +
Sbjct: 65 LLPRIKK-ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 251 KQP 253
++P
Sbjct: 124 RRP 126


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2088HTHFIS672e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 67.2 bits (164), Expect = 2e-14
Identities = 30/107 (28%), Positives = 49/107 (45%), Gaps = 7/107 (6%)

Query: 3 IKVLVVDDSALIRNLLGQMIE-ADSELSLVGMAADAYMAKDMVNQHRPDVITLDIEMPKV 61
+LV DD A IR +L Q + A ++ + AA + D++ D+ MP
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATL---WRWIAAGDGDLVVTDVVMPDE 60

Query: 62 DGLTFLDRLMKARPTAVVMISSLTEEG-ADATFNALALGAVDFIPKP 107
+ L R+ KARP V++ ++ + A GA D++PKP
Sbjct: 61 NAFDLLPRIKKARPDLPVLV--MSAQNTFMTAIKASEKGAYDYLPKP 105


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2093PF06580389e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 38.3 bits (89), Expect = 9e-05
Identities = 15/70 (21%), Positives = 32/70 (45%), Gaps = 10/70 (14%)

Query: 452 EIDKGMIEKLVDPLT--HLVRNSLDHGIEKPEKRLAAGKTEAGVLSLKASQRGGSIVIAV 509
+I+ +++ V P+ LV N + HGI + + G + LK ++ G++ + V
Sbjct: 245 QINPAIMDVQVPPMLVQTLVENGIKHGIAQ--------LPQGGKILLKGTKDNGTVTLEV 296

Query: 510 HDDGGGLNRE 519
+ G +
Sbjct: 297 ENTGSLALKN 306


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2094HTHFIS865e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 86.0 bits (213), Expect = 5e-23
Identities = 30/122 (24%), Positives = 54/122 (44%), Gaps = 3/122 (2%)

Query: 1 MSK-KILIVDDSAAIRQMVEATLKSANYQVVLAKDGREALDLCGGQRFDFILTDQNMPRM 59
M+ IL+ DD AAIR ++ L A Y V + + D ++TD MP
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 60 DGLTLIKSLRGMSAFMRTPIVMLTTEAGEDMKAQGRAAGATGWMVKPFDPQKLLAITAKV 119
+ L+ ++ A P+++++ + + GA ++ KPFD +L+ I +
Sbjct: 61 NAFDLLPRIK--KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118

Query: 120 LG 121
L
Sbjct: 119 LA 120


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2095HTHFIS813e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 80.6 bits (199), Expect = 3e-18
Identities = 32/104 (30%), Positives = 51/104 (49%)

Query: 8 ILVIDDDLVTNQILTAFIHSKGWGVITCCNLEEAYEEINQQNIELILLDYYLPDGTALTL 67
ILV DDD +L + G+ V N + I + +L++ D +PD A L
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 68 LERLRYREPTVPVIVISADNEYQKILSCFRLGALDFIIKPINLE 111
L R++ P +PV+V+SA N + + GA D++ KP +L
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLT 109


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2099TCRTETB1337e-36 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 133 bits (335), Expect = 7e-36
Identities = 85/390 (21%), Positives = 173/390 (44%), Gaps = 15/390 (3%)

Query: 51 LDTTIANVALPHMQGSMGATQDQISWVLTSYIVAAAIFMPLTGFLTARLGRKRVFMWAVV 110
L+ + NV+LP + +WV T++++ +I + G L+ +LG KR+ ++ ++
Sbjct: 28 LNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGII 87

Query: 111 GFTIASMLCGAAQNLEQIVLF-RLLQGVFGASLVPLSQSVLLDSYPPERHGSAMALWGVG 169
S++ + +++ R +QG A+ L V+ P E G A L G
Sbjct: 88 INCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSI 147

Query: 170 VMVGPILGPSLGGWLTEYYNWRWVFYINLPFGLLAWFGLAAYVKETPLDHSRKFDLLGFA 229
V +G +GP++GG + Y +W + + +P + + + + FD+ G
Sbjct: 148 VAMGEGVGPAIGGMIAHYIHWS--YLLLIPMITIITVPFLMKLLKKEVRIKGHFDIKGII 205

Query: 230 MLSLAIGALQMLLDRGESLDWFSSREIVIEAIIAGMAFYLFVAHIFTHKHPFIEPGLFKD 289
++S+ I + F++ + I++ ++F +FV HI PF++PGL K+
Sbjct: 206 LMSVGIVFFML----------FTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGLGKN 255

Query: 290 RNFSVGLIFIFIIGIILLATMALLPPFMQNLLGYPVIDVGY-LLAPRGVGTMIAMMTVGK 348
F +G++ II + ++++P M+++ ++G ++ P + +I G
Sbjct: 256 IPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGI 315

Query: 349 LAGKVDVRYQIFLGLMLTILSLWEMTGFNTNITGWDIVRTGVIQGLGLGFIFVPLSTITF 408
L + Y + +G+ +S F T W + V GL F +STI
Sbjct: 316 LVDRRGPLYVLNIGVTFLSVSFLTA-SFLLETTSWFMTIIIVFVLGGLSFTKTVISTIVS 374

Query: 409 ATLAAKYRNEGTALFSLMRNIGSSIGISVV 438
++L + G +L + + GI++V
Sbjct: 375 SSLKQQEAGAGMSLLNFTSFLSEGTGIAIV 404


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2100RTXTOXIND874e-21 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 87.2 bits (216), Expect = 4e-21
Identities = 44/286 (15%), Positives = 96/286 (33%), Gaps = 26/286 (9%)

Query: 71 VENQRVEKGQVLFRLDDAMFKVMVDKASAKLAQVKTDLAVLKASYHEKQAEITLAETKLT 130
V + V + L + + ++ + L + + + + A + + + +++L
Sbjct: 179 VSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRL- 237

Query: 131 FAEKEQKRQENLIGKHFV--SESQLEDARQNTDIARQNIQTLQKDLHRIAESLGGSP-DF 187
+ + I KH V E++ +A + + ++ ++ ++ E F
Sbjct: 238 -DDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLF 296

Query: 188 PIEQHPSYLEALAQLNE-------AKLDLSRVEIKAPVSGVVSQLP--KLGQYVNVGAIA 238
E + + + I+APVS V QL G V
Sbjct: 297 KNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETL 356

Query: 239 LALV-ADHALWIEANFTETDLTHVKPGQKVNIHIDTFPDNRW---QGTVESLSPATGAEF 294
+ +V D L + A D+ + GQ I ++ FP R+ G V++++
Sbjct: 357 MVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDA---- 412

Query: 295 SLIPAQNATGNWVKIAQRVPVRIAIDTVLPEAPLRAGLSAVVDIDT 340
G + + PL +G++ +I T
Sbjct: 413 ---IEDQRLGLVFNVIISIEENCLST-GNKNIPLSSGMAVTAEIKT 454



Score = 56.8 bits (137), Expect = 5e-11
Identities = 22/138 (15%), Positives = 43/138 (31%), Gaps = 12/138 (8%)

Query: 50 VKADKVPVSAQVAGNVDSLYVVENQRVEKGQVLFRLDDAMFKVMVDKASAKLAQVKTDLA 109
+ V + V E + V KG VL +L + K + L Q + +
Sbjct: 92 HSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQT 151

Query: 110 VLKASY----HEKQAEITLAETK--LTFAEKEQKRQENLIGKHFVSESQLEDARQNTDIA 163
+ K E+ L + +E+E R +LI + Q +
Sbjct: 152 RYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLI------KEQFSTWQNQKYQK 205

Query: 164 RQNIQTLQKDLHRIAESL 181
N+ + + + +
Sbjct: 206 ELNLDKKRAERLTVLARI 223


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2102RTXTOXINA362e-04 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 36.1 bits (83), Expect = 2e-04
Identities = 16/53 (30%), Positives = 25/53 (47%), Gaps = 7/53 (13%)

Query: 95 LINSLISKISQLTQGTEAFSSTLADFGLQLQSTPDVCTLNQLVGGVIDEMQNL 147
LIN L+ ++ L +FS L G L +T + GV +++QNL
Sbjct: 187 LINQLVDTVASLNNNVNSFSQQLNTLGSVLSNTKH-------LNGVGNKLQNL 232


84Shewmr4_2147Shewmr4_2154N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_2147-117-3.504893hypothetical protein
Shewmr4_2148-121-4.349532type 12 methyltransferase
Shewmr4_2149023-3.692036IS4 family transposase
Shewmr4_2150025-4.434250L-serine ammonia-lyase
Shewmr4_2151023-3.492739beta-hexosaminidase
Shewmr4_2152-124-3.188153hypothetical protein
Shewmr4_2153-122-2.597393acylphosphatase
Shewmr4_2154-120-2.008527hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2147GPOSANCHOR373e-04 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 36.6 bits (84), Expect = 3e-04
Identities = 21/68 (30%), Positives = 38/68 (55%), Gaps = 7/68 (10%)

Query: 594 ALEEAKIQ-QAIAEQEAIAAQAKAA--EEAALAKAKAEAEAEAERQRL----EQEEQMKA 646
ALEEA + A+ + ++K +E A +AK EAEA+A +++L E+ +++A
Sbjct: 401 ALEEANSKLAALEKLNKELEESKKLTEKEKAELQAKLEAEAKALKEKLAKQAEELAKLRA 460

Query: 647 SEQSQPET 654
+ S +T
Sbjct: 461 GKASDSQT 468



Score = 34.7 bits (79), Expect = 0.001
Identities = 21/82 (25%), Positives = 33/82 (40%), Gaps = 13/82 (15%)

Query: 596 EEAKIQQAIAEQEAIAAQAKAAEEAALAKAK--------AEAEAEAERQRLEQEEQMKAS 647
EA+ AE+ + Q A + A+ + EAE Q+LE EQ K S
Sbjct: 286 LEAEKAALEAEKADLEHQ-SQVLNANRQSLRRDLDASREAKKQLEAEHQKLE--EQNKIS 342

Query: 648 EQSQPETGSQEAIATSDESLAK 669
E S+ + + S E+ +
Sbjct: 343 EASR--QSLRRDLDASREAKKQ 362



Score = 32.0 bits (72), Expect = 0.008
Identities = 26/88 (29%), Positives = 43/88 (48%), Gaps = 10/88 (11%)

Query: 592 VEA-LEEAKIQQAIAEQEAIAAQAK--AAEEAALAKAKAEAEAEAERQRLEQ-----EEQ 643
+EA ++ + Q I+E + + A+ EA KA EA ++ LE+ EE
Sbjct: 363 LEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQVEKALEEANSKLAALEKLNKELEES 422

Query: 644 MKASEQSQPETGSQ-EAIATS-DESLAK 669
K +E+ + E ++ EA A + E LAK
Sbjct: 423 KKLTEKEKAELQAKLEAEAKALKEKLAK 450



Score = 30.8 bits (69), Expect = 0.020
Identities = 16/61 (26%), Positives = 29/61 (47%), Gaps = 4/61 (6%)

Query: 611 AAQAKAAEEAALAKAKAEAE-AEAERQRLEQEEQMKASEQSQPETGSQEAIATSD-ESLA 668
+ +AK EA K + + + +EA RQ L + + AS +++ + A S +L
Sbjct: 356 SREAKKQLEAEHQKLEEQNKISEASRQSLRR--DLDASREAKKQVEKALEEANSKLAALE 413

Query: 669 K 669
K
Sbjct: 414 K 414


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2148HTHTETR388e-06 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 38.5 bits (89), Expect = 8e-06
Identities = 27/164 (16%), Positives = 56/164 (34%), Gaps = 7/164 (4%)

Query: 21 WEQRRDYLTQVALRSLRGHKTFDLCRSHLVQVSQISKGTIYNHFTTEADLIVAVASAQYD 80
++ R ++ VALR + + + +++G IY HF ++DL +
Sbjct: 9 AQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSES 68

Query: 81 EWLCAAKQ-DVQRYPDPFSRF--LYHHCFRLHQVLSQQRFVIERIMPNQTLLFEATESCR 137
+ + DP S + H ++R ++E I+ ++ +
Sbjct: 69 NIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLME-IIFHKCEFVGEMAVVQ 127

Query: 138 QRFETLFDEYHQWNRNTISEV---GDIPGFNRTELVMDYLRGAM 178
Q L E + T+ +P T +RG +
Sbjct: 128 QAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYI 171


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2153HTHFIS290.049 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 29.0 bits (65), Expect = 0.049
Identities = 10/31 (32%), Positives = 15/31 (48%), Gaps = 1/31 (3%)

Query: 39 KWDKEVEVLIVGSGFAGLAAGIEAIRKGAKD 69
K ++ VL++ S I+A KGA D
Sbjct: 71 KARPDLPVLVM-SAQNTFMTAIKASEKGAYD 100


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2154BCTERIALGSPD300.014 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 29.9 bits (67), Expect = 0.014
Identities = 22/140 (15%), Positives = 42/140 (30%), Gaps = 24/140 (17%)

Query: 4 ALVLTQEDSRINARVAQISETDLPEGEVLVDVAYSSLNYKDGLAVTGTGKIIRQFPMVPG 63
AL++T +N I++ D+ +VLV+ + + DGL + G
Sbjct: 320 ALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNL--------------G 365

Query: 64 IDFAGVVNQSNDPRYQAGDQVILTGWGVGENHWGGMAQKARVKADWLVSMPQNCDPAKAM 123
I +A + Q +G + G S+ +
Sbjct: 366 IQWAN--------KNAGMTQFTNSGLPISTAIAGANQYNK--DGTVSSSLASALSSFNGI 415

Query: 124 MIGTAGLTAMLCVQALEQAG 143
G + + AL +
Sbjct: 416 AAGFYQGNWAMLLTALSSST 435


85Shewmr4_2201Shewmr4_2208N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_2201-1183.461862hypothetical protein
Shewmr4_22021182.152446exodeoxyribonuclease V subunit alpha
Shewmr4_22030181.913176exodeoxyribonuclease V subunit beta
Shewmr4_22041161.273191exodeoxyribonuclease V subunit gamma
Shewmr4_22050161.070146transglutaminase domain-containing protein
Shewmr4_2206-1161.369268hypothetical protein
Shewmr4_2207-1170.182262ATPase
Shewmr4_2208-117-0.114605transposase IS116/IS110/IS902 family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2201OMPADOMAIN721e-16 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 71.9 bits (176), Expect = 1e-16
Identities = 27/92 (29%), Positives = 42/92 (45%), Gaps = 2/92 (2%)

Query: 142 ELALGMNVQFRTGSSELESHFLPQLDNVAKVMKRSSESN--LELKGYADRRGDLAYNQAL 199
L +V F + L+ LD + + + + + GY DR G AYNQ L
Sbjct: 214 HFTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGL 273

Query: 200 SEQRLLEVRGYLIKQGVAPERITTQAFGARMP 231
SE+R V YLI +G+ ++I+ + G P
Sbjct: 274 SERRAQSVVDYLISKGIPADKISARGMGESNP 305


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2202HTHFIS637e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 63.3 bits (154), Expect = 7e-14
Identities = 29/134 (21%), Positives = 54/134 (40%), Gaps = 5/134 (3%)

Query: 3 RIAIVEDEAAIRENYKDVLQQHGYSVQTYADRPSAMLAFNTRLPDLAIIDIGLGNEIDGG 62
I + +D+AAIR L + GY V+ ++ + DL + D+ + +E
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE--NA 62

Query: 63 FMLCQSLRAMSNTLPIIFLTARDSDFDTVCGLRLGADDYLSKEVSFPH---LTARLAALF 119
F L ++ LP++ ++A+++ + GA DYL K + R A
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122

Query: 120 RRSELATSQTPQEN 133
+R Q+
Sbjct: 123 KRRPSKLEDDSQDG 136


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2206SHAPEPROTEIN423e-06 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 42.4 bits (100), Expect = 3e-06
Identities = 25/81 (30%), Positives = 41/81 (50%), Gaps = 11/81 (13%)

Query: 191 AAKRAGFVDVAFLFEPLAAGMDYEASLSADQTVLVVDVGGGTTDCSVVKMGPKHQASFDR 250
+A+ AG +V + EP+AA + +S +VVD+GGGTT+ +V+ +
Sbjct: 129 SAQGAGAREVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLN--------- 179

Query: 251 SADCLGHSGQRIGGNDLDIAL 271
+ S RIGG+ D A+
Sbjct: 180 --GVVYSSSVRIGGDRFDEAI 198


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2208SHAPEPROTEIN290.039 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 29.3 bits (66), Expect = 0.039
Identities = 16/36 (44%), Positives = 24/36 (66%)

Query: 137 NIVIDIGGGSTEVVLGQKNTPTHLSSLRCGCVSFNE 172
++V+DIGGG+TEV + N + SS+R G F+E
Sbjct: 161 SMVVDIGGGTTEVAVISLNGVVYSSSVRIGGDRFDE 196


86Shewmr4_2333Shewmr4_2338N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_23330131.019431hypothetical protein
Shewmr4_23340120.596224NUDIX hydrolase
Shewmr4_23350110.899896hemolysin III family channel protein
Shewmr4_23360120.853053hypothetical protein
Shewmr4_23370141.452531thiamine biosynthesis protein ThiC
Shewmr4_2338-1130.003125thiamine-phosphate pyrophosphorylase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2333BCTERIALGSPF280.034 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 28.3 bits (63), Expect = 0.034
Identities = 16/90 (17%), Positives = 34/90 (37%), Gaps = 16/90 (17%)

Query: 59 RFRKEIEYPANLNLKTLCLIAIAGGC-----------LGAAILLSTSEQVFANLV----- 102
R ++ + YP L + + +++I + A+ LST + +
Sbjct: 168 RIQQAMIYPCVLTVVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFG 227

Query: 103 PWLILLATLAFIGGPWLLKKRLANASITRM 132
PW++L F+ +L++ S R
Sbjct: 228 PWMLLALLAGFMAFRVMLRQEKRRVSFHRR 257


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2335SACTRNSFRASE356e-05 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 34.5 bits (79), Expect = 6e-05
Identities = 25/111 (22%), Positives = 35/111 (31%), Gaps = 24/111 (21%)

Query: 33 LKQYLDAPKRSDEI-YLAESEGVVFGLISLIFFDYFPSQQKICRITA------------L 79
KQY D ++ Y+ E F Y+ I RI +
Sbjct: 47 FKQYEDD---DMDVSYVEEEGKAAFL--------YYLENNCIGRIKIRSNWNGYALIEDI 95

Query: 80 VVTEASRGLGVGTQLIDFAKDRASERGCHQLEVTTSMRREQTQVYYESIGF 130
V + R GVGT L+ A + A E L + T +Y F
Sbjct: 96 AVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHF 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2337ACRIFLAVINRP8430.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 843 bits (2179), Expect = 0.0
Identities = 324/1039 (31%), Positives = 555/1039 (53%), Gaps = 38/1039 (3%)

Query: 3 LTDLSVKRPVFASVISLLLVAFGLVAFDKLPLREYPNIDPPIVSIETNYRGASAAVVESR 62
+ + ++RP+FA V++++L+ G +A +LP+ +YP I PP VS+ NY GA A V+
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 63 ITQLIEDRISGVEGIRHVSSSS-SDGRSQVTLEFDISRNIEDAANDVRDRISGLLDNLPE 121
+TQ+IE ++G++ + ++SS+S S G +TL F + + A V++++ LP+
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 122 EADPPEVQKANGGDEVIMWLNLVSD--QMTTLELTDYTRRYLSDRLSVVDGVSMIRIGGG 179
E + +M VSD T +++DY + D LS ++GV +++ G
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 180 KVYAMRVWLDRQALASRSLTVADVEAALRAENVELPAGSL------ESKERHFTVRLERS 233
+ YAMR+WLD L LT DV L+ +N ++ AG L ++ + ++ +
Sbjct: 181 Q-YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239

Query: 234 YRTAEDFANLVISQGEDGYLVKLGDVAKVEIGSEEERIMFRGNKEAMIGLGVSKQSTANT 293
++ E+F + + DG +V+L DVA+VE+G E ++ R N + GLG+ + AN
Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299

Query: 294 LEVARAVNALVDKINPTLPAGMSIKRSYDSSVFIEASIKEVYQTLFTAMVLVIIVIYLFL 353
L+ A+A+ A + ++ P P GM + YD++ F++ SI EV +TLF A++LV +V+YLFL
Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359

Query: 354 GSVRAMLIPAITVPVSLLGTFIVLYALGYTINLLTLLAMILAIGMVVDDAIVMLENIHRR 413
++RA LIP I VPV LLGTF +L A GY+IN LT+ M+LAIG++VDDAIV++EN+ R
Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419

Query: 414 I-EEGDSPLKAAFLGAREVAFAVIATTLVLVAVFMPITFLEGDLGKLFKEFAVAMSAAVI 472
+ E+ P +A ++ A++ +VL AVF+P+ F G G ++++F++ + +A+
Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479

Query: 473 FSSIVALTLSPMMCSKLLKPASQD---------SWLVRKVDSIMTGISRGYQSSLEKAMA 523
S +VAL L+P +C+ LLKP S + W D + + L
Sbjct: 480 LSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKIL----G 535

Query: 524 RPVLMSILVLIALGSSVLLAQKVPQEFAPQEDRGSLFLMVNGPQGASYEYIESYMNEVEN 583
++ + + V+L ++P F P+ED+G M+ P GA+ E + +++V +
Sbjct: 536 STGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTD 595

Query: 584 RLMPLVDSGDIKRLLIRAPRGFGRAADFSNGMAIIVLEDWGQRRPMKE----VIGDINKR 639
+ + +++ + F A + GMA + L+ W +R + VI
Sbjct: 596 YYLK-NEKANVESVFTVNGFSFSGQAQ-NAGMAFVSLKPWEERNGDENSAEAVIHRAKME 653

Query: 640 LADL--AGVQAFPVMRQA-FGRGVGKPVQFV-IGGPSYEELARWRDIMMEKAAENP-KLL 694
L + V F + G G + + G ++ L + R+ ++ AA++P L+
Sbjct: 654 LGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLV 713

Query: 695 GLDHDYKETKPQLRVVIDRDRAASLGVSISNIGRTLESMLGSRLVTTFMRDGEEYDVIVE 754
+ + E Q ++ +D+++A +LGVS+S+I +T+ + LG V F+ G + V+
Sbjct: 714 SVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQ 773

Query: 755 GERSNQNTAADLQNIYVRSERTKELIPLSNLVTVEEFADASSLNRYNRMRAITIEASLAD 814
+ + D+ +YVRS E++P S T + L RYN + ++ I+ A
Sbjct: 774 ADAKFRMLPEDVDKLYVRS-ANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAP 832

Query: 815 GYSLGEALDYLNQVARAYLPAEAVISYKGQSLDYQESGSSMYFVFLLALGIVFLVLAAQF 874
G S G+A+ + +A LPA + G S + SG+ + ++ +VFL LAA +
Sbjct: 833 GTSSGDAMALMENLASK-LPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALY 891

Query: 875 ESYIHPMVIMLTVPLATVGALIGLWFTGQSLNIYSQIGIIMLVGLAAKNGILIVEFANQL 934
ES+ P+ +ML VPL VG L+ Q ++Y +G++ +GL+AKN ILIVEFA L
Sbjct: 892 ESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDL 951

Query: 935 RDK-GVDFDRAIIQASCQRLRPILMTGITTAAGAVPLVLAAGAGAETRFVIGVVVLSGIM 993
+K G A + A RLRPILMT + G +PL ++ GAG+ + +G+ V+ G++
Sbjct: 952 MEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMV 1011

Query: 994 LATLFTIFVIPTAYGLFAR 1012
ATL IF +P + + R
Sbjct: 1012 SATLLAIFFVPVFFVVIRR 1030



Score = 88.4 bits (219), Expect = 8e-20
Identities = 51/327 (15%), Positives = 125/327 (38%), Gaps = 19/327 (5%)

Query: 706 QLRVVIDRDRAASLGVSISNIGRTLES----MLGSRLVTTFMRDGEEYDVIVEGERSNQN 761
+R+ +D D ++ ++ L+ + +L T G++ + + + +
Sbjct: 183 AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQ-TRFK 241

Query: 762 TAADLQNIYVRSERTKELIPLSNLVTVEE-FADASSLNRYNRMRAITIEASLADGYSLGE 820
+ + +R ++ L ++ VE + + + R N A + LA G
Sbjct: 242 NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATG---AN 298

Query: 821 ALDYLNQVA------RAYLPA--EAVISYKGQSLDYQESGSSMYFVFLLALGIVFLVLAA 872
ALD + + + P + + Y + Q S + A+ +VFLV+
Sbjct: 299 ALDTAKAIKAKLAELQPFFPQGMKVLYPYD-TTPFVQLSIHEVVKTLFEAIMLVFLVMYL 357

Query: 873 QFESYIHPMVIMLTVPLATVGALIGLWFTGQSLNIYSQIGIIMLVGLAAKNGILIVE-FA 931
++ ++ + VP+ +G L G S+N + G+++ +GL + I++VE
Sbjct: 358 FLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVE 417

Query: 932 NQLRDKGVDFDRAIIQASCQRLRPILMTGITTAAGAVPLVLAAGAGAETRFVIGVVVLSG 991
+ + + A ++ Q ++ + +A +P+ G+ + ++S
Sbjct: 418 RVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSA 477

Query: 992 IMLATLFTIFVIPTAYGLFARNSGSPE 1018
+ L+ L + + P + +
Sbjct: 478 MALSVLVALILTPALCATLLKPVSAEH 504


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2338RTXTOXIND384e-05 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 38.3 bits (89), Expect = 4e-05
Identities = 23/116 (19%), Positives = 40/116 (34%), Gaps = 6/116 (5%)

Query: 37 VVIATAAMAPVRDEVEAIGTNKAY-ESVTITPKVTDVVTSLKFDDGDIVKKGDLLVQLQN 95
+ + + V A G S I P +V + +G+ V+KGD+L++L
Sbjct: 70 IAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTA 129

Query: 96 AEQIAKVKVAQVKVSDNQRELARISSLVTSRTVAELERDRLQTLIDTTRAELEQAQ 151
A Q + + E R L S +E ++L L +
Sbjct: 130 LGAEADTLKTQSSLLQARLEQTRYQILSRS-----IELNKLPELKLPDEPYFQNVS 180


87Shewmr4_2487Shewmr4_2498N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_2487116-0.432744hypothetical protein
Shewmr4_2488219-0.644535response regulator receiver modulated metal
Shewmr4_2489421-0.633237hypothetical protein
Shewmr4_2490423-0.995483hypothetical protein
Shewmr4_2491528-1.227270cbb3-type cytochrome c oxidase subunit I
Shewmr4_2492428-1.068159cbb3-type cytochrome c oxidase subunit II
Shewmr4_2493225-0.787094hypothetical protein
Shewmr4_2494231-1.563021Cbb3-type cytochrome oxidase component
Shewmr4_2495127-1.601662cytochrome c oxidase, cbb3-type subunit III
Shewmr4_2496122-1.506327hypothetical protein
Shewmr4_2497-114-1.195008hypothetical protein
Shewmr4_2498116-1.741479heavy metal translocating P-type ATPase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2487HTHFIS310.010 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 30.6 bits (69), Expect = 0.010
Identities = 9/16 (56%), Positives = 14/16 (87%)

Query: 38 LVGESGSGRSLLARAI 53
+ GESG+G+ L+ARA+
Sbjct: 165 ITGESGTGKELVARAL 180


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2488HTHFIS290.020 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 29.0 bits (65), Expect = 0.020
Identities = 13/28 (46%), Positives = 16/28 (57%)

Query: 42 TLAIVGEAGSGKSTLARILVGAEPRSGG 69
TL I GE+G+GK +AR L R G
Sbjct: 162 TLMITGESGTGKELVARALHDYGKRRNG 189


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2492DNABINDINGHU1194e-39 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 119 bits (300), Expect = 4e-39
Identities = 53/88 (60%), Positives = 69/88 (78%)

Query: 2 NKSELIEKIASGADISKAAAGRALDSFIAAVTEGLKEGDKISLVGFGTFEVRERAERTGR 61
NK +LI K+A +++K + A+D+ +AV+ L +G+K+ L+GFG FEVRERA R GR
Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62

Query: 62 NPQTGEEIKIAAAKIPAFKAGKALKDAV 89
NPQTGEEIKI A+K+PAFKAGKALKDAV
Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAV 90


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2493PF05272330.006 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 32.7 bits (74), Expect = 0.006
Identities = 15/86 (17%), Positives = 35/86 (40%), Gaps = 6/86 (6%)

Query: 286 AEATVVRSYVDWMTSVPWSQRSKIKRDLAKAQEVLDTDHYGLEKVKDRILEYLAVQSRVR 345
A+ V + DW+ + W + ++++ L D+ +++ + V
Sbjct: 527 ADMNRVHPFRDWVKAQQWDEVPRLEKWLVHVLGKTPDDYKPRRLRYLQLVGKYILMGHVA 586

Query: 346 QLKGP------ILCLVGPPGVGKTSL 365
++ P + L G G+GK++L
Sbjct: 587 RVMEPGCKFDYSVVLEGTGGIGKSTL 612


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2494HTHFIS300.017 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 30.2 bits (68), Expect = 0.017
Identities = 14/70 (20%), Positives = 28/70 (40%), Gaps = 13/70 (18%)

Query: 64 KLPTPHELRAHLDDYVIGQDRAKKVLSVAVYNHYKRLRNSSPKDGVELGKSNILLIGPTG 123
+ P+ E + ++G+ A +Y RL + +++ G +G
Sbjct: 124 RRPSKLEDDSQDGMPLVGRSAA----MQEIYRVLARLMQT---------DLTLMITGESG 170

Query: 124 SGKTLLAETL 133
+GK L+A L
Sbjct: 171 TGKELVARAL 180


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2498ENTEROTOXINA300.025 Heat-labile enterotoxin A chain signature.
		>ENTEROTOXINA#Heat-labile enterotoxin A chain signature.

Length = 258

Score = 29.6 bits (66), Expect = 0.025
Identities = 21/71 (29%), Positives = 29/71 (40%), Gaps = 15/71 (21%)

Query: 275 NFFTIRDVLGHYDPET----------VRYFLLSGHYRSQINYSEENLKQARAALERLYTA 324
N F + DVLG Y P + Y + G YR +E L + R +R Y
Sbjct: 111 NMFNVNDVLGVYSPHPYEQEVSALGGIPYSQIYGWYRVNFGVIDERLHRNREYRDRYYR- 169

Query: 325 IKDVDLTVAPA 335
+L +APA
Sbjct: 170 ----NLNIAPA 176


88Shewmr4_2570Shewmr4_2576N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_2570-2154.645450radical SAM domain-containing protein
Shewmr4_2571-1255.457262sulfatase
Shewmr4_25721255.454759hypothetical protein
Shewmr4_25730245.410435hypothetical protein
Shewmr4_25741215.721139PepSY-associated TM helix domain-containing
Shewmr4_25752172.573178hypothetical protein
Shewmr4_25762191.203764beta-lactamase domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2570SUBTILISIN1423e-40 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 142 bits (359), Expect = 3e-40
Identities = 71/210 (33%), Positives = 97/210 (46%), Gaps = 24/210 (11%)

Query: 126 AGMKVCIIDSGLDSSNPDFNWNNITG----DSDPGTGNWFQNGGPHGTHVAGTIGAADNN 181
G+KV ++D+G D+ +PD I G D D G F++ HGTHVAGTI A +N
Sbjct: 41 RGVKVAVLDTGCDADHPDLKARIIGGRNFTDDDEGDPEIFKDYNGHGTHVAGTIAATENE 100

Query: 182 IGVVGMAPGVPMHIVKVFNASGWGYSSDLAYAANKCSNAGAKIISMSLGGGAANNTEKNA 241
GVVG+AP + I+KV N G G + IISMSLGG A
Sbjct: 101 NGVVGVAPEADLLIIKVLNKQGSGQYDWIIQGIYYAIEQKVDIISMSLGGPEDVPELHEA 160

Query: 242 FDAFTAAGGLVVAAAGNDGNSVRS-----YPAGYPSVMMIGANDANNNIADFSQYPSCVS 296
A+ LV+ AAGN+G+ YP Y V+ +GA + + + ++FS
Sbjct: 161 VKKAVASQILVMCAAGNEGDGDDRTDELGYPGCYNEVISVGAINFDRHASEFSNS----- 215

Query: 297 GRGKKAVNDDGICVEVTAGGVDTLSTYPAG 326
V++ A G D LST P G
Sbjct: 216 ----------NNEVDLVAPGEDILSTVPGG 235



Score = 53.3 bits (128), Expect = 8e-10
Identities = 19/70 (27%), Positives = 27/70 (38%), Gaps = 7/70 (10%)

Query: 447 YGFMSGTSMATPAVSGMAALVWSN-----HSQCTGTQIRKALKATAMDAGTVGKDNYFGY 501
Y SGTSMATP V+G AL+ T ++ L + G G
Sbjct: 237 YATFSGTSMATPHVAGALALIKQLANASFERDLTEPELYAQLIKRTIPLG--NSPKMEGN 294

Query: 502 GIVNAKAADA 511
G++ A +
Sbjct: 295 GLLYLTAVEE 304


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2573ABC2TRNSPORT401e-05 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 39.5 bits (92), Expect = 1e-05
Identities = 44/166 (26%), Positives = 78/166 (46%), Gaps = 23/166 (13%)

Query: 186 GVILTMTMVMFT----SAAIVREREQGNMEFLITTPVRPLELMLGKI----TPYVLVGFV 237
G++ T M T AA R Q E ++ T +R +++LG++ T L G
Sbjct: 72 GMVATSAMTAATFETIYAAFGRMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGAG 131

Query: 238 QLAIILTAGH-----LLFAVPIRGGLDSIALAAMLFICASLTLGLVISTIAKTQLQSMQM 292
+ G+ LL+A+P+ IAL + F +LG+V++ +A + +
Sbjct: 132 IGVVAAALGYTQWLSLLYALPV------IALTGLAFA----SLGMVVTALAPSYDYFIFY 181

Query: 293 TVFILLPSILLSGFMFPYEAMPVAAQWIAEALPATHFMRMSRAIVL 338
++ P + LSG +FP + +P+ Q A LP +H + + R I+L
Sbjct: 182 QTLVITPILFLSGAVFPVDQLPIVFQTAARFLPLSHSIDLIRPIML 227


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2575RTXTOXIND582e-11 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 57.9 bits (140), Expect = 2e-11
Identities = 35/180 (19%), Positives = 59/180 (32%), Gaps = 11/180 (6%)

Query: 17 PSSRGWGKLLASLLGAALLLQLTACGDESPRVLGTV--ERDRLTLTAPVGELINRINVVE 74
R + L A +L + + G + + ++ I V E
Sbjct: 55 RRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKE 114

Query: 75 GQQVQAGEVLLELDSTAAQARLGQRQAELKQA-------QAKLEEAVTGARSEDIDKARA 127
G+ V+ G+VLL+L + A+A + Q+ L QA Q E
Sbjct: 115 GESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEP 174

Query: 128 ALDGANASVKEARQNFERTQ--QLFKTKVLSQADLDAARAARDTSLAKQAEAEQSLRLLQ 185
+ + + Q K + +LD RA R T LA+ E R+ +
Sbjct: 175 YFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEK 234



Score = 49.8 bits (119), Expect = 8e-09
Identities = 27/231 (11%), Positives = 75/231 (32%), Gaps = 24/231 (10%)

Query: 84 LLELDSTAAQARLGQRQAELKQAQAKLEEAVTGARSEDIDKARAALDGANASVKEARQNF 143
+ E + + + ++ + + + + E + R+E A ++ + +
Sbjct: 179 VSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAE-RLTVLARINRYENLSRVEKSRL 237

Query: 144 ERTQQLFKTKVLS--------------QADLDAARAARDTSLAKQAEAEQSLRLLQNGTR 189
+ L + ++ +L ++ + ++ A++ +L+ +
Sbjct: 238 DDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFK 297

Query: 190 SEQLEQARAAVEAAMAGVAQEQKALKDLSLVAAK-PA---VVDTLPWRVGDRVAAGSQLI 245
+E L++ R + + K + + P V G V L+
Sbjct: 298 NEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLM 357

Query: 246 GLLAIEHPY-VRVYLPATWLDRVKAGNQVKILVDG----RTQPIAGTVRNI 291
++ + V + + + G I V+ R + G V+NI
Sbjct: 358 VIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNI 408


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2576HTHTETR739e-18 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 72.7 bits (178), Expect = 9e-18
Identities = 24/151 (15%), Positives = 55/151 (36%), Gaps = 6/151 (3%)

Query: 31 SDARQRLITAAVSLFSERSYPTVSTREIARVAEVDAALIRYYFGSKAGLFEQMVRETLEP 90
+ RQ ++ A+ LFS++ + S EIA+ A V I ++F K+ LF ++ +
Sbjct: 10 QETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESN 69

Query: 91 VITRLREVSTAQAPNN---VGDLMQTYYRVMAPNPGLPRLIVRVLQESDGTEAYRIMLSV 147
+ E + + +++ L+ + + + ++
Sbjct: 70 IGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQA 129

Query: 148 FEQMLSLSRQWLEASF---VSAGILKEGLDP 175
+ S +E + + A +L L
Sbjct: 130 QRNLCLESYDRIEQTLKHCIEAKMLPADLMT 160


89Shewmr4_2616Shewmr4_2623N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_26162161.972320diguanylate phosphodiesterase
Shewmr4_26172151.545603hypothetical protein
Shewmr4_26181161.679875hypothetical protein
Shewmr4_26190151.282568IS4 family transposase
Shewmr4_26200151.659886hypothetical protein
Shewmr4_2621-1152.209540transposase
Shewmr4_2622-2162.654135integrase catalytic subunit
Shewmr4_2623-2162.839893RNA-directed DNA polymerase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2616ACETATEKNASE290.034 Acetate kinase family signature.
		>ACETATEKNASE#Acetate kinase family signature.

Length = 400

Score = 28.6 bits (64), Expect = 0.034
Identities = 10/44 (22%), Positives = 19/44 (43%), Gaps = 1/44 (2%)

Query: 217 ALVDAGDEMAIAAFDRYMDRLARSLAHVINMLDP-DAIVLGGGM 259
A GD+ A A + + R+ +++ + D IV G+
Sbjct: 289 AAFKNGDKRAQLALNVFAYRVKKTIGSYAAAMGGVDVIVFTAGI 332


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2618NUCEPIMERASE384e-05 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 38.2 bits (89), Expect = 4e-05
Identities = 50/247 (20%), Positives = 82/247 (33%), Gaps = 71/247 (28%)

Query: 12 ITVLLTGA----DSQLSKALLR------VLAKAANRFDGRV--FRVHALSHA-----QLD 54
+ L+TGA +SK LL + + +D + R+ L+ ++D
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 55 IADKQSVAAAFARFKPDWVINCAAYNAVDRAETAAEEAY-RVNSLGPELL---ARECALT 110
+AD++ + FA + V AV R AY N G + R +
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAV-RYSLENPHAYADSNLTGFLNILEGCRHNKIQ 119

Query: 111 GARLVHISSDYVFSGEPAGAAVQALNQNGLTDSRLIESGLTER--GLTESATPAPLSVYG 168
L++ SS V+ GL + T+ + P+S+Y
Sbjct: 120 --HLLYASSSSVY-------------------------GLNRKMPFSTDDSVDHPVSLYA 152

Query: 169 QSKLAGEQAVLCILAERAIVIRTAWL-----YGVDG------HNFVKTMLRLMATMPEGQ 217
+K A E ++ + L YG G F K ML EG+
Sbjct: 153 ATKKANE--LMAHTYSHLYGLPATGLRFFTVYGPWGRPDMALFKFTKAML-------EGK 203

Query: 218 PLTVIND 224
+ V N
Sbjct: 204 SIDVYNY 210


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2621PF06340300.041 Vibrio cholerae toxin co-regulated pilus biosynthesis pr...
		>PF06340#Vibrio cholerae toxin co-regulated pilus biosynthesis

protein F (TcpF)
Length = 338

Score = 29.6 bits (66), Expect = 0.041
Identities = 25/83 (30%), Positives = 36/83 (43%), Gaps = 18/83 (21%)

Query: 216 YSSFKTKLADTQISADEQARLLAQAKANIESDVKPAYRLFIDYFTALQAKAGTDDGYWAL 275
YSS ++ A T+ D ARLLA +++ +L+ID++ A T D W +
Sbjct: 83 YSSTESDGAKTRTKEDFSARLLAGDYDSLQ-------KLYIDFYLA----QTTFD--WEI 129

Query: 276 PNGD-----VAYEQLLKFFTTTN 293
P D V Y K T N
Sbjct: 130 PTRDQIETLVNYANEGKLSTALN 152


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2623PF07520300.028 Virulence protein SrfB
		>PF07520#Virulence protein SrfB

Length = 1041

Score = 30.3 bits (68), Expect = 0.028
Identities = 25/152 (16%), Positives = 41/152 (26%), Gaps = 26/152 (17%)

Query: 520 IPESLEGSLMLVSQVLHQCGVPLAR-ILKRLESERRNHYQFLHGFFSGTETDF---TLES 575
I + ++ Q + VPLA IL E + + E
Sbjct: 676 IGGQEQQTVQRRRQFSIRVLVPLAEAILSACEDAEEADR--IDIPVADVLGLVPTPVGEE 733

Query: 576 LHAVLLHRGADAVGKQVADI-----------DWELLRVELRAIRRSGQEIEHPASDWIFR 624
+ V ++ D W L + L A R I +
Sbjct: 734 GDEEGHEDASPQVTDEILDYLEKPATQLGAEGWRLADMVLSASREDLDAIAREVFQKVLG 793

Query: 625 A---------GDILLIVGKPRRLEKAEAKLLH 647
D++L+ G+P RL A +
Sbjct: 794 NMCEVIDHLGCDVVLLTGRPSRLPAVRAIVEE 825


90Shewmr4_2708Shewmr4_2711N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_2708-19-1.6747912OG-Fe(II) oxygenase
Shewmr4_2709010-2.305937ribosomal biogenesis GTPase
Shewmr4_2710010-2.474554hypothetical protein
Shewmr4_2711013-1.923484hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2708PF06580330.003 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 32.5 bits (74), Expect = 0.003
Identities = 20/105 (19%), Positives = 34/105 (32%), Gaps = 26/105 (24%)

Query: 327 LISNAIRY----TEPGGKITVQWRSVATGGLFSVTDTGEGIAPQHINRLTERFYRVDSAR 382
L+ N I++ GGKI ++ V +TG
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKN---------------- 306

Query: 383 SRQTGGSGLGLAIVKHALSHHHSE---LNISSELGKGSTFSFVIP 424
+G GL V+ L + + +S + GK + +IP
Sbjct: 307 --TKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAM-VLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2709HTHFIS912e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 91.4 bits (227), Expect = 2e-23
Identities = 34/130 (26%), Positives = 64/130 (49%), Gaps = 4/130 (3%)

Query: 3 ARILIVEDELAIREMLTFVMEQHGFTTSAAEDFDSAIALLKEPYPDLILLDWMFPGGSGI 62
A IL+ +D+ AIR +L + + G+ + + + DL++ D + P +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 63 QLAKRLKQDEFTRQIPIIMLTARGEEEDKVKGLEVGADDYITKPFSPKELVARIKAVL-- 120
L R+K+ +P+++++A+ +K E GA DY+ KPF EL+ I L
Sbjct: 64 DLLPRIKKAR--PDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 121 RRSAPTRLEE 130
+ P++LE+
Sbjct: 122 PKRRPSKLED 131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2710ECOLNEIPORIN811e-19 E.coli/Neisseria porin superfamily signature.
		>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature.

Length = 331

Score = 81.4 bits (201), Expect = 1e-19
Identities = 78/335 (23%), Positives = 127/335 (37%), Gaps = 33/335 (9%)

Query: 7 KTLLASALASATLASAYAAEPLTVYGKLNV---TAQSNDEKGDST------TTIQSNASR 57
K+L+A LA+ +A A +T+YG + T++S G T I S+
Sbjct: 3 KSLIALTLAALPVA---AMADVTLYGTIKAGVETSRSVAHNGAQAASVETGTGIVDLGSK 59

Query: 58 FGVKGDFELSSSLEAFYTVEYEVDTGAATSDNFKARNQFVGLKGAFGSFSVGRNDTLLKI 117
G KG +L + L+A + VE + A T + R F+GLKG FG VGR +++LK
Sbjct: 60 IGFKGQEDLGNGLKAIWQVEQKASI-AGTDSGWGNRQSFIGLKGGFGKLRVGRLNSVLK- 117

Query: 118 SQGNVDQFNDLSGDL--KSLFKGENRLGQTATYLSPSIGGFVFGATYAAEGDADQQAQDG 175
G+++ ++ S L + + E RL + Y SP G YA +A + +
Sbjct: 118 DTGDINPWDSKSDYLGVNKIAEPEARL-ISVRYDSPEFAGLSGSVQYALNDNAGRHNSES 176

Query: 176 FSLAAMYGDAKLKKSPFYAAIAYDSDVKGYEILRASVQGKIADLTLGGMYQQQEQTYKNA 235
+ Y + A + + I + + ++ +Y ++A
Sbjct: 177 YHAGFNYKNGGFFVQYGGAYKRHHQVQENVNIEKYQIHRLVSGYDNDALYASVAVQQQDA 236

Query: 236 LPV----NTDSVNGYLFSAAYDINAVTLKAQY----------QDMEDLGDSWSVGADYSL 281
V + +S + AY VT + Y + + D VGA+Y
Sbjct: 237 KLVEENYSHNSQTEVAATLAYRFGNVTPRVSYAHGFKGSFDATNYNNDYDQVVVGAEYDF 296

Query: 282 GKPTKVFAFYT--NRSMEASNDDDKYIAVGLEHKF 314
K T S VGL HKF
Sbjct: 297 SKRTSALVSAGWLQEGKGESKFVSTAGGVGLRHKF 331


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2711SECA300.010 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 30.2 bits (68), Expect = 0.010
Identities = 10/41 (24%), Positives = 24/41 (58%), Gaps = 1/41 (2%)

Query: 81 ESLEEKVALIEDEENRKLAKKEKDALKD-EIITSLLPRAFS 120
++E ++ + DEE + + + L+ E++ +L+P AF+
Sbjct: 29 NAMEPEMEKLSDEELKGKTAEFRARLEKGEVLENLIPEAFA 69


91Shewmr4_2801Shewmr4_2809N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_2801-1141.718662TraR/DksA family transcriptional regulator
Shewmr4_2802-2121.961857TonB-dependent receptor
Shewmr4_2803-2122.071391hypothetical protein
Shewmr4_2804-2112.033768bifunctional UDP-sugar hydrolase/5'-nucleotidase
Shewmr4_2805-2111.736223bifunctional UDP-sugar hydrolase/5'-nucleotidase
Shewmr4_28060130.899550carboxylesterase
Shewmr4_28070130.721728hypothetical protein
Shewmr4_2808014-0.301617peptidase M1, membrane alanine aminopeptidase
Shewmr4_2809113-1.245418hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2801HTHTETR559e-12 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 55.4 bits (133), Expect = 9e-12
Identities = 20/134 (14%), Positives = 45/134 (33%), Gaps = 4/134 (2%)

Query: 13 DKRQQLISTAFKLFYFQSVHGVGINQILQESAIAKKTLYHHFASKDELVEAVVLYRDHVF 72
+ RQ ++ A +LF Q V + +I + + + + +Y HF K +L + +
Sbjct: 11 ETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNI 70

Query: 73 YQWLSER-VQAAETGKAGIRALFMALDDWFNQRVPQLCEFRGCFFINASAEFTDASHPVH 131
+ E + + +R + + + + + F EF V
Sbjct: 71 GELELEYQAKFPGDPLSVLREILIHVLESTVTE-ERRRLLMEIIF--HKCEFVGEMAVVQ 127

Query: 132 RLCAEHKQRVADLM 145
+ D +
Sbjct: 128 QAQRNLCLESYDRI 141


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2804INFPOTNTIATR1401e-44 Macrophage infectivity potentiator signature.
		>INFPOTNTIATR#Macrophage infectivity potentiator signature.

Length = 233

Score = 140 bits (355), Expect = 1e-44
Identities = 67/132 (50%), Positives = 89/132 (67%), Gaps = 2/132 (1%)

Query: 25 KAAQENIRLGNEFLAQNKTQEGVKTTASGLQYQVLKQGTGTVHPKASDTVTVHYHGTLID 84
K A+EN G+ FL+ NK++ G+ SGLQY+++ GTG P SDTVTV Y GTLID
Sbjct: 99 KKAEENKAKGDAFLSANKSKPGIVVLPSGLQYKIIDAGTGA-KPGKSDTVTVEYTGTLID 157

Query: 85 GTVFDSSVERGEPIAFPLNRVIPGWTEGVQLMVEGDKYRFFIPSELAYGNRST-GKIGGG 143
GTVFDS+ + G+P F +++VIPGWTE +QLM G + F+P++LAYG RS G IG
Sbjct: 158 GTVFDSTEKAGKPATFQVSQVIPGWTEALQLMPAGSTWEVFVPADLAYGPRSVGGPIGPN 217

Query: 144 SVLIFDVELLKV 155
LIF + L+ V
Sbjct: 218 ETLIFKIHLISV 229


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2805MICOLLPTASE468e-07 Microbial collagenase metalloprotease (M9) signature.
		>MICOLLPTASE#Microbial collagenase metalloprotease (M9) signature.

Length = 1104

Score = 45.9 bits (108), Expect = 8e-07
Identities = 40/211 (18%), Positives = 77/211 (36%), Gaps = 19/211 (9%)

Query: 542 WEIDADNGDILNAMHEGLGHGEGTTPPANKAPIANAGTDVTVTGTLDVTLNGSASRDPEN 601
++D + + + + G+ T NK P A +D +V ++ +G+ S+D +
Sbjct: 744 HKVDGNGNYVYDVVFHGMNTDTNTDVHVNKEPKAVIKSDSSVIVEEEINFDGTESKDEDG 803

Query: 602 AALSYQWSQVSGPSLSITNADMANAVVQLSATASDVVYVFSLRVTDPEGLSSTDTVTLTH 661
+Y+W G ++ A A + + T Y L VTD G +T++ +
Sbjct: 804 EIKAYEWDFGDG-----EKSNEAKATHKYNKTGE---YEVKLTVTDNNGGINTESKKIKV 855

Query: 662 KAETVNQAPVV--TVPASVTVEAGQSVSINATAT---DADGDSLTYAWTVPS----GVAA 712
+ V+ + P + +A Q N + S Y + V +
Sbjct: 856 VED--KPVEVINESEPNNDFEKANQIAKSNMLVKGTLSEEDYSDKYYFDVAKKGNVKITL 913

Query: 713 SGQNSATLVVTAPAVTQSTQYSLSVLVSDGS 743
+ NS + T Y L +DG+
Sbjct: 914 NNLNSVGITWTLYKEGDLNNYVLYATGNDGT 944


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2808FLAGELLIN320.006 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 31.9 bits (72), Expect = 0.006
Identities = 14/87 (16%), Positives = 33/87 (37%), Gaps = 4/87 (4%)

Query: 282 QLASAMEEMSSTIAEVAQNTQLTSTSINTAYDLCLKSSANMKANTQKVEQLAKSVADAAN 341
+A+ + + ++N + T + + N Q+V +L+ + N
Sbjct: 48 AIANRFTSNIKGLTQASRNANDGISIAQTTE----GALNEINNNLQRVRELSVQATNGTN 103

Query: 342 NAHQLNKEAERVASAMGEIDSIAEQTN 368
+ L + + + EID ++ QT
Sbjct: 104 SDSDLKSIQDEIQQRLEEIDRVSNQTQ 130


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2809SECA310.007 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 31.4 bits (71), Expect = 0.007
Identities = 37/173 (21%), Positives = 67/173 (38%), Gaps = 42/173 (24%)

Query: 220 SQVVYPVEQRRKRELLSELIGK-KNWQQVLVFTATRDAADTLVKELNLDGIPSEVVHGEK 278
+VY E + + ++ ++ + Q VLV T + + ++ + EL GI V++
Sbjct: 424 PDLVYMTEAEKIQAIIEDIKERTAKGQPVLVGTISIEKSELVSNELTKAGIKHNVLN--- 480

Query: 279 AQGSRRRALREFMSGKV-RVLVATEVAARGLDI---------------PSLEYVVNFDLP 322
A+ A +G V +AT +A RG DI P+ E +
Sbjct: 481 AKFHANEAAIVAQAGYPAAVTIATNMAGRGTDIVLGGSWQAEVAALENPTAEQIEKIKAD 540

Query: 323 FLAED---------YV-----H---RI-----GRTGRAGKSGVAISFVSREEE 353
+ ++ H RI GR+GR G +G + ++S E+
Sbjct: 541 WQVRHDAVLEAGGLHIIGTERHESRRIDNQLRGRSGRQGDAGSSRFYLSMEDA 593



Score = 29.8 bits (67), Expect = 0.022
Identities = 47/256 (18%), Positives = 95/256 (37%), Gaps = 41/256 (16%)

Query: 23 KMTPIQQQAIPAIRRGQDVLASAQTGTGKTAAFALPI-LQKMAENPSETLKSNARVLILT 81
M Q + + + +A +TG GKT LP L + V ++T
Sbjct: 80 GMRHFDVQLLGGMVLNERCIAEMRTGEGKTLTATLPAYLNALTGKG---------VHVVT 130

Query: 82 PTRELAAQVADNVEAYSKYLNFSVLTIYGGVKVETQAQKLKR---GADIIVATPGRLLEH 138
LA + A+N ++L +V G+ + KR ADI T
Sbjct: 131 VNDYLAQRDAENNRPLFEFLGLTV-----GINLPGMPAPAKREAYAADITYGTNNEYGFD 185

Query: 139 LTACNLSLSSID-------FLVLDEADRMLDMGFNADIQKILQAVNKKRQNLLFSATFSS 191
N++ S + + ++DE D +L +++ R L+ S
Sbjct: 186 YLRDNMAFSPEERVQRKLHYALVDEVDSIL--------------IDEARTPLIISGPAED 231

Query: 192 AVKKLANEMMVKPQVISADKQNTTADTVSQVVYPVEQRRKRELLSELIGKKNWQQVLVFT 251
+ + + P +I +K+++ + + V+++ ++ L+E G +++LV
Sbjct: 232 SSEMYKRVNKIIPHLIRQEKEDSETFQG-EGHFSVDEKSRQVNLTER-GLVLIEELLVKE 289

Query: 252 ATRDAADTLVKELNLD 267
D ++L N+
Sbjct: 290 GIMDEGESLYSPANIM 305


92Shewmr4_2930Shewmr4_2938N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_2930091.922110glutaredoxin 1
Shewmr4_2931-1121.989459hypothetical protein
Shewmr4_2932-2102.176047hypothetical protein
Shewmr4_2933-2102.076106*******hypothetical protein
Shewmr4_2934-2101.254201hypothetical protein
Shewmr4_2935-2110.354014OmpA/MotB domain-containing protein
Shewmr4_2936-213-1.088592hypothetical protein
Shewmr4_2937119-3.274478translocation protein TolB
Shewmr4_2938123-4.288986hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2930PF00577330.001 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 32.5 bits (74), Expect = 0.001
Identities = 14/101 (13%), Positives = 37/101 (36%), Gaps = 3/101 (2%)

Query: 104 VSYDVTLN--RYNYSGESDLGYFEVTAGVEFSGFRV-AYWYTNDYGGSDLDYHYGEINYS 160
++Y+ + N + G S Y + +G+ +R+ + + +
Sbjct: 187 LNYNFSGNSVQNRIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHI 246

Query: 161 YEFVENWSLDLHYGYNVGDALDDGEGFDSYSDYSVGVSTEF 201
++E + L +GD G+ FD + ++++
Sbjct: 247 NTWLERDIIPLRSRLTLGDGYTQGDIFDGINFRGAQLASDD 287


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2932VACCYTOTOXIN300.021 Helicobacter pylori vacuolating cytotoxin signature.
		>VACCYTOTOXIN#Helicobacter pylori vacuolating cytotoxin signature.

Length = 1291

Score = 30.0 bits (67), Expect = 0.021
Identities = 19/82 (23%), Positives = 35/82 (42%), Gaps = 8/82 (9%)

Query: 2 STTAPGSVA---NTGVNAQDAAAKKSQSKPRLMSLDALRGFDMFWILGGEALFGALLILT 58
+ A G+V+ G+ + A K ++ + A +GF+ + L+ +LL
Sbjct: 50 TGAAVGTVSGLLGWGLKQAEEANKTPDKPDKVWRIQAGKGFNE-FPNKEYDLYKSLLSSK 108

Query: 59 GWAGWQWGDTQMHH----SEWN 76
GW WG+ H+ +WN
Sbjct: 109 IDGGWDWGNAARHYWVKDGQWN 130


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2933UREASE300.019 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 29.7 bits (67), Expect = 0.019
Identities = 13/26 (50%), Positives = 18/26 (69%)

Query: 334 PAEFLGIAKNVGRLAVGQRADLVLLD 359
PA G++ +G L VG+RADLVL +
Sbjct: 413 PAIAHGLSHEIGSLEVGKRADLVLWN 438


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2936TRNSINTIMINR300.035 Translocated intimin receptor (Tir) signature.
		>TRNSINTIMINR#Translocated intimin receptor (Tir) signature.

Length = 549

Score = 30.5 bits (68), Expect = 0.035
Identities = 18/45 (40%), Positives = 25/45 (55%), Gaps = 1/45 (2%)

Query: 5 IAATAILLALGLTACSDVP-KTEAVPSSSTAEQAKPNQLTQAQLQ 48
+AAT I AL LT D P T+ +++ AE A +QLTQ +
Sbjct: 248 LAATGIAQALALTPEPDDPTTTDPDQAANAAESATKDQLTQEAFK 292


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2938BINARYTOXINA300.009 Clostridial binary toxin A signature.
		>BINARYTOXINA#Clostridial binary toxin A signature.

Length = 454

Score = 30.4 bits (68), Expect = 0.009
Identities = 15/48 (31%), Positives = 25/48 (52%), Gaps = 2/48 (4%)

Query: 247 YFFVSPKRPELAAAILAGLENMISDGSFDEMFNRELKIDKLYRDAQFE 294
Y+F SP++ I +N IS F+E+ +E DKL++ F+
Sbjct: 133 YYFESPEKFAFNKEIRTENQNEISLEKFNEL--KETIQDKLFKQDGFK 178


93Shewmr4_2947Shewmr4_2953N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_2947339-8.005000ribonuclease T
Shewmr4_2948339-7.900541alkyl hydroperoxide reductase/ Thiol specific
Shewmr4_2949236-6.807051Na+/H+ antiporter NhaC
Shewmr4_2950134-6.233459hypothetical protein
Shewmr4_2951-128-5.114913RDD domain-containing protein
Shewmr4_2952-121-2.990907uracil phosphoribosyltransferase
Shewmr4_2953017-1.529901phosphoribosylaminoimidazole synthetase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2947BCTERIALGSPG361e-05 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 36.4 bits (84), Expect = 1e-05
Identities = 15/45 (33%), Positives = 26/45 (57%)

Query: 3 NKLLGFTLVELMVTIAVAAILLTIGVPSLISVYEGVRVNNNIAKI 47
+K GFTL+E+MV I + +L ++ VP+L+ E ++ I
Sbjct: 5 DKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDI 49


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2949BCTERIALGSPG561e-12 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 55.7 bits (134), Expect = 1e-12
Identities = 21/64 (32%), Positives = 41/64 (64%)

Query: 2 KKNRLQGFTLIEVMIAVVIVGILASIAYPSYIDYVVKSGRSEGVAAVMKVANLQEQYYLD 61
++ +GFTL+E+M+ +VI+G+LAS+ P+ + K+ + + V+ ++ + N + Y LD
Sbjct: 3 ATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLD 62

Query: 62 NRSY 65
N Y
Sbjct: 63 NHHY 66


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2952PF05307310.006 Bundlin
		>PF05307#Bundlin

Length = 193

Score = 30.5 bits (68), Expect = 0.006
Identities = 37/143 (25%), Positives = 61/143 (42%), Gaps = 11/143 (7%)

Query: 17 QRGLSLVELMVAMVIGLFLTAGVFTMFSMSSSNVTTTSQFNQLQENGRIALAILERDITQ 76
++GLSL+E + + + +TAGV MF S++ + SQ N + E AI I Q
Sbjct: 11 EKGLSLIESAMVLALAATVTAGV--MFYYQSASDSNKSQ-NAISEVMSATSAINGLYIGQ 67

Query: 77 LAFMG---DMTGTDFIIGTNTTDEVPTLSSDCIGAGLNNGTLPNTQSAHFRRLWGYESVS 133
++ G ++ I N D ++ G L G NT ++ L G +
Sbjct: 68 TSYTGLNSNILLNTSAIPDNYKDTKNNKITNPFGGELEVGPAANTSFGYYLTLTGLDKA- 126

Query: 134 SESLSCISSSNVYAGIDKKGTDV 156
+C+S + + G KG V
Sbjct: 127 ----ACVSLATLNLGTSAKGYGV 145


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_2953BCTERIALGSPG325e-04 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 32.2 bits (73), Expect = 5e-04
Identities = 10/24 (41%), Positives = 18/24 (75%), Gaps = 2/24 (8%)

Query: 13 QRGFSLIEVLVALVIL--VIGLIG 34
QRGF+L+E++V +VI+ + L+
Sbjct: 7 QRGFTLLEIMVVIVIIGVLASLVV 30


94Shewmr4_3016Shewmr4_3025N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_30160151.598581hypothetical protein
Shewmr4_30172162.360348hypothetical protein
Shewmr4_30181141.481769hypothetical protein
Shewmr4_30192131.349972hypothetical protein
Shewmr4_30202111.153585GTP cyclohydrolase II
Shewmr4_30212120.893433hypothetical protein
Shewmr4_30221110.694221anaerobic ribonucleotide reductase-activating
Shewmr4_3023010-0.684478anaerobic ribonucleoside triphosphate reductase
Shewmr4_3024-1130.647813hypothetical protein
Shewmr4_30250150.788792patatin
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3016RTXTOXIND606e-12 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 59.8 bits (145), Expect = 6e-12
Identities = 51/320 (15%), Positives = 95/320 (29%), Gaps = 80/320 (25%)

Query: 66 ITPAVKGLVISVDVKPNTPIKQGDVLFRIDPTPFEAVVKRKRAALLAA------------ 113
I P +V + VK +++GDVL ++ EA + +++LL A
Sbjct: 99 IKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSR 158

Query: 114 ----------------------EQEVPQLEAAWESAKANVARVAADRERNKSAYDRYEQG 151
E+EV +L + + + +E N
Sbjct: 159 SIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLT 218

Query: 152 H----RKGGANSPFTELELDNRRQLF----------LASEAQLTAAQAE----------- 186
+ S + LD+ L L E + A E
Sbjct: 219 VLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQI 278

Query: 187 -----ELRARLA-----YESNVDG----VNSKVAGLQGDLESALYNLEQTVVRAPADGIV 232
+ +++ + + L +L + +V+RAP V
Sbjct: 279 ESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKV 338

Query: 233 TQMALR-PGAMAVPLPLRPVMSFIPDEQRYFAGAFWQNSLL-RLQEGDEAEVVLDAAPGQ 290
Q+ + G V +M +P++ A QN + + G A + ++A P
Sbjct: 339 QQLKVHTEG--GVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYT 396

Query: 291 ---VFKGKVAKVLPAMAEGE 307
GKV + E +
Sbjct: 397 RYGYLVGKVKNINLDAIEDQ 416


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3019HTHFIS290.045 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 29.0 bits (65), Expect = 0.045
Identities = 15/60 (25%), Positives = 29/60 (48%), Gaps = 6/60 (10%)

Query: 274 RAPAVQ-VSTDAKAALQPDAPKGVLLLGVQGSGKSLAAKAV---AGVWQRPLLRLDMGAL 329
R+ A+Q + +Q D +++ G G+GK L A+A+ P + ++M A+
Sbjct: 142 RSAAMQEIYRVLARLMQTDLT--LMITGESGTGKELVARALHDYGKRRNGPFVAINMAAI 199


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3020PF06057310.005 Type IV secretory pathway VirJ component
		>PF06057#Type IV secretory pathway VirJ component

Length = 243

Score = 31.3 bits (71), Expect = 0.005
Identities = 10/28 (35%), Positives = 13/28 (46%), Gaps = 2/28 (7%)

Query: 17 GCGELGKEVAIELQRLGVEVIGVD--RY 42
G L K V LQ+ G V+G +Y
Sbjct: 62 GWATLDKAVGGILQQQGWPVVGWSSLKY 89


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3021BCTERIALGSPC330.002 Bacterial general secretion pathway protein C signa...
		>BCTERIALGSPC#Bacterial general secretion pathway protein C

signature.
Length = 272

Score = 33.4 bits (76), Expect = 0.002
Identities = 26/108 (24%), Positives = 45/108 (41%), Gaps = 12/108 (11%)

Query: 348 GMIWFRLPLEGDKRVWPLSTLIAVAQQQPLAPH-IELEILSQANSESAQHEAPGSS---L 403
MI++R+ L + V + A A+QQP+ + L +S +++ +A S
Sbjct: 31 AMIFWRIGLPDNAPVSSVQITPAQARQQPVTLNDFTLFGVSPEKNKAGALDASQMSNLPP 90

Query: 404 FQLVLVNKGNLAGKLPSQLSLAAQACSGY-------DAQNGYQAKLTQ 444
L L G +AG S+ S+A + + GY AK+
Sbjct: 91 STLNLSLTGVMAGDDDSR-SIAIISKDNEQFSRGVNEEVPGYNAKIVS 137


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3022SYCDCHAPRONE290.050 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 28.7 bits (64), Expect = 0.050
Identities = 17/68 (25%), Positives = 33/68 (48%), Gaps = 6/68 (8%)

Query: 103 ENSELSEAQLALVNSLRDAQSLAEAEQIAAQLKESLAPALTWYSLGAMAFDAKEYDKASD 162
E ++ E QLA+ + L+ ++A +I++ E L YSL + + +Y+ A
Sbjct: 4 ETTDTQEYQLAMESFLKGGGTIAMLNEISSDTLEQL------YSLAFNQYQSGKYEDAHK 57

Query: 163 YFKKVIAL 170
F+ + L
Sbjct: 58 VFQALCVL 65


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3025V8PROTEASE389e-05 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 38.1 bits (88), Expect = 9e-05
Identities = 18/58 (31%), Positives = 30/58 (51%), Gaps = 1/58 (1%)

Query: 635 IDSVPVNFLS-TLDTTGGNSGSPTLNGRAELVGLLFDGVYESIIGDWAYDDNINRSIQ 691
I + + L TTGGNSGSP N + E++G+ + GV G ++N+ ++
Sbjct: 218 ITYLKGEAMQYDLSTTGGNSGSPVFNEKNEVIGIHWGGVPNEFNGAVFINENVRNFLK 275



Score = 30.4 bits (68), Expect = 0.025
Identities = 9/53 (16%), Positives = 15/53 (28%), Gaps = 9/53 (16%)

Query: 43 DAKSISKLTEFPMNAVISLG--------GCTASFVSPKGLVVTNHHCAYGSIQ 87
D I+ T V + + V K ++TN H +
Sbjct: 75 DRHQITDTTNGHYAPVTYIQVEAPTGTFIASG-VVVGKDTLLTNKHVVDATHG 126


95Shewmr4_3271Shewmr4_3290N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_3271-2141.798179ribosome recycling factor
Shewmr4_32720181.767505uridylate kinase
Shewmr4_32731182.352117elongation factor Ts
Shewmr4_32741193.46598630S ribosomal protein S2
Shewmr4_32751193.539229methionine aminopeptidase
Shewmr4_32771152.779176PII uridylyl-transferase
Shewmr4_32781171.7206572,3,4,5-tetrahydropyridine-2,6-carboxylate
Shewmr4_3279-1141.967062hypothetical protein
Shewmr4_3280-1161.601713hypothetical protein
Shewmr4_3281-1151.458948metal dependent phosphohydrolase
Shewmr4_3282-2122.288572formyltetrahydrofolate deformylase
Shewmr4_3283-1143.026091PTS system, glucose-like IIB subunit
Shewmr4_32840133.287072flavodoxin
Shewmr4_32852123.504190tRNA pseudouridine synthase C
Shewmr4_32862113.739881hypothetical protein
Shewmr4_32872123.528148hypothetical protein
Shewmr4_32882101.641969hypothetical protein
Shewmr4_32891120.684762hypothetical protein
Shewmr4_3290216-1.004117hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3271V8PROTEASE771e-17 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 77.0 bits (189), Expect = 1e-17
Identities = 42/192 (21%), Positives = 70/192 (36%), Gaps = 34/192 (17%)

Query: 89 RGLGSGVIIDADKGYIVTNNHVIDGADDIQVGLH------------DGREVKAKLIGTDS 136
+ SGV++ K ++TN HV+D L +G ++
Sbjct: 101 TFIASGVVVG--KDTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSG 158

Query: 137 ESDIALLQIEA--------KNLVAIKTSDSDELRVGDFAVAIGNPFGLGQTVTSGIVSAL 188
E D+A+++ + + S++ E +V G P V+ +
Sbjct: 159 EGDLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYPGDKP-------VATM 211

Query: 189 GRSGLGIEMLEN-FIQTDAAINSGNSGGALVNLKGELIGINTAIVAPGGGNVGIGFAIPA 247
S I L+ +Q D + GNSG + N K E+IGI+ G N G
Sbjct: 212 WESKGKITYLKGEAMQYDLSTTGGNSGSPVFNEKNEVIGIHWG----GVPNEFNGAVFIN 267

Query: 248 NMVKNLVAQIAE 259
V+N + Q E
Sbjct: 268 ENVRNFLKQNIE 279


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3272V8PROTEASE746e-17 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 74.3 bits (182), Expect = 6e-17
Identities = 35/177 (19%), Positives = 66/177 (37%), Gaps = 33/177 (18%)

Query: 81 GSLQGLGSGVIMSKEGYILTNYHVIKKADEIVVALQ------------DGRKFTSEVVGF 128
+ + SGV++ K +LTN HV+ AL+ +G ++ +
Sbjct: 98 PTGTFIASGVVVGK-DTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKY 156

Query: 129 DPETDLSVLKIE--------GDNLPTVPVNLDSPPQVGDVVLAIGNPYNLGQTITQGIIS 180
E DL+++K G+ + ++ ++ QV + G P + T
Sbjct: 157 SGEGDLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYPGD-KPVATMW--E 213

Query: 181 ATGRNGLSSGYLDFLQTDAAINAGNSGGALIDTNGSLIGINTAAFQVGGEGGGHGIN 237
+ G+ G +Q D + GNSG + + +IGI+ G + N
Sbjct: 214 SKGKITYLKGE--AMQYDLSTTGGNSGSPVFNEKNEVIGIHWG-------GVPNEFN 261


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3278HTHFIS832e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 83.0 bits (205), Expect = 2e-20
Identities = 28/134 (20%), Positives = 60/134 (44%)

Query: 23 ILLVEDEQDLAQMIMVNLTALNFRVFHAASLHQANALLQAKRIDLVLLDRMLPDGDGLLL 82
IL+ +D+ + ++ L+ + V ++ + A DLV+ D ++PD + L
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 83 CQQLRNDGQQMPVMLLTARDGEADTVLGLESGADDYMTKPFSVLELRARTKALLRRHLSA 142
+++ +PV++++A++ + E GA DY+ KPF + EL L
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRR 125

Query: 143 TPTRQLIEFEGLRI 156
+ +G+ +
Sbjct: 126 PSKLEDDSQDGMPL 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3279OUTRMMBRANEA280.046 Outer membrane protein A signature.
		>OUTRMMBRANEA#Outer membrane protein A signature.

Length = 346

Score = 27.6 bits (61), Expect = 0.046
Identities = 6/16 (37%), Positives = 6/16 (37%)

Query: 27 PPPEPPAPPPVVMKSF 42
P P P V K F
Sbjct: 200 VAPAPAPAPEVQTKHF 215


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3283HTHFIS652e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 65.2 bits (159), Expect = 2e-14
Identities = 41/167 (24%), Positives = 68/167 (40%), Gaps = 8/167 (4%)

Query: 1 MSQIKVAIADDHPLFRTALTQAVLKNVNTADVLEAENFQELITLVENNPDIELIFLDLHM 60
M+ + +ADD RT L QA L DV N L + +L+ D+ M
Sbjct: 1 MTGATILVADDDAAIRTVLNQA-LSRAG-YDVRITSNAATLWRWIAAGD-GDLVVTDVVM 57

Query: 61 PGNEGFTGLTLLQNHFPDIAVIMVSSDDQPEIIRKAINFGASAFIPKSASLTQISTAIAT 120
P F L ++ PD+ V+++S+ + KA GA ++PK LT++ I
Sbjct: 58 PDENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117

Query: 121 VLEGEVWLPEHTDINVDQQ-----TAAEHQRLAKQLAQLTPQQYTVL 162
L P + + +A Q + + LA+L T++
Sbjct: 118 ALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLM 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3284HTHFIS399e-05 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 39.0 bits (91), Expect = 9e-05
Identities = 13/71 (18%), Positives = 29/71 (40%), Gaps = 2/71 (2%)

Query: 1048 ISVLVIDNDELMLKAISSLLLGWGCHVLTARDKTSAEQQLVQQVLPKLIIADYHLDDDQN 1107
++LV D+D + ++ L G V + + + + L++ D + D+ N
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWI-AAGDGDLVVTDVVMPDE-N 61

Query: 1108 GVDLVQSLLTH 1118
DL+ +
Sbjct: 62 AFDLLPRIKKA 72


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3285TCRTETB290.024 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 29.5 bits (66), Expect = 0.024
Identities = 26/107 (24%), Positives = 40/107 (37%), Gaps = 4/107 (3%)

Query: 252 GIVGTIAGILYSRKQPLRLPIIRLSGLLIFLTVLGLSFGSAPWLQTLCAI-VLGFCIFLP 310
I G I GIL R+ PL + I ++ L + + W T+ + VLG F
Sbjct: 307 IIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTK 366

Query: 311 VTALVSIPHELPKMTSQKITVIFSLFWSISYLISTLVLWLFGKLVDI 357
+ L Q+ SL S+L + + G L+ I
Sbjct: 367 TVISTIVSSSL---KQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSI 410


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3289SURFACELAYER300.044 Lactobacillus surface layer protein signature.
		>SURFACELAYER#Lactobacillus surface layer protein signature.

Length = 439

Score = 29.6 bits (66), Expect = 0.044
Identities = 23/101 (22%), Positives = 42/101 (41%), Gaps = 7/101 (6%)

Query: 400 VSALSAPVELTVRVGPEPANAAPALSNIQVSVSGQCASVTGSVVDANQNLASVTVGFSSG 459
++A + PV + + A A + V V+ S++ A + G +G
Sbjct: 21 IAATAMPVNAATTINADSAINANTNAKYDVDVT---PSISAIAAVAKSDTMPAIPGSLTG 77

Query: 460 QQVSASINGTQYSAQGCNLPGGANLATVIAMDSTQLSSQDS 500
+SAS NG Y+A NLP + AT+ ++ + +
Sbjct: 78 S-ISASYNGKSYTA---NLPKDSGNATITDSNNNTVKPAEL 114


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3290DHBDHDRGNASE1184e-34 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 118 bits (297), Expect = 4e-34
Identities = 76/259 (29%), Positives = 121/259 (46%), Gaps = 6/259 (2%)

Query: 33 GLKGKVGLITGSTSGIGLATAHVLAEQGCHLILHGLMPEAEGQRLAAEFAEQYHIHTFFS 92
G++GK+ ITG+ GIG A A LA QG H+ PE + +++ AE H F
Sbjct: 5 GIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAF-- 62

Query: 93 NADLRDPESIHAFMDAGVKALGSIDILVNNAGIQHTENVAHFPIDKWNDIIAINLSSAFH 152
AD+RD +I + +G IDILVN AG+ + ++W ++N + F+
Sbjct: 63 PADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFN 122

Query: 153 TIQQAVPAMAEKRWGRIINIASVHGLVASVNKAAYCAAKHGIVGLTKVVAIECAEQGITV 212
+ M ++R G I+ + S V + AAY ++K V TK + +E AE I
Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 213 NAICPGWVDTPLINK-QIEAIASNKGLSYDEAKYQLVTAKQPLPEMLDPRQIGEFVLFLC 271
N + PG +T + + + + + ++ PL ++ P I + VLFL
Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGI---PLKKLAKPSDIADAVLFLV 239

Query: 272 SSAARGITGASLAMDGAWT 290
S A IT +L +DG T
Sbjct: 240 SGQAGHITMHNLCVDGGAT 258


96Shewmr4_3334Shewmr4_3341N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_3334-1192.118153succinylglutamate desuccinylase/aspartoacylase
Shewmr4_3335-1191.564148MarR family transcriptional regulator
Shewmr4_3336-1182.302794PhnA protein
Shewmr4_3337-1161.861368hypothetical protein
Shewmr4_3338-1170.929557TetR family transcriptional regulator
Shewmr4_3339-2101.249182glutathione S-transferase domain-containing
Shewmr4_33400142.085240glutathione S-transferase domain-containing
Shewmr4_33410142.561871Fmu (Sun) domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_33342FE2SRDCTASE280.021 Ferric iron reductase signature.
		>2FE2SRDCTASE#Ferric iron reductase signature.

Length = 262

Score = 28.1 bits (62), Expect = 0.021
Identities = 18/73 (24%), Positives = 29/73 (39%), Gaps = 9/73 (12%)

Query: 73 FQDSVARLSDFDFGFMPLLPEEEEPLSQRVEALSLWTQSFLTGIAIIQPKLNKASAEVRE 132
+A SD + P++ E +PL +SLW Q + I ++ P L A +
Sbjct: 66 LSSLLAVYSDHIYRNQPMMIRENKPL------ISLWAQWY---IGLMVPPLMLALLTQEK 116

Query: 133 VIKDLAEIAQVEF 145
+ E EF
Sbjct: 117 ALDVSPEHFHAEF 129


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3337DHBDHDRGNASE463e-08 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 46.2 bits (109), Expect = 3e-08
Identities = 52/257 (20%), Positives = 97/257 (37%), Gaps = 31/257 (12%)

Query: 5 IIITGVGKRIGYALAKHLLAQGHKVIG-----TYRSHYPSIDELQSLGATLIQCDFYDNT 59
ITG + IG A+A+ L +QG + S + ++ A D D+
Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSA 70

Query: 60 QLQTLIEQL-SQYPKIRAIIHNASDWLPDNSPSLAAHEVMQRMMQVHVSVPYQMNLALAS 118
+ + ++ + I +++ A P SL+ E + V+ + + + +++
Sbjct: 71 AIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEW-EATFSVNSTGVFNASRSVSK 129

Query: 119 QLRAGAEGEIG--ASDIIHFTDYVAEKGSAKHMAYAASKAALDNLTLSFAAKLAPE-VKV 175
+ G I S+ V A AYA+SKAA T +LA ++
Sbjct: 130 YMMDRRSGSIVTVGSNPAG----VPRTSMA---AYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 176 NAIAPAMI-------LFNPSDDEAYRQKTLAKAI-----LPKEAGNQEIIALVDYLLASR 223
N ++P L+ + K + L K A +I V +L++ +
Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242

Query: 224 --YVTGRSHNVDGGRHL 238
++T + VDGG L
Sbjct: 243 AGHITMHNLCVDGGATL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3340ARGREPRESSOR1472e-48 Bacterial arginine repressor signature.
		>ARGREPRESSOR#Bacterial arginine repressor signature.

Length = 149

Score = 147 bits (372), Expect = 2e-48
Identities = 43/150 (28%), Positives = 71/150 (47%), Gaps = 5/150 (3%)

Query: 6 NQDDLVRIFKAILKEERFGSQSEIVAALQAEGFSNINQSKVSRMLSKFGAVRTRNAKQEM 65
N+ + I+ +Q E+V L+ +G+ N+ Q+ VSR + + V+
Sbjct: 2 NKGQRHIKIREIITANEIETQDELVDILKKDGY-NVTQATVSRDIKELHLVKVPTNNGSY 60

Query: 66 VYCLPAELGVPTAGSPLKNLV---LDVDHNQAMIVVRTSPGAAQLIARLLDSIGKPEGIL 122
Y LPA+ ++L+ + +D +IV++T PG AQ I L+D++ E I+
Sbjct: 61 KYSLPADQRFNPLSKLKRSLMDAFVKIDSASHLIVLKTMPGNAQAIGALMDNLDWEE-IM 119

Query: 123 GTIAGDDTIFICPSSIQDIADTLETIKSLF 152
GTI GDDTI I + D + I L
Sbjct: 120 GTICGDDTILIICRTHDDTKVVQKKILELL 149


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3341NUCEPIMERASE366e-05 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 35.9 bits (83), Expect = 6e-05
Identities = 28/123 (22%), Positives = 43/123 (34%), Gaps = 23/123 (18%)

Query: 1 MKIAVLGASGWIGGTIFNEARSRGHEVVAL-----VRDPS-------KLGETEAEVRSVD 48
MK V GA+G+IG + GH+VV + D S L + + +D
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 49 LT-QPLEADTFA--GVDVVI---AAVGARAEQNHGIVAKTVN-----NLLAVLPQAKVPR 97
L + D FA + V + R + N N+L K+
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120

Query: 98 LLW 100
LL+
Sbjct: 121 LLY 123


97Shewmr4_3401Shewmr4_3408N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_3401-1121.062053Ion transport 2 domain-containing protein
Shewmr4_3402-2121.142429hypothetical protein
Shewmr4_3403-2141.782091hypothetical protein
Shewmr4_3404-3151.996504multi anti extrusion protein MatE
Shewmr4_3405-2141.809832major facilitator transporter
Shewmr4_3406-2122.337567hypothetical protein
Shewmr4_3407-2102.505828hypothetical protein
Shewmr4_3408-1132.406317ABC transporter periplasmic substrate-binding
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3401RTXTOXIND423e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 41.7 bits (98), Expect = 3e-06
Identities = 44/269 (16%), Positives = 91/269 (33%), Gaps = 49/269 (18%)

Query: 25 SSVQATAIRPVKLFEVVQLEGGDFRTFPAR--VSANSRAELSFRISGELTDLALVEGQ-- 80
+V A R L V + DF + + ++ ++ E + + +L + + Q
Sbjct: 217 LTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLE 276

Query: 81 QIRQGSLLAKLDDRDAHNNLMTREAEHELLAADFQRKTELLKRKLISQAEFDSTQAQLKS 140
QI L AK E++L+ F+ + + Q +
Sbjct: 277 QIESEILSAKE--------------EYQLVTQLFKNEI----LDKLRQT-----TDNIGL 313

Query: 141 AKAALAAARDQLSYTKLIAPFSGTVAKRLVDNH-QIVQANQGILTL-QNNNLLDVSIQVP 198
LA ++ + + AP S V + V +V + ++ + ++ L+V+ V
Sbjct: 314 LTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQ 373

Query: 199 EAMAASLNTYVQQQNFTAKVRFSALAGMEF---DAKFKEYSTQVTPGTQ---AYEVVFSL 252
+ A ++ A + K K + + + V+ S+
Sbjct: 374 NKDIGFI-----NVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVIISI 428

Query: 253 PQP------KDIQLLPGMSAELTLALVKT 275
+ K+I L GM+ A +KT
Sbjct: 429 EENCLSTGNKNIPLSSGMAVT---AEIKT 454



Score = 28.6 bits (64), Expect = 0.037
Identities = 11/84 (13%), Positives = 30/84 (35%), Gaps = 5/84 (5%)

Query: 68 SGELTDLALVEGQQIRQGSLLAKLDDRDAHNNLMTREAEHELLAADFQRKTELLKRKLIS 127
+ + ++ + EG+ +R+G +L KL A + + ++ + R + L
Sbjct: 104 NSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTR-----YQILSR 158

Query: 128 QAEFDSTQAQLKSAKAALAAARDQ 151
E + + ++
Sbjct: 159 SIELNKLPELKLPDEPYFQNVSEE 182


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3402ACRIFLAVINRP502e-163 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 502 bits (1294), Expect = e-163
Identities = 207/1046 (19%), Positives = 443/1046 (42%), Gaps = 53/1046 (5%)

Query: 3 IAEYSIRHKVISWMFVLLLLVGGGVSFTGLGQLEFPEFTIKEALVITAYPGASPEQVEEE 62
+A + IR + +W+ ++L++ G ++ L ++P V YPGA + V++
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 63 VTLPLEDALQQLDAVKHVTSI-NSAGLSQIQIEIKENYDKTSLPQVWDEVRRKVNDTAGQ 121
VT +E + +D + +++S +SAG I + + T +V+ K+
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQ---SGTDPDIAQVQVQNKLQLATPL 117

Query: 122 LPPGTSTPKVFDDFGD---VYGILFNLSGPDYSNRELSNYAD-YLRREIVLVPGVKKVSV 177
LP + + + F P + ++S+Y ++ + + GV V +
Sbjct: 118 LPQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQL 177

Query: 178 AGSVTEQVVIEISQQRLSALGLDQSYIYGLINNQNVVSNAGSLVVGDN------RIRIHP 231
G+ + I + L+ L + + QN AG L I
Sbjct: 178 FGA-QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIA 236

Query: 232 TGEFSSVQDLARLIVSPPGSTELIYLGDIANIEKDYDETPDVLYHNRGETALSLGISFSS 291
F + ++ ++ + ++ L D+A +E + + N G+ A LGI ++
Sbjct: 237 QTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARIN-GKPAAGLGIKLAT 295

Query: 292 GVNVVEVGKSVSQRLAELESQRPIGMNLDTVYNQSLAVDDTVNGFLINLLESIAIVIAVL 351
G N ++ K++ +LAEL+ P GM + Y+ + V +++ + L E+I +V V+
Sbjct: 296 GANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVM 355

Query: 352 LLFMG-LRSGLLMGLILLLTILGTFIVMKVLGIELQLISLGALIIALGMLVDNAIVVTEG 410
LF+ +R+ L+ + + + +LGTF ++ G + +++ +++A+G+LVD+AIVV E
Sbjct: 356 YLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVEN 415

Query: 411 ILIGLRRGKTR-LEAAKQIVSQTQWPLLGATVIAIIAFAPIGLSQNAAGEFCRSLFQVLM 469
+ + K EA ++ +SQ Q L+G ++ F P+ + G R ++
Sbjct: 416 VERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIV 475

Query: 470 ISLFISWITAITLTPFFCHLLFKDAPADDEEEQDPYKGWF-------FSLYRVSLTFALR 522
++ +S + A+ LTP C L K A+ E + + GWF + Y S+ L
Sbjct: 476 SAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILG 535

Query: 523 FRLASIVLVGVMLVSAVIGFGHIKNVFFPASNTPIFFVDIWMPEGTDIKGTERFTADIEK 582
+++ +++ V+ F + + F P + +F I +P G + T++ +
Sbjct: 536 STGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTD 595

Query: 583 LLLKQAEEQHSGLKHLTSVIG-------QGAQRFILPYQPEKGYPAYAQLIIEMEDLASL 635
LK + ++ + +V G Q A + +P + +
Sbjct: 596 YYLKNEKA---NVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSA--EAVIHRA 650

Query: 636 KVYMPELETLLNQRFPQAQYRFKNMENGPSPAAKIEARFYGDDPEVLRALGAQAEAIFNA 695
K+ + ++ F + ++ + + +A
Sbjct: 651 KMELGKIRDGFVIPFNMP--AIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQH 708

Query: 696 EPSMDGIRHDWRNQVPLIRPQLENAQARETGISKQDLDNALLINFSGKQIGLYRETSHLL 755
S+ +R + + +++ +A+ G+S D++ + G + + + +
Sbjct: 709 PASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVK 768

Query: 756 PIVARAPAEERLQADSLWKLQIWSTEHNTFVPATQVVSQFETQWENPLVKRRDRMRMLAV 815
+ +A A+ R+ + + KL + + + VP + + + +P ++R + + + +
Sbjct: 769 KLYVQADAKFRMLPEDVDKLYV-RSANGEMVPFSAFTT-SHWVYGSPRLERYNGLPSMEI 826

Query: 816 LADPKLGSD-ETADSVLHKVKDKVEAISLPTGYHLEWGGEYETAGEAQTAVFSSIPMGYL 874
+ G+ A +++ + K LP G +W G + + + + ++
Sbjct: 827 QGEAAPGTSSGDAMALMENLASK-----LPAGIGYDWTGMSYQERLSGNQAPALVAISFV 881

Query: 875 VMFLITVFLFNSVRQPLVIWFTVPLALIGVSAGLLLFDAPFSFMALLGLLSLSGMVIKNG 934
V+FL L+ S P+ + VPL ++GV LF+ ++GLL+ G+ KN
Sbjct: 882 VVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNA 941

Query: 935 IVLVDQIN-LELGEGKPAYAALVDSSVSRVRPVLMAAITTMLGMIPLIPDAFFGS----- 988
I++V+ L EGK A + + R+RP+LM ++ +LG++PL GS
Sbjct: 942 ILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNA 1001

Query: 989 MAITIIFGLGFASLLTLIVLPVMYSL 1014
+ I ++ G+ A+LL + +PV + +
Sbjct: 1002 VGIGVMGGMVSATLLAIFFVPVFFVV 1027



Score = 73.3 bits (180), Expect = 3e-15
Identities = 43/219 (19%), Positives = 93/219 (42%), Gaps = 13/219 (5%)

Query: 814 AVLADPKLGSDETADSVLHKVKDKVEAI--SLPTGYHLEWGGEYETAGEAQTA---VFSS 868
A KL + A +K K+ + P G ++ Y+T Q + V +
Sbjct: 286 AAGLGIKLATGANALDTAKAIKAKLAELQPFFPQG--MKVLYPYDTTPFVQLSIHEVVKT 343

Query: 869 IPMGYLVMFLITVFLFNSVRQPLVIWFTVPLALIGVSAGLLLFDAPFSFMALLGLLSLSG 928
+ +++FL+ ++R L+ VP+ L+G A L F + + + G++ G
Sbjct: 344 LFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIG 403

Query: 929 MVIKNGIVLVDQINLELGEGKPAYAALVDSSVSRVR-PVLMAAITTMLGMIPL-----IP 982
+++ + IV+V+ + + E K + S+S+++ ++ A+ IP+
Sbjct: 404 LLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGST 463

Query: 983 DAFFGSMAITIIFGLGFASLLTLIVLPVMYSLAFNIKPN 1021
A + +ITI+ + + L+ LI+ P + +
Sbjct: 464 GAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSA 502


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3407FLGHOOKAP1355e-04 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 35.3 bits (81), Expect = 5e-04
Identities = 21/110 (19%), Positives = 40/110 (36%), Gaps = 15/110 (13%)

Query: 412 GEILGIE---HKQELVDLHRANGRNVVQGDAADTDFWEKLDKAPNLELVLLAMPHHTGNL 468
+I+G+E ++ ANG ++VQG A + + + +A T
Sbjct: 209 NQIVGVEVSVQDGGTYNITMANGYSLVQGSTARQ--LAAVPSSADPSRTTVAYVDGTAG- 265

Query: 469 FAVEQLKKLNYQGKLSAIV--------QYGDDAASLRTSGVHSVYNLYEA 510
+E +KL G L I+ Q + L + + ++A
Sbjct: 266 -NIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQLALAFAEAFNTQHKA 314


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3408RTXTOXINA290.031 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 29.2 bits (65), Expect = 0.031
Identities = 11/41 (26%), Positives = 21/41 (51%), Gaps = 4/41 (9%)

Query: 124 LAAGLSSSGALVVAFGTAISDSSQLHLSPMAVAQLAQRGEY 164
A GLS+S A +A++ L +SP++ +A + +
Sbjct: 296 AAQGLSTSAAAAGLIASAVT----LAISPLSFLSIADKFKR 332


98Shewmr4_3471Shewmr4_3486N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_3471117-1.087791HPP family protein
Shewmr4_3472117-1.927528hypothetical protein
Shewmr4_3473121-2.690494short chain fatty acid transporter
Shewmr4_3474127-3.630673hypothetical protein
Shewmr4_3475127-3.745107peptidylprolyl isomerase, FKBP-type
Shewmr4_3476127-4.062116hypothetical protein
Shewmr4_3477123-2.595157glycosyl hydrolase family chitinase
Shewmr4_3478-119-0.695385hypothetical protein
Shewmr4_3479-2160.773481ROK family protein
Shewmr4_3480-2151.868940peptidase M24
Shewmr4_3481-3141.789334methyl-accepting chemotaxis sensory transducer
Shewmr4_3482-3142.049313DEAD/DEAH box helicase domain-containing
Shewmr4_3483-3141.686505hypothetical protein
Shewmr4_3484-1151.098260redoxin domain-containing protein
Shewmr4_3485-2150.443104hypothetical protein
Shewmr4_3486-216-0.155656hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3471PF05616320.005 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 31.6 bits (71), Expect = 0.005
Identities = 23/79 (29%), Positives = 34/79 (43%), Gaps = 10/79 (12%)

Query: 268 PSPD---SGVTLPNQLPVP---AADHPAEEAKPDAGTGST-NPQGAAANASAPSNAAPNT 320
P PD PN P+P A++PA P+ G+ NP+ + +A P+T
Sbjct: 311 PRPDLTPGSAEAPNAQPLPEVSPAENPANNPAPNENPGTRPNPE---PDPDLNPDANPDT 367

Query: 321 TAPNTTVPNTGATPSNANG 339
T P++ A P NG
Sbjct: 368 DGQPGTRPDSPAVPDRPNG 386


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3472SHAPEPROTEIN5570.0 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 557 bits (1437), Expect = 0.0
Identities = 315/348 (90%), Positives = 333/348 (95%), Gaps = 1/348 (0%)

Query: 1 MFKKLRGIFSNDLSIDLGTANTLIYVRDEGIVLNEPSVVAIRGERSSSGQKSVAAVGTEA 60
M KK RG+FSNDLSIDLGTANTLIYV+ +GIVLNEPSVVAIR +R+ S KSVAAVG +A
Sbjct: 1 MLKKFRGMFSNDLSIDLGTANTLIYVKGQGIVLNEPSVVAIRQDRAGS-PKSVAAVGHDA 59

Query: 61 KQMLGRTPGNIQAIRPMKDGVIADFYVTEKMLQHFIKQVHNNSFFRPSPRVLVCVPVGAT 120
KQMLGRTPGNI AIRPMKDGVIADF+VTEKMLQHFIKQVH+NSF RPSPRVLVCVPVGAT
Sbjct: 60 KQMLGRTPGNIAAIRPMKDGVIADFFVTEKMLQHFIKQVHSNSFMRPSPRVLVCVPVGAT 119

Query: 121 QVERRAIRESAMGAGAREVYLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAIISLN 180
QVERRAIRESA GAGAREV+LIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVA+ISLN
Sbjct: 120 QVERRAIRESAQGAGAREVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLN 179

Query: 181 GVVYSSSVRIGGDKFDDAIINYVRRNYGSLIGEATAERIKHTIGTAYPGDEVLEIEVRGR 240
GVVYSSSVRIGGD+FD+AIINYVRRNYGSLIGEATAERIKH IG+AYPGDEV EIEVRGR
Sbjct: 180 GVVYSSSVRIGGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRGR 239

Query: 241 NLAEGVPRSFTLNSNEILEALQEPLSGIVSAVMVALEQSPPELASDISERGMVLTGGGAL 300
NLAEGVPR FTLNSNEILEALQEPL+GIVSAVMVALEQ PPELASDISERGMVLTGGGAL
Sbjct: 240 NLAEGVPRGFTLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISERGMVLTGGGAL 299

Query: 301 LRDLDRLLMQETGIPVMVADDPLTCVARGGGKALEMIDMHGGDLFSEE 348
LR+LDRLLM+ETGIPV+VA+DPLTCVARGGGKALEMIDMHGGDLFSEE
Sbjct: 300 LRNLDRLLMEETGIPVVVAEDPLTCVARGGGKALEMIDMHGGDLFSEE 347


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3475BCTERIALGSPG351e-04 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 34.9 bits (80), Expect = 1e-04
Identities = 12/24 (50%), Positives = 19/24 (79%)

Query: 8 RMARSKRGFTLVEMVTVILILGIL 31
R +RGFTL+E++ VI+I+G+L
Sbjct: 2 RATDKQRGFTLLEIMVVIVIIGVL 25


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3476BCTERIALGSPH388e-06 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 37.6 bits (87), Expect = 8e-06
Identities = 15/58 (25%), Positives = 32/58 (55%), Gaps = 4/58 (6%)

Query: 21 KQQGFTLIELVVGMLVIAIAIVM-LSSMLFPQADRAAKTLHRVKSA-ELA--HSVMNE 74
+Q+GFTL+E+++ +L++ ++ M L + + D AA+TL R ++ +
Sbjct: 2 RQRGFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAAQTLARFEAQLRFVQQRGLQTG 59


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3477BCTERIALGSPH422e-07 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 41.9 bits (98), Expect = 2e-07
Identities = 26/80 (32%), Positives = 41/80 (51%), Gaps = 1/80 (1%)

Query: 8 RQFGFTLVELVTTIILIGILSVAVLPRLFSQSSYSAFSLRNEFMAELRQVQQKALNNTDR 67
RQ GFTL+E++ ++L+G+ + VL + SA F A+LR VQQ+ L +
Sbjct: 2 RQRGFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAAQTLARFEAQLRFVQQRGLQT-GQ 60

Query: 68 CYRVVVSAMGYQVSQFASRD 87
+ V V +Q +RD
Sbjct: 61 FFGVSVHPDRWQFLVLEARD 80


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3478BCTERIALGSPG502e-10 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 49.9 bits (119), Expect = 2e-10
Identities = 20/53 (37%), Positives = 31/53 (58%), Gaps = 4/53 (7%)

Query: 1 MKKQSGFTLIELVVVIIILGILAVTAAPKFINLQSDARA----STVKGLESAI 49
KQ GFTL+E++VVI+I+G+LA P + + A S + LE+A+
Sbjct: 4 TDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENAL 56


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3479BCTERIALGSPG451e-08 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 44.9 bits (106), Expect = 1e-08
Identities = 14/31 (45%), Positives = 23/31 (74%)

Query: 2 MKRQQGFTLIELVVVIIILGILAVTAAPKFI 32
+Q+GFTL+E++VVI+I+G+LA P +
Sbjct: 4 TDKQRGFTLLEIMVVIVIIGVLASLVVPNLM 34


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3480BCTERIALGSPG431e-07 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 42.6 bits (100), Expect = 1e-07
Identities = 15/36 (41%), Positives = 27/36 (75%)

Query: 4 KQDGFSLIELVIVIVILGLLAATAIPRFLNVTDDAE 39
KQ GF+L+E+++VIVI+G+LA+ +P + + A+
Sbjct: 6 KQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKAD 41


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3482BCTERIALGSPF303e-102 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 303 bits (777), Expect = e-102
Identities = 118/407 (28%), Positives = 208/407 (51%), Gaps = 6/407 (1%)

Query: 1 MPVYQYRGRSGQGQAVTGQLDAASEGAAADMLLARGIIPLEVKV----AKETKSFTLAQL 56
M Y Y+ QG+ G +A S A +L RG++PL V +++ S L+
Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLR 60

Query: 57 FKRKVGLDELQIFTRQMYSLTRSGIPILRAIAGLSETAHSQRMKDALNDISEQLTAGRPL 116
K ++ +L + TRQ+ +L + +P+ A+ +++ + + + + ++ G L
Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSL 120

Query: 117 SSAMNQHPDVFDSLFVSMVHVGENTGKLEDAFIQLSGYIEREQETRRRIKAAMRYPIFVL 176
+ AM P F+ L+ +MV GE +G L+ +L+ Y E+ Q+ R RI+ AM YP VL
Sbjct: 121 ADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPC-VL 179

Query: 177 IAIALAMV-ILNIMVIPKFAEMFARFGADLPWATKVLIGTSNLFVNYWPLMLIILLGTVI 235
+A+A+V IL +V+PK E F LP +T+VL+G S+ + P ML+ LL +
Sbjct: 180 TVVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFM 239

Query: 236 GIRYWHHTEKGEKQWDQWKLHIPAVGSIIERSTLSRYCRSFSMMLSAGVPMTQALSLVAD 295
R EK + + LH+P +G I +RY R+ S++ ++ VP+ QA+ + D
Sbjct: 240 AFRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGD 299

Query: 296 AVDNAYMHDKIVGMRRGIESGESMLRVSNQSKLFTPLVLQMVAVGEETGQIDQLLNDAAD 355
+ N Y ++ + G S+ + Q+ LF P++ M+A GE +G++D +L AAD
Sbjct: 300 VMSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAAD 359

Query: 356 FYEGEVDYDLKNLTAKLEPILIGIVAVIVLVLALGIYLPMWDMLNVV 402
+ E + EP+L+ +A +VL + L I P+ + ++
Sbjct: 360 NQDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLM 406


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3484IGASERPTASE371e-04 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 37.4 bits (86), Expect = 1e-04
Identities = 28/168 (16%), Positives = 57/168 (33%), Gaps = 13/168 (7%)

Query: 89 IDTSPLETETTTSTEPTAEM-----------AQVSPSSTDAPAKDMSTSAESAPRVAQSP 137
+DT+ + T + + A V P + P++ T AE++ + +++
Sbjct: 992 VDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTV 1051

Query: 138 RVNAAQVEPTSEPTTESVAAHTSSDTPNKAALTQESVQTQAESQQVAVKANQADVNASQS 197
N T+ E S+ N ++ + Q A V +
Sbjct: 1052 EKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEK-EE 1110

Query: 198 EVKITQTEAKASEPVVPAGAQVSSQASTQASSQASAQAQSTGKMAIRE 245
+ K+ +TE P V + + S QA ++ + I+E
Sbjct: 1111 KAKV-ETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKE 1157



Score = 35.4 bits (81), Expect = 5e-04
Identities = 22/138 (15%), Positives = 45/138 (32%), Gaps = 5/138 (3%)

Query: 103 EPTAEMAQVSPSSTDAP---AKDMSTSAESAPRVAQSPRVNAAQVEPTSEPTTESVAAHT 159
+ Q P+ + P K+ + + Q + ++ VE +T ++
Sbjct: 1135 QSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNS 1194

Query: 160 SSDTPNKAALTQESVQTQAESQQVAVKANQADVNASQSEVKITQTEAKASEPVVPAGAQV 219
+ P +ES ++ V + V+ T + V A +
Sbjct: 1195 VVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTV--ALCDL 1252

Query: 220 SSQASTQASSQASAQAQS 237
+S + S A A+AQ
Sbjct: 1253 TSTNTNAVLSDARAKAQF 1270


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3486BCTERIALGSPD1877e-54 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 187 bits (476), Expect = 7e-54
Identities = 78/317 (24%), Positives = 144/317 (45%), Gaps = 28/317 (8%)

Query: 237 ELKETLSAIIGDTGGGRQVVVT--PQAGLVTIRAYPNELRQVRAFLNSAESHLQRQVILE 294
++ A + +++ Q + + A P+ + + + + + QV++E
Sbjct: 292 TMQSEKQAAKPVAALDKNIIIKAHGQTNALIVTAAPDVMNDLERVIAQLDIR-RPQVLVE 350

Query: 295 AKILEVTLSDGYQQGIQWDNVLGHV---GNTNINFGTSAGAGLS----DKITASLGGVTS 347
A I EV +DG GIQW N + N+ + T+ +++SL S
Sbjct: 351 AIIAEVQDADGLNLGIQWANKNAGMTQFTNSGLPISTAIAGANQYNKDGTVSSSLASALS 410

Query: 348 ------LSIKGSDFNTMISLLDTQGDVDVLSSPRVTASNNQKAVIKVGTDEYFVTDVSST 401
++ +++ L + D+L++P + +N +A VG + +T S
Sbjct: 411 SFNGIAAGFYQGNWAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVPVLT--GSQ 468

Query: 402 TVAGTTPVTTPQVELTPFFSGIALDVTPQIDKDGNVLLHVHPSVIDVKEQTKDIKVSSES 461
T +G T + + GI L V PQI++ +VLL + V V + SS S
Sbjct: 469 TTSGDNIFNTVERKTV----GIKLKVKPQINEGDSVLLEIEQEVSSVADAA-----SSTS 519

Query: 462 LELPLAQSEIRESDTVIRAASGDVVVIGGLMKSENTEVVSQVPLLGDIPLVGELFKNRSK 521
+L + R + + SG+ VV+GGL+ ++ +VPLLGDIP++G LF++ SK
Sbjct: 520 SDLGATFN-TRTVNNAVLVGSGETVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSK 578

Query: 522 QKKKTELIIMLKPTVVG 538
+ K L++ ++PTV+
Sbjct: 579 KVSKRNLMLFIRPTVIR 595


99Shewmr4_3515Shewmr4_3534N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_3515015-1.545375tRNA (guanine-N(1)-)-methyltransferase
Shewmr4_3516-2150.57078316S rRNA-processing protein RimM
Shewmr4_3517-1200.13789930S ribosomal protein S16
Shewmr4_3518-1190.483300signal recognition particle protein
Shewmr4_3520018-3.542142cytochrome c assembly protein
Shewmr4_3521-115-3.142481hypothetical protein
Shewmr4_3522-114-3.195759hypothetical protein
Shewmr4_3523-119-4.271263hypothetical protein
Shewmr4_3524018-3.671281transposase IS200-family protein
Shewmr4_3525015-2.4367174'-phosphopantetheinyl transferase
Shewmr4_35261171.821581pyridoxine 5'-phosphate synthase
Shewmr4_35270182.888588DNA repair protein RecO
Shewmr4_3528-1183.208733GTP-binding protein Era
Shewmr4_35290193.489215ribonuclease III
Shewmr4_3530-1182.540623signal peptidase I
Shewmr4_3531-2170.687296GTP-binding protein LepA
Shewmr4_3532-116-0.079185positive regulator of sigma(E), RseC/MucC
Shewmr4_3533-218-0.862343sigma E regulatory protein, MucB/RseB
Shewmr4_3534-218-0.362622hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3515PF03544606e-14 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 60.4 bits (146), Expect = 6e-14
Identities = 16/79 (20%), Positives = 33/79 (41%), Gaps = 1/79 (1%)

Query: 54 RVHPEYPIEAAKNGISGCVALVVGINSSGKPSGYKVKKSYPEGVFDNYATAALANWRWKA 113
R P+YP A I G V + + G+ ++ + P +F+ A+ WR++
Sbjct: 162 RNQPQYPARAQALRIEGQVKVKFDVTPDGRVDNVQILSAKPANMFEREVKNAMRRWRYEP 221

Query: 114 TDKNSDKKPVLTIIQMDFS 132
K V + +++ +
Sbjct: 222 G-KPGSGIVVNILFKINGT 239


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3517HTHFIS867e-22 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 85.7 bits (212), Expect = 7e-22
Identities = 37/170 (21%), Positives = 61/170 (35%), Gaps = 16/170 (9%)

Query: 3 TKLQLYLVDDDEAILDSLGFMLGQFGYQVQTFSSGRDFLAQCPLTQAGCVILDSRMPEIT 62
T + + DDD AI L L + GY V+ S+ V+ D MP+
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 63 GQEVQQKLLETQSPLGVIFLTGHGDLPMALSAFRKGACDFFQKPVSGKALVQAIEKAHKE 122
++ ++ + + L V+ ++ A+ A KGA D+ KP L+ I +A E
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 123 SQASFEQLYLQQKFAQLTEREQQVLAHVVQGMTNKQISEAMYLSLRTIEV 172
+ R ++ GM S AM R +
Sbjct: 122 PKR----------------RPSKLEDDSQDGMPLVGRSAAMQEIYRVLAR 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3520BCTERIALGSPG481e-09 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 48.0 bits (114), Expect = 1e-09
Identities = 18/59 (30%), Positives = 33/59 (55%)

Query: 1 MSRLHTSKGFTLIELVVVIIILGILAVVAAPRFINLSQDAHNARAKAAFAAFTSGVKLY 59
M +GFTL+E++VVI+I+G+LA + P + + A +A + A + + +Y
Sbjct: 1 MRATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMY 59


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3522HTHFIS310.014 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 30.6 bits (69), Expect = 0.014
Identities = 11/36 (30%), Positives = 18/36 (50%), Gaps = 3/36 (8%)

Query: 49 TPKNILMIGPTGVGKTEIAR---RLAKLANAPFIKV 81
T +++ G +G GK +AR K N PF+ +
Sbjct: 159 TDLTLMITGESGTGKELVARALHDYGKRRNGPFVAI 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3524FLGFLIH270.036 Flagellar assembly protein FliH signature.
		>FLGFLIH#Flagellar assembly protein FliH signature.

Length = 228

Score = 27.5 bits (60), Expect = 0.036
Identities = 15/45 (33%), Positives = 21/45 (46%)

Query: 146 DRDQRLAATDNPVAEARLNQLDDEFYKDLDNLDIKLESYAIQMGL 190
++ A + AR+ QL EF LD LD + S +QM L
Sbjct: 81 EQGLAEAKSQQAPIHARMQQLVSEFQTTLDALDSVIASRLMQMAL 125


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3527DHBDHDRGNASE433e-07 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 43.1 bits (101), Expect = 3e-07
Identities = 37/195 (18%), Positives = 74/195 (37%), Gaps = 29/195 (14%)

Query: 55 LEEEIKQLSQNISQLDWLINCIGMLHTEDKGPEKSLQALDGDFFLHNIQLNTLPSMMLAK 114
++E ++ + + +D L+N G+L G SL + + +N+ ++
Sbjct: 72 IDEITARIEREMGPIDILVNVAGVLRP---GLIHSLSDEE---WEATFSVNSTGVFNASR 125

Query: 115 HFEPTLKRSASARFAVVSAKVGSISDNRLGGWYSYRASKAALNMFLKTLSIEWQRSMKHC 174
+ S V + + + +Y +SKAA MF K L +E C
Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMA---AYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 175 VVLALHPGTTDTPLSKP------------------FQQNVPKQKLFTPEYVAQCLVSIIA 216
+++ PG+T+T + F+ +P +KL P +A ++ +++
Sbjct: 183 NIVS--PGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVS 240

Query: 217 NATPAQTGSFLAYDG 231
T L DG
Sbjct: 241 GQAGHITMHNLCVDG 255


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3529HTHFIS872e-22 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 86.8 bits (215), Expect = 2e-22
Identities = 26/110 (23%), Positives = 57/110 (51%)

Query: 3 RLLIIEDDQALAGVLARRLTRHGFECRLSHDASNALLVAREFCPSHILLDMKLAEANGLS 62
+L+ +DD A+ VL + L+R G++ R++ +A+ ++ D+ + + N
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 63 LIVPLRNLLPKVIMVLLTGYASIATAVEAIRLGADNYLAKPVDTQTLLAA 112
L+ ++ P + +++++ + TA++A GA +YL KP D L+
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGI 114



Score = 48.3 bits (115), Expect = 3e-09
Identities = 19/58 (32%), Positives = 29/58 (50%), Gaps = 2/58 (3%)

Query: 116 NSQASALPEDEIDDSPLSPKRLEWEHIQQVLNANQGNVSATARQLGMHRRTLQRKLLK 173
S ALP + D L +E+ I L A +GN A LG++R TL++K+ +
Sbjct: 417 ASFGDALPPSGLYDRVL--AEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRE 472


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3531NUCEPIMERASE715e-16 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 70.6 bits (173), Expect = 5e-16
Identities = 37/158 (23%), Positives = 62/158 (39%), Gaps = 20/158 (12%)

Query: 3 NIMVTGATGLLGRAVVKQLTAAGHRVIA---------TGFSRAEAGI--------HRLDL 45
+VTGA G +G V K+L AGH+V+ +A + H++DL
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 46 TQAAEVEAFIAREQPEVIVHCAAERRPDVSERSPEHALALNLSASQTLAEAAKNHQ-AWL 104
+ A E + S +P NL+ + E ++++ L
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQHL 121

Query: 105 LYISTDYVF-DGTTPPYAEDAEPN-PVNFYGASKLQGE 140
LY S+ V+ P++ D + PV+ Y A+K E
Sbjct: 122 LYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANE 159


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3534RTXTOXIND391e-05 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 39.4 bits (92), Expect = 1e-05
Identities = 45/221 (20%), Positives = 81/221 (36%), Gaps = 36/221 (16%)

Query: 109 AQIHELEKQLSQLELNNLSLNAEILTQLQQRIDVAAEGVTRQNGLLDSFERYQRKGVVPT 168
Q + Q Q ELN AE LT L + ++ LD F K +
Sbjct: 193 EQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSR-LDDFSSLLHKQAIAK 251

Query: 169 ----------ADMAAVLQAHTASKMALE----QAKVDLMQARQAQKTELLAGPIAQSKYN 214
+ L+ + + +E AK + Q K E+L + Q+ N
Sbjct: 252 HAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEIL-DKLRQTTDN 310

Query: 215 ---VELQLARLKAQESQLDIKALTPTRVVDV-------LVQAGEHIVEDRPLVLLSGREA 264
+ L+LA+ + ++ I+A +V + +V E ++ P +
Sbjct: 311 IGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPE-----DDT 365

Query: 265 AVIFAYLEPKYLEYTAIGQEATIKLP--NGTR---LRGEIS 300
+ A ++ K + + +GQ A IK+ TR L G++
Sbjct: 366 LEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVK 406


100Shewmr4_3695Shewmr4_3703N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Shewmr4_3695016-2.491570phosphoribosylaminoimidazole carboxylase ATPase
Shewmr4_3696017-2.910430diguanylate cyclase with GAF sensor
Shewmr4_3697119-4.402994hypothetical protein
Shewmr4_3698117-4.700447hypothetical protein
Shewmr4_3699014-2.886350hypothetical protein
Shewmr4_3700-113-1.160521hypothetical protein
Shewmr4_3701015-0.518394hypothetical protein
Shewmr4_3702016-0.182153glutamate--cysteine ligase
Shewmr4_3703-1170.584370peptidase M16 domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3695TYPE3IMSPROT346e-04 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 33.6 bits (77), Expect = 6e-04
Identities = 20/90 (22%), Positives = 31/90 (34%), Gaps = 17/90 (18%)

Query: 164 IGYEKAFEQIRTGDVIYCDPP-------YAPLSTTASFTTYVGAGFSLDDQALLARYSRH 216
I E ++ V+ +P Y T T+ D Q R
Sbjct: 245 IQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYT----DAQVQ---TVRK 297

Query: 217 MALEQRIPVVISNHDIPLTRELYRGAHLAK 246
+A E+ +P++ IPL R LY A +
Sbjct: 298 IAEEEGVPIL---QRIPLARALYWDALVDH 324


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3696PF05272300.024 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.0 bits (67), Expect = 0.024
Identities = 16/65 (24%), Positives = 24/65 (36%)

Query: 14 ALVERLHHVASYSDQLLVLVGAHGSGKTTLLTALATDFDESNAALVICPMHADNAEIRRK 73
V R+ D +VL G G GK+TL+ L S+ I +I
Sbjct: 583 GHVARVMEPGCKFDYSVVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQIAGI 642

Query: 74 ILVQL 78
+ +L
Sbjct: 643 VAYEL 647


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3698PF05272310.002 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 31.2 bits (70), Expect = 0.002
Identities = 16/64 (25%), Positives = 25/64 (39%), Gaps = 8/64 (12%)

Query: 9 LVGPMGAGKSTIGRHLAQML-----HLEFHDSDQEIEQRTGADIAWVFDVEGEEGFRRRE 63
L G G GKST+ L + H + EQ G +++ FRR +
Sbjct: 601 LEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQIAG---IVAYELSEMTAFRRAD 657

Query: 64 AQVI 67
A+ +
Sbjct: 658 AEAV 661


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3699BCTERIALGSPD2483e-75 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 248 bits (635), Expect = 3e-75
Identities = 99/411 (24%), Positives = 188/411 (45%), Gaps = 38/411 (9%)

Query: 306 GDITLRLDDVPWDQALDLILQTKGLDKRIEGNILMVAPSEELAIRESQNLKNKQEVKELA 365
GD ++ + W A D++ L+K + L + + E N
Sbjct: 190 GDRSVVTVPLSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEPNSR 249

Query: 366 PLYSEYLQ----------------INYAKATDIAELLKGADSSLLSPRG----------- 398
++ + YAKA+D+ E+L G S++ S +
Sbjct: 250 QRIIAMIKQLDRQQATQGNTKVIYLKYAKASDLVEVLTGISSTMQSEKQAAKPVAALDKN 309

Query: 399 -SVAVDERTNTVLVKDTAEIIENIHRLVEVLDIPIRQVLIESRMVTVKDDVSEDLGIRWG 457
+ +TN ++V +++ ++ R++ LDI QVL+E+ + V+D +LGI+W
Sbjct: 310 IIIKAHGQTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGIQWA 369

Query: 458 VTDQQGSKGTSGTLEGAGSIANGTVPTLDNRLNVNLPAAVTNPTSIAFHVAKLADGTILD 517
+ ++ T+ L + +IA D ++ +L +A+++ IA +
Sbjct: 370 NKNAGMTQFTNSGLPISTAIAGANQYNKDGTVSSSLASALSSFNGIAAGFYQGN----WA 425

Query: 518 LELSALEQENKGEIIASPRITTSNQKAAYIEQGVEIPYV-----QSTSSGATSVTFKKAV 572
+ L+AL K +I+A+P I T + A G E+P + S + +V K
Sbjct: 426 MLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVPVLTGSQTTSGDNIFNTVERKTVG 485

Query: 573 LSLRVTPQITPDNRVILDLEITQDSQGKT-VDTPTGPAVAIDTQRIGTQVLVDNGETIVL 631
+ L+V PQI + V+L++E S T + +T+ + VLV +GET+V+
Sbjct: 486 IKLKVKPQINEGDSVLLEIEQEVSSVADAASSTSSDLGATFNTRTVNNAVLVGSGETVVV 545

Query: 632 GGIYQQNLISRVSKVPILGDIPLVGFLFRNTTDKNERQELLIFVTPKIVNE 682
GG+ +++ KVP+LGDIP++G LFR+T+ K ++ L++F+ P ++ +
Sbjct: 546 GGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNLMLFIRPTVIRD 596



Score = 46.8 bits (111), Expect = 2e-07
Identities = 33/175 (18%), Positives = 75/175 (42%), Gaps = 14/175 (8%)

Query: 275 SLNFQNISVRTVLQIIADYNNFNLVTSDTVEGDITLR-LDDVPWDQALDLILQTKGLDKR 333
S +F+ ++ + ++ N ++ +V G IT+R D + +Q L
Sbjct: 31 SASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITVRSYDMLNEEQYYQFFLSVL----D 86

Query: 334 IEGNILMVAPSEELAIRESQNLKNKQEVK--ELAP-----LYSEYLQINYAKATDIAELL 386
+ G ++ + L + S++ K + AP + + + + A D+A LL
Sbjct: 87 VYGFAVINMNNGVLKVVRSKDAKTAAVPVASDAAPGIGDEVVTRVVPLTNVAARDLAPLL 146

Query: 387 KGADSSLLSPRGSVAVDERTNTVLVKDTAEIIENIHRLVEVLDIPIRQVLIESRM 441
+ + + + GSV E +N +L+ A +I+ + +VE +D + ++ +
Sbjct: 147 RQLNDN--AGVGSVVHYEPSNVLLMTGRAAVIKRLLTIVERVDNAGDRSVVTVPL 199


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Shewmr4_3703SHAPEPROTEIN445e-07 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 44.0 bits (104), Expect = 5e-07
Identities = 32/156 (20%), Positives = 58/156 (37%), Gaps = 34/156 (21%)

Query: 199 VDIGANMTTFSVVESGETTFIREQAFGGELFTQSILSFYGMSY------EQAEKAKIE-- 250
VDIG T +V+ + GG+ F ++I+++ +Y AE+ K E
Sbjct: 164 VDIGGGTTEVAVISLNGVVYSSSVRIGGDRFDEAIINYVRRNYGSLIGEATAERIKHEIG 223

Query: 251 -------------------GDLPRNY------MFEVLSPFQTQLLQQIKRTLQIYCTSSG 285
+PR + + E L T ++ + L+
Sbjct: 224 SAYPGDEVREIEVRGRNLAEGVPRGFTLNSNEILEALQEPLTGIVSAVMVALEQCPPELA 283

Query: 286 KDKVDY-LVLCGGTSKLEGMANLLTNELGVHTIIAD 320
D + +VL GG + L + LL E G+ ++A+
Sbjct: 284 SDISERGMVLTGGGALLRNLDRLLMEETGIPVVVAE 319



 
Contact Sachin Pundhir for Bugs/Comments.
For best view 1024 x 768 resolution & IE 6.0 or above recommended.