PredictBias

identification of genomic and pathogenicity islands in prokaryotic genome
Home | Help | Analyzed genomes
 
A) Input parameters
GenomeNC_003210.gbkThreshold dinucleotide bias10
Threshold codon bias10Threshold %GC bias10
E-value (RPSBlast)0.01Genome (non-pathogenic)
 
B) Compare a potential GI or PAI in related non-pathogenic sp. (phylogenetic tree)
Potential GI or PAI start    end  
Select Organism     
 
C) Potential GIs and PAIs in NC_003210 (download)
S.NoStartEndBiasVirulenceInsertion elementsPrediction
1lmo0062lmo0081Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
lmo0062318-0.937085hypothetical protein
lmo0063319-1.071494hypothetical protein
lmo0064418-0.998772hypothetical protein
lmo0065521-2.290624hypothetical protein
lmo0066621-2.378956toxin
lmo00671028-6.321081dinitrogenase reductase ADP-ribosylation
lmo00681033-6.779997hypothetical protein
lmo00691131-7.904113hypothetical protein
lmo0070627-4.101524hypothetical protein
lmo0071225-0.011159hypothetical protein
lmo00723250.094486hypothetical protein
lmo00733211.606205hypothetical protein
lmo00740201.323752hypothetical protein
lmo00750211.932657carboxyphosphonoenolpyruvate phosphonomutase
lmo0076019-0.347416O6-methylguanine-DNA methyltransferase
lmo0077019-1.584224hypothetical protein
lmo0078119-1.426925phosphoglycerate dehydrogenase
lmo0079420-1.859339*hypothetical protein
lmo0080621-3.205768hypothetical protein
lmo0081114-3.341637hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0066BINARYTOXINA744e-16 Clostridial binary toxin A signature.
		>BINARYTOXINA#Clostridial binary toxin A signature.

Length = 454

Score = 73.9 bits (181), Expect = 4e-16
Identities = 53/191 (27%), Positives = 95/191 (49%), Gaps = 18/191 (9%)

Query: 429 ESEKWGIDGFSVWRNSLSSREIQAIRDYTDIWHYGNMNGYLR--GSVEKLAPDNAERIKN 486
+ + WG + +S W N L+ E+ + DY Y +N YL G + P+ ++ N
Sbjct: 260 KGDLWGKENYSDWSNKLTPNELADVNDYMR-GGYTAINNYLISNGPLNNPNPELDSKVNN 318

Query: 487 LSSALEKAELPDNIILYRGTSSEILD--------NFLDLKNLN--YQNLVGKTIEEKGFM 536
+ +AL+ +P N+I+YR + + +F ++N++ + GK I F+
Sbjct: 319 IENALKLTPIPSNLIVYRRSGPQEFGLTLTSPEYDFNKIENIDAFKEKWEGKVITYPNFI 378

Query: 537 STT--TISNQTFSGN-VTMKINAPKGSKGAYLAHFSETPEEAEVLFNIGQKMLIKEVTEL 593
ST+ +++ F+ + ++IN PK S GAYL+ E EVL N G K I +V
Sbjct: 379 STSIGSVNMSAFAKRKIILRINIPKDSPGAYLSAIPGYAGEYEVLLNHGSKFKINKVDSY 438

Query: 594 -NGKI-EIIVD 602
+G + ++I+D
Sbjct: 439 KDGTVTKLILD 449


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0070TYPE3IMSPROT280.030 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 28.2 bits (63), Expect = 0.030
Identities = 15/90 (16%), Positives = 34/90 (37%), Gaps = 21/90 (23%)

Query: 62 LVLLALLCYIILYPQKMTIRFQNLQYLLYICCFQFLVFMVIRYFYSNLIYGIQNMVSLTA 121
+VLL++L +II+ + +++ + + +
Sbjct: 147 VVLLSILIWIIIKG---------------------NLVTLLQLPTCGIECITPLLGQILR 185

Query: 122 QTLVASYVFLLVLWILALIFFYFHFRKKLR 151
Q +V V +V+ I F Y+ + K+L+
Sbjct: 186 QLMVICTVGFVVISIADYAFEYYQYIKELK 215


2lmo0106lmo0115Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
lmo01062161.402034transcriptional regulator
lmo01072161.118083ABC transporter ATP-binding protein
lmo01082140.265618ABC transporter ATP-binding protein
lmo0109216-0.469429AraC family transcriptional regulator
lmo0110218-0.796636lipase
lmo0111221-2.537118hypothetical protein
lmo0112221-2.678173Fnr/Crp family transcriptional regulator
lmo0113120-1.636455hypothetical protein
lmo0114221-1.460536repressor C1
lmo0115321-1.481448hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0107FLGBIOSNFLIP330.004 Escherichia coli: Flagellar biosynthetic protein Fl...
		>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP

signature.
Length = 245

Score = 32.5 bits (74), Expect = 0.004
Identities = 20/117 (17%), Positives = 46/117 (39%), Gaps = 8/117 (6%)

Query: 134 IRVVNYINMLSDLLSNGLINLISDILSVIVTLGFM------LMIDPVLTLYSLALIPVLF 187
++ + +I L+ ++ +++ +I+ G + P L LAL F
Sbjct: 42 VQTLVFITSLT--FIPAILLMMTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFF 99

Query: 188 VIVMVIKTAQRKAYQVLSNKQSNMNAYIHESIAGIKVTQSFSREEENFEIFTEVSNE 244
++ VI AYQ S ++ +M + + ++ E + +F ++N
Sbjct: 100 IMSPVIDKIYVDAYQPFSEEKISMQEALEKGAQPLREFMLRQTREADLGLFARLANT 156


3lmo0132lmo0146Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
lmo01320213.055013inosine 5-monophosphate dehydrogenase
lmo01330202.482782hypothetical protein
lmo0134-1192.500200hypothetical protein
lmo0135-1182.254843peptide ABC transporter substrate-binding
lmo01362222.013169peptide ABC transporter permease
lmo01370200.861756peptide ABC transporter permease
lmo0138121-2.033127hypothetical protein
lmo0139223-3.084623hypothetical protein
lmo0140323-5.414311hypothetical protein
lmo01411330-11.222633hypothetical protein
lmo01421328-11.016886hypothetical protein
lmo0142a1229-10.930523hypothetical protein
lmo0143727-8.012029hypothetical protein
lmo0144826-8.156557hypothetical protein
lmo0146825-7.406934hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0134SACTRNSFRASE309e-04 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 29.9 bits (67), Expect = 9e-04
Identities = 13/66 (19%), Positives = 25/66 (37%), Gaps = 1/66 (1%)

Query: 1 MEYKNGENR-IYAVNDEGVEVGEVTFVPTGEDMFIIDHTGVDDAARGQGIAQELVKRAVE 59
+ Y E + + E +G + +I+ V R +G+ L+ +A+E
Sbjct: 57 VSYVEEEGKAAFLYYLENNCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIE 116

Query: 60 KAKSEG 65
AK
Sbjct: 117 WAKENH 122


4lmo0158lmo0175Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
lmo01582131.198270hypothetical protein
lmo01592141.515336peptidoglycan binding protein
lmo0160-2110.932137peptidoglycan binding protein
lmo0161-1141.327274hypothetical protein
lmo01620162.265945DNA polymerase III subunit delta'
lmo01632141.451706hypothetical protein
lmo01641150.330715DNA replication intiation control protein YabA
lmo0165-112-0.424576hypothetical protein
lmo0166-111-1.046354GIY-YIG nuclease
lmo0167012-1.199552hypothetical protein
lmo0168414-1.931359AbrB family transcriptional regulator
lmo0169413-1.722109glucose transporter
lmo0170212-1.397330hypothetical protein
lmo0171210-1.374062internalin
lmo0174214-1.060086transposase
lmo0175214-0.974335peptidoglycan-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0159TONBPROTEIN441e-06 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 43.8 bits (103), Expect = 1e-06
Identities = 20/88 (22%), Positives = 24/88 (27%), Gaps = 8/88 (9%)

Query: 673 TTPVSFEIVAGETDPIVKVTKENTLVPPTPVPPTPVPPTPVPPTPVPPTPLPPVPYEPTV 732
P+S +V P P V P P P P PV E
Sbjct: 42 AQPISVTMVTPADLE--------PPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPK 93

Query: 733 PPTKPEVPVTPKKTENSEDSPKTTPIRI 760
P KP+ K E + K R
Sbjct: 94 PKPKPKPKPVKKVQEQPKRDVKPVESRP 121


5lmo0297lmo0346Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
lmo02972302.774705transcriptional antiterminator BglG
lmo02982242.673228PTS beta-glucoside transporter subunit IIC
lmo0299014-0.964318PTS beta-glucoside transporter subunit IIB
lmo0300116-0.161714phospho-beta-galactosidase
lmo0301223-2.878119PTS beta-glucoside transporter subunit IIA
lmo0302321-4.594522hypothetical protein
lmo0303523-6.466642putaive secreted, lysin rich protein
lmo0304626-7.024502hypothetical protein
lmo0305729-7.031018L-allo-threonine aldolase
lmo03061031-9.173996hypothetical protein
lmo03071132-9.069055hypothetical protein
lmo0309827-7.252702hypothetical protein
lmo0310529-4.203380hypothetical protein
lmo0311026-0.477564hypothetical protein
lmo03122251.761920hypothetical protein
lmo03134285.283533hypothetical protein
lmo03142276.967899hypothetical protein
lmo0315-1195.725077thiamin biosynthesis protein
lmo0316-1194.923349hydroxyethylthiazole kinase
lmo0317-2183.180734phosphomethylpyrimidine kinase
lmo0318-1192.152886thiamine-phosphate pyrophosphorylase
lmo03190180.953159phospho-beta-glucosidase
lmo0320016-1.064723peptidoglycan-bound surface protein
lmo0321018-1.402771hypothetical protein
lmo0322113-1.434273hypothetical protein
lmo0323113-1.544757hypothetical protein
lmo0324115-1.921310hypothetical protein
lmo0325215-1.969560transcriptional regulator
lmo0326415-2.150432transcriptional regulator
lmo0327616-2.721831cell surface protein
lmo0328215-3.526525hypothetical protein
lmo0329316-3.591627transposase
lmo0330316-3.340478transposase
lmo0331316-3.058972rli25
lmo0332016-2.089823internalin
lmo0333016-2.189623hypothetical protein
lmo0334325-1.259855internalin
lmo03354240.054211hypothetical protein
lmo0336522-0.532323hypothetical protein
lmo03373182.994177hypothetical protein
lmo03383193.583117hypothetical protein
lmo03393234.541736hypothetical protein
lmo03402234.807993hypothetical protein
lmo03412234.704909hypothetical protein
lmo03423225.382714hypothetical protein
lmo03430193.609166transketolase
lmo03440173.822118translaldolase
lmo03450143.649345short chain dehydrogenase
lmo0346-1143.369873sugar-phosphate isomerase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0297PF05043502e-08 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 49.6 bits (118), Expect = 2e-08
Identities = 50/231 (21%), Positives = 97/231 (41%), Gaps = 15/231 (6%)

Query: 4 SERQRSLLEKLNDSQKTVTAKALSEMLGVSSKTVRNDIMQINQSFSSTIIASKAGKGYFL 63
S RQ LLE L + ++ L+E+L + + V++D+ + +F I S +
Sbjct: 9 SHRQLELLELLFEHKRWFHRSELAELLNCTERAVKDDLSHVKSAFPDLIFHSSTNGIRII 68

Query: 64 MPNEQLSQMNLTK-NNENLHFELLRHIIEQDHTNFYDLADQFFISESTLARIIKELNIVI 122
++ +M + HF +L I + + +F+IS S+L RII ++N VI
Sbjct: 69 NTDDSDIEMVYHHFFKHSTHFSILEFIFFNEGCQAESICKEFYISSSSLYRIISQINKVI 128

Query: 123 AEKDESLCIIRKNNELLTEGGEEEKRRIFNLFLNQEIENHQLSLDKYADYFDYCNLKQLS 182
+ + + G E R F E + ++ F+ + + LS
Sbjct: 129 KRQFQFEVSLTPV----QIIGNERDIRYFFAQYFSE----KYYFLEWP--FENFSSEPLS 178

Query: 183 ELIIAYHKKHEFFMNDFSTISFILHIAVLIERISMGSYIERTALLEQDKTS 233
+L+ +K+ F MN + L + + RI G ++E +++D +
Sbjct: 179 QLLELVYKETSFPMNLSTHRMLKLLLVTNLYRIKFGHFME----VDKDSFN 225


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0300NUCEPIMERASE300.025 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 29.8 bits (67), Expect = 0.025
Identities = 16/78 (20%), Positives = 33/78 (42%), Gaps = 8/78 (10%)

Query: 31 DYYLHEAGLENGDVASDHYHRYEEDIRMMKEGGQNSYRFSLSWPRIIKNRQGDINLKGIE 90
+ H+ L + + +D + + R+ + + R+SL P D NL G
Sbjct: 53 GFQFHKIDLADREGMTDLFASGHFE-RVFISPHRLAVRYSLENPHAYA----DSNLTG-- 105

Query: 91 FYQNLLDTCKKYDIEPFV 108
+ N+L+ C+ I+ +
Sbjct: 106 -FLNILEGCRHNKIQHLL 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0310GPOSANCHOR300.016 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 30.0 bits (67), Expect = 0.016
Identities = 17/89 (19%), Positives = 25/89 (28%)

Query: 253 HNEKKRMELHFRKLEKQKLELEKKNKELISENYNLDLEIKNKHTVIEKLSKNKMEVEANK 312
N + LE +K L + +L I+ L K +EA +
Sbjct: 133 MNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQ 192

Query: 313 ELLKICKIENGNLIKKVSALNFELVKMKE 341
L+ N SA L K
Sbjct: 193 AELEKALEGAMNFSTADSAKIKTLEAEKA 221


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0318ALARACEMASE290.011 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 29.4 bits (66), Expect = 0.011
Identities = 14/70 (20%), Positives = 23/70 (32%), Gaps = 7/70 (10%)

Query: 132 GVGPIFPTISKADAEPVSGTAILEE---IRRAGITIPIVGIGGINETNSAEVLTAGADGV 188
G+ I+ I D LEE +R G PI+ + G E+
Sbjct: 42 GIERIWSAIGATDG---FALLNLEEAITLRERGWKGPILMLEGFFHAQDLEIYDQ-HRLT 97

Query: 189 SVISAITQSD 198
+ + + Q
Sbjct: 98 TCVHSNWQLK 107


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0320TONBPROTEIN361e-04 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 36.1 bits (83), Expect = 1e-04
Identities = 23/106 (21%), Positives = 35/106 (33%), Gaps = 9/106 (8%)

Query: 268 PVTPPKNDPEPDNPEEPVTPVDPATPIPDEPSTPTDPATPEKPEITTPENPESTVVEADS 327
P P +PEP+ P P + I P KP E P+ V +S
Sbjct: 63 PPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKP---KPVKKVQEQPKRDVKPVES 119

Query: 328 SENEPEKSADSKIVNNPIQITSQATKTATKQAKSSATKTTLPLPKA 373
P ++ P ++TS AT + +S L +
Sbjct: 120 RPASPFEN------TAPARLTSSTATAATSKPVTSVASGPRALSRN 159


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0333MICOLLPTASE360.002 Microbial collagenase metalloprotease (M9) signature.
		>MICOLLPTASE#Microbial collagenase metalloprotease (M9) signature.

Length = 1104

Score = 36.2 bits (83), Expect = 0.002
Identities = 33/119 (27%), Positives = 55/119 (46%), Gaps = 10/119 (8%)

Query: 1015 VTVNKDPAPIISA------KTEITYDKFSKKTEAAFLDDIDADTNDGSIVTSNFATAVN- 1067
V VNK+P +I + + EI +D K E + + D DG SN A A +
Sbjct: 769 VHVNKEPKAVIKSDSSVIVEEEINFDGTESKDEDGEIKAYEWDFGDGE--KSNEAKATHK 826

Query: 1068 LDKAGDYTVTLNSINSDGVAGTPTAIIVHVEKEKIATISTNTAQQ-YEKYAKINETQFL 1125
+K G+Y V L +++G T + I VE + + I+ + +EK +I ++ L
Sbjct: 827 YNKTGEYEVKLTVTDNNGGINTESKKIKVVEDKPVEVINESEPNNDFEKANQIAKSNML 885



Score = 33.9 bits (77), Expect = 0.007
Identities = 44/195 (22%), Positives = 80/195 (41%), Gaps = 21/195 (10%)

Query: 1313 TNFKTAMSYTVTLNAVNEDGISAEPVAVTVTINKEPAAALKADA------EVSYAKNEAV 1366
N K + + V G++ + V +NKEP A +K+D+ E+++ E+
Sbjct: 742 VNHKVDGNGNYVYDVVFH-GMNTDTNT-DVHVNKEPKAVIKSDSSVIVEEEINFDGTESK 799

Query: 1367 TESDFFKDVHLE-GTEAPST-AKATSNFDSVVDRSKTGDYTVTINATNEDGAVSTPIEVI 1424
E K + G S AKAT ++ KTG+Y V + T+ +G ++T + I
Sbjct: 800 DEDGEIKAYEWDFGDGEKSNEAKATHKYN------KTGEYEVKLTVTDNNGGINTESKKI 853

Query: 1425 VHIEAESAPVITANA-EVKYNKHEQTDERRFL----YDSEAKIDEANVEIKTDFAEKVDI 1479
+E + VI + + K Q + L E D+ ++ K+ +
Sbjct: 854 KVVEDKPVEVINESEPNNDFEKANQIAKSNMLVKGTLSEEDYSDKYYFDVAKKGNVKITL 913

Query: 1480 NKVGTYTVTLTATNE 1494
N + + +T T E
Sbjct: 914 NNLNSVGITWTLYKE 928


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0344DHBDHDRGNASE1354e-41 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 135 bits (342), Expect = 4e-41
Identities = 83/254 (32%), Positives = 126/254 (49%), Gaps = 12/254 (4%)

Query: 12 ITDKVAVVTGAASGIGKAMAELFSEKGAYVVLLDIKED--VKDVAAKINPSRTL-ALQVD 68
I K+A +TGAA GIG+A+A + +GA++ +D + K V++ +R A D
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 69 ITKKENIEKVVAEIKKVYPKIDILANSAGVALLEKAEDLPEEYWDKTMELNLKGSFLMAQ 128
+ I+++ A I++ IDIL N AGV L +E W+ T +N G F ++
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 129 IIGREMIATGGGKIVNMASQASVIALDKHVAYCASKAAIVSMTQVLAMEWAPYNINVNAI 188
+ + M+ G IV + S + + AY +SKAA V T+ L +E A YNI N +
Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185

Query: 189 SPTVILTELGKKAWAGQVGED---------MKKLIPAGRFGYPEEVAACALFLVSDAASL 239
SP T++ WA + G + K IP + P ++A LFLVS A
Sbjct: 186 SPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGH 245

Query: 240 ITGENLIIDGGYTI 253
IT NL +DGG T+
Sbjct: 246 ITMHNLCVDGGATL 259


6lmo0364lmo0419Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
lmo03642171.275359peptidase E
lmo03652172.650750transcriptional regulator
lmo03663172.501942hypothetical protein
lmo03673152.072035hypothetical protein
lmo03681160.293639hypothetical protein
lmo03691170.255968hypothetical protein
lmo03701170.097358hypothetical protein
lmo03711160.577372hypothetical protein
lmo03721170.700883GntR family transcriptional regulator
lmo0373115-0.491766beta-glucosidase
lmo0374-119-1.821476PTS beta-glucoside transporter subunit IIC
lmo0375020-4.028483PTS beta-glucoside transporter subunit IIB
lmo0376119-4.582439hypothetical protein
lmo0377321-5.200463transcriptional regulator
lmo0378219-2.211488hypothetical protein
lmo03793210.134975hypothetical protein
lmo03803191.455900hypothetical protein
lmo03813214.938882hypothetical protein
lmo03824215.421731hypothetical protein
lmo03834215.326628transcriptional regulator
lmo03843224.063812methylmalonate-semialdehyde dehydrogenase
lmo03853213.626407IolB protein
lmo03861213.461649IolC protein
lmo03870150.814507IolD protein
lmo03880140.980492hypothetical protein
lmo03892130.327635hypothetical protein
lmo03901170.571846low temperature requirement protein A
lmo03911160.608949uracil-DNA glycosylase
lmo03921140.596961hypothetical protein
lmo0393113-0.229395hypothetical protein
lmo03940130.420417hypothetical protein
lmo0395-282.448116P60 protein
lmo03961152.339948blasticidin S-acetyltransferase
lmo03972171.715411pyrroline-5-carboxylate reductase
lmo03982161.345050hypothetical protein
lmo03992160.918226PTS sugar transporter subunit IIA
lmo04002150.842915PTS fructose transporter subunit IIB
lmo04012140.200622PTS fructose transporter subunit IIC
lmo0402114-1.088751alpha-mannosidase
lmo0403017-1.892894transcriptional antiterminator BglG
lmo0404215-1.435113hypothetical protein
lmo0405113-0.394301hypothetical protein
lmo0406012-0.437411phosphate transporter
lmo0407012-0.272102hypothetical protein
lmo04081110.464913hypothetical protein
lmo04091110.145055hypothetical protein
lmo04111120.809909internalin
lmo0412214-0.631894phosphoenolpyruvate synthase
lmo0413115-1.426838hypothetical protein
lmo0414-114-2.169766hypothetical protein
lmo0415015-2.606466hypothetical protein
lmo0416317-2.472173endo-1,4-beta-xylanase
lmo0417317-2.870897transcriptional regulator
lmo0418317-2.462726hypothetical protein
lmo0419217-1.604425hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0392IGASERPTASE300.014 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 30.0 bits (67), Expect = 0.014
Identities = 34/138 (24%), Positives = 51/138 (36%), Gaps = 9/138 (6%)

Query: 150 TQITVQSNIERIVGGAGEDTV-IARVGEAVVSTVG---ETREHTDVLENPNSISK--KVQ 203
T IT +NI+ V + IARV EA V + V EN SK +
Sbjct: 995 TNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKN 1054

Query: 204 EQGLGDGTAYTILSIDIAEMRIGDNIKAKLDIEKANADMEVAQAAASKRKAEAIALEQEN 263
EQ + TA A+ + N + E A + E + ++ K A ++E
Sbjct: 1055 EQDATETTAQNREVAKEAKSNVKANTQTN---EVAQSGSETKETQTTETKETATVEKEEK 1111

Query: 264 RAAVVAAEAEVPRALSRA 281
EVP+ S+
Sbjct: 1112 AKVETEKTQEVPKVTSQV 1129


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0395SACTRNSFRASE437e-08 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 42.6 bits (100), Expect = 7e-08
Identities = 20/96 (20%), Positives = 37/96 (38%), Gaps = 9/96 (9%)

Query: 50 EMVGGVTAKISYGE-LHVSLLSVDPSTQGSGVGTELMAQIERYGRANSCHHISLTTFSYQ 108
+G + + ++ + ++V + GVGT L+ + + + N + L T
Sbjct: 75 NCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDIN 134

Query: 109 AP--EFYRKCGFTELGRV-----KDFPIKGEEKYFF 137
FY K F +G V +FP E F+
Sbjct: 135 ISACHFYAKHHFI-IGAVDTMLYSNFPTANEIAIFW 169


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0402PF08280330.004 M protein trans-acting positive regulator
		>PF08280#M protein trans-acting positive regulator

Length = 530

Score = 32.9 bits (75), Expect = 0.004
Identities = 29/157 (18%), Positives = 59/157 (37%), Gaps = 22/157 (14%)

Query: 21 SHLSAQKLSQDLHISERTIRTDIAKLTEFLESHGATITLTRGAGYKIEILDPTVFQAFQA 80
S L ++++ ++ + +L F + I +
Sbjct: 57 SSLPITEVAEKTGLTFLQLNHYCEELNAFFPDS-----------LSMTIQKRMI----SC 101

Query: 81 EKNKPKNADYF-DLDNPEERVKYEIFLLLSSADYIKLEDLADTIFASRATISNDMKQVRK 139
+ P Y L ++ FL+ + + L D A + F S ++ + +
Sbjct: 102 QFTHPSKETYLYQLYASSNVLQLLAFLIKNGSHSRPLTDFARSHFLSNSSAYRMREALIP 161

Query: 140 VIASYDLTLVSKPGSGVKIVGDEEKMRYALTALIASK 176
++ +++L L KIVG+E ++RY L AL+ SK
Sbjct: 162 LLRNFELKLSKN-----KIVGEEYRIRY-LIALLYSK 192


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0412IGASERPTASE345e-04 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 34.3 bits (78), Expect = 5e-04
Identities = 26/142 (18%), Positives = 48/142 (33%), Gaps = 22/142 (15%)

Query: 121 ENGAFSLSVNELDKAQDAVLSIVIDGQTKEQTLKLALTPAYETKIAEKAEAERV------ 174
NG + L E++K V + I Q A P+ + E A +
Sbjct: 974 VNGRYDLYNPEVEKRNQTVDTTNITTPNNIQ----ADVPSVPSNNEEIARVDEAPVPPPA 1029

Query: 175 -------AAEKAEAERVERERVAAEEKRAADAKIAAEKKAEEAR--VAAEKKAAEEKRVA 225
AE + E + V E+ A + + A+EA+ V A + E +
Sbjct: 1030 PATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSG 1089

Query: 226 AERSKA---AAAQPDTSNEQGQ 244
+E + + T ++ +
Sbjct: 1090 SETKETQTTETKETATVEKEEK 1111


7lmo0430lmo0435Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
lmo0430516-2.765550sugar hydrolase
lmo0431516-2.962248LysR family transcriptional regulator
lmo0432415-2.816565acetyltransferase
lmo0433417-3.240145oxidoreductase
lmo0434215-3.346535internalin A
lmo0435216-3.751985internalin B
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0432DHBDHDRGNASE952e-25 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 94.7 bits (235), Expect = 2e-25
Identities = 60/223 (26%), Positives = 102/223 (45%), Gaps = 6/223 (2%)

Query: 3 IKNKVIIITGASSGIGKATALLLAEKGAKLVLAARRVEKLEKIVQTIKANSGEAIFAKTD 62
I+ K+ ITGA+ GIG+A A LA +GA + EKLEK+V ++KA + A D
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 63 VTKREDNKKLVELAIERYGKVDAIFLNAGIMPNSPLSALKEDEWEQMIDINIKGVLNGIA 122
V ++ G +D + AG++ + +L ++EWE +N GV N
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 123 AVLPSFIAQKSGHIIATSSVAGLKAYPGGAVYGATKWAVRDLMEVLRMESAQEGTNIRTV 182
+V + ++SG I+ S A Y ++K A + L +E A NIR
Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELA--EYNIRCN 183

Query: 183 TIYPAAINTELLETI--TDKETEQGMTSLYKQY--GITPDRIA 221
+ P + T++ ++ + EQ + + + GI ++A
Sbjct: 184 IVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLA 226


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0435MICOLLPTASE340.007 Microbial collagenase metalloprotease (M9) signature.
		>MICOLLPTASE#Microbial collagenase metalloprotease (M9) signature.

Length = 1104

Score = 34.3 bits (78), Expect = 0.007
Identities = 30/109 (27%), Positives = 46/109 (42%), Gaps = 12/109 (11%)

Query: 1594 NKPGKYEVTITATDTKGNQTTKEITVQVSKDKPV---ITADPKISYQGKIEVTEANFLSG 1650
NK G+YEV +T TD G T+ ++V +DKPV ++P ++ ++ ++N L
Sbjct: 828 NKTGEYEVKLTVTDNNGGINTESKKIKVVEDKPVEVINESEPNNDFEKANQIAKSNMLVK 887

Query: 1651 VHAEVTDELDGDVKITSDFAEKVDFNKVGTYTVTLNAKDEYGNTAEPVK 1699
D D D K G +TLN + G T K
Sbjct: 888 GTLSEEDYSDK---------YYFDVAKKGNVKITLNNLNSVGITWTLYK 927


8lmo0458lmo0482Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
lmo0458317-0.870947hypothetical protein
lmo0459723-3.178431hydantoinase
lmo0460823-2.131353transcriptional regulator
lmo04611025-3.708403membrane associated lipoprotein
lmo04621026-6.043601hypothetical protein
lmo04631027-6.390880hypothetical protein
lmo0464928-7.083950hypothetical protein
lmo0464a929-7.744016transposase
lmo0465829-7.225185hypothetical protein
lmo0466828-6.908880hypothetical protein
lmo0467727-6.591847hypothetical protein
lmo0468524-4.913517hypothetical protein
lmo0469523-5.187556hypothetical protein
lmo0470421-4.669284hypothetical protein
lmo0471322-3.255611hypothetical protein
lmo0472323-2.848697hypothetical protein
lmo0473322-1.848077hypothetical protein
lmo0474428-3.427774hypothetical protein
lmo0475130-1.860135hypothetical protein
lmo04762240.038581hypothetical protein
lmo04771170.184066oxetanocin A resistance protein OxrB
lmo04781161.048081secreted protein
lmo04791181.960990secreted protein
lmo04802182.540359secreted protein
lmo04812193.059050transcriptional regulator
lmo04822193.565250hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0459PF05043583e-11 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 58.4 bits (141), Expect = 3e-11
Identities = 54/247 (21%), Positives = 102/247 (41%), Gaps = 38/247 (15%)

Query: 1 MREYLDSKSQKKVALLEKIF--YAENHTSTQEELLN-----------DLNITYPTLISTI 47
MR+ L KS +++ LLE +F H S ELLN + +P LI
Sbjct: 1 MRDLLSKKSHRQLELLELLFEHKRWFHRSELAELLNCTERAVKDDLSHVKSAFPDLIFHS 60

Query: 48 KTINFDIERFGYKAFSIVHSAPNLSYTLKISDNCSIQLIINAYIRESPKFQILETLLLAS 107
T I+++ D+ I+++ + + + S F ILE +
Sbjct: 61 ST----------NGIRIINT-----------DDSDIEMVYHHFFKHSTHFSILEFIFFNE 99

Query: 108 FPNLQALAKKVHVSYSGIKKEIKELNEELRER-NLSISTGNQVEITGDEFSLRIFYAFLF 166
+++ K+ ++S S + + I ++N+ ++ + +S V+I G+E +R F+A F
Sbjct: 100 GCQAESICKEFYISSSSLYRIISQINKVIKRQFQFEVSLTP-VQIIGNERDIRYFFAQYF 158

Query: 167 LVTYSGDRWPFSFVQYDEITDLLESCPKEIYRANSIDKGMMIHYYVAMHLLRDRMN--CQ 224
Y WPF + ++ LLE KE ++ M+ + +L R + +
Sbjct: 159 SEKYYFLEWPFENFSSEPLSQLLELVYKETSFPMNLSTHRMLKLLLVTNLYRIKFGHFME 218

Query: 225 IDTTRQF 231
+D
Sbjct: 219 VDKDSFN 225


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0480HTHTETR432e-07 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 42.7 bits (100), Expect = 2e-07
Identities = 14/48 (29%), Positives = 22/48 (45%)

Query: 7 TKKAIAGGLMELCQHKRFEKISIADITNICGLNRQTFYYHFTDKYDLL 54
T++ I + L + S+ +I G+ R Y+HF DK DL
Sbjct: 12 TRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLF 59


9lmo0518lmo0534Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
lmo05182232.268043encapsulation protein CapA
lmo05191161.869115phosphoglycerate mutase
lmo05200131.099227hypothetical protein
lmo0521-2120.209098multidrug resistance protein
lmo0522114-0.874663transcriptional regulator
lmo0523011-0.8951396-phospho-beta-glucosidase
lmo0524112-0.165945transcriptional regulator
lmo0525112-0.372646hypothetical protein
lmo0526215-0.543284sulfate transporter
lmo0527215-0.079888hypothetical protein
lmo05280160.224845transcriptional regulator
lmo05290151.910610transmembrane protein
lmo05301152.001885hypothetical protein
lmo05312142.658561glucosaminyltransferase
lmo05321163.731990hypothetical protein
lmo05331204.814641hypothetical protein
lmo05340174.153166hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0519TCRTETB1296e-35 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 129 bits (325), Expect = 6e-35
Identities = 92/400 (23%), Positives = 168/400 (42%), Gaps = 14/400 (3%)

Query: 25 FIGLFSETALNMALSDLIQVFDISSATVQWLTTGYLLTLGILVPISGLLLQWFTTRGLFF 84
F + +E LN++L D+ F+ A+ W+ T ++LT I + G L + L
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 85 TAVSFSIAGTLIAALSPTFAMLMI-GRVVQAVGTALLLPLMFNTILLIFPEHKRGSAMGM 143
+ + G++I + +F L+I R +Q G A L+ + P+ RG A G+
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143

Query: 144 IGLVIMFAPAVGPTISGLILENLTWNWIFWISLPFLIIALLFGMKFMQNVSVVTKPKIDI 203
IG ++ VGP I G+I + W+++ I + II + F MK ++ + DI
Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPM-ITIITVPFLMKLLKKEVRIKGH-FDI 201

Query: 204 LSIILSTLGFGGVVFAFSSAGESGWGSATVLVSIIVGGIALGLFVWRQLTMEKPLMDLKV 263
IIL G+VF V V ++ +FV + P +D +
Sbjct: 202 KGIILM---SVGIVFFMLFTTSYSISFLIVSV------LSFLIFVKHIRKVTDPFVDPGL 252

Query: 264 FKYPMFTLGLILVFISFMMILSTMILLPLYLQNSLALAAFSAG-LVLLPGGVLNGLMSPF 322
K F +G++ I F + + ++P +++ L+ G +++ PG + +
Sbjct: 253 GKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYI 312

Query: 323 TGRLFDAYGPRALVIPGFIVAVVALFFLTRIEVGTSALTIIVLHSVLMIGISMVMMPAQT 382
G L D GP ++ G V+ + + TS I++ VL G+S T
Sbjct: 313 GGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLG-GLSFTKTVIST 371

Query: 383 NGLNQLPPKLYPDGTAIMNTLQQVSGAIGTAVAITIMSAG 422
+ L + G +++N +S G A+ ++S
Sbjct: 372 IVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIP 411


10lmo0612lmo0622Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
lmo0612012-3.226773internalin
lmo0613012-2.989835azoreductase
lmo0614214-3.849155MarR family transcriptional evidence
lmo0615113-3.631195oxidoreductase
lmo0616114-3.417005hypothetical protein
lmo0617118-3.364373hypothetical protein
lmo0618218-2.785972glycerophosphoryl diester phosphodiesterase
lmo0619220-3.398325hypothetical protein
lmo0620219-2.732909protein kinase
lmo0621414-2.745376hypothetical protein
lmo0622211-2.025313hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0614SACTRNSFRASE461e-08 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 45.7 bits (108), Expect = 1e-08
Identities = 28/107 (26%), Positives = 44/107 (41%), Gaps = 14/107 (13%)

Query: 52 EEAEYIEESDKNPGSVMLLCFIDDELASISQLIGHIKKRELHTSELA---ISIRKKYWGL 108
+ Y+EE K L ++++ IG IK R I++ K Y
Sbjct: 55 MDVSYVEEEGK----AAFLYYLENNC------IGRIKIRSNWNGYALIEDIAVAKDYRKK 104

Query: 109 GIGTICMEELIKYAKSSEYLKLIYLEVVTENKRAINLYKKFGFIEAG 155
G+GT + + I++AK + + L LE N A + Y K FI
Sbjct: 105 GVGTALLHKAIEWAKENHFCGL-MLETQDINISACHFYAKHHFIIGA 150


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0616TYPE3IMSPROT320.005 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 32.0 bits (73), Expect = 0.005
Identities = 35/206 (16%), Positives = 65/206 (31%), Gaps = 46/206 (22%)

Query: 59 GEVFSSPVAIIMLLILALLILLFVYYELGFFIMMAIYQLRGESYTFFKIIQRLNVKAKYF 118
G+V S + LI+AL +L + F + + E + Y
Sbjct: 21 GQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIPAEQ-----SYLPFSQALSYV 75

Query: 119 LSYQAIYFLLYFFLLLPIAGLSL-------------------------PITITENLYLPH 153
+ + F F LL +A L PI + ++
Sbjct: 76 VDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDIKKINPIEGAKRIFSIK 135

Query: 154 FITDELMKTTTGTWLYVIAIAIIFYISARLVFALPYFIEDKSLKISGAIRKSWKYPQKHL 213
+ E +K+ L V+ ++I+ +I + L G L
Sbjct: 136 SLV-EFLKSI----LKVVLLSILIWI-----IIKGNLVTLLQLPTCGI------ECITPL 179

Query: 214 FFMLLKWVLIIVAIGFLVSIIATIIM 239
+L+ +++I +GF+V IA
Sbjct: 180 LGQILRQLMVICTVGFVVISIADYAF 205


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0620TATBPROTEIN260.045 Bacterial sec-independent translocation TatB protein...
		>TATBPROTEIN#Bacterial sec-independent translocation TatB protein

signature.
Length = 171

Score = 25.8 bits (56), Expect = 0.045
Identities = 12/74 (16%), Positives = 28/74 (37%), Gaps = 3/74 (4%)

Query: 53 QVESKLNGVSMPISEEISRDKLKDAIKQAQAGKIDFEIFIKLAGLAGVRLWEADLSAMKV 112
+ S V +++E+ + +D++K+ + + A + +R MK
Sbjct: 38 ALRSLATTVQNELTQELKLQEFQDSLKKVEKASLTNLTPELKASMDELRQAAES---MKR 94

Query: 113 TYIDNTGNDLVIEP 126
+Y+ N E
Sbjct: 95 SYVANDPEKASDEA 108


11lmo0670lmo0718Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
lmo0670118-3.375078ABC transporter permease
lmo0671-116-2.889730oxidoreductase
lmo0672018-2.188800hypothetical protein
lmo0673020-1.902767hypothetical protein
lmo0674-124-0.453727hypothetical protein
lmo06752311.478962hypothetical protein
lmo06761312.229582hypothetical protein
lmo06771302.653470hypothetical protein
lmo06781272.383520flagellar biosynthesis protein FliP
lmo06791252.557797flagellar biosynthesis protein FliQ
lmo06801222.771434flagellar biosynthesis protein FliR
lmo06811202.839688flagellar biosynthesis protein FlhB
lmo06821202.164971flagellar biosynthesis protein FlhA
lmo06833161.294716flagellar biosynthesis regulator FlhF
lmo06843161.150930flagellar basal body rod protein FlgG
lmo06852151.082342chemotaxis protein CheR
lmo06862170.625415hypothetical protein
lmo06872150.719824flagellar motor protein MotA
lmo06882151.021329flagellar motor rotation MotB
lmo06892151.214053hypothetical protein
lmo06903182.253140hypothetical protein
lmo06914222.627035chemotaxis protein CheV
lmo06922202.859843flagellin
lmo06930202.969593chemotaxis response regulator CheY
lmo06940192.694730two-component sensor histidine kinase CheA
lmo06950192.864884flagellar motor switch protein FliY
lmo06960162.086798hypothetical protein
lmo06970141.319939hypothetical protein
lmo06982150.279805flagellar basal body rod modification protein
lmo0699215-0.008847flagellar hook protein FlgE
lmo07002160.558936flagellar motor switch protein
lmo0701115-0.037221flagellar motor switch protein FliM
lmo0702217-0.058519flagellar motor switch protein FliY
lmo07032180.149872hypothetical protein
lmo07041190.455287hypothetical protein
lmo07052210.991246hypothetical protein
lmo07062230.214231hypothetical protein
lmo07073220.500680flagellar hook-associated protein FlgK
lmo07082281.743568flagellar hook-associated protein FlgL
lmo07092302.579293flagellar capping protein FliD
lmo07102322.610471flagellar protein
lmo07113303.041956hypothetical protein
lmo07122303.076636flagellar basal-body rod protein FlgB
lmo07133312.722908flagellar basal body rod protein FlgC
lmo07144292.338795flagellar hook-basal body protein FliE
lmo07152282.118471flagellar MS-ring protein FliF
lmo07162212.099313flagellar motor switch protein FliG
lmo07171242.216690flagellar assembly protein H
lmo07182271.938759flagellum-specific ATP synthase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0676FLGBIOSNFLIP1671e-53 Escherichia coli: Flagellar biosynthetic protein Fl...
		>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP

signature.
Length = 245

Score = 167 bits (425), Expect = 1e-53
Identities = 79/239 (33%), Positives = 132/239 (55%)

Query: 14 FIVIFAISLVVFWPGVNVHAESWMDSLGVNGTDGVNSSVALFVLVTVLSLSASIVLMFTH 73
+ + + L + P G + V V +T L+ +I+LM T
Sbjct: 4 LLSVAPVLLWLITPLAFAQLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILLMMTS 63

Query: 74 FTYCIIVLGLTRQGLGATNLPPNQVLVGLALFLSLFMMQPLITAWYDDVYKPSQKEEWSA 133
FT IIV GL R LG + PPNQVL+GLALFL+ F+M P+I Y D Y+P +E+ S
Sbjct: 64 FTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEEKISM 123

Query: 134 SKVWDETQPLLTKYVAENTYKHDINMMLKAEGEDPVTKKEDAPLMALMPAFILTQITQGF 193
+ ++ L +++ T + D+ + + P+ E P+ L+PA++ +++ F
Sbjct: 124 QEALEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSELKTAF 183

Query: 194 LTGMFIYLAFIFIDLIVSTLLMYLGMMMVPPMTISLPFKILVFIFIGGYGLITNMIFQT 252
G I++ F+ IDL+++++LM LGMMMVPP TI+LPFK+++F+ + G+ L+ + Q+
Sbjct: 184 QIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLAQS 242


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0677TYPE3IMQPROT435e-09 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 42.8 bits (101), Expect = 5e-09
Identities = 15/76 (19%), Positives = 34/76 (44%)

Query: 6 ITQIFQDFFYSGLALILPVSLICIVVVIVVAILMAMMQIQDQSLTFLPKIVAFVVALFIL 65
+ Y L L +++ ++ ++V + + Q+Q+Q+L F K++ + LF+L
Sbjct: 4 LVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLFLL 63

Query: 66 GPWMFEHMTDLFVGIF 81
W E + +
Sbjct: 64 SGWYGEVLLSYGRQVI 79


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0678TYPE3IMRPROT943e-25 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 94.0 bits (234), Expect = 3e-25
Identities = 51/230 (22%), Positives = 107/230 (46%), Gaps = 1/230 (0%)

Query: 12 VFSRVASFLFFFPLLKGRNIPNSVKVVFGMAISIPVATWVDVSGITTLPD-LLLRVTSEV 70
RV + + P+L R++P VK+ M I+ +A + + + L ++
Sbjct: 19 PLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPANDVPVFSFFALWLAVQQI 78

Query: 71 VFGLALAKLVEIIAVIPKMAGFMIDYDLGFSQVNLIDPSYGTQNSITAAILDTFFVVIFL 130
+ G+AL ++ + AG +I +G S +DP+ + A I+D +++FL
Sbjct: 79 LIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHLNMPVLARIMDMLALLLFL 138

Query: 131 SLQGMDYLIYYLMKSFEFTASVSILFEKGFIDLLLGTLGFALASAVSIALPIMGSIFIVN 190
+ G +LI L+ +F L + + +ALP++ + +N
Sbjct: 139 TFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLIFLNGLMLALPLITLLLTLN 198

Query: 191 IILAFISKSAPQINIFMNAFIIKITFGIFILACAVPILSTVFKNLTDEMI 240
+ L +++ APQ++IF+ F + +T GI ++A +P+++ ++L E+
Sbjct: 199 LALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFCEHLFSEIF 248


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0679TYPE3IMSPROT2751e-92 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 275 bits (704), Expect = 1e-92
Identities = 96/340 (28%), Positives = 183/340 (53%), Gaps = 2/340 (0%)

Query: 4 DNKTEKATPRRIKKARNEGNVAKSKELNNAFSLLIVAGLLYFFGEMFIKNTIQAFVALLK 63
KTE+ TP++I+ AR +G VAKSKE+ + ++ ++ +L + + ++ + + +
Sbjct: 3 GEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIPAE 62

Query: 64 QP--PKLANMESYSLFYLMEFGKVLMPIMVMVVIFGLMNYGVQVGILFSAKAVKPQFKRL 121
Q P + L+EF + P++ + + + ++ VQ G L S +A+KP K++
Sbjct: 63 QSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDIKKI 122

Query: 122 NPANYFKRVFSVKGIVEVVKALLLITLLSYVAYIGFRDHLDTLISYTGQNWLYSLGQIFA 181
NP KR+FS+K +VE +K++L + LLS + +I + +L TL+ +
Sbjct: 123 NPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPLLGQ 182

Query: 182 LFKNEFLALFLVIAVIGLLDFFYQRYDYKKGLRMSKQEIKDEMKDSEGRPEVKQRQRSIA 241
+ + + + VI + D+ ++ Y Y K L+MSK EIK E K+ EG PE+K ++R
Sbjct: 183 ILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRRQFH 242

Query: 242 RGLLQGSITKKMADATFVVNNPTHISVVMRYDKTKDHAPKLLVKGEDELALFIRQVADTD 301
+ + ++ + + ++ VV NPTHI++ + Y + + P + K D +R++A+ +
Sbjct: 243 QEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIAEEE 302

Query: 302 GVPMITNRQLARSIYYTTNPDEYIQEDLYKDVIEVMKELM 341
GVP++ LAR++Y+ D YI + + EV++ L
Sbjct: 303 GVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLE 342


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0681GPOSANCHOR310.007 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 31.2 bits (70), Expect = 0.007
Identities = 24/182 (13%), Positives = 60/182 (32%), Gaps = 9/182 (4%)

Query: 7 EKMEIFKGNSKREIHKKIQLVTNEPYKITDERVTKLGIFKKQYEVTAVIMSEVAIADGRM 66
+E + ++ + + T + KI K + ++ ++ + + +
Sbjct: 186 AALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADS 245

Query: 67 DFQETFQKSVVKTRPKTDDLLKKEKLLEMLAAGAELAQST------PLLEERKTQEEELS 120
+T + + +L K + + T L E+ E +
Sbjct: 246 AKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQ 305

Query: 121 SMRLELAALNRELAVKMREEREQNSDFVKFLKGRGISDTYVADF---MQAGRKQFKQVET 177
+ +L R+L +++ ++ K + IS+ + A R+ KQ+E
Sbjct: 306 VLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQLEA 365

Query: 178 AH 179
H
Sbjct: 366 EH 367


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0682FLGHOOKAP1280.036 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 28.4 bits (63), Expect = 0.036
Identities = 5/32 (15%), Positives = 13/32 (40%)

Query: 3 GLYIGAAGMMNYMQHIQVHSNNVANAQTPGFK 34
+ +G+ + SNN+++ G+
Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYT 34


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0685SECFTRNLCASE280.032 Bacterial translocase SecF protein signature.
		>SECFTRNLCASE#Bacterial translocase SecF protein signature.

Length = 333

Score = 28.3 bits (63), Expect = 0.032
Identities = 13/50 (26%), Positives = 25/50 (50%), Gaps = 8/50 (16%)

Query: 3 ITTIIGLVLAVIVIAGSFMIQNISLAMLFSAEALIVIILGTITAVMMAHP 52
+TT+ L L ++I G +I+ AM++ + GT ++V +A
Sbjct: 262 MTTL--LALVPMLIWGGDVIRGFVFAMVWG------VFTGTYSSVYVAKN 303


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0686OMPADOMAIN581e-11 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 57.6 bits (139), Expect = 1e-11
Identities = 36/129 (27%), Positives = 58/129 (44%), Gaps = 16/129 (12%)

Query: 148 ITIRDDILFQSGSAEL-SAGKREIAKEIGELFAQGKGTMEGIVSGHTDNVPISTSIYSSN 206
T++ D+LF A L G+ + + +L +V G+TD I + Y N
Sbjct: 215 FTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDR--IGSDAY--N 270

Query: 207 WELSVARAVNFMEAIIQENSEVNPGEFSARGYGEFRPVAKNDIAANREK---------NR 257
LS RA + ++ +I + + + SARG GE PV N +++ +R
Sbjct: 271 QGLSERRAQSVVDYLISKG--IPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDR 328

Query: 258 RVEIMVRPI 266
RVEI V+ I
Sbjct: 329 RVEIEVKGI 337


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0688SYCDCHAPRONE290.039 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 28.7 bits (64), Expect = 0.039
Identities = 21/130 (16%), Positives = 48/130 (36%), Gaps = 8/130 (6%)

Query: 165 IYHYGYMSEIVEKQDKSDRNLRLLEKEVKNNKNSGFVHFNIGQEMNRLGNKKEALKEFSE 224
+Y + K + + + + L V ++ +S F +G +G A+ +S
Sbjct: 39 LYSLAFNQYQSGKYEDAHKVFQALC--VLDHYDSRF-FLGLGACRQAMGQYDLAIHSYSY 95

Query: 225 AFRLRDHNHYIWAKLSAYHIAELLEQEKRYDESLAIIEEARVIWPNVPEFPLKKANILYV 284
+ +H AE L Q+ E+ + + A+ + + EF + +
Sbjct: 96 GAIMDIKEPR-----FPFHAAECLLQKGELAEAESGLFLAQELIADKTEFKELSTRVSSM 150

Query: 285 NHQLEDAKEI 294
++ KE+
Sbjct: 151 LEAIKLKKEM 160


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0689HTHFIS439e-07 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 42.9 bits (101), Expect = 9e-07
Identities = 30/114 (26%), Positives = 47/114 (41%), Gaps = 13/114 (11%)

Query: 175 TIFIAEDSQMLRQLLEDTLHEAGYTNLQFFANGREAQEHIFKLLKEQKEQTFENVNLLIT 234
TI +A+D +R +L L AGY +N I +L++T
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRIT-SNAATLWRWI-------AAGDG---DLVVT 53

Query: 235 DIEMPQMDGHHLTKVIKEDEIGRELPVVIFSSLITEDLEHKGAGVGADAQVSKP 288
D+ MP + L IK + +LPV++ S+ T K + GA + KP
Sbjct: 54 DVVMPDENAFDLLPRIK--KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKP 105


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0690FLAGELLIN1272e-35 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 127 bits (319), Expect = 2e-35
Identities = 84/277 (30%), Positives = 129/277 (46%), Gaps = 9/277 (3%)

Query: 1 MKVNTNIISLKTQEYLRKNNEGMTQAQERLASGKRINSSLDDAAGLAVVTRMNVKSTGLD 60
+NTN +SL TQ L K+ ++ A ERL+SG RINS+ DDAAG A+ R GL
Sbjct: 2 QVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLT 61

Query: 61 AASKNSSMGIDLLQTADSALSSMSSILQRMRQLAVQSSNGSFSDEDRKQYTAEFGSLIKE 120
AS+N++ GI + QT + AL+ +++ LQR+R+L+VQ++NG+ SD D K E ++E
Sbjct: 62 QASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEE 121

Query: 121 LDHVADTTNYNNIKLLDQTATGAATQVSIQASDKANDLINIDLFNAKGLSAGTITLGSGS 180
+D V++ T +N +K+L Q Q+ IQ + I IDL S G G
Sbjct: 122 IDRVSNQTQFNGVKVLSQD-----NQMKIQVGANDGETITIDLQKIDVKSLG----LDGF 172

Query: 181 TVAGYSALSVADADSSQQATEAIDELINNISNGRALLGAGMSRLSYNVSNVNNQSIATKA 240
V G +V D SS + D + R + +G V ++ A
Sbjct: 173 NVNGPKEATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAA 232

Query: 241 SASSIEDADMAAEMSEMTKYKILTQTSISMLSQANQT 277
+ D ++ K T + + A
Sbjct: 233 NGQLTTDDAENNTAVDLFKTTKSTAGTAEAKAIAGAI 269



Score = 75.9 bits (186), Expect = 1e-17
Identities = 51/294 (17%), Positives = 103/294 (35%), Gaps = 16/294 (5%)

Query: 4 NTNIISLKTQEYLRKNNEGMTQAQERLASGKRINSSLDDAAGLAVVTRMNVKSTGLDAAS 63
+T ++ + Y+ N +T + + + AG A + G
Sbjct: 217 DTTAPTVPDKVYVNAANGQLTTDDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGD 276

Query: 64 KNSSMGIDLLQTADSALSSMSSILQRMRQLAVQSSNGSFSDEDRKQYTAEFGSLIKELDH 123
G+ + + + V + + ++ +
Sbjct: 277 TFDYKGVTFTIDTKTGNDGNGKVSTTINGEKVTLTVAD----ITAGAANVDAATLQSSKN 332

Query: 124 VADTTNYNNIKLLDQTATGAATQVSIQASDKANDLINIDLFNAKGLSAGTITLGSGSTVA 183
V + D+T +A ++A++ I + A+ + + +
Sbjct: 333 VYTSVVNGQFTFDDKTKNESAKLSDLEANNAVKGESKITVNGAEYTANAAGDKVTLAGKT 392

Query: 184 GYSA------------LSVADADSSQQATEAIDELINNISNGRALLGAGMSRLSYNVSNV 231
+ + A S+ +ID ++ + R+ LGA +R ++N+
Sbjct: 393 MFIDKTASGVSTLINEDAAAAKKSTANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNL 452

Query: 232 NNQSIATKASASSIEDADMAAEMSEMTKYKILTQTSISMLSQANQTPQMLTQLI 285
N ++ S IEDAD A E+S M+K +IL Q S+L+QANQ PQ + L+
Sbjct: 453 GNTVTNLNSARSRIEDADYATEVSNMSKAQILQQAGTSVLAQANQVPQNVLSLL 506


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0691HTHFIS939e-26 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 93.4 bits (232), Expect = 9e-26
Identities = 27/115 (23%), Positives = 53/115 (46%), Gaps = 1/115 (0%)

Query: 3 KLLIVDDAMFMRTMIKNIVKDSDFEVVAEAENGLEAVKKYDEVKPDIVTLDITMPEMDGL 62
+L+ DD +RT++ + + ++V N + D+V D+ MP+ +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 63 EALAQIMAKDPSAKVIMCSAMGQQGMVVDAIKKGAKDFIVKPFQADRVLEALEKA 117
+ L +I P V++ SA + A +KGA D++ KPF ++ + +A
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0692PF06580365e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 35.6 bits (82), Expect = 5e-04
Identities = 14/67 (20%), Positives = 22/67 (32%), Gaps = 9/67 (13%)

Query: 353 LIRNSVDHGAETVEVRRKNGKNETATINLKAFHSGNNVVIEIADDGAGINKRKVLEKAIA 412
L+ N + HG + I LK V +E+ + G+ K
Sbjct: 263 LVENGIKHGIA--------QLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGTG 314

Query: 413 -KNVVTR 418
+NV R
Sbjct: 315 LQNVRER 321


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0693FLGMOTORFLIN605e-15 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 59.9 bits (145), Expect = 5e-15
Identities = 20/81 (24%), Positives = 46/81 (56%), Gaps = 3/81 (3%)

Query: 21 GREKGSIRQVD---NIGVNLIVRLGKKEMPVGDIAELSIGDVLEVEKKPGHKVEIFLDEK 77
G G+++ +D +I V L V LG+ M + ++ L+ G V+ ++ G ++I ++
Sbjct: 45 GDVSGAMQDIDLIMDIPVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGY 104

Query: 78 KVGIGEAILMDENFGIVISEI 98
+ GE +++ + +G+ I++I
Sbjct: 105 LIAQGEVVVVADKYGVRITDI 125


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0695IGASERPTASE290.041 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 28.9 bits (64), Expect = 0.041
Identities = 33/202 (16%), Positives = 56/202 (27%), Gaps = 19/202 (9%)

Query: 48 EADNEEQATIPLKEIAPSLVSAKLLDSEPETKLPSAPLELKEVKETLAAIAKQAIDQPKI 107
A N E A + + + ++ S ETK + E KE T+ K ++ K
Sbjct: 1062 TAQNREVAKEAKSNVKANTQTNEVAQSGSETK-ETQTTETKE-TATVEKEEKAKVETEKT 1119

Query: 108 DSAPQVAQ--PP------------EMNTPKEPT---KNTTREQQPPPELIMPTKDSPKLA 150
P+V P E +PT K + + P K++
Sbjct: 1120 QEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNV 1179

Query: 151 ENVAKNQPALAKLPQEKEAVQLFKASIKEPVTAKEEVAVKKPAESSNIWHDTTKQLTPAA 210
E + E + + +P E K ++
Sbjct: 1180 EQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATT 1239

Query: 211 KVEVPVTLKQLDKTITDQIEQL 232
T+ D T T+ L
Sbjct: 1240 SSNDRSTVALCDLTSTNTNAVL 1261


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0697FLGHOOKAP1454e-07 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 44.9 bits (106), Expect = 4e-07
Identities = 16/36 (44%), Positives = 25/36 (69%)

Query: 5 MYTAISGMNAFQQALSVTSNNIANANTTGYKKQSVV 40
+ A+SG+NA Q AL+ SNNI++ N GY +Q+ +
Sbjct: 4 INNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTI 39



Score = 38.4 bits (89), Expect = 4e-05
Identities = 12/47 (25%), Positives = 25/47 (53%)

Query: 363 ISGSSLEGSNVDLSREFVNLMTYQSGFQGNTKVIRVADDVMKQIVNL 409
+S S V+L E+ NL +Q + N +V++ A+ + ++N+
Sbjct: 499 LSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0698FLGMOTORFLIN514e-12 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 51.4 bits (123), Expect = 4e-12
Identities = 22/68 (32%), Positives = 37/68 (54%)

Query: 7 IPLRIDFELGRTKQPVGSLLDVKKGTVFRLEDSTANVVKITISGKCIGYGEILTKDGKMF 66
IP+++ ELGRT+ + LL + +G+V L+ + I I+G I GE++ K
Sbjct: 60 IPVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYG 119

Query: 67 VKITKLGE 74
V+IT +
Sbjct: 120 VRITDIIT 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0699FLGMOTORFLIM1454e-43 Flagellar motor switch protein FliM signature.
		>FLGMOTORFLIM#Flagellar motor switch protein FliM signature.

Length = 344

Score = 145 bits (368), Expect = 4e-43
Identities = 81/334 (24%), Positives = 166/334 (49%), Gaps = 9/334 (2%)

Query: 1 MSDKLSQEQIDALLSQMSEGKV-VDESTEIGDFGRFHPYDFHKPEKFGAEHLESLKTIAS 59
M++ LSQ++ID LL+ +S G ++++ I D + YDF +P+KF E + +L +
Sbjct: 1 MTEVLSQDEIDQLLTAISSGDASIEDARPISDTRKITLYDFRRPDKFSKEQMRTLSLMHE 60

Query: 60 AFTKKSMEFVSQRIRIPIHTEATLADQVSFASGYIETMPNDSYIFCIIDLGNPELGQIII 119
F + + +S ++R +H DQ+++ +I ++P S +I + +P G ++
Sbjct: 61 TFARLTTTSLSAQLRSMVHVHVASVDQLTYEE-FIRSIPTPS-TLAVITM-DPLKGNAVL 117

Query: 120 ELDLAYIIYIHECLSGGNPKRKLSERRLLSVFEELTLKSILEKFCEALKDSFKSVHPISP 179
E+D + I + L GG + +R L+ E ++ ++ + +++S+ V + P
Sbjct: 118 EVDPSITFSIIDRLFGGT-GQAAKVQRDLTDIENSVMEGVIVRILANVRESWTQVIDLRP 176

Query: 180 EIVNIETNPALLRVTSPNDMMALVSVDIKSEFWISTMRIGVPFFSVEEIMNKLEN---VV 236
+ IETNP ++ P++M+ LV+++ K M +P+ ++E I++KL +
Sbjct: 177 RLGQIETNPQFAQIVPPSEMVVLVTLETKVGEEEGMMNFCIPYITIEPIISKLSSQFWFS 236

Query: 237 EYTFDKRRNFDAEVEQELHQVEKEARIRVGEIKTTWKELNKLEVGDVL-LTETHIRDTLK 295
+ + +L V+ + VG ++ + +++ L VGD++ L +TH+ D
Sbjct: 237 SVRRSSTTQYMGVLRDKLSTVDMDVVAEVGSLRLSVRDILGLRVGDIIRLHDTHVGDPFV 296

Query: 296 GYVTEKWKFECYMGKSGNQKAVKFMRHTGRTEQE 329
+ + KF C G G + A + + T QE
Sbjct: 297 LSIGNRKKFLCQPGVVGKKIAAQILERIESTSQE 330


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0700FLGMOTORFLIN514e-10 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 51.4 bits (123), Expect = 4e-10
Identities = 21/71 (29%), Positives = 42/71 (59%)

Query: 444 ILEDIPVTLEVVFGTAKVKLEKFISWCEKDVIILKESMNEPLVLALNGVTIGKGILVRVD 503
++ DIPV L V G ++ +++ + + V+ L EPL + +NG I +G +V V
Sbjct: 56 LIMDIPVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVA 115

Query: 504 DHFGIQMTELV 514
D +G+++T+++
Sbjct: 116 DKYGVRITDII 126


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0705FLGHOOKAP11352e-36 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 135 bits (340), Expect = 2e-36
Identities = 112/556 (20%), Positives = 215/556 (38%), Gaps = 66/556 (11%)

Query: 4 SDFNTSLSGMSAAQIANMVAQQNISNMNTPGYIRQAVDQTAVYGDGGLLGGKQTGYGVKV 63
S N ++SG++AAQ A A NIS+ N GY RQ + L G G GV V
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTT--IMAQANSTLGAGGWVGNGVYV 59

Query: 64 TDIKRLTNTALTTQYNNQIAKQSASLYQSGALNQALNLFGTPGKNTPSDNLDNFFTAWAA 123
+ ++R + +T Q + S + +++ N+ T + + + +FFT+
Sbjct: 60 SGVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLAT-QMQDFFTSLQT 118

Query: 124 LAKNPDQATNTTALLSSMSIFTDQLNQLHSGLKELETTIAADTDAAIQDLNSLIKKLGSI 183
L N + AL+ +Q L++ + + A++ +N+ K++ S+
Sbjct: 119 LVSNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASL 178

Query: 184 NKAI----GNAGSNPPNDLLNQRDQLLSTMAGYAGISVSAHPNNPDVYDVTIG-GRLVVQ 238
N I G PN+LL+QRDQL+S + G+ VS Y++T+ G +VQ
Sbjct: 179 NDQISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDG--GTYNITMANGYSLVQ 236

Query: 239 GDETTEITS-------TRTATGFEFSVDGQKLNMPE-----GSIIASVRVNQNEIKSYQE 286
G ++ + +RT + + +PE GS+ + ++ +
Sbjct: 237 GSTARQLAAVPSSADPSRTTVAY-VDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRN 295

Query: 287 KIETFSNGLAKALDDIQV------KNVNKTMDDLQK------------------INDALQ 322
+ + A+A + + + + K + DA
Sbjct: 296 TLGQLALAFAEAFNTQHKAGFDANGDAGEDFFAIGKPAVLQNTKNKGDVAIGATVTDASA 355

Query: 323 ANPNDEKLLSNRDELLRQLEKFPGVTRSGDTLTIGGVDHPVDTLGTSTYVTDVNDFSIPI 382
D K+ + ++ Q+ + T T G T T VND S +
Sbjct: 356 VLATDYKISFDNNQW--QVTRLASNTTFTVTPDANGKVAFDGLELTFTGTPAVND-SFTL 412

Query: 383 FAQSSGKWILNPAIT-------------SNADNKPFLGVIAADIASLKTDKNIQGTTFPS 429
S ++ IT ++DN+ ++ S +F
Sbjct: 413 KPVSDAIVNMDVLITDEAKIAMASEEDAGDSDNRNGQALLDLQSNSKTVGGA---KSFND 469

Query: 430 FMDGIITEVATDASKSSATATADTQALSSLTESKSSLEGVNIDEEMTNIMQYQSYYVANT 489
+++++ + ++ ++ L+ + S+ GVN+DEE N+ ++Q YY+AN
Sbjct: 470 AYASLVSDIGNKTATLKTSSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANA 529

Query: 490 KAMNTVNDMMKALLAM 505
+ + T N + AL+ +
Sbjct: 530 QVLQTANAIFDALINI 545


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0706FLAGELLIN451e-07 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 45.4 bits (107), Expect = 1e-07
Identities = 37/238 (15%), Positives = 75/238 (31%), Gaps = 1/238 (0%)

Query: 1 MRISTNQQASSIINQLNNVSGNLAKYQLQVSSGKKYESMSENPGATAQILSYNHVLSQLN 60
I+TN + N LN +L+ ++SSG + S ++ A + + L
Sbjct: 2 QVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLT 61

Query: 61 REKTDVTEAKSLLNTAETSLSSMSTSMNRVNALVLQAINGTSDKDNMSQSAEEIKGLLDV 120
+ + + S+ T E +L+ ++ ++ RV L +QA NGT+ ++ +EI+ L+
Sbjct: 62 QASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEE 121

Query: 121 LISVANSED-DGRYVFSGSSTSVKPFTTDKTTGEIIYNGTTENKKFRVTDTLEVEVFHDG 179
+ V+N +G V S + + I + K +
Sbjct: 122 IDRVSNQTQFNGVKVLSQDNQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKEAT 181

Query: 180 SAMTDVFNNIQKIVDAMKTGDKDALSALQETNSKNIEIITNSMTNIGGQKNGVTAYDN 237
D G + + +
Sbjct: 182 VGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTD 239


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0711FLGHOOKAP1333e-04 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 32.6 bits (74), Expect = 3e-04
Identities = 27/120 (22%), Positives = 49/120 (40%), Gaps = 14/120 (11%)

Query: 4 GINTSGSALNAAKQWMEVSSNNIANADSSAAPGETPFLRKRVVLSEITPFETALTGTKGV 63
IN + S LNAA+ + +SNNI++ + + + R+ ++++ A G G
Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAG------YTRQTTIMAQANSTLGA-GGWVGN 55

Query: 64 KVSEISSDTGSVKRVYDPTHPNANEAGYVNYANVDMTAEMTNLMVGQKMYAANTSALQAN 123
V V+R YD N+ + +TA + M + +TS+L
Sbjct: 56 GV-----YVSGVQREYDAFI--TNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQ 108



Score = 28.4 bits (63), Expect = 0.008
Identities = 18/72 (25%), Positives = 30/72 (41%), Gaps = 4/72 (5%)

Query: 65 VSEISSDTGSVKRVYDPTHPNANEAGY---VNYANVDMTAEMTNLMVGQKMYAANTSALQ 121
VS+I + T ++K T N + + V++ E NL Q+ Y AN LQ
Sbjct: 475 VSDIGNKTATLKTSSA-TQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQ 533

Query: 122 ANEKMMEKDLEI 133
+ + + I
Sbjct: 534 TANAIFDALINI 545


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0712FLGHOOKFLIE312e-04 Flagellar hook-basal body complex protein FliE signa...
		>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE

signature.
Length = 103

Score = 31.2 bits (70), Expect = 2e-04
Identities = 20/65 (30%), Positives = 37/65 (56%), Gaps = 1/65 (1%)

Query: 35 QMLDSMSDTQSNAQTSVSNLLTTGEG-NASDVLIQMKKAESEMKTAAVIRDNVIESYKQL 93
LD +SDTQ+ A+T G +DV+ M+KA M+ +R+ ++ +Y+++
Sbjct: 39 AALDRISDTQTAARTQAEKFTLGEPGVALNDVMTDMQKASVSMQMGIQVRNKLVAAYQEV 98

Query: 94 LNMQV 98
++MQV
Sbjct: 99 MSMQV 103


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0713FLGMRINGFLIF1717e-49 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 171 bits (435), Expect = 7e-49
Identities = 110/584 (18%), Positives = 219/584 (37%), Gaps = 83/584 (14%)

Query: 9 SKLKNWHKGAILVGLFVVVTVLL---LYMNTPKTEVTLYKNLSETSQQQVTDQLAKMGVD 65
++L+ + ++V V +++ L+ TP TL+ NLS+ + QL +M +
Sbjct: 17 NRLRANPRIPLIVAGSAAVAIVVAMVLWAKTPDYR-TLFSNLSDQDGGAIVAQLTQMNIP 75

Query: 66 YTVDK-SGNILVDEKVETLVRDKFADLGIPYTGQDGNDILLNSSLGASEEDKKMQEKVGT 124
Y SG I V +R + A G+P G G ++L G S+ +++ +
Sbjct: 76 YRFANGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQFSEQVNYQRAL 135

Query: 125 KVNLEKEIVQSYGTTIDSASVQLTLPESSSIFEEASQKGTAAVTLKTKNNQTLTSEQVLG 184
+ L + I + SA V L +P+ S F + +A+VT+ + + L Q+
Sbjct: 136 EGELARTIETLGP--VKSARVHLAMPKPSL-FVREQKSPSASVTVTLEPGRALDEGQISA 192

Query: 185 IQRTVSAAVPNVASDDVAIIDTKNGVISEADTSKEEGSSAYKNEVDIQNAIGKNVKTDIE 244
+ VS+AV + +V ++D ++++++TS + + A ++ N + ++ IE
Sbjct: 193 VVHLVSSAVAGLPPGNVTLVDQSGHLLTQSNTSGRDLNDA---QLKFANDVESRIQRRIE 249

Query: 245 GTLSSIFALDNFRVNTNVAVNFDEIKQNTEHY-PNDGKVRSNQKDTSTDTSKGSANTTES 303
LS I N ++F +Q EHY PN ++ + + S+
Sbjct: 250 AILSPIVGNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNISEQVGAGYPG 309

Query: 304 ---GTASN--------------------ADVPNYTEQNGDDTNTYTSEKSSETTNYELDS 340
G SN + P + ++ S + +ET+NYE+D
Sbjct: 310 GVPGALSNQPAPPNEAPIATPPTNQQNAQNTPQTSTSTNSNSAGPRSTQRNETSNYEVDR 369

Query: 341 TIQEIKKHPA-LAKTNVVVWVDQNALNK------NGVDMAEFTKAIGVSAGLTPNMTTEE 393
TI+ K + + + +V V V+ L M + + G +
Sbjct: 370 TIRHTKMNVGDIERLSVAVVVNYKTLADGKPLPLTADQMKQIEDLTREAMGFSDK----- 424

Query: 394 AGADGEAAAAPTFEGTFQNGD-VTIMPIQFLDNATPAEKDTTEKAEPASKAWIWW----L 448
GD + ++ F + D T P + +
Sbjct: 425 ------------------RGDTLNVVNSPF------SAVDNTGGELPFWQQQSFIDQLLA 460

Query: 449 AGGLLFAVIAAGIITYIILLKRKEQLEEALEPEEKDYIPAEEAIINPEEHPDFNFQTDAF 508
AG L ++ A I+ + + + E + ++ + + E +
Sbjct: 461 AGRWLLVLVVAWILWRKAVRPQLTRRVEEAKAAQE-----QAQVRQETEEAVEVRLSKDE 515

Query: 509 DLSE--PELKARKESLKNKLGEMAKEDPGRAAAVIQKWLNERQE 550
L + + E + ++ EM+ DP A VI++W++ E
Sbjct: 516 QLQQRRANQRLGAEVMSQRIREMSDNDPRVVALVIRQWMSNDHE 559


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0714FLGMOTORFLIG1938e-61 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 193 bits (491), Expect = 8e-61
Identities = 112/335 (33%), Positives = 189/335 (56%), Gaps = 4/335 (1%)

Query: 34 SGISRREKAALIIWSLDEQIATEVVDLLPDASKQRLAREMAKMKEMDGGAVEEATREFLG 93
S ++ ++KAA+++ S+ +I+++V L + L E+AK++ + + EF
Sbjct: 13 SALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITSELKDNVLLEFK- 71

Query: 94 ELELLSGGIAKLDREHLQRLFPDMTTEELNQLIYGVEAESRIGETALDILREIDDVDSLF 153
EL + I K ++ + L + I S + + +R D ++
Sbjct: 72 ELMMAQEFIQKGGIDYARELLEKSLGTQKAVDIIN-NLGSALQSRPFEFVRRADPA-NIL 129

Query: 154 TIISDESPQTIAMIASYMKPEEASKLLALLPEEKMINTVIGIASLEQFDSEVMQNVSNLL 213
I E PQTIA+I SY+ P++AS +L+ LP E N IA +++ EV++ V +L
Sbjct: 130 NFIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEVVREVERVL 189

Query: 214 RIKLDTMSNSSLNKTDGIKNVANILNNVTRGLERTIFEHLDAEQAELSERIKEKMFMFED 273
KL ++S+ G+ NV I+N R E+ I E L+ E EL+E IK+KMF+FED
Sbjct: 190 EKKLASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEIKKKMFVFED 249

Query: 274 IILLDNMTLQQVLAEIQDNNKIARALKNEKEELKEKILSCVSKNRRDMITEELEVLGPIR 333
I+LLD+ ++Q+VL EI D ++A+ALK+ ++EKI +SK M+ E++E LGP R
Sbjct: 250 IVLLDDRSIQRVLREI-DGQELAKALKSVDIPVQEKIFKNMSKRAASMLKEDMEFLGPTR 308

Query: 334 LSDVEQAQQDIANVVKNLEKDGKIVIQRGEQDVLI 368
DVE++QQ I ++++ LE+ G+IVI RG ++ ++
Sbjct: 309 RKDVEESQQKIVSLIRKLEEQGEIVISRGGEEDVL 343


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0715GPOSANCHOR320.002 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 32.0 bits (72), Expect = 0.002
Identities = 15/79 (18%), Positives = 33/79 (41%)

Query: 24 YLDDIEETEEIESPYSKELEQLESHQKELEKHLSAIEIEQQKLANEKAALQAERQAIEEL 83
I+ E ++ +LE + +A + + L EKAAL+AE+ +E
Sbjct: 244 DSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQ 303

Query: 84 RRDAEAEIEANKQAFEKEK 102
+ A ++ ++ + +
Sbjct: 304 SQVLNANRQSLRRDLDASR 322


12lmo0739lmo0756Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
lmo0739012-3.554165hypothetical protein
lmo0740013-3.625420PTS beta-glucoside transporter subunit IIABC
lmo0741013-3.8149796-phospho-beta-glucosidase
lmo0742013-4.112999transcriptional regulator
lmo0743115-4.282496GntR family transcriptional regulator
lmo0744016-4.252571ABC transporter ATP-binding protein
lmo0745-118-5.428276hypothetical protein
lmo0746123-5.872982ABC transporter ATP-binding protein
lmo0747220-3.093297hypothetical protein
lmo0748018-2.909838hypothetical protein
lmo0749021-1.802164hypothetical protein
lmo0750021-0.880644hypothetical protein
lmo07510200.648354hypothetical protein
lmo0752-1191.633177hypothetical protein
lmo07531192.012813hypothetical protein
lmo07542182.944227hypothetical protein
lmo07553193.084402Crp/Fnr family transcriptional regulator
lmo07562162.932502hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0752BINARYTOXINA300.011 Clostridial binary toxin A signature.
		>BINARYTOXINA#Clostridial binary toxin A signature.

Length = 454

Score = 30.0 bits (67), Expect = 0.011
Identities = 35/159 (22%), Positives = 57/159 (35%), Gaps = 41/159 (25%)

Query: 99 ILGSSSGSIVAMHVLKNHPEVVKKIAFHEPPINTFLPDSE---MWQEANEKIVQTALTKN 155
IL SS S L+ + AF E P + FL D E W++ + V+ L
Sbjct: 17 ILTSSFPSYTYAQDLQIASNYITDRAFIERPED-FLKDKENAIQWEKKEAERVEKNLDTL 75

Query: 156 MAEAMQLFGETLHIAPIDAESMSKPAVT--------IDEVTKDSTTQQM----------- 196
EA++L+ + D+E +S + T I+ ++ + +
Sbjct: 76 EKEALELYKK-------DSEQISNYSQTRQYFYDYQIESNPREKEYKNLRNAISKNKIDK 128

Query: 197 -----------KYWFTYEIRQYTSSNISLDDFKPYVHQI 224
K+ F EIR + ISL+ F I
Sbjct: 129 PINVYYFESPEKFAFNKEIRTENQNEISLEKFNELKETI 167


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0754NUCEPIMERASE280.031 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 27.8 bits (62), Expect = 0.031
Identities = 14/52 (26%), Positives = 21/52 (40%), Gaps = 11/52 (21%)

Query: 63 EASKKMTNPAPHKIYNTQVWIKNDRAVAIMQATIQTRTIINGVEMELNSDAK 114
E + AP+++YN I N V +M I +E L +AK
Sbjct: 244 ETGTPAASIAPYRVYN----IGNSSPVELM-------DYIQALEDALGIEAK 284


13lmo0765lmo0779Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
lmo0765316-3.248464hypothetical protein
lmo0766416-3.668426lipoate-protein ligase
lmo0767215-4.047228hypothetical protein
lmo0768017-3.187646sugar ABC transporter permease
lmo0769-115-2.123316ABC transporter permease
lmo0770-116-1.793465sugar ABC transporter substrate-binding protein
lmo0771-114-0.458762alpha-1,6-mannanase
lmo0772-2150.845189LacI family transcriptional regulator
lmo0773-2171.542147hypothetical protein
lmo0774-3171.382735transcriptional regulator
lmo0775-218-0.664159alcohol dehydrogenase
lmo07762190.664311hypothetical protein
lmo07774221.038224hypothetical protein
lmo07784210.850767transcriptional regulator
lmo07794221.428765hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0768MALTOSEBP642e-13 Maltose binding protein signature.
		>MALTOSEBP#Maltose binding protein signature.

Length = 396

Score = 64.4 bits (156), Expect = 2e-13
Identities = 80/314 (25%), Positives = 127/314 (40%), Gaps = 21/314 (6%)

Query: 65 KVKYVMQENVEEKLLTGIAGGELPDIIMWDRYQTALYAPKGVLEPLDKLVKEDNLKMDDF 124
KV + +EEK A G+ PDII W + YA G+L + D D
Sbjct: 60 KVTVEHPDKLEEKFPQVAATGDGPDIIFWAHDRFGGYAQSGLLAE----ITPDKAFQDKL 115

Query: 125 YEESVKEMTYSDKLYGLPLLNDNRILFYNKKLLQEAGVKPPTTWDELATAAQKTTKWDGN 184
Y + + Y+ KL P+ + L YNK LL PP TW+E+ A K K G
Sbjct: 116 YPFTWDAVRYNGKLIAYPIAVEALSLIYNKDLLP----NPPKTWEEIP-ALDKELKAKGK 170

Query: 185 KMTQAGMSLQDVGLFNLYLMQAGG------ELVTSDNKETAFNSEQGLEVLNYW-DKMQN 237
+LQ+ F L+ A G E D K+ ++ L + D ++N
Sbjct: 171 SALM--FNLQE-PYFTWPLIAADGGYAFKYENGKYDIKDVGVDNAGAKAGLTFLVDLIKN 227

Query: 238 DLKVYQRGFDDGSDAFAAGKEAMTYNGPWALADYNKVEDLDYGVVEPPKGPNGDKGAIMG 297
+ AF G+ AMT NGPWA ++ + ++YGV P +G
Sbjct: 228 KHMNADTDYSIAEAAFNKGETAMTINGPWAWSNIDT-SKVNYGVTVLPTFKGQPSKPFVG 286

Query: 298 GFGLVMPKQAEHKDGAWDFMKWWTTKPENGVEFAKISGWLPANKIAAEDEYFTKDPNYSV 357
+ + +K+ A +F++ + E G+E L A + + +E KDP +
Sbjct: 287 VLSAGINAASPNKELAKEFLENYLLTDE-GLEAVNKDKPLGAVALKSYEEELAKDPRIAA 345

Query: 358 FVNTMKYAKIRPTV 371
+ + +I P +
Sbjct: 346 TMENAQKGEIMPNI 359


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0773NUCEPIMERASE290.033 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 28.6 bits (64), Expect = 0.033
Identities = 18/71 (25%), Positives = 30/71 (42%)

Query: 152 KIAVSGATGGVGSLSSAILSKRGFSVVASSGKSDAKEFLEKFGVSEIVSREAFQPEKVRA 211
K V+GA G +G S L + G VV +D + K E++++ FQ K+
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 212 LDKQLYAGAID 222
D++
Sbjct: 62 ADREGMTDLFA 72


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0778HOKGEFTOXIC240.043 Hok/Gef cell toxic protein family signature.
		>HOKGEFTOXIC#Hok/Gef cell toxic protein family signature.

Length = 52

Score = 24.4 bits (53), Expect = 0.043
Identities = 9/34 (26%), Positives = 17/34 (50%), Gaps = 1/34 (2%)

Query: 1 MKKKWLLLILSVIVLCAIIFGIKWLLYRDNLVEM 34
MK L+ V+++C + +L R +L E+
Sbjct: 1 MKLPRSSLVWCVLIVCLTLLIFTYLT-RKSLCEI 33


14lmo1051lmo1061Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
lmo10512352.199900molybdenum cofactor biosynthesis protein B
lmo10522311.678558molybdopterin biosynthesis protein MoeB
lmo10530271.537717hypothetical protein
lmo10541240.924825peptide deformylase
lmo1055115-0.269331pyruvate dehydrogenase subunit E1 alpha
lmo1056013-3.835506pyruvate dehydrogenase subunit E1 beta
lmo1057113-3.988229dihydrolipoamide acetyltransferase
lmo1058014-4.524215dihydrolipoamide dehydrogenase
lmo1059-112-4.539974hypothetical protein
lmo1060012-4.357224L-lactate dehydrogenase
lmo1061012-3.724089hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo1054RTXTOXIND363e-04 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 36.3 bits (84), Expect = 3e-04
Identities = 17/80 (21%), Positives = 27/80 (33%), Gaps = 3/80 (3%)

Query: 42 DKSVEEITSPVSGTIKEIKVAEGTVATVGQVLVTFDGVEGHEDDAEEESAAPKAESTEST 101
+EI + +KEI V EG G VL+ + +A+
Sbjct: 93 SGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGA---EADTLKTQSSLLQARLE 149

Query: 102 PAPAQASGKGIFEFKLPDIG 121
Q + I KLP++
Sbjct: 150 QTRYQILSRSIELNKLPELK 169



Score = 32.5 bits (74), Expect = 0.004
Identities = 11/36 (30%), Positives = 16/36 (44%)

Query: 152 DKSVEEITSPVDGTVKDILVSEGTVATVGQVLVTFE 187
+EI + VK+I+V EG G VL+
Sbjct: 93 SGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLT 128



Score = 31.7 bits (72), Expect = 0.008
Identities = 10/34 (29%), Positives = 17/34 (50%), Gaps = 1/34 (2%)

Query: 152 DKSVEEITSPVDGTVKDILV-SEGTVATVGQVLV 184
+ I +PV V+ + V +EG V T + L+
Sbjct: 324 RQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLM 357



Score = 31.7 bits (72), Expect = 0.009
Identities = 17/59 (28%), Positives = 28/59 (47%), Gaps = 2/59 (3%)

Query: 18 EIVKWFVQPGDKIEE-DESLFEVQNDKSVEEITSPVSGTIKEIKV-AEGTVATVGQVLV 74
EI+ Q D I L + + + I +PVS ++++KV EG V T + L+
Sbjct: 299 EILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLM 357


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo1060HTHFIS907e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 89.5 bits (222), Expect = 7e-23
Identities = 35/126 (27%), Positives = 60/126 (47%), Gaps = 5/126 (3%)

Query: 3 KILIAEDDSAILGVITAFLTEAGYQVMTAKNGIEAYHLFQKETFDLIIMDIMMPSMDGYT 62
IL+A+DD+AI V+ L+ AGY V N + DL++ D++MP + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 63 LTELIRST-STTPILMMTALSEEDDELKGFDLGADDYIQKPFSYLVLLKRVQVLLRRVNQ 121
L I+ P+L+M+A + +K + GA DY+ KPF L + ++ R
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFD----LTELIGIIGRALA 120

Query: 122 SETKKQ 127
++
Sbjct: 121 EPKRRP 126


15lmo1071lmo1087Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
lmo1071210-1.939125hypothetical protein
lmo1072112-2.696635hypothetical protein
lmo1073114-4.445386hypothetical protein
lmo1074213-4.813119cell division protein FtsW
lmo1075214-4.914017pyruvate carboxylase
lmo1076114-4.648240metal ABC transporter substrate-binding protein
lmo1077015-5.046419teichoic acid translocation permease TagG
lmo1078117-4.767051teichoic acid ABC transporter ATP-binding
lmo1079217-4.965855autolysin
lmo1080117-5.200853teichoic acid biosynthesis protein B
lmo1081117-5.037614UDP-glucose pyrophosphorylase
lmo1082017-4.781535hypothetical protein
lmo1083-117-5.235969teichoic acid biosynthesis protein GgaB
lmo1084-116-5.232701glucose-1-phosphate thymidyl transferase
lmo1085-116-5.261059dTDP-sugar epimerase
lmo1086-116-4.876479dTDP-D-glucose 4,6-dehydratase
lmo1087-214-3.321573DTDP-L-rhamnose synthetase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo1072RTXTOXIND360.001 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature.

Length = 478

Score = 36.0 bits (83), Expect = 0.001
Identities = 8/33 (24%), Positives = 19/33 (57%)

Query: 1114 TIQAPFDGEVSSIYVSDGDTIESGDLLIEVNRI 1146
I+ + V I V +G+++ GD+L+++ +
Sbjct: 98 EIKPIENSIVKEIIVKEGESVRKGDVLLKLTAL 130



Score = 31.0 bits (70), Expect = 0.028
Identities = 13/35 (37%), Positives = 20/35 (57%)

Query: 1083 TGSVIQVVVKKGDSVKKGDPLLITEAMKMETTIQA 1117
V +++VK+G+SV+KGD LL A+ E
Sbjct: 104 NSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLK 138


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo1073FERRIBNDNGPP376e-05 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 36.8 bits (85), Expect = 6e-05
Identities = 28/130 (21%), Positives = 45/130 (34%), Gaps = 9/130 (6%)

Query: 50 PTKIVSLIPSNTEILFALGLGD-EVKGVSAYDDYPKEAQKIEKV----TSTSVDTEKIIA 104
P +IV+L E+L ALG+ V Y + E + V T + E +
Sbjct: 35 PNRIVALEWLPVELLLALGIVPYGVADTINYRLWVSEPPLPDSVIDVGLRTEPNLELLTE 94

Query: 105 LKPDLVLGHESMLATEKDAYKILTDAGINVFVVPDATN-LKEVEKSIATIGDLTGTEKEA 163
+KP ++ + + +I A F D L KS+ + DL + A
Sbjct: 95 MKPSFMVWSAGYGPSPEMLARI---APGRGFNFSDGKQPLAMARKSLTEMADLLNLQSAA 151

Query: 164 TKVTDSMEKQ 173
E
Sbjct: 152 ETHLAQYEDF 161


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo1076FLGFLGJ712e-15 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 70.9 bits (173), Expect = 2e-15
Identities = 57/228 (25%), Positives = 105/228 (46%), Gaps = 26/228 (11%)

Query: 43 EVAREEMPPESEEPVFSLEQNR-------DDAMAALVVPQTRNSFLRAASTPTFQQTFIN 95
++ E+ PE P ++ + A++ LV ++ S P + F+
Sbjct: 97 QMTPEQPLPEESTPAAPMKFPLETVVRYQNQALSQLVQKAVPRNYDD--SLPGDSKAFLA 154

Query: 96 SISTQAMDLCKKYNLYPSVMIAQAALESNWGRSELGKA---PNYNLFGIK--GSYNGKSV 150
+S A ++ + +++AQAALES WG+ ++ + P+YNLFG+K G++ G
Sbjct: 155 QLSLPAQLASQQSGVPHHLILAQAALESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVT 214

Query: 151 TMKTWEYSDSKGWYQINANFAKYPSHKESLEDNAKKLRNGPSWDSSYYKGAWRENAKTYK 210
+ T EY + + ++ A F Y S+ E+L D L P + + + + A+ +
Sbjct: 215 EITTTEYENGEA-KKVKAKFRVYSSYLEALSDYVGLLTRNPRYAAVTTAASAEQGAQALQ 273

Query: 211 DATAWLQGRYATDNTYASKLNTLISSYNLTQYDTLYDTIKQQKNVSED 258
DA YATD YA KL +I Q ++ D + + +++ D
Sbjct: 274 DAG------YATDPHYARKLTNMIQ-----QMKSISDKVSKTYSMNID 310


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo1083NUCEPIMERASE1852e-58 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 185 bits (471), Expect = 2e-58
Identities = 81/333 (24%), Positives = 138/333 (41%), Gaps = 28/333 (8%)

Query: 1 MNLLVTGGAGFIGSNFVHHILNKHDDYKVVNLDLLT-YAGT---MSNLEDIKENPNHVFV 56
M LVTG AGFIG + +L VV +D L Y + LE + P F
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQ--VVGIDNLNDYYDVSLKQARLE-LLAQPGFQFH 57

Query: 57 EGNICDYDLVKKLVTDHKIDTIVNFAAESHVDRSIINPGIFIETNVQGTLNLLNVAKELN 116
+ ++ D + + L + + V S+ NP + ++N+ G LN+L +
Sbjct: 58 KIDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNK 117

Query: 117 VAKYLQVSTDEVYGSLGETGYFTEETPIA-PNSPYSASKASADLLVRSYFETYGLNVNIT 175
+ L S+ VYG L F+ + + P S Y+A+K + +L+ +Y YGL
Sbjct: 118 IQHLLYASSSSVYG-LNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGL 176

Query: 176 RCSNNYGPHHFPEKLIPLMITNGLDGENLPIYGDGKNIRDWLHVSDHCAAIDLVIHNGKS 235
R YGP P+ + L+G+++ +Y GK RD+ ++ D AI +
Sbjct: 177 RFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIPH 236

Query: 236 ------------------GEVYNVGGHNERTNNEIVHIIVDDLNLSKDKIVYVEDRLGHD 277
VYN+G + + + + D L + + K + + G
Sbjct: 237 ADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGI-EAKKNMLPLQPGDV 295

Query: 278 LRYAIDPKKIETELGWEPKYTFDTGIKETIEWY 310
L + D K + +G+ P+ T G+K + WY
Sbjct: 296 LETSADTKALYEVIGFTPETTVKDGVKNFVNWY 328


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo1084NUCEPIMERASE649e-14 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 63.6 bits (155), Expect = 9e-14
Identities = 53/244 (21%), Positives = 89/244 (36%), Gaps = 47/244 (19%)

Query: 1 MSILVTGANGQLGTELVQLLKEHNLTVTEWD----------KDS--------------VD 36
M LVTGA G +G + + L E V D K + +D
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 37 IVDKAAVKKAMLDLKPEWIIHCAAFTNVEAA-EDELKNVNWEVNVDGTENISEAAEIVGA 95
+ D+ + E + V + E+ + N+ G NI E
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYA--DSNLTGFLNILEGCRHNKI 118

Query: 96 K-LVYISTDYVFDGTKKEAYLPDDKTN-PLNQYGIAKLAGEKVALEKNSQTYVIRTS--- 150
+ L+Y S+ V+ +K + DD + P++ Y K A E +A S Y + +
Sbjct: 119 QHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTY-SHLYGLPATGLR 177

Query: 151 --WVFGKYGN------NFVYSMLKLAETHKELKVVNDQLGRPTYTY--DLADFIRFVIEK 200
V+G +G F +ML+ K + V N + +TY D+A+ I + +
Sbjct: 178 FFTVYGPWGRPDMALFKFTKAMLE----GKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDV 233

Query: 201 NPAY 204
P
Sbjct: 234 IPHA 237


16lmo1096lmo1123Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
lmo1096418-2.279484NAD synthetase
lmo1097724-5.296059hypothetical protein
lmo1097a726-5.969212PTS cellbiose transporter subunit IIB
lmo1098723-5.699778GMP synthase
lmo1099721-4.105635integrase
lmo1100718-2.564745hypothetical protein
lmo1101720-1.240393hypothetical protein
lmo1102721-0.662057hypothetical protein
lmo1103721-0.101735cadmium resistance protein
lmo11046210.548229lipoprotein signal peptidase
lmo11056220.244197cadmium efflux system accessory protein
lmo11064230.346110hypothetical protein
lmo11076270.476829P60 protein
lmo11086290.657255hypothetical protein
lmo11096280.385823hypothetical protein
lmo1110821-1.188872hypothetical protein
lmo1111720-1.910500hypothetical protein
lmo1112820-2.728945hypothetical protein
lmo1113721-5.559217hypothetical protein
lmo1114721-6.507589hypothetical protein
lmo1115721-6.593081hypothetical protein
lmo1116928-9.133568hypothetical protein
lmo1117927-9.318019hypothetical protein
lmo1118928-9.763515fibrinogen-binding protein
lmo1119828-8.631977regulatory protein
lmo1120628-7.931806hypothetical protein
lmo1121728-7.484704hypothetical protein
lmo1122427-6.401049methylase
lmo1123119-3.712396hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo1105IGASERPTASE397e-05 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 39.3 bits (91), Expect = 7e-05
Identities = 37/226 (16%), Positives = 75/226 (33%), Gaps = 16/226 (7%)

Query: 505 HERQNENGNESTANRPTSTKQNRSAGTRAGEKMGDLMEAKNRMVDKAGDMKDTIKNAPTN 564
+ + E N++ +T N A + + + + + T
Sbjct: 981 YNPEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETV 1040

Query: 565 AKYAVHKGKRDLVRNVSEFSQSFADTRNLQQQERATKRNEKKATVAQRRQEMDKAKAEKS 624
A+ + + K + + ++ + K N + VAQ E + + ++
Sbjct: 1041 AENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTET 1100

Query: 625 NLEPYASMQKRQRDYEERKQPVPRTAPTPAP-----QASTPKATPERTASTNQQHERPLQ 679
+++ + E+ Q VP+ +P + P+A P R + P Q
Sbjct: 1101 KETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEP-Q 1159

Query: 680 KQKNTTKEQVKANKR----------ENTSTTSKNSKTENKITKTTT 715
Q NTT + + K E+T+ + NS EN T
Sbjct: 1160 SQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPA 1205


17lmo1151lmo1159Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
lmo11510143.287114cobalamin (5'-phosphatase) synthetase
lmo11520142.848604alpha-ribazole-5'-phosphatase
lmo1153-1132.848498transcriptional regulator PocR
lmo11540153.081653PduA protein
lmo11551163.105574PduB protein
lmo11560163.912078propanediol dehydratase subunit alpha
lmo1157-1183.179880diol dehydratase subunit gamma
lmo11580183.382956diol dehydratase subunit gamma
lmo11590203.057760diol dehydratase-reactivating factor large
18lmo1175lmo1185Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
lmo11754291.744768two-component response regulator
lmo11765250.905020two-component sensor histidine kinase
lmo11775240.868471ethanolamine utilization protein EutA
lmo11783210.227694ethanolamine ammonia-lyase large subunit
lmo11791210.840567ethanolamine ammonia-lyase small subunit
lmo1180-1160.657967carboxysome structural protein EutL
lmo1181-1140.166138carboxysome structural protein
lmo11821180.428636alcohol dehydrogenase
lmo1183214-1.813532carboxysome structural protein
lmo1184213-2.504572cobalamin adenosyl transferase
lmo1185213-2.609716PduL protein
19lmo1260lmo1267Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
lmo1260115-3.041896hypothetical protein
lmo1261014-3.407530hypothetical protein
lmo1262215-2.972401gamma-glutamyl phosphate reductase
lmo1263214-2.388912gamma-glutamyl kinase
lmo1264214-2.437085hypothetical protein
lmo1265213-2.001217transcriptional regulator
lmo1266213-1.598038transcriptional regulator
lmo1267215-1.094318hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo1260CARBMTKINASE504e-09 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 49.8 bits (119), Expect = 4e-09
Identities = 32/148 (21%), Positives = 64/148 (43%), Gaps = 25/148 (16%)

Query: 124 TLESLLERGI---------IPIVNENDTVAVEELEHVTKYGDNDLLSAIVAKLVQADLLI 174
T++ L+ERG+ +P++ E+ E++ V D DL +A+ V AD+ +
Sbjct: 178 TIKKLVERGVIVIASGGGGVPVILEDG-----EIKGVEAVIDKDLAGEKLAEEVNADIFM 232

Query: 175 MLSDIDGFYGSNPSTDPDAVMFSEINQITPEIEALAGGKGSKFGTGGMLTKLSAAS-YCM 233
+L+D++G T + ++ E E + F G M K+ AA +
Sbjct: 233 ILTDVNGAA-LYYGT-EKE---QWLREVKVE-ELRKYYEEGHFKAGSMGPKVLAAIRFIE 286

Query: 234 NANQKMILTNGKNPTIIFDIMQGEQIGT 261
++ I+ + + ++G+ GT
Sbjct: 287 WGGERAIIA---HLEKAVEALEGK-TGT 310


20lmo1330lmo1343Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
lmo1330219-1.031070ribosome-binding factor A
lmo1331219-1.140598tRNA pseudouridine synthase B
lmo1332322-3.358034riboflavin kinase
lmo1333217-3.77381430S ribosomal protein S15
lmo1333a118-4.124260polynucleotide phosphorylase
lmo1334114-2.900992GTPase EngC
lmo1335-115-3.660270hypothetical protein
lmo1336015-3.904865hypothetical protein
lmo1337014-4.016559hypothetical protein
lmo1338015-5.01406650S ribosomal protein L33
lmo1339014-4.9352605-formyltetrahydrofolate cyclo-ligase
lmo1340218-6.406907hypothetical protein
lmo1341117-6.224200hypothetical protein
lmo1342118-4.598276glucose kinase
lmo1343115-3.382087hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo1332PF05272330.002 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 32.7 bits (74), Expect = 0.002
Identities = 16/65 (24%), Positives = 27/65 (41%), Gaps = 7/65 (10%)

Query: 175 ALESDLKPNSTLILLGSSGVGKSSFINSLAGTDLMKTAGIREDDSKGKHTTTHREMHLLS 234
+E K + +++L G+ G+GKS+ IN+L G D D GK + +
Sbjct: 588 VMEPGCKFDYSVVLEGTGGIGKSTLINTLVGLDFFSDTHF--DIGTGKDSY----EQIAG 641

Query: 235 NGWIV 239

Sbjct: 642 I-VAY 645


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo133356KDTSANTIGN270.038 Rickettsia 56kDa type-specific antigen protein sign...
		>56KDTSANTIGN#Rickettsia 56kDa type-specific antigen protein

signature.
Length = 533

Score = 26.8 bits (59), Expect = 0.038
Identities = 13/38 (34%), Positives = 21/38 (55%)

Query: 52 KNKYEKLLAQQEVDKATEAEAKKKAEEDAKKKAEEAKK 89
+ EKL AQQE D + + K ++ A +K++E K
Sbjct: 391 RKAMEKLAAQQEEDAKNQGKGDCKQQQGASEKSKEGKV 428


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo1337SYCDCHAPRONE348e-04 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 33.7 bits (77), Expect = 8e-04
Identities = 22/114 (19%), Positives = 40/114 (35%), Gaps = 4/114 (3%)

Query: 401 NVAVQQYLQEQNKKEATKITDNLISSGSADGYSYTYAASIALQD-KQIDKAETMAKEAIN 459
++A QY + ++A K+ L D + Q Q D A
Sbjct: 41 SLAFNQYQSGK-YEDAHKVFQALCVLDHYD-SRFFLGLGACRQAMGQYDLAIHSYSYGAI 98

Query: 460 IDKDIPEAHYYLSVCYRIKGDMPNAIKEANSARELSSN-PFFDSYYDELEKIKE 512
+D P ++ + C KG++ A A+EL ++ F + + E
Sbjct: 99 MDIKEPRFPFHAAECLLQKGELAEAESGLFLAQELIADKTEFKELSTRVSSMLE 152


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo1343BCTERIALGSPG342e-05 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 34.5 bits (79), Expect = 2e-05
Identities = 16/46 (34%), Positives = 27/46 (58%)

Query: 1 MNKINGFSLVESMVSLLLFAMVCSFLLPTAMTIFEKLDHQKETSRV 46
+K GF+L+E MV +++ ++ S ++P M EK D QK S +
Sbjct: 4 TDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDI 49


21lmo1545lmo1551Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
lmo1545119-4.14500050S ribosomal protein L21
lmo1546119-3.906969ribonuclease G
lmo1547317-3.195038septum formation inhibitor MinD
lmo1548417-2.548720septum formation inhibitor MinC
lmo1549316-2.719977cell-shape determining protein MreD
lmo1550315-2.852737rod shape-determining protein MreC
lmo1551314-1.884922rod shape-determining protein MreB
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo1547GPOSANCHOR280.042 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 28.1 bits (62), Expect = 0.042
Identities = 14/44 (31%), Positives = 27/44 (61%), Gaps = 1/44 (2%)

Query: 68 DLKNTYTENQHLKERLE-ELAQLESEVADLKKENKDLKESLDIT 110
L+ ++ K+++E L + S++A L+K NK+L+ES +T
Sbjct: 383 SLRRDLDASREAKKQVEKALEEANSKLAALEKLNKELEESKKLT 426


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo1548SHAPEPROTEIN470e-170 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 470 bits (1210), Expect = e-170
Identities = 192/338 (56%), Positives = 247/338 (73%), Gaps = 6/338 (1%)

Query: 1 MFGFGNKDIGIDLGTANTLVYMKGKGIVLREPSVVAMKKD----TQEIVAVGSDAKNMIG 56
G + D+ IDLGTANTL+Y+KG+GIVL EPSVVA+++D + + AVG DAK M+G
Sbjct: 5 FRGMFSNDLSIDLGTANTLIYVKGQGIVLNEPSVVAIRQDRAGSPKSVAAVGHDAKQMLG 64

Query: 57 RTPGNIVAIRPMKDGVIADYDTTAAMMKYYIQKA-GKSVNASKPRVMICVPSGITGVEKR 115
RTPGNI AIRPMKDGVIAD+ T M++++I++ S PRV++CVP G T VE+R
Sbjct: 65 RTPGNIAAIRPMKDGVIADFFVTEKMLQHFIKQVHSNSFMRPSPRVLVCVPVGATQVERR 124

Query: 116 AVIDATRQAGAKDAFTIEEPFAAAIGAGLPVGEPTGSMVVDIGGGTTEVAVISLGGIVTS 175
A+ ++ + AGA++ F IEEP AAAIGAGLPV E TGSMVVDIGGGTTEVAVISL G+V S
Sbjct: 125 AIRESAQGAGAREVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLNGVVYS 184

Query: 176 RSVRTAGDDLDEVIINYIRKKYNLLIGDRTAEAIKMEIGSASPKGLDLSPFSIRGRDLVT 235
SVR GD DE IINY+R+ Y LIG+ TAE IK EIGSA P G ++ +RGR+L
Sbjct: 185 SSVRIGGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAYP-GDEVREIEVRGRNLAE 243

Query: 236 GLPKTIEITPEEISEALADTVAAIIDAVKGTLENTPPELSADIMDKGIVLTGGGALLRNL 295
G+P+ + EI EAL + + I+ AV LE PPEL++DI ++G+VLTGGGALLRNL
Sbjct: 244 GVPRGFTLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISERGMVLTGGGALLRNL 303

Query: 296 DTVISEETKMPVIIADEPLDCVAIGTGKALENMDMYKR 333
D ++ EET +PV++A++PL CVA G GKALE +DM+
Sbjct: 304 DRLLMEETGIPVVVAEDPLTCVARGGGKALEMIDMHGG 341


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo1550PREPILNPTASE943e-25 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 94.1 bits (234), Expect = 3e-25
Identities = 48/206 (23%), Positives = 98/206 (47%), Gaps = 14/206 (6%)

Query: 35 SECNYCQKKLAFYHIIPIFSFLFFRGKSRCCERPIPIIYFLMELVTPIYILLLYIQFSFS 94
S C +C + IP+ S+L+ RG+ R C+ PI Y L+EL+T + + + + +
Sbjct: 72 SCCPHCNHPITALENIPLLSWLWLRGRCRGCQAPISARYPLVELLTALLSVAVAMTLAPG 131

Query: 95 YSFLLYYIIYYFLAFFFITDIFYLYVPNSILIVFFCVLATIAILYNQ-----SLMDLIYS 149
+ L ++ + L D+ + +P+ + L +L+N SL D +
Sbjct: 132 WGTLAALLLTWVLVALTFIDLDKMLLPDQLT----LPLLWGGLLFNLLGGFVSLGDAVIG 187

Query: 150 G----GISCLFYLLFFIIFRK-GIGLGDIKILIILSTFLGFKIGYYIFFLAIIMGTIILL 204
+ Y F ++ K G+G GD K+L L +LG++ + L+ ++G + +
Sbjct: 188 AMAGYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLVGAFMGI 247

Query: 205 TALMLKKVKKNKQVPFVPYIFVSFLL 230
++L+ ++K +PF PY+ ++ +
Sbjct: 248 GLILLRNHHQSKPIPFGPYLAIAGWI 273


22lmo1792lmo1804Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
lmo17922181.764094hypothetical protein
lmo1793015-0.214162hypothetical protein
lmo17947196.218576hypothetical protein
lmo17956185.776838tRNA (guanine-N(1)-)-methyltransferase
lmo17965195.39449416S rRNA-processing protein RimM
lmo17975184.886562hypothetical protein
lmo17984194.920175hypothetical protein
lmo17994184.659532hypothetical protein
lmo18003171.04635730S ribosomal protein S16
lmo18014181.266743hypothetical protein
lmo18025161.019321peptidoglycan binding protein
lmo18034161.518210protein-tyrosine phosphatase
lmo18043161.061871signal recognition particle protein Ffh
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo1804GPOSANCHOR521e-08 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 52.0 bits (124), Expect = 1e-08
Identities = 33/262 (12%), Positives = 87/262 (33%), Gaps = 3/262 (1%)

Query: 676 KHELGQLAEKIAELNESTREMESAVQLAKDSMAKKREELEETRVIGENLRLQEKELLGKL 735
EL + + E +A ++ ++ L + +L + +
Sbjct: 115 IQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARK---ADLEKALEGAMNFS 171

Query: 736 DRETENLERFNKQLQLYDIEKADGSEELNKLLERKETLLQEQVEIAKQIEVTDEEIKAMT 795
++ ++ + + +A+ + L + + + + +
Sbjct: 172 TADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLE 231

Query: 796 SSSKALESKRAADLESLSSLKAQIAAKREQLQSATEAVERVTTTLHENYEQKEAAEQKLA 855
+ + + AD + +L+A+ AA + +A+E + + + E + A
Sbjct: 232 KALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKA 291

Query: 856 SLKTNLTSVHTSEETARKSIEELRKDKTETSEKLTQTRQTRAELQEKLELLEAELTQKNN 915
+L+ + + + + LR+D + E Q +L+E+ ++ EA
Sbjct: 292 ALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRR 351

Query: 916 QISFYMEQKNNAEISIGRLEVD 937
+ E K E +LE
Sbjct: 352 DLDASREAKKQLEAEHQKLEEQ 373



Score = 48.5 bits (115), Expect = 1e-07
Identities = 42/248 (16%), Positives = 98/248 (39%), Gaps = 10/248 (4%)

Query: 669 KSSILTRKHELGQLAEKIAELNESTREMESAVQLAKDSMAKKREELEETRVIGENLRLQE 728
K+++ R+ EL + E + + ++ K ++A ++ +LE+ N +
Sbjct: 185 KAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTAD 244

Query: 729 KELLGKLDRETENLERFNKQLQLYDIEKADGSEELNKLLERKETLLQEQVEIAKQIEVTD 788
+ L+ E LE +L+ + +TL E+ + + +
Sbjct: 245 SAKIKTLEAEKAALEARQAELEK---ALEGAMNFSTADSAKIKTLEAEKAALEAEKADLE 301

Query: 789 EEIKAMTSSSKALESKRAADLESLSSLKAQIAAKREQLQSATEAVERVTTTLHENYEQKE 848
+ + + ++ ++L A E+ L+A+ EQ + + + + + L + E K+
Sbjct: 302 HQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKK 361

Query: 849 AAEQKLASLKTNLTSVHTSE-------ETARKSIEELRKDKTETSEKLTQTRQTRAELQE 901
E + L+ S + +R++ +++ K E + KL + EL+E
Sbjct: 362 QLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQVEKALEEANSKLAALEKLNKELEE 421

Query: 902 KLELLEAE 909
+L E E
Sbjct: 422 SKKLTEKE 429



Score = 37.7 bits (87), Expect = 3e-04
Identities = 26/225 (11%), Positives = 60/225 (26%)

Query: 145 KIDEILNSKPEERRSIFEEAAGVLKYKHRKKQAENKLFETEENLNRVQDILYELEGQLEP 204
++ + + + + G + + L + L Q L +
Sbjct: 145 TLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMN 204

Query: 205 LEMQASIAKDYLFQQEELEKYEVTLLASEISSLTEKLAEVRKEFGENQTVLIKLREELHA 264
S L ++ L + + + L
Sbjct: 205 FSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAE 264

Query: 265 EEAVISREKQALNETDIALDKLQERLLVETEKLEQLEGERNLQLERKKHSSENEQVYAET 324
E + + L+ + LE + + ++ + E
Sbjct: 265 LEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREA 324

Query: 325 LAVITEKITALEEQKEVLSSSKLEKETALEIAIKSKKVLEATLAK 369
+ + LEEQ ++ +S+ L+ + ++KK LEA K
Sbjct: 325 KKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQLEAEHQK 369



Score = 37.0 bits (85), Expect = 5e-04
Identities = 42/285 (14%), Positives = 86/285 (30%), Gaps = 17/285 (5%)

Query: 760 SEELNKLLERKETLLQEQVEIAKQIEVTDEEIKAMTSSSKALESKRAADLESLSSLKAQI 819
E +K TL + +++ + + +T + K L ++
Sbjct: 56 QERADKFEIENNTLKLKNSDLSFNNKALKDHNDELTEELSNAKEK----LRKNDKSLSEK 111

Query: 820 AAKREQLQSATEAVERVTTTLHENYEQKEAAEQKLASLKTNLTSVHTSEETARKSIEELR 879
A+K ++L++ +E+ A + L + K L + E A +
Sbjct: 112 ASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFS 171

Query: 880 KDKTETSEKLTQTRQTRAELQEKLELLEAELTQKNNQISFYMEQKNNAEISIGRLEVDIA 939
+ + L+ + LEA + + M I LE + A
Sbjct: 172 TADSAKIK----------TLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKA 221

Query: 940 NRIDRLQEAYLLTPEQAEEKILSEVNTEQARSKVRLLKRSIDELGIVNIGAIEEFERIQE 999
R + + ++ L+ EL GA+
Sbjct: 222 ALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSA 281

Query: 1000 RFDFLTGQQADL---LAAKETLFKVMDEMDEEMKIRFSESFEAIK 1041
+ L ++A L A E +V++ + ++ S EA K
Sbjct: 282 KIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKK 326



Score = 32.7 bits (74), Expect = 0.008
Identities = 46/236 (19%), Positives = 90/236 (38%), Gaps = 14/236 (5%)

Query: 678 ELGQLAEKIAELNESTREMESAVQLAKDSMAKKREELEETRVIGENLRLQEKELLGKLDR 737
++ L + A L E+E A++ A + +++ L ++ +L +
Sbjct: 247 KIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQV 306

Query: 738 ETENLERFNKQLQLYDIEKADGSEELNKLLERKETLLQEQVEIAKQIEVTDEEIKAMTSS 797
N + + L K E KL E+ + + + + ++ + E K + +
Sbjct: 307 LNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQLEAE 366

Query: 798 SKALESKRAADLESLSSLKAQIAAKREQLQSATEAVERVTTTLHENYEQKEAAEQKLASL 857
+ LE + S SL+ + A RE A + VE+ E A KLA+L
Sbjct: 367 HQKLEEQNKISEASRQSLRRDLDASRE----AKKQVEK----------ALEEANSKLAAL 412

Query: 858 KTNLTSVHTSEETARKSIEELRKDKTETSEKLTQTRQTRAELQEKLELLEAELTQK 913
+ + S++ K EL+ ++ L + +AE KL +A +Q
Sbjct: 413 EKLNKELEESKKLTEKEKAELQAKLEAEAKALKEKLAKQAEELAKLRAGKASDSQT 468


23lmo1824lmo1844Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
lmo18242162.590752phosphoprotein phosphatase
lmo18252132.342036RNA-binding Sun protein
lmo18262131.855955methionyl-tRNA formyltransferase
lmo18271132.290683primosome assembly protein PriA
lmo18280132.990435pantothenate metabolism flavoprotein
lmo1829-1143.722539DNA-directed RNA polymerase subunit omega
lmo1830-1144.053757guanylate kinase
lmo1831-1154.066673hypothetical protein
lmo1832-1154.158297fibronectin-binding proteins
lmo1833-1143.993827short-chain dehydrogenase
lmo1834-1153.773441orotate phosphoribosyltransferase
lmo1835-1143.688380orotidine 5'-phosphate decarboxylase
lmo1836-1123.019498dihydroorotate dehydrogenase
lmo1837-2111.676909dihydroorotate dehydrogenase electron transfer
lmo1838-2121.189955carbamoyl-phosphate synthetase
lmo1839-2100.490818carbamoyl phosphate synthase small subunit
lmo1840-1110.073105dihydroorotase
lmo18410110.458355aspartate carbamoyltransferase
lmo18420110.474832uracil permease
lmo18431130.922419bifunctional pyrimidine regulatory protein PyrR
lmo18442151.071724hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo1829FbpA_PF058337080.0 Fibronectin-binding protein
		>FbpA_PF05833#Fibronectin-binding protein

Length = 577

Score = 708 bits (1830), Expect = 0.0
Identities = 226/572 (39%), Positives = 346/572 (60%), Gaps = 11/572 (1%)

Query: 1 MAFDAMFLKAMTEELAEHGESGRIMKIHQPFSHELVLYIRKNRENKRLLISSHPSYARIQ 60
MA D +FL ++ +EL +G+I K++QP E++L IRK R + +LLISS +Y RI
Sbjct: 1 MALDGIFLYSIIDELKNTIINGKIDKVNQPEKDEIILNIRKGRLSFKLLISSSSNYPRIH 60

Query: 61 WTDDIPENPATPPMFCMLLRKYLEGAIIESITQLPNERILQFSIRGKDDIGENRFCDLFV 120
TD NP PMFCM+LRKY+ A I I Q+ +RI+ D++G N L +
Sbjct: 61 LTDLTKPNPIKAPMFCMVLRKYISNAKIVDIHQINQDRIVVIDFESTDELGFNSIYSLII 120

Query: 121 EIMGRHSNITLVDRAKNVIVDCIKHVSPAQNSYRTLLPGATYVLPPATDKLNPFEVTSEQ 180
EIMGRHSN+TL+ + N+I+D IKH++P N+YR++ PG YV PP + KLNPF+ + +
Sbjct: 121 EIMGRHSNMTLIRKRDNIIMDSIKHITPDINTYRSIYPGIEYVYPPKSPKLNPFDFSYDM 180

Query: 181 ILDRLDFSAGRIDKQ-LVQNFAGFSPLLAREIVFRAGNLTADSLVAAFFEVMGLVND--- 236
I + ++ +++ + F G S L+ EI FR N + D ++ E++ + D
Sbjct: 181 IENFTKENSLQLNDNIFSKIFTGVSKTLSSEICFRLKNNSIDLSLSNLKEIVEVCKDLFK 240

Query: 237 HLGSAAVPNEWRIQNKEDYYFFSLRHV---DAEITEFANLSTLLDHFYIGKARRDRVHQF 293
+ S +N F+ L + D + ++ + S LL++FY K + DR+
Sbjct: 241 EIQSNKFEFNCYTKNNSFVGFYCLNLMSKEDYKKIQYDSSSKLLENFYYAKDKSDRLKSK 300

Query: 294 AHDLEKLLSNELARSRLKIEKLENTLLETEKADVYRIQGELLTANLHLMERGMEEITVEN 353
+ DL+K++ N + R K + L NTL + E D++++ GELLTAN++ +++G+ I + N
Sbjct: 301 SSDLQKIVMNNINRCTKKDKILNNTLKKCEDKDIFKLYGELLTANIYALKKGLSHIELAN 360

Query: 354 FYDD-MKKMTIPLDTRKTPSANAQSYFSRYQKLRNAVEVVKEQIALTKEEITYLESVESQ 412
+Y + + I LD KTPS N QSY+ +Y KL+ + E EQ+ +EE+ YL SV +
Sbjct: 361 YYSENYDTVKITLDENKTPSQNVQSYYKKYNKLKKSEEAANEQLLQNEEELNYLYSVLTN 420

Query: 413 LETSGPQD-VEEIRQELAEQGYLRYKQKKGSRKKATLPAPEKYTSSTGLTILVGKNNKQN 471
+ + D +EEI++EL E GY+++K+ S+K T P + S G+ I VGKNN QN
Sbjct: 421 INNADNYDEIEEIKKELIETGYIKFKKIYKSKKSKTSK-PMHFISKDGIDIYVGKNNIQN 479

Query: 472 DYLTNKLARNNEYWFHVKDLPGSHVVIQSN-NPDETSITEAAMIAAYYSKARLSATVPVD 530
DYLT K A ++ WFH K++PGSHV++++ + E+++ EAA +AAYYSK++ S+ VPVD
Sbjct: 480 DYLTLKFANKHDIWFHTKNIPGSHVIVKNIMDIPESTLLEAANLAAYYSKSQNSSNVPVD 539

Query: 531 GTLVKHVKKPNGAKPGYVIYDNQTTYFVTPDE 562
T VK+VKKPNGAKPG VIY T +VTP
Sbjct: 540 YTEVKNVKKPNGAKPGMVIYSTNQTIYVTPTN 571


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo1835TYPE4SSCAGA310.029 Type IV secretion system CagA exotoxin signature.
		>TYPE4SSCAGA#Type IV secretion system CagA exotoxin signature.

Length = 1147

Score = 31.2 bits (70), Expect = 0.029
Identities = 18/72 (25%), Positives = 30/72 (41%), Gaps = 21/72 (29%)

Query: 977 ASIPVSQVKKIGENQETLIDY----------------IRNGQVTLVVNTLTTGKRPERDG 1020
S+P+S+ KIG NQ+ + DY +G + N +T
Sbjct: 1069 GSVPLSEYDKIGFNQKNMKDYSDSFKFSTKLNNAVKDTNSGFTQFLTNAFSTASY----- 1123

Query: 1021 FQIRRESVENGI 1032
+ + RE+ E+GI
Sbjct: 1124 YCLARENAEHGI 1135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo1837UREASE363e-04 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 35.9 bits (83), Expect = 3e-04
Identities = 21/61 (34%), Positives = 33/61 (54%), Gaps = 8/61 (13%)

Query: 4 LKNGQVLNASGKLESKDVLIQNGKVNLIADSIEVTSGEEFDATGKLITPGFIDVHVHLRE 63
LK+G++ A GK + D +Q G ++ EV +GE GK++T G +D H+H
Sbjct: 90 LKDGRIA-AIGKAGNPD--MQPGVTIIVGPGTEVIAGE-----GKIVTAGGMDSHIHFIC 141

Query: 64 P 64
P
Sbjct: 142 P 142


24lmo1979lmo1988Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
lmo1979-1133.234581oxidoreductase
lmo19800153.985361hypothetical protein
lmo19810173.816674glucose-6-phosphate 1-dehydrogenase
lmo1982-1184.204912hypothetical protein
lmo19830194.606799hypothetical protein
lmo19840184.505482hypothetical protein
lmo19850193.952603hypothetical protein
lmo1986-1163.537139dihydroxy-acid dehydratase
lmo1987-1153.565357acetolactate synthase
lmo1988-1143.304799acetolactate synthase small subunit
25lmo2191lmo2196Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
lmo2191222-1.684948oligoendopeptidase
lmo2192221-2.053859competence protein CoiA
lmo2193219-2.100451adaptor protein
lmo2194220-2.288937ArsC family transcriptional regulator
lmo2195219-2.364026peptide ABC transporter ATP-binding protein
lmo2196222-1.750925peptide ABC transporter ATP-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo2196PF06872300.018 EspG protein
		>PF06872#EspG protein

Length = 398

Score = 30.5 bits (68), Expect = 0.018
Identities = 17/66 (25%), Positives = 33/66 (50%), Gaps = 3/66 (4%)

Query: 466 LDLFVTDGAQNRMSYS--NKDYDKILNDASVTYAADDQKRWDEMVKAEKILLTDDVAI-Q 522
+ +FV D + + N + ++ DA++ +D +K W+EM AEK+L + +
Sbjct: 5 IKIFVIDETERAFMLNGLNNNSASLVLDATIKINSDYKKPWNEMTCAEKLLKILTLGLWN 64

Query: 523 PLYQRS 528
P Y +
Sbjct: 65 PKYSQD 70


26lmo2213lmo2221Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
lmo2213-112-3.159964hypothetical protein
lmo2214-212-2.881529ferrochelatase
lmo2215-214-2.567241uroporphyrinogen decarboxylase
lmo2216212-3.214068hypothetical protein
lmo2217211-2.950320ABC transporter permease
lmo2218312-2.992341ABC transporter ATP-binding protein
lmo2219312-2.738515histidine triad (HIT) protein
lmo2220213-2.203520hypothetical protein
lmo2221213-2.134365hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo2221GPOSANCHOR320.014 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 32.0 bits (72), Expect = 0.014
Identities = 35/272 (12%), Positives = 80/272 (29%), Gaps = 19/272 (6%)

Query: 482 VFLEQKKIRQEWQQLLNEMDIIASQIAELRATENKLNETIYQHTMELKQLFSDLG----- 536
+E ++ + L + EL + E + ++ L + S +
Sbjct: 62 FEIENNTLKLKNSDLSFNNKALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQELEAR 121

Query: 537 IKHNPDANWAYELAVYKKNSEKAQLAMELISKLEPLAEKQEVYKARLANLELPGKYTDTE 596
A +++ L E K A K ++ KA + +
Sbjct: 122 KADLEKALEGAMNFSTADSAKIKTLEAE---KAALAARKADLEKALEGAMNFSTADSAKI 178

Query: 597 EKITFLRQGLLYYRNHLTENAKLAEKLEQVTMQLDLVKQDLLLIKKEKADLLASANAKNE 656
+ + + L + L + + A + + L A
Sbjct: 179 KTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAM 238

Query: 657 EEFRMAAIRVKEEQNWRERLVLIEAQLEPEKRIALNQYEN-----------QATIKEKEL 705
+ ++K + + L +A+LE A+N +A ++ ++
Sbjct: 239 NFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKA 298

Query: 706 QLEETLRQIELEQEKIHASLAAQNHAIHKLEE 737
LE + + ++ + L A A +LE
Sbjct: 299 DLEHQSQVLNANRQSLRRDLDASREAKKQLEA 330


27lmo2256lmo2288Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
lmo2256220-2.059436phosphoglucomutase
lmo2257221-2.662138hypothetical protein
lmo2258216-1.226385hypothetical protein
lmo2259316-1.834807hypothetical protein
lmo2260416-2.338320hypothetical protein
lmo2261414-1.698333hypothetical protein
lmo2262114-2.705476PTS beta-glucoside transporter subunit IIA
lmo2263215-3.092512hypothetical protein
lmo2264316-3.549992hypothetical protein
lmo2265316-3.686254hypothetical protein
lmo2266317-3.721884hypothetical protein
lmo2267417-3.941805hypothetical protein
lmo2268517-4.394100hypothetical protein
lmo2269622-6.817406hypothetical protein
lmo2270724-7.049072ATP-dependent deoxyribonuclease subunit A
lmo2271622-6.093851ATP-dependent deoxyribonuclease subunit B
lmo2272320-6.456765hypothetical protein
lmo2273217-4.648467competence protein ComK
lmo2274220-3.828156hypothetical protein
lmo2275121-3.500545hypothetical protein
lmo2276022-2.957988hypothetical protein
lmo2277-123-3.003078protein gp29
lmo2278-118-1.261387protein gp28
lmo2279119-2.253690hypothetical protein
lmo2280019-3.114829hypothetical protein
lmo2281019-2.685673L-alanoyl-D-glutamate peptidase
lmo2282518-0.316628holin
lmo2283517-0.364893protein gp23
lmo2284617-0.560750protein gp22
lmo2285617-0.110169protein gp21
lmo22866160.735847protein gp20
lmo22877170.683010protein gp19
lmo2288217-0.858469protein gp18
28lmo2300lmo2347Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
lmo2300122-3.953382scaffolding protein
lmo2301325-4.956909protein gp4
lmo2302329-5.056403portal protein
lmo2303627-2.427353terminase large subunit from bacteriophage A118
lmo2304726-2.048196terminase
lmo2305627-0.888555hypothetical protein
lmo2306523-1.116511hypothetical protein
lmo2307426-2.124810hypothetical protein
lmo2308223-1.888172hypothetical protein
lmo2309327-2.607652hypothetical protein
lmo2310-122-2.069564hypothetical protein
lmo2311023-2.021675single-stranded DNA-binding protein
lmo2312-123-1.444518hypothetical protein
lmo2313-120-0.943795hypothetical protein
lmo2314122-1.379357hypothetical protein
lmo2315121-1.615999hypothetical protein
lmo2316225-1.758471hypothetical protein
lmo2317228-2.086667hypothetical protein
lmo2318629-2.246609hypothetical protein
lmo2319732-3.428710site-specific DNA-methyltransferase
lmo2320328-3.777734hypothetical protein
lmo2321328-3.551240hypothetical protein
lmo2322326-4.227711hypothetical protein
lmo2323226-4.508514hypothetical protein
lmo2324327-5.443393hypothetical protein
lmo2325226-5.330895gp44
lmo2325a427-6.132185hypothetical protein
lmo2326523-6.540933anti-repressor
lmo2327325-6.981686hypothetical protein
lmo2328320-6.233228hypothetical protein
lmo2329320-5.828389hypothetical protein
lmo2329a219-5.799869hypothetical protein
lmo2330119-3.310850XRE family transcriptional regulator
lmo2331116-2.413878hypothetical protein
lmo2332116-1.978752hypothetical protein
lmo02333216-1.646444hypothetical protein
lmo2334215-1.327847hypothetical protein
lmo2335214-0.757315integrase
lmo2336-112-0.576552hypothetical protein
lmo2337-2120.033235transcriptional regulator
lmo2338-2111.607543PTS fructose transporter subunit IIABC
lmo2339-1192.930652fructose-1-phosphate kinase
lmo23400183.093278DeoR family transcriptional regulator
lmo23410173.224762aminopeptidase
lmo23421183.208644hypothetical protein
lmo23431172.681679hypothetical protein
lmo23440161.254360sugar kinase
lmo23450171.75640716S pseudouridylate synthase
lmo23461191.460426nitrilotriacetate monooxygenase
lmo23472191.273678hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo2301GPOSANCHOR280.033 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 28.5 bits (63), Expect = 0.033
Identities = 33/261 (12%), Positives = 76/261 (29%), Gaps = 1/261 (0%)

Query: 3 TEEKYKIFAKTYVMNGFNGKEAAISAGYSTKTAEQQASRLLRNVKVLELIDEEMKLLSKR 62
T E + ++ +E A T + + S L N K L+ ++E+
Sbjct: 37 TNEVSAVATRSQTDTLEKVQERADKFEIENNTLKLKNSDLSFNNKALKDHNDELTEELSN 96

Query: 63 MQDDASKIYAELWKQVRMIDDKMAKHEEASRKLSITDARKITAIADINNLKAKIRRTESK 122
++ K L ++ I + A+ + + L A I L+A+ ++
Sbjct: 97 AKEKLRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAAR 156

Query: 123 IKKMDGRKADEGKFKKELLEEYDELKIQLEELEDSVSEIYEENSTSKRDLLWHKDWKEIL 182
++ F + L+ + LE +E+ + + + L
Sbjct: 157 KADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTL 216

Query: 183 SLRAQILQDLFDRSGYKETKDMQDRRVALLDAQINKLELEAKKDCKDSGFATIIMSNVDE 242
L K + + A +A + + + + ++
Sbjct: 217 EAEKAALAARKADLE-KALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNF 275

Query: 243 MQAYLDKKAGGTDERDDTQTT 263
A K E+ +
Sbjct: 276 STADSAKIKTLEAEKAALEAE 296


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo2306BLACTAMASEA270.015 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 27.4 bits (61), Expect = 0.015
Identities = 14/43 (32%), Positives = 19/43 (44%), Gaps = 1/43 (2%)

Query: 63 VLQESFRKNGKLYRAIKYKADFLVRYSDGHEELIDIKGMLTKE 105
VL + +L R I Y+ LV YS E+ + GM E
Sbjct: 76 VLARVDAGDEQLERKIHYRQQDLVDYSPVSEKHLA-DGMTVGE 117


29lmo2390lmo2405Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
lmo23902153.753504hypothetical protein
lmo23914201.025725hypothetical protein
lmo23926190.371898NADH dehydrogenase
lmo23935200.900976hypothetical thioredoxine reductase
lmo23943151.091108hypothetical protein
lmo23953140.868851hypothetical protein
lmo23963140.846281hypothetical protein
lmo23972151.907840hypothetical protein
lmo23981172.441005hypothetical protein
lmo23990172.707817internalin
lmo24001172.581026NifU protein
lmo24012213.012190hypothetical protein
lmo24022232.634344hypothetical protein
lmo24031222.295238acetyltransferase
lmo2404321-0.207709hypothetical protein
lmo2405223-3.136417hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo2391NUCEPIMERASE398e-06 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 38.6 bits (90), Expect = 8e-06
Identities = 34/163 (20%), Positives = 60/163 (36%), Gaps = 31/163 (19%)

Query: 1 MNVLVIGANGKIGRLLVEKLAMEKGFFVRA---------MVRKAEQVSELEKLGAKPIIA 51
M LV GA G IG + ++L +E G V + K ++ L + G +
Sbjct: 1 MKYLVTGAAGFIGFHVSKRL-LEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKI 59

Query: 52 DLK-KDF---HYAYDEIEAVIFTAGSGGHTPASE----TVNIDQNGAIKAIETAKEKGVR 103
DL ++ +A E V + + E + + G + +E + ++
Sbjct: 60 DLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQ 119

Query: 104 RFIIVSS---YGA--------DNPENGPESLAHYLKAKQAADE 135
+ SS YG D+ + P SL Y K+A +
Sbjct: 120 HLLYASSSSVYGLNRKMPFSTDDSVDHPVSL--YAATKKANEL 160


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo2400SACTRNSFRASE332e-04 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 33.0 bits (75), Expect = 2e-04
Identities = 18/50 (36%), Positives = 24/50 (48%), Gaps = 2/50 (4%)

Query: 79 KEHQGKGYGGDALEQIIEMVKNLPEQPARLRLSYEPDNIVAEKFYAKYGF 128
K+++ KG G L + IE K L L + NI A FYAK+ F
Sbjct: 99 KDYRKKGVGTALLHKAIEWAKE--NHFCGLMLETQDINISACHFYAKHHF 146


30lmo2453lmo2466Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
lmo24532351.672320carboxylesterase
lmo24545421.777752preprotein translocase subunit SecG
lmo24553382.186543carboxylesterase
lmo24562291.740189epoxide hydrolase
lmo24571241.476873hypothetical protein
lmo2458-1161.408176phosphopyruvate hydratase
lmo2459-2120.988539phosphoglyceromutase
lmo2460-1100.931914triosephosphate isomerase
lmo2461-2100.075736phosphoglycerate kinase
lmo2462-180.078816glyceraldehyde-3-phosphate dehydrogenase
lmo2463-19-0.008437transcriptional regulator
lmo2464313-0.973773RNA polymerase factor sigma-54
lmo2465413-1.462611dipeptidase
lmo2466314-0.882426multidrug transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo2463ACRIFLAVINRP573e-10 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 56.8 bits (137), Expect = 3e-10
Identities = 58/297 (19%), Positives = 117/297 (39%), Gaps = 39/297 (13%)

Query: 227 SLLIGTVLLVLVFLLVIYRSPILALIPLIAVGFAYLVITPILGLLAKEGIITYGSQGLSI 286
+L +L+ LV + + ++ LIP IAV +V+ +LA G ++
Sbjct: 343 TLFEAIMLVFLV-MYLFLQNMRATLIPTIAVP---VVLLGTFAILAAFGY------SINT 392

Query: 287 MT----VLLFGAGTDYCLFLISRFRSHLHTEK-NRFQAFKEAFSGTAGAIALSGLTVMAA 341
+T VL G D + ++ + +K +A +++ S GA+ + + A
Sbjct: 393 LTMFGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAV 452

Query: 342 LL--LLLAAEYGS-FHNFAVPFSLAIFIMMISSLTLVPALLGIFGRVSFWPFVPRTVEME 398
+ G+ + F++ A+ + ++ +L L PAL + P + E
Sbjct: 453 FIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLK-------PVSAEHH 505

Query: 399 ETRAKKKGKTPK--HHKENRFWHKIGEMSAKHPVRILIITLIILIGCGIFTTQVKYTYDT 456
E + G H N + + +G++ R L+I +I+ G + ++ ++
Sbjct: 506 ENKGGFFGWFNTTFDHSVNHYTNSVGKI-LGSTGRYLLIYALIVAGMVVLFLRLPSSF-- 562

Query: 457 LSTFPEDMPSREGFTLISDHFGAG-MLAPMEVVVNSKES--MKSSLENVNGVASVTG 510
PE+ +G L AG + V++ +K+ NV V +V G
Sbjct: 563 ---LPEE---DQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNEKANVESVFTVNG 613


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo2464HTHTETR857e-23 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 85.4 bits (211), Expect = 7e-23
Identities = 46/208 (22%), Positives = 84/208 (40%), Gaps = 10/208 (4%)

Query: 1 MEKKRTRAEELGITRRKILDTARDLFMEKGYRAVSTREIAKIANITQPALYHHFEDKESL 60
K + A+E TR+ ILD A LF ++G + S EIAK A +T+ A+Y HF+DK L
Sbjct: 2 ARKTKQEAQE---TRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDL 58

Query: 61 YIEVVRELTQNI-QVEMHPIMQTNKAKEEQLHDMLIMLIE--EHPTNILLMIHDILNEMK 117
+ E+ NI ++E+ + L ++LI ++E L++ I ++ +
Sbjct: 59 FSEIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCE 118

Query: 118 PENQFLLYKLWQKTYLEPFQQFFERL----ENAGELRNGISAETAARYCLSTISPLFSGK 173
+ + + Q+ E+ A L + AA IS L
Sbjct: 119 FVGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENW 178

Query: 174 GSFAQKQTTTEQIDELINLMMFGICKKE 201
Q ++ + + +++
Sbjct: 179 LFAPQSFDLKKEARDYVAILLEMYLLCP 206


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo2466BCTERIALGSPF300.002 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 29.8 bits (67), Expect = 0.002
Identities = 8/33 (24%), Positives = 15/33 (45%), Gaps = 1/33 (3%)

Query: 74 LFGHWIKWLLLTIITIGIYGFWVFIKLEDWKVK 106
+ W+LL ++ G F V ++ E +V
Sbjct: 222 AVRTFGPWMLLALL-AGFMAFRVMLRQEKRRVS 253


31lmo2504lmo2527Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
lmo2504213-0.438096two-component response phosphate regulator
lmo2505012-0.187785hypothetical protein
lmo2506011-0.471557cardiolipin synthase
lmo2507-110-0.606144cell wall-binding protein
lmo2508-110-0.526032peptidoglycan lytic protein P45
lmo250909-0.576107cell division protein FtsX
lmo2510010-0.766475cell division protein FtsE
lmo2511116-1.265753hypothetical protein
lmo2512217-1.278615peptide chain release factor 2
lmo2513215-1.072744preprotein translocase subunit SecA
lmo2514115-1.935961hypothetical protein
lmo2515215-1.948506competence protein ComFC
lmo2516217-1.785762competence protein comFA
lmo2517218-2.276701hypothetical protein
lmo2518221-2.222855two-component response regulator DegU
lmo2519425-2.289328hypothetical protein
lmo2520325-1.941872hypothetical protein
lmo2521322-1.476364LytR family transcriptional regulator
lmo2522324-2.532254teichoic acid linkage unit synthesis protein
lmo2523222-1.513184O-succinylbenzoate-CoA synthase
lmo2524221-0.028564polyglycerol phosphate biosynthesis protein
lmo2525322-0.013817cell wall-binding protein
lmo25263231.162567single-strand DNA-binding protein
lmo25272240.965960(3R)-hydroxymyristoyl-ACP dehydratase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo2504GPOSANCHOR462e-07 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 46.2 bits (109), Expect = 2e-07
Identities = 38/250 (15%), Positives = 86/250 (34%), Gaps = 8/250 (3%)

Query: 24 HADTINDMQKRQNEIEQKKSEIDKNIDSKNSELNHLESAEKDAAKELESLMKSLDDTNKK 83
+ ++++ + E+E +K++++K ++ + + K E +L D K
Sbjct: 104 NDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKA 163

Query: 84 LKEQEDKVSSENEKLKKLQKEMEKLRNDIRDRQKVLDNRARAIQTTGTATSYLDMIFEAD 143
L+ + ++++ K+K L+ E L + +K L+ L+
Sbjct: 164 LEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLE------ 217

Query: 144 DFKELVDRVTVVSAIVKADQNIMQDQKDDQDKLKVAESTSEKKLENLKVLAVELEVSKNN 203
E + + KA + M D K+K E+ L LE + N
Sbjct: 218 --AEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNF 275

Query: 204 MESQKQEKNDLVMALANKKDLTKSEQTLLASEQGALTDEEKRLASNIAGEKAKQEAAIKA 263
+ + L A + + + L ++ +K + K
Sbjct: 276 STADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKL 335

Query: 264 AEEKRMQEAA 273
E+ ++ EA+
Sbjct: 336 EEQNKISEAS 345



Score = 31.6 bits (71), Expect = 0.006
Identities = 33/189 (17%), Positives = 75/189 (39%), Gaps = 5/189 (2%)

Query: 22 GAHADTINDMQKRQNEIEQKKSEIDKNIDSKNSELNHLESAEKDAAKELESLMKSLDDTN 81
++ RQ E+E+ + ++++ LE+ + E L N
Sbjct: 249 KTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLN 308

Query: 82 KKLKEQEDKVSSENEKLKKLQKEMEKLRNDIRDRQKVLDNRARAIQTTGTATSYLDMIFE 141
+ + + E K+L+ E +KL + + + R + + A L+
Sbjct: 309 ANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQLEA--- 365

Query: 142 ADDFKELVDRVTVVSAIVKADQNIMQDQKDDQDKLKVAESTSEKKLENLKVLAVELEVSK 201
+ ++L ++ + A ++ + + ++ + +++ A + KL L+ L ELE SK
Sbjct: 366 --EHQKLEEQNKISEASRQSLRRDLDASREAKKQVEKALEEANSKLAALEKLNKELEESK 423

Query: 202 NNMESQKQE 210
E +K E
Sbjct: 424 KLTEKEKAE 432


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo2505GPOSANCHOR394e-05 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 38.5 bits (89), Expect = 4e-05
Identities = 35/217 (16%), Positives = 80/217 (36%), Gaps = 6/217 (2%)

Query: 27 ADVNTDIQNQDKKINDIKSKKTDLQSDLSGLVADLEKAQEKAKSLQGEFDKTGKELKKLN 86
A++ ++ +K L+++ + L A ++ + ++K L
Sbjct: 193 AELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLE 252

Query: 87 EDIKSINERIKERETVLKERARAMQKTSNSNAYLEVILDAENLSDLVGRVSAVNQLVD-S 145
+ ++ R E E L+ S LE + L + +Q+++ +
Sbjct: 253 AEKAALEARQAELEKALEGAMNFSTADSAKIKTLEA--EKAALEAEKADLEHQSQVLNAN 310

Query: 146 DKSILEDQQNDEKALKTKQTAVKKKQEDQATAIHEYEAQQNKIEAQKAEK---EAIVAQL 202
+S+ D +A K + +K +E + ++ + ++A + K EA +L
Sbjct: 311 RQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQLEAEHQKL 370

Query: 203 ASDQASAENAKAGLVSERDKAAKEATARATALREATS 239
+E ++ L + D + + AL EA S
Sbjct: 371 EEQNKISEASRQSLRRDLDASREAKKQVEKALEEANS 407


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo2510SECA11870.0 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 1187 bits (3073), Expect = 0.0
Identities = 431/899 (47%), Positives = 581/899 (64%), Gaps = 65/899 (7%)

Query: 1 MAGLLKKIFESG-KKDVKYLERKADEIIALADETAALSDDALREKTVEFKERVQKGETLD 59
+ LL K+F S + ++ + + + I A+ E LSD+ L+ KT EF+ R++KGE L+
Sbjct: 2 LIKLLTKVFGSRNDRTLRRMRKVVNIINAMEPEMEKLSDEELKGKTAEFRARLEKGEVLE 61

Query: 60 DLLVEAFAVAREGAKRALGLYPFKVQLMGGIVLHEGNIAEMKTGEGKTLTATLPVYLNAL 119
+L+ EAFAV RE +KR G+ F VQL+GG+VL+E IAEM+TGEGKTLTATLP YLNAL
Sbjct: 62 NLIPEAFAVVREASKRVFGMRHFDVQLLGGMVLNERCIAEMRTGEGKTLTATLPAYLNAL 121

Query: 120 SGEGVHVVTVNEYLAHRDAEEMGVLYNFLGLSVGLNLNALSSTEKREAYACDITYSTNNE 179
+G+GVHVVTVN+YLA RDAE L+ FLGL+VG+NL + + KREAYA DITY TNNE
Sbjct: 122 TGKGVHVVTVNDYLAQRDAENNRPLFEFLGLTVGINLPGMPAPAKREAYAADITYGTNNE 181

Query: 180 LGFDYLRDNMVVYKEEMVQRPLAFAVIDEVDSILVDEARTPLIISGEAEKSTILYVRANT 239
GFDYLRDNM EE VQR L +A++DEVDSIL+DEARTPLIISG AE S+ +Y R N
Sbjct: 182 YGFDYLRDNMAFSPEERVQRKLHYALVDEVDSILIDEARTPLIISGPAEDSSEMYKRVNK 241

Query: 240 FVRTLTEEE-----------DYTVDIKTKSVQLTEDGMTKGENYF-------DVENLFDL 281
+ L +E ++VD K++ V LTE G+ E + E+L+
Sbjct: 242 IIPHLIRQEKEDSETFQGEGHFSVDEKSRQVNLTERGLVLIEELLVKEGIMDEGESLYSP 301

Query: 282 ENTVILHHIAQALKANYTMSLDVDYVVQDDEVLIVDQFTGRIMKGRRFSEGLHQALEAKE 341
N +++HH+ AL+A+ + DVDY+V+D EV+IVD+ TGR M+GRR+S+GLHQA+EAKE
Sbjct: 302 ANIMLMHHVTAALRAHALFTRDVDYIVKDGEVIIVDEHTGRTMQGRRWSDGLHQAVEAKE 361

Query: 342 GVTIQNESKTMATITFQNYFRMYKKLAGMTGTAKTEEEEFRDIYNMRVIEIPTNKVIIRD 401
GV IQNE++T+A+ITFQNYFR+Y+KLAGMTGTA TE EF IY + + +PTN+ +IR
Sbjct: 362 GVQIQNENQTLASITFQNYFRLYEKLAGMTGTADTEAFEFSSIYKLDTVVVPTNRPMIRK 421

Query: 402 DRPDLIYTTIEAKFNAVVEDIAERHAKGQPVLVGTVAIETSELISSKLKRKGIKHDVLNA 461
D PDL+Y T K A++EDI ER AKGQPVLVGT++IE SEL+S++L + GIKH+VLNA
Sbjct: 422 DLPDLVYMTEAEKIQAIIEDIKERTAKGQPVLVGTISIEKSELVSNELTKAGIKHNVLNA 481

Query: 462 KQHEREADIIKHAGERGAVVIATNMAGRGTDIKLG------------------------- 496
K H EA I+ AG AV IATNMAGRGTDI LG
Sbjct: 482 KFHANEAAIVAQAGYPAAVTIATNMAGRGTDIVLGGSWQAEVAALENPTAEQIEKIKADW 541

Query: 497 ----EGTIEAGGLAVIGTERHESRRIDNQLRGRSGRQGDPGVTQFYLSMEDELMRRFGSD 552
+ +EAGGL +IGTERHESRRIDNQLRGRSGRQGD G ++FYLSMED LMR F SD
Sbjct: 542 QVRHDAVLEAGGLHIIGTERHESRRIDNQLRGRSGRQGDAGSSRFYLSMEDALMRIFASD 601

Query: 553 NMKSMMERFGMAED-AIQSKMVSRAVESAQRRVEGNNFDSRKQVLQYDDVLRQQREVIYK 611
+ MM + GM AI+ V++A+ +AQR+VE NFD RKQ+L+YDDV QR IY
Sbjct: 602 RVSGMMRKLGMKPGEAIEHPWVTKAIANAQRKVESRNFDIRKQLLEYDDVANDQRRAIYS 661

Query: 612 QRYEVINAENSLREIIEQMIQRTVNFIVSSNASSHEPEEAWNLQGIIDYVDANLLPEGTI 671
QR E+++ + + E I + + + + EE W++ G+ + + + + I
Sbjct: 662 QRNELLDVSD-VSETINSIREDVFKATIDAYIPPQSLEEMWDIPGLQERLKNDFDLDLPI 720

Query: 672 T--LEDLQNRTSEDIQNLILDKIKAAYDEKETLLPPEEFNEFEKVVLLRVVDTKWVDHID 729
L+ E ++ IL + Y KE ++ E FEK V+L+ +D+ W +H+
Sbjct: 721 AEWLDKEPELHEETLRERILAQSIEVYQRKEEVVGAEMMRHFEKGVMLQTLDSLWKEHLA 780

Query: 730 AMDHLRDGIHLRAYGQIDPLREYQSEGFEMFEAMVSSIDEDVARYIMKAEIR-------- 781
AMD+LR GIHLR Y Q DP +EY+ E F MF AM+ S+ +V + K ++R
Sbjct: 781 AMDYLRQGIHLRGYAQKDPKQEYKRESFSMFAAMLESLKYEVISTLSKVQVRMPEEVEEL 840

Query: 782 ---QNLEREQVAKGEAINPAEGKPEAKRQPIRK--DQHIGRNDPCPCGSGKKYKNCHGK 835
+ +E E++A+ + ++ + A + ++ +GRNDPCPCGSGKKYK CHG+
Sbjct: 841 EQQRRMEAERLAQMQQLSHQDDDSAAAAALAAQTGERKVGRNDPCPCGSGKKYKQCHGR 899


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo2515HTHFIS764e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 75.6 bits (186), Expect = 4e-18
Identities = 27/114 (23%), Positives = 48/114 (42%), Gaps = 2/114 (1%)

Query: 4 KIMIVDDHQLFREGIKRILELEDSFEVVAEAENGKNIVAKVREYKPDIVLMDINMPTVNG 63
I++ DD R + + L ++V N + + D+V+ D+ MP N
Sbjct: 5 TILVADDDAAIRTVLNQALSRAG-YDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 64 LDATEMLVRQFPSIKVIVLTIHDTDEYVTEALRAGAVGYLLKEMDAHELVEAVK 117
D + + P + V+V++ +T +A GA YL K D EL+ +
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo2525SHAPEPROTEIN435e-156 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 435 bits (1120), Expect = e-156
Identities = 162/335 (48%), Positives = 238/335 (71%), Gaps = 5/335 (1%)

Query: 2 AKDVGIDLGTANVLIHVKGRGIVVNEPAVVAINNKTG----QVLAVGTEARDMVGRTPGD 57
+ D+ IDLGTAN LI+VKG+GIV+NEP+VVAI V AVG +A+ M+GRTPG+
Sbjct: 10 SNDLSIDLGTANTLIYVKGQGIVLNEPSVVAIRQDRAGSPKSVAAVGHDAKQMLGRTPGN 69

Query: 58 ITAIKPMKDGVIADFDIVQEMLRFFIQKLNLKTFFS-RPRILICCPTNITSVEQKAIREV 116
I AI+PMKDGVIADF + ++ML+ FI++++ +F PR+L+C P T VE++AIRE
Sbjct: 70 IAAIRPMKDGVIADFFVTEKMLQHFIKQVHSNSFMRPSPRVLVCVPVGATQVERRAIRES 129

Query: 117 AEKSGGKQVFLEEEPKVAAIGAGMEIFEPSGNMIIDIGGGTADVAVLSMGDIVTSQSVKV 176
A+ +G ++VFL EEP AAIGAG+ + E +G+M++DIGGGT +VAV+S+ +V S SV++
Sbjct: 130 AQGAGAREVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLNGVVYSSSVRI 189

Query: 177 AGNKWDADILNYVKRKYNLLIGERTAENIKVTIGTACQGAKEEKMEIRGRDLVSGLPKTI 236
G+++D I+NYV+R Y LIGE TAE IK IG+A G + ++E+RGR+L G+P+
Sbjct: 190 GGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRGRNLAEGVPRGF 249

Query: 237 SITSSEVEEAIHDSLHLMVLAAKQVLEQTPPELSADIIDRGIIMTGGGSLLHGLDELMSE 296
++ S+E+ EA+ + L +V A LEQ PPEL++DI +RG+++TGGG+LL LD L+ E
Sbjct: 250 TLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISERGMVLTGGGALLRNLDRLLME 309

Query: 297 QLKVPVLITENPLDVVALGTGILLDSLTNKKRNRF 331
+ +PV++ E+PL VA G G L+ + + F
Sbjct: 310 ETGIPVVVAEDPLTCVARGGGKALEMIDMHGGDLF 344


32lmo2581lmo2594Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
lmo2581-1183.668890hypothetical protein
lmo25820194.082490hypothetical protein
lmo25831214.146317ABC transporter ATP-binding protein
lmo25841263.899464hypothetical protein
lmo25850233.252288histidine kinase
lmo2586-1172.558690DNA-binding response regulator
lmo25870131.028253formate dehydrogenase accessory protein
lmo25880150.669511hypothetical protein
lmo2589013-0.024012formate dehydrogenase subunit alpha
lmo2590-115-0.914856hypothetical protein
lmo2591013-0.538780multidrug transporter
lmo2592118-0.007828TetR family transcriptional regulator
lmo2593217-0.089824****ATP-binding protein
lmo2594216-0.167534N-acetylmuramoyl-L-alanine amidase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo2582PF06580356e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 34.8 bits (80), Expect = 6e-04
Identities = 22/101 (21%), Positives = 42/101 (41%), Gaps = 20/101 (19%)

Query: 359 NLLTNAIKFTPQGGNIQVRLYEDTTNVFVEVQDSGVGISKVDMTKIFDRFYKANESRTRE 418
N + + I PQGG I ++ +D V +EV+++G K
Sbjct: 266 NGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALK------------------NT 307

Query: 419 EGSSGLGLS-ICQKIITLHHGEVTVQ-SSLEKGTTFTVKLP 457
+ S+G GL + +++ L+ E ++ S + V +P
Sbjct: 308 KESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo2583HTHFIS991e-26 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 99 bits (249), Expect = 1e-26
Identities = 39/123 (31%), Positives = 65/123 (52%), Gaps = 1/123 (0%)

Query: 4 ILVVDDDRHILKLVGHYLRAEGFHVLEASDGVEAEKIVETEQVHLAVIDVMMPNMDGFEL 63
ILV DDD I ++ L G+ V S+ + + L V DV+MP+ + F+L
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 64 CQKMRASYPDIPVIMLTAKDALADKSRGFEVGTDDYVTKPFEPEELIFRI-RALLRRSNQ 122
+++ + PD+PV++++A++ + E G DY+ KPF+ ELI I RAL +
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRR 125

Query: 123 ASE 125
S+
Sbjct: 126 PSK 128


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo2588TCRTETB1393e-38 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 139 bits (351), Expect = 3e-38
Identities = 107/418 (25%), Positives = 198/418 (47%), Gaps = 19/418 (4%)

Query: 16 SYSRSLL-----VVTMIIGAFVAILNQTLLATALPMIMDDLHITAATGQWLTTAFLLTNG 70
SYS+S L ++ + I +F ++LN+ +L +LP I +D + A+ W+ TAF+LT
Sbjct: 4 SYSQSNLRHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFS 63

Query: 71 IMIPITALLIEKISSKTLFITAMTVFTIGTIIASVAGS-FPILLTGRIVQAAGAGIMMPL 129
I + L +++ K L + + + G++I V S F +L+ R +Q AGA L
Sbjct: 64 IGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPAL 123

Query: 130 LQTIFLLIFPREKRGAAMGLMGLVIAFAPAIGPTLSGWIVDSYDWRVLFLILIPIAVIDI 189
+ + P+E RG A GL+G ++A +GP + G I W L LI + I +I +
Sbjct: 124 VMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPM-ITIITV 182

Query: 190 ILAFFGMKKVVKLTDTKIDFLSIVMSSIGFGALLYGFSSAGNDGWGDTTVITMLIVGVVV 249
+KK V++ D I++ S+G + +S I+ LIV V+
Sbjct: 183 PFLMKLLKKEVRIKGH-FDIKGIILMSVGIVFFMLFTTSY---------SISFLIVSVLS 232

Query: 250 IALFVWRQLVIDNPMLELHVFKYPVFSLSVILGSIVTMAMIGAEIVLPLYIQTIRGESAL 309
+FV + +P ++ + K F + V+ G I+ + G ++P ++ + S
Sbjct: 233 FLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTA 292

Query: 310 QSG-LLLLPGAIIMGIMSPITGIIFDKIGAKWLTITGVTILTIGTIPFMFLTMDTPLWYI 368
+ G +++ PG + + I I GI+ D+ G ++ GVT L++ + FL T +
Sbjct: 293 EIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMT 352

Query: 369 VVFYAVRFFGISMAMMPVSTAGMNALPNHLINHGSAVNNTIRQIAGSIGTAVLITVLT 426
++ V G+S +ST ++L G ++ N ++ G A++ +L+
Sbjct: 353 IIIVFV-LGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLS 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo2589HTHTETR688e-16 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 68.1 bits (166), Expect = 8e-16
Identities = 20/74 (27%), Positives = 38/74 (51%)

Query: 2 KEKKQRIIKSAKEVFQKQGYLKTSVQDMVDAAGISKGTFYNYFTSKEELAIVIFKQEYSV 61
+E +Q I+ A +F +QG TS+ ++ AAG+++G Y +F K +L I++ S
Sbjct: 10 QETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESN 69

Query: 62 LHQRLEYTMAQDGA 75
+ + A+
Sbjct: 70 IGELELEYQAKFPG 83


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo2591FLGFLGJ853e-20 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 84.8 bits (209), Expect = 3e-20
Identities = 56/174 (32%), Positives = 83/174 (47%), Gaps = 15/174 (8%)

Query: 32 RTAQVNLTTSQQAFIDEILPAAQDGYRDGKLLTSVTLAQAILESNWGESGL----SQNSK 87
R +L +AF+ ++ AQ + + + LAQA LES WG+ + + S
Sbjct: 139 RNYDDSLPGDSKAFLAQLSLPAQLASQQSGVPHHLILAQAALESGWGQRQIRRENGEPSY 198

Query: 88 NLFGIK--GTYKGKSVSMGTMEASGSTT----ANFRVYPSWKESIEDHTALITENARYQD 141
NLFG+K G +KG + T E A FRVY S+ E++ D+ L+T N RY
Sbjct: 199 NLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRVYSSYLEALSDYVGLLTRNPRYAA 258

Query: 142 AVDETDYRKALQAIKDGGYATDPDYVSKLVAIIERYNLDKYDVIYDKIESNQSL 195
+ QA++D GYATDP Y KL +I+ + I DK+ S+
Sbjct: 259 VTTAASAEQGAQALQDAGYATDPHYARKLTNMIQ-----QMKSISDKVSKTYSM 307


33lmo2673lmo2694Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
lmo2673-1183.570350hypothetical protein
lmo2674-1164.344690hypothetical protein
lmo2675-1164.375219AraC family transcriptional regulator
lmo26761194.665936hypothetical protein
lmo26770225.184314ribose-5-phosphate isomerase B
lmo26781235.123805hypothetical protein
lmo26790194.663960DNA polymerase IV
lmo26801204.169794esterase
lmo26810203.225538XRE family transcriptional regulator
lmo26821191.757312histidine kinase
lmo26832160.714663potassium-transporting ATPase subunit C
lmo2684-2141.450798potassium-transporting ATPase subunit B
lmo2685-1201.112373potassium-transporting ATPase subunit A
lmo2686-1210.797675PTS cellbiose transporter subunit IIB
lmo2687-1151.552114PTS cellbiose transporter subunit IIC
lmo2688-1141.958267PTS cellbiose transporter subunit IIA
lmo2689-2151.902618hypothetical protein
lmo2689a-1192.641585cell division protein FtsW
lmo2690-1173.419276cell division protein FtsW
lmo2691-1184.252050magnesium-translocating P-type ATPase
lmo2692-1164.945097hypothetical protein
lmo26930173.711847TetR family transcriptional regulator
lmo26940173.179919autolysin
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo2678HTHFIS908e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 89.5 bits (222), Expect = 8e-23
Identities = 34/131 (25%), Positives = 64/131 (48%), Gaps = 1/131 (0%)

Query: 3 SKRLVLIVEDEDGISNFISAVLTASDYSVIKAVNGKEALEQTASHSPDVVLLDLGLPDME 62
+ +L+ +D+ I ++ L+ + Y V N A+ D+V+ D+ +PD
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 63 GLDVLRDIR-VWSKVPIIVVSARDHEREKVTALDLGADDYITKPFGTSELLARIRTALRH 121
D+L I+ +P++V+SA++ + A + GA DY+ KPF +EL+ I AL
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 122 IQPSSKESPND 132
+ + +D
Sbjct: 122 PKRRPSKLEDD 132


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo2679PF06580404e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 39.8 bits (93), Expect = 4e-05
Identities = 28/131 (21%), Positives = 53/131 (40%), Gaps = 11/131 (8%)

Query: 708 NLIRG-IKDDSGWLIRMVENLLSVTRISEGLVSLERAPEAVE-EIVGEAVGRIKKRFRDR 765
N IR I +D M+ +L + R S + + A E +V + +F DR
Sbjct: 180 NNIRALILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDR 239

Query: 766 -TIHVKVPRDLLMVPMDGTLIEQVLINLMENALRHG----GTGAEVWVDVTKTKQSAIFS 820
++ ++ V + ++ Q L+ EN ++HG G ++ + TK +
Sbjct: 240 LQFENQINPAIMDVQVP-PMLVQTLV---ENGIKHGIAQLPQGGKILLKGTKDNGTVTLE 295

Query: 821 VRDNGKGIPEN 831
V + G +N
Sbjct: 296 VENTGSLALKN 306


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo2681TYPE4SSCAGA320.008 Type IV secretion system CagA exotoxin signature.
		>TYPE4SSCAGA#Type IV secretion system CagA exotoxin signature.

Length = 1147

Score = 32.4 bits (73), Expect = 0.008
Identities = 17/81 (20%), Positives = 36/81 (44%), Gaps = 13/81 (16%)

Query: 73 NFAEAIAEGRGRAQADSLKMARKDV-------------LARKLKNVDDKTDVIEVASNDL 119
NF +A+A+ + D +K A+KD+ + +KL++ + +E +
Sbjct: 590 NFNKAVADAKNTGNYDEVKKAQKDLEKSLRKREHLEKEVEKKLESKSGNKNKMEAKAQAN 649

Query: 120 KKGDIVYVLANEQIPMDGEVI 140
+ D ++ L N++ D I
Sbjct: 650 SQKDEIFALINKEANRDARAI 670


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo2690HTHTETR581e-12 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 57.7 bits (139), Expect = 1e-12
Identities = 32/194 (16%), Positives = 78/194 (40%), Gaps = 12/194 (6%)

Query: 3 TNESIMDATLCMMAKHGIKGSTTRQLAEAAGINEATIFKKFKSKDNLIHMTLEVQFESMK 62
T + I+D L + ++ G+ ++ ++A+AAG+ I+ FK K +L E+ ++
Sbjct: 12 TRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIG 71

Query: 63 AEINQFFDKDFESAKVFLRQAS--QFISDIYEKYRDFMVISV--REMGSKDMEFID---P 115
++ K LR+ S + E+ R ++ + + +M +
Sbjct: 72 ELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQR 131

Query: 116 SIVEYLYERVNEKVKEMVPSKNSAQ--EADAISLILNSVILLIMVEKVRDDIYKRPPTIT 173
++ Y+R+ + +K + +K ++I+ I +M + + +
Sbjct: 132 NLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAP---QSFDLK 188

Query: 174 TTADSLADVLLKLL 187
A +LL++
Sbjct: 189 KEARDYVAILLEMY 202


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo2691FLGFLGJ592e-11 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 58.6 bits (141), Expect = 2e-11
Identities = 56/235 (23%), Positives = 95/235 (40%), Gaps = 30/235 (12%)

Query: 82 TAKQTVGPQQTETKEQTKTPEEKQAATNQVEKAPAEPATVSNPDNATSSSTPATYNLLQK 141
T +Q + + T E NQ + A N D++ +
Sbjct: 99 TPEQPLPEESTPAAPMKFPLETVVRYQNQALSQLVQKAVPRNYDDSLPGDS--------- 149

Query: 142 SALRSGATVQSFIQTIQASSSQIAAENDLYASVMIAQAILESAYGTSELGSA---PNYNL 198
++F+ + + + ++ + +++AQA LES +G ++ P+YNL
Sbjct: 150 ---------KAFLAQLSLPAQLASQQSGVPHHLILAQAALESGWGQRQIRRENGEPSYNL 200

Query: 199 FGIK--GAYNGQSYTKQTLEDDGKGNYYTITAKFRKYPSYHQSLEDYAQVIRKGPSWNPN 256
FG+K G + G T E + G + AKFR Y SY ++L DY ++ + NP
Sbjct: 201 FGVKASGNWKGPVTEITTTEYE-NGEAKKVKAKFRVYSSYLEALSDYVGLLTR----NPR 255

Query: 257 YYSKAWKSNTTSYKDATKALTGTYATDTAYATKLNDLISRYNLTQYDSGKTTGGN 311
Y A + ++ + A YATD YA KL ++I + KT N
Sbjct: 256 Y--AAVTTAASAEQGAQALQDAGYATDPHYARKLTNMIQQMKSISDKVSKTYSMN 308


34lmo2802lmo2813Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
lmo2802224-0.333485PTS mannitol transporter subunit IIBC
lmo2802a724-1.958486dehydrogenase
lmo2803924-2.823214N-acetylmannosamine-6-phosphate 2-epimerase
lmo2804824-3.36408116S rRNA methyltransferase GidB
lmo2805019-2.071072hypothetical protein
lmo2806-3130.222923hypothetical protein
lmo2807-3131.203588hypothetical protein
lmo2808-3121.195744hypothetical protein
lmo2809-2121.913792hypothetical protein
lmo2810-1133.258503hypothetical protein
lmo28110212.533970hypothetical protein
lmo28122201.953480hypothetical protein
lmo28132172.039105tRNA uridine 5-carboxymethylaminomethyl
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo2803ADHESNFAMILY270.023 Adhesin family signature.
		>ADHESNFAMILY#Adhesin family signature.

Length = 309

Score = 26.7 bits (59), Expect = 0.023
Identities = 19/100 (19%), Positives = 39/100 (39%), Gaps = 18/100 (18%)

Query: 31 EASQEKWRFHQELENLSEQIRYIYQKRDYDASEDLPKAYHLISSIQEEGE----WM-VKN 85
E W L E + K + S+ + Y L ++ E W+ ++N
Sbjct: 93 ETGGNAW-----FTKLVENAKKTENKDYFAVSDGVDVIY-LEGQNEKGKEDPHAWLNLEN 146

Query: 86 AVTHLENESE-------EHQTLYKKQVTAYEEELHQLKKE 118
+ +N ++ ++ Y+K + Y ++L +L KE
Sbjct: 147 GIIFAKNIAKQLSAKDPNNKEFYEKNLKEYTDKLDKLDKE 186


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo2812BLACTAMASEA429e-07 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 42.1 bits (99), Expect = 9e-07
Identities = 35/158 (22%), Positives = 66/158 (41%), Gaps = 17/158 (10%)

Query: 1 MKIHKLTWVLLIGLLLLSACSTEQPNLYLSAN--------AAAVYSVENGEALYEQNADK 52
M+ +L + L+ L L+ ++ QP + + + +G L AD+
Sbjct: 1 MRYIRLCIISLLATLPLAVHASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRADE 60

Query: 53 VMPIASLSKLMTAFLVLEAVDNNELSWDEKLDLVRLDDPSAVSLYAITQKR---TWSVRD 109
P+ S K++ VL VD + + K+ + D V +++K +V +
Sbjct: 61 RFPMMSTFKVVLCGAVLARVDAGDEQLERKIHYRQQD---LVDYSPVSEKHLADGMTVGE 117

Query: 110 LYSAMLTMSANDAAETLGDRLDGADFPKEMNNQAKKLG 147
L +A +TMS N AA L + G P + +++G
Sbjct: 118 LCAAAITMSDNSAANLLLATVGG---PAGLTAFLRQIG 152


35lmo0676lmo0715N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
lmo06761312.229582hypothetical protein
lmo06771302.653470hypothetical protein
lmo06781272.383520flagellar biosynthesis protein FliP
lmo06791252.557797flagellar biosynthesis protein FliQ
lmo06801222.771434flagellar biosynthesis protein FliR
lmo06811202.839688flagellar biosynthesis protein FlhB
lmo06821202.164971flagellar biosynthesis protein FlhA
lmo06833161.294716flagellar biosynthesis regulator FlhF
lmo06843161.150930flagellar basal body rod protein FlgG
lmo06852151.082342chemotaxis protein CheR
lmo06862170.625415hypothetical protein
lmo06872150.719824flagellar motor protein MotA
lmo06882151.021329flagellar motor rotation MotB
lmo06892151.214053hypothetical protein
lmo06903182.253140hypothetical protein
lmo06914222.627035chemotaxis protein CheV
lmo06922202.859843flagellin
lmo06930202.969593chemotaxis response regulator CheY
lmo06940192.694730two-component sensor histidine kinase CheA
lmo06950192.864884flagellar motor switch protein FliY
lmo06960162.086798hypothetical protein
lmo06970141.319939hypothetical protein
lmo06982150.279805flagellar basal body rod modification protein
lmo0699215-0.008847flagellar hook protein FlgE
lmo07002160.558936flagellar motor switch protein
lmo0701115-0.037221flagellar motor switch protein FliM
lmo0702217-0.058519flagellar motor switch protein FliY
lmo07032180.149872hypothetical protein
lmo07041190.455287hypothetical protein
lmo07052210.991246hypothetical protein
lmo07062230.214231hypothetical protein
lmo07073220.500680flagellar hook-associated protein FlgK
lmo07082281.743568flagellar hook-associated protein FlgL
lmo07092302.579293flagellar capping protein FliD
lmo07102322.610471flagellar protein
lmo07113303.041956hypothetical protein
lmo07122303.076636flagellar basal-body rod protein FlgB
lmo07133312.722908flagellar basal body rod protein FlgC
lmo07144292.338795flagellar hook-basal body protein FliE
lmo07152282.118471flagellar MS-ring protein FliF
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0676FLGBIOSNFLIP1671e-53 Escherichia coli: Flagellar biosynthetic protein Fl...
		>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP

signature.
Length = 245

Score = 167 bits (425), Expect = 1e-53
Identities = 79/239 (33%), Positives = 132/239 (55%)

Query: 14 FIVIFAISLVVFWPGVNVHAESWMDSLGVNGTDGVNSSVALFVLVTVLSLSASIVLMFTH 73
+ + + L + P G + V V +T L+ +I+LM T
Sbjct: 4 LLSVAPVLLWLITPLAFAQLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILLMMTS 63

Query: 74 FTYCIIVLGLTRQGLGATNLPPNQVLVGLALFLSLFMMQPLITAWYDDVYKPSQKEEWSA 133
FT IIV GL R LG + PPNQVL+GLALFL+ F+M P+I Y D Y+P +E+ S
Sbjct: 64 FTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEEKISM 123

Query: 134 SKVWDETQPLLTKYVAENTYKHDINMMLKAEGEDPVTKKEDAPLMALMPAFILTQITQGF 193
+ ++ L +++ T + D+ + + P+ E P+ L+PA++ +++ F
Sbjct: 124 QEALEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSELKTAF 183

Query: 194 LTGMFIYLAFIFIDLIVSTLLMYLGMMMVPPMTISLPFKILVFIFIGGYGLITNMIFQT 252
G I++ F+ IDL+++++LM LGMMMVPP TI+LPFK+++F+ + G+ L+ + Q+
Sbjct: 184 QIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLAQS 242


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0677TYPE3IMQPROT435e-09 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 42.8 bits (101), Expect = 5e-09
Identities = 15/76 (19%), Positives = 34/76 (44%)

Query: 6 ITQIFQDFFYSGLALILPVSLICIVVVIVVAILMAMMQIQDQSLTFLPKIVAFVVALFIL 65
+ Y L L +++ ++ ++V + + Q+Q+Q+L F K++ + LF+L
Sbjct: 4 LVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLFLL 63

Query: 66 GPWMFEHMTDLFVGIF 81
W E + +
Sbjct: 64 SGWYGEVLLSYGRQVI 79


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0678TYPE3IMRPROT943e-25 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 94.0 bits (234), Expect = 3e-25
Identities = 51/230 (22%), Positives = 107/230 (46%), Gaps = 1/230 (0%)

Query: 12 VFSRVASFLFFFPLLKGRNIPNSVKVVFGMAISIPVATWVDVSGITTLPD-LLLRVTSEV 70
RV + + P+L R++P VK+ M I+ +A + + + L ++
Sbjct: 19 PLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPANDVPVFSFFALWLAVQQI 78

Query: 71 VFGLALAKLVEIIAVIPKMAGFMIDYDLGFSQVNLIDPSYGTQNSITAAILDTFFVVIFL 130
+ G+AL ++ + AG +I +G S +DP+ + A I+D +++FL
Sbjct: 79 LIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHLNMPVLARIMDMLALLLFL 138

Query: 131 SLQGMDYLIYYLMKSFEFTASVSILFEKGFIDLLLGTLGFALASAVSIALPIMGSIFIVN 190
+ G +LI L+ +F L + + +ALP++ + +N
Sbjct: 139 TFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLIFLNGLMLALPLITLLLTLN 198

Query: 191 IILAFISKSAPQINIFMNAFIIKITFGIFILACAVPILSTVFKNLTDEMI 240
+ L +++ APQ++IF+ F + +T GI ++A +P+++ ++L E+
Sbjct: 199 LALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFCEHLFSEIF 248


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0679TYPE3IMSPROT2751e-92 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 275 bits (704), Expect = 1e-92
Identities = 96/340 (28%), Positives = 183/340 (53%), Gaps = 2/340 (0%)

Query: 4 DNKTEKATPRRIKKARNEGNVAKSKELNNAFSLLIVAGLLYFFGEMFIKNTIQAFVALLK 63
KTE+ TP++I+ AR +G VAKSKE+ + ++ ++ +L + + ++ + + +
Sbjct: 3 GEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIPAE 62

Query: 64 QP--PKLANMESYSLFYLMEFGKVLMPIMVMVVIFGLMNYGVQVGILFSAKAVKPQFKRL 121
Q P + L+EF + P++ + + + ++ VQ G L S +A+KP K++
Sbjct: 63 QSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDIKKI 122

Query: 122 NPANYFKRVFSVKGIVEVVKALLLITLLSYVAYIGFRDHLDTLISYTGQNWLYSLGQIFA 181
NP KR+FS+K +VE +K++L + LLS + +I + +L TL+ +
Sbjct: 123 NPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPLLGQ 182

Query: 182 LFKNEFLALFLVIAVIGLLDFFYQRYDYKKGLRMSKQEIKDEMKDSEGRPEVKQRQRSIA 241
+ + + + VI + D+ ++ Y Y K L+MSK EIK E K+ EG PE+K ++R
Sbjct: 183 ILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRRQFH 242

Query: 242 RGLLQGSITKKMADATFVVNNPTHISVVMRYDKTKDHAPKLLVKGEDELALFIRQVADTD 301
+ + ++ + + ++ VV NPTHI++ + Y + + P + K D +R++A+ +
Sbjct: 243 QEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIAEEE 302

Query: 302 GVPMITNRQLARSIYYTTNPDEYIQEDLYKDVIEVMKELM 341
GVP++ LAR++Y+ D YI + + EV++ L
Sbjct: 303 GVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLE 342


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0681GPOSANCHOR310.007 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 31.2 bits (70), Expect = 0.007
Identities = 24/182 (13%), Positives = 60/182 (32%), Gaps = 9/182 (4%)

Query: 7 EKMEIFKGNSKREIHKKIQLVTNEPYKITDERVTKLGIFKKQYEVTAVIMSEVAIADGRM 66
+E + ++ + + T + KI K + ++ ++ + + +
Sbjct: 186 AALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADS 245

Query: 67 DFQETFQKSVVKTRPKTDDLLKKEKLLEMLAAGAELAQST------PLLEERKTQEEELS 120
+T + + +L K + + T L E+ E +
Sbjct: 246 AKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQ 305

Query: 121 SMRLELAALNRELAVKMREEREQNSDFVKFLKGRGISDTYVADF---MQAGRKQFKQVET 177
+ +L R+L +++ ++ K + IS+ + A R+ KQ+E
Sbjct: 306 VLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQLEA 365

Query: 178 AH 179
H
Sbjct: 366 EH 367


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0682FLGHOOKAP1280.036 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 28.4 bits (63), Expect = 0.036
Identities = 5/32 (15%), Positives = 13/32 (40%)

Query: 3 GLYIGAAGMMNYMQHIQVHSNNVANAQTPGFK 34
+ +G+ + SNN+++ G+
Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYT 34


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0685SECFTRNLCASE280.032 Bacterial translocase SecF protein signature.
		>SECFTRNLCASE#Bacterial translocase SecF protein signature.

Length = 333

Score = 28.3 bits (63), Expect = 0.032
Identities = 13/50 (26%), Positives = 25/50 (50%), Gaps = 8/50 (16%)

Query: 3 ITTIIGLVLAVIVIAGSFMIQNISLAMLFSAEALIVIILGTITAVMMAHP 52
+TT+ L L ++I G +I+ AM++ + GT ++V +A
Sbjct: 262 MTTL--LALVPMLIWGGDVIRGFVFAMVWG------VFTGTYSSVYVAKN 303


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0686OMPADOMAIN581e-11 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 57.6 bits (139), Expect = 1e-11
Identities = 36/129 (27%), Positives = 58/129 (44%), Gaps = 16/129 (12%)

Query: 148 ITIRDDILFQSGSAEL-SAGKREIAKEIGELFAQGKGTMEGIVSGHTDNVPISTSIYSSN 206
T++ D+LF A L G+ + + +L +V G+TD I + Y N
Sbjct: 215 FTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDR--IGSDAY--N 270

Query: 207 WELSVARAVNFMEAIIQENSEVNPGEFSARGYGEFRPVAKNDIAANREK---------NR 257
LS RA + ++ +I + + + SARG GE PV N +++ +R
Sbjct: 271 QGLSERRAQSVVDYLISKG--IPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDR 328

Query: 258 RVEIMVRPI 266
RVEI V+ I
Sbjct: 329 RVEIEVKGI 337


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0688SYCDCHAPRONE290.039 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 28.7 bits (64), Expect = 0.039
Identities = 21/130 (16%), Positives = 48/130 (36%), Gaps = 8/130 (6%)

Query: 165 IYHYGYMSEIVEKQDKSDRNLRLLEKEVKNNKNSGFVHFNIGQEMNRLGNKKEALKEFSE 224
+Y + K + + + + L V ++ +S F +G +G A+ +S
Sbjct: 39 LYSLAFNQYQSGKYEDAHKVFQALC--VLDHYDSRF-FLGLGACRQAMGQYDLAIHSYSY 95

Query: 225 AFRLRDHNHYIWAKLSAYHIAELLEQEKRYDESLAIIEEARVIWPNVPEFPLKKANILYV 284
+ +H AE L Q+ E+ + + A+ + + EF + +
Sbjct: 96 GAIMDIKEPR-----FPFHAAECLLQKGELAEAESGLFLAQELIADKTEFKELSTRVSSM 150

Query: 285 NHQLEDAKEI 294
++ KE+
Sbjct: 151 LEAIKLKKEM 160


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0689HTHFIS439e-07 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 42.9 bits (101), Expect = 9e-07
Identities = 30/114 (26%), Positives = 47/114 (41%), Gaps = 13/114 (11%)

Query: 175 TIFIAEDSQMLRQLLEDTLHEAGYTNLQFFANGREAQEHIFKLLKEQKEQTFENVNLLIT 234
TI +A+D +R +L L AGY +N I +L++T
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRIT-SNAATLWRWI-------AAGDG---DLVVT 53

Query: 235 DIEMPQMDGHHLTKVIKEDEIGRELPVVIFSSLITEDLEHKGAGVGADAQVSKP 288
D+ MP + L IK + +LPV++ S+ T K + GA + KP
Sbjct: 54 DVVMPDENAFDLLPRIK--KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKP 105


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0690FLAGELLIN1272e-35 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 127 bits (319), Expect = 2e-35
Identities = 84/277 (30%), Positives = 129/277 (46%), Gaps = 9/277 (3%)

Query: 1 MKVNTNIISLKTQEYLRKNNEGMTQAQERLASGKRINSSLDDAAGLAVVTRMNVKSTGLD 60
+NTN +SL TQ L K+ ++ A ERL+SG RINS+ DDAAG A+ R GL
Sbjct: 2 QVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLT 61

Query: 61 AASKNSSMGIDLLQTADSALSSMSSILQRMRQLAVQSSNGSFSDEDRKQYTAEFGSLIKE 120
AS+N++ GI + QT + AL+ +++ LQR+R+L+VQ++NG+ SD D K E ++E
Sbjct: 62 QASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEE 121

Query: 121 LDHVADTTNYNNIKLLDQTATGAATQVSIQASDKANDLINIDLFNAKGLSAGTITLGSGS 180
+D V++ T +N +K+L Q Q+ IQ + I IDL S G G
Sbjct: 122 IDRVSNQTQFNGVKVLSQD-----NQMKIQVGANDGETITIDLQKIDVKSLG----LDGF 172

Query: 181 TVAGYSALSVADADSSQQATEAIDELINNISNGRALLGAGMSRLSYNVSNVNNQSIATKA 240
V G +V D SS + D + R + +G V ++ A
Sbjct: 173 NVNGPKEATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAA 232

Query: 241 SASSIEDADMAAEMSEMTKYKILTQTSISMLSQANQT 277
+ D ++ K T + + A
Sbjct: 233 NGQLTTDDAENNTAVDLFKTTKSTAGTAEAKAIAGAI 269



Score = 75.9 bits (186), Expect = 1e-17
Identities = 51/294 (17%), Positives = 103/294 (35%), Gaps = 16/294 (5%)

Query: 4 NTNIISLKTQEYLRKNNEGMTQAQERLASGKRINSSLDDAAGLAVVTRMNVKSTGLDAAS 63
+T ++ + Y+ N +T + + + AG A + G
Sbjct: 217 DTTAPTVPDKVYVNAANGQLTTDDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGD 276

Query: 64 KNSSMGIDLLQTADSALSSMSSILQRMRQLAVQSSNGSFSDEDRKQYTAEFGSLIKELDH 123
G+ + + + V + + ++ +
Sbjct: 277 TFDYKGVTFTIDTKTGNDGNGKVSTTINGEKVTLTVAD----ITAGAANVDAATLQSSKN 332

Query: 124 VADTTNYNNIKLLDQTATGAATQVSIQASDKANDLINIDLFNAKGLSAGTITLGSGSTVA 183
V + D+T +A ++A++ I + A+ + + +
Sbjct: 333 VYTSVVNGQFTFDDKTKNESAKLSDLEANNAVKGESKITVNGAEYTANAAGDKVTLAGKT 392

Query: 184 GYSA------------LSVADADSSQQATEAIDELINNISNGRALLGAGMSRLSYNVSNV 231
+ + A S+ +ID ++ + R+ LGA +R ++N+
Sbjct: 393 MFIDKTASGVSTLINEDAAAAKKSTANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNL 452

Query: 232 NNQSIATKASASSIEDADMAAEMSEMTKYKILTQTSISMLSQANQTPQMLTQLI 285
N ++ S IEDAD A E+S M+K +IL Q S+L+QANQ PQ + L+
Sbjct: 453 GNTVTNLNSARSRIEDADYATEVSNMSKAQILQQAGTSVLAQANQVPQNVLSLL 506


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0691HTHFIS939e-26 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 93.4 bits (232), Expect = 9e-26
Identities = 27/115 (23%), Positives = 53/115 (46%), Gaps = 1/115 (0%)

Query: 3 KLLIVDDAMFMRTMIKNIVKDSDFEVVAEAENGLEAVKKYDEVKPDIVTLDITMPEMDGL 62
+L+ DD +RT++ + + ++V N + D+V D+ MP+ +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 63 EALAQIMAKDPSAKVIMCSAMGQQGMVVDAIKKGAKDFIVKPFQADRVLEALEKA 117
+ L +I P V++ SA + A +KGA D++ KPF ++ + +A
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0692PF06580365e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 35.6 bits (82), Expect = 5e-04
Identities = 14/67 (20%), Positives = 22/67 (32%), Gaps = 9/67 (13%)

Query: 353 LIRNSVDHGAETVEVRRKNGKNETATINLKAFHSGNNVVIEIADDGAGINKRKVLEKAIA 412
L+ N + HG + I LK V +E+ + G+ K
Sbjct: 263 LVENGIKHGIA--------QLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGTG 314

Query: 413 -KNVVTR 418
+NV R
Sbjct: 315 LQNVRER 321


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0693FLGMOTORFLIN605e-15 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 59.9 bits (145), Expect = 5e-15
Identities = 20/81 (24%), Positives = 46/81 (56%), Gaps = 3/81 (3%)

Query: 21 GREKGSIRQVD---NIGVNLIVRLGKKEMPVGDIAELSIGDVLEVEKKPGHKVEIFLDEK 77
G G+++ +D +I V L V LG+ M + ++ L+ G V+ ++ G ++I ++
Sbjct: 45 GDVSGAMQDIDLIMDIPVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGY 104

Query: 78 KVGIGEAILMDENFGIVISEI 98
+ GE +++ + +G+ I++I
Sbjct: 105 LIAQGEVVVVADKYGVRITDI 125


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0695IGASERPTASE290.041 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 28.9 bits (64), Expect = 0.041
Identities = 33/202 (16%), Positives = 56/202 (27%), Gaps = 19/202 (9%)

Query: 48 EADNEEQATIPLKEIAPSLVSAKLLDSEPETKLPSAPLELKEVKETLAAIAKQAIDQPKI 107
A N E A + + + ++ S ETK + E KE T+ K ++ K
Sbjct: 1062 TAQNREVAKEAKSNVKANTQTNEVAQSGSETK-ETQTTETKE-TATVEKEEKAKVETEKT 1119

Query: 108 DSAPQVAQ--PP------------EMNTPKEPT---KNTTREQQPPPELIMPTKDSPKLA 150
P+V P E +PT K + + P K++
Sbjct: 1120 QEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNV 1179

Query: 151 ENVAKNQPALAKLPQEKEAVQLFKASIKEPVTAKEEVAVKKPAESSNIWHDTTKQLTPAA 210
E + E + + +P E K ++
Sbjct: 1180 EQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATT 1239

Query: 211 KVEVPVTLKQLDKTITDQIEQL 232
T+ D T T+ L
Sbjct: 1240 SSNDRSTVALCDLTSTNTNAVL 1261


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0697FLGHOOKAP1454e-07 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 44.9 bits (106), Expect = 4e-07
Identities = 16/36 (44%), Positives = 25/36 (69%)

Query: 5 MYTAISGMNAFQQALSVTSNNIANANTTGYKKQSVV 40
+ A+SG+NA Q AL+ SNNI++ N GY +Q+ +
Sbjct: 4 INNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTI 39



Score = 38.4 bits (89), Expect = 4e-05
Identities = 12/47 (25%), Positives = 25/47 (53%)

Query: 363 ISGSSLEGSNVDLSREFVNLMTYQSGFQGNTKVIRVADDVMKQIVNL 409
+S S V+L E+ NL +Q + N +V++ A+ + ++N+
Sbjct: 499 LSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0698FLGMOTORFLIN514e-12 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 51.4 bits (123), Expect = 4e-12
Identities = 22/68 (32%), Positives = 37/68 (54%)

Query: 7 IPLRIDFELGRTKQPVGSLLDVKKGTVFRLEDSTANVVKITISGKCIGYGEILTKDGKMF 66
IP+++ ELGRT+ + LL + +G+V L+ + I I+G I GE++ K
Sbjct: 60 IPVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYG 119

Query: 67 VKITKLGE 74
V+IT +
Sbjct: 120 VRITDIIT 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0699FLGMOTORFLIM1454e-43 Flagellar motor switch protein FliM signature.
		>FLGMOTORFLIM#Flagellar motor switch protein FliM signature.

Length = 344

Score = 145 bits (368), Expect = 4e-43
Identities = 81/334 (24%), Positives = 166/334 (49%), Gaps = 9/334 (2%)

Query: 1 MSDKLSQEQIDALLSQMSEGKV-VDESTEIGDFGRFHPYDFHKPEKFGAEHLESLKTIAS 59
M++ LSQ++ID LL+ +S G ++++ I D + YDF +P+KF E + +L +
Sbjct: 1 MTEVLSQDEIDQLLTAISSGDASIEDARPISDTRKITLYDFRRPDKFSKEQMRTLSLMHE 60

Query: 60 AFTKKSMEFVSQRIRIPIHTEATLADQVSFASGYIETMPNDSYIFCIIDLGNPELGQIII 119
F + + +S ++R +H DQ+++ +I ++P S +I + +P G ++
Sbjct: 61 TFARLTTTSLSAQLRSMVHVHVASVDQLTYEE-FIRSIPTPS-TLAVITM-DPLKGNAVL 117

Query: 120 ELDLAYIIYIHECLSGGNPKRKLSERRLLSVFEELTLKSILEKFCEALKDSFKSVHPISP 179
E+D + I + L GG + +R L+ E ++ ++ + +++S+ V + P
Sbjct: 118 EVDPSITFSIIDRLFGGT-GQAAKVQRDLTDIENSVMEGVIVRILANVRESWTQVIDLRP 176

Query: 180 EIVNIETNPALLRVTSPNDMMALVSVDIKSEFWISTMRIGVPFFSVEEIMNKLEN---VV 236
+ IETNP ++ P++M+ LV+++ K M +P+ ++E I++KL +
Sbjct: 177 RLGQIETNPQFAQIVPPSEMVVLVTLETKVGEEEGMMNFCIPYITIEPIISKLSSQFWFS 236

Query: 237 EYTFDKRRNFDAEVEQELHQVEKEARIRVGEIKTTWKELNKLEVGDVL-LTETHIRDTLK 295
+ + +L V+ + VG ++ + +++ L VGD++ L +TH+ D
Sbjct: 237 SVRRSSTTQYMGVLRDKLSTVDMDVVAEVGSLRLSVRDILGLRVGDIIRLHDTHVGDPFV 296

Query: 296 GYVTEKWKFECYMGKSGNQKAVKFMRHTGRTEQE 329
+ + KF C G G + A + + T QE
Sbjct: 297 LSIGNRKKFLCQPGVVGKKIAAQILERIESTSQE 330


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0700FLGMOTORFLIN514e-10 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 51.4 bits (123), Expect = 4e-10
Identities = 21/71 (29%), Positives = 42/71 (59%)

Query: 444 ILEDIPVTLEVVFGTAKVKLEKFISWCEKDVIILKESMNEPLVLALNGVTIGKGILVRVD 503
++ DIPV L V G ++ +++ + + V+ L EPL + +NG I +G +V V
Sbjct: 56 LIMDIPVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVA 115

Query: 504 DHFGIQMTELV 514
D +G+++T+++
Sbjct: 116 DKYGVRITDII 126


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0705FLGHOOKAP11352e-36 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 135 bits (340), Expect = 2e-36
Identities = 112/556 (20%), Positives = 215/556 (38%), Gaps = 66/556 (11%)

Query: 4 SDFNTSLSGMSAAQIANMVAQQNISNMNTPGYIRQAVDQTAVYGDGGLLGGKQTGYGVKV 63
S N ++SG++AAQ A A NIS+ N GY RQ + L G G GV V
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTT--IMAQANSTLGAGGWVGNGVYV 59

Query: 64 TDIKRLTNTALTTQYNNQIAKQSASLYQSGALNQALNLFGTPGKNTPSDNLDNFFTAWAA 123
+ ++R + +T Q + S + +++ N+ T + + + +FFT+
Sbjct: 60 SGVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLAT-QMQDFFTSLQT 118

Query: 124 LAKNPDQATNTTALLSSMSIFTDQLNQLHSGLKELETTIAADTDAAIQDLNSLIKKLGSI 183
L N + AL+ +Q L++ + + A++ +N+ K++ S+
Sbjct: 119 LVSNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASL 178

Query: 184 NKAI----GNAGSNPPNDLLNQRDQLLSTMAGYAGISVSAHPNNPDVYDVTIG-GRLVVQ 238
N I G PN+LL+QRDQL+S + G+ VS Y++T+ G +VQ
Sbjct: 179 NDQISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDG--GTYNITMANGYSLVQ 236

Query: 239 GDETTEITS-------TRTATGFEFSVDGQKLNMPE-----GSIIASVRVNQNEIKSYQE 286
G ++ + +RT + + +PE GS+ + ++ +
Sbjct: 237 GSTARQLAAVPSSADPSRTTVAY-VDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRN 295

Query: 287 KIETFSNGLAKALDDIQV------KNVNKTMDDLQK------------------INDALQ 322
+ + A+A + + + + K + DA
Sbjct: 296 TLGQLALAFAEAFNTQHKAGFDANGDAGEDFFAIGKPAVLQNTKNKGDVAIGATVTDASA 355

Query: 323 ANPNDEKLLSNRDELLRQLEKFPGVTRSGDTLTIGGVDHPVDTLGTSTYVTDVNDFSIPI 382
D K+ + ++ Q+ + T T G T T VND S +
Sbjct: 356 VLATDYKISFDNNQW--QVTRLASNTTFTVTPDANGKVAFDGLELTFTGTPAVND-SFTL 412

Query: 383 FAQSSGKWILNPAIT-------------SNADNKPFLGVIAADIASLKTDKNIQGTTFPS 429
S ++ IT ++DN+ ++ S +F
Sbjct: 413 KPVSDAIVNMDVLITDEAKIAMASEEDAGDSDNRNGQALLDLQSNSKTVGGA---KSFND 469

Query: 430 FMDGIITEVATDASKSSATATADTQALSSLTESKSSLEGVNIDEEMTNIMQYQSYYVANT 489
+++++ + ++ ++ L+ + S+ GVN+DEE N+ ++Q YY+AN
Sbjct: 470 AYASLVSDIGNKTATLKTSSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANA 529

Query: 490 KAMNTVNDMMKALLAM 505
+ + T N + AL+ +
Sbjct: 530 QVLQTANAIFDALINI 545


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0706FLAGELLIN451e-07 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 45.4 bits (107), Expect = 1e-07
Identities = 37/238 (15%), Positives = 75/238 (31%), Gaps = 1/238 (0%)

Query: 1 MRISTNQQASSIINQLNNVSGNLAKYQLQVSSGKKYESMSENPGATAQILSYNHVLSQLN 60
I+TN + N LN +L+ ++SSG + S ++ A + + L
Sbjct: 2 QVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLT 61

Query: 61 REKTDVTEAKSLLNTAETSLSSMSTSMNRVNALVLQAINGTSDKDNMSQSAEEIKGLLDV 120
+ + + S+ T E +L+ ++ ++ RV L +QA NGT+ ++ +EI+ L+
Sbjct: 62 QASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEE 121

Query: 121 LISVANSED-DGRYVFSGSSTSVKPFTTDKTTGEIIYNGTTENKKFRVTDTLEVEVFHDG 179
+ V+N +G V S + + I + K +
Sbjct: 122 IDRVSNQTQFNGVKVLSQDNQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKEAT 181

Query: 180 SAMTDVFNNIQKIVDAMKTGDKDALSALQETNSKNIEIITNSMTNIGGQKNGVTAYDN 237
D G + + +
Sbjct: 182 VGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTD 239


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0711FLGHOOKAP1333e-04 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 32.6 bits (74), Expect = 3e-04
Identities = 27/120 (22%), Positives = 49/120 (40%), Gaps = 14/120 (11%)

Query: 4 GINTSGSALNAAKQWMEVSSNNIANADSSAAPGETPFLRKRVVLSEITPFETALTGTKGV 63
IN + S LNAA+ + +SNNI++ + + + R+ ++++ A G G
Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAG------YTRQTTIMAQANSTLGA-GGWVGN 55

Query: 64 KVSEISSDTGSVKRVYDPTHPNANEAGYVNYANVDMTAEMTNLMVGQKMYAANTSALQAN 123
V V+R YD N+ + +TA + M + +TS+L
Sbjct: 56 GV-----YVSGVQREYDAFI--TNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQ 108



Score = 28.4 bits (63), Expect = 0.008
Identities = 18/72 (25%), Positives = 30/72 (41%), Gaps = 4/72 (5%)

Query: 65 VSEISSDTGSVKRVYDPTHPNANEAGY---VNYANVDMTAEMTNLMVGQKMYAANTSALQ 121
VS+I + T ++K T N + + V++ E NL Q+ Y AN LQ
Sbjct: 475 VSDIGNKTATLKTSSA-TQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQ 533

Query: 122 ANEKMMEKDLEI 133
+ + + I
Sbjct: 534 TANAIFDALINI 545


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0712FLGHOOKFLIE312e-04 Flagellar hook-basal body complex protein FliE signa...
		>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE

signature.
Length = 103

Score = 31.2 bits (70), Expect = 2e-04
Identities = 20/65 (30%), Positives = 37/65 (56%), Gaps = 1/65 (1%)

Query: 35 QMLDSMSDTQSNAQTSVSNLLTTGEG-NASDVLIQMKKAESEMKTAAVIRDNVIESYKQL 93
LD +SDTQ+ A+T G +DV+ M+KA M+ +R+ ++ +Y+++
Sbjct: 39 AALDRISDTQTAARTQAEKFTLGEPGVALNDVMTDMQKASVSMQMGIQVRNKLVAAYQEV 98

Query: 94 LNMQV 98
++MQV
Sbjct: 99 MSMQV 103


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0713FLGMRINGFLIF1717e-49 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 171 bits (435), Expect = 7e-49
Identities = 110/584 (18%), Positives = 219/584 (37%), Gaps = 83/584 (14%)

Query: 9 SKLKNWHKGAILVGLFVVVTVLL---LYMNTPKTEVTLYKNLSETSQQQVTDQLAKMGVD 65
++L+ + ++V V +++ L+ TP TL+ NLS+ + QL +M +
Sbjct: 17 NRLRANPRIPLIVAGSAAVAIVVAMVLWAKTPDYR-TLFSNLSDQDGGAIVAQLTQMNIP 75

Query: 66 YTVDK-SGNILVDEKVETLVRDKFADLGIPYTGQDGNDILLNSSLGASEEDKKMQEKVGT 124
Y SG I V +R + A G+P G G ++L G S+ +++ +
Sbjct: 76 YRFANGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQFSEQVNYQRAL 135

Query: 125 KVNLEKEIVQSYGTTIDSASVQLTLPESSSIFEEASQKGTAAVTLKTKNNQTLTSEQVLG 184
+ L + I + SA V L +P+ S F + +A+VT+ + + L Q+
Sbjct: 136 EGELARTIETLGP--VKSARVHLAMPKPSL-FVREQKSPSASVTVTLEPGRALDEGQISA 192

Query: 185 IQRTVSAAVPNVASDDVAIIDTKNGVISEADTSKEEGSSAYKNEVDIQNAIGKNVKTDIE 244
+ VS+AV + +V ++D ++++++TS + + A ++ N + ++ IE
Sbjct: 193 VVHLVSSAVAGLPPGNVTLVDQSGHLLTQSNTSGRDLNDA---QLKFANDVESRIQRRIE 249

Query: 245 GTLSSIFALDNFRVNTNVAVNFDEIKQNTEHY-PNDGKVRSNQKDTSTDTSKGSANTTES 303
LS I N ++F +Q EHY PN ++ + + S+
Sbjct: 250 AILSPIVGNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNISEQVGAGYPG 309

Query: 304 ---GTASN--------------------ADVPNYTEQNGDDTNTYTSEKSSETTNYELDS 340
G SN + P + ++ S + +ET+NYE+D
Sbjct: 310 GVPGALSNQPAPPNEAPIATPPTNQQNAQNTPQTSTSTNSNSAGPRSTQRNETSNYEVDR 369

Query: 341 TIQEIKKHPA-LAKTNVVVWVDQNALNK------NGVDMAEFTKAIGVSAGLTPNMTTEE 393
TI+ K + + + +V V V+ L M + + G +
Sbjct: 370 TIRHTKMNVGDIERLSVAVVVNYKTLADGKPLPLTADQMKQIEDLTREAMGFSDK----- 424

Query: 394 AGADGEAAAAPTFEGTFQNGD-VTIMPIQFLDNATPAEKDTTEKAEPASKAWIWW----L 448
GD + ++ F + D T P + +
Sbjct: 425 ------------------RGDTLNVVNSPF------SAVDNTGGELPFWQQQSFIDQLLA 460

Query: 449 AGGLLFAVIAAGIITYIILLKRKEQLEEALEPEEKDYIPAEEAIINPEEHPDFNFQTDAF 508
AG L ++ A I+ + + + E + ++ + + E +
Sbjct: 461 AGRWLLVLVVAWILWRKAVRPQLTRRVEEAKAAQE-----QAQVRQETEEAVEVRLSKDE 515

Query: 509 DLSE--PELKARKESLKNKLGEMAKEDPGRAAAVIQKWLNERQE 550
L + + E + ++ EM+ DP A VI++W++ E
Sbjct: 516 QLQQRRANQRLGAEVMSQRIREMSDNDPRVVALVIRQWMSNDHE 559


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0714FLGMOTORFLIG1938e-61 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 193 bits (491), Expect = 8e-61
Identities = 112/335 (33%), Positives = 189/335 (56%), Gaps = 4/335 (1%)

Query: 34 SGISRREKAALIIWSLDEQIATEVVDLLPDASKQRLAREMAKMKEMDGGAVEEATREFLG 93
S ++ ++KAA+++ S+ +I+++V L + L E+AK++ + + EF
Sbjct: 13 SALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITSELKDNVLLEFK- 71

Query: 94 ELELLSGGIAKLDREHLQRLFPDMTTEELNQLIYGVEAESRIGETALDILREIDDVDSLF 153
EL + I K ++ + L + I S + + +R D ++
Sbjct: 72 ELMMAQEFIQKGGIDYARELLEKSLGTQKAVDIIN-NLGSALQSRPFEFVRRADPA-NIL 129

Query: 154 TIISDESPQTIAMIASYMKPEEASKLLALLPEEKMINTVIGIASLEQFDSEVMQNVSNLL 213
I E PQTIA+I SY+ P++AS +L+ LP E N IA +++ EV++ V +L
Sbjct: 130 NFIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEVVREVERVL 189

Query: 214 RIKLDTMSNSSLNKTDGIKNVANILNNVTRGLERTIFEHLDAEQAELSERIKEKMFMFED 273
KL ++S+ G+ NV I+N R E+ I E L+ E EL+E IK+KMF+FED
Sbjct: 190 EKKLASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEIKKKMFVFED 249

Query: 274 IILLDNMTLQQVLAEIQDNNKIARALKNEKEELKEKILSCVSKNRRDMITEELEVLGPIR 333
I+LLD+ ++Q+VL EI D ++A+ALK+ ++EKI +SK M+ E++E LGP R
Sbjct: 250 IVLLDDRSIQRVLREI-DGQELAKALKSVDIPVQEKIFKNMSKRAASMLKEDMEFLGPTR 308

Query: 334 LSDVEQAQQDIANVVKNLEKDGKIVIQRGEQDVLI 368
DVE++QQ I ++++ LE+ G+IVI RG ++ ++
Sbjct: 309 RKDVEESQQKIVSLIRKLEEQGEIVISRGGEEDVL 343


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo0715GPOSANCHOR320.002 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 32.0 bits (72), Expect = 0.002
Identities = 15/79 (18%), Positives = 33/79 (41%)

Query: 24 YLDDIEETEEIESPYSKELEQLESHQKELEKHLSAIEIEQQKLANEKAALQAERQAIEEL 83
I+ E ++ +LE + +A + + L EKAAL+AE+ +E
Sbjct: 244 DSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQ 303

Query: 84 RRDAEAEIEANKQAFEKEK 102
+ A ++ ++ + +
Sbjct: 304 SQVLNANRQSLRRDLDASR 322


36lmo1343lmo1346N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
lmo1343115-3.382087hypothetical protein
lmo1344014-2.434034competence protein ComG
lmo1345-113-1.394742competence protein ComGF
lmo1346-113-1.349216competence protein ComGE
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo1343BCTERIALGSPG342e-05 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 34.5 bits (79), Expect = 2e-05
Identities = 16/46 (34%), Positives = 27/46 (58%)

Query: 1 MNKINGFSLVESMVSLLLFAMVCSFLLPTAMTIFEKLDHQKETSRV 46
+K GF+L+E MV +++ ++ S ++P M EK D QK S +
Sbjct: 4 TDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDI 49


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo1344BCTERIALGSPH451e-08 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 44.6 bits (105), Expect = 1e-08
Identities = 19/78 (24%), Positives = 34/78 (43%), Gaps = 1/78 (1%)

Query: 1 MKINGFTLLEMLLVLTISFTLITLTIFPISSTLSTLREKQLLEEIKASIYYAQINAVATN 60
M+ GFTLLEM+L+L + + + ++ Q L +A + + Q + T
Sbjct: 1 MRQRGFTLLEMMLILLLMGVSAGMVLLAFPASRDD-SAAQTLARFEAQLRFVQQRGLQTG 59

Query: 61 QDTFISFDPTKNQLITYT 78
Q +S P + Q +
Sbjct: 60 QFFGVSVHPDRWQFLVLE 77


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo1345BCTERIALGSPG475e-10 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 47.2 bits (112), Expect = 5e-10
Identities = 22/92 (23%), Positives = 46/92 (50%), Gaps = 4/92 (4%)

Query: 9 RDERGFTLVEMLIVLLVVSVLLLLTIPNIVSQSKSINDKGCEAFISMVQGQVQSYQLDKN 68
+RGFTL+E+++V++++ VL L +PN++ + + + + I ++ + Y+LD +
Sbjct: 5 DKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLDNH 64

Query: 69 SIPS----VADLVSGGYLKANQKSCPNGNSIK 96
P+ + LV L + IK
Sbjct: 65 HYPTTNQGLESLVEAPTLPPLAANYNKEGYIK 96


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo1346BCTERIALGSPF741e-16 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 73.7 bits (181), Expect = 1e-16
Identities = 65/352 (18%), Positives = 146/352 (41%), Gaps = 21/352 (5%)

Query: 5 QRTNWKDDGEFLIRVASLLEKGFSLDATISYL--SITSPKYCKRYERIITSLANGNSFSY 62
R + D ++A+L+ L+ + + P + + + + G+S +
Sbjct: 63 IRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSLAD 122

Query: 63 ALSKNG--FPDFICSQLHYASSHGYFLQTIHETGVHMKRKAEEKNALMKTFQYPLVLFST 120
A+ F C+ + + G+ ++ + +++ + ++ + + YP VL
Sbjct: 123 AMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLTVV 182

Query: 121 VILVFFLLRIFLLPKFELLFTQ------LSTNGTVGTNFTYFLLEKVPILLGIFLLSLFL 174
I V +L ++PK F LST +G + + P +L LL+ F+
Sbjct: 183 AIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGM--SDAVRTFGPWMLLA-LLAGFM 239

Query: 175 IFSFIIRKQKQKNAYDRAYFYCRIPYIRQFSRIHYSQYLSRELGYLLKSGLSITHIMHLF 234
F ++R++K++ ++ R +P I + +R + +R L L S + + M +
Sbjct: 240 AFRVMLRQEKRRVSFHRRLL--HLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRIS 297

Query: 235 AQEESPAFFQEIARQILPTLEQGLSLTKALEKMPIFEKELYYIAIHGEKNGNLA---EEF 291
S + + + +G+SL KALE+ +F + ++ GE++G L E
Sbjct: 298 GDVMSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERA 357

Query: 292 LFYYNLCHQKSLQKTEKLFSFIQPIVFIVIGILIVSIYLSILYPMFSMVNQI 343
+ + LF +P++ + + +++ I L+IL P+ + +
Sbjct: 358 ADNQDREFSSQMTLALGLF---EPLLVVSMAAVVLFIVLAILQPILQLNTLM 406


37lmo1473lmo1479N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
lmo1473-115-0.97614116S ribosomal RNA methyltransferase RsmE
lmo1474012-1.411038ribosomal protein L11 methyltransferase
lmo1475012-1.326554molecular chaperone DnaJ
lmo1476012-2.106714molecular chaperone DnaK
lmo1477-213-3.046138heat shock protein GrpE
lmo1478-214-2.925721heat-inducible transcription repressor
lmo1479-114-2.796414coproporphyrinogen III oxidase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo1473SHAPEPROTEIN1603e-46 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 160 bits (406), Expect = 3e-46
Identities = 75/353 (21%), Positives = 142/353 (40%), Gaps = 38/353 (10%)

Query: 2 SKIIGIDLGTTNSAVAVLEGGEAKIIPNPEGARTTPSVVGFKNGERQVGEVAKRAAITNP 61
S + IDLGT N+ + V G P+ R G VG AK+ P
Sbjct: 10 SNDLSIDLGTANTLIYVKGQGIVLNEPSVVAIRQDR--AGSPKSVAAVGHDAKQMLGRTP 67

Query: 62 NTISSIKRHMGTNYKETIEGKDYSPQEISAIILQYLKSYAEDYLGETVDKAVITVPAYFN 121
I++I R M K+ + + +++ ++ + S + + ++ VP
Sbjct: 68 GNIAAI-RPM----KDGVIADFFVTEKMLQHFIKQVHS---NSFMRPSPRVLVCVPVGAT 119

Query: 122 DAQRQATKDAGKIAGLEVERIINEPTAAALAYGMDKTETDQTILVFDLGGGTFDVSILEL 181
+R+A +++ + AG +I EP AAA+ G+ +E +V D+GGGT +V+++ L
Sbjct: 120 QVERRAIRESAQGAGAREVFLIEEPMAAAIGAGLPVSE-ATGSMVVDIGGGTTEVAVISL 178

Query: 182 GDGVFEVHSTAGDNELGGDDFDKKIIDYLVAEFKKDNGIDLSQDKMALQRLKDAAEKAKK 241
V + +GGD FD+ II+Y+ + G + AE+ K
Sbjct: 179 NGVV-----YSSSVRIGGDRFDEAIINYVRRNYGSLIG-------------EATAERIKH 220

Query: 242 DLS----GVTSTQISLPFITAGEAGPLHLEVTLTRAKFDELTHDLVERTIAPTRQALKDA 297
++ G +I + E P + + + L + + ++ AL+
Sbjct: 221 EIGSAYPGDEVREIEVRGRNLAEGVPRGFTLN-SNEILEAL-QEPLTGIVSAVMVALEQC 278

Query: 298 --NLSASDIDQ-VILVGGSTRIPAVQETIKKELGKEPHKGVNPDEVVAMGAAI 347
L++ ++ ++L GG + + + +E G +P VA G
Sbjct: 279 PPELASDISERGMVLTGGGALLRNLDRLLMEETGIPVVVAEDPLTCVARGGGK 331


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo1474CHANLCOLICIN290.012 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 28.9 bits (64), Expect = 0.012
Identities = 16/78 (20%), Positives = 33/78 (42%)

Query: 6 NKKERLADEIEQEELNILDEAEEAVEEEATADTLTEEQAKILELENKLDEVENRYLRMQA 65
+ E+ EIE+E+ + + A EE L+EE + + KL ++ ++M
Sbjct: 151 QEAEQRRKEIEREKAETERQLKLAEAEEKRLAALSEEAKAVEIAQKKLSAAQSEVVKMDG 210

Query: 66 DFENVKKRHIADRDASQK 83
+ + + R + A
Sbjct: 211 EIKTLNSRLSSSIHARDA 228


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo1477NUCEPIMERASE543e-10 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 53.6 bits (129), Expect = 3e-10
Identities = 68/350 (19%), Positives = 125/350 (35%), Gaps = 55/350 (15%)

Query: 4 NVLVTGGTGFLGMHIIFQLLQQGYQVK-----TTVRSLKSKEKVIEILQNNGITDFTHLS 58
LVTG GF+G H+ +LL+ G+QV + K+ +E+L G
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQ------ 55

Query: 59 FVELDLSKDEGWKEAMLDCE----YVLSVASPVFFGKFKNEEELISPAIEGITRILQAAK 114
F ++DL+ EG + ++ V + +N + G IL+ +
Sbjct: 56 FHKIDLADREGMTDLFASGHFERVFISPHRLAVRYS-LENPHAYADSNLTGFLNILEGCR 114

Query: 115 EAKVKRVVMTSNFGAIGFSNADKNSITTEAYWTDELAKGLSAYEKSKLIAEKEAWKFMEN 174
K++ ++ S+ G + K +T+ D + +S Y +K E A +
Sbjct: 115 HNKIQHLLYASSSSVYGLN--RKMPFSTD----DSVDHPVSLYAATKKANELMAHTYSHL 168

Query: 175 ----ETELEFATINPVAIFGPSQSNHVSGSFDLLKNLLNGSMKRVINIPLNVVDARD--- 227
T L F T ++GP ++ F K +L G + I++ RD
Sbjct: 169 YGLPATGLRFFT-----VYGPWGRPDMA-LFKFTKAMLEG---KSIDVYNYGKMKRDFTY 219

Query: 228 ---VADLHIRAMIT-PEANGERFIASADGEISMAD-----IAHLLQRERPELVDKMPKKT 278
+A+ IR P A+ + + + S+A I + E + + +
Sbjct: 220 IDDIAEAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDAL 279

Query: 279 LPNAAIRAAALFSKHAKEGELMINMNRQISNSKARDVLGWKPISTKEEAV 328
A L E +V+G+ P +T ++ V
Sbjct: 280 GIEAKKNMLPLQPGDVLETSADT--------KALYEVIGFTPETTVKDGV 321


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo1479TCRTETOQM1671e-46 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 167 bits (425), Expect = 1e-46
Identities = 98/438 (22%), Positives = 183/438 (41%), Gaps = 87/438 (19%)

Query: 12 KIRNFSIIAHIDHGKSTLADRILEQTGALTHR----EMKNQLLDSMDLERERGITIKLNA 67
KI N ++AH+D GK+TL + +L +GA + D+ LER+RGITI+
Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGA-ITELGSVDKGTTRTDNTLLERQRGITIQTGI 60

Query: 68 VQLKYKAKDGETYIFHLIDTPGHVDFTYEVSRSLAACEGAILVVDAAQGIEAQTLANVYL 127
++ E ++IDTPGH+DF EV RSL+ +GAIL++ A G++AQT +
Sbjct: 61 TSFQW-----ENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHA 115

Query: 128 ALDNDLEILPVINKIDLPAADPERVREEIEDVIG-------------------------- 161
+ + INKID D V ++I++ +
Sbjct: 116 LRKMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQW 175

Query: 162 ---LDASDAVLASAKSGIGI--EDI--------------------------LEQIVE--- 187
++ +D +L SG + ++ ++ ++E
Sbjct: 176 DTVIEGNDDLLEKYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVIT 235

Query: 188 -KVPEPSGDVNKPLKALIFDSVFDAYRGVIANIRIMDGVVKAGDRIKMMSNGKEFEVTEV 246
K + L +F + R +A IR+ GV+ D +++ K ++TE+
Sbjct: 236 NKFYSSTHRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKEKI-KITEM 294

Query: 247 GVF-SPKATPRDELLVGDVGYLTAAIKNVGDTRVGDTITLANNPAEEALDGYRKLNPMVY 305
+ + D+ G++ L + +GDT L P E ++ P++
Sbjct: 295 YTSINGELCKIDKAYSGEIVILQNEFLKLNSV-LGDTKLL---PQRERIENPL---PLLQ 347

Query: 306 CGLYPIDSSKYNDLRDALEKLELNDSALQFE--AETSQALGFGFRCGFLGLLHMEIIQER 363
+ P + L DAL ++ +D L++ + T + + FLG + ME+
Sbjct: 348 TTVEPSKPQQREMLLDALLEISDSDPLLRYYVDSATHEII-----LSFLGKVQMEVTCAL 402

Query: 364 IEREFNIDLITTAPSVIY 381
++ ++++++ P+VIY
Sbjct: 403 LQEKYHVEIEIKEPTVIY 420



Score = 45.6 bits (108), Expect = 5e-07
Identities = 19/80 (23%), Positives = 32/80 (40%), Gaps = 2/80 (2%)

Query: 410 EPYVKATVMVPNDYVGAVMELAQNKRGNFITMEYLDDIRVSIVYEIPLSEIVYDFFDQLK 469
EPY+ + P +Y+ A N + + ++ V + EIP I ++ L
Sbjct: 537 EPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQLKNNE-VILSGEIPARCI-QEYRSDLT 594

Query: 470 SSTKGYASFDYELIGYKASK 489
T G + EL GY +
Sbjct: 595 FFTNGRSVCLTELKGYHVTT 614


38lmo1507lmo1514N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
lmo1507-110-0.231957alanyl-tRNA synthetase
lmo1508-290.148855ABC transporter ATP-binding protein
lmo1509-2101.139418transporter
lmo1510-1120.769799response regulator
lmo1511-1101.255119histidine kinase
lmo1512-1111.613475exodeoxyribonuclease V
lmo1513-1131.475470hypothetical protein
lmo1514-2151.249839hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo1507HTHFIS764e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 75.6 bits (186), Expect = 4e-18
Identities = 23/119 (19%), Positives = 58/119 (48%), Gaps = 2/119 (1%)

Query: 2 KLLMIEDNVSVCEMIEMFFMKEEIDATFVHDGKMGYEAFFKDDYDIAIIDLMLPNMDGMT 61
+L+ +D+ ++ ++ + D + + D D+ + D+++P+ +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 ICRKIREV-SDVPIIILTAKESESDQVLGLEMGADDYVTKPFSPLTLMARI-KAVTRRK 118
+ +I++ D+P+++++A+ + + E GA DY+ KPF L+ I +A+ K
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo1508PF06580356e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 34.8 bits (80), Expect = 6e-04
Identities = 25/154 (16%), Positives = 53/154 (34%), Gaps = 35/154 (22%)

Query: 330 KEFLELIKEQLDYVASEK---GNTITVAIDKDMAIYADYDRLTQVFINIVKNSV-----Q 381
+ L ++ Y+ + + + AI D + +V+N + Q
Sbjct: 219 ADELTVVD---SYLQLASIQFEDRLQFENQINPAIM-DVQVPPMLVQTLVENGIKHGIAQ 274

Query: 382 FTENGQITLTGTQDYKESVLTITDTGIGMNTEELEQIWERFYKADMSRTNTAFGESGIGL 441
+ G+I L GT+D L + +TG E +G GL
Sbjct: 275 LPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKE-------------------STGTGL 315

Query: 442 SIVKQLIEY---HDGSITVTSEPNKGTSFTIRLP 472
V++ ++ + I ++ + K + + +P
Sbjct: 316 QNVRERLQMLYGTEAQIKLSEKQGKVNA-MVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo1510SYCDCHAPRONE372e-05 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 37.2 bits (86), Expect = 2e-05
Identities = 29/184 (15%), Positives = 58/184 (31%), Gaps = 39/184 (21%)

Query: 33 VLLSMDDFERAELFFKRALELDDTVPAAYYSLGNLYYELERYQEAADSFQNATKQGMENG 92
L+M+ F + E+ YSL Y+ +Y++A FQ +
Sbjct: 11 YQLAMESFLKGGGTIAMLNEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDS 70

Query: 93 DLFFMLGMSFVQMEELTLAMPYLLRSVELNPEDGEALFQYGIVLARSGFYEDAINMLERV 152
F LG M G Y+ AI+
Sbjct: 71 RFFLGLGACRQAM----------------------------------GQYDLAIHSYSYG 96

Query: 153 LLVKPEDPDALYNIGAAYLAWQGDIVLAKNYFERAIATGASH----ELAENALNAIQDLE 208
++ ++P ++ L +G++ A++ A A EL+ + ++ ++
Sbjct: 97 AIMDIKEPRFPFHAAECLLQ-KGELAEAESGLFLAQELIADKTEFKELSTRVSSMLEAIK 155

Query: 209 NEAE 212
+ E
Sbjct: 156 LKKE 159



Score = 36.8 bits (85), Expect = 2e-05
Identities = 22/128 (17%), Positives = 38/128 (29%), Gaps = 4/128 (3%)

Query: 21 PSDPVGYINFGNVLLSMDDFERAELFFKRALELDDTVPAAYYSLGNLYYELERYQEAADS 80
+ +E A F+ LD + LG + +Y A S
Sbjct: 33 SDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHS 92

Query: 81 FQNATKQGMENGDLFFMLGMSFVQMEELTLAMPYLLRSVELNPEDGEALFQYGIVLARSG 140
+ ++ F +Q EL A L + EL + E + + R
Sbjct: 93 YSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQELIADKTE----FKELSTRVS 148

Query: 141 FYEDAINM 148
+AI +
Sbjct: 149 SMLEAIKL 156



Score = 34.5 bits (79), Expect = 1e-04
Identities = 15/83 (18%), Positives = 27/83 (32%)

Query: 2 QEGNLEEAVKLFTEVIEEHPSDPVGYINFGNVLLSMDDFERAELFFKRALELDDTVPAAY 61
Q G E+A K+F + D ++ G +M ++ A + +D P
Sbjct: 48 QSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFP 107

Query: 62 YSLGNLYYELERYQEAADSFQNA 84
+ + EA A
Sbjct: 108 FHAAECLLQKGELAEAESGLFLA 130



Score = 27.2 bits (60), Expect = 0.037
Identities = 10/58 (17%), Positives = 17/58 (29%)

Query: 2 QEGNLEEAVKLFTEVIEEHPSDPVGYINFGNVLLSMDDFERAELFFKRALELDDTVPA 59
G + A+ ++ +P + LL + AE A EL
Sbjct: 82 AMGQYDLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQELIADKTE 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo1514PF05272354e-04 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 35.4 bits (81), Expect = 4e-04
Identities = 25/109 (22%), Positives = 46/109 (42%), Gaps = 10/109 (9%)

Query: 1 MAIQPLAYRMRPKALDEIVGQTHLVGK-DKIINRMVKAKQLSSMILYGPPGIGKTSIASA 59
+ P Y+ R ++VG+ L+G +++ K S++L G GIGK+++ +
Sbjct: 558 LGKTPDDYKPRRLRYLQLVGKYILMGHVARVMEPGCKFD--YSVVLEGTGGIGKSTLINT 615

Query: 60 IAGSTKYAFRTLNAVTNNKKDMEVVAAEAKMSGTVILLLDEVHRLDKAK 108
+ G + T + K E +A G V L E+ +A
Sbjct: 616 LVG-LDFFSDTHFDIGTGKDSYEQIA------GIVAYELSEMTAFRRAD 657


39lmo1742lmo1750N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
lmo1742-113-2.397725amino acid ABC transporter ATP-binding protein
lmo1743-116-3.946476amino acid ABC transporter permease
lmo1744017-3.817335histidine kinase
lmo1745-116-3.794588adenine deaminase
lmo1746-214-2.643139hypothetical protein
lmo1747-213-2.256294hypothetical protein
lmo1748-313-2.162575two-component response regulator
lmo1749-313-1.438355ABC transporter permease
lmo1750-316-1.174024ABC transporter ATP-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo1742UREASE532e-09 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 52.8 bits (127), Expect = 2e-09
Identities = 35/132 (26%), Positives = 56/132 (42%), Gaps = 27/132 (20%)

Query: 22 DLVIKNGRIINVFSGEIMDGDIAIKNGYIAGIGSF--PD-----------AEKIIDAAGA 68
D VI N I++ G I+ DI +K+G IA IG PD ++I G
Sbjct: 69 DTVITNALILD-HWG-IVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGEGK 126

Query: 69 FIAPGFIDAHVHVESAMVTPAEFARVLLPNGVTTIV---TDPHEIANVA----GEKGIEF 121
+ G +D+H+H + P + L +G+T ++ T P G I
Sbjct: 127 IVTAGGMDSHIH----FICP-QQIEEALMSGLTCMLGGGTGPAHGTLATTCTPGPWHIAR 181

Query: 122 MLEDAKGAPLDM 133
M+E A P+++
Sbjct: 182 MIEAADAFPMNL 193


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo1744NUCEPIMERASE375e-05 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 37.1 bits (86), Expect = 5e-05
Identities = 16/41 (39%), Positives = 20/41 (48%), Gaps = 6/41 (14%)

Query: 1 MKILVFGGTRFFGKKLVERLVSEGHDVTIGTRGKTEDNFGD 41
MK LV G F G + +RL+ GH V +G DN D
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQV-VGI-----DNLND 35


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo1745HTHFIS734e-17 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 72.6 bits (178), Expect = 4e-17
Identities = 30/118 (25%), Positives = 60/118 (50%), Gaps = 1/118 (0%)

Query: 3 KVYIVEDDEVIRDTIRKHLSKWGFEIGVVEDFNNILQEFLAFEPQLVILDVNLPFFDGFY 62
+ + +DD IR + + LS+ G+++ + + + + A + LV+ DV +P + F
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 63 WCNQIREV-SNVPIIFLSSRNSRMDQIMGMNMGADYYIEKPVDLDVLMARINALLRRT 119
+I++ ++P++ +S++N+ M I GA Y+ KP DL L+ I L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo1747PF05272310.006 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.8 bits (69), Expect = 0.006
Identities = 18/53 (33%), Positives = 24/53 (45%), Gaps = 8/53 (15%)

Query: 41 GPSGAGKSTLLNVLSSIDKPTSGEIEIGGKQISTMN--GK------ELAVFRR 85
G G GKSTL+N L +D + +IG + S G E+ FRR
Sbjct: 603 GTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQIAGIVAYELSEMTAFRR 655


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo1750PF07201290.011 Hypersensitivity response secretion protein HrpJ
		>PF07201#Hypersensitivity response secretion protein HrpJ

Length = 293

Score = 29.0 bits (65), Expect = 0.011
Identities = 12/69 (17%), Positives = 23/69 (33%), Gaps = 3/69 (4%)

Query: 49 EVLRRLEEYFSDKSDQGLNLSSFPKYMMETVKKASYVPAKDDDVERLKQLLVEFGSDVRA 108
++ LE + S+Q L + + A + + + + E G +
Sbjct: 120 QLKAYLEGKSEEPSEQFKMLCGLRDALKGRPELAHLSHLVEQALVSMAE---EQGETIVL 176

Query: 109 SDRITVSAA 117
RIT A
Sbjct: 177 GARITPEAY 185


40lmo1934lmo1948N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
lmo1934116-0.072305ubiquinone/menaquinone biosynthesis
lmo1935116-0.147244heptaprenyl diphosphate synthase subunit I
lmo19362160.057428GTP cyclohydrolase I
lmo1937014-0.553181DNA-binding protein HU
lmo1938012-0.776504protein-tyrosine/serine phosphatase
lmo1939114-0.859507NAD(P)H-dependent glycerol-3-phosphate
lmo1940015-1.068877GTP-binding protein EngA
lmo1941015-0.87024030S ribosomal protein S1
lmo1942-215-1.631904cytidylate kinase
lmo1943-212-1.039712asparaginase
lmo1944-111-0.907187hypothetical protein
lmo1945-110-0.396449ATP-dependent DNA helicase
lmo1946110-0.660612hypothetical protein
lmo1947-110-0.375739ferredoxin
lmo1948-1110.678354hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo1934DNABINDINGHU1287e-43 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 128 bits (324), Expect = 7e-43
Identities = 67/91 (73%), Positives = 77/91 (84%)

Query: 1 MANKTDLVNSVAELADLSKKDAAKAVEAVFETIQTSLSKGEKVQLIGFGNFEVRERAARK 60
MANK DL+ VAE +L+KKD+A AV+AVF + + L+KGEKVQLIGFGNFEVRERAARK
Sbjct: 1 MANKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARK 60

Query: 61 GRNPRTKEEIDIPASKVPAFKPGKALKEAVK 91
GRNP+T EEI I ASKVPAFK GKALK+AVK
Sbjct: 61 GRNPQTGEEIKIKASKVPAFKAGKALKDAVK 91


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo1937TCRTETOQM320.006 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 31.7 bits (72), Expect = 0.006
Identities = 22/79 (27%), Positives = 36/79 (45%), Gaps = 10/79 (12%)

Query: 45 SAEWLGKEFNIIDT-GGIDLSDEPFLEQIRAQAEIAIDEADVIIFITNGREGVTDADEQV 103
S +W + NIIDT G +D FL A+ ++ D I + + ++GV +
Sbjct: 62 SFQWENTKVNIIDTPGHMD-----FL----AEVYRSLSVLDGAILLISAKDGVQAQTRIL 112

Query: 104 AKILYRSNKPIVLAINKVD 122
L + P + INK+D
Sbjct: 113 FHALRKMGIPTIFFINKID 131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo1939PF05272330.001 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 33.1 bits (75), Expect = 0.001
Identities = 37/244 (15%), Positives = 70/244 (28%), Gaps = 44/244 (18%)

Query: 6 CIAIDGPAAAGKSTVAKIVAKKLRFVYIDTGAMYRAVTYIALKNNIAYE----------D 55
+ ++G GKST+ + F +Y + +AYE D
Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQIAGIVAYELSEMTAFRRAD 657

Query: 56 EKAIAALLQKTVIRFEP--GEV------QQVFVGSENVTEVIRS---------IEVTNHV 98
+A+ A R+ G Q V + N + + + V
Sbjct: 658 AEAVKAFFSSRKDRYRGAYGRYVQDHPRQVVIWCTTNKRQYLFDITGNRRFWPVLVPGRA 717

Query: 99 SIVAAHPSIREALQERQQVFATEGGIVMDGRDIGTAVLPNAELKIFLLASVEERAERRYK 158
++V + R Q+FA + + G + +I+ E R
Sbjct: 718 NLVWLQ-------KFRGQLFAEALHLYLAGE---RYFPSPEDEEIYFRPEQELRLVETGV 767

Query: 159 ENMAKGFTGDLDQLKKEIEERDHLDYTRTHSPLKKAD--DAIEVDT---TSMSIDQVANK 213
+ E + Y+ + + AD A+ D + M QV +
Sbjct: 768 QGRLWALLTREGAPAAEGAAQK--GYSVNTTFVTIADLVQALGADPGKSSPMLEGQVRDW 825

Query: 214 ILSL 217
+
Sbjct: 826 LNEN 829


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo1941IGASERPTASE421e-06 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 42.0 bits (98), Expect = 1e-06
Identities = 26/111 (23%), Positives = 43/111 (38%), Gaps = 13/111 (11%)

Query: 80 SNEVKTETESTVNVSDNTQSKEEKEKAKKAAEEKA----AAEKAAEEKKAAAEKAEADKK 135
+ +ET TV + +SK ++ + A E A A++A KA + E +
Sbjct: 1029 APATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQS 1088

Query: 136 KQEEDAVKAANAKKEQEAAEEKAAADKAAAEKAAAEKAEQQKANEASQQKA 186
E KE + E K A EKA E + Q+ + + Q +
Sbjct: 1089 GSET---------KETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVS 1130



Score = 40.4 bits (94), Expect = 4e-06
Identities = 23/110 (20%), Positives = 41/110 (37%), Gaps = 1/110 (0%)

Query: 96 NTQSKEEKEKAKKAAEEKAAAEKAAEEKKAAAEKAEADKKKQEEDAVKAANAKKEQEAAE 155
TQ+ E KE A EEKA E ++ + K++Q E A +E +
Sbjct: 1094 ETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTV 1153

Query: 156 EKAAADKAAAEKAAAEKAEQQKANEASQQKAGGSHTVKAGDTLYSIARST 205
+ A + ++ + +Q S TV G+++ +T
Sbjct: 1154 NIKEPQ-SQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENT 1202



Score = 33.1 bits (75), Expect = 9e-04
Identities = 17/112 (15%), Positives = 41/112 (36%), Gaps = 4/112 (3%)

Query: 84 KTETESTVNVSDNTQSKEEKEKAKKAAEEKAAAEKAAEEKKAAAEKAEADKKKQEEDAVK 143
K E ++T + N + +E + KA + ++ E K + E++
Sbjct: 1053 KNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKA 1112

Query: 144 AANAKKEQE----AAEEKAAADKAAAEKAAAEKAEQQKANEASQQKAGGSHT 191
+K QE ++ +++ + AE A + ++ ++T
Sbjct: 1113 KVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNT 1164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo1945TYPE3IMSPROT290.018 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 28.6 bits (64), Expect = 0.018
Identities = 27/132 (20%), Positives = 53/132 (40%), Gaps = 9/132 (6%)

Query: 9 FVSVAVLGTLAFILMMLQFPLLPSAPFLKLDFSDIPALIGGL--LFGPLAVILVELIKNV 66
F + V +A ++Q+ L S +K D I I G +F + LVE +K++
Sbjct: 88 FPLLTVAALMAIASHVVQYGFLISGEAIKPDIKKI-NPIEGAKRIFSIKS--LVEFLKSI 144

Query: 67 LLYIVSGSPVGVPVGELANFISGLFYVLPIYYLFHWLRSTKGMVLSTAVGTVLMTGAMAV 126
L ++ + + + + L + + +++ VG V+++ A
Sbjct: 145 LKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPLLGQILRQLMVICTVGFVVISIADYA 204

Query: 127 FNYFVLLPFYIK 138
F Y+ YIK
Sbjct: 205 FEYY----QYIK 212


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo1947PF06580441e-06 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 43.7 bits (103), Expect = 1e-06
Identities = 30/170 (17%), Positives = 57/170 (33%), Gaps = 40/170 (23%)

Query: 443 LAPLLRKVISNFDV----LAKE-NFVELGLELET---PD-LEYSYD-PDRMEQVLI---- 488
L+ L+R + + LA E V+ L+L + D L++ + V +
Sbjct: 200 LSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPML 259

Query: 489 --NLIMNAIRHTGKEGYDGKVILKQTIDVARSNLVITVSDNGSGIAEEDIPYLFERFYKV 546
L+ N I+H G + + + V + GS +
Sbjct: 260 VQTLVENGIKH-GIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKE--------- 309

Query: 547 DKARKRGKAVGTGIGLAIVKNIVEAHNGK---ISVESELGKGSDFIITLP 593
TG GL V+ ++ G I + + GK + ++ +P
Sbjct: 310 ----------STGTGLQNVRERLQMLYGTEAQIKLSEKQGKVN-AMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo1948HTHFIS989e-26 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 98.0 bits (244), Expect = 9e-26
Identities = 29/136 (21%), Positives = 63/136 (46%), Gaps = 3/136 (2%)

Query: 6 RVLVVDDEDRIRRLLKMYLERENYRIEEASDGDQALSMALNNNYEVILLDLMMPGKDGIE 65
+LV DD+ IR +L L R Y + S+ + ++++ D++MP ++ +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 66 VCRELRE-FKSTPVVMLTAKGEEANRVQGFEVGADDYIVKPFSPREVVLRVKAVL--RRA 122
+ +++ PV++++A+ ++ E GA DY+ KPF E++ + L +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 123 KQSSEESAGGTPGDII 138
+ S E ++
Sbjct: 125 RPSKLEDDSQDGMPLV 140


41lmo2171lmo2180N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
lmo2171-1120.705251glyoxalase
lmo2172-113-0.156383hypothetical protein
lmo2173110-1.206488hypothetical protein
lmo2174110-1.989502MFS transporter
lmo2175212-2.232664propionate CoA-transferase
lmo2176210-2.821435sigma-54-dependent transcriptional regulator
lmo2177211-2.764406hypothetical protein
lmo2178210-2.4012883-ketoacyl-ACP reductase
lmo2179118-2.672058TetR family transcriptional regulator
lmo2180022-2.321486hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo2171TCRTETB310.010 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 31.0 bits (70), Expect = 0.010
Identities = 26/125 (20%), Positives = 47/125 (37%), Gaps = 7/125 (5%)

Query: 246 NMFGLTAASAAAYVSIYSLSNCLGRVVWGAVSDRLGRSNTLMIIYTVIALSLLALTTLQS 305
N F AS + + L+ +G V+G +SD+LG L+ + + S
Sbjct: 42 NDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHS 101

Query: 306 VVGFVIGIIGLGLCFGGTMGVFPSIVM----ENYGPKNQGVNYGIVFIGYSTAAFFAPKM 361
+I + G FP++VM +N+G +G++ + P +
Sbjct: 102 FFSLLI-MARFIQGAGAAA--FPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAI 158

Query: 362 AAQIA 366
IA
Sbjct: 159 GGMIA 163



Score = 29.5 bits (66), Expect = 0.025
Identities = 32/148 (21%), Positives = 55/148 (37%), Gaps = 12/148 (8%)

Query: 45 VVMAFTINAAIGPIPTILGGILTDKGKAKWAILIGGILFGLGFALTGFATSTTMLYLSYG 104
V AF + +IG T + G L+D+ K +L G I+ G ++ GF + L
Sbjct: 54 VNTAFMLTFSIG---TAVYGKLSDQLGIKRLLLFGIIINCFG-SVIGFVGHSFFSLLIMA 109

Query: 105 VLAGLGQGFAYSGCLSNTIR---LFPDKRGLASGLITAGMGGATIIAAPIANYLIETYNV 161
G G A L + + + RG A GLI + + + I + +
Sbjct: 110 RFI-QGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHW 168

Query: 162 MTAFKIMGAVYIAVVIGCSFLIRVAPAG 189
I + +I FL+++
Sbjct: 169 SYLLLIP----MITIITVPFLMKLLKKE 192


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo2173HTHFIS371e-127 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 371 bits (955), Expect = e-127
Identities = 126/339 (37%), Positives = 189/339 (55%), Gaps = 30/339 (8%)

Query: 146 DGTVIVAESVAMKQIVRVCNQIAPFDSKVLLYGESGTGKEVLSRYIHEKSKQAAGPFISI 205
DG +V S AM++I RV ++ D +++ GESGTGKE+++R +H+ K+ GPF++I
Sbjct: 135 DGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAI 194

Query: 206 NCAAIPKALFESELFGHEKGSFTGADIEKPGMLELADGGTLFLDEISEMPLELQAKMLRV 265
N AAIP+ L ESELFGHEKG+FTGA G E A+GGTLFLDEI +MP++ Q ++LRV
Sbjct: 195 NMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRV 254

Query: 266 LETGEVRRLGSTTETKRRFRLISATNRNLGEMVEKGTFRRDLYYRINVVPVHIPALRERP 325
L+ GE +G T + R+++ATN++L + + +G FR DLYYR+NVVP+ +P LR+R
Sbjct: 255 LQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRA 314

Query: 326 QDIIGLARQFIQKFNQKYQKDFQLSGDKTKELLSHNWPGNVRELRNKIERLVVMSGNKEV 385
+DI L R F+Q+ ++ + + + + +H WPGNVREL N + RL + +
Sbjct: 315 EDIPDLVRHFVQQAEKEGLDVKRFDQEALELMKAHPWPGNVRELENLVRRLTALYPQDVI 374

Query: 386 TVAETDDFALDLHFKEQTKK------------------------------DSLYLKDYLQ 415
T ++ +K S L
Sbjct: 375 TREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVLA 434

Query: 416 GVEKHFILRVLEESNGNVTKAASTLGIHRSVLYRKLKTL 454
+E IL L + GN KAA LG++R+ L +K++ L
Sbjct: 435 EMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIREL 473


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo2175DHBDHDRGNASE1308e-39 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 130 bits (327), Expect = 8e-39
Identities = 83/255 (32%), Positives = 129/255 (50%), Gaps = 9/255 (3%)

Query: 4 LNGKVAVVTGAASGMGQQIAILFAKEGAKVVVADLNLEAAQKTVELVEKEHGTGLAVVAN 63
+ GK+A +TGAA G+G+ +A A +GA + D N E +K V ++ E A A+
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 64 VTKQEDIENMINQAIEAFGTLDILVNNAGIMDNFVPAGELTDELWDKVFAINTTGVMRAT 123
V I+ + + G +DILVN AG++ L+DE W+ F++N+TGV A+
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVL-RPGLIHSLSDEEWEATFSVNSTGVFNAS 124

Query: 124 REALHIFEEKGQGVIVNIASAGGLFGSRAGAAYTASKHAVVGFTKNVGFQYANKNIRCNA 183
R ++ G IV + S + AAY +SK A V FTK +G + A NIRCN
Sbjct: 125 RSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNI 184

Query: 184 IAPGAVNTNIGTTIYAPDEFGQERAMIGMGINPRAG-------DASEIAKVALFLASDDS 236
++PG+ T++ +++A DE G E+ + G + G S+IA LFL S +
Sbjct: 185 VSPGSTETDMQWSLWA-DENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQA 243

Query: 237 SFVNGTVITADAGWT 251
+ + D G T
Sbjct: 244 GHITMHNLCVDGGAT 258


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo2176HTHTETR564e-12 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 56.2 bits (135), Expect = 4e-12
Identities = 17/98 (17%), Positives = 36/98 (36%)

Query: 1 MDRRVKKTKKAFNQALFTLLDQKPFQQITITDIVTEADVNRGTFYKHYRDKEELLDSIIE 60
+ ++T++ L Q+ ++ +I A V RG Y H++DK +L I E
Sbjct: 5 TKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWE 64

Query: 61 EILMDLKSAYQDPYLHTSHFSIQTLTPSMIKIFDHVYH 98
++ + + L +I + +
Sbjct: 65 LSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVT 102


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo2180TRNSINTIMINR290.007 Translocated intimin receptor (Tir) signature.
		>TRNSINTIMINR#Translocated intimin receptor (Tir) signature.

Length = 549

Score = 29.3 bits (65), Expect = 0.007
Identities = 16/51 (31%), Positives = 27/51 (52%), Gaps = 5/51 (9%)

Query: 46 IIKSLAADAEMAGIEAKRLLKRKQALENNVQNLKNYLQTEMERMEIRKINS 96
I++ +A A+ AG A R+QA+E+N Q + Y R E +++S
Sbjct: 317 IVEQIAQQAKEAGEVA-----RQQAVESNAQAQQRYEDQHARRQEELQLSS 362


42lmo2203lmo2209N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
lmo2203015-1.042335MarR family transcriptional regulator
lmo2204017-0.9217743-oxoacyl-ACP synthase
lmo2205017-1.0499283-oxoacyl-ACP synthase
lmo2206-115-0.832765N-acetylmuramoyl-L-alanine amidase
lmo2207-114-1.297977hypothetical protein
lmo2208-115-1.784061phosphoglyceromutase
lmo2209-213-1.908534Clp protease subunit B
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo2203FLGFLGJ791e-18 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 79.0 bits (194), Expect = 1e-18
Identities = 54/165 (32%), Positives = 84/165 (50%), Gaps = 16/165 (9%)

Query: 53 QQFIQSIANDAQDLQKEEKILTSVTLAQAILESNWGKSGL----STSANNLFGIK--GSY 106
+ F+ ++ AQ ++ + + LAQA LES WG+ + + NLFG+K G++
Sbjct: 150 KAFLAQLSLPAQLASQQSGVPHHLILAQAALESGWGQRQIRRENGEPSYNLFGVKASGNW 209

Query: 107 EGSSVSMGTQEFSSGKAYHTQADFRKYPDKKASLVDHAQLFVNGVSGNANLYSAVIGETN 166
+G + T E+ +G+A +A FR Y +L D+ L Y+AV +
Sbjct: 210 KGPVTEITTTEYENGEAKKVKAKFRVYSSYLEALSDYVGLLTRNPR-----YAAVTTAAS 264

Query: 167 YKEAAYAIQDAGYATDPAYAEKLISTIENYNLDQYDQIYDTVTST 211
++ A A+QDAGYATDP YA KL + I+ Q I D V+ T
Sbjct: 265 AEQGAQALQDAGYATDPHYARKLTNMIQ-----QMKSISDKVSKT 304


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo2204ACRIFLAVINRP240.046 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 24.0 bits (52), Expect = 0.046
Identities = 12/52 (23%), Positives = 27/52 (51%), Gaps = 1/52 (1%)

Query: 9 WVFLLSLMAEFVLSSMLYVSFDMTRAIILTVGLSFF-IILITFLMPKDSEVY 59
+ +S + F+ + LY S+ + +++L V L ++L L + ++VY
Sbjct: 874 ALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVY 925


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo2206HTHFIS350.001 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 34.8 bits (80), Expect = 0.001
Identities = 32/134 (23%), Positives = 55/134 (41%), Gaps = 24/134 (17%)

Query: 162 ALTKYGRDLVAEVRSG-KLDPVIGRDAEIRNVIRILSRKTKNN-PVLI-GEPGVGKTAIV 218
AL + R P++GR A ++ + R+L+R + + ++I GE G GK +
Sbjct: 118 ALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVA 177

Query: 219 EGLAQRIVRKD----------VPEGLKDKTIISLDIGSLIAGAKYRGEFEERLKAVLQEV 268
L R++ +P L I S + G + +G F
Sbjct: 178 RALHDYGKRRNGPFVAINMAAIPRDL---------IESELFGHE-KGAFTGAQTRSTGRF 227

Query: 269 KQSDGQILLFIDEI 282
+Q++G LF+DEI
Sbjct: 228 EQAEGG-TLFLDEI 240



Score = 33.3 bits (76), Expect = 0.004
Identities = 46/218 (21%), Positives = 75/218 (34%), Gaps = 31/218 (14%)

Query: 561 EREKLLKLADVLHQKVIGQDDAVQLVSDAVLRARAGIKDPKRPIGSFIFLGPTGVGKTEL 620
R L+ ++G+ A+Q + + R + + G +G GK +
Sbjct: 124 RRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQT----DLTL---MITGESGTGKELV 176

Query: 621 AKALAFNMFDSEDHMIRIDMSEYMEKHSVSRLVGAPPGYIGYEEGGQLTEAVRRNPYSI- 679
A+AL + I+M+ S L G E G T A R+
Sbjct: 177 ARALHDYGKRRNGPFVAINMAAIPRDLIESELFGH--------EKGAFTGAQTRSTGRFE 228

Query: 680 ------VLLDEIEKAHPDVFNILLQVLDDGRITDSQGRLIDFKNTVIIMTSNIGSNLLLE 733
+ LDEI D LL+VL G T GR + I+ +N L +
Sbjct: 229 QAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKD---LKQ 285

Query: 734 RTEEGEISPELES--DVMQILQSEFKPEFLNRVDDIIL 769
+G +L +V+ + P +R +DI
Sbjct: 286 SINQGLFREDLYYRLNVVPL----RLPPLRDRAEDIPD 319


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo2209SACTRNSFRASE290.016 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 28.8 bits (64), Expect = 0.016
Identities = 18/107 (16%), Positives = 43/107 (40%), Gaps = 4/107 (3%)

Query: 159 ENEELIIRQIEKGVKRIYYIEQDQEVVAVAETSAENSFSAMITGVATSDEYRQRGFASTL 218
E++++ + +E+ K + + + + + + A+I +A + +YR++G + L
Sbjct: 51 EDDDMDVSYVEEEGKAAFLYYLENNCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTAL 110

Query: 219 L---KKLCCDVLAEGKKPCLFYDNPVAGEIYHRLGFEHTG-DFVMYK 261
L + + G N A Y + F D ++Y
Sbjct: 111 LHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFIIGAVDTMLYS 157


43lmo2500lmo2505N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
lmo2500-110-1.328683phosphate ABC transporter permease
lmo2501112-1.571546phosphate ABC transporter permease
lmo2502011-1.231246phosphate ABC transporter substrate-binding
lmo2503110-1.136242two-component sensor histidine kinase
lmo2504213-0.438096two-component response phosphate regulator
lmo2505012-0.187785hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo2500PF06580371e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 37.2 bits (86), Expect = 1e-04
Identities = 30/192 (15%), Positives = 67/192 (34%), Gaps = 37/192 (19%)

Query: 407 TIIKEESDRLHRLIMDI-------LALSRIEQNPVPENVELVEVDEVIEQSARTIFEMAT 459
+I E+ + ++ + L S Q + + + +V+ + + +
Sbjct: 184 ALILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLAS-IQFEDRLQF 242

Query: 460 EKNIQVIIPEKTIPSVTIETDRDKLQQILINLLSNAINYTPVDGKVEVKLIEQEAEVIIE 519
E I I + +P + +Q ++ N + + I P GK+ +K + V +E
Sbjct: 243 ENQINPAIMDVQVPPML-------VQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLE 295

Query: 520 VTDNGIGIPAKDIDRVFERFYRVDKARSRHSGGTGLGLSIVKHLVENCGG---RIEVESQ 576
V + G + TG GL V+ ++ G +I++ +
Sbjct: 296 VENTGSLALKNTKE------------------STGTGLQNVRERLQMLYGTEAQIKLSEK 337

Query: 577 EEVGSTFRVTLP 588
+ V +P
Sbjct: 338 QG-KVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo2501HTHFIS1002e-26 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 99.5 bits (248), Expect = 2e-26
Identities = 35/136 (25%), Positives = 77/136 (56%)

Query: 3 KILVVDDEASIVTLLQFNIEKAGFEVVTAEDGRTGYELALSEKPDLIVLDLMLPEMDGIE 62
ILV DD+A+I T+L + +AG++V + T + + DL+V D+++P+ + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 63 VTKKLRQDKVNVPILMLTAKDEELDKIIGLELGADDYMTKPFSPREVVARIKAILRRTEG 122
+ ++++ + ++P+L+++A++ + I E GA DY+ KPF E++ I L +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 123 KAEIIEELTEDVEATI 138
+ +E+ ++D +
Sbjct: 125 RPSKLEDDSQDGMPLV 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo2504GPOSANCHOR462e-07 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 46.2 bits (109), Expect = 2e-07
Identities = 38/250 (15%), Positives = 86/250 (34%), Gaps = 8/250 (3%)

Query: 24 HADTINDMQKRQNEIEQKKSEIDKNIDSKNSELNHLESAEKDAAKELESLMKSLDDTNKK 83
+ ++++ + E+E +K++++K ++ + + K E +L D K
Sbjct: 104 NDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKA 163

Query: 84 LKEQEDKVSSENEKLKKLQKEMEKLRNDIRDRQKVLDNRARAIQTTGTATSYLDMIFEAD 143
L+ + ++++ K+K L+ E L + +K L+ L+
Sbjct: 164 LEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLE------ 217

Query: 144 DFKELVDRVTVVSAIVKADQNIMQDQKDDQDKLKVAESTSEKKLENLKVLAVELEVSKNN 203
E + + KA + M D K+K E+ L LE + N
Sbjct: 218 --AEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNF 275

Query: 204 MESQKQEKNDLVMALANKKDLTKSEQTLLASEQGALTDEEKRLASNIAGEKAKQEAAIKA 263
+ + L A + + + L ++ +K + K
Sbjct: 276 STADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKL 335

Query: 264 AEEKRMQEAA 273
E+ ++ EA+
Sbjct: 336 EEQNKISEAS 345



Score = 31.6 bits (71), Expect = 0.006
Identities = 33/189 (17%), Positives = 75/189 (39%), Gaps = 5/189 (2%)

Query: 22 GAHADTINDMQKRQNEIEQKKSEIDKNIDSKNSELNHLESAEKDAAKELESLMKSLDDTN 81
++ RQ E+E+ + ++++ LE+ + E L N
Sbjct: 249 KTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLN 308

Query: 82 KKLKEQEDKVSSENEKLKKLQKEMEKLRNDIRDRQKVLDNRARAIQTTGTATSYLDMIFE 141
+ + + E K+L+ E +KL + + + R + + A L+
Sbjct: 309 ANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQLEA--- 365

Query: 142 ADDFKELVDRVTVVSAIVKADQNIMQDQKDDQDKLKVAESTSEKKLENLKVLAVELEVSK 201
+ ++L ++ + A ++ + + ++ + +++ A + KL L+ L ELE SK
Sbjct: 366 --EHQKLEEQNKISEASRQSLRRDLDASREAKKQVEKALEEANSKLAALEKLNKELEESK 423

Query: 202 NNMESQKQE 210
E +K E
Sbjct: 424 KLTEKEKAE 432


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo2505GPOSANCHOR394e-05 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 38.5 bits (89), Expect = 4e-05
Identities = 35/217 (16%), Positives = 80/217 (36%), Gaps = 6/217 (2%)

Query: 27 ADVNTDIQNQDKKINDIKSKKTDLQSDLSGLVADLEKAQEKAKSLQGEFDKTGKELKKLN 86
A++ ++ +K L+++ + L A ++ + ++K L
Sbjct: 193 AELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLE 252

Query: 87 EDIKSINERIKERETVLKERARAMQKTSNSNAYLEVILDAENLSDLVGRVSAVNQLVD-S 145
+ ++ R E E L+ S LE + L + +Q+++ +
Sbjct: 253 AEKAALEARQAELEKALEGAMNFSTADSAKIKTLEA--EKAALEAEKADLEHQSQVLNAN 310

Query: 146 DKSILEDQQNDEKALKTKQTAVKKKQEDQATAIHEYEAQQNKIEAQKAEK---EAIVAQL 202
+S+ D +A K + +K +E + ++ + ++A + K EA +L
Sbjct: 311 RQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQLEAEHQKL 370

Query: 203 ASDQASAENAKAGLVSERDKAAKEATARATALREATS 239
+E ++ L + D + + AL EA S
Sbjct: 371 EEQNKISEASRQSLRRDLDASREAKKQVEKALEEANS 407


44lmo2577lmo2595N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
lmo2577013-0.356432hypothetical protein
lmo2578-2150.377233cation transporter
lmo2579-3151.146248colossin A
lmo2579a-3171.886173hypothetical protein
lmo2580-2162.141769hypothetical protein
lmo2581-1183.668890hypothetical protein
lmo25820194.082490hypothetical protein
lmo25831214.146317ABC transporter ATP-binding protein
lmo25841263.899464hypothetical protein
lmo25850233.252288histidine kinase
lmo2586-1172.558690DNA-binding response regulator
lmo25870131.028253formate dehydrogenase accessory protein
lmo25880150.669511hypothetical protein
lmo2589013-0.024012formate dehydrogenase subunit alpha
lmo2590-115-0.914856hypothetical protein
lmo2591013-0.538780multidrug transporter
lmo2592118-0.007828TetR family transcriptional regulator
lmo2593217-0.089824****ATP-binding protein
lmo2594216-0.167534N-acetylmuramoyl-L-alanine amidase
lmo25950141.144729aldo/keto reductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo2577NUCEPIMERASE290.020 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 29.0 bits (65), Expect = 0.020
Identities = 12/33 (36%), Positives = 17/33 (51%), Gaps = 1/33 (3%)

Query: 48 GRLDLAILPFIHE-LNIKTPVISCNGGLVRDFT 79
GR D+A+ F L K+ + G + RDFT
Sbjct: 186 GRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFT 218


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo2578YERSSTKINASE310.007 Yersinia serine/threonine protein kinase signature.
		>YERSSTKINASE#Yersinia serine/threonine protein kinase signature.

Length = 732

Score = 30.9 bits (69), Expect = 0.007
Identities = 17/57 (29%), Positives = 31/57 (54%), Gaps = 3/57 (5%)

Query: 55 SLDEMADHLMNKYKSSNEAMSMSINSNGK---IAYQGALTKDAKRPIIKFGFDQNQA 108
+L + + L + +S+ +S+ IN +G +A Q D+ RP++KFG +Q A
Sbjct: 603 TLSQQLNTLQQQQESAKAQLSILINRSGSWADVARQSLQRFDSTRPVVKFGTEQYTA 659


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo2580PF05272280.030 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.1 bits (62), Expect = 0.030
Identities = 9/30 (30%), Positives = 14/30 (46%)

Query: 37 IVGPSGAGKSTFLSIAGALLSPTEGEIAIG 66
+ G G GKST ++ L ++ IG
Sbjct: 601 LEGTGGIGKSTLINTLVGLDFFSDTHFDIG 630


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo2582PF06580356e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 34.8 bits (80), Expect = 6e-04
Identities = 22/101 (21%), Positives = 42/101 (41%), Gaps = 20/101 (19%)

Query: 359 NLLTNAIKFTPQGGNIQVRLYEDTTNVFVEVQDSGVGISKVDMTKIFDRFYKANESRTRE 418
N + + I PQGG I ++ +D V +EV+++G K
Sbjct: 266 NGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALK------------------NT 307

Query: 419 EGSSGLGLS-ICQKIITLHHGEVTVQ-SSLEKGTTFTVKLP 457
+ S+G GL + +++ L+ E ++ S + V +P
Sbjct: 308 KESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo2583HTHFIS991e-26 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 99 bits (249), Expect = 1e-26
Identities = 39/123 (31%), Positives = 65/123 (52%), Gaps = 1/123 (0%)

Query: 4 ILVVDDDRHILKLVGHYLRAEGFHVLEASDGVEAEKIVETEQVHLAVIDVMMPNMDGFEL 63
ILV DDD I ++ L G+ V S+ + + L V DV+MP+ + F+L
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 64 CQKMRASYPDIPVIMLTAKDALADKSRGFEVGTDDYVTKPFEPEELIFRI-RALLRRSNQ 122
+++ + PD+PV++++A++ + E G DY+ KPF+ ELI I RAL +
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRR 125

Query: 123 ASE 125
S+
Sbjct: 126 PSK 128


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo2588TCRTETB1393e-38 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 139 bits (351), Expect = 3e-38
Identities = 107/418 (25%), Positives = 198/418 (47%), Gaps = 19/418 (4%)

Query: 16 SYSRSLL-----VVTMIIGAFVAILNQTLLATALPMIMDDLHITAATGQWLTTAFLLTNG 70
SYS+S L ++ + I +F ++LN+ +L +LP I +D + A+ W+ TAF+LT
Sbjct: 4 SYSQSNLRHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFS 63

Query: 71 IMIPITALLIEKISSKTLFITAMTVFTIGTIIASVAGS-FPILLTGRIVQAAGAGIMMPL 129
I + L +++ K L + + + G++I V S F +L+ R +Q AGA L
Sbjct: 64 IGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPAL 123

Query: 130 LQTIFLLIFPREKRGAAMGLMGLVIAFAPAIGPTLSGWIVDSYDWRVLFLILIPIAVIDI 189
+ + P+E RG A GL+G ++A +GP + G I W L LI + I +I +
Sbjct: 124 VMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPM-ITIITV 182

Query: 190 ILAFFGMKKVVKLTDTKIDFLSIVMSSIGFGALLYGFSSAGNDGWGDTTVITMLIVGVVV 249
+KK V++ D I++ S+G + +S I+ LIV V+
Sbjct: 183 PFLMKLLKKEVRIKGH-FDIKGIILMSVGIVFFMLFTTSY---------SISFLIVSVLS 232

Query: 250 IALFVWRQLVIDNPMLELHVFKYPVFSLSVILGSIVTMAMIGAEIVLPLYIQTIRGESAL 309
+FV + +P ++ + K F + V+ G I+ + G ++P ++ + S
Sbjct: 233 FLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTA 292

Query: 310 QSG-LLLLPGAIIMGIMSPITGIIFDKIGAKWLTITGVTILTIGTIPFMFLTMDTPLWYI 368
+ G +++ PG + + I I GI+ D+ G ++ GVT L++ + FL T +
Sbjct: 293 EIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMT 352

Query: 369 VVFYAVRFFGISMAMMPVSTAGMNALPNHLINHGSAVNNTIRQIAGSIGTAVLITVLT 426
++ V G+S +ST ++L G ++ N ++ G A++ +L+
Sbjct: 353 IIIVFV-LGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLS 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo2589HTHTETR688e-16 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 68.1 bits (166), Expect = 8e-16
Identities = 20/74 (27%), Positives = 38/74 (51%)

Query: 2 KEKKQRIIKSAKEVFQKQGYLKTSVQDMVDAAGISKGTFYNYFTSKEELAIVIFKQEYSV 61
+E +Q I+ A +F +QG TS+ ++ AAG+++G Y +F K +L I++ S
Sbjct: 10 QETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESN 69

Query: 62 LHQRLEYTMAQDGA 75
+ + A+
Sbjct: 70 IGELELEYQAKFPG 83


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo2591FLGFLGJ853e-20 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 84.8 bits (209), Expect = 3e-20
Identities = 56/174 (32%), Positives = 83/174 (47%), Gaps = 15/174 (8%)

Query: 32 RTAQVNLTTSQQAFIDEILPAAQDGYRDGKLLTSVTLAQAILESNWGESGL----SQNSK 87
R +L +AF+ ++ AQ + + + LAQA LES WG+ + + S
Sbjct: 139 RNYDDSLPGDSKAFLAQLSLPAQLASQQSGVPHHLILAQAALESGWGQRQIRRENGEPSY 198

Query: 88 NLFGIK--GTYKGKSVSMGTMEASGSTT----ANFRVYPSWKESIEDHTALITENARYQD 141
NLFG+K G +KG + T E A FRVY S+ E++ D+ L+T N RY
Sbjct: 199 NLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRVYSSYLEALSDYVGLLTRNPRYAA 258

Query: 142 AVDETDYRKALQAIKDGGYATDPDYVSKLVAIIERYNLDKYDVIYDKIESNQSL 195
+ QA++D GYATDP Y KL +I+ + I DK+ S+
Sbjct: 259 VTTAASAEQGAQALQDAGYATDPHYARKLTNMIQ-----QMKSISDKVSKTYSM 307


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo2595IGASERPTASE330.001 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 33.5 bits (76), Expect = 0.001
Identities = 14/71 (19%), Positives = 21/71 (29%)

Query: 129 VKQEAIQKEEAEKAEKERKEAEEKAKQEEEAAAAKAATTDENTPSDDTVYGTLASKDTLT 188
E + + E E E EEKAK E E T + +P + +
Sbjct: 1088 SGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPAR 1147

Query: 189 KEGDAFYKDEA 199
+ E
Sbjct: 1148 ENDPTVNIKEP 1158


45lmo2650lmo2658N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
lmo26503242.133292creatinine amidohydrolase
lmo26513262.085202phosphotriesterase
lmo26522272.173982PTS system ascorbate transporter subunit IIC
lmo26534373.010174MFS transporter
lmo26541303.174997PTS mannitol transporter subunit IIA
lmo26550224.077664transcriptional antiterminator
lmo2656-1213.922545elongation factor Tu
lmo26570224.250802elongation factor G
lmo2658-1164.44436730S ribosomal protein S7
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo2650ARGDEIMINASE250.047 Bacterial arginine deiminase signature.
		>ARGDEIMINASE#Bacterial arginine deiminase signature.

Length = 409

Score = 25.2 bits (55), Expect = 0.047
Identities = 12/52 (23%), Positives = 23/52 (44%), Gaps = 1/52 (1%)

Query: 31 DADVEHID-VSAARSMNVDIIVTSQELAETLGTDTSAKVVIVNNYFDNAEIK 81
A EH S ++ V+I ++E L + + + ++ + AEIK
Sbjct: 50 VARQEHEVFASILKNNLVEIEYIEDLISEVLVSSVALENKFISQFILEAEIK 101


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo2652PF05043330.006 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 32.6 bits (74), Expect = 0.006
Identities = 36/162 (22%), Positives = 64/162 (39%), Gaps = 16/162 (9%)

Query: 7 RNMTLLESLVVANVYLAPENLQEELGISKRTLQYDVEKINKELDNIGLDGIQSVRGQGYY 66
R + LLE L + L E L ++R ++ D+ + + D I G
Sbjct: 11 RQLELLELLFEHKRWFHRSELAELLNCTERAVKDDLS----HVKSAFPDLIFHSSTNGIR 66

Query: 67 LLEEEKTTIKEILENREASHKVFSASERRIRILFFLLVTDARVIIDTINECNEVSRNTSL 126
++ + + I+ + H F S IL F+ + E +S ++
Sbjct: 67 IINTDDSDIEMVY------HHFFKHSTH-FSILEFIFFNEGCQAESICKE-FYISSSSLY 118

Query: 127 QDIKQLKLALK-QFNLELAYDRKNGNMVLGDERSIRQFFIHY 167
+ I Q+ +K QF E++ ++G+ER IR FF Y
Sbjct: 119 RIISQINKVIKRQFQFEVSLTP---VQIIGNERDIRYFFAQY 157


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo2653TCRTETOQM975e-24 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 97.2 bits (242), Expect = 5e-24
Identities = 74/311 (23%), Positives = 119/311 (38%), Gaps = 51/311 (16%)

Query: 13 VNIGTIGHVDHGKTTLTAAI---TTVLAKKGYADAQAYDQIDGAPEERERGITISTAHVE 69
+NIG + HVD GKTTLT ++ + + + G D + D ER+RGITI T
Sbjct: 4 INIGVLAHVDAGKTTLTESLLYNSGAITELGSVDK-GTTRTDNTLLERQRGITIQTGITS 62

Query: 70 YQTDSRHYAHVDCPGHADYVKNMITGAAQMDGAILVVSAADGPMPQTREHILLSRQVGVP 129
+Q ++ +D PGH D++ + + +DGAIL++SA DG QTR R++G+P
Sbjct: 63 FQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIP 122

Query: 130 YIVVFMNKCDMV------------------------------------DDEELLELVEME 153
I F+NK D + E + V
Sbjct: 123 TI-FFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIEG 181

Query: 154 IRDLLTEY----EFPGDDIP------VIKGSALKALQGEADWEAKIDELMEAVDSYIPTP 203
DLL +Y ++ S G A ID L+E + + +
Sbjct: 182 NDDLLEKYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVITNKFYSS 241

Query: 204 ERDTDKPFMMPVEDVFSITGRGTVATGRVERGQVKVGDEVEVIGIEEESKKVVVTGVEMF 263
V + R +A R+ G + + D V + E+ + T +
Sbjct: 242 THRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKEKIKITEMYTSINGE 301

Query: 264 RKLLDYAEAGD 274
+D A +G+
Sbjct: 302 LCKIDKAYSGE 312


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo2654TCRTETOQM6340.0 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 634 bits (1637), Expect = 0.0
Identities = 169/691 (24%), Positives = 308/691 (44%), Gaps = 67/691 (9%)

Query: 9 KTRNIGIMAHIDAGKTTTTERILFYTGRIHKIGETHEGASQMDWMEQEQERGITITSAAT 68
K NIG++AH+DAGKTT TE +L+ +G I ++G +G ++ D E++RGITI + T
Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61

Query: 69 TAQWKGYRVNIIDTPGHVDFTVEVERSLRVLDGAVAVLDAQSGVEPQTETVWRQATTYGV 128
+ QW+ +VNIIDTPGH+DF EV RSL VLDGA+ ++ A+ GV+ QT ++ G+
Sbjct: 62 SFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGI 121

Query: 129 PRVVFVNKMDKIGADFLYSVGTLHERLAANAHPIQLPIGAEDTFEGIIDLIEMNALYYED 188
P + F+NK+D+ G D + E+L+A +I+ Y +
Sbjct: 122 PTIFFINKIDQNGIDLSTVYQDIKEKLSAEI------------------VIKQKVELYPN 163

Query: 189 DLGNDPHIKEIPADLKDLADEYRGKLVEAVAELDEELMMKYLEGEEITKEELKAGIRKGT 248
+ ++++ + V E +++L+ KY+ G+ + EL+
Sbjct: 164 MCVTNF----------TESEQW-----DTVIEGNDDLLEKYMSGKSLEALELEQEESIRF 208

Query: 249 LNVEFYPVVCGTAFKNKGVQPMLDAVLDYLPAPTDVPAINGVLPDGEEAARHADDSEPFS 308
N +PV G+A N G+ +++ + + + T
Sbjct: 209 HNCSLFPVYHGSAKNNIGIDNLIEVITNKFYSSTH------------------RGQSELC 250

Query: 309 SLAFKVMTDPYVGRLTFFRVYSGTLNSGSYVQNSTKGKRERVGRILQMHANHREEISIVY 368
FK+ RL + R+YSG L+ V+ S K K ++ + +I Y
Sbjct: 251 GKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKEK-IKITEMYTSINGELCKIDKAY 309

Query: 369 AGDIAAAVG----LKDTTTGDTLCDEKEQIILESMEFPEPVIQVAIEPKSKADQDKMGQA 424
+G+I L GDT + E +E P P++Q +EP ++ + A
Sbjct: 310 SGEIVILQNEFLKLNS-VLGDTKLLPQR----ERIENPLPLLQTTVEPSKPQQREMLLDA 364

Query: 425 LAKLAEEDPTFRAETDQETGQTLISGMGELHLDILVDRMRREFRVEANVGDPQVSYRETF 484
L ++++ DP R D T + ++S +G++ +++ ++ ++ VE + +P V Y E
Sbjct: 365 LLEISDSDPLLRYYVDSATHEIILSFLGKVQMEVTCALLQEKYHVEIEIKEPTVIYME-- 422

Query: 485 RKSAQVEGKFVRQSGGRGQYGHVWIEFGPNEEGKGFEFENAIVGGVVPREYIPAVQAGLE 544
R + E + + + + P G G ++E+++ G + + + AV G+
Sbjct: 423 RPLKKAEYTIHIEVPPNPFWASIGLSVSPLPLGSGMQYESSVSLGYLNQSFQNAVMEGIR 482

Query: 545 GALDNGVLAGYPLIDIKAKLYDGSYHDVDSNEMAFKVAASMALRNAAKKCDPVILEPMMA 604
+ G L G+ + D K G Y+ S F++ A + L KK +LEP ++
Sbjct: 483 YGCEQG-LYGWNVTDCKICFKYGLYYSPVSTPADFRMLAPIVLEQVLKKAGTELLEPYLS 541

Query: 605 VEVVIPEEYLGDIMGNITSRRGRVDGMEARGNAQVVRAFVPLANMFGYATHLRSGTQGRG 664
++ P+EYL + + + + N ++ +P + Y + L T GR
Sbjct: 542 FKIYAPQEYLSRAYTDAPKYCANIVDTQLKNNEVILSGEIPARCIQEYRSDLTFFTNGRS 601

Query: 665 VYTMQFDHYEEVPKSIAEEIIKANGGNNKED 695
V + Y + E + + N++ D
Sbjct: 602 VCLTELKGYHV---TTGEPVCQPRRPNSRID 629


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo2658SACTRNSFRASE280.009 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 28.4 bits (63), Expect = 0.009
Identities = 13/56 (23%), Positives = 19/56 (33%)

Query: 81 LYLEDLYIIPEMRGKGFGTQFFSYLSKLALARDCGRFEWWCLNENKSGMDFYEKIG 136
+ED+ + + R KG GT + A + N S FY K
Sbjct: 90 ALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHH 145


46lmo2812lmo2818N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
lmo28122201.953480hypothetical protein
lmo28132172.039105tRNA uridine 5-carboxymethylaminomethyl
lmo28141162.689495tRNA modification GTPase TrmE
lmo28150191.803490D-alanyl-D-alanine carboxypeptidase
lmo2816-1120.832737hypothetical protein
lmo2817-2120.884272TetR family transcriptional regulator
lmo2818-3131.0049703-ketoacyl-ACP reductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo2812BLACTAMASEA429e-07 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 42.1 bits (99), Expect = 9e-07
Identities = 35/158 (22%), Positives = 66/158 (41%), Gaps = 17/158 (10%)

Query: 1 MKIHKLTWVLLIGLLLLSACSTEQPNLYLSAN--------AAAVYSVENGEALYEQNADK 52
M+ +L + L+ L L+ ++ QP + + + +G L AD+
Sbjct: 1 MRYIRLCIISLLATLPLAVHASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRADE 60

Query: 53 VMPIASLSKLMTAFLVLEAVDNNELSWDEKLDLVRLDDPSAVSLYAITQKR---TWSVRD 109
P+ S K++ VL VD + + K+ + D V +++K +V +
Sbjct: 61 RFPMMSTFKVVLCGAVLARVDAGDEQLERKIHYRQQD---LVDYSPVSEKHLADGMTVGE 117

Query: 110 LYSAMLTMSANDAAETLGDRLDGADFPKEMNNQAKKLG 147
L +A +TMS N AA L + G P + +++G
Sbjct: 118 LCAAAITMSDNSAANLLLATVGG---PAGLTAFLRQIG 152


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo2814HTHTETR565e-12 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 55.8 bits (134), Expect = 5e-12
Identities = 22/79 (27%), Positives = 44/79 (55%), Gaps = 3/79 (3%)

Query: 2 ARLSQEIILNMAEKIIYEKGMEKTTLYDIASNLNVTHAALYKHYRNKEDLFQKLALRWLE 61
A+ +++ IL++A ++ ++G+ T+L +IA VT A+Y H+++K DLF ++ E
Sbjct: 9 AQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI-WELSE 67

Query: 62 ETSREIFAWTQDAGQTPDD 80
E+ + + P D
Sbjct: 68 SNIGELEL--EYQAKFPGD 84


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo2815DHBDHDRGNASE1009e-28 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 100 bits (251), Expect = 9e-28
Identities = 70/249 (28%), Positives = 115/249 (46%), Gaps = 13/249 (5%)

Query: 5 RVAFILGGSGGIGKAVVQKLVEQNFAVAVHYAGNKAKAETLVENIVKSGGEAISVGGDVA 64
++AFI G + GIG+AV + L Q +A N K E +V ++ A + DV
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAA-VDYNPEKLEKVVSSLKAEARHAEAFPADVR 67

Query: 65 DEAQMIRAFDFIESQFGGIDVVINTAGIMKLSPIATLDMDDFDLIQRTNVRGTFVVSKQA 124
D A + IE + G ID+++N AG+++ I +L ++++ N G F S+
Sbjct: 68 DSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127

Query: 125 A--LRVRNGGAIINFSTSVTRTSFPTYGAYVASKAGVESLTLILARELRGKDITVNAVAP 182
+ + R G+I+ ++ + AY +SKA T L EL +I N V+P
Sbjct: 128 SKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187

Query: 183 GPTATPLFLT------GKDDKTIDNLAK---ATPLERLGQPEDIAETVAFLA-GPARWVN 232
G T T + + G + +L PL++L +P DIA+ V FL G A +
Sbjct: 188 GSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHIT 247

Query: 233 GQVIFTNGG 241
+ +GG
Sbjct: 248 MHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo2816TCRTETB612e-12 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 61.0 bits (148), Expect = 2e-12
Identities = 83/438 (18%), Positives = 159/438 (36%), Gaps = 58/438 (13%)

Query: 8 RHSLIVLLVLFIGYTSVYVDKYTIGISLVTVSQDLGFDPSQKGLILSAFFLGYTLFQIPM 67
RH+ I++ + + + SV +++ + +SL ++ D P+ + +AF L +++
Sbjct: 11 RHNQILIWLCILSFFSV-LNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVY 69

Query: 68 GYLNNRIGARPVLAISIIIVGLFLVIFGFGYSLLFLVVIRFLSGALGHAGYPPSVSNYIS 127
G L++++G + +L III VI G+S L+++ G A +P V ++
Sbjct: 70 GKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVA 129

Query: 128 LHIPLNKRGFAQSAMLASSGFAAFIGPLLIAQLLLSVGWRNTYYWIGFAVILI--GFLIL 185
+IP RG A + + +GP + + + W Y + +I I ++
Sbjct: 130 RYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWS---YLLLIPMITIITVPFLM 186

Query: 186 IVVPKAPKID---------------------------------------LNTQKEKIKVP 206
++ K +I K+ P
Sbjct: 187 KLLKKEVRIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFLIFVKHIRKVTDP 246

Query: 207 F--SELLKDKQLWILLLSALFINAANYGLTSWLASYLNEVRGISISEVSYISSLAG-LCI 263
F L K+ I +L I G S + + +V +S +E+ + G + +
Sbjct: 247 FVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSV 306

Query: 264 LIAGVVGGYFISRFFKGKEPIIIFVFCVLGAFAVYGVYLFEQLALSVICLCLCNIFLIMA 323
+I G +GG + R I F + F L I + L
Sbjct: 307 IIFGYIGGILVDRRGPLYVLNIGVTFLSVS-FLTASFLLETTSWFMTIIIVFVLGGLSFT 365

Query: 324 FTTLMGLPHKLFQQSHIATKYAAINSGGVLGGFFAPMIIGDLVN---------ATNSYQS 374
T + + +Q + +N L I+G L++ QS
Sbjct: 366 KTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLLDQRLLPMEVDQS 425

Query: 375 AFLFLALTLLVSGLIVLA 392
+L+ L LL SG+IV++
Sbjct: 426 TYLYSNLLLLFSGIIVIS 443


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
lmo2818TCRTETB1111e-28 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 111 bits (278), Expect = 1e-28
Identities = 81/387 (20%), Positives = 158/387 (40%), Gaps = 20/387 (5%)

Query: 34 VPAVQSDLGISSDLLSIAISLTALFSGIFIVVAGGMADKFGRVKLTYIGLILSIIGSLLL 93
+P + +D + + L I V G ++D+ G +L G+I++ GS++
Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIG 96

Query: 94 VVTQGS-TLLIIGRIIQGLSAACIMPATLALMKTYFDGADRQRALSYWSIGSWGGSGICS 152
V +LLI+ R IQG AA + ++ Y +R +A G G+
Sbjct: 97 FVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGP 156

Query: 153 FAGGAIATYMGWRWIFIISIVFALLGMLLIKGTPESKVVQNTKAKFDSFGLVLFVIAMVC 212
GG IA Y+ W ++ +I ++ + L+K + K FD G++L + +V
Sbjct: 157 AIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEV---RIKGHFDIKGIILMSVGIVF 213

Query: 213 LNLIITRGATFGWTSPITITMLVVFLVSAGLFFRVELRQANGFIDFSLFKNKAYTGATLS 272
L +T+ +I+ L+V ++S +F + + + F+D L KN + L
Sbjct: 214 FML---------FTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLC 264

Query: 273 NFLLNAA-AGTLVVANTYVQIGRGFTAFQSGLLSIGYLVCVLGMIR--IGEKILQRVGAR 329
++ AG + + ++ + + G + I + + +I IG ++ R G
Sbjct: 265 GGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVII-FPGTMSVIIFGYIGGILVDRRGPL 323

Query: 330 KPMILGSGITAVGIALMALTFIPGTLYTVLVFIGFALFGIGLGMYATPSTDTAISNAPED 389
+ +G +V + +F+ T + I + G GL T + S+ +
Sbjct: 324 YVLNIGVTFLSVS--FLTASFLLETTSWFMTIIIVFVLG-GLSFTKTVISTIVSSSLKQQ 380

Query: 390 KVGVASGIYKMASSLGGSFGVAISATI 416
+ G + S L G+AI +
Sbjct: 381 EAGAGMSLLNFTSFLSEGTGIAIVGGL 407



 
Contact Sachin Pundhir for Bugs/Comments.
For best view 1024 x 768 resolution & IE 6.0 or above recommended.