PredictBias

identification of genomic and pathogenicity islands in prokaryotic genome
Home | Help | Analyzed genomes
 
A) Input parameters
GenomeLF-89LF.gbkThreshold dinucleotide bias2
Threshold codon bias4Threshold %GC bias3
E-value (RPSBlast)0.05Genome (non-pathogenic)
 
B) Compare a potential GI or PAI in related non-pathogenic sp. (phylogenetic tree)
Potential GI or PAI start    end  
Select Organism     
 
C) Potential GIs and PAIs in NZ_CP011849 (download)
S.NoStartEndBiasVirulenceInsertion elementsPrediction
1PSLF89_RS00320PSLF89_RS03835Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS003200193.049498F0F1 ATP synthase subunit C
PSLF89_RS005900193.193850F0F1 ATP synthase subunit B
PSLF89_RS009301182.361038F0F1 ATP synthase subunit delta
PSLF89_RS009800181.955775F0F1 ATP synthase subunit alpha
PSLF89_RS012650161.796316F0F1 ATP synthase subunit gamma
PSLF89_RS012800171.825647F0F1 ATP synthase subunit beta
PSLF89_RS01285-215-1.922422F0F1 ATP synthase subunit epsilon
PSLF89_RS01300-215-2.279111bifunctional UDP-N-acetylglucosamine
PSLF89_RS01315-215-2.601587glutamine--fructose-6-phosphate transaminase
PSLF89_RS01335-116-3.524313IS6 family transposase
PSLF89_RS01350-115-4.122918hypothetical protein
PSLF89_RS01470015-4.218749hypothetical protein
PSLF89_RS01505220-1.1109112OG-Fe dioxygenase family protein
PSLF89_RS37610117-1.464497hypothetical protein
PSLF89_RS37615121-1.266860ParB/RepB/Spo0J family partition protein
PSLF89_RS01675022-1.096718AAA family ATPase
PSLF89_RS01720-122-1.670680hypothetical protein
PSLF89_RS37265121-1.226896transposase
PSLF89_RS37270121-2.592269transposase
PSLF89_RS37275122-3.068789transposase zinc-binding domain-containing
PSLF89_RS01900022-4.334192IS4 family transposase
PSLF89_RS02340223-4.734744hypothetical protein
PSLF89_RS02395322-5.749330IS30 family transposase
PSLF89_RS02560428-7.275175hypothetical protein
PSLF89_RS02570426-6.759015hypothetical protein
PSLF89_RS02600219-3.316993CBS domain-containing protein
PSLF89_RS02665116-2.128049hypothetical protein
PSLF89_RS02690114-1.286603methyltransferase
PSLF89_RS026951160.188987hypothetical protein
PSLF89_RS027151160.258871hypothetical protein
PSLF89_RS028502150.224885MFS transporter
PSLF89_RS02865215-1.431329ubiquinol oxidase subunit II
PSLF89_RS02905414-1.898205cytochrome o ubiquinol oxidase subunit I
PSLF89_RS03085417-3.411903cytochrome c oxidase subunit 3
PSLF89_RS03200316-3.683083prokaryotic cytochrome C oxidase subunit IV
PSLF89_RS03250218-4.376094heme o synthase
PSLF89_RS03255018-4.356703GNAT family N-acetyltransferase
PSLF89_RS03270-116-3.310203FAD-dependent oxidoreductase
PSLF89_RS03335-215-2.966211TVP38/TMEM64 family protein
PSLF89_RS03405-115-1.766422proline iminopeptidase-family hydrolase
PSLF89_RS03465-116-2.506297hypothetical protein
PSLF89_RS03485-219-2.098766IS4 family transposase
PSLF89_RS03665-218-2.025273transposase
PSLF89_RS37280-120-2.736520transposase
PSLF89_RS35755019-4.004758hypothetical protein
PSLF89_RS03800220-5.065139IS4 family transposase
PSLF89_RS03835-117-3.507093transposase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS01265PF05211280.045 Neuraminyllactose-binding hemagglutinin
		>PF05211#Neuraminyllactose-binding hemagglutinin

Length = 260

Score = 27.7 bits (61), Expect = 0.045
Identities = 14/59 (23%), Positives = 24/59 (40%), Gaps = 1/59 (1%)

Query: 203 WDYIYEPEAKELLDQLMVRYIESQVYQGVVENNACEQAARMVA-MKNATDNATDMIHKL 260
I EP + E LD + E + + ++ + +V+ M TDN+ D I
Sbjct: 165 KVTILEPMSGESLDSFTMDLSELDIQEKFLKTTHSSHSGGLVSTMVKGTDNSNDAIKSA 223


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS01335SALSPVBPROT290.018 Salmonella virulence plasmid 65kDa B protein signature.
		>SALSPVBPROT#Salmonella virulence plasmid 65kDa B protein signature.

Length = 591

Score = 28.9 bits (64), Expect = 0.018
Identities = 19/70 (27%), Positives = 31/70 (44%), Gaps = 3/70 (4%)

Query: 60 VHEYGPEIAKRLR-PHFRQTCASWRLDETLVKIKGHWYYLYRAIDKYGHTLDWMLSRQQN 118
+H G A RL P A W ++E++ H YY Y A + G +D +
Sbjct: 168 LHLLGKTAAARLSDPQAASHTAQWLVEESVTPAGEHIYYSYLAEN--GDNVDLNGNEAGR 225

Query: 119 AKAALRFFKK 128
++A+R+ K
Sbjct: 226 DRSAMRYLSK 235


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS02850TCRTETA892e-21 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 89.1 bits (221), Expect = 2e-21
Identities = 67/388 (17%), Positives = 141/388 (36%), Gaps = 25/388 (6%)

Query: 30 KSTFSLASLFGLRMLGLFMILPIFALYANQLHGATTLW--MGLTLGVYGATSCLFQLIFG 87
+ + S L +G+ +I+P+ L + + G+ L +Y + G
Sbjct: 5 RPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLG 64

Query: 88 WASDHFGRKKIIALGLLIFAIGSLIAGLSDSIYGVFIGRALQG-AGAIGSATLALIADLT 146
SD FGR+ ++ + L A+ I + ++ ++IGR + G GA G+ A IAD+T
Sbjct: 65 ALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADIT 124

Query: 147 KDKHRTKAMATVGMTIGFSFVIAMVLGPLLVGHIGLSGLFYLTGALALIAIIVLYKVVPS 206
R + + GF V VLG L+ G F+ AL + + ++P
Sbjct: 125 DGDERARHFGFMSACFGFGMVAGPVLGGLM-GGFSPHAPFFAAAALNGLNFLTGCFLLPE 183

Query: 207 PKRSIIHEGNTARWSQFKTVMTSPQLLNLDLGIFTLHAVLTASFMFIPLDML-------- 258
+ E R + G+ + A++ F+ + +
Sbjct: 184 SHKG---ERRPLRREALNPL----ASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIF 236

Query: 259 --HQLSLDAHEQWMVYLPVFIVSVIF-MVPFVIIAEKKRHMKGVLLGMIALMFISQLGVW 315
+ DA + I+ + + +A + + ++LGMIA L +
Sbjct: 237 GEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAF 296

Query: 316 LFDSNLPGIIISLMLFFTAFTVLEALLPSWISKVSPVAAKGTAMGIYSSSQYLGAFIGGS 375
+ +M+ + + L + +S+ +G G ++ L + +G
Sbjct: 297 ATRGWM---AFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPL 353

Query: 376 IAGLLLSWHNSTALMIIILVALAVWWIL 403
+ + + +T + A++ +
Sbjct: 354 LFTAIYAASITTWNGWAWIAGAALYLLC 381


2PSLF89_RS05705PSLF89_RS07240Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS05705020-3.708050HAD-IA family hydrolase
PSLF89_RS05785319-3.358913hydroxyacylglutathione hydrolase
PSLF89_RS05910119-3.661755hypothetical protein
PSLF89_RS06050018-2.808829SDR family NAD(P)-dependent oxidoreductase
PSLF89_RS06145021-3.385680phosphatase PAP2 family protein
PSLF89_RS06185022-3.110747low molecular weight phosphotyrosine protein
PSLF89_RS06210022-3.319422polysaccharide biosynthesis/export family
PSLF89_RS06215022-5.234779polysaccharide biosynthesis tyrosine autokinase
PSLF89_RS06245223-6.274603UTP--glucose-1-phosphate uridylyltransferase
PSLF89_RS06630323-6.830726glucose-1-phosphate thymidylyltransferase RfbA
PSLF89_RS06650225-7.909544dTDP-glucose 4,6-dehydratase
PSLF89_RS06670328-8.010554adenylyltransferase/cytidyltransferase family
PSLF89_RS07080428-7.860157oligosaccharide flippase family protein
PSLF89_RS07090328-6.526798CDP-glycerol glycerophosphotransferase family
PSLF89_RS07130120-4.252474glycosyltransferase family 2 protein
PSLF89_RS07240-116-3.712867hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS06050DHBDHDRGNASE994e-27 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 99.4 bits (247), Expect = 4e-27
Identities = 68/232 (29%), Positives = 101/232 (43%), Gaps = 16/232 (6%)

Query: 4 ILITGATSGFGRATAELFADKGWSLILTGRRTQYLNNLYS--KLHSKTAIHIITLDVRDT 61
ITGA G G A A A +G + + L + S K ++ A DVRD+
Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHA-EAFPADVRDS 69

Query: 62 DQVFQTLSELPPPFKEIDVLINNAGLALGLETADQANLSDWHQMIETNITGLVNVTRAIL 121
+ + + + ID+L+N AG+ L + +W N TG+ N +R++
Sbjct: 70 AAIDEITARIEREMGPIDILVNVAGV-LRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 122 PQMKTANRGYIINIGSIAANTPYIGGNVYGATKAFVDQFTKNLRTDLLGTKIRATTIAPG 181
M G I+ +GS A P Y ++KA FTK L +L IR ++PG
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188

Query: 182 LAETEFSIVRFKGDKNRAEQVYKD----------LKPLA-AEDIANTIDWLV 222
ET+ + D+N AEQV K LK LA DIA+ + +LV
Sbjct: 189 STETDMQWSLWA-DENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLV 239


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS06650NUCEPIMERASE1512e-45 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 151 bits (384), Expect = 2e-45
Identities = 75/342 (21%), Positives = 138/342 (40%), Gaps = 45/342 (13%)

Query: 1 MKQLLVTGGAGFIGCNFVRYMLKTYNHVNIINVDKLT--YAGSLNNLKN-LPDESRHIFV 57
MK LVTG AGFIG + + +L+ + V + +D L Y SL + L + F
Sbjct: 1 MK-YLVTGAAGFIGFHVSKRLLEAGHQV--VGIDNLNDYYDVSLKQARLELLAQPGFQFH 57

Query: 58 QGDICDRLFIDQLLREHNIDTIVHFAAESHVDNSIKNPKLFIETNINGTFTLLEAARQFW 117
+ D+ DR + L + + + V S++NP + ++N+ G +LE R
Sbjct: 58 KIDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNK 117

Query: 118 LEEKQWLNKGDCRFHHVSTDEVYGTLSKGAPAFTETTAYAPNSPYSASKAGSDHLVRAYF 177
++ + S+ VYG L++ P T+ + P S Y+A+K ++ + Y
Sbjct: 118 IQ----------HLLYASSSSVYG-LNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYS 166

Query: 178 HTYGLPVTISNCSNNYGPYQHREKLIPTVIHLCLAEKKIPIYGNGSNIRDWLYVEDHCSV 237
H YGLP T YGP+ + + L K I +Y G RD+ Y++D
Sbjct: 167 HLYGLPATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEA 226

Query: 238 IDKILHNGRL------------------GEVYNIGANNEVDNLTLVKQVCQILDKKQPRK 279
I ++ VYNIG ++ V+ + ++ + L + +
Sbjct: 227 IIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKN 286

Query: 280 NGSSYQELITFVTDRAGHDWRYAIDNRKIKNELNWQPVYSLQ 321
+ + G + D + + + + P +++
Sbjct: 287 ----------MLPLQPGDVLETSADTKALYEVIGFTPETTVK 318


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS06670LPSBIOSNTHSS435e-08 Lipopolysaccharide core biosynthesis protein signat...
		>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein

signature.
Length = 166

Score = 42.9 bits (101), Expect = 5e-08
Identities = 19/53 (35%), Positives = 29/53 (54%), Gaps = 6/53 (11%)

Query: 8 GTFDLFHYGHLRILERARALGDKLIVGVSSDALNYNKKQCYPITPQEQRLSIV 60
G+FD +GHL I+ER L D++ V V N NK+ P+ ++RL +
Sbjct: 7 GSFDPITFGHLDIIERGCRLFDQVYVAV---LRNPNKQ---PMFSVQERLEQI 53


3PSLF89_RS10440PSLF89_RS11570Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS104402132.324314hypothetical protein
PSLF89_RS104951122.514302hypothetical protein
PSLF89_RS105400123.052381DNA helicase II
PSLF89_RS107300192.282895ProQ/FinO family protein
PSLF89_RS107352182.747239DUF4402 domain-containing protein
PSLF89_RS10875-123-1.197076diaminopimelate epimerase
PSLF89_RS11185018-2.039120lipoprotein
PSLF89_RS11440016-2.606151formyltetrahydrofolate deformylase
PSLF89_RS11445016-3.903197hypothetical protein
PSLF89_RS11450016-4.160568hypothetical protein
PSLF89_RS11475017-4.428280hypothetical protein
PSLF89_RS11485017-3.506305cadmium carbonic anhydrase
PSLF89_RS11570-217-3.174276ion transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS10735TYPE3OMGPROT280.017 Type III secretion system outer membrane G protein ...
		>TYPE3OMGPROT#Type III secretion system outer membrane G protein

family signature.
Length = 607

Score = 27.9 bits (62), Expect = 0.017
Identities = 16/72 (22%), Positives = 26/72 (36%), Gaps = 8/72 (11%)

Query: 68 VTGEENASI-SIDY-------PDTVTLAGPASSSLTVDIQTNDENETLNGSGQLTKSFSG 119
VTG+E A + I Y P +T + SL + I+ ++ +G +
Sbjct: 385 VTGKEVAELKGITYGTMLRMTPRVLTQGDKSEISLNLHIEDGNQKPNSSGIEGIPTISRT 444

Query: 120 EVTVNATTAAGD 131
V A G
Sbjct: 445 VVDTVARVGHGQ 456


4PSLF89_RS12465PSLF89_RS15735Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS12465025-5.366559TMEM165/GDT1 family protein
PSLF89_RS12490026-5.307804hypothetical protein
PSLF89_RS12715019-4.156015hypothetical protein
PSLF89_RS12870-119-3.371056IS4 family transposase
PSLF89_RS12895218-2.733200hypothetical protein
PSLF89_RS12965118-2.897431hypothetical protein
PSLF89_RS12975116-0.168549hypothetical protein
PSLF89_RS13490-1172.458781hypothetical protein
PSLF89_RS136500174.061367hypothetical protein
PSLF89_RS136600184.605644YhbY family RNA-binding protein
PSLF89_RS138301205.062644hypothetical protein
PSLF89_RS138351194.96444423S rRNA (uridine(2552)-2'-O)-methyltransferase
PSLF89_RS138701194.290774ATP-dependent zinc metalloprotease FtsH
PSLF89_RS138800183.424704phosphoglucosamine mutase
PSLF89_RS140400172.973808pyruvate dehydrogenase (acetyl-transferring),
PSLF89_RS147101163.153513dihydrolipoyllysine-residue acetyltransferase
PSLF89_RS147200192.239234dihydrolipoyl dehydrogenase
PSLF89_RS147850192.039718phospholipase A
PSLF89_RS147902243.545465hypothetical protein
PSLF89_RS151353205.289086hypothetical protein
PSLF89_RS154352185.182937chaperonin GroEL
PSLF89_RS154451235.690608co-chaperone GroES
PSLF89_RS155051235.218053type II 3-dehydroquinate dehydratase
PSLF89_RS157201244.849228acetyl-CoA carboxylase biotin carboxyl carrier
PSLF89_RS157350234.658176acetyl-CoA carboxylase biotin carboxylase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS12965FLGPRINGFLGI280.045 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 27.6 bits (61), Expect = 0.045
Identities = 13/80 (16%), Positives = 26/80 (32%), Gaps = 10/80 (12%)

Query: 130 IESADTAGFK---TIYADNNNSDTTNGTHYIQQLEEIIAERTIKPYYKQAEENL---AIA 183
IE + FK + N D + ++ +++ Y E IA
Sbjct: 179 IERELPSKFKDSVNLVLQLRNPDFST----AVRVADVVNAFARARYGDPIAEPRDSQEIA 234

Query: 184 IRKRELEHKIAFLTIIEKIK 203
++K + + IE +
Sbjct: 235 VQKPRVADLTRLMAEIENLT 254


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS13490SECBCHAPRONE290.019 Bacterial protein-transport SecB chaperone protein ...
		>SECBCHAPRONE#Bacterial protein-transport SecB chaperone protein

signature.
Length = 170

Score = 29.1 bits (65), Expect = 0.019
Identities = 14/79 (17%), Positives = 29/79 (36%), Gaps = 6/79 (7%)

Query: 255 HIAAENWQPSVGVQFNQYQSQVMQAVFEFSMSEKSDLKALISQLAKYEAE------FLSS 308
HI ++W+P + + QV ++E ++ + S + E F S
Sbjct: 39 HIFQQDWEPKLSFDLSTEAKQVGDDLYEVCLNISVETTMESSGDVAFICEVKQAGVFTIS 98

Query: 309 SKIKENIPEFNEAICPRIL 327
+ + + CP +L
Sbjct: 99 GLEEMQMAHCLTSQCPNML 117


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS13660TONBPROTEIN260.043 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 26.5 bits (58), Expect = 0.043
Identities = 14/61 (22%), Positives = 21/61 (34%), Gaps = 11/61 (18%)

Query: 87 QAPTKPAAEKAKTKSKPNSKVKAREKRQEIKAKAEKEEQARKKAKYFKKVTQPRAPRNNQ 146
+ P + K K KP K K +K QE + K ++P +P N
Sbjct: 80 EPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVK-----------PVESRPASPFENT 128

Query: 147 N 147

Sbjct: 129 A 129


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS13870HTHFIS330.003 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 33.3 bits (76), Expect = 0.003
Identities = 21/82 (25%), Positives = 32/82 (39%), Gaps = 18/82 (21%)

Query: 189 VLMVGPPGTGKTLLAKAI---AGEAKVPFFS-----ISGSDFVEMFVGV------GASRV 234
+++ G GTGK L+A+A+ PF + I G GA
Sbjct: 163 LMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTR 222

Query: 235 RD-MFDQAKKRAPCIIFIDEID 255
F+QA+ +F+DEI
Sbjct: 223 STGRFEQAEGGT---LFLDEIG 241


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS14710RTXTOXIND364e-04 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 36.3 bits (84), Expect = 4e-04
Identities = 46/263 (17%), Positives = 98/263 (37%), Gaps = 23/263 (8%)

Query: 45 EVPAPFAGTVKAIKVKEGSKVSEGSLIVQMEGSD-----DVVESATVPAPVAAPTAVVAP 99
E+ VK I VKEG V +G +++++ +S+ + A + +
Sbjct: 98 EIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILS 157

Query: 100 TGPIEVRVPDIG--------NYSGVDVIEINVAVGDQVSE---EDALITLETDKATMEVP 148
++P++ N S +V+ + + +Q S + L DK E
Sbjct: 158 RSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERL 217

Query: 149 SPVAGIVKEIKVAEGSQVSEGDL-VLIVEGAGGTSAVAHPP---ATAQQEVTTISSAVPM 204
+ +A I + ++ + D L+ + A AV A E+ S +
Sbjct: 218 TVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQ 277

Query: 205 AAVASASVQEVHVPDIGNYSGVDVIEINVAVGDCINE-EDPLITLETDKATMEVPSPVAG 263
S +E + + ++++ D I L E + + +PV+
Sbjct: 278 IESEILSAKEEYQLVTQLFKN-EILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSV 336

Query: 264 VVKEIKV-AEGSQVSEGDLIVLV 285
V+++KV EG V+ + ++++
Sbjct: 337 KVQQLKVHTEGGVVTTAETLMVI 359



Score = 35.6 bits (82), Expect = 6e-04
Identities = 15/49 (30%), Positives = 27/49 (55%)

Query: 256 EVPSPVAGVVKEIKVAEGSQVSEGDLIVLVESPGASSVVVSSVASQGAA 304
E+ +VKEI V EG V +GD+++ + + GA + + + +S A
Sbjct: 98 EIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQA 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS14785PHPHLIPASEA11724e-54 Bacterial phospholipase A1 protein signature.
		>PHPHLIPASEA1#Bacterial phospholipase A1 protein signature.

Length = 289

Score = 172 bits (438), Expect = 4e-54
Identities = 89/249 (35%), Positives = 134/249 (53%), Gaps = 17/249 (6%)

Query: 82 NNSLGIIFYQPNYVLPYYYTGSPYQAIYNGQTPDNQKVMSSEFKAQLSLMVPLWKDMFGN 141
+N + Y NY++ + +AI + +N + E K QLSL PLW+ + G
Sbjct: 47 DNPFTLYPYDTNYLIYTQTSDLNKEAIASYDWAENAR--KDEVKFQLSLAFPLWRGILG- 103

Query: 142 PDYSLNVGYTQLSYWQF--YAKSQYFRETNYEPELFV---TDHFHRNW---QISYGVVHQ 193
P+ L YTQ S+WQ +S FRETNYEP+LF+ TD+ W + G H
Sbjct: 104 PNSVLGASYTQKSWWQLSNSEESSPFRETNYEPQLFLGFATDYRFAGWTLRDVEMGYNHD 163

Query: 194 SNGRGGSLERSWNRAYLNLEASGEHWLVSIKPWVLIFKPDSSDLHNPDIAHYLGHERIMF 253
SNGR RSWNR Y L A +WLV +KPW ++ ++ D NPDI Y+G+ ++
Sbjct: 164 SNGRSDPTSRSWNRLYTRLMAENGNWLVEVKPWYVV--GNTDD--NPDITKYMGYYQLKI 219

Query: 254 AYVFNNKMQASIALTNIESGMKRGAVELDYSFPLTKHINGFVQYFNGYGQSLIEYDHRTQ 313
Y + + ++ N +G G EL S+P+TKH+ + Q ++GYG+SLI+Y+
Sbjct: 220 GYHLGDAVLSAKGQYNWNTG--YGGAELGLSYPITKHVRLYTQVYSGYGESLIDYNFNQT 277

Query: 314 SVGIGIALS 322
VG+G+ L+
Sbjct: 278 RVGVGVMLN 286


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS15720RTXTOXIND300.005 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.8 bits (67), Expect = 0.005
Identities = 7/30 (23%), Positives = 17/30 (56%)

Query: 125 IEADKAGVVKQILLSDGDIVEFDQPLVIIE 154
I+ + +VK+I++ +G+ V L+ +
Sbjct: 99 IKPIENSIVKEIIVKEGESVRKGDVLLKLT 128


5PSLF89_RS18995PSLF89_RS19320Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS189952181.845255S-(hydroxymethyl)glutathione dehydrogenase/class
PSLF89_RS190006170.588143hypothetical protein
PSLF89_RS19005314-0.473917IS3 family transposase
PSLF89_RS19010-114-1.718603hypothetical protein
PSLF89_RS19015015-2.008481hypothetical protein
PSLF89_RS35785-116-2.632090YbfB/YjiJ family MFS transporter
PSLF89_RS19025021-5.197284IS4 family transposase
PSLF89_RS19030019-4.534085class I SAM-dependent methyltransferase
PSLF89_RS19035320-4.744518hypothetical protein
PSLF89_RS19040421-6.161641hypothetical protein
PSLF89_RS19050118-4.937317IS30 family transposase
PSLF89_RS19055017-5.130048hypothetical protein
PSLF89_RS19060021-6.207765YbfB/YjiJ family MFS transporter
PSLF89_RS19070-121-6.219858IS4 family transposase
PSLF89_RS19075021-7.362538hypothetical protein
PSLF89_RS19080021-7.244641hypothetical protein
PSLF89_RS19085020-6.027332hypothetical protein
PSLF89_RS19090-114-3.930641HAD family hydrolase
PSLF89_RS19095014-2.206377hypothetical protein
PSLF89_RS19100-2150.427688XRE family transcriptional regulator
PSLF89_RS19105-1204.192628hypothetical protein
PSLF89_RS19110-1214.250040DUF3579 domain-containing protein
PSLF89_RS191150223.820629CDP-diacylglycerol--serine
PSLF89_RS191200223.790669hypothetical protein
PSLF89_RS191250182.032736ribosomal protein S18-alanine
PSLF89_RS191300160.473717peptide chain release factor 3
PSLF89_RS19135-216-1.733355TatD family hydrolase
PSLF89_RS19140016-2.289637hypothetical protein
PSLF89_RS19145117-2.738450tRNA threonylcarbamoyladenosine dehydratase
PSLF89_RS19150219-2.848638hypothetical protein
PSLF89_RS19155118-1.261116IS982 family transposase
PSLF89_RS19160-3123.009472hypothetical protein
PSLF89_RS19165-2133.671686pentapeptide repeat-containing protein
PSLF89_RS35795-3133.921834zinc ribbon domain-containing protein
PSLF89_RS19170-2143.791678heavy metal translocating P-type ATPase
PSLF89_RS191751195.668152hypothetical protein
PSLF89_RS191800184.550635transporter substrate-binding domain-containing
PSLF89_RS191850172.908486protein phosphatase 2C domain-containing
PSLF89_RS37290-1131.652753serine/threonine protein kinase
PSLF89_RS191950131.280077hypothetical protein
PSLF89_RS19200-1140.545642IS4 family transposase
PSLF89_RS19210-3100.855808hypothetical protein
PSLF89_RS19220-2122.928822glutamate-1-semialdehyde 2,1-aminomutase
PSLF89_RS37295-1143.064589hypothetical protein
PSLF89_RS192350154.658164rubredoxin
PSLF89_RS192401204.735988protoporphyrinogen oxidase HemJ
PSLF89_RS192452162.671853iron-sulfur cluster insertion protein ErpA
PSLF89_RS192552240.601881hypothetical protein
PSLF89_RS19265-1201.607321IS4 family transposase
PSLF89_RS19270-3151.414822hypothetical protein
PSLF89_RS19280-2122.669629hypothetical protein
PSLF89_RS192851142.781552MFS transporter
PSLF89_RS192900222.739047DNA polymerase III subunit delta'
PSLF89_RS192950213.843660pilus assembly protein PilZ
PSLF89_RS193001224.784390TatD family hydrolase
PSLF89_RS193052224.379908hypothetical protein
PSLF89_RS193102263.939395hypothetical protein
PSLF89_RS193152244.1430493-hydroxybutyrate dehydrogenase
PSLF89_RS193201203.437970putative N-acetylmannosamine-6-phosphate
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS19030TCRTETB290.010 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 29.1 bits (65), Expect = 0.010
Identities = 23/133 (17%), Positives = 55/133 (41%), Gaps = 1/133 (0%)

Query: 17 LCLFVIAMGINRFSYGPIIPFLINEHWVTSSQAGYIGSLNFLGYFIGAYIAHKLTYFIQL 76
LC+ +N +P + N+ + ++ + L + IG + KL+ + +
Sbjct: 19 LCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGI 78

Query: 77 NKIILYMLAFSVLASTLCTFNFGYI-WLGLCRFILGIVSGTIMVLTPTIILHRIAHEKKG 135
+++L+ + + S + + L + RFI G + L ++ I E +G
Sbjct: 79 KRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRG 138

Query: 136 LVSGIMFAGIGLG 148
G++ + + +G
Sbjct: 139 KAFGLIGSIVAMG 151


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS19125SACTRNSFRASE355e-05 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 34.9 bits (80), Expect = 5e-05
Identities = 11/50 (22%), Positives = 22/50 (44%)

Query: 80 IMPELQGQGYGYYLLDAIIKEVMGQGANDVFLEVRESNLAALKLYNGYGF 129
+ + + +G G LL I+ + LE ++ N++A Y + F
Sbjct: 97 VAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHF 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS19130TCRTETOQM2113e-63 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 211 bits (539), Expect = 3e-63
Identities = 123/462 (26%), Positives = 206/462 (44%), Gaps = 49/462 (10%)

Query: 9 KRRTFAIISHPDAGKTTLTEKLLLFGGAIQMAGTV-KGRKASRHATSDWMELEKQRGISV 67
K +++H DAGKTTLTE LL GAI G+V KG +D LE+QRGI++
Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKG-----TTRTDNTLLERQRGITI 56

Query: 68 TTSVMQFPYHERIINLLDTPGHEDFSEDTYRTLTAVDSALMVVDAAKGVEARTLKLWEVC 127
T + F + +N++DTPGH DF + YR+L+ +D A++++ A GV+A+T L+
Sbjct: 57 QTGITSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHAL 116

Query: 128 QLRKTAAMTFVNKLDREARDPVEVLDDIETSLGIFCAPITWPIGMGKNFKGIYHLYEDKV 187
+ + F+NK+D+ D V DI+ L KV
Sbjct: 117 RKMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQ------------------KV 158

Query: 188 YLYSAGKNQQVQDAEVIVGLDNPVLDDKL--GMMASELRDELELVRGASHEFDNVAYLAG 245
LY ++E + +D L M+ + + LEL + S F N
Sbjct: 159 ELYPNMCVTNFTESEQWDTVIE--GNDDLLEKYMSGKSLEALELEQEESIRFHN-----C 211

Query: 246 ELTPVFFGSAINNFGIRELLNYFAEYAPAPQVRKTHERTVAPTEDKLSGFVFKIQANMDP 305
L PV+ GSA NN GI L+ + TH + +L G VFKI+
Sbjct: 212 SLFPVYHGSAKNNIGIDNLIEVITNKFYSS----THR-----GQSELCGKVFKIE--YSE 260

Query: 306 AHRDRIAFMRVCSGQYTKGMKLKHVRTGKTVQIANAMTFMAGDRSQAEEAYPGDILGLHN 365
R R+A++R+ SG ++ K ++I T + G+ + ++AY G+I+ L N
Sbjct: 261 K-RQRLAYIRLYSGVLHLRDSVRISEKEK-IKITEMYTSINGELCKIDKAYSGEIVILQN 318

Query: 366 HGTIQIGDAFTQGEDLKFVGIPNFA-PELFRLVRLRDPLKSKALQKGLIQLSEEGAT-QV 423
+++ + L P L V P + + L L+++S+ +
Sbjct: 319 EF-LKLNSVLGDTKLLPQRERIENPLPLLQTTVEPSKPQQREMLLDALLEISDSDPLLRY 377

Query: 424 FRPLNSNDLILGAVGVLQFDVVAHRLKSEYNVECVYSNISIA 465
+ ++++IL +G +Q +V L+ +Y+VE ++
Sbjct: 378 YVDSATHEIILSFLGKVQMEVTCALLQEKYHVEIEIKEPTVI 419


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS19215IGASERPTASE300.031 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 30.4 bits (68), Expect = 0.031
Identities = 24/135 (17%), Positives = 49/135 (36%), Gaps = 12/135 (8%)

Query: 357 GQALNSANAAVTADPYDETAIVPSSTQTQTQTQTQTIAKITTETPKIKQQSVIKKKEPEK 416
+ A + V A+ S +TQT + K K ++ ++ P+
Sbjct: 1066 REVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKV 1125

Query: 417 AQALKINQDKAQ-VKPAAKPSHVKVAAPVEKTSSVEKVVVKAQQIKRQQTNITQKTPKVN 475
+ Q++++ V+P A+P A + T +++ + + T + P
Sbjct: 1126 TSQVSPKQEQSETVQPQAEP-----ARENDPTVNIK------EPQSQTNTTADTEQPAKE 1174

Query: 476 TPSVVHHALKSSKTS 490
T S V + S T
Sbjct: 1175 TSSNVEQPVTESTTV 1189


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS19245IGASERPTASE401e-05 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 40.4 bits (94), Expect = 1e-05
Identities = 33/197 (16%), Positives = 64/197 (32%), Gaps = 22/197 (11%)

Query: 119 QSEVQAEQTPNEAQQTVQLERRTVGYEQQAQTAEVRRSEAQEQQRQVVSRRKAADAGKVA 178
+E AE + E ++T E +A E Q K A + A
Sbjct: 1036 TTETVAENSKQE-----------------SKTVEKNEQDATETTAQNREVAKEAKSNVKA 1078

Query: 179 ESQALREQQSFIDNIRAQRGQANSLDDQKLQAR-RVEAG----VQEHAAELRVASRQQSA 233
+Q QS + Q + + + + +VE V + +++ Q
Sbjct: 1079 NTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSET 1138

Query: 234 EIDQLATARQRNAQVADQQAQEQGQMVNSEVSRSEQQSSAVSSDHDRKQGQDGSHAIVEQ 293
Q AR+ + V ++ Q Q +++ SS V + +++VE
Sbjct: 1139 VQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVEN 1198

Query: 294 RRQESPDALTRRANATT 310
+P N+ +
Sbjct: 1199 PENTTPATTQPTVNSES 1215


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS19285TCRTETB386e-05 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 37.9 bits (88), Expect = 6e-05
Identities = 74/419 (17%), Positives = 158/419 (37%), Gaps = 44/419 (10%)

Query: 36 VMIPELMHYFNVGATSVGTFAGFYFYAYTPMQLIVGPLFDRFRAHQLLTLAVIACALGTI 95
V +P++ + FN S + ++ + G L D+ +LL +I G++
Sbjct: 35 VSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSV 94

Query: 96 LTGIAPTDHITIAYAGRFLQGFGSAFAFVGILKLGATILPHNRLALIAGLVTCLGFVGAM 155
+ + + ++ RF+QG G+A ++ + A +P GL+ + +G
Sbjct: 95 IGFVGHS-FFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEG 153

Query: 156 AGQNSLAALVTHFNWQ-----PVLITIGLF-------------------GFILAPIFF-F 190
G + + +W P++ I + G IL + F
Sbjct: 154 VGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVF 213

Query: 191 FVHNPHSTPTDHTSQQMSSKDIFQGFLLTIKQPYL---------WLVGLAGGALFMPNSV 241
F+ S + S IF + + P++ +++G+ G + +V
Sbjct: 214 FMLFTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIF-GTV 272

Query: 242 FASLWGIPFLTQT-HHLSTAHATFATSLIFLGW---AIGSPLQGWLSDRLRTARLQLIFV 297
+ +P++ + H LST A + +IF G I + G L DR R L
Sbjct: 273 AGFVSMVPYMMKDVHQLST--AEIGSVIIFPGTMSVIIFGYIGGILVDR-RGPLYVLNIG 329

Query: 298 NILIAATIIYLVIAIPGLNYTLLCILLLAFGIFASAEIAVFPLAIEHMPTQYSGTAIAFV 357
++ + + + ++ + I++ G + + + + + Q +G ++ +
Sbjct: 330 VTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLL 389

Query: 358 NFLTMLGGLTMQRGIGEILDLEW-DGTLSHGIRVYSSTVYSYALYTLPLILLIAAVCVI 415
NF + L T +G +L + D L S+ +YS L I++I+ + +
Sbjct: 390 NFTSFLSEGTGIAIVGGLLSIPLLDQRLLPMEVDQSTYLYSNLLLLFSGIIVISWLVTL 448


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS19305BONTOXILYSIN260.016 Bontoxilysin signature.
		>BONTOXILYSIN#Bontoxilysin signature.

Length = 1196

Score = 26.0 bits (57), Expect = 0.016
Identities = 8/56 (14%), Positives = 15/56 (26%), Gaps = 5/56 (8%)

Query: 14 EKHSDSYRIHLEEDLFGQWWLTRVKTINGKKEIKKDACENYQAGIKRIGHIKYHYE 69
Y QWW + K + ++ +K+I K+
Sbjct: 659 VYFKKIY-----FSFLDQWWTEYYSQYFELICMAKQSILAQESLVKQIVQNKFTDL 709


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS19315DHBDHDRGNASE1061e-29 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 106 bits (265), Expect = 1e-29
Identities = 57/191 (29%), Positives = 103/191 (53%), Gaps = 2/191 (1%)

Query: 5 LDGKVAIITGAASGLGLSIAEKYARSGANVVIADLNPDQAREVAARIAKKNKVTAIGIAM 64
++GK+A ITGAA G+G ++A A GA++ D NP++ +V + + + + A
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEAR-HAEAFPA 64

Query: 65 DVTSEEQVNEGVQKIADDLGTVDILVSNAGIQTIAPIVEFDYEDWKRLLDIHINGTFLTT 124
DV ++E +I ++G +DILV+ AG+ I E+W+ ++ G F +
Sbjct: 65 DVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNAS 124

Query: 125 KACMQQMIRSGRGGSIIIMGSIHSVEASMNKSAYVTAKHGLLGFTRALAKEGAIHNIRAN 184
++ + M+ R GSI+ +GS + + +AY ++K + FT+ L E A +NIR N
Sbjct: 125 RSVSKYMMDR-RSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCN 183

Query: 185 LIGPGFVKTPL 195
++ PG +T +
Sbjct: 184 IVSPGSTETDM 194


6PSLF89_RS19435PSLF89_RS19505Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS19435121-4.255669DUF4286 family protein
PSLF89_RS19440020-4.118118IS30 family transposase
PSLF89_RS19445-122-3.074033hypothetical protein
PSLF89_RS35820023-2.163280hypothetical protein
PSLF89_RS19455-214-1.321715IS982 family transposase
PSLF89_RS35825015-0.822032hypothetical protein
PSLF89_RS19460-116-1.581865transposase
PSLF89_RS35830016-1.839620transposase
PSLF89_RS35835118-2.794261hypothetical protein
PSLF89_RS19480218-4.041311cysteine desulfurase-like protein
PSLF89_RS19485423-5.876374hypothetical protein
PSLF89_RS19490223-4.755384MFS transporter
PSLF89_RS19495220-3.316076VUT family protein
PSLF89_RS36925-123-3.340433VUT family protein
PSLF89_RS19505022-3.323686hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS19490TCRTETA320.002 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 31.7 bits (72), Expect = 0.002
Identities = 28/126 (22%), Positives = 51/126 (40%), Gaps = 32/126 (25%)

Query: 65 ILGRIGDHHGRKKVLLLSVSIMTVSTFCIALLPTYSQTGIIAPILFILF--RLIQGLAIS 122
+LG + D GR+ VLL+S++ V +A AP L++L+ R++ G+
Sbjct: 62 VLGALSDRFGRRPVLLVSLAGAAVDYAIMA----------TAPFLWVLYIGRIVAGIT-G 110

Query: 123 AEFTCSTSY-----QIERRSNKKSYLGALVQSTTLIG--------------SLFAALIVS 163
A + +Y + R+ ++ A + G FAA ++
Sbjct: 111 ATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALN 170

Query: 164 LLSFLL 169
L+FL
Sbjct: 171 GLNFLT 176


7PSLF89_RS19750PSLF89_RS19880Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS19750-1163.112866hypothetical protein
PSLF89_RS19755-1163.279162MFS transporter
PSLF89_RS197601153.913217citrate synthase
PSLF89_RS197651204.235298succinate dehydrogenase, cytochrome b556
PSLF89_RS197701194.034757succinate dehydrogenase, hydrophobic membrane
PSLF89_RS197751214.444683succinate dehydrogenase flavoprotein subunit
PSLF89_RS197800203.927762succinate dehydrogenase iron-sulfur subunit
PSLF89_RS19785-1183.3688092-oxoglutarate dehydrogenase E1 component
PSLF89_RS19790-1161.824958dihydrolipoyllysine-residue succinyltransferase
PSLF89_RS19795-1150.436774ADP-forming succinate--CoA ligase subunit beta
PSLF89_RS19800-1120.294101succinate--CoA ligase subunit alpha
PSLF89_RS19805011-0.407432hypothetical protein
PSLF89_RS19810-2141.054687methyl-accepting chemotaxis protein
PSLF89_RS19815-2221.508995IS30 family transposase
PSLF89_RS19820-3232.504604hypothetical protein
PSLF89_RS19825-1293.852195GNAT family N-acetyltransferase
PSLF89_RS19830-1253.823167hypothetical protein
PSLF89_RS19835-1233.713178Fe(2+) transporter permease subunit FeoB
PSLF89_RS198403193.710667FeoA domain-containing protein
PSLF89_RS198453213.99209050S ribosomal protein L13
PSLF89_RS19850-1222.92258630S ribosomal protein S9
PSLF89_RS19855-1252.665490stringent starvation protein A
PSLF89_RS198600293.108352stringent starvation B family protein
PSLF89_RS19865-1313.219690phosphoheptose isomerase
PSLF89_RS19870-2303.450935YraN family protein
PSLF89_RS19875-2233.633377penicillin-binding protein activator
PSLF89_RS19880-1163.323842adenine phosphoribosyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS19755TCRTETA432e-06 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 42.5 bits (100), Expect = 2e-06
Identities = 58/345 (16%), Positives = 119/345 (34%), Gaps = 25/345 (7%)

Query: 53 ATGVGLLSSFYYYSYAAMQIPAGLAFDRMNARILITVSLTICAIGTLLFSLTDSFTLASL 112
G+L + Y A G DR R ++ VSL A+ + + + +
Sbjct: 42 TAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYI 101

Query: 113 GRFFTGFGSAFAFIAMLFIA---AQWFPTRYFGLIAGIGQFLASIGALAGQGPLAAIVSD 169
GR G A +A +IA R+FG ++ G +AG L ++
Sbjct: 102 GRIVAGITGATGAVAGAYIADITDGDERARHFGFMSAC----FGFGMVAGPV-LGGLMGG 156

Query: 170 LGWREALQGLGFIGITLAVIILLILKDKRHHHTDNQSITKKSKTTPNNHLSIKQQLTILF 229
+ + +L + + K + P ++ + +
Sbjct: 157 FSPHAPFFAAAALNGLNFLTGCFLLPE-----------SHKGERRPLRREALNPLASFRW 205

Query: 230 KHPETFKIALY--SFAAWAPITIFASLWGVPFLRTHYQLTINDAA-NLSSTIWLGIALGS 286
T AL F + A+LW V F + +L++ L +
Sbjct: 206 ARGMTVVAALMAVFFIMQLVGQVPAALW-VIFGEDRFHWDATTIGISLAAFGILHSLAQA 264

Query: 287 PLIGYWSDKIRQRKPLLLLAATLGIIASLIVLYSPSLPVTLLYVLMFFFGVGA-AGQSLS 345
+ G + ++ +R+ L+L G L+ + + VL+ G+G A Q++
Sbjct: 265 MITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGGIGMPALQAML 324

Query: 346 FAYIKDYQQDNILGTAIGFNNMAVVISGALFQPLVGFIMSQLWDG 390
+ + +Q + G+ ++ ++ LF + ++ W+G
Sbjct: 325 SRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAASITT-WNG 368


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS19825SACTRNSFRASE384e-06 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 38.0 bits (88), Expect = 4e-06
Identities = 17/87 (19%), Positives = 40/87 (45%), Gaps = 7/87 (8%)

Query: 45 ILAAHTHQRIVGFMGLQQHAPITTEVALIAILKPYQQQGIGLSLIDAAEKYSRNIRHQYL 104
+ +G + ++ + + IA+ K Y+++G+G +L+ A ++++ L
Sbjct: 67 AFLYYLENNCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGL 126

Query: 105 VVKTPYDENNTPASQRIAERFYSKVGF 131
+++T + N A FY+K F
Sbjct: 127 MLET--QDINISAC-----HFYAKHHF 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS19835TCRTETOQM412e-05 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 41.0 bits (96), Expect = 2e-05
Identities = 36/143 (25%), Positives = 63/143 (44%), Gaps = 32/143 (22%)

Query: 1 MKHIHIALLGNPNSGKTTLFNQL---TGSKQKVG---------NWA------GVTVEKKT 42
MK I+I +L + ++GKTTL L +G+ ++G + G+T++
Sbjct: 1 MKIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGI 60

Query: 43 GSFTYQHHDIQLTDLPG--TYSLNVASAQSSLDERIACEYLLQEKVNLVINIVDAANLER 100
SF +++ + + D PG + V + S LD I L+I+ D +
Sbjct: 61 TSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAI-----------LLISAKDGVQAQT 109

Query: 101 NLYLTSQLLEMRIPCIIALNMLD 123
+ L L +M IP I +N +D
Sbjct: 110 RI-LFHALRKMGIPTIFFINKID 131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS19845PF05043290.007 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 28.8 bits (64), Expect = 0.007
Identities = 18/121 (14%), Positives = 36/121 (29%), Gaps = 17/121 (14%)

Query: 33 AKRLRGKHKPVYTPHVDTGDYIIVVNADKVAVTGNKAKDK-LYHRHTGFPGGIKSLPFDE 91
R+ + V V+ V + GN+ + + ++ PF+
Sbjct: 117 LYRIISQINKVIKRQFQFE-----VSLTPVQIIGNERDIRYFFAQYFSEKYYFLEWPFEN 171

Query: 92 AKAKNPQRVIELAVKGM-LPRGPLGRAMF---------RKLKVYAGAEHDHAAQQPQLLE 141
++ +++EL K P M R + E D + Q L+
Sbjct: 172 FSSEPLSQLLELVYKETSFPMNLSTHRMLKLLLVTNLYRIKFGHF-MEVDKDSFNDQSLD 230

Query: 142 I 142

Sbjct: 231 F 231


8PSLF89_RS19970PSLF89_RS20045Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS19970022-4.873281UDP-4-amino-4,
PSLF89_RS19975022-4.388909pseudaminic acid cytidylyltransferase
PSLF89_RS19980119-4.102385UDP-2,4-diacetamido-2,4,
PSLF89_RS19985114-2.162961pseudaminic acid synthase
PSLF89_RS19990113-1.381473capsule biosynthesis protein
PSLF89_RS19995013-0.984700**threonine--tRNA ligase
PSLF89_RS20000-1141.016454translation initiation factor IF-3
PSLF89_RS200050141.83801450S ribosomal protein L35
PSLF89_RS200201194.72685750S ribosomal protein L20
PSLF89_RS200251194.939782phenylalanine--tRNA ligase subunit alpha
PSLF89_RS200300174.518394phenylalanine--tRNA ligase subunit beta
PSLF89_RS200350154.199990integration host factor subunit alpha
PSLF89_RS20040-1163.634772*YbhB/YbcL family Raf kinase inhibitor-like
PSLF89_RS200450153.104275IS3 family transposase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS20050DNABINDINGHU1021e-32 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 102 bits (257), Expect = 1e-32
Identities = 35/88 (39%), Positives = 54/88 (61%)

Query: 4 TKAVLMDKLFADLGVNKQDAKMIVDLFFEEIQSALEKGQIVKLSGFGNFMLRDKKERPGR 63
K L+ K+ + K+D+ VD F + S L KG+ V+L GFGNF +R++ R GR
Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62

Query: 64 NPKTGDEVAVSARRVVTFRAGQKLRARV 91
NP+TG+E+ + A +V F+AG+ L+ V
Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAV 90


9PSLF89_RS35860PSLF89_RS20225Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS358602183.44897730S ribosomal protein S20
PSLF89_RS201850142.715543murein biosynthesis integral membrane protein
PSLF89_RS20190-1153.974495bifunctional riboflavin kinase/FAD synthetase
PSLF89_RS202050184.261674IS30 family transposase
PSLF89_RS202100193.955127isoleucine--tRNA ligase
PSLF89_RS20215-1143.557659signal peptidase II
PSLF89_RS20225-1164.839071FKBP-type peptidyl-prolyl cis-trans isomerase
10PSLF89_RS20290PSLF89_RS20315Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS20290214-2.873469helix-turn-helix domain-containing protein
PSLF89_RS20295216-3.907900hypothetical protein
PSLF89_RS20300217-3.324923RNA pyrophosphohydrolase
PSLF89_RS20305218-4.775636DUF2282 domain-containing protein
PSLF89_RS20310216-4.556265DoxX family protein
PSLF89_RS20315217-4.184888hypothetical protein
11PSLF89_RS37300PSLF89_RS20440Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS373001313.288070transposase zinc-binding domain-containing
PSLF89_RS373051304.545580transcriptional repressor
PSLF89_RS373101215.718673inorganic diphosphatase
PSLF89_RS203702205.431427DSD1 family PLP-dependent enzyme
PSLF89_RS203752195.379646methionine adenosyltransferase
PSLF89_RS203802195.724755transketolase
PSLF89_RS203902205.713078type I glyceraldehyde-3-phosphate dehydrogenase
PSLF89_RS203952234.959108phosphoglycerate kinase
PSLF89_RS204003204.014423pyruvate kinase
PSLF89_RS204051163.233792fructose-bisphosphate aldolase class II
PSLF89_RS204100152.692122hypothetical protein
PSLF89_RS20415-1141.193340IS30 family transposase
PSLF89_RS20420417-0.777045hypothetical protein
PSLF89_RS20425317-1.352218hypothetical protein
PSLF89_RS20430415-1.989602hypothetical protein
PSLF89_RS20435417-1.707334transporter substrate-binding domain-containing
PSLF89_RS20440418-2.047087hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS20380ALARACEMASE461e-07 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 45.9 bits (109), Expect = 1e-07
Identities = 36/164 (21%), Positives = 65/164 (39%), Gaps = 24/164 (14%)

Query: 17 VIDQQKLLANLHFMQKFADQHGKQLRPHA------KTHKCSH-LAKLQQQIGAI-GICVT 68
+D Q L NL + +Q HA K + H + ++ IGA G +
Sbjct: 8 SLDLQALKQNLSIV--------RQAATHARVWSVVKANAYGHGIERIWSAIGATDGFALL 59

Query: 69 KVAEAEVLVKHGITG-ILITSPVVTPQKIQRLMILAKQDSSIMVVIDHTGNAEVLNQAAL 127
+ EA L + G G IL+ Q ++ I + + V + + L A L
Sbjct: 60 NLEEAITLRERGWKGPILMLEGFFHAQDLE---IYDQHRLTTCVHSNW--QLKALQNARL 114

Query: 128 QADITLKVLVDIDPGVQRTGISYQQALTLGKQLHELQGLELQGI 171
+A L + + ++ G+ R G + LT+ +QL + + +
Sbjct: 115 KA--PLDIYLKVNSGMNRLGFQPDRVLTVWQQLRAMANVGEMTL 156


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS20420PF05704280.007 Capsular polysaccharide synthesis protein
		>PF05704#Capsular polysaccharide synthesis protein

Length = 307

Score = 27.5 bits (61), Expect = 0.007
Identities = 6/37 (16%), Positives = 15/37 (40%)

Query: 7 TAFTREEAEAFVQRHKEAVNFADPHALLSIFKIQFDD 43
+ + + + VN +PH L + + +D+
Sbjct: 228 SVMAVSKEYSKYWKEIPYVNNVNPHMLQYLGNLPYDN 264


12PSLF89_RS35890PSLF89_RS21120Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS358900153.3552106-phosphofructokinase
PSLF89_RS20680-1141.403963hypothetical protein
PSLF89_RS20685-2170.502874UDP-N-acetylmuramate:L-alanyl-gamma-D-glutamyl-
PSLF89_RS20690-1152.061216UbiX family flavin prenyltransferase
PSLF89_RS206951150.443040hypothetical protein
PSLF89_RS20700114-0.409033methyl-accepting chemotaxis protein
PSLF89_RS20705114-0.568098IS30 family transposase
PSLF89_RS207102180.406911hypothetical protein
PSLF89_RS207151191.420340ABC transporter substrate-binding protein
PSLF89_RS20720-118-1.706192HAD-IA family hydrolase
PSLF89_RS207250241.549429DUF1315 family protein
PSLF89_RS207304193.331284BolA family transcriptional regulator
PSLF89_RS207354214.241243YciI family protein
PSLF89_RS207402235.435757septation protein A
PSLF89_RS207451163.634645threonylcarbamoyl-AMP synthase
PSLF89_RS207501174.011591segregation/condensation protein A
PSLF89_RS207551153.758607SMC-Scp complex subunit ScpB
PSLF89_RS207652142.080227pseudouridine synthase
PSLF89_RS207700211.043140hypothetical protein
PSLF89_RS20775-2171.935569hypothetical protein
PSLF89_RS20780-1162.216958hypothetical protein
PSLF89_RS207850152.403882hypothetical protein
PSLF89_RS20790-1203.267715anhydro-N-acetylmuramic acid kinase
PSLF89_RS20795-2162.767315peptidoglycan DD-metalloendopeptidase family
PSLF89_RS208000152.601448tyrosine--tRNA ligase
PSLF89_RS208050143.526437outer membrane beta-barrel protein
PSLF89_RS208150143.301473Bax inhibitor-1 family protein
PSLF89_RS20820-1112.315786hypothetical protein
PSLF89_RS208250171.187395serine/threonine protein kinase
PSLF89_RS20830-2161.280017hypothetical protein
PSLF89_RS20835-2161.075304alanine racemase
PSLF89_RS20840-116-0.649162GNAT family N-acetyltransferase
PSLF89_RS20845018-0.6321266-carboxytetrahydropterin synthase
PSLF89_RS20850218-0.216446spermidine N1-acetyltransferase
PSLF89_RS208602210.730402transposase
PSLF89_RS208652211.051495IS3 family transposase
PSLF89_RS373152210.324408transposase
PSLF89_RS20870321-0.484127transposase
PSLF89_RS20875321-0.302261IS30 family transposase
PSLF89_RS37320223-0.116227transposase zinc-binding domain-containing
PSLF89_RS36940223-2.335398hypothetical protein
PSLF89_RS20885223-2.597987IS3 family transposase
PSLF89_RS20890119-3.710081IS3 family transposase
PSLF89_RS20895-117-2.465500IS3 family transposase
PSLF89_RS20900118-3.344558transposase
PSLF89_RS36945118-3.379296hypothetical protein
PSLF89_RS20910018-2.381323hypothetical protein
PSLF89_RS20915-117-1.992520IS982-like element ISPsa1 family transposase
PSLF89_RS20920017-0.728879hypothetical protein
PSLF89_RS20925019-0.340791hypothetical protein
PSLF89_RS376400223.725428hypothetical protein
PSLF89_RS20930-1223.505982IS3 family transposase
PSLF89_RS209350212.711781transposase
PSLF89_RS373250233.19773716S rRNA (guanine(966)-N(2))-methyltransferase
PSLF89_RS209452253.547526L-serine ammonia-lyase
PSLF89_RS209503263.245288hypothetical protein
PSLF89_RS209553201.072036hypothetical protein
PSLF89_RS209602263.554113SEL1-like repeat protein
PSLF89_RS209651324.506113hypothetical protein
PSLF89_RS209701324.196336hypothetical protein
PSLF89_RS20975-1302.227184tRNA (guanosine(46)-N7)-methyltransferase TrmB
PSLF89_RS20980-1302.583222SLC13 family permease
PSLF89_RS20985-2240.829126hypothetical protein
PSLF89_RS35910-122-3.116016DUF2282 domain-containing protein
PSLF89_RS20990-122-3.589192amidohydrolase family protein
PSLF89_RS20995222-1.751599methyltransferase domain-containing protein
PSLF89_RS35915112-0.022369IS4 family transposase
PSLF89_RS210051100.103941hypothetical protein
PSLF89_RS21010190.256237hypothetical protein
PSLF89_RS21015010-1.005789**glucose-6-phosphate isomerase
PSLF89_RS2103019-0.901858UDP-glucose/GDP-mannose dehydrogenase family
PSLF89_RS21035017-2.763797UTP--glucose-1-phosphate uridylyltransferase
PSLF89_RS21040114-3.955539ergothioneine biosynthesis protein EgtB
PSLF89_RS21045318-5.177793L-histidine N(alpha)-methyltransferase
PSLF89_RS21050019-4.201402hypothetical protein
PSLF89_RS35920-122-2.780454IS3 family transposase
PSLF89_RS21060021-3.082977transposase
PSLF89_RS21065-122-2.468580IS3 family transposase
PSLF89_RS21070-123-2.287055hypothetical protein
PSLF89_RS21075-121-2.477032IS4 family transposase
PSLF89_RS21085020-3.241045MFS transporter
PSLF89_RS21090018-1.796376aminotransferase class I/II-fold pyridoxal
PSLF89_RS21095115-1.270677hypothetical protein
PSLF89_RS21105217-0.263563inosine/xanthosine triphosphatase
PSLF89_RS359251200.324408fumarylacetoacetate hydrolase family protein
PSLF89_RS211102190.134872response regulator
PSLF89_RS21115222-0.093207chemotaxis protein CheW
PSLF89_RS211202240.289751transporter substrate-binding domain-containing
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS20805OMPADOMAIN492e-09 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 49.2 bits (117), Expect = 2e-09
Identities = 36/151 (23%), Positives = 57/151 (37%), Gaps = 19/151 (12%)

Query: 42 SAGY-LFGGRQLR--YGAELGLARYASSCYQSANTSLTYQGASADLLGVLSYQLGARWNV 98
G FGG Q+ G E+G Y+ + + Y+ L L Y + ++
Sbjct: 55 QLGAGAFGGYQVNPYVGFEMGYDWLGRMPYKGSVENGAYKAQGVQLTAKLGYPITDDLDI 114

Query: 99 FGKLGLAYIDQQTEGNLFPNQLDSSNALQPKVALGLGYSLTAAIGVNLSYSHT--FGDQP 156
+ +LG T+ N++ D + P A G+ Y++T I L Y T GD
Sbjct: 115 YTRLGGMVWRADTKSNVYGKNHD--TGVSPVFAGGVEYAITPEIATRLEYQWTNNIGDAH 172

Query: 157 EGLAKNAEPTPVMLNKVASTDLLSFGLSYRF 187
+ +LS G+SYRF
Sbjct: 173 ------------TIGTRPDNGMLSLGVSYRF 191


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS20835ALARACEMASE344e-120 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 344 bits (885), Expect = e-120
Identities = 143/356 (40%), Positives = 206/356 (57%), Gaps = 2/356 (0%)

Query: 1 MARKTQAHLSRDALLHNLNHIRAHAPGCQVVGVVKANAYGHGLEDASRVLASYVDYLGVA 60
M R QA L AL NL+ +R A +V VVKANAYGHG+E + + D +
Sbjct: 1 MTRPIQASLDLQALKQNLSIVRQAATHARVWSVVKANAYGHGIERIWSAIGA-TDGFALL 59

Query: 61 TIEEAMTLVMMPVKTIVLLMEGIFQDSELELVAEHGLEMVLHEEGQILALEQAQLSAPIT 120
+EEA+TL K +L++EG F +LE+ +H L +H Q+ AL+ A+L AP+
Sbjct: 60 NLEEAITLRERGWKGPILMLEGFFHAQDLEIYDQHRLTTCVHSNWQLKALQNARLKAPLD 119

Query: 121 VWLKLDTGLGRLGFPAKLVSMLYQRLRCCANVKKIKLMSHFSASDTNFSYTQKQLKCFMD 180
++LK+++G+ RLGF V ++Q+LR ANV ++ LMSHF+ ++ + +
Sbjct: 120 IYLKVNSGMNRLGFQPDRVLTVWQQLRAMANVGEMTLMSHFAEAE-HPDGISGAMARIEQ 178

Query: 181 MTQGLVAEKSIANGAAIFNCPESCVDIVRPGGLLYGVGLWQGKKSGVDEGLRPVMSLRSH 240
+GL +S++N AA PE+ D VRPG +LYG + + GLRPVM+L S
Sbjct: 179 AAEGLECRRSLSNSAATLWHPEAHFDWVRPGIILYGASPSGQWRDIANTGLRPVMTLSSE 238

Query: 241 LISVKDYQAGDYIGYGRCWQCSGPMRVGVVAIGYGDGYPVTAPDGTPTLVCGVEAPLIGR 300
+I V+ +AG+ +GYG + R+G+VA GY DGYP AP GTP LV GV +G
Sbjct: 239 IIGVQTLKAGERVGYGGRYTARDEQRIGIVAAGYADGYPRHAPTGTPVLVDGVRTMTVGT 298

Query: 301 VSMDMITIDLELCPDAKVGDEVVLWGDGLPVERVAHHVGVVPYALLCAVAPRVKLV 356
VSMDM+ +DL CP A +G V LWG + ++ VA G V Y L+CA+A RV +V
Sbjct: 299 VSMDMLAVDLTPCPQAGIGTPVELWGKEIKIDDVAAAAGTVGYELMCALALRVPVV 354


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS20850SACTRNSFRASE310.001 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 31.5 bits (71), Expect = 0.001
Identities = 18/119 (15%), Positives = 48/119 (40%), Gaps = 3/119 (2%)

Query: 23 NNNFSVMRYWFEEPYESFVELEELYNKHIHDQSERRFIIENSDNNIVGLVELLEIDYIHR 82
N ++ F +PY E +++ ++ ++ + + +NN +G +++ ++
Sbjct: 32 NGVWTYTEERFSKPYFKQYEDDDMDVSYV-EEEGKAAFLYYLENNCIGRIKIRS-NWNGY 89

Query: 83 NAEYTVLIDPNYQGRSYSLQATEQVLGYAFNVLNLHKVYLLVDERNEKAIHVYKKAGFI 141
+ + +Y+ + + + +A + + L + N A H Y K FI
Sbjct: 90 ALIEDIAVAKDYRKKGVGTALLHKAIEWAKEN-HFCGLMLETQDINISACHFYAKHHFI 147


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS21035NUCEPIMERASE290.028 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 29.4 bits (66), Expect = 0.028
Identities = 15/75 (20%), Positives = 28/75 (37%), Gaps = 13/75 (17%)

Query: 1 MRVTVFG-AGYVGLVTAACFADLGNQVICVDVDEKKLAQLAEGKSPIYEPGLDELLLRGQ 59
M+ V G AG++G + + G+QV+ +D + Y+ L + L
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDN-----------LNDYYDVSLKQARLELL 49

Query: 60 ESGNLEF-TADIQSA 73
+F D+
Sbjct: 50 AQPGFQFHKIDLADR 64


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS21100TCRTETB664e-14 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 66.4 bits (162), Expect = 4e-14
Identities = 40/152 (26%), Positives = 76/152 (50%), Gaps = 5/152 (3%)

Query: 177 LSNISHSFGATFAATGQSITAFTLCYAVAAPIAAALFSGKPARKVLFVALAIFSIANIVS 236
L +I++ F A+T TAF L +++ + L +++L + I +++
Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIG 96

Query: 237 ALATS-LSMLLVSRALAGLGAGLFSPMAAAVAAMLVPAEKKGRALGLILGGMSTGTVIGV 295
+ S S+L+++R + G GA F + V A +P E +G+A GLI ++ G +G
Sbjct: 97 FVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGP 156

Query: 296 PIGLLVNNYLGWRYVFIMVTCIGFIGIIGILF 327
IG ++ +Y+ W Y+ ++ I II + F
Sbjct: 157 AIGGMIAHYIHWSYLLLIPM----ITIITVPF 184


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS21120HTHFIS463e-08 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 46.0 bits (109), Expect = 3e-08
Identities = 23/112 (20%), Positives = 54/112 (48%), Gaps = 12/112 (10%)

Query: 68 TVLVADDSSVARRHVKQVLDQIGVNVIMTNDGQHALDILEHDIPRTAGDVSRKYLMLISD 127
T+LVADD + R + Q L + G +V +T++ + GD+ +++D
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAG----DGDL------VVTD 54

Query: 128 VEMPEMDGYSLIKNCREHPGLKNLFIMLNTSITSVFNELDSKEVGCNEFVGK 179
V MP+ + + L+ ++ +L +++ ++ + + + E G +++ K
Sbjct: 55 VVMPDENAFDLLPRIKKA--RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPK 104


13PSLF89_RS21400PSLF89_RS21600Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS214003162.864193UMP kinase
PSLF89_RS214051192.826434ribosome recycling factor
PSLF89_RS214100172.287211isoprenyl transferase
PSLF89_RS21415-2101.866154phosphatidate cytidylyltransferase
PSLF89_RS21420-3142.462495lysophospholipid acyltransferase family protein
PSLF89_RS21425-3152.551358hypothetical protein
PSLF89_RS21430-2182.499351hypothetical protein
PSLF89_RS21435-2222.513200isopentenyl-diphosphate Delta-isomerase
PSLF89_RS21440-1273.592776diphosphomevalonate decarboxylase
PSLF89_RS214500294.411171hypothetical protein
PSLF89_RS21455-1212.984711hydroxymethylglutaryl-CoA synthase
PSLF89_RS21460-1202.956028hydroxymethylglutaryl-CoA reductase
PSLF89_RS21465-1183.505396outer membrane protein assembly factor BamA
PSLF89_RS21470-2183.455387OmpH family outer membrane protein
PSLF89_RS214750162.921691UDP-3-O-(3-hydroxymyristoyl)glucosamine
PSLF89_RS21480-1201.7942293-hydroxyacyl-ACP dehydratase FabZ
PSLF89_RS214850223.569479acyl-ACP--UDP-N-acetylglucosamine
PSLF89_RS214950233.521459lipid-A-disaccharide synthase
PSLF89_RS215000213.275751ribonuclease HII
PSLF89_RS215050213.365304DNA polymerase III subunit alpha
PSLF89_RS21510-1162.810604acetyl-CoA carboxylase carboxyltransferase
PSLF89_RS21515-1141.925601tRNA lysidine(34) synthetase TilS
PSLF89_RS21520-2131.738208IS4 family transposase
PSLF89_RS21530019-0.403340hypothetical protein
PSLF89_RS21535222-1.333280IS4 family transposase
PSLF89_RS21540322-0.541452IS4 family transposase
PSLF89_RS21545422-0.363817IS30 family transposase
PSLF89_RS21550117-3.280186hypothetical protein
PSLF89_RS21555115-4.103741hypothetical protein
PSLF89_RS21560217-4.052518DUF4135 domain-containing protein
PSLF89_RS21565213-3.089570MFS transporter
PSLF89_RS21570112-2.997227hypothetical protein
PSLF89_RS21575113-3.381498hypothetical protein
PSLF89_RS21580314-2.038697MFS transporter
PSLF89_RS35935013-0.854149hypothetical protein
PSLF89_RS21590013-0.812697hypothetical protein
PSLF89_RS21600118-3.271064IS4 family transposase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS21415CARBMTKINASE310.003 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 30.9 bits (70), Expect = 0.003
Identities = 18/85 (21%), Positives = 28/85 (32%), Gaps = 25/85 (29%)

Query: 102 VPARIMSAIPMSGLVDHYDRRKAMHHLSEGRAVIFAAGTGNPLVTT-------------D 148
P + A + LV+ G VI + G G P++ D
Sbjct: 169 DPKGHVEAETIKKLVER------------GVIVIASGGGGVPVILEDGEIKGVEAVIDKD 216

Query: 149 SAASLRGIEVDVDLLLKATRVDGVY 173
A EV+ D+ + T V+G
Sbjct: 217 LAGEKLAEEVNADIFMILTDVNGAA 241


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS21545IGASERPTASE320.007 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 32.3 bits (73), Expect = 0.007
Identities = 38/219 (17%), Positives = 67/219 (30%), Gaps = 24/219 (10%)

Query: 278 VEQSANIPETESSHQSSTSLQAAGPSTSALEAEIFLQSSVSDEDEPTSADASLSLEQQAQ 337
VE+ +T + + ++QA PS + EI P A S + E A+
Sbjct: 985 VEKRNQTVDTTNI-TTPNNIQADVPSVPSNNEEIARVDEAPVPP-PAPATPSETTETVAE 1042

Query: 338 SVGRRAEVTGWGFDLCKQQLENFKKHEAACEQRLDSCAESLTTLEIRVQHLQQMLSARRQ 397
+ KQ+ + +K+E + E + V+ Q +
Sbjct: 1043 NS--------------KQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQS 1088

Query: 398 KLEATSAQTTEPSPSTSGVGASYQPSANDQVAEPSTSTSGAGASYQPSASDQVIQPSPST 457
E QTTE + + ++ E TS S + Q
Sbjct: 1089 GSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTS--------QVSPKQEQSETVQ 1140

Query: 458 PEAGPSHQPSASEQVAEASPSTAGAGASSQPNMNIPGGP 496
P+A P+ + + + E T + QP
Sbjct: 1141 PQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNV 1179


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS21565TYPE4SSCAGX320.003 Type IV secretion system CagX conjugation protein si...
		>TYPE4SSCAGX#Type IV secretion system CagX conjugation protein

signature.
Length = 522

Score = 32.1 bits (72), Expect = 0.003
Identities = 22/93 (23%), Positives = 48/93 (51%)

Query: 89 KQKHQGHQEHQGSQKQSGSKLRAERTKIHSLRLKLLEAISSHKHVKNKENISLLEAELEQ 148
+++ + ++ Q +QK K + ER K + L A+S+ +++ N +N+S L + +
Sbjct: 149 EKEKEAKEQAQKAQKDKREKRKEERAKNRANLENLTNAMSNPQNLSNNKNLSELIKQQRE 208

Query: 149 KDQLLEEKQQKVEELQQQNQLLQQQLLEQKSGK 181
+ E+ + ++E Q N L Q + L +K +
Sbjct: 209 NELDQMERLEDMQEQAQANALKQIEELNKKQAE 241


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS21580TCRTETA392e-05 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 39.0 bits (91), Expect = 2e-05
Identities = 49/310 (15%), Positives = 96/310 (30%), Gaps = 26/310 (8%)

Query: 61 IVISPFAGRLVDAKGSIICIQYSSIFCFFLTIGLIFTNSYLLLFIIVFIRSSLKTVFFPA 120
+P G L D G + S+ + ++ T +L + I I + +
Sbjct: 57 FACAPVLGALSDRFGRRPVL-LVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAV 115

Query: 121 LSRIIKLTVDKKQLLSVNSLIQFNANGLLIIAPIIGMIVFSTLGKKWCFLITSILFFLTF 180
I D + + ++ P++G ++ F + L L F
Sbjct: 116 AGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLM-GGFSPHAPFFAAAALNGLNF 174

Query: 181 SLSFFLKEVSDKRSQVDMGLLS---QSDLDIKKLLVPFLGMMIAAFAIYL---------- 227
FL S K + + + + + + +M F + L
Sbjct: 175 LTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWV 234

Query: 228 --GDSLFPLLLKSIGLDFKDFALIGSFFGIGGMAASIFCQCYKNSNEVTLIKLGAILVII 285
G+ F +IG+ F ++ S A I E + LG I
Sbjct: 235 IFGEDRFHWDATTIGISLAAFGILHSLA-----QAMITGPVAARLGERRALMLGMIADGT 289

Query: 286 SMFTYGMFSSMLSQCVFFVFAMLNAGGITLISISSATLLQKNTPAHMMGKVSSINNMIFG 345
+ F + +L +GGI + ++ +L + G++ +
Sbjct: 290 GYILLAFATR--GWMAFPIMVLLASGGIGMPALQ--AMLSRQVDEERQGQLQGSLAALTS 345

Query: 346 LASIIIPMLG 355
L SI+ P+L
Sbjct: 346 LTSIVGPLLF 355


14PSLF89_RS21650PSLF89_RS21710Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS21650216-1.277397protein kinase family protein
PSLF89_RS21655-1151.091809DUF2272 domain-containing protein
PSLF89_RS21660-2162.534612hypothetical protein
PSLF89_RS216651184.243886hypothetical protein
PSLF89_RS359402225.407589IMP dehydrogenase
PSLF89_RS216702225.293669glutamine-hydrolyzing GMP synthase
PSLF89_RS216753235.473925phosphoribosylformylglycinamidine synthase
PSLF89_RS216802192.662130late competence development ComFB family
PSLF89_RS21685-117-0.669810hypothetical protein
PSLF89_RS21690-116-0.467730hypothetical protein
PSLF89_RS21695-116-6.068541hypothetical protein
PSLF89_RS21700-115-6.027643RNA polymerase sigma factor RpoS
PSLF89_RS21705-212-4.970331cold shock domain-containing protein
PSLF89_RS21710-216-3.575742ATP-binding cassette domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS21690ANTHRAXTOXNA377e-04 Anthrax toxin LF subunit signature.
		>ANTHRAXTOXNA#Anthrax toxin LF subunit signature.

Length = 800

Score = 36.6 bits (84), Expect = 7e-04
Identities = 24/63 (38%), Positives = 34/63 (53%), Gaps = 16/63 (25%)

Query: 16 QKLLNKIQAVVPEVAELYSEY---VYFVDVN-------RELAADEKNSLNSLLHYGEMAP 65
Q LL KI +V E+YSE +YF D++ ++L+ +EKNS+NS GE P
Sbjct: 83 QDLLKKIPK---DVLEIYSELGGEIYFTDIDLVEHKELQDLSEEEKNSMNS---RGEKVP 136

Query: 66 VLS 68
S
Sbjct: 137 FAS 139


15PSLF89_RS21830PSLF89_RS21965Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS21830-119-4.008454transposase
PSLF89_RS37335-118-2.570966IS3 family transposase
PSLF89_RS37340019-2.806869transposase
PSLF89_RS21840019-2.238575transposase
PSLF89_RS21845121-1.034328transposase zinc-binding domain-containing
PSLF89_RS21850-222-0.033524hypothetical protein
PSLF89_RS373451271.364735transposase
PSLF89_RS21860124-0.443853hypothetical protein
PSLF89_RS37350322-0.917557hypothetical protein
PSLF89_RS37645418-0.695863transposase
PSLF89_RS21875419-0.200501IS3 family transposase
PSLF89_RS21880216-1.189043IS3 family transposase
PSLF89_RS21885217-0.702474IS3 family transposase
PSLF89_RS21890-118-0.736093hypothetical protein
PSLF89_RS21895-119-1.441511IS4 family transposase
PSLF89_RS21900020-2.193465hypothetical protein
PSLF89_RS21905022-4.281060hypothetical protein
PSLF89_RS35960020-3.589994hypothetical protein
PSLF89_RS21915-123-1.983504aldo/keto reductase
PSLF89_RS21920-218-0.706201VOC family protein
PSLF89_RS21925-214-0.157502MFS transporter
PSLF89_RS21930-1112.161421class II aldolase/adducin family protein
PSLF89_RS21935-2121.965130SDR family oxidoreductase
PSLF89_RS21940-2133.608977aminotransferase class I/II-fold pyridoxal
PSLF89_RS21945-2163.304645M48 family metallopeptidase
PSLF89_RS21950-3163.010548**trigger factor
PSLF89_RS21955-2153.066089ATP-dependent Clp endopeptidase proteolytic
PSLF89_RS219600153.085649ATP-dependent Clp protease ATP-binding subunit
PSLF89_RS219651173.601312endopeptidase La
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS21940TCRTETA340.001 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 33.6 bits (77), Expect = 0.001
Identities = 37/172 (21%), Positives = 64/172 (37%), Gaps = 32/172 (18%)

Query: 52 LFAIYAIGA-LFRPVGSIMWGHFADRYGRKKTLITTSFIMIFSTLCISILPNGHQAPIFS 110
L A+YA+ PV G +DR+GR+ L+ + + + + +I+ +
Sbjct: 48 LLALYALMQFACAPVL----GALSDRFGRRPVLLVS---LAGAAVDYAIMATAPFLWV-- 98

Query: 111 PIALLTLRCLQGVSLGGDASSAAVLIAETVSNKKRGFYVSFVFAMNSLGSLLAAAMAYLL 170
L R + G++ G + A IA+ +R + F+ A G + + L+
Sbjct: 99 ---LYIGRIVAGIT-GATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLM 154

Query: 171 LKITPANVMLEWGWRIPFVFGAFL-----LIICLIFRSGVLESTIFEKNPQR 217
+P PF A L L C + ES E+ P R
Sbjct: 155 GGFSP---------HAPFFAAAALNGLNFLTGCFLLP----ESHKGERRPLR 193


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS21950DHBDHDRGNASE1175e-34 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 117 bits (293), Expect = 5e-34
Identities = 69/259 (26%), Positives = 121/259 (46%), Gaps = 22/259 (8%)

Query: 2 AMQDHVVVVTGGSMGIGLAVVKKFLQKKAIVYNLD--------LQAGES--GRY---LSC 48
++ + +TG + GIG AV + + A + +D + + R+
Sbjct: 5 GIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPA 64

Query: 49 DVSDSQQVSRAIHEVIRQEGRVDILVSNAGVHFSATIENSAEADYQRVMDINVKGTFFSV 108
DV DS + + R+ G +DILV+ AGV I + ++ +++ +N G F +
Sbjct: 65 DVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNAS 124

Query: 109 QAVIAQMRQQKSGNIVLLSSEQAFVGKPNSSLYGMSKAAIASLARTTALDYAKFNVRVNA 168
++V M ++SG+IV + S A V + + + Y SKAA + L+ A++N+R N
Sbjct: 125 RSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNI 184

Query: 169 VCAGTIETPLYHQAIDNYCHRTGANLTQVHQEEAAL----QPLGRIGRPEEVAELVYFLA 224
V G+ ET + + GA QV + PL ++ +P ++A+ V FL
Sbjct: 185 VSPGSTETDMQWSL---WADENGA--EQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLV 239

Query: 225 SEKAAYITGSLQVIDGGYT 243
S +A +IT +DGG T
Sbjct: 240 SGQAGHITMHNLCVDGGAT 258


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS21990HTHFIS290.027 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 29.4 bits (66), Expect = 0.027
Identities = 7/29 (24%), Positives = 15/29 (51%)

Query: 109 KADDVELSKSNILLVGPTGCGKTLLAQTL 137
+ + +++ G +G GK L+A+ L
Sbjct: 152 VLARLMQTDLTLMITGESGTGKELVARAL 180


16PSLF89_RS22145PSLF89_RS22225Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS22145024-4.188289IS982 family transposase
PSLF89_RS22150020-4.883116hypothetical protein
PSLF89_RS35970021-4.197779hypothetical protein
PSLF89_RS22160221-5.313900hypothetical protein
PSLF89_RS35975219-5.569591hypothetical protein
PSLF89_RS22165019-5.592168DNA/RNA non-specific endonuclease
PSLF89_RS22170-118-4.134351HU family DNA-binding protein
PSLF89_RS22175018-3.328875IS4 family transposase
PSLF89_RS22180117-3.095501hypothetical protein
PSLF89_RS22185-217-3.013203malate dehydrogenase
PSLF89_RS22190-216-2.530416TolC family outer membrane protein
PSLF89_RS22195-214-0.281466protein-L-isoaspartate O-methyltransferase
PSLF89_RS222002221.407767hypothetical protein
PSLF89_RS222051231.582755thiol reductant ABC exporter subunit CydD
PSLF89_RS222152173.122642thiol reductant ABC exporter subunit CydC
PSLF89_RS222202112.810201hypothetical protein
PSLF89_RS222252101.026432IS30 family transposase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS22200DNABINDINGHU1043e-33 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 104 bits (262), Expect = 3e-33
Identities = 40/87 (45%), Positives = 60/87 (68%)

Query: 3 KTELVEVISKKADISKKAAGRLVDIMLESIEGGLKEGDSVDLKGFGKFEMKQRAARVGRN 62
K +L+ +++ +++KK + VD + ++ L +G+ V L GFG FE+++RAAR GRN
Sbjct: 4 KQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGRN 63

Query: 63 PRTGEEIEIPAATVPVFKPSKALKAAV 89
P+TGEEI+I A+ VP FK KALK AV
Sbjct: 64 PQTGEEIKIKASKVPAFKAGKALKDAV 90


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS22255HTHFIS280.044 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 28.3 bits (63), Expect = 0.044
Identities = 10/43 (23%), Positives = 19/43 (44%), Gaps = 1/43 (2%)

Query: 4 RHLNEKDRFYIEQRLSE-GDSLRSIARALGFSPSTISREIKRH 45
R L E + I L+ + A LG + +T+ ++I+
Sbjct: 431 RVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIREL 473


17PSLF89_RS37060PSLF89_RS22695Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS37060-2264.487573LbtU family siderophore porin
PSLF89_RS22365-2244.377407methyltransferase regulatory domain-containing
PSLF89_RS22370-2214.046687IS4 family transposase
PSLF89_RS22375-2183.092242ATP-grasp domain-containing protein
PSLF89_RS22380-2152.282292IS30 family transposase
PSLF89_RS22385-210-0.738094hypothetical protein
PSLF89_RS22390-113-2.596017hypothetical protein
PSLF89_RS22395-212-2.865080hypothetical protein
PSLF89_RS22400-113-3.451259hypothetical protein
PSLF89_RS35990119-3.392229hypothetical protein
PSLF89_RS22420219-2.856430Do family serine endopeptidase
PSLF89_RS22425120-2.596775DUF1043 family protein
PSLF89_RS22430123-2.804839alpha/beta fold hydrolase
PSLF89_RS22435-128-2.816509cell division protein ZapE
PSLF89_RS22445424-0.287580IS30 family transposase
PSLF89_RS224500220.931855hypothetical protein
PSLF89_RS22455-1211.694585hypothetical protein
PSLF89_RS35995-2153.749314IS4 family transposase
PSLF89_RS37365-2153.709421hypothetical protein
PSLF89_RS22470-1153.225387hypothetical protein
PSLF89_RS22475-1162.579166hypothetical protein
PSLF89_RS22480-2192.236318hypothetical protein
PSLF89_RS22485-1151.664087IS4 family transposase
PSLF89_RS22490012-0.927573hypothetical protein
PSLF89_RS22495-113-1.655722hypothetical protein
PSLF89_RS22500-113-1.861485transposase
PSLF89_RS22510014-2.213151IS481 family transposase
PSLF89_RS22515116-3.686271hypothetical protein
PSLF89_RS22525117-3.996326IS3 family transposase
PSLF89_RS22530118-3.486975hypothetical protein
PSLF89_RS22545115-4.161371IS982 family transposase
PSLF89_RS22550015-3.639255*RNA polymerase sigma factor RpoD
PSLF89_RS22555018-4.311618DNA primase
PSLF89_RS22565219-2.727925GatB/YqeY domain-containing protein
PSLF89_RS22570219-1.57364530S ribosomal protein S21
PSLF89_RS36010321-1.623350tRNA
PSLF89_RS22580221-1.767344glycerol-3-phosphate 1-O-acyltransferase PlsY
PSLF89_RS22585222-1.744823SAM-dependent methyltransferase
PSLF89_RS22590221-0.354274hypothetical protein
PSLF89_RS225950200.912364IS4 family transposase
PSLF89_RS226050201.203472hypothetical protein
PSLF89_RS226100222.299354transposase
PSLF89_RS22615-1173.199653IS30 family transposase
PSLF89_RS22625-2204.768526hypothetical protein
PSLF89_RS226300174.085707alpha/beta hydrolase
PSLF89_RS226350153.828701MFS transporter
PSLF89_RS226400112.511382glycoside hydrolase family 32 protein
PSLF89_RS22650117-0.291296hypothetical protein
PSLF89_RS22655019-1.597144hypothetical protein
PSLF89_RS37075025-2.145550dihydroorotase
PSLF89_RS36025123-2.632050pyridoxamine 5'-phosphate oxidase
PSLF89_RS22670121-4.714211universal stress protein
PSLF89_RS22675123-4.481995formimidoylglutamase
PSLF89_RS36030124-4.903623imidazolonepropionase
PSLF89_RS22685025-4.124459urocanate hydratase
PSLF89_RS22690-124-3.271233histidine ammonia-lyase
PSLF89_RS22695-220-3.052745hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS22395INFPOTNTIATR300.018 Macrophage infectivity potentiator signature.
		>INFPOTNTIATR#Macrophage infectivity potentiator signature.

Length = 233

Score = 29.6 bits (66), Expect = 0.018
Identities = 18/38 (47%), Positives = 25/38 (65%), Gaps = 2/38 (5%)

Query: 1 MKLKLITCSVALSVAASTTFAAT-ADSLKLELDKLKAS 37
MK+KL+T ++ + +A ST AAT A SL + DKL S
Sbjct: 1 MKMKLVTAAI-MGLAMSTAMAATDATSLTTDKDKLSYS 37


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS22470V8PROTEASE667e-14 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 65.8 bits (160), Expect = 7e-14
Identities = 32/167 (19%), Positives = 58/167 (34%), Gaps = 35/167 (20%)

Query: 78 PKSTGSGVIINADKGYILTNYHVIAEAKKIRVTLK------------DGRQLTAKVIGND 125
SGV++ K +LTN HV+ LK +G ++
Sbjct: 100 GTFIASGVVVG--KDTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYS 157

Query: 126 KGTDIAIIKISA--------KNLEQIRLPKPNYIPDVGDFVVAVGSPYGL---SQTVTSG 174
D+AI+K S + ++ + V + G P + + G
Sbjct: 158 GEGDLAIVKFSPNEQNKHIGEVVKPATMSNNAETQ-VNQNITVTGYPGDKPVATMWESKG 216

Query: 175 IISALDRNNLGIEGFENFIQTDAPINPGNSGGALVNLQGQLVGINTA 221
I+ ++G +Q D GNSG + N + +++GI+
Sbjct: 217 KIT-------YLKG--EAMQYDLSTTGGNSGSPVFNEKNEVIGIHWG 254


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS22480PF06057270.044 Type IV secretory pathway VirJ component
		>PF06057#Type IV secretory pathway VirJ component

Length = 243

Score = 27.1 bits (60), Expect = 0.044
Identities = 17/56 (30%), Positives = 27/56 (48%), Gaps = 6/56 (10%)

Query: 93 TADLAAVIDWVKAQQPDHEIWLAGFSFGGYV---AYR---GASRFNVNQLLLVAPA 142
T D A+ID +A+ ++ L G+SFG V R NV +L++P+
Sbjct: 100 TQDTLAIIDKYQAEFGTQKVILIGYSFGAEVIPFVLNEMPARYRKNVLGAVLLSPS 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS22690TCRTETA485e-08 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 47.5 bits (113), Expect = 5e-08
Identities = 54/346 (15%), Positives = 120/346 (34%), Gaps = 23/346 (6%)

Query: 25 IGLVSPQISTYYNVDISNIVYIDVLNIVGLLIGNF---------FSGRLIEKINTHNTLC 75
IGL+ P + + + DV G+L+ + G L ++ L
Sbjct: 21 IGLIMPVLPGLLRDLVHSN---DVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLL 77

Query: 76 SAIALGIIAESLLALGLPLSFYTACSMLNGISIGFLVPAVTQSISDLHTISREKDSKLSL 135
++A + +++A L ++ GI+ G I+D+ T E+
Sbjct: 78 VSLAGAAVDYAIMATAPFLWVLYIGRIVAGIT-GATGAVAGAYIADI-TDGDERARHFGF 135

Query: 136 LNFFFSLGSVFVPIVGGYITHYLSWRGVFAMLAILYVFLLICALTFKIKPTCDNTPKSQQ 195
++ F G V P++GG + + S F A L + F + + + +
Sbjct: 136 MSACFGFGMVAGPVLGGLMGGF-SPHAPFFAAAALNGLNFLTGC-FLLPESHKGERRPLR 193

Query: 196 SQTKNNQSIFNLSLILIGIALVCYVY-----IEYVVSYWFSPYLQMDKHISVIETGKLLG 250
+ N + F + + +A + V+ + V + + + + H G L
Sbjct: 194 REALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLA 253

Query: 251 IFGASIAAVRLIAGLYLLKKIRATNYITLSCITVFIGFLFFLNSSSYFSFMASIILIGCG 310
FG + + + + ++ + L I G++ ++ + ++L+ G
Sbjct: 254 AFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASG 313

Query: 311 CASLFPTLLGYGIAQA-NYQSPRATSFLITCGSIGGFVGLIMSGFL 355
+ P L Q + + L S+ VG ++ +
Sbjct: 314 GIGM-PALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAI 358


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS22710UREASE394e-05 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 39.0 bits (91), Expect = 4e-05
Identities = 24/96 (25%), Positives = 41/96 (42%), Gaps = 19/96 (19%)

Query: 5 LITNATIINEGQKTEADLFIKNGRIEHI----------DSDLSHKPVKQVIDAKNKWLIP 54
+ITNA I++ +AD+ +K+GRI I + P +VI + K +
Sbjct: 71 VITNALILDHWGIVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGEGKIVTA 130

Query: 55 GMIDDQVHFREPGLTHKGEMRTESRAAAAGGITSVM 90
G +D +HF P + A G+T ++
Sbjct: 131 GGMDSHIHFICP---------QQIEEALMSGLTCML 157


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS22730UREASE478e-08 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 47.0 bits (112), Expect = 8e-08
Identities = 21/62 (33%), Positives = 35/62 (56%), Gaps = 5/62 (8%)

Query: 19 DFGLIHQGAIAVKEGNIAWLGRAGDLDSR-----YIGIDTQVHNGQGRYLTPGLIDCHTH 73
D I + I +K+G IA +G+AG+ D + +G T+V G+G+ +T G +D H H
Sbjct: 79 DHWGIVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGEGKIVTAGGMDSHIH 138

Query: 74 MV 75
+
Sbjct: 139 FI 140



Score = 31.2 bits (71), Expect = 0.008
Identities = 12/29 (41%), Positives = 19/29 (65%)

Query: 349 TVHAAKALGMADRVGQLKVGMQADFSLWE 377
T++ A A G++ +G L+VG +AD LW
Sbjct: 410 TINPAIAHGLSHEIGSLEVGKRADLVLWN 438


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS22740SECA320.008 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 31.8 bits (72), Expect = 0.008
Identities = 17/49 (34%), Positives = 25/49 (51%), Gaps = 9/49 (18%)

Query: 167 GKVLPA---SEGLKQLGLEPIALGAKEGLALNNGTQVSTAICLKNYFAL 212
G+ + S+GL Q A+ AKEG+ + N Q +I +NYF L
Sbjct: 341 GRTMQGRRWSDGLHQ------AVEAKEGVQIQNENQTLASITFQNYFRL 383


18PSLF89_RS22865PSLF89_RS22920Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS22865-2103.182205DNA translocase FtsK
PSLF89_RS36045-2124.419102outer-membrane lipoprotein carrier protein LolA
PSLF89_RS22870-2134.489411inositol monophosphatase
PSLF89_RS22880-2184.482181tRNA
PSLF89_RS22885-1173.831530IscS subfamily cysteine desulfurase
PSLF89_RS228900204.349103hypothetical protein
PSLF89_RS228950293.454682nucleoside-diphosphate kinase
PSLF89_RS229000253.18906123S rRNA (adenine(2503)-C(2))-methyltransferase
PSLF89_RS229051291.953905type IV pilus biogenesis/stability protein PilW
PSLF89_RS229101272.737223helix-turn-helix domain-containing protein
PSLF89_RS229150252.329724hypothetical protein
PSLF89_RS229201213.052473histidine--tRNA ligase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS22920SYCDCHAPRONE329e-04 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 32.2 bits (73), Expect = 9e-04
Identities = 13/93 (13%), Positives = 34/93 (36%)

Query: 55 LGLAYLKAEDFRRAQYKLSKAIKLDPHRAEVHYAFAYYLETVGEFEKAQQEYLTALNIAP 114
L ++ + A LD + + + +G+++ A Y +
Sbjct: 42 LAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDI 101

Query: 115 DDPKVLNNYGAFLCRQGQVDKSLRYLLAAAEHV 147
+P+ + L ++G++ ++ L A E +
Sbjct: 102 KEPRFPFHAAECLLQKGELAEAESGLFLAQELI 134



Score = 27.6 bits (61), Expect = 0.029
Identities = 26/124 (20%), Positives = 45/124 (36%), Gaps = 2/124 (1%)

Query: 60 LKAEDFRRAQYKLSKAIKLDPHRAEVHYAFAYYLETVGEFEKAQQEYLTALNIAPDDPKV 119
L E F + ++ ++ E Y+ A+ G++E A + + + D +
Sbjct: 13 LAMESFLKGGGTIAMLNEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRF 72

Query: 120 LNNYGAFLCRQGQVDKSLRYLLAAAEHVEYLDRAGSYENAGLCALKIDELKYAQHYLTQA 179
GA GQ D ++ A R +A C L+ EL A+ L A
Sbjct: 73 FLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRF--PFHAAECLLQKGELAEAESGLFLA 130

Query: 180 LQLA 183
+L
Sbjct: 131 QELI 134


19PSLF89_RS22970PSLF89_RS23045Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS229702160.376231chemotaxis protein CheX
PSLF89_RS229752120.612518response regulator
PSLF89_RS229802130.76757423S rRNA pseudouridine(955/2504/2580) synthase
PSLF89_RS229852151.219780Rne/Rng family ribonuclease
PSLF89_RS229902123.354579phosphatase PAP2 family protein
PSLF89_RS229951123.850740glutamate--cysteine ligase
PSLF89_RS23000-1133.273717glutamate--tRNA ligase
PSLF89_RS23005-2123.241798crossover junction endodeoxyribonuclease RuvC
PSLF89_RS23010-2143.374449Holliday junction branch migration protein RuvA
PSLF89_RS23015-1193.215032Holliday junction branch migration DNA helicase
PSLF89_RS23020-1202.249747protein TolQ
PSLF89_RS230250202.055838protein TolR
PSLF89_RS230301232.546356cell envelope integrity protein TolA
PSLF89_RS230352252.680848Tol-Pal system beta propeller repeat protein
PSLF89_RS230401252.486108OmpA family protein
PSLF89_RS230450243.278626tol-pal system protein YbgF
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS23005HTHFIS639e-15 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 63.3 bits (154), Expect = 9e-15
Identities = 27/114 (23%), Positives = 49/114 (42%), Gaps = 1/114 (0%)

Query: 22 TVLTVDDSKTIHGIAKNLLSGSEFDIIDVAHNGNDGVEKYKKLKPNFVLMDIVMPELDGM 81
T+L DD I + LS + +D+ + N + V+ D+VMP+ +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDV-RITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 82 SALKKIIDFDPHAQVVMATSMGQEDTVEQAITIGAKGYLLKPYDKESVLVVLRT 135
L +I P V++ ++ T +A GA YL KP+D ++ ++
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS23060PF03544474e-08 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 46.5 bits (110), Expect = 4e-08
Identities = 24/190 (12%), Positives = 55/190 (28%), Gaps = 17/190 (8%)

Query: 186 QAEQVRQQALEKKREQEQLHQRQALEKKQREAAAKVKREAEVAAEK-------QRQQALA 238
V LE + + + + + E + +EA V EK + +
Sbjct: 51 SVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKK 110

Query: 239 RLRANAQSNIANQLAENARVAAARTARQQYVQSEFEKYSGLIVTEISRHWNQANID-PSL 297
+ + A + V R ++ P+
Sbjct: 111 VEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVASGPRALSRNQPQYPAR 170

Query: 298 NA--------LIQVNVDVTGDILSVKIVKSSGNAIFDRQAKLAVLSAGRLPMPTDKEVAQ 349
++ +V G + +V+I+ + +F+R+ K A+ P +
Sbjct: 171 AQALRIEGQVKVKFDVTPDGRVDNVQILSAKPANMFEREVKNAMRRWRYEPGKPGSGIVV 230

Query: 350 RFLSFQFHFT 359
+ F+ + T
Sbjct: 231 N-ILFKINGT 239


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS23070OMPADOMAIN821e-20 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 81.9 bits (202), Expect = 1e-20
Identities = 33/110 (30%), Positives = 49/110 (44%), Gaps = 10/110 (9%)

Query: 100 VYFGFDQYSVGKTDQDIVQSNVNYL--LKHPKQKVLLEGYTDPRGSSQYNLNLGQKRANS 157
V F F++ ++ Q + + L L V++ GYTD GS YN L ++RA S
Sbjct: 221 VLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGLSERRAQS 280

Query: 158 LKDALLSAGVGPQQVSTLSYGKE-------CLAVPGGTAEAD-YQKDRRV 199
+ D L+S G+ ++S G+ C V A D DRRV
Sbjct: 281 VVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRV 330


20PSLF89_RS23160PSLF89_RS23210Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS231602200.310047hypothetical protein
PSLF89_RS360653220.756214hypothetical protein
PSLF89_RS360706250.528150CHASE domain-containing protein
PSLF89_RS36075425-0.650143IS4 family transposase
PSLF89_RS36080423-1.173575response regulator
PSLF89_RS36085424-1.245661sulfate transporter CysZ
PSLF89_RS23180125-1.507744chromosome segregation protein SMC
PSLF89_RS231851130.864347hypothetical protein
PSLF89_RS231900131.177723protease SohB
PSLF89_RS232000173.4070247-cyano-7-deazaguanine synthase
PSLF89_RS232050193.561933hypothetical protein
PSLF89_RS232101193.479820transposase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS23190PF06580340.001 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 34.5 bits (79), Expect = 0.001
Identities = 33/195 (16%), Positives = 68/195 (34%), Gaps = 46/195 (23%)

Query: 467 EKIIQQSLEGAEKVKNIVLSL-----KSFAHSDTDN---KEEFDLNHCIEQALTITQNEL 518
I LE K + ++ SL S +S+ +E + ++ L + +
Sbjct: 180 NNIRALILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTV---VDSYLQLASIQF 236

Query: 519 KYKCKIIKNLSPLNPLLGYSSQIGQVIMNLLI-NA-AHAIKES---GTITITTQQIAGFN 573
+ + + ++P ++ Q+ +++ L+ N H I + G I + + G
Sbjct: 237 EDRLQFENQINP--AIMDV--QVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTV 292

Query: 574 KLTIEDDGYGIHKDHLAKLFDPFFTTKSVGEGTGLGLS-------ISYGIIKKHQGSINV 626
L +E+ G K+ E TG GL + YG + I +
Sbjct: 293 TLEVENTGSLA--------------LKNTKESTGTGLQNVRERLQMLYG----TEAQIKL 334

Query: 627 ESTVGQGTVFTIQLP 641
G+ + +P
Sbjct: 335 SEKQGKVNA-MVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS23200HTHFIS958e-25 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 94.5 bits (235), Expect = 8e-25
Identities = 22/119 (18%), Positives = 45/119 (37%), Gaps = 1/119 (0%)

Query: 2 PSLLLVDDEPHIIDALKRLFRREKYTLHCAYSAKEGLDILAQQHIDIILSDQRMPSMLGS 61
++L+ DD+ I L + R Y + +A +A D++++D MP
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 EFLAKAQAQSPQTLRIILSGYADTKEIINGILNNHIHQFLEKPWRANELREHLRHLINL 120
+ L + + P +++S I + +L KP+ EL + +
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKG-AYDYLPKPFDLTELIGIIGRALAE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS23210GPOSANCHOR504e-08 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 50.1 bits (119), Expect = 4e-08
Identities = 51/300 (17%), Positives = 115/300 (38%), Gaps = 1/300 (0%)

Query: 716 QALQRELAEVKARVSGRLAHIEQVRARLAVVEHELSEALELFNEEQEEIKIARQRLEIAV 775
++ L +V+ R ++ + + + + +E EE+ A+++L
Sbjct: 46 RSQTDTLEKVQERADKFEIENNTLKLKNSDLSFNNKALKDHNDELTEELSNAKEKLRKND 105

Query: 776 EKMAELELQRQALESGRVEASRTTQEARQQLESLKSQYQQKQMQLQRVQSEQAGIKAALQ 835
+ ++E + Q LE+ + + + + A + ++ + + + + + +A ++ AL+
Sbjct: 106 KSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALE 165

Query: 836 RLEELLGRDKVRLVELEQSRIGLEEPLEEQRMLLDEQLERQLSFEDRLKTVKDQAQAHEN 895
D ++ LE + LE E L+ + + ++KT++ + A
Sbjct: 166 GAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAA 225

Query: 896 QVRTFEKQLHEQMNQVAHAREALEQGRMQAQELFIRRQSVEEQLVEAGFQLRGLL-EIYQ 954
+ EK L MN ++ + L R+ +E+ L A +I
Sbjct: 226 RKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKT 285

Query: 955 EGCDVKELETALEDIGRRIQRLGVINLAAIDEYAGQSERKVYLDAQHDDLTEALDMLEAA 1014
+ LE D+ + Q L + + E K L+A+H L E + EA+
Sbjct: 286 LEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEAS 345



Score = 43.1 bits (101), Expect = 7e-06
Identities = 34/219 (15%), Positives = 68/219 (31%)

Query: 168 AGISKYKERRKETERRIRHTRENLERLGDIREELGKQLSRLHQQAQAAEKYQNFKKEERE 227
A + K E + LE L + + A + K + E
Sbjct: 123 ADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLE 182

Query: 228 VKGQLYIQRWKSLQTQHGQEQKKIQEHEVIVEKQRAGQQHIDASLEKERLALSEASEKLH 287
+ R L+ ++ A + + A AL A
Sbjct: 183 AEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFST 242

Query: 288 ACQADYYQAGNEVSRLEQQIEHATTRIRETGQEMARLNTSLEKARSELAADEQQKVLLSA 347
A A E + LE + + + ++ +E AA E +K L
Sbjct: 243 ADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEH 302

Query: 348 QEEALEPETELLQNAVDEAALSAEEAEVSYRQLEKERES 386
Q + L + L+ +D + + ++ E +++LE++ +
Sbjct: 303 QSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKI 341



Score = 42.7 bits (100), Expect = 8e-06
Identities = 29/191 (15%), Positives = 73/191 (38%)

Query: 670 EKELMTLRQQELALEESICLHEEQLSQSQEVLMQVEAQVKSVQQQAQALQRELAEVKARV 729
E E L ++ LE+++ + + +EA+ +++ + L++ L
Sbjct: 147 EAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFS 206

Query: 730 SGRLAHIEQVRARLAVVEHELSEALELFNEEQEEIKIARQRLEIAVEKMAELELQRQALE 789
+ A I+ + A A + ++ + +++ + A LE ++ LE
Sbjct: 207 TADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELE 266

Query: 790 SGRVEASRTTQEARQQLESLKSQYQQKQMQLQRVQSEQAGIKAALQRLEELLGRDKVRLV 849
A + ++++L+++ + + ++ + + A Q L L +
Sbjct: 267 KALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKK 326

Query: 850 ELEQSRIGLEE 860
+LE LEE
Sbjct: 327 QLEAEHQKLEE 337



Score = 39.3 bits (91), Expect = 8e-05
Identities = 35/232 (15%), Positives = 76/232 (32%), Gaps = 2/232 (0%)

Query: 273 EKERLALSEASEKLHACQADYYQAGNEVSRLEQQIEHATTRIRETGQEMARLNTSLEKAR 332
E +A ++ L Q + E + L+ + + + L L A+
Sbjct: 39 EVSAVATRSQTDTLEKVQERADKFEIENNTLKLKNSDLSFNNKALKDHNDELTEELSNAK 98

Query: 333 SELAADEQQKVLLSAQEEALEPETELLQNAVDEAALSAEEAEVSYRQLEKERESLLQQVA 392
+L +++ +++ + LE L+ A++ A + + LE E+ +L + A
Sbjct: 99 EKLRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKA 158

Query: 393 LCRQEAEVEQTRIRHMEEQGQRLQQRLERLRAETH--NSDLISLEVGLEDVQGQQRELEE 450
+ E + + L+ L A L + + LE
Sbjct: 159 DLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEA 218

Query: 451 KEQELQLGAEEQQQRLIQQRQLIEQQRKGLEQQRGELHPLKGRLASLEALQQ 502
++ L + ++ L ++ E L+ R A LE +
Sbjct: 219 EKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALE 270



Score = 37.0 bits (85), Expect = 5e-04
Identities = 36/207 (17%), Positives = 84/207 (40%)

Query: 670 EKELMTLRQQELALEESICLHEEQLSQSQEVLMQVEAQVKSVQQQAQALQRELAEVKARV 729
++ TL ++ ALE E+ L + A++K+++ + AL+ E A+++ +
Sbjct: 245 SAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQS 304

Query: 730 SGRLAHIEQVRARLAVVEHELSEALELFNEEQEEIKIARQRLEIAVEKMAELELQRQALE 789
A+ + +R L + + +E+ KI+ + + ++ LE
Sbjct: 305 QVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQLE 364

Query: 790 SGRVEASRTTQEARQQLESLKSQYQQKQMQLQRVQSEQAGIKAALQRLEELLGRDKVRLV 849
+ + + + +SL+ + ++V+ + L LE+L +
Sbjct: 365 AEHQKLEEQNKISEASRQSLRRDLDASREAKKQVEKALEEANSKLAALEKLNKELEESKK 424

Query: 850 ELEQSRIGLEEPLEEQRMLLDEQLERQ 876
E+ + L+ LE + L E+L +Q
Sbjct: 425 LTEKEKAELQAKLEAEAKALKEKLAKQ 451


21PSLF89_RS23415PSLF89_RS23505Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS234151253.893240multidrug effflux MFS transporter
PSLF89_RS234200202.344048aminotransferase class I/II-fold pyridoxal
PSLF89_RS370950170.542011phytanoyl-CoA dioxygenase family protein
PSLF89_RS23425216-1.099747class I SAM-dependent methyltransferase
PSLF89_RS23430215-3.267586WbqC family protein
PSLF89_RS23435314-6.431265hypothetical protein
PSLF89_RS23440321-9.594569hypothetical protein
PSLF89_RS23445723-11.662777GNAT family N-acetyltransferase
PSLF89_RS23450725-12.511116hypothetical protein
PSLF89_RS23455521-11.407658hypothetical protein
PSLF89_RS23465013-4.077884flagellin
PSLF89_RS23470012-3.349812B-type flagellin
PSLF89_RS23475012-2.630978flagellar protein FlaG
PSLF89_RS23480-111-1.088937flagellar filament capping protein FliD
PSLF89_RS234850191.398544flagellar export chaperone FliS
PSLF89_RS23490122-0.186141flagellar protein FliT
PSLF89_RS23495219-2.641399hypothetical protein
PSLF89_RS23500119-2.954444hypothetical protein
PSLF89_RS23505223-4.419964hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS23435TCRTETB681e-14 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 68.0 bits (166), Expect = 1e-14
Identities = 56/240 (23%), Positives = 105/240 (43%), Gaps = 20/240 (8%)

Query: 2 TNKSNNPTALILFFILLIVPIGQVAIDIYLPSLPYISQELAISTSVTQWSLTIYLLSSGL 61
+N +N + L + + ++ +++ SLP I+ + + T W T ++L+ +
Sbjct: 8 SNLRHNQILIWLCILSFFSVLNEMVLNV---SLPDIANDFNKPPASTNWVNTAFMLTFSI 64

Query: 62 SQFFYGPISDSLGRKPILCYGLIIFFIGSLVCAQAQGELSLL-AGRLLQGLGIGA----G 116
YG +SD LG K +L +G+II GS++ SLL R +QG G A
Sbjct: 65 GTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALV 124

Query: 117 AVISNAMVGDHFHGIHLAKVTSLSSFAYGISPIIAPFIGGLIQTHLGWRFNFYFLLIITA 176
V+ + G + S+ + G+ P IGG+I ++ W + +IT
Sbjct: 125 MVVVARYIPKENRGKAFGLIGSIVAMGEGVGPA----IGGMIAHYIHWSYLLLI-PMITI 179

Query: 177 ASLLLAIALLPETLNKQNKQHLNIKTLKQNYLSILKQKIF-----WGYVLCMTLSFAISI 231
++ + LL + + K H +IK + + I+ +F +++ LSF I +
Sbjct: 180 ITVPFLMKLLKK--EVRIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFLIFV 237


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS23485FLAGELLIN1839e-54 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 183 bits (466), Expect = 9e-54
Identities = 131/527 (24%), Positives = 222/527 (42%), Gaps = 24/527 (4%)

Query: 5 INTNFTAILGQNRLESVNTEINRVMQRLTTGKRVNTAADDAAGYAIITRMTTRLKGYDTA 64
INTN ++L QN L + ++ ++RL++G R+N+A DDAAG AI R T+ +KG A
Sbjct: 4 INTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQA 63

Query: 65 VRNASDAVSLVQIAGGAVGQQVNMLQRVRTLALQSANDTNNTTDRANLNLEVQEIIEEFG 124
RNA+D +S+ Q GA+ + N LQRVR L++Q+ N TN+ +D ++ E+Q+ +EE
Sbjct: 64 SRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEID 123

Query: 125 FVAERTKFNGVRLLDGSLANQMFQIGVEVDDTLSLTFGNTKTEVIGMAEYGGGITGSAIG 184
V+ +T+FNGV++L Q+G +T+++ + +G+ G
Sbjct: 124 RVSNQTQFNGVKVLSQD-NQMKIQVGANDGETITIDLQKIDVKSLGL---DGFNVNGPKE 179

Query: 185 AAQGDNMGALRAGVLGTAGAAADLNAFVNQIVAQSITIAGHAGPAKTVAYAVPGAAGQAS 244
A GD + + A + ++G T A
Sbjct: 180 ATVGDL----------KSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYV 229

Query: 245 SAKDVAKALNTEKASTGVRVEARTRMTLSNLQNPGQI-SFVLYGDGALLTANPSTGGFAV 303
+A + + + +T V + T+ T + + +G T
Sbjct: 230 NAANGQLTTDDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDT 289

Query: 304 NVNIADQNDLTSLAAGINDLSGSTGITASLSPDLNEIMLEHADGENIAIENFLNSGTGTM 363
+++ G ITA + + + + T
Sbjct: 290 KTGNDGNGKVSTTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTK 349

Query: 364 AVQGMDGVAATNLTSGGTDSIAAVGTLQFKSDKQFDIASTVAGTNTVGGIFVGAAGDTKF 423
S + A G + + A+ T+ G +
Sbjct: 350 NESA--------KLSDLEANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTAS- 400

Query: 424 SLLSSVKDMNILDRFNALLTVEIVDAALDALTTIGAELGAKQNRLDVTIASIENQELNLT 483
+ + + + + + + +D+AL + + + LGA QNR D I ++ N NL
Sbjct: 401 GVSTLINEDAAAAKKSTANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLN 460

Query: 484 SARGRIEDADFAAESTNLSKFQVLLQAGTAMLAQANQLPATALQLLQ 530
SAR RIEDAD+A E +N+SK Q+L QAGT++LAQANQ+P L LL+
Sbjct: 461 SARSRIEDADYATEVSNMSKAQILQQAGTSVLAQANQVPQNVLSLLR 507


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS23490FLAGELLIN1594e-45 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 159 bits (403), Expect = 4e-45
Identities = 125/502 (24%), Positives = 198/502 (39%), Gaps = 18/502 (3%)

Query: 2 SLQTTLQRLATGKRINSPADDAAGYAIAARQTSDILSFGQAARNANDGISVVQTASSAIN 61
SL + ++RL++G RINS DDAAG AIA R TS+I QA+RNANDGIS+ QT A+N
Sbjct: 23 SLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQASRNANDGISIAQTTEGALN 82

Query: 62 TNISSLQRMRVLALQSLNDTNSSSDRVNLSLEFKQLSASITETAKSTKFNGQSLLDGSFA 121
++LQR+R L++Q+ N TNS SD ++ E +Q I + T+FNG +L
Sbjct: 83 EINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEIDRVSNQTQFNGVKVLSQDN- 141

Query: 122 GKQFQVGITTTETISMSFADSRATAVGDYKTTAVNAGGAVFDFEMVATALGTATLGTGQD 181
+ QVG ETI++ ++G A L ++
Sbjct: 142 QMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKEATV------GDLKSSFKNVTGY 195

Query: 182 VDNGILAASGLTVVGHLGTKALADADFGAAGSSFGLATATTTSAGMSAAVIAKAVSDSSG 241
+ A V A A T+ +
Sbjct: 196 DTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDDAENNTAVDLFKTTKS 255

Query: 242 DHGVTATGRTEVTLSGLTAAGDVSFSLGSGTGSVAADYSFATISSTIADTSDLSALAQAI 301
G + G + + T ST + ++ I
Sbjct: 256 TAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVSTTINGEKVTLTVADI 315

Query: 302 NDTSGTHGVTAELSGTDKGSMTLVSENGQNISLAVFDSSAAGTMTLTEQDGTATSVLQDA 361
+ S + + + NGQ + +A L +
Sbjct: 316 TAGAANVDAATLQSSKNVYTSVV---NGQFTFDDKTKNESAKLSDLEANNAVKGESKITV 372

Query: 362 GGNDSFIATGIVEYHSSKAFTLQSAVADIGTVVADASGFKSVADSDIKTTAEAKSAIFAL 421
G + + A + + + + + + ++
Sbjct: 373 NGAEYTANAA--------GDKVTLAGKTMFIDKTASGVSTLINEDAAAAKKSTANPLASI 424

Query: 422 DQALNRLTDSQANNGAIENRLNVVISNLENQQLNTTNSRGRIEDADFASETANLSKLQIL 481
D AL+++ +++ GAI+NR + I+NL N N ++R RIEDAD+A+E +N+SK QIL
Sbjct: 425 DSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNMSKAQIL 484

Query: 482 SQVGTAMLAQANQIPAAVLSLI 503
Q GT++LAQANQ+P VLSL+
Sbjct: 485 QQAGTSVLAQANQVPQNVLSLL 506


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS23495PF07299280.014 Fibronectin-binding protein (FBP)
		>PF07299#Fibronectin-binding protein (FBP)

Length = 219

Score = 27.9 bits (62), Expect = 0.014
Identities = 18/77 (23%), Positives = 29/77 (37%), Gaps = 11/77 (14%)

Query: 2 IMKIEHTSISTQSTQQTAATPAKIKVNEVERLARLNQEIKEQEHVLQEFEQAEPVTEVQS 61
I+ H + + + Q + A K+ V L E KE + V VQ+
Sbjct: 25 ILANGHATANDRGVIQALKSLAIEKIIHV--FENLTDEQKEL---------IDTVLTVQN 73

Query: 62 RARIEQAIADINQFIQP 78
R E + IN ++ P
Sbjct: 74 REDAESFLLKINPYVIP 90


22PSLF89_RS23685PSLF89_RS23715Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS23685-2173.759581tRNA preQ1(34) S-adenosylmethionine
PSLF89_RS236900174.302381tRNA guanosine(34) transglycosylase Tgt
PSLF89_RS236950183.963372preprotein translocase subunit YajC
PSLF89_RS237001223.528331protein translocase subunit SecD
PSLF89_RS237051212.814797protein translocase subunit SecF
PSLF89_RS237101223.112866hypothetical protein
PSLF89_RS237151203.160221transglycosylase SLT domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS23725SECFTRNLCASE871e-20 Bacterial translocase SecF protein signature.
		>SECFTRNLCASE#Bacterial translocase SecF protein signature.

Length = 333

Score = 86.8 bits (215), Expect = 1e-20
Identities = 42/262 (16%), Positives = 103/262 (39%), Gaps = 26/262 (9%)

Query: 348 VRLTGNEVSNFSKVTRDNVGKGMAVVLVQTTLSSKKINGKDIFQRKTSERVISIATIQQA 407
+R + + V++ + + + + Q
Sbjct: 55 IRTESTTAIDVGVYRAALEPLELGDVIISEVRDPSFREDQHVAMIRIQMQEDGQGAEGQG 114

Query: 408 LGNSFQITGIGEVKDAQSLAIQIRAGALPAPVQIVEDQVIGPTLGAQNIHIGLVSLAAAM 467
+ +V+ A A+ ++I + +GP + + + + SL AA
Sbjct: 115 AQGQ---ELVNKVETA--------LTAVDPALKITSFESVGPKVSGELVWTAVWSLLAAT 163

Query: 468 MVTLLFMLVYYR-AFGIYANIALILNMIFLFAIMSVMGATMSLPGIAAAVLHIGMAVDAN 526
+V + ++ V + F + A +AL+ +++ + +V+ L +AA + G +++
Sbjct: 164 VVIMFYIWVRFEWQFALGAVVALVHDVLLTVGLFAVLQLKFDLTTVAALLTITGYSINDT 223

Query: 527 VLIFERIREELRA--GISPH----KAISQGFDRALATIVDSNLTTLIVAVVLFAIGTGSV 580
V++F+R+RE L + ++++ R + T + TTL+ V + G +
Sbjct: 224 VVVFDRLRENLIKYKTMPLRDVMNLSVNETLSRTVMTGM----TTLLALVPMLIWGGDVI 279

Query: 581 KGFAVVLIIGIV----TSLFTA 598
+GF ++ G+ +S++ A
Sbjct: 280 RGFVFAMVWGVFTGTYSSVYVA 301


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS23730SECFTRNLCASE302e-104 Bacterial translocase SecF protein signature.
		>SECFTRNLCASE#Bacterial translocase SecF protein signature.

Length = 333

Score = 302 bits (774), Expect = e-104
Identities = 101/308 (32%), Positives = 179/308 (58%), Gaps = 8/308 (2%)

Query: 1 MEFFKQQTNIDFLGLRRWAGIFSVVICLGSIAIMAIKGLNWGLDFTGGYSVQVSYVKAPN 60
++ ++TN DF + ++V+ + S+ + + GLN+G+DF GG +++ A +
Sbjct: 5 LKLVPEKTNFDFFRWQWATFGAAIVMMIASVILPLVIGLNFGIDFKGGTTIRTESTTAID 64

Query: 61 LTKVRNALDAANFREARVTTYGSTR------DLQIRFAPQEGQSAGLSETQQGA-LKAKL 113
+ R AL+ + ++ IR QE + QG L K+
Sbjct: 65 VGVYRAALEPLELGDVIISEVRDPSFREDQHVAMIRIQMQEDGQGAEGQGAQGQELVNKV 124

Query: 114 KTTLTLGQP-VEINSVNYIGSEVGSEMVQQGILAIIVSVLAIMVYVALRFDYRFAISAAV 172
+T LT P ++I S +G +V E+V + +++ + + IM Y+ +RF+++FA+ A V
Sbjct: 125 ETALTAVDPALKITSFESVGPKVSGELVWTAVWSLLAATVVIMFYIWVRFEWQFALGAVV 184

Query: 173 ALAHDPLLILGIFSLFHIEFTLISLAALLAVIGFSLNDTVVIYDRIRENFRKMRKATPVD 232
AL HD LL +G+F++ ++F L ++AALL + G+S+NDTVV++DR+REN K + D
Sbjct: 185 ALVHDVLLTVGLFAVLQLKFDLTTVAALLTITGYSINDTVVVFDRLRENLIKYKTMPLRD 244

Query: 233 VVNRSINDTLSRTLMTSGLTLLVVVILYVFGGPALQPFALVLIIGILIGTYSSIYIAGAL 292
V+N S+N+TLSRT+MT TLL +V + ++GG ++ F ++ G+ GTYSS+Y+A +
Sbjct: 245 VMNLSVNETLSRTVMTGMTTLLALVPMLIWGGDVIRGFVFAMVWGVFTGTYSSVYVAKNI 304

Query: 293 SIKLGINR 300
+ +G++R
Sbjct: 305 VLFIGLDR 312


23PSLF89_RS23785PSLF89_RS23830Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS23785018-4.934689hypothetical protein
PSLF89_RS23790-219-3.913000MFS transporter
PSLF89_RS36135-218-3.319509hypothetical protein
PSLF89_RS23800-119-1.475978patatin-like phospholipase family protein
PSLF89_RS238052152.514864cadherin-like domain-containing protein
PSLF89_RS238152153.221996hypothetical protein
PSLF89_RS238203163.746036IS6 family transposase
PSLF89_RS238252142.649129IS6 family transposase
PSLF89_RS238302142.281949IS982 family transposase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS23815TCRTETB415e-06 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 41.4 bits (97), Expect = 5e-06
Identities = 37/159 (23%), Positives = 66/159 (41%), Gaps = 22/159 (13%)

Query: 50 INALLLTFGIFAAGYLARPLGGLIFGHIGDRFGRRHAFSHSIIIMAIGTMCIGLLPGYHH 109
A +LTF I G ++G + D+ G + +++ I C G + G+
Sbjct: 55 NTAFMLTFSI----------GTAVYGKLSDQLGIKR-----LLLFGIIINCFGSVIGF-- 97

Query: 110 IGITAPLLLMLLRIIQGVSLGGEIPGSSIFTAEHLFNQNRRGMAIGMIFMFITLGNTLGG 169
+G + LL++ R IQG P + + RG A G+I + +G +G
Sbjct: 98 VGHSFFSLLIMARFIQGAG-AAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGP 156

Query: 170 FIGAVLTHYFTPEQMLSFGWRIPFIIGFSIGIIAYFMRK 208
IG ++ HY S+ IP I ++ + ++K
Sbjct: 157 AIGGMIAHYIH----WSYLLLIPMITIITVPFLMKLLKK 191


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS23830RTXTOXINA1215e-30 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 121 bits (306), Expect = 5e-30
Identities = 81/262 (30%), Positives = 105/262 (40%), Gaps = 15/262 (5%)

Query: 695 HAGYGDDTVRGGTGEDAIFGGAGDDDLRGGAGNDPLRGGQGEDSLRGGAGNDDLRGGAGN 754
H G GDD V G I+ G G D + + G + G G
Sbjct: 615 HLGDGDDKVFLSAGSANIYAGKGHDVVYYDKTDTGYLTIDGTKATEAGNYTVTRVLGGDV 674

Query: 755 DALRGGQGEDSLRGGAGDDDLRGGAGEDVLRGGQG---EDSLR------GGAGNDDLRGG 805
L+ E + G + + + E G+ D+L G D G
Sbjct: 675 KVLQEVVKEQEVSVGKRTEKTQYRSYEFTHINGKNLTETDNLYSVEELIGTTRADKFFGS 734

Query: 806 AGEDVLRGGQGEDSLRGGAGNDDLRGGAGEDVLRGGQGEDSLRGGAGNDDLRGGAGEDVL 865
D+ G G+D + G GND L G G D L GG G+D L GG GND L G AG + L
Sbjct: 735 KFTDIFHGADGDDLIEGNDGNDRLYGDKGNDTLSGGNGDDQLYGGDGNDKLIGVAGNNYL 794

Query: 866 RGGQGEDSLR---GGAGDDDLRGGAGEDVLRGGQGDDVLDGGEGVDTVYAGQGNDAATFV 922
GG G+D + + L GG G D L G +G D+LDGGEG D + G GND +
Sbjct: 795 NGGDGDDEFQVQGNSLAKNVLFGGKGNDKLYGSEGADLLDGGEGDDLLKGGYGND--IYR 852

Query: 923 VGQSEGIDQYY-GGSGEDTLRI 943
G G ED L +
Sbjct: 853 YLSGYGHHIIDDDGGKEDKLSL 874



Score = 119 bits (299), Expect = 3e-29
Identities = 83/262 (31%), Positives = 107/262 (40%), Gaps = 26/262 (9%)

Query: 674 VNAGSGDDVIQLGNGYANSTIHAGYGDDTV---RGGTGEDAIFG---------------G 715
+ G GDD + L G AN I+AG G D V + TG I G G
Sbjct: 614 SHLGDGDDKVFLSAGSAN--IYAGKGHDVVYYDKTDTGYLTIDGTKATEAGNYTVTRVLG 671

Query: 716 AGDDDLRGGAGNDPLRGGQGEDSLRGGAGNDDLRGGAGNDALRGGQGEDSLRGGAGDDDL 775
L+ + G+ + + + G + L G D
Sbjct: 672 GDVKVLQEVVKEQEVSVGKRTEKTQYRSYEFTHINGKNLTETDNLYSVEELIGTTRADKF 731

Query: 776 RGGAGEDVLRGGQGEDSLRGGAGNDDLRGGAGEDVLRGGQGEDSLRGGAGNDDLRGGAGE 835
G D+ G G+D + G GND L G G D L GG G+D L GG GND L G AG
Sbjct: 732 FGSKFTDIFHGADGDDLIEGNDGNDRLYGDKGNDTLSGGNGDDQLYGGDGNDKLIGVAGN 791

Query: 836 DVLRGGQGEDSLR---GGAGNDDLRGGAGEDVLRGGQGEDSLRGGAGDDDLRGGAGEDVL 892
+ L GG G+D + + L GG G D L G +G D L GG GDD L+GG G D+
Sbjct: 792 NYLNGGDGDDEFQVQGNSLAKNVLFGGKGNDKLYGSEGADLLDGGEGDDLLKGGYGNDIY 851

Query: 893 RGGQG---DDVLDGGEGVDTVY 911
R G + D G D +
Sbjct: 852 RYLSGYGHHIIDDDGGKEDKLS 873



Score = 100 bits (251), Expect = 2e-23
Identities = 75/263 (28%), Positives = 100/263 (38%), Gaps = 20/263 (7%)

Query: 714 GGAGDDDLRGGAGNDPLRGGQGEDSLRGGAGNDDLRGGAGNDALRGGQGEDSLRGGAGDD 773
G GDD + AG+ + G+G D + + G A G + G
Sbjct: 616 LGDGDDKVFLSAGSANIYAGKGHDVVYYDKTDTGYLTIDGTKATEAGNYTVTRVLGGDVK 675

Query: 774 DLRGGAGEDVLRGGQGEDSLRGGAGNDDLRGGAGEDVLRGGQGEDSLRGGAGNDDLRGGA 833
L+ E + G+ + + + G + L G D G
Sbjct: 676 VLQEVVKEQEVSVGKRTEKTQYRSYEFTHINGKNLTETDNLYSVEELIGTTRADKFFGSK 735

Query: 834 GEDVLRGGQGEDSLRGGAGNDDLRGGAGEDVLRGGQGEDSLRGGAGDDDLRGGAGEDVLR 893
D+ G G+D + G GND L G G D L GG G+D L GG G+D L G AG + L
Sbjct: 736 FTDIFHGADGDDLIEGNDGNDRLYGDKGNDTLSGGNGDDQLYGGDGNDKLIGVAGNNYLN 795

Query: 894 GGQGDD------------VLDGGEGVDTVYAGQGNDAATFVVGQSEGIDQYYGGSGEDTL 941
GG GDD VL GG+G D +Y +G D ++ EG D GG G D
Sbjct: 796 GGDGDDEFQVQGNSLAKNVLFGGKGNDKLYGSEGAD----LLDGGEGDDLLKGGYGNDIY 851

Query: 942 RIELSAEQLENSDIITDLHGLND 964
R II D G D
Sbjct: 852 RY----LSGYGHHIIDDDGGKED 870



Score = 68.8 bits (168), Expect = 1e-13
Identities = 47/151 (31%), Positives = 73/151 (48%), Gaps = 13/151 (8%)

Query: 1099 EEVASVEEFNTGAGDDIVDLASYRYEYGDTVMNLGEGSDVGWGNIGEDQIFGGAGNDWLA 1158
+ + SVEE D + + + + +G D+ GN G D+++G GND L+
Sbjct: 714 DNLYSVEELIGTTRADKFFGSKF-----TDIFHGADGDDLIEGNDGNDRLYGDKGNDTLS 768

Query: 1159 GNSGNDLLKGGLGDDRLEGNAGDDEIHVGQGDDIAMGHSGSDLFVFNLDEGNLGQNWVSG 1218
G +G+D L GG G+D+L G AG++ ++ G GDD S N+ G G + + G
Sbjct: 769 GGNGDDQLYGGDGNDKLIGVAGNNYLNGGDGDDEFQVQGNS--LAKNVLFGGKGNDKLYG 826

Query: 1219 GEGEDSLQLSGSGGQNWVLHVENGGDGEVIH 1249
EG D L G G + + GG G I+
Sbjct: 827 SEGAD--LLDGGEGDDLL----KGGYGNDIY 851


24PSLF89_RS23920PSLF89_RS23990Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS239201153.585303pentapeptide repeat-containing protein
PSLF89_RS239251163.691101hypothetical protein
PSLF89_RS239301164.105341DNA-binding protein
PSLF89_RS239451163.445031hypothetical protein
PSLF89_RS23950-1142.435015hypothetical protein
PSLF89_RS23955091.422801hypothetical protein
PSLF89_RS239601110.479601hypothetical protein
PSLF89_RS23965214-0.154173hypothetical protein
PSLF89_RS23970315-0.453007TSUP family transporter
PSLF89_RS23975322-0.910447ribonuclease HI
PSLF89_RS23980427-1.917481TetR/AcrR family transcriptional regulator
PSLF89_RS23985224-4.316791ATP-binding cassette domain-containing protein
PSLF89_RS23990121-3.598870insulinase family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS23980GPOSANCHOR405e-05 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 40.0 bits (93), Expect = 5e-05
Identities = 46/311 (14%), Positives = 88/311 (28%), Gaps = 25/311 (8%)

Query: 637 KVRAQSIRDHFREHTKGFRESTQRYAEQAKQREDGKIAGADTLEEMARLRTLALTQHVAA 696
+++ T + AE+A M + +
Sbjct: 123 ADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADS-AKIKTL 181

Query: 697 TAAKNGFREIGKELGKANSTIDQLKETNKQSQETIKETEERVRNAKKETSAIHTKLEQAQ 756
A K EL KA + +T++ + + K +
Sbjct: 182 EAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFS 241

Query: 757 EVIQRQNAEKRQDFQDLKSEITEYCERLKDNNQSLIKRLGALARSIPDQKEFEELRDDFT 816
+ + L++ E + L+ + ++ E + D
Sbjct: 242 TADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLE 301

Query: 817 SQHKKQVAFLDGLIVKLSAKVEIYQENSKEVTKL-----ISDAN---------------K 856
Q + A L L A E ++ E KL IS+A+ K
Sbjct: 302 HQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKK 361

Query: 857 QVACANTALEGVEQTLDKIRKELKLTFSQLEEAHKKIKTAQAINTNQADQI----SELTA 912
Q+ + LE + + R+ L+ EA K+++ A ++ + EL
Sbjct: 362 QLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQVEKALEEANSKLAALEKLNKELEE 421

Query: 913 QLTLLTQPKAE 923
L + KAE
Sbjct: 422 SKKLTEKEKAE 432



Score = 33.5 bits (76), Expect = 0.006
Identities = 28/259 (10%), Positives = 67/259 (25%), Gaps = 10/259 (3%)

Query: 730 TIKETEERVRNAKKETSAIHTKLEQAQEVIQRQNAEKRQDFQDLKSEITEYCERLKDNNQ 789
E ++ +T + E+ K D + ++ + L +
Sbjct: 36 NTNEVSAVATRSQTDTLEKVQERADKFEIENNTLKLKNSDLSFNNKALKDHNDELTEELS 95

Query: 790 SLIKRLGALARSIPDQKEFEELRDDFTSQHKKQVAFLDGLIVKLSAKVEIYQENSKEVTK 849
+ ++L +S+ ++ + + + +K + SAK + +
Sbjct: 96 NAKEKLRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAK----IKTLEAEKA 151

Query: 850 LISDANKQVACANTALEGVEQTLDKIRKELKLTFSQLEEAHKKIKTAQAINTNQADQISE 909
++ + A K L+ + LE +++ A N + S
Sbjct: 152 ALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSA 211

Query: 910 LTAQLTLLTQPKAEEFQLKVVDTTLSRGDNMKRTGDLFEIVVKALKENQKLNGTNKGKIL 969
L A D + M + + E L + L
Sbjct: 212 KIKTLEAEKAALAARKA----DLEKALEGAMNFSTADSAKIKTLEAEKAAL--EARQAEL 265

Query: 970 ELLRDDLQKHKENLMSDSD 988
E + +
Sbjct: 266 EKALEGAMNFSTADSAKIK 284



Score = 32.7 bits (74), Expect = 0.010
Identities = 46/261 (17%), Positives = 92/261 (35%), Gaps = 11/261 (4%)

Query: 656 ESTQRYAEQAKQREDGKIAGADTLEEMARLRTLALTQHVA-ATAAKNGFREIGKELGKAN 714
E ++ + A L AL + +TA + + E
Sbjct: 200 EGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALE 259

Query: 715 STIDQLKETNKQSQETIKETEERVRNAKKETSAIHTKLEQAQEVIQRQNAEKRQDFQDLK 774
+ +L++ + + +++ + E +A+ + + Q NA ++ +DL
Sbjct: 260 ARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLD 319

Query: 775 SEITEYCERLKDNNQSLIKRLGALARSIPDQKEFEELRDDFTSQHKKQVAFLDGLIVKLS 834
+ ++L+ +Q L ++ S ++ D K+ A L +
Sbjct: 320 ASREAK-KQLEAEHQKLEEQNKISEAS---RQSLRRDLDASREAKKQLEAEHQKLEEQNK 375

Query: 835 AKVEIYQENSKEVTKLISDANKQVACANTALEGVEQTLDKIRKEL----KLTFSQLEEAH 890
Q +++ +A KQV A L+K+ KEL KLT + E
Sbjct: 376 ISEASRQSLRRDLDASR-EAKKQVEKALEEANSKLAALEKLNKELEESKKLTEKEKAELQ 434

Query: 891 KKIKT-AQAINTNQADQISEL 910
K++ A+A+ A Q EL
Sbjct: 435 AKLEAEAKALKEKLAKQAEEL 455


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS23985RTXTOXIND384e-05 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 37.9 bits (88), Expect = 4e-05
Identities = 23/182 (12%), Positives = 60/182 (32%), Gaps = 18/182 (9%)

Query: 94 NAQAEISNLQQLLNNKELELDTWQQNHQRLITQHESLTTIHQELTAQHQTLITEQQLKTE 153
S +++ + + + + N + + + A+ +++
Sbjct: 183 EVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAER-------LTVLARINRYENLSRVEKS 235

Query: 154 QLTALEQL-------HHSSHAQQQQYI---NELSSQLAEEKEIFKKAIEHIQENYQLHCE 203
+L L H+ Q+ +Y+ NEL ++ ++I + I +E YQL +
Sbjct: 236 RLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQI-ESEILSAKEEYQLVTQ 294

Query: 204 QLETHHQQKISELKTNLSSYQANVQELTSDLQHLQQQLNQTKTINNLSEYVFNQLNNQTE 263
+ K+ + N+ + + Q + + + L + + E
Sbjct: 295 LFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAE 354

Query: 264 HL 265
L
Sbjct: 355 TL 356



Score = 28.6 bits (64), Expect = 0.033
Identities = 23/173 (13%), Positives = 57/173 (32%), Gaps = 15/173 (8%)

Query: 136 ELTAQHQTLITEQQLKTEQLT-----ALEQLHHSSHAQQQQYINELSSQLAEEKEIFKKA 190
L A+ TL T+ L +L L + + + + +E Q E+E +
Sbjct: 129 ALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEE-VLRL 187

Query: 191 IEHIQENYQLHCEQLETHHQQKISELKTNLSSYQANVQELTSDLQHLQQQLNQTKTINNL 250
I+E + Q + + + + + A + + + + +L+ ++ +
Sbjct: 188 TSLIKEQFSTWQNQK-YQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHK 246

Query: 251 SEYVFNQLNNQTEHLTQLIKDHQHQQNADLIALQNQQATVNKELTHLALEQQQ 303
+ + Q + +L ++Q + E+ E Q
Sbjct: 247 QAIAKHAVLEQENKYVEA--------VNELRVYKSQLEQIESEILSAKEEYQL 291


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS24025HTHTETR705e-17 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 70.0 bits (171), Expect = 5e-17
Identities = 35/215 (16%), Positives = 73/215 (33%), Gaps = 17/215 (7%)

Query: 6 KKQRPSADITRSNILQAAQKLFASHGFAGTSISMIAKKANINQSLIYHHFTNKHDLWCKA 65
+K + A TR +IL A +LF+ G + TS+ IAK A + + IY HF +K DL
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDL-FSE 61

Query: 66 KKEFITPDLSKESPPSTQLQSLALYDFIKTIITIRFNIYKHNPDMGRML--LWQFLEFND 123
E ++ + ++ I+ ++ ++ EF
Sbjct: 62 IWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVG 121

Query: 124 DKPLLEDS-----SPMMTTILDCITQFKNEGEIHPRYINYTASQLLFYIFTNASSLFTSL 178
+ +++ + I + + + A+ ++ + L
Sbjct: 122 EMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMR-------GYISGL 174

Query: 179 YGAWFNKDEMTEEFHKQYADFIITAVYQSLASAPT 213
W + + K+ A + + + PT
Sbjct: 175 MENWLFAPQSFDL--KKEARDYVAILLEMYLLCPT 207


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS24030RTXTOXIND373e-04 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 36.7 bits (85), Expect = 3e-04
Identities = 23/129 (17%), Positives = 42/129 (32%), Gaps = 13/129 (10%)

Query: 517 YDDWLRQRSVLSQGQSQAQAQ---AQAQIQNSIKNPAGSQVENCAV---NKNNTMDKSGV 570
+ W Q+ + +A+ A+I + + K V
Sbjct: 195 FSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAV 254

Query: 571 KEQKKKLSGKLSFKEKQELEALPAKIEQLEHE--QEELNLAMANGDFYQQESDVIRQATT 628
EQ+ K + EL +++EQ+E E + + F + D +RQ T
Sbjct: 255 LEQENKYV-----EAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTD 309

Query: 629 RMATLEEAL 637
+ L L
Sbjct: 310 NIGLLTLEL 318


25PSLF89_RS24040PSLF89_RS24160Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS24040016-3.791884hypothetical protein
PSLF89_RS24045-117-3.760348IS4 family transposase
PSLF89_RS36155-117-3.760348IS3 family transposase
PSLF89_RS24055017-2.692670transposase
PSLF89_RS24060-117-3.860786transposase
PSLF89_RS36160-118-4.842082hypothetical protein
PSLF89_RS24065121-4.756162IS30 family transposase
PSLF89_RS24070016-4.374322IS3 family transposase
PSLF89_RS37665-119-4.545798hypothetical protein
PSLF89_RS24085-221-5.035157hypothetical protein
PSLF89_RS24090-222-3.786776hypothetical protein
PSLF89_RS36170-121-1.548345hypothetical protein
PSLF89_RS24105017-0.755524hypothetical protein
PSLF89_RS36175022-1.314250peptide-methionine (R)-S-oxide reductase MsrB
PSLF89_RS24120521-1.115794pentapeptide repeat-containing protein
PSLF89_RS24125213-1.342658helix-turn-helix transcriptional regulator
PSLF89_RS24130215-2.578553DUF2975 domain-containing protein
PSLF89_RS24135117-3.024329AAA family ATPase
PSLF89_RS36180116-3.326505peroxiredoxin
PSLF89_RS24150115-3.2080664-hydroxy-tetrahydrodipicolinate synthase
PSLF89_RS24155-116-3.524155hypothetical protein
PSLF89_RS24160020-3.402659phosphoribosylaminoimidazolesuccinocarboxamide
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS24210VACCYTOTOXIN337e-04 Helicobacter pylori vacuolating cytotoxin signature.
		>VACCYTOTOXIN#Helicobacter pylori vacuolating cytotoxin signature.

Length = 1291

Score = 33.5 bits (76), Expect = 7e-04
Identities = 42/165 (25%), Positives = 67/165 (40%), Gaps = 35/165 (21%)

Query: 31 DATAFNGVKHELLENKG---------KVSNIFNAFIMQKLESAGVKTHFIEKISDHESLV 81
+ F+GV +++ NK K NI + S G THF E I +
Sbjct: 520 NTLDFSGVTNKVNINKLITASTNVAVKNFNINELVVKTNGVSVGEYTHFSEDIGSQSRIN 579

Query: 82 KPLEMLRVECVVRNIAAGSLSKRYGIEEGSELKAPIFEFFLKDDDLGDPM-INDEHI--- 137
+R+E R+I +G G++ K I +F+ + D I + I
Sbjct: 580 ----TVRLETGTRSIYSG------GVKFKGGEKLVINDFYYAPWNYFDARNIKNVEITNK 629

Query: 138 IAFG-----WGTEEDIKMMRELTIKVNHVLN-----DLFLQGDIL 172
+AFG WGT + M LT+ N V++ +L +QGD +
Sbjct: 630 LAFGPQGSPWGTAK--LMFNNLTLGQNAVMDYSQFSNLTIQGDFV 672


26PSLF89_RS24305PSLF89_RS36960Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS24305317-3.498770hypothetical protein
PSLF89_RS24310116-5.871675phosphopentomutase
PSLF89_RS24315617-5.143994deoxyribose-phosphate aldolase
PSLF89_RS24320921-4.138827cytochrome d ubiquinol oxidase subunit II
PSLF89_RS24325721-2.842676cytochrome ubiquinol oxidase subunit I
PSLF89_RS24330721-0.787159hypothetical protein
PSLF89_RS24340516-0.158257FAD-dependent
PSLF89_RS243452131.188954hypothetical protein
PSLF89_RS243551132.833194hypothetical protein
PSLF89_RS243650182.072536hypothetical protein
PSLF89_RS24375218-0.969936DNA recombination protein RmuC
PSLF89_RS36960215-1.004819PAS domain-containing methyl-accepting
27PSLF89_RS24445PSLF89_RS36205Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS24445-1273.389647oligoribonuclease
PSLF89_RS361900254.338908tRNA
PSLF89_RS24455-1234.952739N-acetylmuramoyl-L-alanine amidase
PSLF89_RS24460-1174.220032DNA starvation/stationary phase protection
PSLF89_RS24470-2162.489659catalase
PSLF89_RS24475-2200.578877CBS domain-containing protein
PSLF89_RS24480-2200.551099hypothetical protein
PSLF89_RS244851200.318766hypothetical protein
PSLF89_RS244900190.797526hypothetical protein
PSLF89_RS245001151.275193IS3 family transposase
PSLF89_RS24505226-0.204838hypothetical protein
PSLF89_RS36195326-0.244766transposase
PSLF89_RS36205220-0.594287hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS24490HELNAPAPROT1431e-46 Helicobacter neutrophil-activating protein A family ...
		>HELNAPAPROT#Helicobacter neutrophil-activating protein A family

signature.
Length = 153

Score = 143 bits (361), Expect = 1e-46
Identities = 47/144 (32%), Positives = 79/144 (54%)

Query: 10 KTAEISKELNKLLATYQVFYMNVRGFHWNIKGQQFFELHTKFEEIYNDLLTKVDEIAERI 69
+ LN L+ + + Y + FHW +KG FF LH KFEE+Y+ VD IAER+
Sbjct: 9 NQTLVENSLNTQLSNWFLLYSKLHRFHWYVKGPHFFTLHEKFEELYDHAAETVDTIAERL 68

Query: 70 LTLGEQPLHAYSQYAKHSEISEAINVVDAENSVKSLLNSFSALIKLQRHILKVSGDAEDE 129
L +G QP+ +Y +H+ I++ N A V++L+N + + + ++ ++ + +D
Sbjct: 69 LAIGGQPVATVKEYTEHASITDGGNETSASEMVQALVNDYKQISSESKFVIGLAEENQDN 128

Query: 130 GTSSLMGDYIKEQEKLIWMFLAYL 153
T+ L I+E EK +WM +YL
Sbjct: 129 ATADLFVGLIEEVEKQVWMLSSYL 152


28PSLF89_RS24570PSLF89_RS24720Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS24570-1123.030016formate-dependent phosphoribosylglycinamide
PSLF89_RS245750214.715222hypothetical protein
PSLF89_RS245800214.877499tRNA pseudouridine(38-40) synthase TruA
PSLF89_RS245850213.972311acetyl-CoA carboxylase, carboxyltransferase
PSLF89_RS245901223.433550bifunctional tetrahydrofolate
PSLF89_RS245950212.735320CvpA family protein
PSLF89_RS24600-2181.123277amidophosphoribosyltransferase
PSLF89_RS24605-2140.444687tRNA-(ms[2]io[6]A)-hydroxylase
PSLF89_RS24610-1151.734077gamma-glutamyltransferase
PSLF89_RS24615-1151.971165IS982 family transposase
PSLF89_RS24620-1152.245908hypothetical protein
PSLF89_RS24625-2172.579584DNA mismatch repair protein MutS
PSLF89_RS24630-2154.201035CinA family protein
PSLF89_RS24635-2164.114731recombinase RecA
PSLF89_RS24640-2172.336256recombination regulator RecX
PSLF89_RS24645-1161.333214alanine--tRNA ligase
PSLF89_RS24650-3100.367181carbon storage regulator CsrA
PSLF89_RS24655-2100.052703**IS982 family transposase
PSLF89_RS24660122-1.980000hypothetical protein
PSLF89_RS24675125-1.314299SH3 domain-containing protein
PSLF89_RS362153200.150692sphingomyelin phosphodiesterase
PSLF89_RS246853200.305441APC family permease
PSLF89_RS246900171.162485glycerophosphodiester phosphodiesterase
PSLF89_RS246950141.627749glycerol-3-phosphate transporter
PSLF89_RS24700-1152.166806RNA chaperone Hfq
PSLF89_RS24705-1202.741838GTPase HflX
PSLF89_RS24710-1192.515040FtsH protease activity modulator HflK
PSLF89_RS24715-1192.665632protease modulator HflC
PSLF89_RS24720-1193.109827adenylosuccinate synthase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS24585BCTERIALGSPF290.026 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 29.4 bits (66), Expect = 0.026
Identities = 12/51 (23%), Positives = 22/51 (43%), Gaps = 4/51 (7%)

Query: 384 PLWLQAL----DSMTYWGTAVVVLLFLGLFWLAWRLRREHVRRLEDQAVLR 430
PL + L D++ +G +++ L G LR+E R + +L
Sbjct: 210 PLSTRVLMGMSDAVRTFGPWMLLALLAGFMAFRVMLRQEKRRVSFHRRLLH 260


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS24700SALVRPPROT310.005 Salmonella virulence-associated 28kDa protein signature.
		>SALVRPPROT#Salmonella virulence-associated 28kDa protein signature.

Length = 241

Score = 31.3 bits (70), Expect = 0.005
Identities = 26/88 (29%), Positives = 38/88 (43%), Gaps = 10/88 (11%)

Query: 122 LFSEQYPVQHTKIPTLKYVVQYVKAIAGEKAGFQIEIKTDPAHPHQSAT----PKQFATA 177
LFSE PV K+ ++ VVQ + G A F + IK D + SA+ +QF
Sbjct: 125 LFSEDSPVDKWKVTDMEKVVQQARVSLG--AQFTLYIKPDQENSQYSASFLHKTRQFIEC 182

Query: 178 LAKLLKAEGITD----RTEVQAFDWPCL 201
L L G+ ++V +W L
Sbjct: 183 LESRLSENGVISGQCPESDVHPENWKYL 210


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS24705TCRTETA349e-04 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 34.4 bits (79), Expect = 9e-04
Identities = 32/146 (21%), Positives = 61/146 (41%), Gaps = 11/146 (7%)

Query: 59 GFDKGDLGLVLAAVSIAYGLSK-FVMGTISDRSNPRTFLTVGLLLSALINLFFGAASISM 117
+D +G+ LAA I + L++ + G ++ R R L +G++ + A+
Sbjct: 242 HWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGW 301

Query: 118 SSIPLMFCLMFLNGWFQGMGWPACGRTMVHWFSVGERGTKMSIWNVAHNVGGGLIGPLAI 177
+ P+M L G+G PA + M+ ER ++ A ++GPL
Sbjct: 302 MAFPIMVLLASG-----GIGMPAL-QAMLSRQVDEERQGQLQGSLAALTSLTSIVGPL-- 353

Query: 178 MGLALFSAWQSLFYFPALIAIVVAIF 203
+ A+++A S+ + I A
Sbjct: 354 LFTAIYAA--SITTWNGWAWIAGAAL 377



Score = 32.9 bits (75), Expect = 0.002
Identities = 28/156 (17%), Positives = 54/156 (34%), Gaps = 7/156 (4%)

Query: 63 GDLGLVLAAVSIAYGLSKFVMGTISDRSNPRTFLTVGLLLSALINLFFGAASISMSSIPL 122
G++LA ++ V+G +SDR R L V L +A+ A
Sbjct: 43 AHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLW----- 97

Query: 123 MFCLMFLNGWFQGMGWPACGRTMVHWFSVGERGTKMSIWNVAHNVGGGLIGPLAIMGLAL 182
+ + + G G + ER + + A G G++ + GL
Sbjct: 98 VLYIGRIVAGITGATGAVAGAYIADITDGDER-ARHFGFMSA-CFGFGMVAGPVLGGLMG 155

Query: 183 FSAWQSLFYFPALIAIVVAIFVFFSLRDTPQSVGLP 218
+ + F+ A + + + F L ++ + P
Sbjct: 156 GFSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRP 191


29PSLF89_RS24760PSLF89_RS24950Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS24760-117-3.366613MFS transporter
PSLF89_RS24765018-4.474031IucA/IucC family siderophore biosynthesis
PSLF89_RS24770020-4.971359ATP-grasp domain-containing protein
PSLF89_RS24775019-3.924470TonB-dependent siderophore receptor
PSLF89_RS24780018-3.874724MotA/TolQ/ExbB proton channel family protein
PSLF89_RS24785018-3.565383biopolymer transporter ExbD
PSLF89_RS24790-118-3.628375biopolymer transporter ExbD
PSLF89_RS24795-116-2.759200energy transducer TonB
PSLF89_RS24800-113-2.430120iron-siderophore ABC transporter
PSLF89_RS24805323-2.616535iron ABC transporter permease
PSLF89_RS24810219-4.197150hypothetical protein
PSLF89_RS24815117-3.926434IS4 family transposase
PSLF89_RS24820215-3.802621hypothetical protein
PSLF89_RS24825116-3.522291DUF2158 domain-containing protein
PSLF89_RS24830-113-2.152390NADPH-dependent 2,4-dienoyl-CoA reductase
PSLF89_RS24835-212-1.753009MFS transporter
PSLF89_RS24840-2130.224771ferredoxin family protein
PSLF89_RS24845-2151.63693330S ribosomal protein S16
PSLF89_RS24850-2142.486183ribosome maturation factor RimM
PSLF89_RS24855-2143.313630tRNA (guanosine(37)-N1)-methyltransferase TrmD
PSLF89_RS24860-1174.31163150S ribosomal protein L19
PSLF89_RS248652203.561832hypothetical protein
PSLF89_RS248701212.281653hypothetical protein
PSLF89_RS248750200.695461hypothetical protein
PSLF89_RS24880017-0.274369IS4 family transposase
PSLF89_RS24885021-3.019294IS3 family transposase
PSLF89_RS24890-119-2.409913IS3 family transposase
PSLF89_RS24895020-1.504468transposase
PSLF89_RS24900-117-1.614990IS3 family transposase
PSLF89_RS24910017-0.817582IS30 family transposase
PSLF89_RS24920215-1.941036IS3 family transposase
PSLF89_RS24925313-4.148643hypothetical protein
PSLF89_RS24930416-3.163714transposase
PSLF89_RS36220318-3.343160hypothetical protein
PSLF89_RS24940318-2.993757IS3 family transposase
PSLF89_RS24945219-2.494805IS3 family transposase
PSLF89_RS24950319-1.878987DUF2807 domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS24785TCRTETA605e-12 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 59.8 bits (145), Expect = 5e-12
Identities = 61/330 (18%), Positives = 134/330 (40%), Gaps = 24/330 (7%)

Query: 44 GFLYVLPTLLTAIASPFWGKISDKINKKSALLRAQLGLSISFLIVAFSSGYLSLFILSLC 103
G L L L+ +P G +SD+ ++ LL + G ++ + I+A + L+I
Sbjct: 46 GILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYI-GRI 104

Query: 104 LQGLLGGTLAAANAYLATTSHRQQLSQLLNLTQFSARAAFLIAPIIIGFLINLFSPLSVY 163
+ G+ G T A A AY+A + + ++ + P++ G + FSP + +
Sbjct: 105 VAGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGG-FSPHAPF 163

Query: 164 FLLALITFISAIIIYFYVPKDKDKDKNKNYDHDKITPNSKPQSIDAAINILPYYCLLAAS 223
F A + ++ + F +P+ K + + + + P + + + L+A
Sbjct: 164 FAAAALNGLNFLTGCFLLPESH-KGERRPLRREALNPLASFRWARG---MTVVAALMAVF 219

Query: 224 FVFNFSTVISFPYFITLLQAHFNVHSGLILGLL--FGLPHAVYLISIFSLQKYRQQPSQQ 281
F+ + ++ + F+ + I L FG+ H++ I + ++
Sbjct: 220 FIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITG--PVAARLGER 277

Query: 282 PWIFTG------ALILLAFSLYWQCVTTGFFTLIILRIVMGAAITLGFISLNRMIATLKL 335
+ G ILLAF T G+ I+ ++ I + +L M++
Sbjct: 278 RALMLGMIADGTGYILLAF------ATRGWMAFPIMVLLASGGIGMP--ALQAMLSRQVD 329

Query: 336 QQQEGKVFGWLDSISKWAGVCAGLIAGFSY 365
++++G++ G L +++ + L+ Y
Sbjct: 330 EERQGQLQGSLAALTSLTSIVGPLLFTAIY 359


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS24790PF041831213e-31 IucA / IucC family
		>PF04183#IucA / IucC family

Length = 580

Score = 121 bits (305), Expect = 3e-31
Identities = 61/310 (19%), Positives = 104/310 (33%), Gaps = 36/310 (11%)

Query: 168 LEYDHIAAFLD-HPLYPTARAKLGFNPNDLYNYTTEFRAEFKLNWIAIPKSLSTLSGTLP 226
L D + L HP + + + G+ L Y E+ F+L+W+A+ +
Sbjct: 124 LNADRLQCLLSGHPKFVFNKGRRGWGKEALERYAPEYANTFRLHWLAVKREHMIWRCDNE 183

Query: 227 I-------------FWPSFSSVGLNPTLQQTHTLLPVHPFLI-HRLQDLLDEQGIKLKII 272
+ + FS V L LPVHP+ ++ + +++
Sbjct: 184 MDIHQLLTAAMDPQEFARFSQVWQENGLDHNWLPLPVHPWQWQQKIATDFIADFAEGRMV 243

Query: 273 KAPVSFLTVNPTLSIRSL-SIKNYSHFHLKLPLDIRTLSAKNIRTIKASTINDGHQVQSL 331
S+R+L + +KLPL I S R I I G
Sbjct: 244 SLGEFGDQWLAQQSLRTLTNASRRGGLDIKLPLTIYNTSC--YRGIPGRYIAAGPLASRW 301

Query: 332 LESIRLQDPELKENIFLTTEHTGMHINSHP--------------MLAFILRQYPSQL--N 375
L+ + D L ++ + SH ML I R+ P +
Sbjct: 302 LQQVFATDATLVQSGAVILGEPAAGYVSHEGYAALARAPYRYQEMLGVIWRENPCRWLKP 361

Query: 376 NHWIIPIAALCAK-NNGQLIIQHLINDHFNKDTIQFIKNYFDLTIHTHLNLWLIYGITLE 434
+ + +A L N Q + I D D ++ F + + +L YG+ L
Sbjct: 362 DESPVLMATLMECDENNQPLAGAYI-DRSGLDAETWLTQLFRVVVVPLYHLLCRYGVALI 420

Query: 435 ANQQNSLLII 444
A+ QN L +
Sbjct: 421 AHGQNITLAM 430


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS24820PF03544583e-12 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 58.4 bits (141), Expect = 3e-12
Identities = 21/98 (21%), Positives = 42/98 (42%), Gaps = 5/98 (5%)

Query: 181 GGESLLPPSYLQKLLIHL-QKYKYYPPFALRRQITGEAKVNIRLTCQGQVESYQLVKKTG 239
+ P + + L + YP A +I G+ KV +T G+V++ Q++
Sbjct: 143 TAATSKPVTSVASGPRALSRNQPQYPARAQALRIEGQVKVKFDVTPDGRVDNVQILSAKP 202

Query: 240 SRLLDNAVRQMLKQANPFPPAKVCQVAFNVVVPIEFKI 277
+ + + V+ +++ P + +VV I FKI
Sbjct: 203 ANMFEREVKNAMRRWRYEPG----KPGSGIVVNILFKI 236


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS24825FERRIBNDNGPP1292e-37 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 129 bits (326), Expect = 2e-37
Identities = 71/274 (25%), Positives = 123/274 (44%), Gaps = 20/274 (7%)

Query: 41 KRVITLEHRYTEMVLSLGVIPIGVADIKSYQEYDGVDEKKL-KGVESVGRRAAPNLELIA 99
R++ LE E++L+LG++P GVAD +Y+ + V E L V VG R PNLEL+
Sbjct: 36 NRIVALEWLPVELLLALGIVPYGVADTINYRLW--VSEPPLPDSVIDVGLRTEPNLELLT 93

Query: 100 SLKPDLIIGAKLRNASVYPVLSSISPSLLFNYIQMPNGKEQPLAGLFAEFNTIAKLLGKT 159
+KP ++ + +L+ I+P FN + +QPLA +A LL
Sbjct: 94 EMKPSFMVWS-AGYGPSPEMLARIAPGRGFN----FSDGKQPLAMARKSLTEMADLLNLQ 148

Query: 160 QQAKKIIVNYNKTVTEAKAIIDQLKQQGLLKSDRVAIAQFLPGSSRLRLLTTDSVAIEVL 219
A+ + Y + K + + LL + + + + +S+ E+L
Sbjct: 149 SAAETHLAQYEDFIRSMKPRFVKRGARPLL------LTTLI-DPRHMLVFGPNSLFQEIL 201

Query: 220 KSVGLKAAWPVKGGPSTLGYRTVGIQRLSTLGQTNVFYFNERADDSYLKNTLSNPLWLNL 279
G+ AW +G + G V I RL+ +V F+ + + ++ PLW +
Sbjct: 202 DEYGIPNAW--QGETNFWGSTAVSIDRLAAYKDVDVLCFDH-DNSKDMDALMATPLWQAM 258

Query: 280 PFVKSALTYRFSQQIWPWGGPVALEKFINEVVDN 313
PFV++ R +W +G ++ F+ V+DN
Sbjct: 259 PFVRAGRFQRVP-AVWFYGATLSAMHFV-RVLDN 290


30PSLF89_RS25465PSLF89_RS36255Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS254655200.789264IS30 family transposase
PSLF89_RS254706220.590166IS3 family transposase
PSLF89_RS254755210.199834hypothetical protein
PSLF89_RS254803160.920966methyl-accepting chemotaxis protein
PSLF89_RS254853180.670411uridine kinase
PSLF89_RS254902181.112822hypothetical protein
PSLF89_RS25495015-0.082538hypothetical protein
PSLF89_RS25500-118-0.111886dGTPase
PSLF89_RS25505-217-0.145023IS30 family transposase
PSLF89_RS25510116-1.459772IS982 family transposase
PSLF89_RS25515016-1.767038IS3 family transposase
PSLF89_RS36255218-2.838837transposase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS25490IGASERPTASE594e-12 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 58.5 bits (141), Expect = 4e-12
Identities = 39/187 (20%), Positives = 57/187 (30%), Gaps = 10/187 (5%)

Query: 40 AQNSETASHIKLDEPAEELETPAKETATVEAAAPAPEELETPAEETATAEAAAPAPEELE 99
A + K + E T + A E T T E A E E
Sbjct: 1035 ETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKE 1094

Query: 100 TP---AEETATAEAAAPAPEELETPTEETATAEAAAPAPEELET------PAEETATAEA 150
T +ETAT E A E E E +P E+ ET PA E
Sbjct: 1095 TQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVN 1154

Query: 151 AAPAPEELETPAEETATAEAAAPAPEELETPAEETATAEAAAPAPE-ELETPAEETATAE 209
+ T A+ A+ + E+ T + T + PE + T +E
Sbjct: 1155 IKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSE 1214

Query: 210 AAAPVEE 216
++ +
Sbjct: 1215 SSNKPKN 1221



Score = 57.8 bits (139), Expect = 6e-12
Identities = 37/188 (19%), Positives = 58/188 (30%), Gaps = 18/188 (9%)

Query: 35 EIKIDAQNSETASHIKLDEPAEELETPAKETATVEAAAPAPEELETPAEETATAEAAAPA 94
+K + Q +E A E E T KETATVE A E E E +P
Sbjct: 1075 NVKANTQTNEVAQ--SGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPK 1132

Query: 95 PEELET------PAEETATAEAAAPAPEELETPTEETATAEAAAPAPEELETPAEETATA 148
E+ ET PA E + T + A+ + E+ T + T
Sbjct: 1133 QEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTG 1192

Query: 149 EAAAPAPE-ELETPAEETATAEAAAPAPEELETPAEETATAEAAAPAPEELETPAEETAT 207
+ PE + T +E++ + + P +E +
Sbjct: 1193 NSVVENPENTTPATTQPTVNSESSN---------KPKNRHRRSVRSVPHNVEPATTSSND 1243

Query: 208 AEAAAPVE 215
A +
Sbjct: 1244 RSTVALCD 1251



Score = 45.8 bits (108), Expect = 8e-08
Identities = 26/133 (19%), Positives = 42/133 (31%), Gaps = 5/133 (3%)

Query: 87 TAEAAAPAPEELETPAEETATAEAAAP--APEELETPTEETATAEAAAPAPEELETPAEE 144
T P + + P+ + E A AP P + T E A ++ E+ E
Sbjct: 994 TTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQ-ESKTVE 1052

Query: 145 TATAEAAAPAPEELETPAEETATAEAAAPAPEELETPAEETATAEAAAPAPEELETPAEE 204
+A + E E + +A E ++ +E T +E EE
Sbjct: 1053 KNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVE--KEE 1110

Query: 205 TATAEAAAPVEEA 217
A E E
Sbjct: 1111 KAKVETEKTQEVP 1123



Score = 42.4 bits (99), Expect = 1e-06
Identities = 27/170 (15%), Positives = 48/170 (28%), Gaps = 13/170 (7%)

Query: 29 TRKPHTEIKIDAQNSETASHIKLDEPAEELETPAKETATVEAAAPAPEELETPAEETATA 88
T K K+ +Q S + +P E PA+E + T A+ A
Sbjct: 1116 TEKTQEVPKVTSQVSPKQEQSETVQPQAE---PARENDPTVNIKEPQSQTNTTADTEQPA 1172

Query: 89 EAAAPAPEELETPAEETATAEAAAPAPEELETPTEETATAEAAAPAPEELETPAEETATA 148
+ + E+ T + T +E P T P + +
Sbjct: 1173 KETSSNVEQPVTESTTVNTG------NSVVENPENT--TPATTQPTVNSESSNKPKNRHR 1224

Query: 149 EAAAPAPEELETPAEETATAEAAAPAPEELETPAEETATAEAAAPAPEEL 198
+ P +E + A +L + ++A A A
Sbjct: 1225 RSVRSVPHNVEPATTSSNDRSTVALC--DLTSTNTNAVLSDARAKAQFVA 1272


31PSLF89_RS25575PSLF89_RS25755Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS25575220-2.012644hypothetical protein
PSLF89_RS25580219-0.954584hypothetical protein
PSLF89_RS25590227-1.162824acyl-CoA thioesterase
PSLF89_RS37415326-1.820294thioredoxin fold domain-containing protein
PSLF89_RS37420223-2.148461homogentisate 1,2-dioxygenase
PSLF89_RS37425016-1.320121ankyrin repeat domain-containing protein
PSLF89_RS36265120-2.015959ankyrin repeat domain-containing protein
PSLF89_RS25620321-2.245209ankyrin repeat domain-containing protein
PSLF89_RS25625320-2.238561hypothetical protein
PSLF89_RS25630421-2.514169IS30 family transposase
PSLF89_RS25635520-2.124104transposase
PSLF89_RS25640118-0.690846transposase
PSLF89_RS25645014-0.535807transposase
PSLF89_RS25650-214-1.141868threonine/serine dehydratase
PSLF89_RS25655-215-1.122205MFS transporter
PSLF89_RS25660-115-2.081496DNA polymerase III subunit epsilon
PSLF89_RS25665-115-1.923707hypothetical protein
PSLF89_RS25670-117-2.807871flagellar motor switch protein FliM
PSLF89_RS25675017-3.321626RasGEF domain-containing protein
PSLF89_RS25680-117-4.545832hypothetical protein
PSLF89_RS25685-118-4.517423chorismate mutase
PSLF89_RS25690-217-4.060397sel1 repeat family protein
PSLF89_RS25695-219-5.379900IS4 family transposase
PSLF89_RS25700-315-3.328382hypothetical protein
PSLF89_RS25705-314-2.499631aminopeptidase PepB
PSLF89_RS25710-214-1.640759carbohydrate kinase family protein
PSLF89_RS25715-114-2.173738hypothetical protein
PSLF89_RS25720114-1.011483hypothetical protein
PSLF89_RS25725015-0.117923IS3 family transposase
PSLF89_RS25730218-0.878091transposase
PSLF89_RS25735123-1.282454transposase zinc-binding domain-containing
PSLF89_RS25740120-1.118193hypothetical protein
PSLF89_RS25745023-0.411900type II toxin-antitoxin system RelE/ParE family
PSLF89_RS25755222-0.991647hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS25675TCRTETB1191e-31 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 119 bits (301), Expect = 1e-31
Identities = 103/455 (22%), Positives = 200/455 (43%), Gaps = 19/455 (4%)

Query: 5 STSRRSHKTMLPWLVSFGLFMENLDSTVINTAIPQMAHTLAVNPLSLKLAVTSYLLSLVL 64
S S H +L WL F L+ V+N ++P +A+ P S T+++L+ +
Sbjct: 6 SQSNLRHNQILIWLC-ILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSI 64

Query: 65 FMPISGYLADRLGTKRIFISAITVFTFGSLLCGLSTS-LTMLIIARIIQGIGGAMMVPTG 123
+ G L+D+LG KR+ + I + FGS++ + S ++LI+AR IQG G A
Sbjct: 65 GTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALV 124

Query: 124 RLILVKTFEKSAMINALSNMAVIGQIGPAFGPLLGGALTSYLSWHWVFLINLP-IGVLGI 182
+++ + K A + I +G GP +GG + Y+ HW +L+ +P I ++ +
Sbjct: 125 MVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYI--HWSYLLLIPMITIITV 182

Query: 183 FFAYRWIGENHTAQTPSFDIKGFILFGLGLVTINLFLSLADNRLISLKLLEVSLAIGIIS 242
F + + + + FDIKG IL +G+V LF + L + ++S
Sbjct: 183 PFLMKLLKKEVRIKGH-FDIKGIILMSVGIVFFMLFTTSY---------SISFLIVSVLS 232

Query: 243 LVTYYFYAKNKTYPMISFSPFKTHTFKVAVLGSLWIRITVNSLPFILPLLLQINFGYSAF 302
+ + + + T P + K F + VL I TV ++P +++ S
Sbjct: 233 FLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTA 292

Query: 303 ISG-LLILPYGLGLIGAKFIIKSLLRYLGYRRILLINPCIIALIVLSFAYLNPMSSIFLI 361
G ++I P + +I +I L+ G +L I +++ L+ ++L ++ + +
Sbjct: 293 EIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFL-LETTSWFM 351

Query: 362 AFLCFFAGLVCSVQFSSMQTLNYIDIQDQEKSQATSLASVFQQLAMNLGVCLTA--LSLE 419
+ F S + + T+ ++ QE SL + L+ G+ + LS+
Sbjct: 352 TIIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIP 411

Query: 420 FFNVPLQPQKALISLHAFHSSFIFLALVAASSTIV 454
+ L P + S + + + + + + S +V
Sbjct: 412 LLDQRLLPMEVDQSTYLYSNLLLLFSGIIVISWLV 446


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS25680SSPAKPROTEIN290.008 Invasion protein B family signature.
		>SSPAKPROTEIN#Invasion protein B family signature.

Length = 133

Score = 29.1 bits (65), Expect = 0.008
Identities = 13/56 (23%), Positives = 28/56 (50%), Gaps = 1/56 (1%)

Query: 30 ELINRRLTGNNYHVYINPERVVDEEAIAVHGITNE-FLQDKPVFAQIANEFYHYIQ 84
++N L +Y + E +E + + + + ++ D VFA+I +EFY ++
Sbjct: 72 NILNLMLMNFSYSINELVELHRSDEYLQLRVVIKDDYVHDGIVFAEILHEFYQRME 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS25690FLGMOTORFLIM2311e-75 Flagellar motor switch protein FliM signature.
		>FLGMOTORFLIM#Flagellar motor switch protein FliM signature.

Length = 344

Score = 231 bits (590), Expect = 1e-75
Identities = 87/344 (25%), Positives = 168/344 (48%), Gaps = 12/344 (3%)

Query: 3 TDLLSQDEIDALLHGVDGSESAEEVEVDPDAPVSI---DFNSQERIVRGRMPTLEMVNER 59
T++LSQDEID LL + +++ E I DF ++ + +M TL +++E
Sbjct: 2 TEVLSQDEIDQLLTAISSGDASIEDARPISDTRKITLYDFRRPDKFSKEQMRTLSLMHET 61

Query: 60 FARTFRTTLFNLLRSIPDLSVDGIQMHKFSDYMHTLFVPTSLNMVKMRPLRGNCLFVFDA 119
FAR T+L LRS+ + V + + +++ ++ P++L ++ M PL+GN + D
Sbjct: 62 FARLTTTSLSAQLRSMVHVHVASVDQLTYEEFIRSIPTPSTLAVITMDPLKGNAVLEVDP 121

Query: 120 RLIFILVDNFFGSDGRFHAKIEGREFTPTELRIVMLLLETIFIDYKEAWAPVLDVNFEYQ 179
+ F ++D FG G+ R+ T E ++ ++ I + +E+W V+D+
Sbjct: 122 SITFSIIDRLFGGTGQAAKVQ--RDLTDIENSVMEGVIVRILANVRESWTQVIDLRPRLG 179

Query: 180 SSEVNPAMANIVGPTEAIFVSTFQIELNGGGGKLQIGFPYPMIEPIRDILDAG--IQSDS 237
E NP A IV P+E + + T + ++ G + PY IEPI L + S
Sbjct: 180 QIETNPQFAQIVPPSEMVVLVTLETKVGEEEGMMNFCIPYITIEPIISKLSSQFWFSSVR 239

Query: 238 TDIDTRWIQSLHHEIAGAPLRVTADLAKVELSAREIMQLKVGQIIPFE---MPEEVQVFV 294
T+++ L +++ + V A++ + LS R+I+ L+VG II + + + +
Sbjct: 240 RSSTTQYMGVLRDKLSTVDMDVVAEVGSLRLSVRDILGLRVGDIIRLHDTHVGDPFVLSI 299

Query: 295 QDTPAYMAKLGQANGNLAIELLKEIDKKTGAAIPFQVRSHEEEQ 338
+ ++ + G +A ++L+ I+ + F+ S +EE+
Sbjct: 300 GNRKKFLCQPGVVGKKIAAQILERIESTSQED--FEELSADEEE 341


32PSLF89_RS26070PSLF89_RS26150Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS260700153.406263IS91 family transposase
PSLF89_RS260750222.754928IS30 family transposase
PSLF89_RS260804252.294382chemotaxis protein
PSLF89_RS374452272.698960hypothetical protein
PSLF89_RS260953211.147232hypothetical protein
PSLF89_RS26100115-0.563018thioredoxin domain-containing protein
PSLF89_RS26105115-0.794333hypothetical protein
PSLF89_RS37450213-0.637978amino acid permease
PSLF89_RS26115011-0.251552hypothetical protein
PSLF89_RS26120012-0.799460protein kinase
PSLF89_RS26125112-0.548303hypothetical protein
PSLF89_RS261302200.474294SulP family inorganic anion transporter
PSLF89_RS26135120-2.329827rhodanese-related sulfurtransferase
PSLF89_RS26140222-2.587541hypothetical protein
PSLF89_RS36315218-3.316843transglycosylase SLT domain-containing protein
PSLF89_RS26145116-3.056232IS3 family transposase
PSLF89_RS26150119-3.352160IS30 family transposase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS26125HTHFIS379e-05 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 36.7 bits (85), Expect = 9e-05
Identities = 15/78 (19%), Positives = 27/78 (34%), Gaps = 10/78 (12%)

Query: 186 TALVVDDSLIARKQVKKALDTIGVKSILMRNGREALDYLINVLPGAGGDITQKYLMVIAD 245
T LV DD R + +AL G + N ++ V+ D
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDL----------VVTD 54

Query: 246 VEMPEIDGYAFIKACREH 263
V MP+ + + + ++
Sbjct: 55 VVMPDENAFDLLPRIKKA 72


33PSLF89_RS26220PSLF89_RS26355Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS262202241.542382nucleoside deaminase
PSLF89_RS36320-1231.496133hypothetical protein
PSLF89_RS37455-1230.698365transposase
PSLF89_RS26240-2190.853742hypothetical protein
PSLF89_RS26245-123-0.138870IS3 family transposase
PSLF89_RS26250020-1.272003hypothetical protein
PSLF89_RS26255019-2.174638IS3 family transposase
PSLF89_RS26260220-2.921128hypothetical protein
PSLF89_RS26265220-2.327190peptide-methionine (R)-S-oxide reductase
PSLF89_RS26270221-2.189105IS4 family transposase
PSLF89_RS26275-1140.073829peptide-methionine (R)-S-oxide reductase
PSLF89_RS262800162.393486hypothetical protein
PSLF89_RS36325-2170.333486hypothetical protein
PSLF89_RS26310-116-0.904197IS30 family transposase
PSLF89_RS26315019-3.236382transposase
PSLF89_RS26320018-3.686949cytochrome d ubiquinol oxidase subunit II
PSLF89_RS26325118-3.682122cytochrome ubiquinol oxidase subunit I
PSLF89_RS26335113-2.590091hypothetical protein
PSLF89_RS26340013-2.120599fatty acid desaturase
PSLF89_RS26350213-1.778683WbuC family cupin fold metalloprotein
PSLF89_RS26355317-1.755576hypothetical protein
34PSLF89_RS26680PSLF89_RS36370Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS26680-115-4.043053hypothetical protein
PSLF89_RS26685012-5.360632hypothetical protein
PSLF89_RS26690-115-5.334902hypothetical protein
PSLF89_RS26700016-5.150186type VI secretion system baseplate subunit TssK
PSLF89_RS26705-116-3.685881hypothetical protein
PSLF89_RS26710016-4.155932type VI secretion system contractile sheath
PSLF89_RS26715016-4.100653type VI secretion system contractile sheath
PSLF89_RS26720219-3.657460SEC-C domain-containing protein
PSLF89_RS26725221-3.369198hypothetical protein
PSLF89_RS26730222-4.478988hypothetical protein
PSLF89_RS26735325-5.879834hypothetical protein
PSLF89_RS26740225-5.591616hypothetical protein
PSLF89_RS26745224-5.770756hypothetical protein
PSLF89_RS26750224-6.053048hypothetical protein
PSLF89_RS26755121-6.676553hypothetical protein
PSLF89_RS26760121-4.461610type VI secretion system baseplate subunit TssK
PSLF89_RS26765220-4.826048hypothetical protein
PSLF89_RS26770420-3.884692IS3 family transposase
PSLF89_RS26775113-3.105835IS30 family transposase
PSLF89_RS26780115-2.017288type VI secretion system contractile sheath
PSLF89_RS26785113-3.127015IS4 family transposase
PSLF89_RS37140013-4.261789hypothetical protein
PSLF89_RS26790014-4.454682type VI secretion system contractile sheath
PSLF89_RS26795013-3.496653hypothetical protein
PSLF89_RS26800013-2.994798type VI secretion system contractile sheath
PSLF89_RS26805012-3.318131IS982 family transposase
PSLF89_RS26810014-3.127759hypothetical protein
PSLF89_RS26815017-1.167516efflux RND transporter permease subunit
PSLF89_RS26825017-1.031465efflux RND transporter periplasmic adaptor
PSLF89_RS26830016-2.051449outer membrane beta-barrel protein
PSLF89_RS26835119-2.673311outer membrane beta-barrel protein
PSLF89_RS26840217-3.819160outer membrane beta-barrel protein
PSLF89_RS26845117-4.525847phosphatase
PSLF89_RS36360217-6.830034helix-turn-helix domain-containing protein
PSLF89_RS36365-219-3.305342hypothetical protein
PSLF89_RS26850-119-2.842983membrane protein insertion efficiency factor
PSLF89_RS26855017-2.729979ribosome-associated translation inhibitor RaiA
PSLF89_RS26860114-2.305448superoxide dismutase
PSLF89_RS26865114-1.4620017-cyano-7-deazaguanine synthase QueC
PSLF89_RS26870113-1.187781NAD(P)/FAD-dependent oxidoreductase
PSLF89_RS26875317-1.199707PLP-dependent aminotransferase family protein
PSLF89_RS26880121-0.864939DMT family transporter
PSLF89_RS26885-1210.043903enoyl-CoA hydratase/isomerase family protein
PSLF89_RS268902201.091240hypothetical protein
PSLF89_RS268953221.334877type VI secretion system tip protein VgrG
PSLF89_RS363703202.878468hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS26775SECA357e-07 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 35.2 bits (81), Expect = 7e-07
Identities = 9/13 (69%), Positives = 11/13 (84%)

Query: 4 CTCGSGKKHKKCC 16
C CGSGKK+K+C
Sbjct: 885 CPCGSGKKYKQCH 897


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS26870ACRIFLAVINRP7420.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 742 bits (1918), Expect = 0.0
Identities = 306/1041 (29%), Positives = 536/1041 (51%), Gaps = 41/1041 (3%)

Query: 5 DLFIKRPVLACVLSLVIFLTGLIAYNKLAVRQYPAVSANVVTISTSYSGASASLVEAFVT 64
+ FI+RP+ A VL++++ + G +A +L V QYP ++ V++S +Y GA A V+ VT
Sbjct: 3 NFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVT 62

Query: 65 TPLEQALQGISGVDYVSSVS-SAGNSRITVSLNLNADLYQALIEINNDLTPVLKKLPSGV 123
+EQ + GI + Y+SS S SAG+ IT++ D A +++ N L LP V
Sbjct: 63 QVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEV 122

Query: 124 DTPVIKEGDSNSTPMMIISFSSSK--LTPEAINDYLQRVVQPQLANLSGVAQANILGPRV 181
I S+S+ +M+ F S T + I+DY+ V+ L+ L+GV + G +
Sbjct: 123 QQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQ- 181

Query: 182 YAMRLWLNPAKMAALGVTTEDVSTALAANDLFAQAGSIST------NSQVININIESSLN 235
YAMR+WL+ + +T DV L + AG + +I ++
Sbjct: 182 YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFK 241

Query: 236 SAAQFNNLVIKSQQD-QYVRLSDIGYAELGAQTKASSLYVNGKPAVGVGIIAKSDANPLM 294
+ +F + ++ D VRL D+ ELG + +NGKPA G+GI + AN L
Sbjct: 242 NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALD 301

Query: 295 VANTVKNEVAEIQKQLPQGLSVRIARDSSSYIQDSLSEVSHTVMVAIVIVIAVVLLFLGS 354
A +K ++AE+Q PQG+ V D++ ++Q S+ EV T+ AI++V V+ LFL +
Sbjct: 302 TAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQN 361

Query: 355 FRALMIPLVTIPVSLVGTFALMYLLGYSINVLTLLAFVLAIGLVVDDAIVVLENVHRYI- 413
RA +IP + +PV L+GTFA++ GYSIN LT+ VLAIGL+VDDAIVV+ENV R +
Sbjct: 362 MRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMM 421

Query: 414 EQGFTPFKAALKGAREIRFAIIAMTLTLAAVYAPIGFSTGITGSLFREFAFSLAASVILS 473
E P +A K +I+ A++ + + L+AV+ P+ F G TG+++R+F+ ++ +++ LS
Sbjct: 422 EDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALS 481

Query: 474 GIVALTLSPMMCARMMRA-----HHQPAGWQLKIEVCLTRLRDYYSLLLNKVFNNKVNVL 528
+VAL L+P +CA +++ H G+ ++Y+ + K+ + L
Sbjct: 482 VLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYL 541

Query: 529 IIAGTVIVCGGIYIIPLVKNSTLAPKEDQNTVIGIVQGSMAASVVNTEAYTSKLRE--LA 586
+I ++ G+ ++ L S+ P+EDQ + ++Q A+ T+ ++ + L
Sbjct: 542 LIY--ALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599

Query: 587 SKVSGVENVTVING---AGGDQSNAMLMVQLASKSQRS---LSAERIAGQLNKAAARIPG 640
++ + VE+V +NG +G Q+ M V L +R+ SAE + + +I
Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRD 659

Query: 641 AKAMFVLPPSLPTSHD-----NYDIEFVIKTNGDYAELETHVNKILQAIHKN-AGFGRVM 694
FV+P ++P + +D E + + + L N++L ++ A V
Sbjct: 660 G---FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVR 716

Query: 695 TDLQFNKPEYNVTIQRDMAARLGVSVSGIASVLTNALAEPQSSEFVNNGLSYYVIPQVIA 754
+ + ++ + + ++ A LGVS+S I ++ AL ++F++ G + Q A
Sbjct: 717 PNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADA 776

Query: 755 SGQGSITGLNQLYVTAESGAKIPLRDLIKVKMTVNPSSLNHFQSQRSVTIQATLSHRYST 814
+ +++LYV + +G +P L + S+ IQ + S+
Sbjct: 777 KFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSS 836

Query: 815 EQALNFLEMIAKKDLTAQMSYATSGNTRQYLEESSSVYFIFIAALLFIYLSLSAQFESFI 874
A+ +E +A K L A + Y +G + Q + + + + ++L L+A +ES+
Sbjct: 837 GDAMALMENLASK-LPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWS 895

Query: 875 DPLIILMSVPFSIAGALGTLFLIGGSLNIYTEIGLVTLIGLIAKHGILIVEFANQSQ-KS 933
P+ +++ VP I G L L ++Y +GL+T IGL AK+ ILIVEFA K
Sbjct: 896 IPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKE 955

Query: 934 GESLLVAIKQSARVRFRPILMTTAAMVLGAVPLVFASGAGSEARYQLGWVIVGGMMIGTM 993
G+ ++ A + R+R RPILMT+ A +LG +PL ++GAGS A+ +G ++GGM+ T+
Sbjct: 956 GKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATL 1015

Query: 994 MTLLVLPLMYYLVNTAKTVFK 1014
+ + +P+ + ++ + FK
Sbjct: 1016 LAIFFVPVFFVVI---RRCFK 1033


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS26875RTXTOXIND513e-09 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 51.0 bits (122), Expect = 3e-09
Identities = 15/88 (17%), Positives = 39/88 (44%), Gaps = 2/88 (2%)

Query: 69 TLKAQVAGTVTRVAFQSGDKVKQGQLLVSLDSTTAKGQLDKAEADYHLSLLTYQRDQSLF 128
+K V + + G+ V++G +L+ L + A+ K ++ + L R Q L
Sbjct: 98 EIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILS 157

Query: 129 KNHVLSEQELDQVKFTVKANWALLEQAQ 156
++ + +L ++K + + + + +
Sbjct: 158 RS--IELNKLPELKLPDEPYFQNVSEEE 183



Score = 40.2 bits (94), Expect = 9e-06
Identities = 37/190 (19%), Positives = 71/190 (37%), Gaps = 17/190 (8%)

Query: 99 DSTTAKGQLDKAEADYHLSLLTYQRDQSLFKNHVLSEQELDQVKFTVKANWALLEQAQSA 158
+ K QL++ E++ + YQ LFKN +L + L Q + L + +
Sbjct: 267 ELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDK--LRQTTDNIGLLTLELAKNEER 324

Query: 159 YNKTQVKAPFNGNI-GISDITVGSYLDSGDTIVSLQNLDH-LWVDFNVSSQDSLQVKIDE 216
+ ++AP + + + T G + + +T++ + D L V V ++D + + +
Sbjct: 325 QQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQ 384

Query: 217 IVDITTQAEPMQIA---SGKVVAIEPQINSDTGT----LTLRAQINNT------HYQLLP 263
I +A P GKV I D + + N + L
Sbjct: 385 NAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVIISIEENCLSTGNKNIPLSS 444

Query: 264 GQLVSVNLYT 273
G V+ + T
Sbjct: 445 GMAVTAEIKT 454


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS26880OMPADOMAIN485e-09 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 48.4 bits (115), Expect = 5e-09
Identities = 42/222 (18%), Positives = 72/222 (32%), Gaps = 48/222 (21%)

Query: 7 LTVMTTLLISASALAAKPGA---YIGLNLGYGGMDTAQLTKNSFRNEASSSASLRGFAGR 63
+ + L A+ A P Y G LG+ N+ + F G
Sbjct: 6 IAIAVALAGFATVAQAAPKDNTWYTGAKLGWSQYHDTGF-INNNGPTHENQLGAGAFGG- 63

Query: 64 INAGYLWSQSSLNYGIELGYATYANNQYSALGKNGEKYNFTYKGYNIDLLGIAQYNFNPN 123
Q + G E+GY Y +NG YK + L Y +
Sbjct: 64 -------YQVNPYVGFEMGYDWLGRMPYKGSVENG-----AYKAQGVQLTAKLGYPITDD 111

Query: 124 WNIFAKVGIAYASQTTSGS-------SEFSHMFAN--KGRLLPKVALGLGYEFTNGIGLN 174
+I+ ++G T + + S +FA + + P++A L Y++TN IG
Sbjct: 112 LDIYTRLGGMVWRADTKSNVYGKNHDTGVSPVFAGGVEYAITPEIATRLEYQWTNNIG-- 169

Query: 175 LTASHIFGNQSTFDGNNNQTIKNNLNKVSPVDMVTVGISYNF 216
+ + M+++G+SY F
Sbjct: 170 --------------------DAHTIGTRPDNGMLSLGVSYRF 191


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS26885OMPADOMAIN494e-09 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 48.8 bits (116), Expect = 4e-09
Identities = 45/216 (20%), Positives = 70/216 (32%), Gaps = 31/216 (14%)

Query: 5 IKLTAIT---ALLISASTLATKPGA---YIGLNLGYGGMDTPNLDLTKINNIANDSHSTR 58
+K TAI AL A+ P Y G LG+ D INN +
Sbjct: 1 MKKTAIAIAVALAGFATVAQAAPKDNTWYTGAKLGWSQYH----DTGFINNNGPTHENQ- 55

Query: 59 GLAGSINAGYLWNKGALNYGFELGYSTYANNQYTAVSVGKKYNFTYSGSSLDLLGVVQYN 118
L GY N GFE+GY + + + G L Y
Sbjct: 56 -LGAGAFGGYQVNPY---VGFEMGYDWLG--RMPYKGSVENGAYKAQGVQLTAKL--GYP 107

Query: 119 INPNWNIFGKAGLSYVSQKTTGDGILSLAADSKSKMRPKFALGAGYGFDNGIGLNVMASH 178
I + +I+ + G T + + + + P FA G Y +
Sbjct: 108 ITDDLDIYTRLGGMVWRADTKSNVYGK---NHDTGVSPVFAGGVEY--------AITPEI 156

Query: 179 TFGTKPQVSNNIISIKDDVNKVAPIDMITVGITYNF 214
+ Q +NNI + M+++G++Y F
Sbjct: 157 ATRLEYQWTNNIGD-AHTIGTRPDNGMLSLGVSYRF 191


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS26890OUTRMMBRANEA401e-06 Outer membrane protein A signature.
		>OUTRMMBRANEA#Outer membrane protein A signature.

Length = 346

Score = 40.3 bits (94), Expect = 1e-06
Identities = 40/176 (22%), Positives = 54/176 (30%), Gaps = 27/176 (15%)

Query: 5 IKLTAIT---ALLISASALAAKPGA---YIGLNLGYGGMDTPSVNFKNKYPGVHSYSHSS 58
+K TAI AL A+ A P Y G LG+ F N H +
Sbjct: 1 MKKTAIAIAVALAGFATVAQAAPKDNTWYTGAKLGWS--QYHDTGFINNNGPTHENQLGA 58

Query: 59 RGFAGRINAGYLWNQGSLNYGVELGYATYANSKYSVTNKDDTRTLKYSGTNIDLLGVIQY 118
F G Q + G E+GY Y K Y + L + Y
Sbjct: 59 GAFGGY--------QVNPYVGFEMGYDWLGRMPY----KGSVENGAYKAQGVQLTAKLGY 106

Query: 119 NFTPNWNIFAKAGLAYVTQKTSGSNAFKLEFESNNKVLSEVALGAGY----EFALR 170
T + +I+ + G T + K + V A G Y E A R
Sbjct: 107 PITDDLDIYTRLGGMVWRADTKSNVYGK---NHDTGVSPVFAGGVEYAITPEIATR 159


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS26900STREPKINASE270.008 Streptococcus streptokinase protein signature.
		>STREPKINASE#Streptococcus streptokinase protein signature.

Length = 440

Score = 27.4 bits (60), Expect = 0.008
Identities = 18/50 (36%), Positives = 28/50 (56%), Gaps = 2/50 (4%)

Query: 41 HQRLSTHTSPDVISQELIR-EHNIQVSESTI-YRYIYDDRERGGELYKNL 88
H +L T DV + EL++ E + SE + +R +YD R++ LY NL
Sbjct: 317 HLKLFTIKYVDVDTNELLKSEQLLTASERNLDFRDLYDPRDKAKLLYNNL 366


35PSLF89_RS27635PSLF89_RS27720Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS27635-1194.789923IS982-like element ISPsa1 family transposase
PSLF89_RS276401195.254895HU family DNA-binding protein
PSLF89_RS276451195.433809glycine zipper 2TM domain-containing protein
PSLF89_RS276501195.164830DNA repair protein RadA
PSLF89_RS276551245.369365glucosaminidase domain-containing protein
PSLF89_RS276601254.672060alanine racemase
PSLF89_RS276651201.911542replicative DNA helicase
PSLF89_RS27675019-1.62381850S ribosomal protein L9
PSLF89_RS27680019-2.39512130S ribosomal protein S18
PSLF89_RS27685221-1.18079230S ribosomal protein S6
PSLF89_RS27690520-1.191234transporter substrate-binding domain-containing
PSLF89_RS27695417-2.040896GAF domain-containing protein
PSLF89_RS36415415-2.0136713-dehydroquinate synthase
PSLF89_RS27700216-1.738752chorismate synthase
PSLF89_RS27710117-1.600192hypothetical protein
PSLF89_RS27715118-2.2841733-phosphoshikimate 1-carboxyvinyltransferase
PSLF89_RS27720-112-3.374541chorismate mutase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS27775DNABINDINGHU315e-04 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 30.8 bits (70), Expect = 5e-04
Identities = 27/97 (27%), Positives = 43/97 (44%), Gaps = 9/97 (9%)

Query: 30 TKIQVVNSIAEETGLPKKDVLQVFESLRILISRHMKKRGSGEFTIPEVGVKIRRSKKAAT 89
K ++ +AE T L KKD +++ +S ++ K + + G ++AA
Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAK--GEKVQLIGFG-NFEVRERAAR 59

Query: 90 KARTIISPFNGEEIHVPAKPARTTVKVTALKALKETV 126
K R +P GEEI + A A KALK+ V
Sbjct: 60 KGR---NPQTGEEIKI---KASKVPAFKAGKALKDAV 90


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS27795ALARACEMASE376e-132 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 376 bits (968), Expect = e-132
Identities = 135/363 (37%), Positives = 209/363 (57%), Gaps = 10/363 (2%)

Query: 1 MGRATCVLLSRSALLHNLKKVREKAPNSKILAMVKAYGYGHSFEVAKYLDKKVDGFGVAA 60
M R L AL NL VR+ A ++++ ++VKA YGH E DGF +
Sbjct: 1 MTRPIQASLDLQALKQNLSIVRQAATHARVWSVVKANAYGHGIERIWSAIGATDGFALLN 60

Query: 61 IEEAIQLREQGICSPIVLMEGVFSPEELRLVENYNFSIVIHCQEQVDWLHHHALVAKSVD 120
+EEAI LRE+G PI+++EG F ++L + + + + +H Q+ L + A + +D
Sbjct: 61 LEEAITLRERGWKGPILMLEGFFHAQDLEIYDQHRLTTCVHSNWQLKALQN-ARLKAPLD 119

Query: 121 VWLKLDTGMGRLGFDCTSGKCLDYLSAIYLSLKKSNKVGQLGLMSHFACADEPYHQLNSR 180
++LK+++GM RLGF D + ++ L+ VG++ LMSHFA A+ P
Sbjct: 120 IYLKVNSGMNRLGFQ------PDRVLTVWQQLRAMANVGEMTLMSHFAEAEHPDGISG-- 171

Query: 181 QISAFQQVRKKFPGPYSCVNSAAIFNFSYERYDWVRPGIMLYGISPFAD-KNGVDLELQP 239
++ +Q + S NSAA +DWVRPGI+LYG SP ++ + L+P
Sbjct: 172 AMARIEQAAEGLECRRSLSNSAATLWHPEAHFDWVRPGIILYGASPSGQWRDIANTGLRP 231

Query: 240 VMHVVSRLISVKQLRQGESVGYGATWQCPEDMQVGILSLGYGDGYPRLAASGTPFLVRGQ 299
VM + S +I V+ L+ GE VGYG + ++ ++GI++ GY DGYPR A +GTP LV G
Sbjct: 232 VMTLSSEIIGVQTLKAGERVGYGGRYTARDEQRIGIVAAGYADGYPRHAPTGTPVLVDGV 291

Query: 300 RCALIGRVSMDMIAIDLRRCPDAGVGEAVTVWGQDLPVEEIARHVGTIAYELVCNMPLRA 359
R +G VSMDM+A+DL CP AG+G V +WG+++ ++++A GT+ YEL+C + LR
Sbjct: 292 RTMTVGTVSMDMLAVDLTPCPQAGIGTPVELWGKEIKIDDVAAAAGTVGYELMCALALRV 351

Query: 360 PYI 362
P +
Sbjct: 352 PVV 354


36PSLF89_RS27880PSLF89_RS28175Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS27880616-0.804471NADP-dependent isocitrate dehydrogenase
PSLF89_RS27890318-1.337082PilZ domain-containing protein
PSLF89_RS36435218-1.843499OmpA family protein
PSLF89_RS27900018-3.167521flagellar motor protein PomA
PSLF89_RS27905-115-1.906230polyprenyl synthetase family protein
PSLF89_RS279100150.096274GTP cyclohydrolase FolE2
PSLF89_RS27915-1100.425173hypothetical protein
PSLF89_RS27920-1110.661454hypothetical protein
PSLF89_RS279300120.922667transposase
PSLF89_RS27935-1100.675557IS982 family transposase
PSLF89_RS27940-1100.833129transposase
PSLF89_RS27955-113-4.024284IS3 family transposase
PSLF89_RS27965-115-3.818809IS3 family transposase
PSLF89_RS27975021-3.613990hypothetical protein
PSLF89_RS27980121-0.658866IS4 family transposase
PSLF89_RS279950140.844289hypothetical protein
PSLF89_RS280000141.258877transposase
PSLF89_RS36440-1181.921781IS3 family transposase
PSLF89_RS28010-2152.234450IS982 family transposase
PSLF89_RS28015-1142.661276transposase
PSLF89_RS28020-2183.250088transposase
PSLF89_RS28025-2182.845145hypothetical protein
PSLF89_RS28030-1201.803107transposase
PSLF89_RS28035-220-0.619633SpoIIE family protein phosphatase
PSLF89_RS28040-115-2.078291hypothetical protein
PSLF89_RS28045116-3.057248hypothetical protein
PSLF89_RS28050013-3.452146hypothetical protein
PSLF89_RS28055-113-2.504664hypothetical protein
PSLF89_RS28070015-2.049217TauD/TfdA family dioxygenase
PSLF89_RS28075115-2.232565ATP-binding protein
PSLF89_RS28080116-2.137905STAS domain-containing protein
PSLF89_RS28085019-1.922971IS6-like element ISPsa2 family transposase
PSLF89_RS28095218-3.414595VWA domain-containing protein
PSLF89_RS28105217-2.513381VWA domain-containing protein
PSLF89_RS28110217-1.718143IS30 family transposase
PSLF89_RS28115415-1.535061VWA domain-containing protein
PSLF89_RS281206160.317594DUF4381 domain-containing protein
PSLF89_RS281255151.255917DUF58 domain-containing protein
PSLF89_RS28130120-6.308891MoxR family ATPase
PSLF89_RS37155223-8.154078NADP-dependent oxidoreductase
PSLF89_RS28135324-8.855768hypothetical protein
PSLF89_RS28140328-11.638852transposase
PSLF89_RS36445528-12.378568hypothetical protein
PSLF89_RS28150529-12.078957IS6-like element ISPsa2 family transposase
PSLF89_RS28155527-11.545716BatD family protein
PSLF89_RS28160426-10.898972hypothetical protein
PSLF89_RS28165224-9.344881hypothetical protein
PSLF89_RS28170118-5.985040metal/formaldehyde-sensitive transcriptional
PSLF89_RS28175-114-3.472244hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS28025OMPADOMAIN498e-09 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 49.2 bits (117), Expect = 8e-09
Identities = 30/122 (24%), Positives = 49/122 (40%), Gaps = 17/122 (13%)

Query: 160 FQSGSEWVRPTFLPVIAKIANVLKKTK---GNIIVAGHTDNLRISNARFRSNWDLSAARA 216
F ++P + ++ + L G+++V G+TD S+A N LS RA
Sbjct: 223 FNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDR-IGSDA---YNQGLSERRA 278

Query: 217 VSVALALFRDKGLEQKRFMVVGYADTQALEDNKTAANRSK---------NRRVEITVVFG 267
SV L KG+ + G ++ + N + + +RRVEI V
Sbjct: 279 QSVVDYL-ISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIEVKGI 337

Query: 268 KD 269
KD
Sbjct: 338 KD 339


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS28125HTHFIS270.031 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 26.7 bits (59), Expect = 0.031
Identities = 10/43 (23%), Positives = 19/43 (44%), Gaps = 1/43 (2%)

Query: 4 RHLNEKDRFYIEQRLSE-GDSLRSIARALGFSPSTISREIKRH 45
R L E + I L+ + A LG + +T+ ++I+
Sbjct: 431 RVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIREL 473


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS28225HTHFIS384e-05 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 37.9 bits (88), Expect = 4e-05
Identities = 40/163 (24%), Positives = 63/163 (38%), Gaps = 28/163 (17%)

Query: 25 NHHIIGQ----KELLLMLQIALLADGHLLVEGAPGLAKTT---AIKALSHYVEGDFQRIQ 77
++G+ +E+ +L + D L++ G G K A+ G F I
Sbjct: 136 GMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAIN 195

Query: 78 ---FTPDLLPSDITG------TDVFRPQTGEF-YFQHGPLFHPIILADEINRASAKVQSA 127
DL+ S++ G T TG F + G LF DEI Q+
Sbjct: 196 MAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLF-----LDEIGDMPMDAQTR 250

Query: 128 LLEAMGERQIT-VGNTTYPLPQLFLVMATQN-PLEQ---EGTF 165
LL + + + T VG T P+ ++A N L+Q +G F
Sbjct: 251 LLRVLQQGEYTTVGGRT-PIRSDVRIVAATNKDLKQSINQGLF 292


37PSLF89_RS28255PSLF89_RS28360Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS28255221-1.216787DUF4325 domain-containing protein
PSLF89_RS282602200.244152flavohemoglobin expression-modulating QEGLA
PSLF89_RS282652180.238998glutathione synthase
PSLF89_RS28270321-0.450746ABC transporter ATP-binding protein
PSLF89_RS28275422-0.790260hypothetical protein
PSLF89_RS28285219-1.917413hypothetical protein
PSLF89_RS36450018-0.414983transposase
PSLF89_RS36455018-0.317187hypothetical protein
PSLF89_RS36460117-0.070342hypothetical protein
PSLF89_RS36465-120-0.500929hypothetical protein
PSLF89_RS28310019-1.713623hypothetical protein
PSLF89_RS28315-120-2.309507hypothetical protein
PSLF89_RS28320-117-3.491086chemotaxis protein
PSLF89_RS28325016-3.776281hypothetical protein
PSLF89_RS28330-116-3.052541hypothetical protein
PSLF89_RS28335217-2.156823IS30 family transposase
PSLF89_RS28345114-1.261781transposase
PSLF89_RS28350214-0.170664IS4 family transposase
PSLF89_RS28360216-0.600162pentapeptide repeat-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS28460HTHFIS529e-10 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 52.1 bits (125), Expect = 9e-10
Identities = 29/138 (21%), Positives = 59/138 (42%), Gaps = 20/138 (14%)

Query: 174 LSSVSLLIVDDSSFARNHLIKILSKLDINVVACNSGADAFAYLKKVANEESDADISKKIP 233
++ ++L+ DD + R L + LS+ +V ++ A + +A + D
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATL---WRWIAAGDGD-------- 49

Query: 234 VVITDAEMPEMDGYTLTVKCRE-DPKLKDLFIVLHTSLSGEFNKAMVE--HVGCNDFIAK 290
+V+TD MP+ + + L + ++ P L L + + ++ G D++ K
Sbjct: 50 LVVTDVVMPDENAFDLLPRIKKARPDLPVLVMSAQNTFM-----TAIKASEKGAYDYLPK 104

Query: 291 -FDPTKTLHIIQERLKAL 307
FD T+ + II L
Sbjct: 105 PFDLTELIGIIGRALAEP 122


38PSLF89_RS28435PSLF89_RS28515Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS28435119-3.629332hypothetical protein
PSLF89_RS28440217-5.234674IS982 family transposase
PSLF89_RS37000018-5.692470HAMP domain-containing protein
PSLF89_RS28445020-5.921830response regulator transcription factor
PSLF89_RS36475-119-4.375619Spy/CpxP family protein refolding chaperone
PSLF89_RS28455-218-3.482236hypothetical protein
PSLF89_RS28460-116-2.304304Rossmann-like and DUF2520 domain-containing
PSLF89_RS28465-118-1.443719Maf family nucleotide pyrophosphatase
PSLF89_RS28475-217-2.809031signal peptide peptidase SppA
PSLF89_RS28480-318-2.473307FUSC family protein
PSLF89_RS36480-218-2.749466Na+/H+ antiporter NhaA
PSLF89_RS28495-122-3.079274NAD-glutamate dehydrogenase
PSLF89_RS28500-115-3.218734DUF2835 family protein
PSLF89_RS28505-213-1.155610PrkA family serine protein kinase
PSLF89_RS28510-112-0.482639YeaH/YhbH family protein
PSLF89_RS28515215-2.561643SpoVR family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS28560PF06580340.001 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 33.7 bits (77), Expect = 0.001
Identities = 10/42 (23%), Positives = 22/42 (52%)

Query: 348 VENILRNALFHTESGTEVRVSSRYDESSVIISVEDSGSGVFE 389
VEN +++ + G ++ + D +V + VE++GS +
Sbjct: 264 VENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALK 305


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS28565HTHFIS1017e-27 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 101 bits (253), Expect = 7e-27
Identities = 37/127 (29%), Positives = 68/127 (53%), Gaps = 1/127 (0%)

Query: 1 MSKNANVLLIDDDLELCELLIRYLTVEEFNVKAVHHGDEALSQLQAQHYDVAVLDVMLPG 60
M+ A +L+ DDD + +L + L+ ++V+ + + A D+ V DV++P
Sbjct: 1 MTG-ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPD 59

Query: 61 QSGFDVLKEMRKQQIETPVLMLTARGEEVDRIVGLELGADDYLPKPCNPRELVARLRAVL 120
++ FD+L ++K + + PVL+++A+ + I E GA DYLPKP + EL+ + L
Sbjct: 60 ENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119

Query: 121 RRTTAKP 127
+P
Sbjct: 120 AEPKRRP 126


39PSLF89_RS28715PSLF89_RS28795Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS28715117-4.293740DNA-directed RNA polymerase subunit omega
PSLF89_RS28725116-4.978427bifunctional (p)ppGpp
PSLF89_RS28730118-5.375085GGDEF domain-containing protein
PSLF89_RS28735219-5.849883MFS transporter
PSLF89_RS28745014-3.955606IS30 family transposase
PSLF89_RS28750-211-0.825662hypothetical protein
PSLF89_RS28755-2120.460710IS4 family transposase
PSLF89_RS28760-3131.143544hypothetical protein
PSLF89_RS28765-3131.715046IS4 family transposase
PSLF89_RS28770-2192.766766zf-TFIIB domain-containing protein
PSLF89_RS28775-2153.758284MFS transporter
PSLF89_RS28780-1143.409068IS982 family transposase
PSLF89_RS287850164.002686hypothetical protein
PSLF89_RS287901264.922376thiamine-phosphate kinase
PSLF89_RS28795-1243.799263transcription antitermination factor NusB
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS28880TCRTETB1412e-39 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 141 bits (358), Expect = 2e-39
Identities = 91/361 (25%), Positives = 155/361 (42%), Gaps = 15/361 (4%)

Query: 14 ILIGTAISGFMIGIDYTIVNMAIASIQTELTVNTNQLQWLMSGFGITFCAFLASMGKLAD 73
ILI I F ++ ++N+++ I + W+ + F +TF A GKL+D
Sbjct: 15 ILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSD 74

Query: 74 IVGRRRLLFIGISGFGLASLGAGFSNSIISLVIF-RLLQGVFGAIILPAGMALTASAFPA 132
+G +RLL GI S+ +S SL+I R +QG A M + A P
Sbjct: 75 QLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPK 134

Query: 133 KEQGRAMGIYNGILGLGLAFGPVFGGIILSFMSWHWIFFINIPIIIISLCICYFTIQGRD 192
+ +G+A G+ I+ +G GP GG+I ++ HW + + IP+I I + ++
Sbjct: 135 ENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYI--HWSYLLLIPMITIITVPFLMKLLKKE 192

Query: 193 CKTDQEMDWLGIALIAATLMSFVYVVNQATITGWASSLVIYPFIASFVFLGIFITVEAKS 252
+ D GI L++ ++ F+ +I+ I S + IF+ K
Sbjct: 193 VRIKGHFDIKGIILMSVGIVFFMLFTTSYSISF---------LIVSVLSFLIFVKHIRKV 243

Query: 253 NSPFLPMSLFSNRGFFLGATAYMIAAGFAWPIIFLVPLYLQQVLGYSVYSA-SIALIPMT 311
PF+ L N F +G I G + +VP ++ V S S+ + P T
Sbjct: 244 TDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGT 303

Query: 312 LMTAILPPLTGKIYDHKGAFTCFILLATCLILSFLL--FLTFTTQTHLTVLLLTFLLFGA 369
+ I + G + D +G + T L +SFL FL TT +T++++ L +
Sbjct: 304 MSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLS 363

Query: 370 A 370

Sbjct: 364 F 364



Score = 32.5 bits (74), Expect = 0.004
Identities = 23/97 (23%), Positives = 36/97 (37%), Gaps = 4/97 (4%)

Query: 69 GKLADIVGRRRLLFIGISGFGLASLGAGF---SNSIISLVIFRLLQGVFGAIILPAGMAL 125
G L D G +L IG++ ++ L A F + S +I + G +
Sbjct: 314 GILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVISTIV 373

Query: 126 TASAFPAKEQGRAMGIYNGILGLGLAFGPVFGGIILS 162
++S E G M + N L G G +LS
Sbjct: 374 SSSLKQQ-EAGAGMSLLNFTSFLSEGTGIAIVGGLLS 409


40PSLF89_RS28845PSLF89_RS29080Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS28845022-3.699986methyltransferase domain-containing protein
PSLF89_RS36490-123-4.507346transposase
PSLF89_RS28850022-4.080814DDE-type integrase/transposase/recombinase
PSLF89_RS28855-119-2.731187integrase core domain-containing protein
PSLF89_RS28860-116-1.393825hypothetical protein
PSLF89_RS28865-115-1.865721transposase
PSLF89_RS28870018-1.244190IS30 family transposase
PSLF89_RS28875-118-0.821076SGNH/GDSL hydrolase family protein
PSLF89_RS28880-117-1.262602VOC family protein
PSLF89_RS28885-112-2.245126helix-hairpin-helix domain-containing protein
PSLF89_RS28890-113-3.115205LapA family protein
PSLF89_RS28895015-0.80297130S ribosomal protein S1
PSLF89_RS28900-2150.292115(d)CMP kinase
PSLF89_RS28915-1212.8930243-phosphoserine/phosphohydroxythreonine
PSLF89_RS289201234.672526DNA gyrase subunit A
PSLF89_RS289251225.008323hypothetical protein
PSLF89_RS289300175.208149TRZ/ATZ family hydrolase
PSLF89_RS28935-1153.389427bifunctional 2-polyprenyl-6-hydroxyphenol
PSLF89_RS28940-1152.867317HAD-IA family hydrolase
PSLF89_RS28950-1120.782108VOC family protein
PSLF89_RS28955114-0.442099VOC family protein
PSLF89_RS28970116-1.846861metalloregulator ArsR/SmtB family transcription
PSLF89_RS28975315-0.167483IS4 family transposase
PSLF89_RS28990016-0.749671WD40 repeat domain-containing protein
PSLF89_RS37170017-0.426517hypothetical protein
PSLF89_RS28995-118-1.751740PAS domain-containing protein
PSLF89_RS29000217-0.022624hypothetical protein
PSLF89_RS290051182.148044glutamine-hydrolyzing carbamoyl-phosphate
PSLF89_RS290102182.833857carbamoyl-phosphate synthase
PSLF89_RS290152193.592468aspartate carbamoyltransferase
PSLF89_RS290202183.929252orotate phosphoribosyltransferase
PSLF89_RS290252184.151645orotidine-5'-phosphate decarboxylase
PSLF89_RS290300153.706617ATP-dependent chaperone ClpB
PSLF89_RS290350132.898536FAM83 family protein
PSLF89_RS290400152.491942hypothetical protein
PSLF89_RS365000230.942890transposase
PSLF89_RS29050018-0.010693transposase
PSLF89_RS29055015-0.646457hypothetical protein
PSLF89_RS29060-112-2.973618hypothetical protein
PSLF89_RS29065-115-4.204780MFS transporter
PSLF89_RS36505117-4.500401methionine--tRNA ligase
PSLF89_RS29070016-4.228661RnfABCDGE type electron transport complex
PSLF89_RS29075-116-4.550235endonuclease III
PSLF89_RS29080-116-3.702172DUF1841 family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS29025ACRIFLAVINRP290.047 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 29.4 bits (66), Expect = 0.047
Identities = 16/105 (15%), Positives = 33/105 (31%), Gaps = 3/105 (2%)

Query: 318 PSKLVAVGDEIDVLVLEIDEERRRISLGMKQCVPNPWKKFSDKYNKGDKVAGKIKSITDF 377
L ++ ++ + +I+ G P + + N + K+ +F
Sbjct: 190 ADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQ---QLNASIIAQTRFKNPEEF 246

Query: 378 GLFIGLDGGIDGLVHLSDISWNENGEEAVRNYKKGEEVEAVVLSV 422
G +V L D++ E G E + A L +
Sbjct: 247 GKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGI 291


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS29055UREASE320.004 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 32.4 bits (74), Expect = 0.004
Identities = 14/27 (51%), Positives = 17/27 (62%)

Query: 347 TLNGAKALGIDHITGSLETNKAADLAI 373
T+N A A G+ H GSLE K ADL +
Sbjct: 410 TINPAIAHGLSHEIGSLEVGKRADLVL 436


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS29060DHBDHDRGNASE290.013 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 29.2 bits (65), Expect = 0.013
Identities = 24/97 (24%), Positives = 40/97 (41%), Gaps = 10/97 (10%)

Query: 54 KDILDLGCGGGI---LSESLAKEGAQVTAIDMSKDVLNAAKLHKLESQLDIDYQHISAEE 110
K G GI ++ +LA +GA + A+D N KL K+ S L + +H A
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVD-----YNPEKLEKVVSSLKAEARHAEAFP 63

Query: 111 LAKQSPGKFEIITCMEMLEHVPDPLSILHACKTLLKP 147
+ + I + + P+ IL +L+P
Sbjct: 64 ADVRDSAAIDEI-TARIEREM-GPIDILVNVAGVLRP 98


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS29135HTHFIS395e-05 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 39.4 bits (92), Expect = 5e-05
Identities = 32/178 (17%), Positives = 59/178 (33%), Gaps = 28/178 (15%)

Query: 552 MRSEREKLLKMEEVLHDQVIGQSEAVTAVANAIRRSRAGLSDPSRPIGSFLFLGPTGVGK 611
R L+ + ++G+S A+ + + R L + + G +G GK
Sbjct: 121 EPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLAR----LMQTDLTL---MITGESGTGK 173

Query: 612 TELSKALARFLFDTDEATIRIDMSEYMEKHAVARLVGAPPGYVGYEEGGQLTEQVRRRPY 671
+++AL + + + I+M+ + L G E G T R
Sbjct: 174 ELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGH--------EKGAFTGAQTRSTG 225

Query: 672 SV-------ILFDEVEKAHPDVFNLLLQVLDDG---RLTDSQGRTVDFRNTVVIMTSN 719
+ DE+ D LL+VL G + D R ++ +N
Sbjct: 226 RFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVR---IVAATN 280



Score = 33.7 bits (77), Expect = 0.004
Identities = 17/91 (18%), Positives = 34/91 (37%), Gaps = 10/91 (10%)

Query: 135 NVEAAIERVRGG-------QNINDQAAEGNRQALDKYTIDLTERAEAG-KIDPVIGRDEE 186
AI+ G + +AL + ++ + P++GR
Sbjct: 86 TFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAA 145

Query: 187 IRRTVQVLQRRTKNN-PVLI-GEPGVGKTAI 215
++ +VL R + + ++I GE G GK +
Sbjct: 146 MQEIYRVLARLMQTDLTLMITGESGTGKELV 176


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS29170TCRTETB515e-09 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 51.0 bits (122), Expect = 5e-09
Identities = 68/374 (18%), Positives = 150/374 (40%), Gaps = 45/374 (12%)

Query: 35 FYEFIQMNMFNSLAGSLAATFHLSAFQIGLVSAFYFLADSILLYPAGVLVDRLSSRRVII 94
F+ + + N +A F+ V+ + L SI G L D+L +R+++
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 95 YGMVMCIIGTLLIASASSSWF-LVVARFLEGISAAFCLLSILRLASQWLPENRMATASGL 153
+G+++ G+++ S + L++ARF++G AA ++ + ++++P+ A GL
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143

Query: 154 IVTIGMLGGAVSQTPLTMMIEQWGWREALYVVALIGVILLVVVVIVVKDAPTAKQHAKLQ 213
I +I +G V M+ W L ++ +I +I + ++ ++K K H ++
Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHYIHW-SYLLLIPMITIITVPFLMKLLKKEVRIKGHFDIK 202

Query: 214 TTVDKVGFWQTLVILVKNPQNWL--------IGLYIALM--------------NLPIMLL 251
+ ++ + + +++ + N+P M+
Sbjct: 203 GIILMSVGIVFFMLFTTS-YSISFLIVSVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMIG 261

Query: 252 ----GGLFGS---------HFMQQGHGFSATEAATINMMIFIGTIVGSTVVGFVSDFLKQ 298
G +FG+ + M+ H S E ++ +IF GT + + G++ L
Sbjct: 262 VLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSV--IIFPGT-MSVIIFGYIGGILVD 318

Query: 299 RRLP---MIIASVVSLVIFLII-LYAHALTYAGYISLFFLLGFFTAAQILGYPAAQASNP 354
RR P + I V FL ++ I + F+LG + + + +S
Sbjct: 319 RRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVISTIVSSSLK 378

Query: 355 AKIVGSALGFVSVI 368
+ G+ + ++
Sbjct: 379 QQEAGAGMSLLNFT 392


41PSLF89_RS29485PSLF89_RS29545Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS29485023-3.625430hypothetical protein
PSLF89_RS29490021-4.063144hypothetical protein
PSLF89_RS37175-222-2.440569IS30 family transposase
PSLF89_RS29495-116-2.010276hypothetical protein
PSLF89_RS29500-2140.033915transposase
PSLF89_RS37515-2142.094197helix-turn-helix domain-containing protein
PSLF89_RS29510-2143.087082queuosine precursor transporter
PSLF89_RS29520-1153.931618hypothetical protein
PSLF89_RS29525-1175.017930oligopeptide:H+ symporter
PSLF89_RS295301195.612172integrase core domain-containing protein
PSLF89_RS295350205.898792hypothetical protein
PSLF89_RS29540-2304.827273aminomethyl-transferring glycine dehydrogenase
PSLF89_RS29545-2293.714085aminomethyl-transferring glycine dehydrogenase
42PSLF89_RS29595PSLF89_RS29955Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS29595121-5.050393zinc-finger domain-containing protein
PSLF89_RS29600220-5.187185bifunctional [glutamate--ammonia
PSLF89_RS36540219-6.0071654-hydroxybenzoate octaprenyltransferase
PSLF89_RS29615-117-5.038337chorismate lyase
PSLF89_RS29620-215-4.783380alpha/beta hydrolase
PSLF89_RS29625-215-3.804250hypothetical protein
PSLF89_RS29630-113-0.022760protease inhibitor I42 family protein
PSLF89_RS29635-1162.054095FAD-binding oxidoreductase
PSLF89_RS29640-2183.753771GIY-YIG nuclease family protein
PSLF89_RS29645-1194.258382hypothetical protein
PSLF89_RS29650-1204.589358hypothetical protein
PSLF89_RS296550214.354094hypothetical protein
PSLF89_RS296600213.678093IS30 family transposase
PSLF89_RS29670-1213.552904winged helix-turn-helix domain-containing
PSLF89_RS29675-1213.502133phosphate regulon sensor histidine kinase PhoR
PSLF89_RS29685-2213.701782hypothetical protein
PSLF89_RS29690-1223.771524IS3 family transposase
PSLF89_RS29695-1203.857925transposase
PSLF89_RS29700-1213.589683hypothetical protein
PSLF89_RS297050183.037969IS30 family transposase
PSLF89_RS297100162.781106IS3 family transposase
PSLF89_RS297150121.028985dTMP kinase
PSLF89_RS297200150.596672endolytic transglycosylase MltG
PSLF89_RS29725117-0.231148aminodeoxychorismate lyase
PSLF89_RS29730120-1.161462beta-ketoacyl-ACP synthase II
PSLF89_RS29735019-1.707006acyl carrier protein
PSLF89_RS29740019-2.3101453-oxoacyl-ACP reductase FabG
PSLF89_RS29745218-3.560485ACP S-malonyltransferase
PSLF89_RS29750318-2.344754ketoacyl-ACP synthase III
PSLF89_RS37180218-1.326027phosphate acyltransferase PlsX
PSLF89_RS29765219-1.58336950S ribosomal protein L32
PSLF89_RS29770018-1.416515YceD family protein
PSLF89_RS29775113-1.020370BON domain-containing protein
PSLF89_RS29785-117-1.749935class I poly(R)-hydroxyalkanoic acid synthase
PSLF89_RS29795119-1.875653hypothetical protein
PSLF89_RS29800118-1.740515hypothetical protein
PSLF89_RS29805225-1.517008DUF4286 family protein
PSLF89_RS29810220-0.444794uracil-DNA glycosylase
PSLF89_RS36565-112-0.495745OmpA family protein
PSLF89_RS29815-112-0.311479Nif3-like dinuclear metal center hexameric
PSLF89_RS29820-2131.438363UDP-N-acetylglucosamine
PSLF89_RS29830-1203.339377BolA/IbaG family iron-sulfur metabolism protein
PSLF89_RS29835-2193.631848STAS domain-containing protein
PSLF89_RS29845-1194.820020ABC transporter substrate-binding protein
PSLF89_RS29850-1194.265751outer membrane lipid asymmetry maintenance
PSLF89_RS29855-1204.163799lipid asymmetry maintenance ABC transporter
PSLF89_RS298650173.143031ABC transporter ATP-binding protein
PSLF89_RS298700142.586203KpsF/GutQ family sugar-phosphate isomerase
PSLF89_RS298802132.567641LPS export ABC transporter periplasmic protein
PSLF89_RS298901121.938267lipopolysaccharide transport periplasmic protein
PSLF89_RS298951142.182982LPS export ABC transporter ATP-binding protein
PSLF89_RS29900-1163.217279RNA polymerase factor sigma-54
PSLF89_RS29905-1172.671426ribosome-associated translation inhibitor RaiA
PSLF89_RS299100213.218654RNase adapter RapZ
PSLF89_RS299150213.509245signal peptidase I
PSLF89_RS299201193.446045carbon-nitrogen hydrolase family protein
PSLF89_RS299252194.5919892-polyprenyl-3-methyl-6-methoxy-1,4-benzoquinone
PSLF89_RS299302213.624777hypothetical protein
PSLF89_RS299352174.174176chemotaxis protein
PSLF89_RS299403164.342357hypothetical protein
PSLF89_RS299454164.233761type VI secretion system baseplate subunit TssG
PSLF89_RS299503174.633607type VI secretion system baseplate subunit TssF
PSLF89_RS299550223.601295type VI secretion system baseplate subunit TssE
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS29705UREASE270.003 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 27.4 bits (61), Expect = 0.003
Identities = 5/28 (17%), Positives = 11/28 (39%), Gaps = 3/28 (10%)

Query: 6 PVRACAVPRYDVTKKDLPLSCPMPQMEI 33
V+ R + K + + P +E+
Sbjct: 515 AVQNT---RGGIGKASMIHNSLTPHIEV 539


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS29780HTHFIS832e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 83.0 bits (205), Expect = 2e-20
Identities = 37/131 (28%), Positives = 66/131 (50%), Gaps = 1/131 (0%)

Query: 3 ARIMIIDDEAPIRDMVRFALELSHFEVIEAETAKEGQRKIIEKTPDLLLLDWMLPDQAGI 62
A I++ DD+A IR ++ AL + ++V A R I DL++ D ++PD+
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 63 ELAQTLKVQYPALPIIMLTARAEEESRVLGLEQ-ADDYVIKPFSPRELIARIKAVLRRSQ 121
+L +K P LP+++++A+ + + E+ A DY+ KPF ELI I L +
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 122 TNAGENTEPAQ 132
+ + +Q
Sbjct: 124 RRPSKLEDDSQ 134


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS29785PF06580344e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 34.5 bits (79), Expect = 4e-04
Identities = 23/92 (25%), Positives = 40/92 (43%), Gaps = 23/92 (25%)

Query: 239 NAVRY----TPKGGKISIIAYKNDHGIHVLVKDTGIGVPKKHIQRLTERFYRVDKGRSRD 294
N +++ P+GGKI + K++ + + V++TG K +
Sbjct: 266 NGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKE---------------- 309

Query: 295 VGGTGLGLAIVKHVL-LRHEGELTIKSTEQQG 325
TG GL V+ L + + E IK +E+QG
Sbjct: 310 --STGTGLQNVRERLQMLYGTEAQIKLSEKQG 339


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS29850ACRIFLAVINRP260.017 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 26.0 bits (57), Expect = 0.017
Identities = 11/42 (26%), Positives = 19/42 (45%), Gaps = 2/42 (4%)

Query: 34 GADSLDTVELVMALEEEFETEIPDEDAEKIATVQDAMNYVKQ 75
GA++LDT + + A E + P K+ D +V+
Sbjct: 296 GANALDTAKAIKAKLAELQPFFP--QGMKVLYPYDTTPFVQL 335


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS29855DHBDHDRGNASE1353e-41 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 135 bits (342), Expect = 3e-41
Identities = 81/248 (32%), Positives = 119/248 (47%), Gaps = 10/248 (4%)

Query: 6 KLALITGASRGIGRAVLNELGRQGLTVVGTATTEAGAENITGFINEQGYKGCGLALNVTE 65
K+A ITGA++GIG AV L QG + E + + + +V +
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 66 PEQITAAVDKITKEFGPVQVLVNNAGITRDNLMLRMKDEDWNAVIETNLSAVFRVTKACL 125
I +I +E GP+ +LVN AG+ R L+ + DE+W A N + VF +++
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 126 KGMMKVRWGRIINIGSVVGNMGNPGQANYCAAKAGLVGVTKSMAHEFASRNITVNVIAPG 185
K MM R G I+ +GS + A Y ++KA V TK + E A NI N+++PG
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188

Query: 186 FIQTDM-----TDALAEEQRVA-LLEH----IPAKRLGQPEDIAHMAAFLSSEAASYLTG 235
+TDM D EQ + LE IP K+L +P DIA FL S A ++T
Sbjct: 189 STETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHITM 248

Query: 236 QTLHINGG 243
L ++GG
Sbjct: 249 HNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS29900IGASERPTASE581e-10 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 58.2 bits (140), Expect = 1e-10
Identities = 58/316 (18%), Positives = 95/316 (30%), Gaps = 44/316 (13%)

Query: 410 NP--AGHDRDILTTQIDNIRHATWGVKAAVADNATISLGDAAQELKTAGTAGASELVDTE 467
NP ++ + TT I + V + ++N I+ E A A+ TE
Sbjct: 982 NPEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIA---RVDEAPVPPPAPATPSETTE 1038

Query: 468 LVAPEAE---------SPEAESPEAESPEAESPEAESPEAESPEAESPEAESPEAESPEA 518
VA ++ +A A++ E + +A + E ++ S E+
Sbjct: 1039 TVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTT 1098

Query: 519 ESPEAESPEAE--------------------SPEAESPEAESPEAESPEAESPEAESPEA 558
E+ E + E E SP+ E E P+AE P E
Sbjct: 1099 ETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEP 1158

Query: 559 ESPEAESPEAESPEAETS-EIVEEVEEEGAVNVNVSVRGGGEHQDVQSVSLEEGDEATLT 617
+S + + E P ETS + + V E VN V T
Sbjct: 1159 QSQTNTTADTEQPAKETSSNVEQPVTESTTVNTG---------NSVVENPENTTPATTQP 1209

Query: 618 MGDSETEVALDDIRLEENSEEESVGADVDILAGDNEISAQLDAARAYILAEDVDSARKVL 677
+SE+ + + D A D A D+ K
Sbjct: 1210 TVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTNAVLSDARAKAQ 1269

Query: 678 RDVLKQGDSTQQDEAR 693
L G + Q ++
Sbjct: 1270 FVALNVGKAVSQHISQ 1285



Score = 32.3 bits (73), Expect = 0.009
Identities = 26/163 (15%), Positives = 54/163 (33%), Gaps = 6/163 (3%)

Query: 83 IAQAKRVKAQPKVQSPVKAKPEVTTVIVPSKPQKVLIEKKVSYIPTDKPRQIDNELKKQL 142
IA+ P + E + + V ++ + T + R++ E K +
Sbjct: 1017 IARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNV 1076

Query: 143 GYITQQMSA--LTRSVEEVQEEMLSVARDVQQTSNGVLQLEEYQQQLQHAEQERIARQEA 200
TQ +E Q V++ ++ E+ Q+ + Q +QE
Sbjct: 1077 KANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVS-PKQEQ 1135

Query: 201 AESVQSKQNEIAEIPVELGVHNVVTPAEPSLDQSPAASESRIT 243
+E+VQ + E N+ P + + ++ T
Sbjct: 1136 SETVQPQAEPARE---NDPTVNIKEPQSQTNTTADTEQPAKET 1175


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS29915NAFLGMOTY989e-26 Sodium-type flagellar protein MotY precursor signature.
		>NAFLGMOTY#Sodium-type flagellar protein MotY precursor signature.

Length = 293

Score = 97.5 bits (242), Expect = 9e-26
Identities = 69/278 (24%), Positives = 128/278 (46%), Gaps = 6/278 (2%)

Query: 17 LFFSTGHALTRIHYQAPLATVKWTFSEELGF-CRLTHEIPHYGQAMFEMRTGRLQSFVLD 75
L + +A+ Y A +W C+L H IP +G A+F R + + +
Sbjct: 14 LLSANSYAVMGKRYVATPQQSQWEMVVNTPLECQLVHPIPSFGDAVFSSRASKKINLDFE 73

Query: 76 TQVGPSRKET--VTLTTGSPGWRLPEALILNDHAKMSQGYAPFRFGTMTVRRMLNSLAQG 133
++ ET V+L + P WR E + K + + + G T +L+ L +G
Sbjct: 74 LKMRRPMGETRNVSLISMPPPWRPGEHADRITNLKFFKQFDGY-VGGQTAWGILSELEKG 132

Query: 134 YAPTLTYQSWLGNRQQVQVSISPANFSRSYTQYLACSKKILPFTFVDIEHTVIYFGVNDR 193
PT +YQ W Q+++V++S F Y + C +L ++F DI T++++
Sbjct: 133 RYPTFSYQDWQSRDQRIEVALSSVLFQSKYNAFSDCIANLLKYSFEDIAFTILHYERQGD 192

Query: 194 RILKDQRYKLERIQEYFKVMKPKIRRIVIKGYADYAGNYLYNKYLSIDRAKALRKFIVED 253
++ K + +L +I +Y + I +++ Y D ++ LS RA++LR + E
Sbjct: 193 QLTKASKKRLAQIADYVR-HNQDIDLVLVATYTDSTDGKSESQSLSERRAESLRTYF-ES 250

Query: 254 MKFDAKKLVVRAYGVGQAVANNSTRSGRALNRRATIDI 291
+ ++ V+ YG + +A+N T G+ NRR I +
Sbjct: 251 LGLPEDRIQVQGYGKRRPIADNGTPIGKDKNRRVVISL 288


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS30015HTHFIS383e-05 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 38.3 bits (89), Expect = 3e-05
Identities = 16/111 (14%), Positives = 42/111 (37%), Gaps = 11/111 (9%)

Query: 180 TVLLVDDSRVAKRQIANILDQLKVQYITASDGIEAFEMLTKMVEGTDNINSKLLMMLSDI 239
T+L+ DD + + L + S+ + + ++++D+
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGD---------LVVTDV 55

Query: 240 EMPNMDGYTLTSKCREHPKLKDLYIVLNTSISGEFNQQMAKRVKADQFLAK 290
MP+ + + L + ++ DL +++ ++ + A A +L K
Sbjct: 56 VMPDENAFDLLPRIKKA--RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPK 104


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS30035OMADHESIN280.012 Yersinia outer membrane adhesin signature.
		>OMADHESIN#Yersinia outer membrane adhesin signature.

Length = 455

Score = 27.9 bits (61), Expect = 0.012
Identities = 14/58 (24%), Positives = 25/58 (43%)

Query: 55 DIIHGDQIELNHFCRDIERLICEYEPRIRHVTVDLAEEHVMNSRFELIISGEVYYENK 112
D+++ + N R E+ + T++ AEEH E + S VY ++K
Sbjct: 276 DVLNMAKAHSNSVARTTLETAEEHANSVARTTLETAEEHANKKSAEALASANVYADSK 333


43PSLF89_RS30005PSLF89_RS30090Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS30005-117-4.707347type IVB secretion system protein IcmH/DotU
PSLF89_RS30010019-6.198581hypothetical protein
PSLF89_RS30015120-6.532120hypothetical protein
PSLF89_RS30020223-7.003307hypothetical protein
PSLF89_RS30025323-8.545688hypothetical protein
PSLF89_RS30035324-9.417493adenosylmethionine decarboxylase
PSLF89_RS30040225-9.678575aminodeoxychorismate/anthranilate synthase
PSLF89_RS30045424-9.445464ribulose-phosphate 3-epimerase
PSLF89_RS30050322-9.259311DUF3530 family protein
PSLF89_RS30055118-8.024127response regulator
PSLF89_RS30060117-8.310907DnaJ domain-containing protein
PSLF89_RS30065115-7.907625nucleotidyltransferase family protein
PSLF89_RS30075120-7.649417phosphotransferase
PSLF89_RS30080120-7.012153LPS assembly protein LptD
PSLF89_RS30085117-6.530843peptidyl-prolyl cis-trans isomerase SurA
PSLF89_RS30090015-4.9981254-hydroxythreonine-4-phosphate dehydrogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS30125HTHFIS791e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 79.5 bits (196), Expect = 1e-20
Identities = 29/118 (24%), Positives = 52/118 (44%), Gaps = 5/118 (4%)

Query: 2 KILIADDSMTMRRIIINALVDHGAASDNILEAEDGERALVLWQAEGDEVGLALLDWNMPK 61
IL+ADD +R ++ AL G ++ + A + L + D MP
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGY---DVRITSNAATLWRWIAAG--DGDLVVTDVVMPD 59

Query: 62 MNGLDTLHLIRDIDKKTPIIMVTTHSEKTDVVSAISEGATNYIIKPFEVDTLIAKISQ 119
N D L I+ P+++++ + + A +GA +Y+ KPF++ LI I +
Sbjct: 60 ENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117


44PSLF89_RS30225PSLF89_RS30330Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS302252160.563893diaminopimelate decarboxylase
PSLF89_RS302301120.008855RidA family protein
PSLF89_RS302352130.055591hypothetical protein
PSLF89_RS365750150.949262bifunctional UDP-4-keto-pentose/UDP-xylose
PSLF89_RS302450151.449438glycosyltransferase
PSLF89_RS302651170.988328DegT/DnrJ/EryC1/StrS family aminotransferase
PSLF89_RS302703240.888664small Multidrug Resistance family protein
PSLF89_RS302752210.564357glycosyltransferase family 39 protein
PSLF89_RS302801220.983429tRNA (guanosine(18)-2'-O)-methyltransferase
PSLF89_RS302852220.721135RidA family protein
PSLF89_RS302902210.516979response regulator
PSLF89_RS30295-1190.829832MFS transporter
PSLF89_RS303000180.816256IS3 family transposase
PSLF89_RS30305-1191.060484integrase core domain-containing protein
PSLF89_RS30310-1181.572136hypothetical protein
PSLF89_RS30315-1201.957224****DUF475 domain-containing protein
PSLF89_RS303200201.607613IS4 family transposase
PSLF89_RS303253222.251520transposase
PSLF89_RS303302233.207147ABC transporter permease subunit
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS30315TACYTOLYSIN280.013 Bacterial thiol-activated pore-forming cytolysin sig...
		>TACYTOLYSIN#Bacterial thiol-activated pore-forming cytolysin

signature.
Length = 574

Score = 28.0 bits (62), Expect = 0.013
Identities = 9/29 (31%), Positives = 15/29 (51%)

Query: 9 DNAPAAIGTYSQAIQVKDTVYISGQIPLD 37
+N A + S+ ++ T Y SG+I L
Sbjct: 443 NNKIAGVNNRSEYVETTSTEYTSGKINLS 471


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS30325NUCEPIMERASE1202e-33 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 120 bits (302), Expect = 2e-33
Identities = 74/366 (20%), Positives = 140/366 (38%), Gaps = 68/366 (18%)

Query: 4 KVLILGANGFIGSSLSEYILEHTDWEIYGLD---------LSHHKLDQCIGHPRFHFTEG 54
K L+ GA GFIG +S+ +LE ++ G+D L +L+ + P F F +
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGH-QVVGIDNLNDYYDVSLKQARLEL-LAQPGFQFHKI 59

Query: 55 DMLIHKEWVE--YHVKKCDVVLPLVAIATPATYVKDPLRIFELDFEANLEVVRWCAKYN- 111
D L +E + + + V +++P + + L ++ C
Sbjct: 60 D-LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKI 118

Query: 112 KHVIFPSTSEVYGMCEDDAFDEESSNFINGPINKPRWIYSNSKQLMDRVIHALGEKDGLN 171
+H+++ S+S VYG+ F + ++ P +Y+ +K+ + + H GL
Sbjct: 119 QHLLYASSSSVYGLNRKMPFSTDD------SVDHPVSLYAATKKANELMAHTYSHLYGLP 172

Query: 172 YTLFRPFNWIG--GRQDEVFNIKEGGARVLTQFISNIIHGRDIQLVDGGEQRRCFTYIDD 229
T R F G GR D F ++ G+ I + + G+ +R FTYIDD
Sbjct: 173 ATGLRFFTVYGPWGRPDMALFK----------FTKAMLEGKSIDVYNYGKMKRDFTYIDD 222

Query: 230 GIEALARIIEC---------------RDSSANRQIINIGN--PENNHSIKELAEVLLAEI 272
EA+ R+ + S A ++ NIGN P + + + L +
Sbjct: 223 IAEAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVE---LMDYIQALEDAL 279

Query: 273 KKYDQYQAQADKVRVIITQSDRYYGEGYQDVKARIPAIENAKKYLNWQPKTDFVTAIQKT 332
+A K + + D E D + + + + P+T ++
Sbjct: 280 GI------EAKKNMLPLQPGDVL--ETSAD-------TKALYEVIGFTPETTVKDGVKNF 324

Query: 333 LAYHLA 338
+ ++
Sbjct: 325 VNWYRD 330


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS30360HTHFIS693e-15 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 69.1 bits (169), Expect = 3e-15
Identities = 21/120 (17%), Positives = 45/120 (37%), Gaps = 4/120 (3%)

Query: 6 RMNILIVDDRKNVLHSLKRLILLKFPESKLTLAESGEEAMSLISEGPSFGLIMTDYKMAN 65
IL+ DD + L + L + + + I+ G L++TD M +
Sbjct: 3 GATILVADDDAAIRTVLNQA--LSRAGYDVRITSNAATLWRWIAAGD-GDLVVTDVVMPD 59

Query: 66 INGLEVLAHAHGIHADTVRVLLTGYPNDEGIIAAIANDDINYILAKPWTQEQIVEILNQC 125
N ++L D ++++ I A +Y+ KP+ +++ I+ +
Sbjct: 60 ENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYL-PKPFDLTELIGIIGRA 118


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS30365TCRTETA352e-04 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 35.2 bits (81), Expect = 2e-04
Identities = 33/227 (14%), Positives = 73/227 (32%), Gaps = 8/227 (3%)

Query: 2 VFNLGFGAGLLLAGFLFTHYTHWLFWLDALTAFVSLLLIIFYLTEGHTVEANSHLEQAVA 61
F G AG +L G + H F+ A ++ L F L E H E +A+
Sbjct: 139 CFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALN 198

Query: 62 GSVWLVLKRRPQLVSYTLICTVLSLTMAQLIFAFPLYLAALFNAQGAQY----YGQIMTA 117
R +V+ + + QL+ P L +F + G + A
Sbjct: 199 PLASFRWARGMTVVAALMAVFFI----MQLVGQVPAALWVIFGEDRFHWDATTIGISLAA 254

Query: 118 NAVIVVIFTPILTVLTRRYSALIGTMIAAIFLGLCYSTLLLNQSLIVIFLAVFFMTVAEV 177
++ + ++T ++ + LL + + + + +
Sbjct: 255 FGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGG 314

Query: 178 LIVTKSSVFIANHSPSSHRGRITGILPAVINSANYFSPVIMGGYIDH 224
+ + ++ +G++ G L A+ + + P++
Sbjct: 315 IGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAA 361



Score = 34.8 bits (80), Expect = 2e-04
Identities = 28/142 (19%), Positives = 56/142 (39%), Gaps = 6/142 (4%)

Query: 104 NAQGAQYYGQIMTANAVIVVIFTPILTVLTRRYSALIGTMIAAIFLGLCYSTLLLNQSLI 163
+ +YG ++ A++ P+L L+ R+ +++ + Y+ + L
Sbjct: 38 SNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLW 97

Query: 164 VIFLA--VFFMTVAEVLIVTKSSVFIANHSPSSHRGRITGILPAVINSANYFSPVIMGGY 221
V+++ V +T A + +IA+ + R R G + A PV+ GG
Sbjct: 98 VLYIGRIVAGITGATGAVAGA---YIADITDGDERARHFGFMSACFGFGMVAGPVL-GGL 153

Query: 222 IDHFGFHALWYLMIILAVLGVC 243
+ F HA ++ L L
Sbjct: 154 MGGFSPHAPFFAAAALNGLNFL 175


45PSLF89_RS36600PSLF89_RS30560Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS36600-1153.515383hypothetical protein
PSLF89_RS305150153.508957hypothetical protein
PSLF89_RS305251194.304688copper-transporting ATPase
PSLF89_RS305300184.360223hypothetical protein
PSLF89_RS305402151.390196IS982 family transposase
PSLF89_RS305500160.046749heavy metal-responsive transcriptional
PSLF89_RS30555125-1.579470acyl-CoA desaturase
PSLF89_RS30560222-1.756146peptide deformylase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS30610PF04647290.022 Accessory gene regulator B
		>PF04647#Accessory gene regulator B

Length = 212

Score = 29.4 bits (66), Expect = 0.022
Identities = 15/63 (23%), Positives = 28/63 (44%), Gaps = 2/63 (3%)

Query: 16 FIVLHLICLLAFVTGVTTSSVVLAISLYFIRMWAVTAGYHRYFSHKSFKTSRVFQFILAF 75
+ +I L+AFV G+ +S R ++ G H ++ TS + +LA+
Sbjct: 36 VFQIIIILLVAFVIGLAKEVAFCLLSAAVYRRFS--GGAHCEKYYRCTLTSLLVFNVLAY 93

Query: 76 LAQ 78
+A
Sbjct: 94 IAH 96


46PSLF89_RS30690PSLF89_RS30880Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS30690116-3.026724IS30 family transposase
PSLF89_RS30695217-3.927600hypothetical protein
PSLF89_RS30700217-2.745622hypothetical protein
PSLF89_RS30705116-0.409953phosphatase PAP2 family protein
PSLF89_RS30710115-0.157846hypothetical protein
PSLF89_RS307152170.711949hypothetical protein
PSLF89_RS307203162.687864type IVB secretion system apparatus protein
PSLF89_RS307254182.190182hypothetical protein
PSLF89_RS30730120-0.835432DotA/TraY family protein
PSLF89_RS30735-120-5.794023hypothetical protein
PSLF89_RS30740122-7.774556ATPase AAA
PSLF89_RS36615325-9.808935type IVB secretion system protein IcmW
PSLF89_RS30750425-9.487815type IVB secretion system protein IcmJDotN
PSLF89_RS30755119-6.425743type IVB secretion system coupling complex
PSLF89_RS30760219-6.307052hypothetical protein
PSLF89_RS30765121-5.417808TraM recognition domain-containing protein
PSLF89_RS30770121-5.354496IcmT/TraK family protein
PSLF89_RS30775121-5.620751Flp pilus assembly complex ATPase component
PSLF89_RS30780022-6.070108type IV secretory system conjugative DNA
PSLF89_RS30785027-7.910689DotD/TraH family lipoprotein
PSLF89_RS30790027-7.432183hypothetical protein
PSLF89_RS30795228-9.314177hypothetical protein
PSLF89_RS30800228-8.541371hypothetical protein
PSLF89_RS30810225-7.593099DotG/IcmE/VirB10 family protein
PSLF89_RS30815024-6.865206hypothetical protein
PSLF89_RS30820225-6.121658hypothetical protein
PSLF89_RS30825325-6.786846aconitate hydratase AcnA
PSLF89_RS30830121-5.732786hypothetical protein
PSLF89_RS30835220-5.319461hypothetical protein
PSLF89_RS30840120-4.801171hypothetical protein
PSLF89_RS30850-216-0.878743IS30 family transposase
PSLF89_RS30855-3150.871548hypothetical protein
PSLF89_RS30860-1151.346749IS4 family transposase
PSLF89_RS308701151.485276hypothetical protein
PSLF89_RS30875317-0.211374Hsp20 family protein
PSLF89_RS30880418-0.447050adenylosuccinate lyase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS30740STREPKINASE270.049 Streptococcus streptokinase protein signature.
		>STREPKINASE#Streptococcus streptokinase protein signature.

Length = 440

Score = 27.0 bits (59), Expect = 0.049
Identities = 18/50 (36%), Positives = 28/50 (56%), Gaps = 2/50 (4%)

Query: 88 HQRLSTHTSPDVISQELIR-EHNIQVSESTI-YRYIYDDRERGGELYKNL 135
H +L T DV + EL++ E + SE + +R +YD R++ LY NL
Sbjct: 317 HLKLFTIKYVDVDTNELLKSEQLLTASERNLDFRDLYDPRDKAKLLYNNL 366


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS30750CHANLCOLICIN280.003 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 28.1 bits (62), Expect = 0.003
Identities = 8/37 (21%), Positives = 19/37 (51%), Gaps = 2/37 (5%)

Query: 23 AGGGLTKFVLSIDSII--HTLEWIGLGIIALFVGAFV 57
A G++ V + S++ TL G+ I+ + +++
Sbjct: 472 ADAGVSYVVALLFSLLAGTTLGIWGIAIVTGILCSYI 508


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS30815BONTOXILYSIN340.003 Bontoxilysin signature.
		>BONTOXILYSIN#Bontoxilysin signature.

Length = 1196

Score = 34.1 bits (78), Expect = 0.003
Identities = 37/181 (20%), Positives = 58/181 (32%), Gaps = 27/181 (14%)

Query: 586 KLKINQFLKVKPPSREE--VNFLNSVYSVSDLNVDIEDNFDHISEDDQLINFLIKKIHSN 643
L +N F + + N L Y +D DN++ + F+ +I++
Sbjct: 333 NLNLNYFCQSFNSIIPDRFSNALKHFYRKQYYTMDYTDNYN-------INGFVNGQINTK 385

Query: 644 IPLDINKACSIIDEIKDFNEDEPVEEISADFIPKNDIFLNIPLEDRVDSVYYDYGTSSLE 703
+PL NK +II E + + +N+I L + S Y G
Sbjct: 386 LPLS-NKNTNIIS----------KPEKVVNLVNENNISL-------MKSNIYGDGLKGTT 427

Query: 704 DCLIDYKQIKSAVNKMAILNDMQKSKANNISKNILDRISYITCYEFYDKKSDVNSIIDKI 763
+ +I ND NNIS +D I I Y SD
Sbjct: 428 EDFYSTYKIPYNEEYEYRFNDSDNFPLNNISIEEVDSIPEIIDINPYKDNSDNLVFTQIT 487

Query: 764 I 764

Sbjct: 488 S 488


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS30825PF05272300.013 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.4 bits (68), Expect = 0.013
Identities = 14/36 (38%), Positives = 19/36 (52%), Gaps = 1/36 (2%)

Query: 147 GIVVISGATGSGKSTLLASLVANSLEQVDSHLKVLT 182
VV+ G G GKSTL+ +LV D+H + T
Sbjct: 597 YSVVLEGTGGIGKSTLINTLVGLDF-FSDTHFDIGT 631


47PSLF89_RS30935PSLF89_RS30985Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS30935214-2.425058Spy/CpxP family protein refolding chaperone
PSLF89_RS30940114-2.462589LysE family translocator
PSLF89_RS30945315-3.009733glutamate 5-kinase
PSLF89_RS30950318-2.905021hypothetical protein
PSLF89_RS30955218-2.959795IS4 family transposase
PSLF89_RS30960217-2.666773protein kinase family protein
PSLF89_RS30965-116-2.131080YchJ family protein
PSLF89_RS30970-217-1.367070M48 family metallopeptidase
PSLF89_RS30975-317-1.080319M15 family metallopeptidase
PSLF89_RS30980-115-0.881115S-methyl-5-thioribose kinase
PSLF89_RS37195124-2.429571hypothetical protein
PSLF89_RS30985222-2.956189acireductone synthase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS30980CARBMTKINASE431e-06 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 42.5 bits (100), Expect = 1e-06
Identities = 32/125 (25%), Positives = 53/125 (42%), Gaps = 9/125 (7%)

Query: 133 VPIINENNAISIEATAIGDNDTLAALIASQAQADLLVLLTCVDG-LIDYRANQVVETVTN 191
VP+I E+ I A+ D D +A + AD+ ++LT V+G + Y + + +
Sbjct: 197 VPVILEDGEIK-GVEAVIDKDLAGEKLAEEVNADIFMILTDVNGAALYYGTEK-EQWLRE 254

Query: 192 IEQQAAELVRQEKTELGTGGMATKLQAA-RIVNESGIAMLIANGQQPYVMTELLQGANIG 250
++ + +E G M K+ AA R + G +IA E L+G G
Sbjct: 255 VKVEELRKYYEEG-HFKAGSMGPKVLAAIRFIEWGGERAIIA---HLEKAVEALEG-KTG 309

Query: 251 TLFCP 255
T P
Sbjct: 310 TQVLP 314



Score = 32.1 bits (73), Expect = 0.003
Identities = 21/81 (25%), Positives = 36/81 (44%), Gaps = 9/81 (11%)

Query: 10 KRIIIKVGTSLLVKDSKLQTYF-----ITHLAQQIVQLRARGKECIVVTSG---AVGLGA 61
KR++I +G + L + + +Y + A+QI ++ ARG E +V+T G VG
Sbjct: 3 KRVVIALGGNALQQRGQKGSYEEMMDNVRKTARQIAEIIARGYE-VVITHGNGPQVGSLL 61

Query: 62 ELNHKGKTPNRTEQQALAAIG 82
G+ Q + G
Sbjct: 62 LHMDAGQATYGIPAQPMDVAG 82


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS31015SALSPVAPROT260.038 Salmonella virulence plasmid 28.1kDa A protein signa...
		>SALSPVAPROT#Salmonella virulence plasmid 28.1kDa A protein

signature.
Length = 255

Score = 26.3 bits (57), Expect = 0.038
Identities = 17/46 (36%), Positives = 22/46 (47%), Gaps = 1/46 (2%)

Query: 24 QTLPATPEQLMRSRYTAYTQANIDYIIATMQG-EALNRFNRNSATT 68
QTLP P+ + T++ Q N+ A EALN F R A T
Sbjct: 192 QTLPTEPDNSTATDLTSFYQTNLGLKTADYTPFEALNTFARQLAIT 237


48PSLF89_RS31160PSLF89_RS31395Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS31160022-3.327698hypothetical protein
PSLF89_RS31165-121-4.229078hypothetical protein
PSLF89_RS31170-317-3.958307hypothetical protein
PSLF89_RS36625-215-4.421237patatin-like phospholipase family protein
PSLF89_RS37535-215-5.005103ATPase AAA
PSLF89_RS31180-118-6.672991type IVB secretion system protein IcmW
PSLF89_RS31185-219-6.962037hypothetical protein
PSLF89_RS31190-122-7.860715type IVB secretion system coupling complex
PSLF89_RS31195125-9.440052hypothetical protein
PSLF89_RS31200027-11.557378TraM recognition domain-containing protein
PSLF89_RS31205321-5.674618IcmT/TraK family protein
PSLF89_RS31210421-5.496969Flp pilus assembly complex ATPase component
PSLF89_RS31215119-5.834413type IV secretory system conjugative DNA
PSLF89_RS31220119-6.079503DotD/TraH family lipoprotein
PSLF89_RS37600018-6.304270hypothetical protein
PSLF89_RS31230-116-5.123587hypothetical protein
PSLF89_RS31235-116-6.289653hypothetical protein
PSLF89_RS31240-116-6.487211DotG/IcmE/VirB10 family protein
PSLF89_RS31245-117-6.514068hypothetical protein
PSLF89_RS31250016-6.483463hypothetical protein
PSLF89_RS31255020-6.028231hypothetical protein
PSLF89_RS31260322-8.655962hypothetical protein
PSLF89_RS31270225-7.860368DedA family protein
PSLF89_RS31280224-7.912091ion transporter
PSLF89_RS31285124-7.220749hypothetical protein
PSLF89_RS31290227-7.771217hypothetical protein
PSLF89_RS31295227-8.872311IS4 family transposase
PSLF89_RS31300328-8.871200hypothetical protein
PSLF89_RS31305328-8.479919IS30 family transposase
PSLF89_RS31310228-8.642686L-threonine 3-dehydrogenase
PSLF89_RS31315328-9.406187glycine C-acetyltransferase
PSLF89_RS31320428-9.724669hypothetical protein
PSLF89_RS31325324-8.370546hypothetical protein
PSLF89_RS31330420-8.862028hypothetical protein
PSLF89_RS31335318-9.094759hypothetical protein
PSLF89_RS31340518-8.573190hypothetical protein
PSLF89_RS31345218-6.432087hypothetical protein
PSLF89_RS31350216-5.857965methylated-DNA--[protein]-cysteine
PSLF89_RS31355215-4.992258dethiobiotin synthase
PSLF89_RS31360113-2.974445malonyl-ACP O-methyltransferase BioC
PSLF89_RS31365-114-1.676707alpha/beta fold hydrolase
PSLF89_RS31370-1120.1774708-amino-7-oxononanoate synthase
PSLF89_RS31375-1130.913468biotin synthase BioB
PSLF89_RS31380-1120.656246hypothetical protein
PSLF89_RS313851141.552723hypothetical protein
PSLF89_RS31390-1130.907246hypothetical protein
PSLF89_RS31395217-0.562262bifunctional
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS31245PF01540300.027 Adhesin lipoprotein
		>PF01540#Adhesin lipoprotein

Length = 475

Score = 30.1 bits (67), Expect = 0.027
Identities = 44/213 (20%), Positives = 85/213 (39%), Gaps = 14/213 (6%)

Query: 66 KTEPKFKVEKGVVDFITYLQDMIKDLHKAGRKSSSLAQSIEKQILGSVYSSLNEISNEQA 125
K E KF++++ F L I+ L+K + + A + L+E+ + +
Sbjct: 272 KLERKFQIDE---KFKKQLISTIELLNKKSVEVKTFATVNTIK----KDFLLSELESFKE 324

Query: 126 LETAKLSNQVINLEKEVETGKLNEKELSTKVDK-LSSELDNLKSRNEQLLEIANNSSSTT 184
T+ L V E+ + E+ + DK L+ E +K+ E+L +I N + +
Sbjct: 325 FNTSWLEKIVSEWEEVKKAWSKELAEIKAEDDKKLAEENQKIKNGVEELKKINNEAFELS 384

Query: 185 NTVIKTATFHTALERTAKISEGLTKVGEVMVGLQNIALPVSQSMASANEVIKTLNDTVKQ 244
TV KT LE+ KI E + + L S+ + V T
Sbjct: 385 KTVNKTI---AELEKKFKIDV---SFKEQLKNFADDLLDKSRQIDEFTTVTSTQEGFTLA 438

Query: 245 SQKTIKDTDERKFKDVQSKITTVKDEIIKEVKD 277
++ K+ F ++S+ V++ ++K+
Sbjct: 439 ELESFKEITTTWFNGMKSEWARVQEAWKDQLKE 471


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS31265FLGMOTORFLIM280.034 Flagellar motor switch protein FliM signature.
		>FLGMOTORFLIM#Flagellar motor switch protein FliM signature.

Length = 344

Score = 27.9 bits (62), Expect = 0.034
Identities = 21/101 (20%), Positives = 41/101 (40%), Gaps = 11/101 (10%)

Query: 100 LSQAEINRLLTELVKKSLENNDDLFSIK-----TLYSYFNEKSDILTHLCSSEVINPGFY 154
LSQ EI++LL + + +D I TLY + + + +++ F
Sbjct: 5 LSQDEIDQLL-TAISSGDASIEDARPISDTRKITLYDFRRPDKFSKEQMRTLSLMHETFA 63

Query: 155 RLISSS-----SYSNHKIISGIRILFSYDFIKNNSITTTDN 190
RL ++S H ++ + L +FI++ +T
Sbjct: 64 RLTTTSLSAQLRSMVHVHVASVDQLTYEEFIRSIPTPSTLA 104


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS31320IGASERPTASE300.016 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 30.0 bits (67), Expect = 0.016
Identities = 40/183 (21%), Positives = 78/183 (42%), Gaps = 19/183 (10%)

Query: 96 NNQLDSRKKYNIADKKKDKNINNTNEVKYNN-VSDNLSDKLPINVNNENIKNNIS-ENND 153
N ++ + +I K D ++ Y +D LSDK + N N++ N++ +
Sbjct: 766 NITASNKAQVHIGYKTGDTVCVRSDYTGYVTCTTDKLSDKALNSFNPTNLRGNVNLTESA 825

Query: 154 KFLSKINLILNSINEIKIDNKSSKNNILKLNQSDVDQIT-DSIKKELSASQNEIDSYIKR 212
F+ + +I +S N+ ++L ++ +T +S +L + + +I
Sbjct: 826 NFVLGKANLFGTI-------QSRGNSQVRLTENSHWHLTGNSDVHQLDLA----NGHIHL 874

Query: 213 DNQINDNNKMIINKLNQVLAEVAKENKNYAYLIAKSNNNFDKLTLRAIITGRAWL--VNK 270
++ N NN + K N + N ++ YL SN DK+ + TG L +K
Sbjct: 875 NSADNSNN---VTKYNTLTVNSLSGNGSFYYLTDLSNKQGDKVVVTKSATGNFTLQVADK 931

Query: 271 TGK 273
TG+
Sbjct: 932 TGE 934


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS31485LPSBIOSNTHSS310.004 Lipopolysaccharide core biosynthesis protein signat...
		>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein

signature.
Length = 166

Score = 31.3 bits (71), Expect = 0.004
Identities = 12/56 (21%), Positives = 30/56 (53%), Gaps = 9/56 (16%)

Query: 345 GCFDILHAGHIDYLQKARAKGDRLIIAINDDTSIQRLKGPS-RPIVPLAQRMQLLN 399
G FD + GH+D +++ D++ +A+ L+ P+ +P+ + +R++ +
Sbjct: 7 GSFDPITFGHLDIIERGCRLFDQVYVAV--------LRNPNKQPMFSVQERLEQIA 54


49PSLF89_RS31535PSLF89_RS31690Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS31535-217-3.451919TIGR02449 family protein
PSLF89_RS31540-114-1.650872cell division protein ZapA
PSLF89_RS31545-113-0.4756315-formyltetrahydrofolate cyclo-ligase
PSLF89_RS31550-1152.222130EVE domain-containing protein
PSLF89_RS31555-2172.810818HAMP domain-containing methyl-accepting
PSLF89_RS31565-1194.1796403'(2'),5'-bisphosphate nucleotidase CysQ
PSLF89_RS31570-1203.616978ADP compounds hydrolase NudE
PSLF89_RS31575-1172.619790ribosome-associated heat shock protein Hsp15
PSLF89_RS31580-1121.972232hypothetical protein
PSLF89_RS366400121.136974hypothetical protein
PSLF89_RS31590-1112.057853hypothetical protein
PSLF89_RS31595-1123.025350IS30 family transposase
PSLF89_RS31600-1133.154115hypothetical protein
PSLF89_RS316050143.637312hypothetical protein
PSLF89_RS316101153.545731polysaccharide deacetylase family protein
PSLF89_RS316151174.199793hypothetical protein
PSLF89_RS31620-1213.395668hypothetical protein
PSLF89_RS31625-1261.739984IS4 family transposase
PSLF89_RS31635-1191.736200lipoyl synthase
PSLF89_RS316500190.915342lipoyl(octanoyl) transferase LipB
PSLF89_RS316551161.169861D-amino acid aminotransferase
PSLF89_RS316600170.162846D-alanyl-D-alanine carboxypeptidase
PSLF89_RS316653210.389460SPOR domain-containing protein
PSLF89_RS31670320-0.289595lytic murein transglycosylase B
PSLF89_RS31675519-2.069003rod shape-determining protein RodA
PSLF89_RS37545520-0.70007823S rRNA
PSLF89_RS31685416-1.569543ribosome silencing factor
PSLF89_RS36645415-1.256075nicotinate-nucleotide adenylyltransferase
PSLF89_RS31690314-0.903474DNA polymerase III subunit delta
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS31635RTXTOXINA260.036 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 26.1 bits (57), Expect = 0.036
Identities = 11/69 (15%), Positives = 32/69 (46%), Gaps = 2/69 (2%)

Query: 37 AALLLDDKMRHIRDSGKVIGVERIAIMAALNLAHDYLKNMNHRDDYIDDVNQQLEQLEHK 96
+++ +D+ ++ + G V E +A A++ L + + + ++ ++ +QQL L
Sbjct: 158 SSMKIDELIKKQKSGGNVSSSE-LA-KASIELINQLVDTVASLNNNVNSFSQQLNTLGSV 215

Query: 97 VKQALRFSS 105
+ +
Sbjct: 216 LSNTKHLNG 224


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS31655CHANLCOLICIN320.007 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 32.0 bits (72), Expect = 0.007
Identities = 36/201 (17%), Positives = 72/201 (35%), Gaps = 18/201 (8%)

Query: 275 QQVNNSSATMSNMMREAQVQLTEQASEAEHIQSSMTAMNDTMKTVVSKAEEVAKSARDAD 334
++ + +EA+ + E E + + K + + +EE AK+ A
Sbjct: 137 EKARKEAEAAEKAFQEAEQRRKEIEREKAETERQLKLAEAEEKRLAALSEE-AKAVEIAQ 195

Query: 335 QKSHEGQKVVNATRETIDSL-----------AKEVETTAQVITSLNEASNNVGSILDVIK 383
+K Q V I +L E++T A L +AS + +++K
Sbjct: 196 KKLSAAQSEVVKMDGEIKTLNSRLSSSIHARDAEMKTLAGKRNELAQASAKYKELDELVK 255

Query: 384 GISEQTNMLALNAAIEAARAGEQGRGFAVVADEVRGLAQRTGDSAEQIYDLIEQLRSHAH 443
+S + N N R + V A ++R Q+ ++E + I +
Sbjct: 256 KLSPRANDPLQN------RPFFEATRRRVGAGKIREEKQKQVTASETRINRINADITQIQ 309

Query: 444 NAVEAMDKGKERADASVQQSE 464
A+ + + A V ++E
Sbjct: 310 KAISQVSNNRNAGIARVHEAE 330


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS31740BLACTAMASEA362e-04 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 35.9 bits (83), Expect = 2e-04
Identities = 24/98 (24%), Positives = 37/98 (37%)

Query: 80 ILVDYNSGQVLTSGNPDERLSPASLTKVMSYYVVAEALRNGKIKESDKVRISRKAWKTGG 139
I +D SG+ LT+ DER S KV+ V + G + K+ ++
Sbjct: 43 IEMDLASGRTLTAWRADERFPMMSTFKVVLCGAVLARVDAGDEQLERKIHYRQQDLVDYS 102

Query: 140 SRMFVKAGDSVSVKDLLQGMVVQSGNDATVALAEYVAG 177
D ++V +L + S N A L V G
Sbjct: 103 PVSEKHLADGMTVGELCAAAITMSDNSAANLLLATVGG 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS31750BINARYTOXINA290.029 Clostridial binary toxin A signature.
		>BINARYTOXINA#Clostridial binary toxin A signature.

Length = 454

Score = 29.3 bits (65), Expect = 0.029
Identities = 20/98 (20%), Positives = 42/98 (42%), Gaps = 5/98 (5%)

Query: 4 SYWLGLLIFLNGFFITTHAASNTLAVSASTNSKSAEYIQRADVKSYINDLVKQYGFSKAQ 63
S +L L + L F + A + AS +I+R + ++ D + K +
Sbjct: 9 SVFLILYLILTSSFPSYTYAQD--LQIASNYITDRAFIERPE--DFLKDKENAIQWEKKE 64

Query: 64 LERWFHHAKANQR-ALEILQRPAEKVWTWQQYRSWLVS 100
ER + ++ ALE+ ++ +E++ + Q R +
Sbjct: 65 AERVEKNLDTLEKEALELYKKDSEQISNYSQTRQYFYD 102


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS31775LPSBIOSNTHSS310.003 Lipopolysaccharide core biosynthesis protein signat...
		>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein

signature.
Length = 166

Score = 30.6 bits (69), Expect = 0.003
Identities = 10/17 (58%), Positives = 13/17 (76%)

Query: 9 ALFGGTFDPIHSGHLRI 25
A++ G+FDPI GHL I
Sbjct: 3 AIYPGSFDPITFGHLDI 19



Score = 28.6 bits (64), Expect = 0.012
Identities = 7/26 (26%), Positives = 15/26 (57%)

Query: 180 ISSTMVRERLKKNESIRYLVPEPVEQ 205
+SS++V+E + ++ + VP V
Sbjct: 125 LSSSLVKEVARFGGNVEHFVPSHVAA 150


50PSLF89_RS32215PSLF89_RS32265Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS322150134.092292type II toxin-antitoxin system RatA family
PSLF89_RS322201174.646861SsrA-binding protein SmpB
PSLF89_RS322251174.633658dihydropteroate synthase
PSLF89_RS322351154.805983hypothetical protein
PSLF89_RS322401163.794244glutaminase A
PSLF89_RS322502242.366687prolyl aminopeptidase
PSLF89_RS322553261.235462D-tyrosyl-tRNA(Tyr) deacylase
PSLF89_RS322604231.002655DUF1853 family protein
PSLF89_RS322652160.158982hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS32295PF06580280.020 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 27.5 bits (61), Expect = 0.020
Identities = 11/58 (18%), Positives = 25/58 (43%), Gaps = 6/58 (10%)

Query: 41 KSLRAGKLQLAESYVLIKRGEVFLIGSHITP------LNTASTHIKADPTRTRKLLLH 92
K+ + ++ + + + ++ + + I P LN I DPT+ R++L
Sbjct: 142 KNYKQAEIDQWKMASMAQEAQLMALKAQINPHFMFNALNNIRALILEDPTKAREMLTS 199


51PSLF89_RS32730PSLF89_RS32805Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS32730-213-3.226407alpha/beta fold hydrolase
PSLF89_RS32735-114-3.259119MATE family efflux transporter
PSLF89_RS32740-120-4.221785hypothetical protein
PSLF89_RS32745020-4.947832translational GTPase TypA
PSLF89_RS32750-113-1.733102hypothetical protein
PSLF89_RS32755012-0.711121transposase
PSLF89_RS327600152.988190IS982 family transposase
PSLF89_RS327650163.062283transposase
PSLF89_RS327700214.544596transposase
PSLF89_RS32775-1195.487307ABC transporter permease subunit
PSLF89_RS32780-1225.073388ABC transporter ATP-binding protein
PSLF89_RS327850214.398404dihydrofolate reductase
PSLF89_RS32790-1263.889246LysR family transcriptional regulator
PSLF89_RS327950223.322898DMT family protein
PSLF89_RS328000223.801628***hypothetical protein
PSLF89_RS328051193.222766IS982 family transposase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS32815TCRTETOQM1723e-48 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 172 bits (437), Expect = 3e-48
Identities = 101/448 (22%), Positives = 172/448 (38%), Gaps = 88/448 (19%)

Query: 4 KLRNIAIIAHVDHGKTTLVDKLLEQSGT---LGRNESGERMMDSNDLEKERGITILAKNT 60
K+ NI ++AHVD GKTTL + LL SG LG + G D+ LE++RGITI T
Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61

Query: 61 AIQWQDYRINIVDTPGHADFGGEVERVLSMVDSVLLLVDAVDGPMPQTRFVTKKAFEQGL 120
+ QW++ ++NI+DTPGH DF EV R LS++D +LL+ A DG QTR + + G+
Sbjct: 62 SFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGI 121

Query: 121 NPIVVINKVDRPGARPDWVMDQV-------------FELFDQLGATDEQLD--------- 158
I INK+D+ G V + EL+ + T+
Sbjct: 122 PTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIEG 181

Query: 159 --------------------------------FPVVYASALQGYASLEEGELGGDMTPLF 186
FPV + SA +G + L
Sbjct: 182 NDDLLEKYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNN--------IG--IDNLI 231

Query: 187 KTIIEKVAAPDVDPEGPFQMQVSSLDYSSYVGAIAIGRISRGKISTNSAIRIIDHQGNER 246
+ I K + + +V ++YS +A R+ G + ++RI +
Sbjct: 232 EVITNKFYSSTHRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRI----SEKE 287

Query: 247 SGRILKIMTHHGLQRVETEQAFAGDIVCVTGIERPF---ISETFCSPDKVEPLPALTVDE 303
+I ++ T + + ++A++G+IV + + +T P + +
Sbjct: 288 KIKITEMYTSINGELCKIDKAYSGEIVILQNEFLKLNSVLGDTKLLPQR----ERIENPL 343

Query: 304 PTVSMMFCVNNSPFAGKEGKYVTSRQIRDRLEQELIYNVALRVENTEDPDKFRVSGRGEL 363
P + + + D L LR + +S G++
Sbjct: 344 PLLQTTVEPSKPQQREMLLDALLEISDSDPL---------LRYYVDSATHEIILSFLGKV 394

Query: 364 HLSILIETMRRE-GYELGVSRPEVILKE 390
+ + ++ + E+ + P VI E
Sbjct: 395 QMEVTCALLQEKYHVEIEIKEPTVIYME 422



Score = 30.2 bits (68), Expect = 0.027
Identities = 13/84 (15%), Positives = 29/84 (34%), Gaps = 1/84 (1%)

Query: 388 LKEIDGKLQEPFEELMLDIEEQHQGTVMERLGLRRGQLTNMIPDGKGRIRLDYQIPTRGL 447
LK+ +L EP+ + +++ + + + L +IP R +
Sbjct: 528 LKKAGTELLEPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQLKN-NEVILSGEIPARCI 586

Query: 448 IGFHNDFLTMTSGSGIMTHVFDHY 471
+ +D T+G + Y
Sbjct: 587 QEYRSDLTFFTNGRSVCLTELKGY 610


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS32825HTHFIS270.031 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 27.1 bits (60), Expect = 0.031
Identities = 10/43 (23%), Positives = 19/43 (44%), Gaps = 1/43 (2%)

Query: 4 RHLNEKDRFYIEQRLSE-GDSLRSIARALGFSPSTISREIKRH 45
R L E + I L+ + A LG + +T+ ++I+
Sbjct: 431 RVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIREL 473


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS32850PF05272300.011 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.0 bits (67), Expect = 0.011
Identities = 12/59 (20%), Positives = 20/59 (33%), Gaps = 9/59 (15%)

Query: 47 EIVAIL-GKSGAGKSTFLRTIAGLLPASSGKVYHQDTLIEKP-IEEIAMVFQSFALLPW 103
+ +L G G GKST + T+ GL + DT + ++
Sbjct: 596 DYSVVLEGTGGIGKSTLINTLVGLD-------FFSDTHFDIGTGKDSYEQIAGIVAYEL 647


52PSLF89_RS32915PSLF89_RS33000Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS32915-2153.552453pantoate--beta-alanine ligase
PSLF89_RS32920-2173.754568transposase
PSLF89_RS32925-1183.700381succinylglutamate desuccinylase
PSLF89_RS329351162.977750hypothetical protein
PSLF89_RS329400162.512674hypothetical protein
PSLF89_RS329452200.858363hypothetical protein
PSLF89_RS329501230.121891hypothetical protein
PSLF89_RS32955-127-1.768827IS30 family transposase
PSLF89_RS329653332.601448hypothetical protein
PSLF89_RS366901243.080686MFS transporter
PSLF89_RS329701151.438201multidrug effflux MFS transporter
PSLF89_RS372201180.332334LysR family transcriptional regulator
PSLF89_RS329752120.720696ATP-binding protein
PSLF89_RS329802120.881359shikimate kinase AroK
PSLF89_RS329853140.058647type IV pilus secretin PilQ
PSLF89_RS32990416-0.174225secretin and TonB N-terminal domain-containing
PSLF89_RS32995518-0.269321hypothetical protein
PSLF89_RS330005240.920067hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS33025TCRTETA453e-07 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 45.2 bits (107), Expect = 3e-07
Identities = 54/281 (19%), Positives = 87/281 (30%), Gaps = 16/281 (5%)

Query: 60 AGLGNLAATFFYSYLIIQIFSGPLLDRFGARYIGSLALLISALGTWLFAQADQLLWAEIG 119
A G L A + G L DRFG R + ++L +A+ + A A L IG
Sbjct: 43 AHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIG 102

Query: 120 RALMGV-GVAFATVTYLKVAATWFD--ARRFALLSGLVPTAVMIGAVFGQVPLAHVVASE 176
R + G+ G A T D AR F +S ++ G V G ++
Sbjct: 103 RIVAGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGG-----LMGGF 157

Query: 177 GWRRSLELCAILGVIFAVLFLLFVRDKKNHSSVIDDTQQVNWQDIIS--VLKRPANWLLT 234
A L + + + + + +N L+
Sbjct: 158 SPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMA 217

Query: 235 LYSGLAFAPLAVFAGLWGNPFLVASYQLTTADAA-SLTSLVFIGLGVGGPIFGALADYFG 293
++ + V A LW F + SL + + I G +A G
Sbjct: 218 VFFIMQ-LVGQVPAALWV-IFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLG 275

Query: 294 KRTLWMFLGGFVTLASVLCLLYCLGLHSTLLSILMFLFGFG 334
+R M G + + LL + +M L G
Sbjct: 276 ERRALML--GMIADGTGYILLAFAT-RGWMAFPIMVLLASG 313


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS33030TCRTETB698e-15 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 68.8 bits (168), Expect = 8e-15
Identities = 88/402 (21%), Positives = 149/402 (37%), Gaps = 48/402 (11%)

Query: 12 ILVIFGVIAAQAAVSLYLPSLPAIDHEWHLVSGQAQLTLSAFFLTFGVSQLFYGALSDHF 71
IL F V+ + SLP I ++++ +AF LTF + YG LSD
Sbjct: 21 ILSFFSVLNE----MVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQL 76

Query: 72 GRKPLLLTGLVILVLSSVWAIYATSFHSLL-AARLVQGAGGGALSVLARAIIRDLFHGDE 130
G K LLL G++I SV SF SLL AR +QGAG A L ++ +
Sbjct: 77 GIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKEN 136

Query: 131 LRKAISILAIAASFTPALAPSLGGWLEDHFDWRSSFVILTVYSI---------------- 174
KA ++ + + P++GG + + W +I + I
Sbjct: 137 RGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVRIK 196

Query: 175 -----------ILLVTIFSLFTETNQYQRQSNESIDFSKVVASYYFVT---------KNK 214
+ + F LFT + + F V VT KN
Sbjct: 197 GHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGLGKNI 256

Query: 215 LFWCYGFAILIGYLSLVICLANAPFLLEKKFGLS-AEITGYLMFVQPGFFLVGNLLQHKL 273
F I + ++ ++ P++++ LS AEI ++F ++ + L
Sbjct: 257 PFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGIL 316

Query: 274 TDKISGDLFLKFGMVILGVIGLSFLLQGLFHSAT--LISVLLTLALAGFATSLILVNALA 331
D+ L G+ L V SFL T +++++ L G + + +++ +
Sbjct: 317 VDRRGPLYVLNIGVTFLSV---SFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVISTIV 373

Query: 332 GVLLPFTENAGAAAALSGVLQMVGASFITALISNLHWTSIID 373
L E AGA +L + A++ L ++D
Sbjct: 374 SSSLKQQE-AGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLLD 414


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS33040PF05272310.008 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.8 bits (69), Expect = 0.008
Identities = 11/24 (45%), Positives = 13/24 (54%)

Query: 48 CYSDYVVAVYGPIGAGKSTFLELL 71
C DY V + G G GKST + L
Sbjct: 593 CKFDYSVVLEGTGGIGKSTLINTL 616


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS33050BCTERIALGSPD2061e-63 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 206 bits (526), Expect = 1e-63
Identities = 79/308 (25%), Positives = 140/308 (45%), Gaps = 17/308 (5%)

Query: 16 IVSFDQRTNILLIHDYVDKIKIIKKMIKALDRPVPQVMIEARIVIANRSFEKDLGVKFGV 75
I+ +TN L++ D + ++++I LD PQV++EA I + +LG+++
Sbjct: 311 IIKAHGQTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGIQWAN 370

Query: 76 SGGGSTVATAGSISGTNAIRQGESPGIAERLNVSLPFMTDSAATGLGRFALAVAKLPGNL 135
G T T + + AI ++ SL SA + A + GN
Sbjct: 371 KNAGMTQFTNSGLPISTAIAGANQYNKDGTVSSSLA----SALSSFNGIAAGFYQ--GNW 424

Query: 136 LLDLELQALESEGEAEVISTPKLLTAHDQEAFIEQGEEIPYLESTSSG-----AASVSFK 190
+ L AL S + ++++TP ++T + EA G+E+P L + + +V K
Sbjct: 425 --AMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVPVLTGSQTTSGDNIFNTVERK 482

Query: 191 KAVLGLTVTPHITPDQHIILTIRLSKDSRSALSAGDGGSSANVLPPAIDTRVIKTQALVK 250
+ L V P I ++L I S A S+++ L +TR + LV
Sbjct: 483 TVGIKLKVKPQINEGDSVLLEIEQEVSS----VADAASSTSSDLGATFNTRTVNNAVLVG 538

Query: 251 DGETIVLGGIYEQEKQRVVRRVPFLADLPGIGWLFQSRSQSTLNKELLIFVTPKIMSAAA 310
GET+V+GG+ ++ +VP L D+P IG LF+S S+ + L++F+ P ++
Sbjct: 539 SGETVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNLMLFIRPTVIRDRD 598

Query: 311 SHGVLSTG 318
+ S+G
Sbjct: 599 EYRQASSG 606


53PSLF89_RS33045PSLF89_RS33150Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS33045023-5.192904P-type conjugative transfer ATPase TrbB
PSLF89_RS33050223-7.464026S26 family signal peptidase
PSLF89_RS33055216-3.588767type IV secretion system protein
PSLF89_RS33060118-2.864679hypothetical protein
PSLF89_RS330651313.952799hypothetical protein
PSLF89_RS330700324.719667TrbG/VirB9 family P-type conjugative transfer
PSLF89_RS330800315.499977hypothetical protein
PSLF89_RS330901297.359529conjugal transfer protein TrbE
PSLF89_RS330952211.880807VirB3 family type IV secretion system protein
PSLF89_RS331002222.425764TrbC/VirB2 family protein
PSLF89_RS331101243.149735biotin/lipoyl-binding protein
PSLF89_RS331151232.330122HlyD family efflux transporter periplasmic
PSLF89_RS331200242.839068YifB family Mg chelatase-like AAA ATPase
PSLF89_RS33125-1253.152066accessory factor UbiK family protein
PSLF89_RS33130-1252.917454P-II family nitrogen regulator
PSLF89_RS33135-1221.445495biosynthetic arginine decarboxylase
PSLF89_RS331400201.513591hypothetical protein
PSLF89_RS331502230.580199glycosyltransferase family 2 protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS33140PF04335633e-14 VirB8 type IV secretion protein
		>PF04335#VirB8 type IV secretion protein

Length = 227

Score = 63.3 bits (154), Expect = 3e-14
Identities = 31/209 (14%), Positives = 71/209 (33%), Gaps = 11/209 (5%)

Query: 15 EAVTPYQKAAQEWDR-RIGSSRAQANSWRLIAIACIVACILLLIGMMMLIQQKKNVVYVA 73
+ + Y + A W+R ++ ++ ++A ++ + L K YV
Sbjct: 8 DELKAYFEEAASWERDKLAAAERSKKLAWVVAGVAGALATAGVVAVAALTPLKTVEPYVI 67

Query: 74 EVGSSG---QVINVVKTNQPYRPTDAQYQYFIAKFIRHAMSLPLDPVILKNNLLEAYQLT 130
V + + + + +A +YF+A ++R+ + ++
Sbjct: 68 TVDRNTGEASIAAKLHGDATITYDEAVRKYFLATYVRYREG--WIAAAREEYFDAVMVMS 125

Query: 131 ASKGRLQFNELMK---KLQPTRHIGQLTQT-VEVQMVEQITPNSYSATWRQTSYDQNGKV 186
A + +++ K P + T VE++ V + N + + S +
Sbjct: 126 ARPEQDRWSRFYKTDNPQSPQNILANRTDVFVEIKRVSFLGGNVAQVYFTKESVTGSNST 185

Query: 187 TQVKRYHGVFTVSQTMPTTEHEILVNPLG 215
+ V P+ E + NPLG
Sbjct: 186 KTDAVATIKYKVD-GTPSKEVDRFKNPLG 213


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS33170RTXTOXIND384e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 38.3 bits (89), Expect = 4e-06
Identities = 22/110 (20%), Positives = 42/110 (38%), Gaps = 4/110 (3%)

Query: 9 IISPKTVMLKAQQAGLVTHIYFQSGEQVNKGQRLLQIDNHKQQASLAKAKADLFSLKADY 68
S ++ +K + +V I + GE V KG LL++ +A K ++ L + +
Sbjct: 91 THSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQ 150

Query: 69 QRNLQMAQKNHVSISANTLDQKLGTVRAAQAAVASAKESLAETTVRAPFA 118
R Q SI N L + V+ + + ++ F+
Sbjct: 151 TR----YQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFS 196


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS33175RTXTOXIND290.014 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 28.6 bits (64), Expect = 0.014
Identities = 17/108 (15%), Positives = 36/108 (33%), Gaps = 13/108 (12%)

Query: 10 TLGSYLAEGDSIVMLT-DSKALLVQYQLPQEYSAQMAINQHVHITTAQQAWAEKTDKPPV 68
T G + ++++++ + L V + + + + Q+ I +A+
Sbjct: 345 TEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKV--EAFPYTR--YGY 400

Query: 69 TTSTVSYISPILITNSHAYLAH-ARINTLNNTMI-------LKPGMTV 108
V I+ I + L I+ N + L GM V
Sbjct: 401 LVGKVKNINLDAIEDQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAV 448


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS33205HTHFIS381e-04 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 37.5 bits (87), Expect = 1e-04
Identities = 33/192 (17%), Positives = 60/192 (31%), Gaps = 49/192 (25%)

Query: 173 LEQAYFQVDTRQTHYLDMADVKGQA----HAKRALEIAAAGRHHLLFVGPPGTGKTMLAS 228
L + + + D + G++ R L L+ G GTGK ++A
Sbjct: 119 LAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVAR 178

Query: 229 RLPGILPALSNQEALESAAVHSL----TSAEIDLSCWFIPKFASPHHTASSIAMVG---- 280
A+H + ++ IP+ + G
Sbjct: 179 ------------------ALHDYGKRRNGPFVAINMAAIPR------DLIESELFGHEKG 214

Query: 281 ---GGSVPKPGEISRAHHGVLFLDEL----PEFDRKVLEVLREPLESGQIDIIRASHRAS 333
G G +A G LFLDE+ + ++L VL++ + R
Sbjct: 215 AFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQG------EYTTVGGRTP 268

Query: 334 FPASFQLIAAMN 345
+ +++AA N
Sbjct: 269 IRSDVRIVAATN 280


54PSLF89_RS33285PSLF89_RS33320Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS33285028-3.146426glycine--tRNA ligase subunit beta
PSLF89_RS33290127-5.148935acyl-CoA thioesterase
PSLF89_RS33295227-5.728551D-glycero-beta-D-manno-heptose 1,7-bisphosphate
PSLF89_RS33300120-4.6642471-acyl-sn-glycerol-3-phosphate acyltransferase
PSLF89_RS33305320-5.756193hypothetical protein
PSLF89_RS33310119-5.054502sterol desaturase family protein
PSLF89_RS33315122-4.194452DMT family transporter
PSLF89_RS33320124-3.3418334-oxalomesaconate tautomerase
55PSLF89_RS33385PSLF89_RS33540Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS33385324-2.493953Sua5/YciO/YrdC/YwlC family protein
PSLF89_RS33390022-2.187143oxygen-dependent coproporphyrinogen oxidase
PSLF89_RS33395022-2.932702DUF4823 domain-containing protein
PSLF89_RS33400-122-3.587683IS30 family transposase
PSLF89_RS33405014-2.335504hypothetical protein
PSLF89_RS33410015-1.395273IS30 family transposase
PSLF89_RS334150171.126382hypothetical protein
PSLF89_RS334201231.417902IS4 family transposase
PSLF89_RS334301163.072106hypothetical protein
PSLF89_RS334350134.017795hypothetical protein
PSLF89_RS33440-1144.496815hypothetical protein
PSLF89_RS33445-1184.547526hypothetical protein
PSLF89_RS33450-1244.60589316S rRNA (uracil(1498)-N(3))-methyltransferase
PSLF89_RS33455-1274.204116glycoside hydrolase family 3 protein
PSLF89_RS33460-1202.587254S41 family peptidase
PSLF89_RS33465-1181.709807peptidoglycan DD-metalloendopeptidase family
PSLF89_RS33470-2180.9594262,3-bisphosphoglycerate-independent
PSLF89_RS33475018-0.226083hypothetical protein
PSLF89_RS33480219-1.120661transposase
PSLF89_RS33485419-1.221343DUF3750 domain-containing protein
PSLF89_RS33490119-1.279800hypothetical protein
PSLF89_RS36710-117-1.696886DegT/DnrJ/EryC1/StrS family aminotransferase
PSLF89_RS33505-219-2.175356hypothetical protein
PSLF89_RS33510-220-2.207142DNA polymerase I
PSLF89_RS33515020-6.131045enoyl-ACP reductase FabI
PSLF89_RS33520-118-5.541451hypothetical protein
PSLF89_RS33525121-6.594263M4 family metallopeptidase
PSLF89_RS33530-117-5.722760hypothetical protein
PSLF89_RS33535018-6.589245GrpB family protein
PSLF89_RS33540017-4.623786chloride channel protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS33605DHBDHDRGNASE553e-11 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 55.1 bits (132), Expect = 3e-11
Identities = 59/260 (22%), Positives = 102/260 (39%), Gaps = 18/260 (6%)

Query: 2 LTGKKGLIIGLANENSIAFGCAKILKQQGAELI-LTHRSEKSYKQARFLANE-LSADLYQ 59
+ GK I G A I A+ L QGA + + + EK K L E A+ +
Sbjct: 6 IEGKIAFITGAAQ--GIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFP 63

Query: 60 CDVTNQENIKQLFPYIKKKWQTLDFVIHSLAFAKPRELQGRLVDTSSQAFLQAMDISCHS 119
DV + I ++ I+++ +D +++ +P G + S + + ++
Sbjct: 64 ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRP----GLIHSLSDEEWEATFSVNSTG 119

Query: 120 FLRIAQQAESLMPN--GGSLISMSYLGAQKVMKNYNMMGPIKAALEASIKYLAVELAEKN 177
++ M + GS++++ A + KAA K L +ELAE N
Sbjct: 120 VFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYN 179

Query: 178 IRVYGISPGLMPTRAATGIKNLDQLLAATTRKS--------PMQRIINQEEVGALASFLV 229
IR +SPG T + + + S P++++ ++ FLV
Sbjct: 180 IRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLV 239

Query: 230 SDFASGMTGQTLFVDGGYNL 249
S A +T L VDGG L
Sbjct: 240 SGQAGHITMHNLCVDGGATL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS33615THERMOLYSIN2534e-79 Thermolysin metalloprotease (M4) family signature.
		>THERMOLYSIN#Thermolysin metalloprotease (M4) family signature.

Length = 544

Score = 253 bits (648), Expect = 4e-79
Identities = 130/534 (24%), Positives = 190/534 (35%), Gaps = 105/534 (19%)

Query: 79 GDTYVRYQQKYEGIPVIGKQVVV-KQPKAVTGFAATSRSASRATATRISLAKDLDVDLVA 137
G T +R++Q +G +V ++ + T L +LD +
Sbjct: 87 GHTVMRFEQAIAASLCMGAVLVAHVNDGELSSLSGT-------------LIPNLDKRTLK 133

Query: 138 T---VSAGDAMAFAKQQFEQSYSGTQVADGSNSVKATKEIRIVDNKARLYYRVTFNASNT 194
T +S A AKQ + + A I + RL Y V
Sbjct: 134 TEAAISIQQAEMIAKQDVADRVTKERPAA-EEGKPTRLVIYPDEETPRLAYEVNVR---F 189

Query: 195 AGGKPYSMVYIIAANGGAKPVVLKHWDNIQNYE--DTGPGGNEKTVKHGPTGVEFFYGEN 252
P + +Y+I A G VL W+ + + P TV G
Sbjct: 190 LTPVPGNWIYMIDAADGK---VLNKWNQMDEAKPGGAQPVAGTSTVGVG----------- 235

Query: 253 NLPALNVSENNGS-CTMDNGDVRLVDVQNQED----HSWDSDYNTTAYQYSCGHNQGDPI 307
V + T + +Q+ ++D T
Sbjct: 236 ----RGVLGDQKYINTTYSSYYGYYYLQDNTRGSGIFTYDGRNRTVLPGSLWADGDNQFF 291

Query: 308 NGAYSPTDDAYYFGSMIIDMYKNWYGVDALQENGEPMQLIMRVHYGTDYDNAFWDGQTMS 367
+ DA+Y+ ++ D YKN +G + +G + VHYG Y+NAFW+G M
Sbjct: 292 ASYDAAAVDAHYYAGVVYDYYKNVHGRLSY--DGSNAAIRSTVHYGRGYNNAFWNGSQMV 349

Query: 368 FGDG--SSFYPLV-SLDVAGHEVSHGFTEQHSGLEYSDQSGSLNEAFSDMAGQAVRAYLL 424
+GDG +F P +DV GHE++H T+ +GL Y ++SG++NEA SD+ G V Y
Sbjct: 350 YGDGDGQTFLPFSGGIDVVGHELTHAVTDYTAGLVYQNESGAINEAMSDIFGTLVEFYAN 409

Query: 425 STNSDLYKQLYFNQDEVTWGIGETIMKGDNTDTALRYMDQPSKDQDENGVSADCLDKDLA 484
W IGE I ALR M P+K D + S
Sbjct: 410 RNPD--------------WEIGEDIYTPGVAGDALRSMSDPAKYGDPDHYSKRYTGTQDN 455

Query: 485 GSGCIISYDDVVTAAKKLPLRYQQSYIVHHGSGVFNKAFYLLSQQ----------VGIKE 534
G VH SG+ NKA YLLSQ +G +
Sbjct: 456 GG-------------------------VHTNSGIINKAAYLLSQGGVHYGVSVTGIGRDK 490

Query: 535 AFKVMKDANATRWTSGSDFADAACGVLQAAHADGVGSDSM----IKEVFNQVGV 584
K+ A T S+F+ +QAA AD GS S +K+ FN VGV
Sbjct: 491 MGKIFYRALVYYLTPTSNFSQLRAACVQAA-ADLYGSTSQEVNSVKQAFNAVGV 543


56PSLF89_RS33670PSLF89_RS33695Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS336702203.031309YcgL domain-containing protein
PSLF89_RS336752193.499517adenosylmethionine--8-amino-7-oxononanoate
PSLF89_RS336803222.870541DNA topoisomerase (ATP-hydrolyzing) subunit B
PSLF89_RS336853221.763447DNA replication/repair protein RecF
PSLF89_RS336903232.465031DNA polymerase III subunit beta
PSLF89_RS336953242.072310chromosomal replication initiator protein DnaA
57PSLF89_RS12965PSLF89_RS13870N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS12965118-2.897431hypothetical protein
PSLF89_RS12975116-0.168549hypothetical protein
PSLF89_RS13490-1172.458781hypothetical protein
PSLF89_RS136500174.061367hypothetical protein
PSLF89_RS136600184.605644YhbY family RNA-binding protein
PSLF89_RS138301205.062644hypothetical protein
PSLF89_RS138351194.96444423S rRNA (uridine(2552)-2'-O)-methyltransferase
PSLF89_RS138701194.290774ATP-dependent zinc metalloprotease FtsH
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS12965FLGPRINGFLGI280.045 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 27.6 bits (61), Expect = 0.045
Identities = 13/80 (16%), Positives = 26/80 (32%), Gaps = 10/80 (12%)

Query: 130 IESADTAGFK---TIYADNNNSDTTNGTHYIQQLEEIIAERTIKPYYKQAEENL---AIA 183
IE + FK + N D + ++ +++ Y E IA
Sbjct: 179 IERELPSKFKDSVNLVLQLRNPDFST----AVRVADVVNAFARARYGDPIAEPRDSQEIA 234

Query: 184 IRKRELEHKIAFLTIIEKIK 203
++K + + IE +
Sbjct: 235 VQKPRVADLTRLMAEIENLT 254


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS13490SECBCHAPRONE290.019 Bacterial protein-transport SecB chaperone protein ...
		>SECBCHAPRONE#Bacterial protein-transport SecB chaperone protein

signature.
Length = 170

Score = 29.1 bits (65), Expect = 0.019
Identities = 14/79 (17%), Positives = 29/79 (36%), Gaps = 6/79 (7%)

Query: 255 HIAAENWQPSVGVQFNQYQSQVMQAVFEFSMSEKSDLKALISQLAKYEAE------FLSS 308
HI ++W+P + + QV ++E ++ + S + E F S
Sbjct: 39 HIFQQDWEPKLSFDLSTEAKQVGDDLYEVCLNISVETTMESSGDVAFICEVKQAGVFTIS 98

Query: 309 SKIKENIPEFNEAICPRIL 327
+ + + CP +L
Sbjct: 99 GLEEMQMAHCLTSQCPNML 117


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS13660TONBPROTEIN260.043 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 26.5 bits (58), Expect = 0.043
Identities = 14/61 (22%), Positives = 21/61 (34%), Gaps = 11/61 (18%)

Query: 87 QAPTKPAAEKAKTKSKPNSKVKAREKRQEIKAKAEKEEQARKKAKYFKKVTQPRAPRNNQ 146
+ P + K K KP K K +K QE + K ++P +P N
Sbjct: 80 EPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVK-----------PVESRPASPFENT 128

Query: 147 N 147

Sbjct: 129 A 129


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS13870HTHFIS330.003 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 33.3 bits (76), Expect = 0.003
Identities = 21/82 (25%), Positives = 32/82 (39%), Gaps = 18/82 (21%)

Query: 189 VLMVGPPGTGKTLLAKAI---AGEAKVPFFS-----ISGSDFVEMFVGV------GASRV 234
+++ G GTGK L+A+A+ PF + I G GA
Sbjct: 163 LMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTR 222

Query: 235 RD-MFDQAKKRAPCIIFIDEID 255
F+QA+ +F+DEI
Sbjct: 223 STGRFEQAEGGT---LFLDEIG 241


58PSLF89_RS19695PSLF89_RS19755N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS19695-117-0.196195sigma-54 dependent transcriptional regulator
PSLF89_RS19700-117-1.709792PAS domain-containing protein
PSLF89_RS19705-117-0.005012sigma-54 dependent transcriptional regulator
PSLF89_RS197100171.215205flagellar hook-basal body complex protein FliE
PSLF89_RS197150170.858341flagellar M-ring protein FliF
PSLF89_RS19720016-0.315989flagellar motor switch protein FliG
PSLF89_RS19725117-1.017870flagellar assembly protein FliH
PSLF89_RS197300130.588560flagellar protein export ATPase FliI
PSLF89_RS19735016-0.008624flagellar export protein FliJ
PSLF89_RS197401160.533969flagellar hook-length control protein FliK
PSLF89_RS197451210.850603hypothetical protein
PSLF89_RS19750-1163.112866hypothetical protein
PSLF89_RS19755-1163.279162MFS transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS19695HTHFIS422e-146 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 422 bits (1086), Expect = e-146
Identities = 164/488 (33%), Positives = 248/488 (50%), Gaps = 14/488 (2%)

Query: 1 MLKQRVVIVSQCQVSANELKLLFEFMGENVAVCLN-NDDWTMLLHDNDPLLLCVAHDALS 59
M +++ L G +V + N W + + L++
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 60 HVFALYHELKHQNKIACECRFVVVAEPERIKRHYSAKELKNVCGYLVKPYRYAQLEQVLD 119
+ F L L K + +V++ A E YL KP+ L +++
Sbjct: 61 NAFDL---LPRIKKARPDLPVLVMSAQNTFMTAIKASEK-GAYDYLPKPF---DLTELIG 113

Query: 120 NVQTAQTANEERLAAGRSEVQDELNQLLVGKSRAIRRVRQLIRQVAKSEVNVLILGCSGT 179
+ A + R + + QD + LVG+S A++ + +++ ++ ++++ ++I G SGT
Sbjct: 114 IIGRALAEPKRRPSKLEDDSQDGMP--LVGRSAAMQEIYRVLARLMQTDLTLMITGESGT 171

Query: 180 GKEVVSQAIHRASVRAQQAFVPVNCGAIPADLLESELFGHEKGAFTGAIASRQGRFELAQ 239
GKE+V++A+H R FV +N AIP DL+ESELFGHEKGAFTGA GRFE A+
Sbjct: 172 GKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAE 231

Query: 240 KGTLFLDEIGDMPLNMQVKLLRVLQERTFERVGSNKALECDVRVIAATHRNLEELIEEGL 299
GTLFLDEIGDMP++ Q +LLRVLQ+ + VG + DVR++AAT+++L++ I +GL
Sbjct: 232 GGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGL 291

Query: 300 FREDLFYRLNVFPIEMPSLAERSEDIPLLIKELVSRIQREGRGRIRFTAEALARLKNYHW 359
FREDL+YRLNV P+ +P L +R+EDIP L++ V + ++EG RF EAL +K + W
Sbjct: 292 FREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDVKRFDQEALELMKAHPW 351

Query: 360 PGNVRELANLVERLTVSYANRWVDVPQLPPKFLTAEDIEACNALDESDGPLYEETAPIDP 419
PGNVREL NLV RLT Y + + + + G L A +
Sbjct: 352 PGNVRELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEEN 411

Query: 420 IDIEANMVGPSTVHLPGEGVDLKSYLTTIEANLIQAALDQSNGVVAHAAKRLSIRRTTLV 479
+ G + L +E LI AAL + G AA L + R TL
Sbjct: 412 MRQYFASFGDA----LPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLR 467

Query: 480 EKIRKLNL 487
+KIR+L +
Sbjct: 468 KKIRELGV 475


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS19705HTHFIS455e-160 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 455 bits (1173), Expect = e-160
Identities = 161/484 (33%), Positives = 268/484 (55%), Gaps = 20/484 (4%)

Query: 1 MSQGVVLIVEDEAALAEAIKETLSLANLPSIIANHAEEALEKIKRHNILIVISDINMPGI 60
M+ +L+ +D+AA+ + + LS A I ++A I + +V++D+ MP
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 SGHELLKQIKRYQSDIPVLLMTAFSNIEGAVQAMRDGAVDYIAKPFEPEYLVECVQHFID 120
+ +LL +IK+ + D+PVL+M+A + A++A GA DY+ KPF+ L+ + +
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 121 K----------KIYDEKNPIAEDLNTKKLFSLAKKVAATDASVLITGESGTGKEVLSRFI 170
+ D + ++++ + ++ TD +++ITGESGTGKE+++R +
Sbjct: 121 EPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARAL 180

Query: 171 HHHSSRYKNSFIAINCAAIPENMLEAVLFGYEKGAFTGAYQACPGKFEQANGGTLLLDEI 230
H + R F+AIN AAIP +++E+ LFG+EKGAFTGA G+FEQA GGTL LDEI
Sbjct: 181 HDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEI 240

Query: 231 SEMDLNLQAKLLRVLQEKEVERLGGRKLIQLDVRIIATSNRKIQDYIKDGRFREDLYYRI 290
+M ++ Q +LLRVLQ+ E +GGR I+ DVRI+A +N+ ++ I G FREDLYYR+
Sbjct: 241 GDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRL 300

Query: 291 NVFPLQWQPLRSRINDIVPLAKRLVYQYANKE--KVPELTQAAEKKLTEYFWPGNIRELD 348
NV PL+ PLR R DI L + V Q A KE V Q A + + + WPGN+REL+
Sbjct: 301 NVVPLRLPPLRDRAEDIPDLVRHFV-QQAEKEGLDVKRFDQEALELMKAHPWPGNVRELE 359

Query: 349 NVMQRAIILHVGDKIEIDDIQLDSDWQSDEYDESEINNINNNFKNKGEKNIDDINGDYSG 408
N+++R L+ D I + I+ + +S+ D + + +++ Y
Sbjct: 360 NLVRRLTALYPQDVITREIIENEL--RSEIPDSPIEKAAARSGSLSISQAVEENMRQYFA 417

Query: 409 DNKNGKSDNLSYEMKHHEFD--IILKSLEKHKGVRKKVSEELDISSRTLRYKLAKMREAG 466
+ + Y+ E + +IL +L +G + K ++ L ++ TLR K+ ++ G
Sbjct: 418 SFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIREL---G 474

Query: 467 ITIP 470
+++
Sbjct: 475 VSVY 478


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS19710FLGHOOKFLIE581e-14 Flagellar hook-basal body complex protein FliE signa...
		>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE

signature.
Length = 103

Score = 58.1 bits (140), Expect = 1e-14
Identities = 35/106 (33%), Positives = 55/106 (51%), Gaps = 5/106 (4%)

Query: 8 SAEQAVLNVMQQLAAKAANEKTAAGSGVGDNHANTFSNLLKVSLNTVNKHQINSANLQKS 67
SA Q + V+ QL A A +A +F+ L +L+ ++ Q + +
Sbjct: 1 SAIQGIEGVISQLQATA---MSARAQESLPQPTISFAGQLHAALDRISDTQTAARTQAEK 57

Query: 68 FEVGEATLP--EVIVAMQKASVSFTAIKEVRNKLIDAYRQVMNMPV 111
F +GE + +V+ MQKASVS +VRNKL+ AY++VM+M V
Sbjct: 58 FTLGEPGVALNDVMTDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV 103


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS19715FLGMRINGFLIF396e-134 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 396 bits (1018), Expect = e-134
Identities = 198/566 (34%), Positives = 302/566 (53%), Gaps = 40/566 (7%)

Query: 12 IEGFNRLNWLKQVALMIGLSVSIASGVAVIMWTKTSNYEPVFSSVDSLSLPHIVQSLKQS 71
+E NRL ++ L++ S ++A VA+++W KT +Y +FS++ IV L Q
Sbjct: 13 LEWLNRLRANPRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQDGGAIVAQLTQM 72

Query: 72 NIEFKLDERRNLILVAKDQVNKARIALAENGVSGRISTGFESLGKDSSFGTSQFMETVRY 131
NI ++ I V D+V++ R+ LA+ G+ + GFE L + FG SQF E V Y
Sbjct: 73 NIPYRFANGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQ-EKFGISQFSEQVNY 131

Query: 132 RHALEGELSRTISSIQGVRSSRVHLAIPKQSSFLKSQKEARASVFINLQGGY-LEKSQVA 190
+ ALEGEL+RTI ++ V+S+RVHLA+PK S F++ QK ASV + L+ G L++ Q++
Sbjct: 132 QRALEGELARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRALDEGQIS 191

Query: 191 AIVNLVASSVPNLKRSQVSVVDQHGNLLTHAMEGGGFAATERQFAYQRQVESAYVQRILN 250
A+V+LV+S+V L V++VDQ G+LLT G + Q + VES +RI
Sbjct: 192 AVVHLVSSAVAGLPPGNVTLVDQSGHLLT-QSNTSGRDLNDAQLKFANDVESRIQRRIEA 250

Query: 251 ILEPIVGSGNVRAQVTANVDFTKSEKTQETFNPDMKAVRS----EFLLNEEKSGEAGLGG 306
IL PIVG+GNV AQVTA +DF E+T+E ++P+ A ++ L E+ G GG
Sbjct: 251 ILSPIVGNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNISEQVGAGYPGG 310

Query: 307 IPGALSNQPPGIGTAPEKA--VGEEGAEKTKQ----------TPTSKRNESTRNYEVDRL 354
+PGALSNQP AP ++ A+ T Q P S + T NYEVDR
Sbjct: 311 VPGALSNQPAPPNEAPIATPPTNQQNAQNTPQTSTSTNSNSAGPRSTQRNETSNYEVDRT 370

Query: 355 ISHTRGQLGRVMRLTVAVVLNNKTTRDDKGKITAAAIKQDEINRIAQLVRDAVGFDVARG 414
I HT+ +G + RL+VAVV+N KT D K + D++ +I L R+A+GF RG
Sbjct: 371 IRHTKMNVGDIERLSVAVVVNYKTLADGKPL----PLTADQMKQIEDLTREAMGFSDKRG 426

Query: 415 DSLNVVNLPFVKEVTAKPPVIPLWEQGWFISLLKQVLGGLFILILVL----FILRPTLRS 470
D+LNVVN PF +P W+Q FI L L +L++ +RP L
Sbjct: 427 DTLNVVNSPFSAVDNTGGE-LPFWQQQSFIDQLLAAGRWLLVLVVAWILWRKAVRPQLTR 485

Query: 471 LAGKSKAELFDQKMQLAREVGIELDANGNPIVPEEEPVVDEFERPLDLPHDSDDQERNIN 530
++KA +++ E +E+ + +E + L + Q
Sbjct: 486 RVEEAKAAQEQAQVRQETEEAVEVRLSK------DEQLQQRRANQR-LGAEVMSQR---- 534

Query: 531 FVKQLVEKDAKLVAQVIKEWVSEDEQ 556
++++ + D ++VA VI++W+S D +
Sbjct: 535 -IREMSDNDPRVVALVIRQWMSNDHE 559


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS19720FLGMOTORFLIG2654e-89 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 265 bits (678), Expect = 4e-89
Identities = 102/330 (30%), Positives = 192/330 (58%)

Query: 9 NLDGIQKSSIFLMTVGKDVAATILQHLNPREVQRVGEAMVKTTKVEKSEVKYVFDIFYDA 68
L G QK++I L+++G ++++ + ++L+ E++ + + K + V F +
Sbjct: 14 ALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITSELKDNVLLEFKEL 73

Query: 69 VARQTGLGIGSDEYIREMLVGAMGEEQAGGVIERILIGGSTKGLDSLKWMDARAVADVIR 128
+ Q + G +Y RE+L ++G ++A +I + ++ + ++ D + + I+
Sbjct: 74 MMAQEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSALQSRPFEFVRRADPANILNFIQ 133

Query: 129 YEHPQIMSIVLSYIDGDQAAEVLAHLPMNQRSDLMMRVASLEAVQPAALRELNEILEKQF 188
EHPQ ++++LSY+D +A+ +L+ LP ++++ R+A ++ P +RE+ +LEK+
Sbjct: 134 QEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEVVREVERVLEKKL 193

Query: 189 AGKQSAQAAAIGGVKTAADIMNFLDSTIEGEIMEEVKAADEELGHQIEDLMFVFDDLINI 248
A S + GGV +I+N D E I+E ++ D EL +I+ MFVF+D++ +
Sbjct: 194 ASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEIKKKMFVFEDIVLL 253

Query: 249 ADRDMQRLLTDVEQDKLMLALKGADNSMKEKIFNNMSSRAAAMLREDLEVSAPARLSDVE 308
DR +QR+L +++ +L ALK D ++EKIF NMS RAA+ML+ED+E P R DVE
Sbjct: 254 DDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKEDMEFLGPTRRKDVE 313

Query: 309 TAQKEILATARNLADQAEISLGGAGGEEMV 338
+Q++I++ R L +Q EI + G E+++
Sbjct: 314 ESQQKIVSLIRKLEEQGEIVISRGGEEDVL 343



Score = 30.9 bits (70), Expect = 0.006
Identities = 19/111 (17%), Positives = 46/111 (41%), Gaps = 4/111 (3%)

Query: 121 RAVADVIRYEHPQIMSIVLSYIDGDQAAEVLAHLPMNQRSDLMMRVASLEAVQPAALREL 180
+ + DV Q +I+L I + +++V +L + L +A LE +
Sbjct: 7 KEILDVSALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITS---ELK 63

Query: 181 NEILEKQFAGKQSAQAAAIGGVKTAADIMN-FLDSTIEGEIMEEVKAADEE 230
+ +L + + + GG+ A +++ L + +I+ + +A +
Sbjct: 64 DNVLLEFKELMMAQEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSALQS 114


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS19725FLGFLIH451e-07 Flagellar assembly protein FliH signature.
		>FLGFLIH#Flagellar assembly protein FliH signature.

Length = 228

Score = 45.2 bits (106), Expect = 1e-07
Identities = 42/190 (22%), Positives = 85/190 (44%), Gaps = 20/190 (10%)

Query: 132 DLEELHKKAHEDGFAIGKAAGFSAGQAAGE----AQGYQEAYAQAQTE---INQKKQELE 184
L +L +AHE G+ G A G G G AQG ++ A+A+++ I+ + Q+L
Sbjct: 43 QLAQLQMQAHEQGYQAGIAEGRQQGHKQGYQEGLAQGLEQGLAEAKSQQAPIHARMQQLV 102

Query: 185 QEQLKLIEMMNSLTHPFEEVSDKLKDELLHFITQLSEEIAKEQCLISADGLKDIINQILA 244
E ++ ++S+ + L+ + + ++ + + L I Q+L
Sbjct: 103 SEFQTTLDALDSV----------IASRLMQMALEAARQVIGQTPTVDNSALIKQIQQLLQ 152

Query: 245 K--LFSEEKIRISLNPVDIERIKEQENEELLSENIDFIEDDAITVGGCVVDAGASRVDMT 302
+ LFS K ++ ++P D++R+ + L D + GGC V A +D +
Sbjct: 153 QEPLFS-GKPQLRVHPDDLQRVDDMLGATLSLHGWRLRGDPTLHPGGCKVSADEGDLDAS 211

Query: 303 MENRIRDMTQ 312
+ R +++ +
Sbjct: 212 VATRWQELCR 221


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS19735FLGFLIJ405e-07 Flagellar FliJ protein signature.
		>FLGFLIJ#Flagellar FliJ protein signature.

Length = 147

Score = 40.2 bits (93), Expect = 5e-07
Identities = 34/142 (23%), Positives = 69/142 (48%), Gaps = 2/142 (1%)

Query: 1 MKRSQRLVNIIKIAEYQERKLAKQLAASRNTLKQYQEQLAMLDLYLNDYLKKLSAIKKNN 60
M L + +AE + A+ L R +Q +EQL ML Y N+Y L++
Sbjct: 1 MAEHGALATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAG 60

Query: 61 QEVTISKLTIYHDFIQTIEQGIQRQQHFIADASIVIQRHEQEWRKARAKVESFKHLQQKF 120
+T ++ Y FIQT+E+ I + + + + + WR+ + ++++++ LQ++
Sbjct: 61 --ITSNRWINYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQERQ 118

Query: 121 KNAEDRELDRQEQRMIDDYVNR 142
A +R +Q+ +D++ R
Sbjct: 119 STAALLAENRLDQKKMDEFAQR 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS19740FLGFLGJ300.019 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 30.1 bits (67), Expect = 0.019
Identities = 18/68 (26%), Positives = 29/68 (42%), Gaps = 5/68 (7%)

Query: 353 LMIKLHVDASQKTHLTFTTHSDVVREMIEQQLPRLKDMFDSQGLALGDANVAGQGTFSQG 412
+M+K DA K L + H+ + M +QQ+ + M +GL L + V +
Sbjct: 47 MMLKSMRDALPKDGLFSSEHTRLYTSMYDQQIA--QQMTAGKGLGLAEMMVK---QMTPE 101

Query: 413 QHFNEEKE 420
Q EE
Sbjct: 102 QPLPEEST 109


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS19755TCRTETA432e-06 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 42.5 bits (100), Expect = 2e-06
Identities = 58/345 (16%), Positives = 119/345 (34%), Gaps = 25/345 (7%)

Query: 53 ATGVGLLSSFYYYSYAAMQIPAGLAFDRMNARILITVSLTICAIGTLLFSLTDSFTLASL 112
G+L + Y A G DR R ++ VSL A+ + + + +
Sbjct: 42 TAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYI 101

Query: 113 GRFFTGFGSAFAFIAMLFIA---AQWFPTRYFGLIAGIGQFLASIGALAGQGPLAAIVSD 169
GR G A +A +IA R+FG ++ G +AG L ++
Sbjct: 102 GRIVAGITGATGAVAGAYIADITDGDERARHFGFMSAC----FGFGMVAGPV-LGGLMGG 156

Query: 170 LGWREALQGLGFIGITLAVIILLILKDKRHHHTDNQSITKKSKTTPNNHLSIKQQLTILF 229
+ + +L + + K + P ++ + +
Sbjct: 157 FSPHAPFFAAAALNGLNFLTGCFLLPE-----------SHKGERRPLRREALNPLASFRW 205

Query: 230 KHPETFKIALY--SFAAWAPITIFASLWGVPFLRTHYQLTINDAA-NLSSTIWLGIALGS 286
T AL F + A+LW V F + +L++ L +
Sbjct: 206 ARGMTVVAALMAVFFIMQLVGQVPAALW-VIFGEDRFHWDATTIGISLAAFGILHSLAQA 264

Query: 287 PLIGYWSDKIRQRKPLLLLAATLGIIASLIVLYSPSLPVTLLYVLMFFFGVGA-AGQSLS 345
+ G + ++ +R+ L+L G L+ + + VL+ G+G A Q++
Sbjct: 265 MITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGGIGMPALQAML 324

Query: 346 FAYIKDYQQDNILGTAIGFNNMAVVISGALFQPLVGFIMSQLWDG 390
+ + +Q + G+ ++ ++ LF + ++ W+G
Sbjct: 325 SRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAASITT-WNG 368


59PSLF89_RS37085PSLF89_RS23180N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS37085-122-0.391065hypothetical protein
PSLF89_RS231602200.310047hypothetical protein
PSLF89_RS360653220.756214hypothetical protein
PSLF89_RS360706250.528150CHASE domain-containing protein
PSLF89_RS36075425-0.650143IS4 family transposase
PSLF89_RS36080423-1.173575response regulator
PSLF89_RS36085424-1.245661sulfate transporter CysZ
PSLF89_RS23180125-1.507744chromosome segregation protein SMC
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS36085TONBPROTEIN330.001 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 33.4 bits (76), Expect = 0.001
Identities = 17/92 (18%), Positives = 26/92 (28%), Gaps = 11/92 (11%)

Query: 319 TNAMEFADDNAEDLPPPPPPSNPDLQQPLPDFNDSDFPPPPPPPPPSDEEQMPLADFDDI 378
+ DL PP P P+ P P P P ++ P+
Sbjct: 42 AQPISVTMVTPADLEPPQAVQPPPEPVVEPE--------PEPEPIPEPPKEAPVVIEKPK 93

Query: 379 ELPPPPPMDFETGPETQVSPIEEVNSGATPTD 410
P P P + + Q P +V +
Sbjct: 94 PKPKPKP---KPVKKVQEQPKRDVKPVESRPA 122



Score = 28.8 bits (64), Expect = 0.042
Identities = 22/93 (23%), Positives = 33/93 (35%), Gaps = 7/93 (7%)

Query: 293 PPPPPPAELKAGAKARREEEAAVTVLTNAMEFADDNAEDLPPPPPPSNPDLQQPLPDFND 352
P P P + A E AV + + E +P PP + +++P P
Sbjct: 39 PAPAQPISVTMVTPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKP---- 94

Query: 353 SDFPPPPPPPPPSDEEQMPLADFDDIELPPPPP 385
P P P P ++ P D +E P P
Sbjct: 95 ---KPKPKPKPVKKVQEQPKRDVKPVESRPASP 124


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS23190PF06580340.001 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 34.5 bits (79), Expect = 0.001
Identities = 33/195 (16%), Positives = 68/195 (34%), Gaps = 46/195 (23%)

Query: 467 EKIIQQSLEGAEKVKNIVLSL-----KSFAHSDTDN---KEEFDLNHCIEQALTITQNEL 518
I LE K + ++ SL S +S+ +E + ++ L + +
Sbjct: 180 NNIRALILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTV---VDSYLQLASIQF 236

Query: 519 KYKCKIIKNLSPLNPLLGYSSQIGQVIMNLLI-NA-AHAIKES---GTITITTQQIAGFN 573
+ + + ++P ++ Q+ +++ L+ N H I + G I + + G
Sbjct: 237 EDRLQFENQINP--AIMDV--QVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTV 292

Query: 574 KLTIEDDGYGIHKDHLAKLFDPFFTTKSVGEGTGLGLS-------ISYGIIKKHQGSINV 626
L +E+ G K+ E TG GL + YG + I +
Sbjct: 293 TLEVENTGSLA--------------LKNTKESTGTGLQNVRERLQMLYG----TEAQIKL 334

Query: 627 ESTVGQGTVFTIQLP 641
G+ + +P
Sbjct: 335 SEKQGKVNA-MVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS23200HTHFIS958e-25 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 94.5 bits (235), Expect = 8e-25
Identities = 22/119 (18%), Positives = 45/119 (37%), Gaps = 1/119 (0%)

Query: 2 PSLLLVDDEPHIIDALKRLFRREKYTLHCAYSAKEGLDILAQQHIDIILSDQRMPSMLGS 61
++L+ DD+ I L + R Y + +A +A D++++D MP
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 EFLAKAQAQSPQTLRIILSGYADTKEIINGILNNHIHQFLEKPWRANELREHLRHLINL 120
+ L + + P +++S I + +L KP+ EL + +
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKG-AYDYLPKPFDLTELIGIIGRALAE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS23210GPOSANCHOR504e-08 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 50.1 bits (119), Expect = 4e-08
Identities = 51/300 (17%), Positives = 115/300 (38%), Gaps = 1/300 (0%)

Query: 716 QALQRELAEVKARVSGRLAHIEQVRARLAVVEHELSEALELFNEEQEEIKIARQRLEIAV 775
++ L +V+ R ++ + + + + +E EE+ A+++L
Sbjct: 46 RSQTDTLEKVQERADKFEIENNTLKLKNSDLSFNNKALKDHNDELTEELSNAKEKLRKND 105

Query: 776 EKMAELELQRQALESGRVEASRTTQEARQQLESLKSQYQQKQMQLQRVQSEQAGIKAALQ 835
+ ++E + Q LE+ + + + + A + ++ + + + + + +A ++ AL+
Sbjct: 106 KSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALE 165

Query: 836 RLEELLGRDKVRLVELEQSRIGLEEPLEEQRMLLDEQLERQLSFEDRLKTVKDQAQAHEN 895
D ++ LE + LE E L+ + + ++KT++ + A
Sbjct: 166 GAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAA 225

Query: 896 QVRTFEKQLHEQMNQVAHAREALEQGRMQAQELFIRRQSVEEQLVEAGFQLRGLL-EIYQ 954
+ EK L MN ++ + L R+ +E+ L A +I
Sbjct: 226 RKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKT 285

Query: 955 EGCDVKELETALEDIGRRIQRLGVINLAAIDEYAGQSERKVYLDAQHDDLTEALDMLEAA 1014
+ LE D+ + Q L + + E K L+A+H L E + EA+
Sbjct: 286 LEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEAS 345



Score = 43.1 bits (101), Expect = 7e-06
Identities = 34/219 (15%), Positives = 68/219 (31%)

Query: 168 AGISKYKERRKETERRIRHTRENLERLGDIREELGKQLSRLHQQAQAAEKYQNFKKEERE 227
A + K E + LE L + + A + K + E
Sbjct: 123 ADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLE 182

Query: 228 VKGQLYIQRWKSLQTQHGQEQKKIQEHEVIVEKQRAGQQHIDASLEKERLALSEASEKLH 287
+ R L+ ++ A + + A AL A
Sbjct: 183 AEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFST 242

Query: 288 ACQADYYQAGNEVSRLEQQIEHATTRIRETGQEMARLNTSLEKARSELAADEQQKVLLSA 347
A A E + LE + + + ++ +E AA E +K L
Sbjct: 243 ADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEH 302

Query: 348 QEEALEPETELLQNAVDEAALSAEEAEVSYRQLEKERES 386
Q + L + L+ +D + + ++ E +++LE++ +
Sbjct: 303 QSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKI 341



Score = 42.7 bits (100), Expect = 8e-06
Identities = 29/191 (15%), Positives = 73/191 (38%)

Query: 670 EKELMTLRQQELALEESICLHEEQLSQSQEVLMQVEAQVKSVQQQAQALQRELAEVKARV 729
E E L ++ LE+++ + + +EA+ +++ + L++ L
Sbjct: 147 EAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFS 206

Query: 730 SGRLAHIEQVRARLAVVEHELSEALELFNEEQEEIKIARQRLEIAVEKMAELELQRQALE 789
+ A I+ + A A + ++ + +++ + A LE ++ LE
Sbjct: 207 TADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELE 266

Query: 790 SGRVEASRTTQEARQQLESLKSQYQQKQMQLQRVQSEQAGIKAALQRLEELLGRDKVRLV 849
A + ++++L+++ + + ++ + + A Q L L +
Sbjct: 267 KALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKK 326

Query: 850 ELEQSRIGLEE 860
+LE LEE
Sbjct: 327 QLEAEHQKLEE 337



Score = 39.3 bits (91), Expect = 8e-05
Identities = 35/232 (15%), Positives = 76/232 (32%), Gaps = 2/232 (0%)

Query: 273 EKERLALSEASEKLHACQADYYQAGNEVSRLEQQIEHATTRIRETGQEMARLNTSLEKAR 332
E +A ++ L Q + E + L+ + + + L L A+
Sbjct: 39 EVSAVATRSQTDTLEKVQERADKFEIENNTLKLKNSDLSFNNKALKDHNDELTEELSNAK 98

Query: 333 SELAADEQQKVLLSAQEEALEPETELLQNAVDEAALSAEEAEVSYRQLEKERESLLQQVA 392
+L +++ +++ + LE L+ A++ A + + LE E+ +L + A
Sbjct: 99 EKLRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKA 158

Query: 393 LCRQEAEVEQTRIRHMEEQGQRLQQRLERLRAETH--NSDLISLEVGLEDVQGQQRELEE 450
+ E + + L+ L A L + + LE
Sbjct: 159 DLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEA 218

Query: 451 KEQELQLGAEEQQQRLIQQRQLIEQQRKGLEQQRGELHPLKGRLASLEALQQ 502
++ L + ++ L ++ E L+ R A LE +
Sbjct: 219 EKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALE 270



Score = 37.0 bits (85), Expect = 5e-04
Identities = 36/207 (17%), Positives = 84/207 (40%)

Query: 670 EKELMTLRQQELALEESICLHEEQLSQSQEVLMQVEAQVKSVQQQAQALQRELAEVKARV 729
++ TL ++ ALE E+ L + A++K+++ + AL+ E A+++ +
Sbjct: 245 SAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQS 304

Query: 730 SGRLAHIEQVRARLAVVEHELSEALELFNEEQEEIKIARQRLEIAVEKMAELELQRQALE 789
A+ + +R L + + +E+ KI+ + + ++ LE
Sbjct: 305 QVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQLE 364

Query: 790 SGRVEASRTTQEARQQLESLKSQYQQKQMQLQRVQSEQAGIKAALQRLEELLGRDKVRLV 849
+ + + + +SL+ + ++V+ + L LE+L +
Sbjct: 365 AEHQKLEEQNKISEASRQSLRRDLDASREAKKQVEKALEEANSKLAALEKLNKELEESKK 424

Query: 850 ELEQSRIGLEEPLEEQRMLLDEQLERQ 876
E+ + L+ LE + L E+L +Q
Sbjct: 425 LTEKEKAELQAKLEAEAKALKEKLAKQ 451


60PSLF89_RS24750PSLF89_RS24765N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS24750015-1.088075ABC transporter ATP-binding protein
PSLF89_RS24755-118-1.006877type III PLP-dependent enzyme
PSLF89_RS37670-217-2.552136IucA/IucC family siderophore biosynthesis
PSLF89_RS24760-117-3.366613MFS transporter
PSLF89_RS24765018-4.474031IucA/IucC family siderophore biosynthesis
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS24770PF05272362e-04 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 35.8 bits (82), Expect = 2e-04
Identities = 22/92 (23%), Positives = 32/92 (34%), Gaps = 19/92 (20%)

Query: 47 LIGPNGSGKSTLLKLLTGLI----TPD------------QGQIYLNQSELHSLKRKEIAK 90
L G G GKSTL+ L GL T G + SE+ + +R +
Sbjct: 601 LEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQIAGIVAYELSEMTAFRRADAEA 660

Query: 91 HIAFLPQRSAIPDQFTVIDLILAGRYPHQGLF 122
AF R D++ +P Q +
Sbjct: 661 VKAFFSSRK---DRYRGAYGRYVQDHPRQVVI 689


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS24780PF041832111e-62 IucA / IucC family
		>PF04183#IucA / IucC family

Length = 580

Score = 211 bits (538), Expect = 1e-62
Identities = 73/461 (15%), Positives = 151/461 (32%), Gaps = 72/461 (15%)

Query: 73 YQLILTLPESQIDQKKIVIAIQKPSLTYHFS---------YISSPIMTSKQQPLGK---L 120
Y+ + D+ I P + F +I + + +P+ L
Sbjct: 23 YEQVFHAESQGDDR----YCINLPGAQWRFIAERGIWGWLWIDAQTLRCADEPVLAQTLL 78

Query: 121 LDFSDLALIITNVLAKHHRSSVNLEFIQQAMQSCEIIEYFLKQSPHSNQQALNFIQSEQS 180
+ + + +A+H ++ + + + + S+ LN Q
Sbjct: 79 MQLKQVLSMSDATVAEH------MQDLYATLLGDLQLLKARRGLSASDLINLNA-DRLQC 131

Query: 181 LIFGHEFHPTPKARQGFTEKDIKRYSPELSEKFQLYYFKINKNQLKQYSKNNKLPPVIIE 240
L+ GH K R+G+ ++ ++RY+PE + F+L++ + + + N ++
Sbjct: 132 LLSGHPKFVFNKGRRGWGKEALERYAPEYANTFRLHWLAVKREHMIWRCDNEMDIHQLLT 191

Query: 241 E--------------------QDHVLYPTHPWQAHYLLSQQETKQALIDNNIQPIGLQGD 280
+ + P HPWQ ++ + + +G GD
Sbjct: 192 AAMDPQEFARFSQVWQENGLDHNWLPLPVHPWQWQQKIATDF-IADFAEGRMVSLGEFGD 250

Query: 281 SFSATSSVRTLFQENHPYFY--KFSLNVRLTNCIRKNSVAELKTAVELTHILNQ-YTQEV 337
+ A S+RTL + K L + T+C R + + L Q + +
Sbjct: 251 QWLAQQSLRTLTNASRRGGLDIKLPLTIYNTSCYRGIPGRYIAAGPLASRWLQQVFATDA 310

Query: 338 SKNHPNVTLLNESYAFSLKLANLAYNPTLNKKITEGFGFILRDNPLFSNHSNNLNNLLSD 397
+ +L E A + A + E G I R+NP
Sbjct: 311 TLVQSGAVILGEPAAGYVSHEGYAALARAPYRYQEMLGVIWRENPC-------------- 356

Query: 398 HNHFLEPLSEPLLAGGLFSSQPDQHSWIENILIQLARHEKFPYETIAVRWFNRYISLLVP 457
+L+P P+L L + + + A W + ++V
Sbjct: 357 --RWLKPDESPVLMATLMECDENNQPLAGAYIDR--------SGLDAETWLTQLFRVVVV 406

Query: 458 AILDYYLYHGITFEPHLQNVLIQLDHQYYPSHIYLRDLEGT 498
+ +G+ H QN+ + + + P + L+D +G
Sbjct: 407 PLYHLLCRYGVALIAHGQNITLAMK-EGVPQRVLLKDFQGD 446


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS24785TCRTETA605e-12 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 59.8 bits (145), Expect = 5e-12
Identities = 61/330 (18%), Positives = 134/330 (40%), Gaps = 24/330 (7%)

Query: 44 GFLYVLPTLLTAIASPFWGKISDKINKKSALLRAQLGLSISFLIVAFSSGYLSLFILSLC 103
G L L L+ +P G +SD+ ++ LL + G ++ + I+A + L+I
Sbjct: 46 GILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYI-GRI 104

Query: 104 LQGLLGGTLAAANAYLATTSHRQQLSQLLNLTQFSARAAFLIAPIIIGFLINLFSPLSVY 163
+ G+ G T A A AY+A + + ++ + P++ G + FSP + +
Sbjct: 105 VAGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGG-FSPHAPF 163

Query: 164 FLLALITFISAIIIYFYVPKDKDKDKNKNYDHDKITPNSKPQSIDAAINILPYYCLLAAS 223
F A + ++ + F +P+ K + + + + P + + + L+A
Sbjct: 164 FAAAALNGLNFLTGCFLLPESH-KGERRPLRREALNPLASFRWARG---MTVVAALMAVF 219

Query: 224 FVFNFSTVISFPYFITLLQAHFNVHSGLILGLL--FGLPHAVYLISIFSLQKYRQQPSQQ 281
F+ + ++ + F+ + I L FG+ H++ I + ++
Sbjct: 220 FIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITG--PVAARLGER 277

Query: 282 PWIFTG------ALILLAFSLYWQCVTTGFFTLIILRIVMGAAITLGFISLNRMIATLKL 335
+ G ILLAF T G+ I+ ++ I + +L M++
Sbjct: 278 RALMLGMIADGTGYILLAF------ATRGWMAFPIMVLLASGGIGMP--ALQAMLSRQVD 329

Query: 336 QQQEGKVFGWLDSISKWAGVCAGLIAGFSY 365
++++G++ G L +++ + L+ Y
Sbjct: 330 EERQGQLQGSLAALTSLTSIVGPLLFTAIY 359


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS24790PF041831213e-31 IucA / IucC family
		>PF04183#IucA / IucC family

Length = 580

Score = 121 bits (305), Expect = 3e-31
Identities = 61/310 (19%), Positives = 104/310 (33%), Gaps = 36/310 (11%)

Query: 168 LEYDHIAAFLD-HPLYPTARAKLGFNPNDLYNYTTEFRAEFKLNWIAIPKSLSTLSGTLP 226
L D + L HP + + + G+ L Y E+ F+L+W+A+ +
Sbjct: 124 LNADRLQCLLSGHPKFVFNKGRRGWGKEALERYAPEYANTFRLHWLAVKREHMIWRCDNE 183

Query: 227 I-------------FWPSFSSVGLNPTLQQTHTLLPVHPFLI-HRLQDLLDEQGIKLKII 272
+ + FS V L LPVHP+ ++ + +++
Sbjct: 184 MDIHQLLTAAMDPQEFARFSQVWQENGLDHNWLPLPVHPWQWQQKIATDFIADFAEGRMV 243

Query: 273 KAPVSFLTVNPTLSIRSL-SIKNYSHFHLKLPLDIRTLSAKNIRTIKASTINDGHQVQSL 331
S+R+L + +KLPL I S R I I G
Sbjct: 244 SLGEFGDQWLAQQSLRTLTNASRRGGLDIKLPLTIYNTSC--YRGIPGRYIAAGPLASRW 301

Query: 332 LESIRLQDPELKENIFLTTEHTGMHINSHP--------------MLAFILRQYPSQL--N 375
L+ + D L ++ + SH ML I R+ P +
Sbjct: 302 LQQVFATDATLVQSGAVILGEPAAGYVSHEGYAALARAPYRYQEMLGVIWRENPCRWLKP 361

Query: 376 NHWIIPIAALCAK-NNGQLIIQHLINDHFNKDTIQFIKNYFDLTIHTHLNLWLIYGITLE 434
+ + +A L N Q + I D D ++ F + + +L YG+ L
Sbjct: 362 DESPVLMATLMECDENNQPLAGAYI-DRSGLDAETWLTQLFRVVVVPLYHLLCRYGVALI 420

Query: 435 ANQQNSLLII 444
A+ QN L +
Sbjct: 421 AHGQNITLAM 430


61PSLF89_RS25330PSLF89_RS25385N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS25330-112-0.858876flagellar motor switch protein FliN
PSLF89_RS25335-112-1.223307flagellar biosynthetic protein FliO
PSLF89_RS25340-112-1.797068flagellar type III secretion system pore protein
PSLF89_RS25345-123-3.061147flagellar biosynthesis protein FliQ
PSLF89_RS25350-123-3.410058flagellar biosynthetic protein FliR
PSLF89_RS25355025-3.092885flagellar biosynthesis protein FlhB
PSLF89_RS25360023-3.633816flagellar biosynthesis protein FlhA
PSLF89_RS25365021-2.079858flagellar biosynthesis protein FlhF
PSLF89_RS25370-121-1.218779MinD/ParA family protein
PSLF89_RS25375-120-0.619967RNA polymerase sigma factor FliA
PSLF89_RS25385-2200.806475chemotaxis response regulator CheY
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS25360FLGMOTORFLIN1012e-30 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 101 bits (252), Expect = 2e-30
Identities = 50/118 (42%), Positives = 77/118 (65%), Gaps = 1/118 (0%)

Query: 33 WGAALEESGDAEGKDELETLNTGFDPVALASEEYPDLEKILDLPVTISMQVGGANISIRN 92
W AL E K + + + S D++ I+D+PV +++++G ++I+
Sbjct: 19 WADALNEQKATTTKSAADAVFQQLGGGDV-SGAMQDIDLIMDIPVKLTVELGRTRMTIKE 77

Query: 93 LLQLNQGSVVELDRYAGEPLDVRVNGTLIAHGEVVVVNEKYGIRLTDVISAAERLQKL 150
LL+L QGSVV LD AGEPLD+ +NG LIA GEVVVV +KYG+R+TD+I+ +ER+++L
Sbjct: 78 LLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYGVRITDIITPSERMRRL 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS25370FLGBIOSNFLIP2424e-83 Escherichia coli: Flagellar biosynthetic protein Fl...
		>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP

signature.
Length = 245

Score = 242 bits (620), Expect = 4e-83
Identities = 134/243 (55%), Positives = 178/243 (73%), Gaps = 1/243 (0%)

Query: 1 MAIFLIFIVFLGCCITTSAAPTIPIVTATTEPNGSETYSVGLQILLLMTALTLLPAFLLM 60
M L L IT A +P +T+ P G +++S+ +Q L+ +T+LT +PA LLM
Sbjct: 1 MRRLLSVAPVLLWLITPLAFAQLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILLM 60

Query: 61 MTSFTRILIVLGILRQALGMPTVPTNQILIGLSLFLTIFIMSPVWMKINQQAIQPYFADE 120
MTSFTRI+IV G+LR ALG P+ P NQ+L+GL+LFLT FIMSPV KI A QP+ ++
Sbjct: 61 MTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEEK 120

Query: 121 INVQTALEKAQKPIRNFMIEQTREADLKLFVEMSGTQANQ-LSEIPLTIIMPAFITSELK 179
I++Q ALEK +P+R FM+ QTREADL LF ++ T Q +P+ I++PA++TSELK
Sbjct: 121 ISMQEALEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSELK 180

Query: 180 TAFQIGFMIFLPFLVIDLVVASVLMGMGMMMLSPLIISLPFKIMLFVLVDGWMLILGTLA 239
TAFQIGF IF+PFL+IDLV+ASVLM +GMMM+ P I+LPFK+MLFVLVDGW L++G+LA
Sbjct: 181 TAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLA 240

Query: 240 SSF 242
SF
Sbjct: 241 QSF 243


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS25375TYPE3IMQPROT521e-12 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 51.7 bits (124), Expect = 1e-12
Identities = 22/76 (28%), Positives = 44/76 (57%)

Query: 7 VDLISRAVYVLIIMSSILIVPGLVVGLIIAVFQAATQINEQTLSFVPRLLATFLALVFAG 66
V ++A+Y+++I+S + ++GL++ +FQ TQ+ EQTL F +LL L L
Sbjct: 5 VFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLFLLS 64

Query: 67 PLLLKIIISFTEELIK 82
++++S+ ++I
Sbjct: 65 GWYGEVLLSYGRQVIF 80


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS25380TYPE3IMRPROT1161e-33 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 116 bits (292), Expect = 1e-33
Identities = 92/249 (36%), Positives = 149/249 (59%), Gaps = 2/249 (0%)

Query: 1 MLELTTADIHAWAAGYFWPFIRIAAMLMTIAVIGSQYVAKHVRLVLAVLITIVIVPVIPE 60
ML++T+ +W YFWP +R+ A++ T ++ + V K V+L LA++IT I P +P
Sbjct: 1 MLQVTSEQWLSWLNLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLP- 59

Query: 61 VPKLDILSIDSVLITVQQVLIGIFIGFMTQLLFQIFVIGGQIIAMQMGLGFAALVDPQNG 120
+ + S ++ + VQQ+LIGI +GF Q F G+II +QMGL FA VDP +
Sbjct: 60 ANDVPVFSFFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASH 119

Query: 121 FVVTGVSQVYFIMVALLFFTMNGHLVFIQMVVESFTILPITADSVLSMQFIWLMLEKFSW 180
+ ++++ ++ LLF T NGHL I ++V++F LPI + + S F+ L + S
Sbjct: 120 LNMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLAL-TKAGSL 178

Query: 181 VFAKAVLIALPAILSLLLINFAFGVMSRAAPQLNVFSIGFPTTLLMGAVVIAFVVFIINE 240
+F +++ALP I LL +N A G+++R APQL++F IGFP TL +G ++A ++ +I
Sbjct: 179 IFLNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAP 238

Query: 241 HFQSYFVEI 249
+ F EI
Sbjct: 239 FCEHLFSEI 247


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS25385TYPE3IMSPROT308e-105 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 308 bits (790), Expect = e-105
Identities = 108/348 (31%), Positives = 185/348 (53%), Gaps = 6/348 (1%)

Query: 8 AQEKTEDPSQKRIDDARKRGQVPRSKELNTFAIVVFGVILLIAFGQYMGEYFFKIIRICF 67
+ EKTE P+ K+I DARK+GQV +SKE+ + A++V +L+ + +Y+F+
Sbjct: 2 SGEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMG----LSDYYFEHFSKLM 57

Query: 68 TLTPTELLQDD--LIMTKVKDVFYLASYLLLPFLSLILLVALIAPILMGGLNFSSESLTP 125
+ + + V +V YL P L++ L+A+ + ++ G S E++ P
Sbjct: 58 LIPAEQSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKP 117

Query: 126 KIDRMDPIKGLKRMFSIKSIIELIKAIFKFLLVMAMAIFLMWFFSEKFLHLAYEGDKAAL 185
I +++PI+G KR+FSIKS++E +K+I K +L+ + ++ L L G +
Sbjct: 118 DIKKINPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECIT 177

Query: 186 LHSLTLIGWCALGLGMTLLVVVMIDVPFQFWDYKKQLKMSHKEIKDERKETEGQPEVKQK 245
++ + + +V+ + D F+++ Y K+LKMS EIK E KE EG PE+K K
Sbjct: 178 PLLGQILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSK 237

Query: 246 IRRLQMEMSQKRMMEGVKTADVVITNPTHFAVALSYEENAAGAPLLVAKGGDFIAEQIRK 305
R+ E+ + M E VK + VV+ NPTH A+ + Y+ PL+ K D + +RK
Sbjct: 238 RRQFHQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRK 297

Query: 306 VAEHSDVSIVTLPALARSIYYTTDIGNEIPEGLYLAVAQVLAYVFQLE 353
+AE V I+ LAR++Y+ + + IP A A+VL ++ +
Sbjct: 298 IAEEEGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQN 345


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS25410HTHFIS924e-25 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 92.2 bits (229), Expect = 4e-25
Identities = 31/105 (29%), Positives = 51/105 (48%), Gaps = 3/105 (2%)

Query: 2 KILVVDDFSTMRRIVKNLLRDLGFTNIAEADDGATAWPLLQKSDFDFLVTDWNMPGMTGI 61
ILV DD + +R ++ L G+ + AT W + D D +VTD MP
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 DLLKNVRAHDKLKSMPVLMVTAEQKREQIVEAAQAGVNGYIVKPF 106
DLL ++ +PVL+++A+ ++A++ G Y+ KPF
Sbjct: 64 DLLPRIKKAR--PDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPF 106


62PSLF89_RS26815PSLF89_RS36360N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS26815017-1.167516efflux RND transporter permease subunit
PSLF89_RS26825017-1.031465efflux RND transporter periplasmic adaptor
PSLF89_RS26830016-2.051449outer membrane beta-barrel protein
PSLF89_RS26835119-2.673311outer membrane beta-barrel protein
PSLF89_RS26840217-3.819160outer membrane beta-barrel protein
PSLF89_RS26845117-4.525847phosphatase
PSLF89_RS36360217-6.830034helix-turn-helix domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS26870ACRIFLAVINRP7420.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 742 bits (1918), Expect = 0.0
Identities = 306/1041 (29%), Positives = 536/1041 (51%), Gaps = 41/1041 (3%)

Query: 5 DLFIKRPVLACVLSLVIFLTGLIAYNKLAVRQYPAVSANVVTISTSYSGASASLVEAFVT 64
+ FI+RP+ A VL++++ + G +A +L V QYP ++ V++S +Y GA A V+ VT
Sbjct: 3 NFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVT 62

Query: 65 TPLEQALQGISGVDYVSSVS-SAGNSRITVSLNLNADLYQALIEINNDLTPVLKKLPSGV 123
+EQ + GI + Y+SS S SAG+ IT++ D A +++ N L LP V
Sbjct: 63 QVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEV 122

Query: 124 DTPVIKEGDSNSTPMMIISFSSSK--LTPEAINDYLQRVVQPQLANLSGVAQANILGPRV 181
I S+S+ +M+ F S T + I+DY+ V+ L+ L+GV + G +
Sbjct: 123 QQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQ- 181

Query: 182 YAMRLWLNPAKMAALGVTTEDVSTALAANDLFAQAGSIST------NSQVININIESSLN 235
YAMR+WL+ + +T DV L + AG + +I ++
Sbjct: 182 YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFK 241

Query: 236 SAAQFNNLVIKSQQD-QYVRLSDIGYAELGAQTKASSLYVNGKPAVGVGIIAKSDANPLM 294
+ +F + ++ D VRL D+ ELG + +NGKPA G+GI + AN L
Sbjct: 242 NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALD 301

Query: 295 VANTVKNEVAEIQKQLPQGLSVRIARDSSSYIQDSLSEVSHTVMVAIVIVIAVVLLFLGS 354
A +K ++AE+Q PQG+ V D++ ++Q S+ EV T+ AI++V V+ LFL +
Sbjct: 302 TAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQN 361

Query: 355 FRALMIPLVTIPVSLVGTFALMYLLGYSINVLTLLAFVLAIGLVVDDAIVVLENVHRYI- 413
RA +IP + +PV L+GTFA++ GYSIN LT+ VLAIGL+VDDAIVV+ENV R +
Sbjct: 362 MRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMM 421

Query: 414 EQGFTPFKAALKGAREIRFAIIAMTLTLAAVYAPIGFSTGITGSLFREFAFSLAASVILS 473
E P +A K +I+ A++ + + L+AV+ P+ F G TG+++R+F+ ++ +++ LS
Sbjct: 422 EDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALS 481

Query: 474 GIVALTLSPMMCARMMRA-----HHQPAGWQLKIEVCLTRLRDYYSLLLNKVFNNKVNVL 528
+VAL L+P +CA +++ H G+ ++Y+ + K+ + L
Sbjct: 482 VLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYL 541

Query: 529 IIAGTVIVCGGIYIIPLVKNSTLAPKEDQNTVIGIVQGSMAASVVNTEAYTSKLRE--LA 586
+I ++ G+ ++ L S+ P+EDQ + ++Q A+ T+ ++ + L
Sbjct: 542 LIY--ALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599

Query: 587 SKVSGVENVTVING---AGGDQSNAMLMVQLASKSQRS---LSAERIAGQLNKAAARIPG 640
++ + VE+V +NG +G Q+ M V L +R+ SAE + + +I
Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRD 659

Query: 641 AKAMFVLPPSLPTSHD-----NYDIEFVIKTNGDYAELETHVNKILQAIHKN-AGFGRVM 694
FV+P ++P + +D E + + + L N++L ++ A V
Sbjct: 660 G---FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVR 716

Query: 695 TDLQFNKPEYNVTIQRDMAARLGVSVSGIASVLTNALAEPQSSEFVNNGLSYYVIPQVIA 754
+ + ++ + + ++ A LGVS+S I ++ AL ++F++ G + Q A
Sbjct: 717 PNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADA 776

Query: 755 SGQGSITGLNQLYVTAESGAKIPLRDLIKVKMTVNPSSLNHFQSQRSVTIQATLSHRYST 814
+ +++LYV + +G +P L + S+ IQ + S+
Sbjct: 777 KFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSS 836

Query: 815 EQALNFLEMIAKKDLTAQMSYATSGNTRQYLEESSSVYFIFIAALLFIYLSLSAQFESFI 874
A+ +E +A K L A + Y +G + Q + + + + ++L L+A +ES+
Sbjct: 837 GDAMALMENLASK-LPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWS 895

Query: 875 DPLIILMSVPFSIAGALGTLFLIGGSLNIYTEIGLVTLIGLIAKHGILIVEFANQSQ-KS 933
P+ +++ VP I G L L ++Y +GL+T IGL AK+ ILIVEFA K
Sbjct: 896 IPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKE 955

Query: 934 GESLLVAIKQSARVRFRPILMTTAAMVLGAVPLVFASGAGSEARYQLGWVIVGGMMIGTM 993
G+ ++ A + R+R RPILMT+ A +LG +PL ++GAGS A+ +G ++GGM+ T+
Sbjct: 956 GKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATL 1015

Query: 994 MTLLVLPLMYYLVNTAKTVFK 1014
+ + +P+ + ++ + FK
Sbjct: 1016 LAIFFVPVFFVVI---RRCFK 1033


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS26875RTXTOXIND513e-09 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 51.0 bits (122), Expect = 3e-09
Identities = 15/88 (17%), Positives = 39/88 (44%), Gaps = 2/88 (2%)

Query: 69 TLKAQVAGTVTRVAFQSGDKVKQGQLLVSLDSTTAKGQLDKAEADYHLSLLTYQRDQSLF 128
+K V + + G+ V++G +L+ L + A+ K ++ + L R Q L
Sbjct: 98 EIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILS 157

Query: 129 KNHVLSEQELDQVKFTVKANWALLEQAQ 156
++ + +L ++K + + + + +
Sbjct: 158 RS--IELNKLPELKLPDEPYFQNVSEEE 183



Score = 40.2 bits (94), Expect = 9e-06
Identities = 37/190 (19%), Positives = 71/190 (37%), Gaps = 17/190 (8%)

Query: 99 DSTTAKGQLDKAEADYHLSLLTYQRDQSLFKNHVLSEQELDQVKFTVKANWALLEQAQSA 158
+ K QL++ E++ + YQ LFKN +L + L Q + L + +
Sbjct: 267 ELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDK--LRQTTDNIGLLTLELAKNEER 324

Query: 159 YNKTQVKAPFNGNI-GISDITVGSYLDSGDTIVSLQNLDH-LWVDFNVSSQDSLQVKIDE 216
+ ++AP + + + T G + + +T++ + D L V V ++D + + +
Sbjct: 325 QQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQ 384

Query: 217 IVDITTQAEPMQIA---SGKVVAIEPQINSDTGT----LTLRAQINNT------HYQLLP 263
I +A P GKV I D + + N + L
Sbjct: 385 NAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVIISIEENCLSTGNKNIPLSS 444

Query: 264 GQLVSVNLYT 273
G V+ + T
Sbjct: 445 GMAVTAEIKT 454


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS26880OMPADOMAIN485e-09 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 48.4 bits (115), Expect = 5e-09
Identities = 42/222 (18%), Positives = 72/222 (32%), Gaps = 48/222 (21%)

Query: 7 LTVMTTLLISASALAAKPGA---YIGLNLGYGGMDTAQLTKNSFRNEASSSASLRGFAGR 63
+ + L A+ A P Y G LG+ N+ + F G
Sbjct: 6 IAIAVALAGFATVAQAAPKDNTWYTGAKLGWSQYHDTGF-INNNGPTHENQLGAGAFGG- 63

Query: 64 INAGYLWSQSSLNYGIELGYATYANNQYSALGKNGEKYNFTYKGYNIDLLGIAQYNFNPN 123
Q + G E+GY Y +NG YK + L Y +
Sbjct: 64 -------YQVNPYVGFEMGYDWLGRMPYKGSVENG-----AYKAQGVQLTAKLGYPITDD 111

Query: 124 WNIFAKVGIAYASQTTSGS-------SEFSHMFAN--KGRLLPKVALGLGYEFTNGIGLN 174
+I+ ++G T + + S +FA + + P++A L Y++TN IG
Sbjct: 112 LDIYTRLGGMVWRADTKSNVYGKNHDTGVSPVFAGGVEYAITPEIATRLEYQWTNNIG-- 169

Query: 175 LTASHIFGNQSTFDGNNNQTIKNNLNKVSPVDMVTVGISYNF 216
+ + M+++G+SY F
Sbjct: 170 --------------------DAHTIGTRPDNGMLSLGVSYRF 191


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS26885OMPADOMAIN494e-09 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 48.8 bits (116), Expect = 4e-09
Identities = 45/216 (20%), Positives = 70/216 (32%), Gaps = 31/216 (14%)

Query: 5 IKLTAIT---ALLISASTLATKPGA---YIGLNLGYGGMDTPNLDLTKINNIANDSHSTR 58
+K TAI AL A+ P Y G LG+ D INN +
Sbjct: 1 MKKTAIAIAVALAGFATVAQAAPKDNTWYTGAKLGWSQYH----DTGFINNNGPTHENQ- 55

Query: 59 GLAGSINAGYLWNKGALNYGFELGYSTYANNQYTAVSVGKKYNFTYSGSSLDLLGVVQYN 118
L GY N GFE+GY + + + G L Y
Sbjct: 56 -LGAGAFGGYQVNPY---VGFEMGYDWLG--RMPYKGSVENGAYKAQGVQLTAKL--GYP 107

Query: 119 INPNWNIFGKAGLSYVSQKTTGDGILSLAADSKSKMRPKFALGAGYGFDNGIGLNVMASH 178
I + +I+ + G T + + + + P FA G Y +
Sbjct: 108 ITDDLDIYTRLGGMVWRADTKSNVYGK---NHDTGVSPVFAGGVEY--------AITPEI 156

Query: 179 TFGTKPQVSNNIISIKDDVNKVAPIDMITVGITYNF 214
+ Q +NNI + M+++G++Y F
Sbjct: 157 ATRLEYQWTNNIGD-AHTIGTRPDNGMLSLGVSYRF 191


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS26890OUTRMMBRANEA401e-06 Outer membrane protein A signature.
		>OUTRMMBRANEA#Outer membrane protein A signature.

Length = 346

Score = 40.3 bits (94), Expect = 1e-06
Identities = 40/176 (22%), Positives = 54/176 (30%), Gaps = 27/176 (15%)

Query: 5 IKLTAIT---ALLISASALAAKPGA---YIGLNLGYGGMDTPSVNFKNKYPGVHSYSHSS 58
+K TAI AL A+ A P Y G LG+ F N H +
Sbjct: 1 MKKTAIAIAVALAGFATVAQAAPKDNTWYTGAKLGWS--QYHDTGFINNNGPTHENQLGA 58

Query: 59 RGFAGRINAGYLWNQGSLNYGVELGYATYANSKYSVTNKDDTRTLKYSGTNIDLLGVIQY 118
F G Q + G E+GY Y K Y + L + Y
Sbjct: 59 GAFGGY--------QVNPYVGFEMGYDWLGRMPY----KGSVENGAYKAQGVQLTAKLGY 106

Query: 119 NFTPNWNIFAKAGLAYVTQKTSGSNAFKLEFESNNKVLSEVALGAGY----EFALR 170
T + +I+ + G T + K + V A G Y E A R
Sbjct: 107 PITDDLDIYTRLGGMVWRADTKSNVYGK---NHDTGVSPVFAGGVEYAITPEIATR 159


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS26900STREPKINASE270.008 Streptococcus streptokinase protein signature.
		>STREPKINASE#Streptococcus streptokinase protein signature.

Length = 440

Score = 27.4 bits (60), Expect = 0.008
Identities = 18/50 (36%), Positives = 28/50 (56%), Gaps = 2/50 (4%)

Query: 41 HQRLSTHTSPDVISQELIR-EHNIQVSESTI-YRYIYDDRERGGELYKNL 88
H +L T DV + EL++ E + SE + +R +YD R++ LY NL
Sbjct: 317 HLKLFTIKYVDVDTNELLKSEQLLTASERNLDFRDLYDPRDKAKLLYNNL 366


63PSLF89_RS36395PSLF89_RS36990N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS36395-124-0.974066flagellar hook-associated protein FlgL
PSLF89_RS27330021-2.057004flagellar hook-associated protein FlgK
PSLF89_RS27335-120-2.601648rod-binding protein
PSLF89_RS27340-118-2.789502flagellar basal body P-ring protein FlgI
PSLF89_RS27345018-1.668846flagellar basal body L-ring protein FlgH
PSLF89_RS36400119-2.919257flagellar basal-body rod protein FlgG
PSLF89_RS27350-113-0.867126flagellar basal body rod protein FlgF
PSLF89_RS273600131.429813flagellar hook-basal body complex protein
PSLF89_RS273650131.816180flagellar hook assembly protein FlgD
PSLF89_RS369900150.633143flagellar basal body rod protein FlgC
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS27420FLAGELLIN423e-06 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 42.3 bits (99), Expect = 3e-06
Identities = 34/139 (24%), Positives = 62/139 (44%)

Query: 3 TRVSTSSIFNTTVENMAKRQEELAKVQDQIASNKKILTAADDPIDALRTLALKNNIAQKK 62
++T+S+ T N+ K Q L+ ++++S +I +A DD +NI
Sbjct: 2 QVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLT 61

Query: 63 QFSENMDFSRSRLELEEATLTSLAGLFREVKVRAIEAGNGGYATSDVREVGRSIASLLES 122
Q S N + S + E L + + V+ +++A NG + SD++ + I LE
Sbjct: 62 QASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEE 121

Query: 123 IVQQANSRDRNGEYLFSGS 141
I + +N NG + S
Sbjct: 122 IDRVSNQTQFNGVKVLSQD 140



Score = 32.7 bits (74), Expect = 0.003
Identities = 19/78 (24%), Positives = 32/78 (41%)

Query: 369 NITLNENIQRMIASIDEAAGTLLSVTTEVGLRQSNIHLQQEVSSHIQLSQNKALGDLSDL 428
++ +ASID A + +V + +G Q+ + + N A + D
Sbjct: 410 AAAAKKSTANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDA 469

Query: 429 DFAKAVSELSILQTTLQA 446
D+A VS +S Q QA
Sbjct: 470 DYATEVSNMSKAQILQQA 487


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS27425FLGHOOKAP12108e-61 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 210 bits (537), Expect = 8e-61
Identities = 135/528 (25%), Positives = 252/528 (47%), Gaps = 36/528 (6%)

Query: 4 LGISVAGLNAARTQLDTTSHNIANASTPGYTRQRVLQSSVLGDTASGQYVGAGVQIDAIQ 63
+ +++GLNAA+ L+T S+NI++ + GYTRQ + + +G +VG GV + +Q
Sbjct: 4 INNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVSGVQ 63

Query: 64 RMADRFAVEQLRDSTTAFAESDIFHSISSRVDNLASNDATSLSTSLSGYFETLNEGVNEP 123
R D F QLR + T + + S++DN+ S +SL+T + +F +L V+
Sbjct: 64 REYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLVSNA 123

Query: 124 TSIALRQSILGEANNLTTRFHTIERELSQLRVEINRDLDDAALNLTQLGKRVAIINDQIS 183
A RQ+++G++ L +F T ++ L ++N + + + K++A +NDQIS
Sbjct: 124 EDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLNDQIS 183

Query: 184 RAVGSAGGAIPNDLLDDRDRALKEIAEFANISVFEHTDGSVDVSIGSGQSLVAGTNSLTI 243
R G GA PN+LLD RD+ + E+ + + V G+ ++++ +G SLV G+ + +
Sbjct: 184 RLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTARQL 243

Query: 244 VAEPNAEDASKSNLFVKDLNKNIRFDITNEIQSGRVKGLIDVRDNVIDQSLRQLGLVAVG 303
A P++ D S++ + D + +G + G++ R +DQ+ LG +A+
Sbjct: 244 AAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQLALA 303

Query: 304 LIQTTNEQHKLGMDFNSALGGDFFNDLNKSVLIAQRYLPNSANAGNAALTVELGEFAADT 363
+ N QHK G D N G DFF + K L N+ N G+ A+ +
Sbjct: 304 FAEAFNTQHKAGFDANGDAGEDFF-AIGKP-----AVLQNTKNKGDVAIGATV------- 350

Query: 364 IALPNKPATGIKDLEAEEYNLIITGTSYELIRQSDQASMASGAIADFPIQINGMRISLSS 423
T + A +Y + +++ R + + A+ + +G+ ++ +
Sbjct: 351 --------TDASAVLATDYKISFDNNQWQVTRLASNTTFTVTPDANGKVAFDGLELT-FT 401

Query: 424 GGFADQDSYVIRPLQGLARGIDVQITDPKKLALAW--PVAASENEANLGSGKLTVSDMVS 481
G A DS+ ++P+ +DV ITD K+A+A S+N N + S+ +
Sbjct: 402 GTPAVNDSFTLKPVSDAIVNMDVLITDEAKIAMASEEDAGDSDN-RNGQALLDLQSNSKT 460

Query: 482 TNQPINFSDL-----------STAANTVTSFQPNLVSQLTDLKTPLAG 518
+F+D + T ++ Q N+V+QL++ + ++G
Sbjct: 461 VGGAKSFNDAYASLVSDIGNKTATLKTSSATQGNVVTQLSNQQQSISG 508



Score = 70.8 bits (173), Expect = 1e-14
Identities = 34/113 (30%), Positives = 53/113 (46%), Gaps = 5/113 (4%)

Query: 880 YNTNGFGDNSNAIKLAKIEQQAVLTADTSGNPTSSISQGYESLVASVASETETSIIDLNA 939
G DN N L ++ + + S + Y SLV+ + ++T T
Sbjct: 437 EEDAGDSDNRNGQALLDLQ-----SNSKTVGGAKSFNDAYASLVSDIGNKTATLKTSSAT 491

Query: 940 SETLKRQAQQKRDSIMGVNLDEEAANLIQFQQAYQASARVITVAQTLFSSLLQ 992
+ Q ++ SI GVNLDEE NL +FQQ Y A+A+V+ A +F +L+
Sbjct: 492 QGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALIN 544


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS27430FLGFLGJ891e-22 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 88.6 bits (219), Expect = 1e-22
Identities = 53/201 (26%), Positives = 87/201 (43%), Gaps = 31/201 (15%)

Query: 10 SNAFDFSAFQKLKANVNKAGQEDK-TLRAVAEQFESIFIKMALDSMRKASKELESDLFKS 68
S A+D + +LKA KAG++ +R VA Q E +F++M L SMR A + LF S
Sbjct: 10 SAAWDAQSLNELKA---KAGEDPAANIRPVARQVEGMFVQMMLKSMRDALPK--DGLFSS 64

Query: 69 SYQDFYQDLYDDQLSLNLANNGGIGLTDALVRYLS-QQAGSEQ----------------- 110
+ Y +YD Q++ + G+GL + +V+ ++ +Q E+
Sbjct: 65 EHTRLYTSMYDQQIAQQMTAGKGLGLAEMMVKQMTPEQPLPEESTPAAPMKFPLETVVRY 124

Query: 111 ----VLNVNNTLKKEQSAQDGQTAFKQLIATLEPYLDDLSEKLGVSRKAILSHAIVETGW 166
+ + K +A L S++ GV IL+ A +E+GW
Sbjct: 125 QNQALSQLVQKAVPRNYDDSLPGDSKAFLAQLSLPAQLASQQSGVPHHLILAQAALESGW 184

Query: 167 GSTNMMKRGHLSNSNQVNLFG 187
G ++R + S NLFG
Sbjct: 185 GQ-RQIRRENGEPSY--NLFG 202


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS27435FLGPRINGFLGI373e-130 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 373 bits (960), Expect = e-130
Identities = 159/367 (43%), Positives = 228/367 (62%), Gaps = 12/367 (3%)

Query: 35 LVFVFSGIFISPSITYAEQRIKDISNIASVRSNQLIGYGLVVGLNGTGD---NANFTIVS 91
LVF +P RIKDI+++ + R NQLIGYGLVVGL GTGD ++ FT S
Sbjct: 11 LVFSALPFLSTPPAQADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSPFTEQS 70

Query: 92 FKRMLSNLGIKLPPGVDPKMKNVAAVALSAELPAFAKPGQRIDVTASSLGDSKSLVGGTL 151
+ ML NLGI G KN+AAV ++A LP FA PG R+DVT SSLGD+ SL GG L
Sbjct: 71 MRAMLQNLGITTQGG-QSNAKNIAAVMVTANLPPFASPGSRVDVTVSSLGDATSLRGGNL 129

Query: 152 LMSPLKGADGRVYALAQGGVIVGGLGVTGKDGSKLIVNVPSVGRIPGGAIVEKQVPTPFS 211
+M+ L GADG++YA+AQG +IV G G D + L V + R+P GAI+E+++P+ F
Sbjct: 130 IMTSLSGADGQIYAVAQGALIVNGFSAQG-DAATLTQGVTTSARVPNGAIIERELPSKFK 188

Query: 212 HDDHIVFNLKSPDFTTAKWMADVINQF----LGPGSARPLDSTSVWVSAPKDPAQKVMFV 267
++V L++PDF+TA +ADV+N F G A P DS + V P+ A +
Sbjct: 189 DSVNLVLQLRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKPR-VADLTRLM 247

Query: 268 AVLENLKVKSAEAPARVIVNSRTGTVVISKNVRVSPAAVTHGNLIVTIEETTKVSQPGAL 327
A +ENL V + PA+V++N RTGT+VI +VR+S AV++G L V + E+ +V QP
Sbjct: 248 AEIENLTV-ETDTPAKVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESPQVIQPAPF 306

Query: 328 SGGETVTVPESEINAEQQNNPMFVFSPGPTLKDIVRAVNEVGVGPGDLIEILEALQAAGA 387
S G+T P+++I A Q+ + + + GP L+ +V +N +G+ +I IL+ +++AGA
Sbjct: 307 SRGQTAVQPQTDIMAMQEGSKVAI-VEGPDLRTLVAGLNSIGLKADGIIAILQGIKSAGA 365

Query: 388 LHAELVV 394
L AELV+
Sbjct: 366 LQAELVL 372


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS27440FLGLRINGFLGH1503e-47 Flagellar L-ring protein signature.
		>FLGLRINGFLGH#Flagellar L-ring protein signature.

Length = 232

Score = 150 bits (379), Expect = 3e-47
Identities = 72/225 (32%), Positives = 108/225 (48%), Gaps = 12/225 (5%)

Query: 12 LLAIMAGLLNGCSY------VVGPEPGDPRYAPIPPAVAHIPQYQGGAIYQTRYGASLYN 65
+ +++ L GC++ V G P P P A I Q Y + L+
Sbjct: 11 ISSLLVLSLTGCAWIPSTPLVQGATSAQPVPGPTPVANGSIFQSAQPINYGYQ---PLFE 67

Query: 66 TTLPFQVGDVLTVEFNESNKASKKADNKIEKKDELTMDGSALPAAAKSIPFLGHLVDENW 125
P +GD LT+ E+ ASK + + + +P + + L +
Sbjct: 68 DRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVP---RYLQGLFGNARADV 124

Query: 126 QVSQERKFQGKGDAKQENSLRGSITVTVSRILANGNLVIRGEKWMKLNSGREYIRLSGIV 185
+ S F GKG A N+ G++TVTV ++L NGNL + GEK + +N G E+IR SG+V
Sbjct: 125 EASGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRFSGVV 184

Query: 186 RADDIDASNTIQSTKIADARIAYSGTGSFADSSRQGWLSRFFGSV 230
I SNT+ ST++ADARI Y G G ++ GWL RFF ++
Sbjct: 185 NPRTISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNL 229


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS27445FLGHOOKAP1421e-06 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 42.3 bits (99), Expect = 1e-06
Identities = 13/63 (20%), Positives = 23/63 (36%)

Query: 197 ASGAATLGNPASDAYGSTRQGELEASNVNVVEELIGLIETQRAYEMNSKSISTADGMMQF 256
+ T + + S VN+ EE L Q+ Y N++ + TA+ +
Sbjct: 482 TATLKTSSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDA 541

Query: 257 LNQ 259
L
Sbjct: 542 LIN 544



Score = 35.7 bits (82), Expect = 2e-04
Identities = 14/78 (17%), Positives = 29/78 (37%), Gaps = 14/78 (17%)

Query: 5 LWISKTGLDAQNLKLQVVSNNLANVSTTGFKKDRAVFQSLFYQNVRQAGAENAEGVRLPS 64
+ + +GL+A L SNN+++ + G+ + + N
Sbjct: 4 INNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQ---ANSTLGAGGW-------- 52

Query: 65 GLMLGRGVAVGATLKQHD 82
+G GV V +++D
Sbjct: 53 ---VGNGVYVSGVQREYD 67


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS27450FLGHOOKAP1280.034 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 28.0 bits (62), Expect = 0.034
Identities = 11/31 (35%), Positives = 16/31 (51%)

Query: 4 GIYIAMSGAKQAFTKLAMNNNNLSNASTTGF 34
I AMSG A L +NN+S+ + G+
Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGY 33


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS27455FLGHOOKAP1412e-05 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 40.7 bits (95), Expect = 2e-05
Identities = 19/55 (34%), Positives = 29/55 (52%)

Query: 2 SFNIALSGLQASSQDLSVISNNIANASTIGFKKSRAEFGDVYQTSGSGSAVGSGV 56
N A+SGL A+ L+ SNNI++ + G+ + T G+G VG+GV
Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGV 57



Score = 39.6 bits (92), Expect = 4e-05
Identities = 14/43 (32%), Positives = 26/43 (60%)

Query: 689 LEDSNVDLTQELVSMIIAQRNFQANAQTIRTSDQVTQTIINIR 731
S V+L +E ++ Q+ + ANAQ ++T++ + +INIR
Sbjct: 504 QSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS27465FLGHOOKAP1280.012 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 28.0 bits (62), Expect = 0.012
Identities = 18/71 (25%), Positives = 27/71 (38%), Gaps = 10/71 (14%)

Query: 5 SVFEIAGSAMMAQSIRLNTTASNLANINSVSSSIDTTYRSRQPVFAPIAASMRDEFFPNR 64
S+ A S + A LNT ++N+++ N +RQ I A
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYN-------VAGYTRQTT---IMAQANSTLGAGG 51

Query: 65 APGRGVQVLGI 75
G GV V G+
Sbjct: 52 WVGNGVYVSGV 62



Score = 27.6 bits (61), Expect = 0.018
Identities = 8/40 (20%), Positives = 15/40 (37%)

Query: 101 LPNVNPVEAMVNMISASQSYRVNVEAFNTSKQLMQQTLRL 140
+ VN E N+ Q Y N + T+ + + +
Sbjct: 506 ISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545


64PSLF89_RS32010PSLF89_RS32090N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS320108283.52655050S ribosomal protein L7/L12
PSLF89_RS320158263.30531950S ribosomal protein L10
PSLF89_RS320258265.81631250S ribosomal protein L1
PSLF89_RS320308276.08757050S ribosomal protein L11
PSLF89_RS320357296.239114transcription termination/antitermination
PSLF89_RS320457306.373915preprotein translocase subunit SecE
PSLF89_RS320558315.950137****prepilin-type N-terminal cleavage/methylation
PSLF89_RS320657335.007727GspE/PulE family protein
PSLF89_RS320758304.862622type II secretion system F family protein
PSLF89_RS320809324.868857IS982 family transposase
PSLF89_RS320858285.203845type II secretion system F family protein
PSLF89_RS320907275.508573A24 family peptidase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS32120IGASERPTASE260.043 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 26.2 bits (57), Expect = 0.043
Identities = 23/87 (26%), Positives = 33/87 (37%), Gaps = 1/87 (1%)

Query: 34 TAAVAAAAPATDAGAAVEEQTEFDVVMKEFGGNKVGAIKAVRAITGLGLKEAKAMVESCP 93
A V APAT + K N+ A + A KEAK+ V++
Sbjct: 1022 EAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATET-TAQNREVAKEAKSNVKANT 1080

Query: 94 ATVKEGVSKEEAEEVKKQLEEAGATVE 120
T + S E +E + + ATVE
Sbjct: 1081 QTNEVAQSGSETKETQTTETKETATVE 1107


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS32135ECOLNEIPORIN260.043 E.coli/Neisseria porin superfamily signature.
		>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature.

Length = 331

Score = 26.3 bits (58), Expect = 0.043
Identities = 9/57 (15%), Positives = 16/57 (28%), Gaps = 10/57 (17%)

Query: 43 NAKTQNLEVGAPVPVVITVYSDRSFTFETKTPPASYLLKKAAKLQKGSGTPNLNKVG 99
+ EV A ++ F TP SY + + ++V
Sbjct: 243 YSHNSQTEVAA----------TLAYRFGNVTPRVSYAHGFKGSFDATNYNNDYDQVV 289


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS32145SECETRNLCASE943e-28 Bacterial translocase SecE signature.
		>SECETRNLCASE#Bacterial translocase SecE signature.

Length = 127

Score = 94.2 bits (234), Expect = 3e-28
Identities = 46/103 (44%), Positives = 66/103 (64%)

Query: 9 SRFDKLKWALVCLLVVAAAGGNFYFSSYALSLRAGAVLVVVVAALAIASLTGKGRQAVDF 68
+ +KW +V L++ A GN+ + L LRA AV++++ AA +A LT KG+ V F
Sbjct: 12 RGLEAMKWVVVVALLLVAIVGNYLYRDIMLPLRALAVVILIAAAGGVALLTTKGKATVAF 71

Query: 69 LREARIELRKVVWPARKEVQQTTMIVGAFVVVAALVLWGVDSI 111
REAR E+RKV+WP R+E TT+IV A V +L+LWG+D I
Sbjct: 72 AREARTEVRKVIWPTRQETLHTTLIVAAVTAVMSLILWGLDGI 114


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS32170BCTERIALGSPG433e-08 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 43.3 bits (102), Expect = 3e-08
Identities = 17/49 (34%), Positives = 26/49 (53%)

Query: 5 KNRNQLGFSLIEIMVAVAIIGILIAIAVPSYQEYVSRAKNAALDATIAA 53
Q GF+L+EIMV + IIG+L ++ VP+ +A + I A
Sbjct: 3 ATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVA 51


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS32180BCTERIALGSPF1201e-34 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 120 bits (303), Expect = 1e-34
Identities = 57/217 (26%), Positives = 107/217 (49%), Gaps = 13/217 (5%)

Query: 6 VKLYRWQAVTDSQQTHQGINSALNQQ------------ALECDLFSRDKIKRYYPSLYQR 53
+ Y +QA+ + +G A + + L D D+ K L R
Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLR 60

Query: 54 YRSRINSHFISQWTRQLATLLSAQLPLAEALAISAEASPFCLQQQFILSINQDIKQGLSL 113
+ R+++ ++ TRQLATL++A +PL EAL A+ S Q + ++ + +G SL
Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSL 120

Query: 114 SQSLKNTQH-FDATYIAMIQAGEVSGQLIDTLITLANDQERQTALKKRLNIALIYPVIIS 172
+ ++K F+ Y AM+ AGE SG L L LA+ E++ ++ R+ A+IYP +++
Sbjct: 121 ADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLT 180

Query: 173 IFSCIITFAMLQFIVPQFKQFYAALNTPLPRLTELIL 209
+ + + +L +VP+ + + + LP T +++
Sbjct: 181 VVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLM 217



Score = 73.7 bits (181), Expect = 1e-17
Identities = 31/128 (24%), Positives = 67/128 (52%)

Query: 63 ISQWTRQLATLLSAQLPLAEALAISAEASPFCLQQQFILSINQDIKQGLSLSQSLKNTQH 122
+++ R L+ L ++ +PL +A+ IS + + + +++G+SL ++L+ T
Sbjct: 273 TARYARTLSILNASAVPLLQAMRISGDVMSNDYARHRLSLATDAVREGVSLHKALEQTAL 332

Query: 123 FDATYIAMIQAGEVSGQLIDTLITLANDQERQTALKKRLNIALIYPVIISIFSCIITFAM 182
F MI +GE SG+L L A++Q+R+ + + L + L P+++ + ++ F +
Sbjct: 333 FPPMMRHMIASGERSGELDSMLERAADNQDREFSSQMTLALGLFEPLLVVSMAAVVLFIV 392

Query: 183 LQFIVPQF 190
L + P
Sbjct: 393 LAILQPIL 400


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS32190BCTERIALGSPF1128e-32 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 112 bits (282), Expect = 8e-32
Identities = 44/163 (26%), Positives = 81/163 (49%), Gaps = 4/163 (2%)

Query: 40 KKYRDKVHALVEPFLLRLPIIGAIKQKNRLNTFCRTLHILFNSDHHLPTSLTIAIKATNS 99
+K R H LL LP+IG I + + RTL IL S L ++ I+ ++
Sbjct: 248 EKRRVSFHRR----LLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDVMSN 303

Query: 100 PQLIRQAATMCTELEQGTSLHQTLKNHSTFPNIALQMIHAGEHSHQLAIILKQLSALYES 159
+ + + +G SLH+ L+ + FP + MI +GE S +L +L++ + +
Sbjct: 304 DYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADNQDR 363

Query: 160 ELTQTINTLIKLLEPLAIIIIGGLVGLIIIALYLPIFQLSHIL 202
E + + + L EPL ++ + +V I++A+ PI QL+ ++
Sbjct: 364 EFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLM 406



Score = 41.0 bits (96), Expect = 2e-06
Identities = 27/127 (21%), Positives = 52/127 (40%), Gaps = 1/127 (0%)

Query: 72 FCRTLHILFNSDHHLPTSLTIAIKATNSPQLIRQAATMCTELEQGTSLHQTLKNHST-FP 130
R L L + L +L K + P L + A + +++ +G SL +K F
Sbjct: 73 LTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSLADAMKCFPGSFE 132

Query: 131 NIALQMIHAGEHSHQLAIILKQLSALYESELTQTINTLIKLLEPLAIIIIGGLVGLIIIA 190
+ M+ AGE S L +L +L+ E ++ P + ++ V I+++
Sbjct: 133 RLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLTVVAIAVVSILLS 192

Query: 191 LYLPIFQ 197
+ +P
Sbjct: 193 VVVPKVV 199


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS32195PREPILNPTASE1177e-34 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 117 bits (294), Expect = 7e-34
Identities = 58/141 (41%), Positives = 84/141 (59%), Gaps = 3/141 (2%)

Query: 88 WHYFSLIILSYFLISLSFIDFYYRYLPDTLTLPLLWLGLLFNLCPTIHHCSINQAILGAV 147
W + ++L++ L++L+FID LPD LTLPLLW GLLFNL S+ A++GA+
Sbjct: 132 WGTLAALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNLLGGF--VSLGDAVIGAM 189

Query: 148 IGYCNLRLINLLFTRARNKQGLGGGDIKLFAALGAWFGLNSLPNILFIACLLGLSFSLAQ 207
GY L + F K+G+G GD KL AALGAW G +LP +L ++ L+G +
Sbjct: 190 AGYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLVGAFMGIGL 249

Query: 208 SLYQ-HRKITHIAFGPFLSLA 227
L + H + I FGP+L++A
Sbjct: 250 ILLRNHHQSKPIPFGPYLAIA 270


65PSLF89_RS36690PSLF89_RS32985N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS366901243.080686MFS transporter
PSLF89_RS329701151.438201multidrug effflux MFS transporter
PSLF89_RS372201180.332334LysR family transcriptional regulator
PSLF89_RS329752120.720696ATP-binding protein
PSLF89_RS329802120.881359shikimate kinase AroK
PSLF89_RS329853140.058647type IV pilus secretin PilQ
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS33025TCRTETA453e-07 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 45.2 bits (107), Expect = 3e-07
Identities = 54/281 (19%), Positives = 87/281 (30%), Gaps = 16/281 (5%)

Query: 60 AGLGNLAATFFYSYLIIQIFSGPLLDRFGARYIGSLALLISALGTWLFAQADQLLWAEIG 119
A G L A + G L DRFG R + ++L +A+ + A A L IG
Sbjct: 43 AHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIG 102

Query: 120 RALMGV-GVAFATVTYLKVAATWFD--ARRFALLSGLVPTAVMIGAVFGQVPLAHVVASE 176
R + G+ G A T D AR F +S ++ G V G ++
Sbjct: 103 RIVAGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGG-----LMGGF 157

Query: 177 GWRRSLELCAILGVIFAVLFLLFVRDKKNHSSVIDDTQQVNWQDIIS--VLKRPANWLLT 234
A L + + + + + +N L+
Sbjct: 158 SPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMA 217

Query: 235 LYSGLAFAPLAVFAGLWGNPFLVASYQLTTADAA-SLTSLVFIGLGVGGPIFGALADYFG 293
++ + V A LW F + SL + + I G +A G
Sbjct: 218 VFFIMQ-LVGQVPAALWV-IFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLG 275

Query: 294 KRTLWMFLGGFVTLASVLCLLYCLGLHSTLLSILMFLFGFG 334
+R M G + + LL + +M L G
Sbjct: 276 ERRALML--GMIADGTGYILLAFAT-RGWMAFPIMVLLASG 313


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS33030TCRTETB698e-15 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 68.8 bits (168), Expect = 8e-15
Identities = 88/402 (21%), Positives = 149/402 (37%), Gaps = 48/402 (11%)

Query: 12 ILVIFGVIAAQAAVSLYLPSLPAIDHEWHLVSGQAQLTLSAFFLTFGVSQLFYGALSDHF 71
IL F V+ + SLP I ++++ +AF LTF + YG LSD
Sbjct: 21 ILSFFSVLNE----MVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQL 76

Query: 72 GRKPLLLTGLVILVLSSVWAIYATSFHSLL-AARLVQGAGGGALSVLARAIIRDLFHGDE 130
G K LLL G++I SV SF SLL AR +QGAG A L ++ +
Sbjct: 77 GIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKEN 136

Query: 131 LRKAISILAIAASFTPALAPSLGGWLEDHFDWRSSFVILTVYSI---------------- 174
KA ++ + + P++GG + + W +I + I
Sbjct: 137 RGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVRIK 196

Query: 175 -----------ILLVTIFSLFTETNQYQRQSNESIDFSKVVASYYFVT---------KNK 214
+ + F LFT + + F V VT KN
Sbjct: 197 GHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGLGKNI 256

Query: 215 LFWCYGFAILIGYLSLVICLANAPFLLEKKFGLS-AEITGYLMFVQPGFFLVGNLLQHKL 273
F I + ++ ++ P++++ LS AEI ++F ++ + L
Sbjct: 257 PFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGIL 316

Query: 274 TDKISGDLFLKFGMVILGVIGLSFLLQGLFHSAT--LISVLLTLALAGFATSLILVNALA 331
D+ L G+ L V SFL T +++++ L G + + +++ +
Sbjct: 317 VDRRGPLYVLNIGVTFLSV---SFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVISTIV 373

Query: 332 GVLLPFTENAGAAAALSGVLQMVGASFITALISNLHWTSIID 373
L E AGA +L + A++ L ++D
Sbjct: 374 SSSLKQQE-AGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLLD 414


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS33040PF05272310.008 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.8 bits (69), Expect = 0.008
Identities = 11/24 (45%), Positives = 13/24 (54%)

Query: 48 CYSDYVVAVYGPIGAGKSTFLELL 71
C DY V + G G GKST + L
Sbjct: 593 CKFDYSVVLEGTGGIGKSTLINTL 616


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS33050BCTERIALGSPD2061e-63 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 206 bits (526), Expect = 1e-63
Identities = 79/308 (25%), Positives = 140/308 (45%), Gaps = 17/308 (5%)

Query: 16 IVSFDQRTNILLIHDYVDKIKIIKKMIKALDRPVPQVMIEARIVIANRSFEKDLGVKFGV 75
I+ +TN L++ D + ++++I LD PQV++EA I + +LG+++
Sbjct: 311 IIKAHGQTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGIQWAN 370

Query: 76 SGGGSTVATAGSISGTNAIRQGESPGIAERLNVSLPFMTDSAATGLGRFALAVAKLPGNL 135
G T T + + AI ++ SL SA + A + GN
Sbjct: 371 KNAGMTQFTNSGLPISTAIAGANQYNKDGTVSSSLA----SALSSFNGIAAGFYQ--GNW 424

Query: 136 LLDLELQALESEGEAEVISTPKLLTAHDQEAFIEQGEEIPYLESTSSG-----AASVSFK 190
+ L AL S + ++++TP ++T + EA G+E+P L + + +V K
Sbjct: 425 --AMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVPVLTGSQTTSGDNIFNTVERK 482

Query: 191 KAVLGLTVTPHITPDQHIILTIRLSKDSRSALSAGDGGSSANVLPPAIDTRVIKTQALVK 250
+ L V P I ++L I S A S+++ L +TR + LV
Sbjct: 483 TVGIKLKVKPQINEGDSVLLEIEQEVSS----VADAASSTSSDLGATFNTRTVNNAVLVG 538

Query: 251 DGETIVLGGIYEQEKQRVVRRVPFLADLPGIGWLFQSRSQSTLNKELLIFVTPKIMSAAA 310
GET+V+GG+ ++ +VP L D+P IG LF+S S+ + L++F+ P ++
Sbjct: 539 SGETVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNLMLFIRPTVIRDRD 598

Query: 311 SHGVLSTG 318
+ S+G
Sbjct: 599 EYRQASSG 606


66PSLF89_RS33080PSLF89_RS33120N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PSLF89_RS330800315.499977hypothetical protein
PSLF89_RS330901297.359529conjugal transfer protein TrbE
PSLF89_RS330952211.880807VirB3 family type IV secretion system protein
PSLF89_RS331002222.425764TrbC/VirB2 family protein
PSLF89_RS331101243.149735biotin/lipoyl-binding protein
PSLF89_RS331151232.330122HlyD family efflux transporter periplasmic
PSLF89_RS331200242.839068YifB family Mg chelatase-like AAA ATPase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS33140PF04335633e-14 VirB8 type IV secretion protein
		>PF04335#VirB8 type IV secretion protein

Length = 227

Score = 63.3 bits (154), Expect = 3e-14
Identities = 31/209 (14%), Positives = 71/209 (33%), Gaps = 11/209 (5%)

Query: 15 EAVTPYQKAAQEWDR-RIGSSRAQANSWRLIAIACIVACILLLIGMMMLIQQKKNVVYVA 73
+ + Y + A W+R ++ ++ ++A ++ + L K YV
Sbjct: 8 DELKAYFEEAASWERDKLAAAERSKKLAWVVAGVAGALATAGVVAVAALTPLKTVEPYVI 67

Query: 74 EVGSSG---QVINVVKTNQPYRPTDAQYQYFIAKFIRHAMSLPLDPVILKNNLLEAYQLT 130
V + + + + +A +YF+A ++R+ + ++
Sbjct: 68 TVDRNTGEASIAAKLHGDATITYDEAVRKYFLATYVRYREG--WIAAAREEYFDAVMVMS 125

Query: 131 ASKGRLQFNELMK---KLQPTRHIGQLTQT-VEVQMVEQITPNSYSATWRQTSYDQNGKV 186
A + +++ K P + T VE++ V + N + + S +
Sbjct: 126 ARPEQDRWSRFYKTDNPQSPQNILANRTDVFVEIKRVSFLGGNVAQVYFTKESVTGSNST 185

Query: 187 TQVKRYHGVFTVSQTMPTTEHEILVNPLG 215
+ V P+ E + NPLG
Sbjct: 186 KTDAVATIKYKVD-GTPSKEVDRFKNPLG 213


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS33170RTXTOXIND384e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 38.3 bits (89), Expect = 4e-06
Identities = 22/110 (20%), Positives = 42/110 (38%), Gaps = 4/110 (3%)

Query: 9 IISPKTVMLKAQQAGLVTHIYFQSGEQVNKGQRLLQIDNHKQQASLAKAKADLFSLKADY 68
S ++ +K + +V I + GE V KG LL++ +A K ++ L + +
Sbjct: 91 THSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQ 150

Query: 69 QRNLQMAQKNHVSISANTLDQKLGTVRAAQAAVASAKESLAETTVRAPFA 118
R Q SI N L + V+ + + ++ F+
Sbjct: 151 TR----YQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFS 196


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS33175RTXTOXIND290.014 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 28.6 bits (64), Expect = 0.014
Identities = 17/108 (15%), Positives = 36/108 (33%), Gaps = 13/108 (12%)

Query: 10 TLGSYLAEGDSIVMLT-DSKALLVQYQLPQEYSAQMAINQHVHITTAQQAWAEKTDKPPV 68
T G + ++++++ + L V + + + + Q+ I +A+
Sbjct: 345 TEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKV--EAFPYTR--YGY 400

Query: 69 TTSTVSYISPILITNSHAYLAH-ARINTLNNTMI-------LKPGMTV 108
V I+ I + L I+ N + L GM V
Sbjct: 401 LVGKVKNINLDAIEDQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAV 448


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PSLF89_RS33205HTHFIS381e-04 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 37.5 bits (87), Expect = 1e-04
Identities = 33/192 (17%), Positives = 60/192 (31%), Gaps = 49/192 (25%)

Query: 173 LEQAYFQVDTRQTHYLDMADVKGQA----HAKRALEIAAAGRHHLLFVGPPGTGKTMLAS 228
L + + + D + G++ R L L+ G GTGK ++A
Sbjct: 119 LAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVAR 178

Query: 229 RLPGILPALSNQEALESAAVHSL----TSAEIDLSCWFIPKFASPHHTASSIAMVG---- 280
A+H + ++ IP+ + G
Sbjct: 179 ------------------ALHDYGKRRNGPFVAINMAAIPR------DLIESELFGHEKG 214

Query: 281 ---GGSVPKPGEISRAHHGVLFLDEL----PEFDRKVLEVLREPLESGQIDIIRASHRAS 333
G G +A G LFLDE+ + ++L VL++ + R
Sbjct: 215 AFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQG------EYTTVGGRTP 268

Query: 334 FPASFQLIAAMN 345
+ +++AA N
Sbjct: 269 IRSDVRIVAATN 280



 
Contact Sachin Pundhir for Bugs/Comments.
For best view 1024 x 768 resolution & IE 6.0 or above recommended.