PredictBias

identification of genomic and pathogenicity islands in prokaryotic genome
Home | Help | Analyzed genomes
 
A) Input parameters
Genomepaero.gbkThreshold dinucleotide bias2
Threshold codon bias4Threshold %GC bias3
E-value (RPSBlast)0.05Genome (non-pathogenic)
 
B) Compare a potential GI or PAI in related non-pathogenic sp. (phylogenetic tree)
Potential GI or PAI start    end  
Select Organism     
 
C) Potential GIs and PAIs in NC_002516 (download)
S.NoStartEndBiasVirulenceInsertion elementsPrediction
1PA5570PA5555Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA5570015-3.74638550S ribosomal protein L34
PA5569-114-3.581464ribonuclease P
PA5568-112-3.393931inner membrane protein translocase subunit YidC
PA5567-115-2.572335tRNA modification GTPase TrmE
PA5566-216-3.187894hypothetical protein
PA5565-114-3.736112tRNA uridine 5-carboxymethylaminomethyl
PA5564-117-4.15776416S rRNA methyltransferase GidB
PA5563021-4.854734chromosome partitioning protein Soj
PA5562228-5.710240chromosome partitioning protein
PA5561026-6.446658ATP synthase subunit I
PA5560123-6.521586ATP synthase subunit A
PA5559226-6.113878ATP synthase subunit C
PA5558224-5.491877ATP synthase subunit B
PA5557118-4.187250ATP synthase subunit delta
PA5556216-3.644820ATP synthase subunit alpha
PA5555113-3.180513ATP synthase subunit gamma
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA556860KDINNERMP6770.0 60kDa inner membrane protein signature.
		>60KDINNERMP#60kDa inner membrane protein signature.

Length = 548

Score = 677 bits (1748), Expect = 0.0
Identities = 245/581 (42%), Positives = 343/581 (59%), Gaps = 54/581 (9%)

Query: 1 MDIQRSILIVALAVVSYLLVLQWNKDYGQPELPAASASMNTTQGLPDTPSASGTSSDVPT 60
MD QR++L++AL VS+++ W +D T
Sbjct: 1 MDSQRNLLVIALLFVSFMIWQAWEQDKNPQPQA-----------------------QQTT 37

Query: 61 AQSSAAGSEAADK--PVAVSDKLIQVKTDVLDLAIDPRGGDIVQLGLLQYPRRLDRPDVP 118
++ A AAD+ P + KLI VKTDVLDL I+ RGGD+ Q L YP+ L+ P
Sbjct: 38 QTTTTAAGSAADQGVPASGQGKLISVKTDVLDLTINTRGGDVEQALLPAYPKELNSTQ-P 96

Query: 119 FPLFDNGRERTYLAQSGLTGADGPDASSAG-RPLFRSAQSSYQLADGQNELVVDLSFS-H 176
F L + + Y AQSGLTG DGPD + G RPL+ + +Y LA+GQNEL V ++++
Sbjct: 97 FQLLETSPQFIYQAQSGLTGRDGPDNPANGPRPLYNVEKDAYVLAEGQNELQVPMTYTDA 156

Query: 177 DGVNYIKRFTFHRGLKADCSDKEKAQKKIECINENAYQVGVSYLIDNQSGKTWSGNLFAQ 236
G + K F RG Y V V+Y + N K + F Q
Sbjct: 157 AGNTFTKTFVLKRG---------------------DYAVNVNYNVQNAGEKPLEISSFGQ 195

Query: 237 LKRDGSADPSSTTATG---VSTYLGAAVWTPDSPYKKISTKDM-DKEQFKESVQGGWVAW 292
LK+ + P T + + T+ GAA TPD Y+K + D E S +GGWVA
Sbjct: 196 LKQSITLPPHLDTGSSNFALHTFRGAAYSTPDEKYEKYKFDTIADNENLNISSKGGWVAM 255

Query: 293 LQHYFVTAWVPTKGEQHQVMTRKDGQGNYIVGFTGPTLSVPAGSKVETDLTLYAGPKLQK 352
LQ YF TAW+P + T G G +G+ + V G + TL+ GP++Q
Sbjct: 256 LQQYFATAWIPHNDGTNNFYTANLGNGIAAIGYKSQPVLVQPGQTGAMNSTLWVGPEIQD 315

Query: 353 HLKELSPGLELTVDYGFLWFIAQPIFWLLQHIHSLIGNWGWSIIALTVLIKLAFFPLSAA 412
+ ++P L+LTVDYG+LWFI+QP+F LL+ IHS +GNWG+SII +T +++ +PL+ A
Sbjct: 316 KMAAVAPHLDLTVDYGWLWFISQPLFKLLKWIHSFVGNWGFSIIIITFIVRGIMYPLTKA 375

Query: 413 SYRSMARMRAVSPKMQAIKEQHGDDRQKMSQAMMELYKKEKINPLGGCLPILVQMPVFLS 472
Y SMA+MR + PK+QA++E+ GDD+Q++SQ MM LYK EK+NPLGGC P+L+QMP+FL+
Sbjct: 376 QYTSMAKMRMLQPKIQAMRERLGDDKQRISQEMMALYKAEKVNPLGGCFPLLIQMPIFLA 435

Query: 473 LYWVLLESVEMRQAPWLGWITDLSVKDPFFILPIVMGGTMLIQQMLNPTP-PDPMQAKVM 531
LY++L+ SVE+RQAP+ WI DLS +DP++ILPI+MG TM Q ++PT DPMQ K+M
Sbjct: 436 LYYMLMGSVELRQAPFALWIHDLSAQDPYYILPILMGVTMFFIQKMSPTTVTDPMQQKIM 495

Query: 532 KLMPIIFTFFFLWFPAGLVLYWVVNNCLSIAQQWYITRKIE 572
MP+IFT FFLWFP+GLVLY++V+N ++I QQ I R +E
Sbjct: 496 TFMPVIFTVFFLWFPSGLVLYYIVSNLVTIIQQQLIYRGLE 536


2PA5473PA5466Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
PA5473380.310999hypothetical protein
PA5472290.533248hypothetical protein
PA5471391.011096hypothetical protein
PA5470391.051901peptide chain release factor-like protein
PA54692101.265385hypothetical protein
PA54681102.026014citrate transporter
PA54672122.499185hypothetical protein
PA54662112.202946hypothetical protein
3PA5409PA5384Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA5409419-0.389051hypothetical protein
PA5408525-1.399748hypothetical protein
PA5407624-2.989001hypothetical protein
PA5406521-1.519521hypothetical protein
PA54050171.234633hypothetical protein
PA5404-1131.665920hypothetical protein
PA5403-2121.126982transcriptional regulator
PA5402-2121.109652hypothetical protein
PA54010140.903756hypothetical protein
PA54000140.506024electron transfer flavoprotein subunit alpha
PA5399012-0.167974dimethylglycine catabolism protein DgcB
PA5398212-0.357940dimethylglycine catabolism protein DgcA
PA5397211-0.355133hypothetical protein
PA53962100.497412hypothetical protein
PA53952102.033521hypothetical protein
PA53940102.171097cardiolipin synthetase
PA5393-192.138788hypothetical protein
PA5392-1102.046376hypothetical protein
PA5391-1102.849660hypothetical protein
PA5390-1112.399152acetylornithine deacetylase
PA53892122.694626CdhR family transcriptional regulator
PA53882102.810586hypothetical protein
PA53873113.368060carnitine dehydrogenase
PA53863123.5678793-hydroxybutyryl-CoA dehydrogenase
PA53854132.681730carnitine dehydrogenase
PA53842112.450658lipolytic protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA5399TCRTETA320.008 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 31.7 bits (72), Expect = 0.008
Identities = 35/155 (22%), Positives = 52/155 (33%), Gaps = 16/155 (10%)

Query: 5 LLPVLLFAALALAVLGAAKRFLMWRRGRPAKVDWIGGL----LQMPRRYLVDLHHVVERD 60
LL L AA+ A++ A + GR + G+ + Y+ D+ ER
Sbjct: 76 LLVSLAGAAVDYAIMATAPFLWVLYIGR-----IVAGITGATGAVAGAYIADITDGDERA 130

Query: 61 RYMSRTHVATAGGFVLAALLAILVHGFGLHGRILGFALLAATALMFVGALF--VARRRLD 118
R+ G V +L L+ GF H A L + L +
Sbjct: 131 RHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGERR 190

Query: 119 PPSRLSKGP-----WMRLPKSLLAFAASFFLATLP 148
P R + P W R + A A FF+ L
Sbjct: 191 PLRREALNPLASFRWARGMTVVAALMAVFFIMQLV 225


4PA5269PA5253Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA5269017-3.738322hypothetical protein
PA5268-113-3.137433magnesium/cobalt transporter
PA5267015-2.884533secreted protein Hcp
PA5266017-2.848234hypothetical protein
PA5265020-3.452229hypothetical protein
PA5264-114-2.119159hypothetical protein
PA52630100.188909argininosuccinate lyase
PA52623150.257774alginate biosynthesis protein AlgZ/FimS
PA52612140.679267alginate biosynthesis regulatory protein AlgR
PA52604140.442984porphobilinogen deaminase
PA5259418-0.256492uroporphyrinogen-III synthase
PA52581023-0.548596hypothetical protein
PA52571126-0.764483hypothetical protein
PA5256925-1.164409disulfide bond formation protein
PA5255825-1.426318anti-RNA polymerase sigma 70 factor
PA5254722-1.452500FkbP-type peptidyl-prolyl cis-trans isomerase
PA5253620-1.118981alginate regulatory protein AlgP
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA5262PF065801821e-56 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 182 bits (463), Expect = 1e-56
Identities = 76/308 (24%), Positives = 137/308 (44%), Gaps = 23/308 (7%)

Query: 64 LFVQWIVLLSAALFCRLRPLLARLPVALAGSACCLLVVALT------LGCTAVAEHYQLG 117
+F I L+ L R + R +L V + A ++L
Sbjct: 43 IFNIAISLMGLVLTHAYRSFIKRQGWLKLNMGQIILRVLPACVVIGMVWFVANTSIWRLL 102

Query: 118 GELTRAGE-------VNLYLRHALIALIMSALVLRYFYLQS-------QWRRQQQAELQA 163
+ +++ ++ + S L + + ++ QW+ A+ +A
Sbjct: 103 AFINTKPVAFTLPLALSIIFNVVVVTFMWSLLYFGWHFFKNYKQAEIDQWKMASMAQ-EA 161

Query: 164 RLESLQARIRPHFLFNSLNSIASLIELDPLKAEHAVLDLSDLFRASLAK-PGTLVSWEEE 222
+L +L+A+I PHF+FN+LN+I +LI DP KA + LS+L R SL VS +E
Sbjct: 162 QLMALKAQINPHFMFNALNNIRALILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADE 221

Query: 223 LALARRYLSIEQYRLGDRLQLDWQVHGVPANLPIPQLTLQPLLENALIYGIQPRVEGGLV 282
L + YL + + DRLQ + Q++ ++ +P + +Q L+EN + +GI +GG +
Sbjct: 222 LTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKI 281

Query: 283 QVEAVYREGVFQLCVSNPYDEALESPPSKGTRQALHNIDARLGALFGPKASLSVERRDGR 342
++ G L V N AL++ + T L N+ RL L+G +A + + + G+
Sbjct: 282 LLKGTKDNGTVTLEVENTGSLALKNTK-ESTGTGLQNVRERLQMLYGTEAQIKLSEKQGK 340

Query: 343 HYTCLRYP 350
+ P
Sbjct: 341 VNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA5261HTHFIS794e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 79.5 bits (196), Expect = 4e-19
Identities = 31/136 (22%), Positives = 57/136 (41%), Gaps = 5/136 (3%)

Query: 3 VLIVDDEPLARERLARLVGQLDGYRVLEPSASNGEEALTLIDSLKPDIVLLDIRMPGLDG 62
+L+ DD+ R L + + + GY V SN I + D+V+ D+ MP +
Sbjct: 6 ILVADDDAAIRTVLNQALSRA-GYDVR--ITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 63 LQVAARLCEREAPPAVIFCTAHDEF--ALEAFQVSAVGYLVKPVRSEDLAEALKKASRPN 120
+ R+ + V+ +A + F A++A + A YL KP +L + +A
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122

Query: 121 RVQLAALTKPPASGGS 136
+ + + L G
Sbjct: 123 KRRPSKLEDDSQDGMP 138


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA5254INFPOTNTIATR962e-26 Macrophage infectivity potentiator signature.
		>INFPOTNTIATR#Macrophage infectivity potentiator signature.

Length = 233

Score = 95.8 bits (238), Expect = 2e-26
Identities = 61/202 (30%), Positives = 97/202 (48%), Gaps = 16/202 (7%)

Query: 22 KDELAYAVGARLGMRLQQEMPGLELSELLLGLRQAYRGEALEIPPERIEQLLLQHE---- 77
KD+L+Y++GA LG + + + L G++ G L + E+++ +L + +
Sbjct: 31 KDKLSYSIGADLGKNFKNQGIDINPDVLAKGMQDGMSGAQLILTEEQMKDVLSKFQKDLM 90

Query: 78 -------NATTETPRTTPAEARFLANEKARFGVRELTGGVLVSELRRGQGNGIGAATQVH 130
N E + +A FL+ K++ G+ L G+ + G G G + V
Sbjct: 91 AKRSAEFNKKAEENKAK-GDA-FLSANKSKPGIVVLPSGLQYKIIDAGTGAKPGKSDTVT 148

Query: 131 VRYRGLLADGQVFDQSESA---EWFALDSVIEGWRTALRAMPVGARWRVVIPSAQAYGHE 187
V Y G L DG VFD +E A F + VI GW AL+ MP G+ W V +P+ AYG
Sbjct: 149 VEYTGTLIDGTVFDSTEKAGKPATFQVSQVIPGWTEALQLMPAGSTWEVFVPADLAYGPR 208

Query: 188 GAGDLIPPDAPLVFEIDLLGFR 209
G I P+ L+F+I L+ +
Sbjct: 209 SVGGPIGPNETLIFKIHLISVK 230


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA5253IGASERPTASE621e-12 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 62.4 bits (151), Expect = 1e-12
Identities = 44/220 (20%), Positives = 67/220 (30%), Gaps = 7/220 (3%)

Query: 134 KAKPATKPAAKAAAKPAVKTVAAKPAAKPAAKPAAKPA-AKPAAKTAAAKPAAKPTAKPA 192
T P A P+V + + A+ P PA A P+ T +K +K
Sbjct: 993 DTTNITTPNNIQADVPSVPSNNEE-IARVDEAPVPPPAPATPSETTETVAENSKQESKTV 1051

Query: 193 AKPAAKPAAKTAAAKPAAKPAAKPVAKPAAKPAAKTAAAKPAAKPAAKPVAKPTAKPAAK 252
K TA + AK A V A + A + K K TA +
Sbjct: 1052 EKNEQDATETTAQNREVAKEAKSNV--KANTQTNEVAQSGSETKETQTTETKETATVEKE 1109

Query: 253 TAAAKPAAKPAAKPAAKPAAKPVAKSAAAKPAAKPAAKPAAKPAAKPAAKPVAAKPAATK 312
A K P P +P A+PA + K ++ T
Sbjct: 1110 EKAKVETEKTQEVPKVTSQVSP---KQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTA 1166

Query: 313 PATAPAAKPAATPSAPAAASSAASATPAAGSNGAAPTSAS 352
PA + ++ P S+ + + N T A+
Sbjct: 1167 DTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPAT 1206



Score = 45.1 bits (106), Expect = 3e-07
Identities = 40/246 (16%), Positives = 80/246 (32%), Gaps = 19/246 (7%)

Query: 45 EKQRGKAQEKLHKARTKLQDAAKAGKTKAQAK--ARETISDLEEALDTLKARQADTRTYI 102
E A+ +++T ++ A +T AQ + A+E S+++ T + Q
Sbjct: 1035 ETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQ------- 1087

Query: 103 VGLKRDVQESLKLAQGVGKVKEAAGKA-LESRKAKPATKPAAKAAAKPAVKTVAAKPAAK 161
+ +E+ E KA +E+ K + K ++ + K ++ +P A+
Sbjct: 1088 --SGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQE-QSETVQPQAE 1144

Query: 162 PAAKPA----AKPAAKPAAKTAAAKPAAKPTAKPAAKPAAKPAAKTAAAKPAAKPAAKPV 217
PA + K TA + AK T+ +P + P +
Sbjct: 1145 PARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENP--ENT 1202

Query: 218 AKPAAKPAAKTAAAKPAAKPAAKPVAKPTAKPAAKTAAAKPAAKPAAKPAAKPAAKPVAK 277
+P + ++ + V T ++ + A V
Sbjct: 1203 TPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTNAVLS 1262

Query: 278 SAAAKP 283
A AK
Sbjct: 1263 DARAKA 1268



Score = 33.9 bits (77), Expect = 0.001
Identities = 17/119 (14%), Positives = 29/119 (24%), Gaps = 1/119 (0%)

Query: 233 PAAKPAAKPVAKPTAKPAAKTAAAKPAAKPAAKPAAKPAAKPVAKSAAAKPAAKPAAKPA 292
P + + V A P+ + A+ PV A A P+ A
Sbjct: 983 PEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSET-TETVA 1041

Query: 293 AKPAAKPAAKPVAAKPAATKPATAPAAKPAATPSAPAAASSAASATPAAGSNGAAPTSA 351
+ + A A A + A + A + + T
Sbjct: 1042 ENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTET 1100


5PA5102PA5087Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA5102291.191779hypothetical protein
PA51010110.955273hypothetical protein
PA51002110.640727urocanate hydratase
PA50993110.222337transporter
PA50982110.929737histidine ammonia-lyase
PA50972101.011440amino acid permease
PA5096-191.318908ABC transporter
PA5095-161.142792ABC transporter permease
PA5094-380.635094ABC transporter ATP-binding protein
PA5093-113-0.876341histidine/phenylalanine ammonia-lyase
PA5092221-3.188153imidazolonepropionase
PA5091223-3.370420N-formylglutamate amidohydrolase
PA5090323-2.670777hypothetical protein
PA5089527-3.619250hypothetical protein
PA5088629-4.461367hypothetical protein
PA5087415-1.383716hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA5092UREASE362e-04 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 36.3 bits (84), Expect = 2e-04
Identities = 17/33 (51%), Positives = 21/33 (63%)

Query: 341 LAGVTLHAARALGLEASHGSLEVGKLADFVAWD 373
+A T++ A A GL GSLEVGK AD V W+
Sbjct: 406 IAKYTINPAIAHGLSHEIGSLEVGKRADLVLWN 438


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA508856KDTSANTIGN300.019 Rickettsia 56kDa type-specific antigen protein sign...
		>56KDTSANTIGN#Rickettsia 56kDa type-specific antigen protein

signature.
Length = 533

Score = 29.5 bits (66), Expect = 0.019
Identities = 26/91 (28%), Positives = 41/91 (45%), Gaps = 4/91 (4%)

Query: 41 PQADAWYREAVALAKPDTLRPWDRIVDLYSKAVE----RGHWKAMHNLASLYRTGWPGGV 96
QA A +EAVA A L D+I LY V+ G KAM LA+
Sbjct: 350 QQAQATAQEAVAAAAVRLLNGSDQIAQLYKDLVKLQRHAGIRKAMEKLAAQQEEDAKNQG 409

Query: 97 EKDTQKALDLYQKMIDLKVPQGFYDMAAMIG 127
+ D ++ +K + KV + +D++ ++G
Sbjct: 410 KGDCKQQQGASEKSKEGKVKETEFDLSMVVG 440


6PA5065PA5060Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA5065615-0.856728ubiquinone biosynthetic protein UbiB
PA5064716-1.313404hypothetical protein
PA5063414-1.809555ubiquinone/menaquinone biosynthesis
PA5062314-1.346383hypothetical protein
PA5061311-1.814575hypothetical protein
PA5060311-1.172929polyhydroxyalkanoate synthesis protein PhaF
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA5060IGASERPTASE476e-08 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 47.0 bits (111), Expect = 6e-08
Identities = 34/200 (17%), Positives = 64/200 (32%), Gaps = 6/200 (3%)

Query: 110 VPSRNEVKELHSKVDTLTKQIEKLTGVSVKPAAKAAAKPAAKPAA---KPAAKTAAAKPA 166
+ + N ++ V + ++I ++ V P A A + A K +KT
Sbjct: 997 ITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQ 1056

Query: 167 AKPAAKAAAKPAAKPAAKKTAAKTAAAKPA--AKPAAKPTAKAAAKPATKPAA-KAAAKP 223
A + AK A A T + A + + AT KA +
Sbjct: 1057 DATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVET 1116

Query: 224 AAKPAAAKPAAKPAAKPAAATAAKPAAKPAAKPAAKKPAAKKPAAKPAAAKPAAPAASSS 283
K ++ + K + +P A+PA + + + A PA +S
Sbjct: 1117 EKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETS 1176

Query: 284 APAAPAATPAASAPAANAPA 303
+ T + + N+
Sbjct: 1177 SNVEQPVTESTTVNTGNSVV 1196



Score = 42.7 bits (100), Expect = 1e-06
Identities = 27/175 (15%), Positives = 42/175 (24%), Gaps = 6/175 (3%)

Query: 140 PAAKAAAKPAAKPAAKPAAKTAAAKPAAKPAAKAAAK----PAAKPAAKKTAAKTAAAKP 195
P + + A P+ + A+ P PA + T
Sbjct: 983 PEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAE 1042

Query: 196 AAKPAAKPTAKAAAKPA-TKPAAKAAAKPAAKPAAAKPAAKPAAKPAAATA-AKPAAKPA 253
+K +K K T + AK A A A+ + T +
Sbjct: 1043 NSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKE 1102

Query: 254 AKPAAKKPAAKKPAAKPAAAKPAAPAASSSAPAAPAATPAASAPAANAPATPSSQ 308
K+ AK K S + P A N P +
Sbjct: 1103 TATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKE 1157


7PA5002PA4988Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA5002-115-3.548164hypothetical protein
PA5001-212-3.343673hypothetical protein
PA5000-210-2.139847alpha-1,3-rhamnosyltransferase WapR
PA4999-29-1.292373O-antigen ligase WaaL
PA4998-39-0.201645hypothetical protein
PA4997-3100.166956transporter MsbA
PA4996-291.779070bifunctional heptose 7-phosphate kinase/heptose
PA4995-191.618278acyl-CoA dehydrogenase
PA4994-181.853138acyl-CoA dehydrogenase
PA4993093.358575hypothetical protein
PA4992-192.952493hypothetical protein
PA4991083.305821hypothetical protein
PA4990193.276280SMR multidrug efflux transporter
PA49890103.371464transcriptional regulator
PA4988093.0665543-deoxy-D-manno-octulosonic acid transferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA5002FLGMRINGFLIF290.039 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 29.2 bits (65), Expect = 0.039
Identities = 12/30 (40%), Positives = 17/30 (56%)

Query: 9 LKRHRRNKRIGLLVALLALLAVGLLVSPWL 38
L R R N RI L+VA A +A+ + + W
Sbjct: 16 LNRLRANPRIPLIVAGSAAVAIVVAMVLWA 45


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4997ACRIFLAVINRP310.013 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 31.3 bits (71), Expect = 0.013
Identities = 12/50 (24%), Positives = 23/50 (46%)

Query: 144 ITFNVTMVTGAATDAIKVVIREGLTVVFLFLYLLWMNWKLTLVMLAILPV 193
++ T + + + E + +VFL +YL N + TL+ +PV
Sbjct: 325 YPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPV 374


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4996LPSBIOSNTHSS280.043 Lipopolysaccharide core biosynthesis protein signat...
		>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein

signature.
Length = 166

Score = 28.2 bits (63), Expect = 0.043
Identities = 14/57 (24%), Positives = 26/57 (45%), Gaps = 7/57 (12%)

Query: 346 GCFDILHAGHVTYLEQARAQGDRLIVGVNDDASVTRLKGVGRPINSVDRRMAVLAGL 402
G FD + GH+ +E+ D++ V V + + +P+ SV R+ +A
Sbjct: 7 GSFDPITFGHLDIIERGCRLFDQVYVAVLRNPN-------KQPMFSVQERLEQIAKA 56


8PA4912PA4867Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA4912214-0.123905branched-chain amino acid ABC transporter
PA49111120.837510branched-chain amino acid ABC transporter
PA49100101.509787ABC transporter ATP-binding protein
PA49090102.139503ABC transporter ATP-binding protein
PA49080103.247530ornithine cyclodeaminase
PA4907-192.693930short-chain dehydrogenase
PA4906093.179041transcriptional regulator
PA4905-193.360371vanillate O-demethylase
PA4904-1103.219039vanillate O-demethylase oxygenase
PA4903-1102.745397major facilitator superfamily transporter
PA4902081.496501transcriptional regulator
PA4901071.421688benzoylformate decarboxylase
PA4900-171.686682major facilitator superfamily transporter
PA4899-181.797954aldehyde dehydrogenase
PA4898081.432972vanillate porin OpdK
PA4897-182.249653hypothetical protein
PA48961124.480418RNA polymerase sigma factor
PA48951124.863252transmembrane sensor
PA48942124.810144hypothetical protein
PA48931103.372919urease accessory protein UreG
PA48922112.958628urease accessory protein UreF
PA48912113.093630urease accessory protein UreE
PA48901102.474318transcriptional regulator
PA48892102.493202oxidoreductase
PA48882101.528039acyl-CoA desaturase
PA4887191.342297major facilitator superfamily transporter
PA4886191.519575two-component sensor
PA4885011-0.433026two-component response regulator
PA4884110-0.044826hypothetical protein
PA4883-18-0.683478hypothetical protein
PA488219-0.732162hypothetical protein
PA4881111-0.976590hypothetical protein
PA4880211-1.164824bacterioferritin
PA4879211-0.338597hypothetical protein
PA4878-111-0.489001transcriptional regulator
PA4877-1130.966937hypothetical protein
PA4876-1110.703368OsmE family transcriptional regulator
PA4874-190.393288hypothetical protein
PA48731100.826123heat-shock protein
PA4872290.878933hypothetical protein
PA48711121.218738hypothetical protein
PA48703130.415032hypothetical protein
PA48692151.699138hypothetical protein
PA48683192.638271urease subunit alpha
PA48670173.193355urease subunit beta
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4907DHBDHDRGNASE791e-19 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 79.3 bits (195), Expect = 1e-19
Identities = 61/244 (25%), Positives = 99/244 (40%), Gaps = 14/244 (5%)

Query: 6 FITGATSGFGEACARRFAEAGWSLVLTGRREERLQALAGELSAKTRVL-PLTLDVRDRAA 64
FITGA G GEA AR A G + E+L+ + L A+ R DVRD AA
Sbjct: 12 FITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSAA 71

Query: 65 MSAAVDNLPEEFATLRGLINNAGLALGTDPAQSCDLDDWDTMVDTNIKGLLYSTRLLLPR 124
+ + E + L+N AG+ L S ++W+ N G+ ++R +
Sbjct: 72 IDEITARIEREMGPIDILVNVAGV-LRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSKY 130

Query: 125 LIAHGAGASIVNLGSVAGKWPYPGSHVYGGTKAFVEQFSLNLRCDLQGTGVRVTNLEPGL 184
++ +G SIV +GS P Y +KA F+ L +L +R + PG
Sbjct: 131 MMDRRSG-SIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPGS 189

Query: 185 CESEFSLV----------RFGGDQARYDKTYAGAHPIQPEDIAETI-FWIMNQPAHLNIN 233
E++ G + +P DIA+ + F + Q H+ ++
Sbjct: 190 TETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHITMH 249

Query: 234 SLEI 237
+L +
Sbjct: 250 NLCV 253


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4903TCRTETA509e-09 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 50.2 bits (120), Expect = 9e-09
Identities = 40/147 (27%), Positives = 57/147 (38%), Gaps = 8/147 (5%)

Query: 55 AEIGLLLSAGLFGMAAGSLFIAPWADRWGRRPLILACLALSGLGMLASALSQAAWQLALL 114
A G+LL+ A + + +DR+GRRP++L LA + + A + W L +
Sbjct: 43 AHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIG 102

Query: 115 R---GLTGLGIGGILASSNVIASEYASRRWRGLAVSLQSTGYALGATLGGLLAVWLIGAW 171
R G+TG A I R G + G G LGGL+ G +
Sbjct: 103 RIVAGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLM-----GGF 157

Query: 172 GWRSVFVFGAGLTLAVIPLVCLCLPES 198
+ F A L C LPES
Sbjct: 158 SPHAPFFAAAALNGLNFLTGCFLLPES 184



Score = 36.7 bits (85), Expect = 1e-04
Identities = 34/146 (23%), Positives = 59/146 (40%), Gaps = 7/146 (4%)

Query: 51 NLGGAEIGLLLSA-GLFGMAAGSLFIAPWADRWGRRPLILACLALSGLGMLASALSQAAW 109
+ IG+ L+A G+ A ++ P A R G R ++ + G G + A + W
Sbjct: 242 HWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGW 301

Query: 110 QLALLRGLTGLGIGGILASSNVIASEYASRRW---RGLAVSLQSTGYALGATLGGLLAVW 166
+ L G G+ A +++ + R +G +L S +G L +
Sbjct: 302 MAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAA 361

Query: 167 LIGAW-GWRSVFVFGAGLTLAVIPLV 191
I W GW ++ GA L L +P +
Sbjct: 362 SITTWNGW--AWIAGAALYLLCLPAL 385



Score = 33.3 bits (76), Expect = 0.002
Identities = 41/189 (21%), Positives = 66/189 (34%), Gaps = 5/189 (2%)

Query: 253 RTTLLLWALFFLVMFGFYFIMSWTPKLLVAAGLSTAQGITGGTLLSIGGI---FGAALLG 309
R +++ + L G IM P LL S G LL++ + A +LG
Sbjct: 5 RPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLG 64

Query: 310 GLAARFRLERVLALFMLLTAALLALFSLSAGLPGAALPLGLLIGLCANACVAGLYALAPS 369
L+ RF VL + + A A+ + + L L +G ++ A A A
Sbjct: 65 ALSDRFGRRPVLLVSLAGAAVDYAIMATAPFL--WVLYIGRIVAGITGATGAVAGAYIAD 122

Query: 370 LYDASVRATGVGWGIGVGRGGAILSPLVAGLLLDDGWQPLSLYGAFAAVFVVAAAVLPLL 429
+ D RA G+ G + P++ GL+ A L
Sbjct: 123 ITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLP 182

Query: 430 GARRRERSP 438
+ + ER P
Sbjct: 183 ESHKGERRP 191


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4900TCRTETB461e-07 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 46.4 bits (110), Expect = 1e-07
Identities = 33/187 (17%), Positives = 75/187 (40%), Gaps = 5/187 (2%)

Query: 16 NRTHWLILGWGCFIMLFDGYDMVIYGSVVPRLMQEWQLSPVQAGTLGSCALFGMLFGGTL 75
N H IL W C + F + ++ +P + ++ P + + + G +
Sbjct: 9 NLRHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAV 68

Query: 76 LAPLADRFGRRRLV---IATTLLASLAAFLTGHARDPLELGAGRFFTGLALGALVPSAIN 132
L+D+ G +RL+ I S+ F+ GH+ L L RF G A +
Sbjct: 69 YGKLSDQLGIKRLLLFGIIINCFGSVIGFV-GHSFFSL-LIMARFIQGAGAAAFPALVMV 126

Query: 133 LISEFAPAGRRSTLVTVMSAFYSVGAVLSALLAIAMIPAWGWQSVFYVAVLPVLAVPLML 192
+++ + P R ++ + ++G + + + W + + ++ ++ VP ++
Sbjct: 127 VVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLM 186

Query: 193 RWLPESA 199
+ L +
Sbjct: 187 KLLKKEV 193



Score = 32.2 bits (73), Expect = 0.005
Identities = 28/152 (18%), Positives = 54/152 (35%), Gaps = 13/152 (8%)

Query: 258 VAFAMCMLMSYG------LNTWLPKLMAGGGYALGSSLAFLVTLNVGATLGALFGGWLAD 311
+ +C+L + LN LP + S+ + ++G G L+D
Sbjct: 15 ILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSD 74

Query: 312 RLGAGRTLVLFFAL--AAASLAALGLGPGPWLLNGLLVVA--GATTIGTLAVIHAYAAQF 367
+LG R L+ + + + +G L+ + A + V+ A++
Sbjct: 75 QLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVV---VARY 131

Query: 368 YPAWVRSTGVGWAAGVGRLGAIAGPMLGGSLL 399
P R G + +G GP +GG +
Sbjct: 132 IPKENRGKAFGLIGSIVAMGEGVGPAIGGMIA 163


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4890HTHTETR646e-15 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 64.3 bits (156), Expect = 6e-15
Identities = 32/179 (17%), Positives = 57/179 (31%), Gaps = 10/179 (5%)

Query: 1 MSSPRAEQKQQTRHALMSAARHLMESGRGFGSLSLREVTRAAGIVPAGFYRHFSDMDQLG 60
M+ ++ Q+TR ++ A L +G S SL E+ +AAG+ Y HF D L
Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQ-QGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLF 59

Query: 61 LALVAEVDETFRATLR--AVRRNEFELGGLIDASVRIF-LDAVGANRSQF---LFLAREQ 114
+ + + L L + + + R +F E
Sbjct: 60 SEIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEF 119

Query: 115 YGGSLPIRQAIASLRQRITDDLAADLALLNKMPHLDGAALDVFADLVVKTVFATLPELI 173
G ++QA +L D + L D+ + + L+
Sbjct: 120 VGEMAVVQQAQRNLCLESYDRIE---QTLKHCIEAKMLPADLMTRRAAIIMRGYISGLM 175


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4887TCRTETA330.002 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 32.9 bits (75), Expect = 0.002
Identities = 70/330 (21%), Positives = 115/330 (34%), Gaps = 37/330 (11%)

Query: 44 GGLMASYYFGLVCGGKFGHKLIASFGHIRSYVACAGI--ATVTVLLHALVDQLEVWLLLR 101
G L+A Y L FG R V + A V + A L V + R
Sbjct: 46 GILLALYALMQFACAPVLGALSDRFG--RRPVLLVSLAGAAVDYAIMATAPFLWVLYIGR 103

Query: 102 F---ITGAVMMNQYMVIESWLNEQAESHQRGKVFAGYMVA-VDLGLVLGQ---GLLA-LS 153
ITGA V +++ + + +R + F G+M A G+V G GL+ S
Sbjct: 104 IVAGITGATGA----VAGAYIADITDGDERARHF-GFMSACFGFGMVAGPVLGGLMGGFS 158

Query: 154 PTLDY---KPLLLVAICFASCLIPLAMTRRVHPAKLVAAPLEVRFFWQR----VPQALGT 206
P + L + L+P + P + A F W R V +
Sbjct: 159 PHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAV 218

Query: 207 IFIAGLMVGAFYGLAPVY-ANRNGLDASQSSF-FVGMCIVAGFCAQWPLG----WLSDRL 260
FI L+ L ++ +R DA+ I+ G L +R
Sbjct: 219 FFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERR 278

Query: 261 DRSWLIRGNAVLLCLASIPMWGLVTLPYWLLLANGFVTGMLLFTLYPLAVALANDHVEQP 320
+ + L + G + P +LLA+G G+ + P A+ + V++
Sbjct: 279 ALMLGMIADGTGYILLAFATRGWMAFPIMVLLASG---GIGM----PALQAMLSRQVDEE 331

Query: 321 RRVALSAMLLTTYGVGACIGPLVAGALMRH 350
R+ L L + + +GPL+ A+
Sbjct: 332 RQGQLQGSLAALTSLTSIVGPLLFTAIYAA 361


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4885HTHFIS764e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 76.0 bits (187), Expect = 4e-18
Identities = 42/156 (26%), Positives = 74/156 (47%), Gaps = 6/156 (3%)

Query: 2 RILVIEDDTKTGEYLKKGLGESGYAVDWSQHGADGLYLALENRYDLVVLDVMLPGLDGWQ 61
ILV +DD L + L +GY V + + A DLVV DV++P + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 IMEVLRK-KHDVPVLFLTARDQLQDRIRGLELGADDYLVKPFSFTELLLRIRTLLRRGVV 120
++ ++K + D+PVL ++A++ I+ E GA DYL KPF TEL+ I L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 121 REAEQVQLADLQLDVLR-----RKVSRQGQVIALTN 151
R ++ + + ++ +++ R + T+
Sbjct: 125 RPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTD 160


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4880HELNAPAPROT392e-06 Helicobacter neutrophil-activating protein A family ...
		>HELNAPAPROT#Helicobacter neutrophil-activating protein A family

signature.
Length = 153

Score = 39.1 bits (91), Expect = 2e-06
Identities = 27/143 (18%), Positives = 56/143 (39%), Gaps = 2/143 (1%)

Query: 26 TEGYSADRQTVLRLLNEALATELVCFLRYKRHYFMATGLKASIAAAEFLEHANQEMQHAD 85
TE ++ V LN L+ + + + R ++ G +F E + + D
Sbjct: 3 TENAKTNQTLVENSLNTQLSNWFLLYSKLHRFHWYVKGPHFFTLHEKFEELYDHAAETVD 62

Query: 86 QLAERIMQLGGEPDFNPRG-LEERSHAEYVEGKTLKDMVTENLIAERIAIDSYREII-TY 143
+AER++ +GG+P + E S + + +MV + + + +I
Sbjct: 63 TIAERLLAIGGQPVATVKEYTEHASITDGGNETSASEMVQALVNDYKQISSESKFVIGLA 122

Query: 144 LGNDDPTTRRIFEEILAQEEEHA 166
N D T +F ++ + E+
Sbjct: 123 EENQDNATADLFVGLIEEVEKQV 145


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4873SHAPEPROTEIN491e-08 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 49.4 bits (118), Expect = 1e-08
Identities = 52/226 (23%), Positives = 89/226 (39%), Gaps = 45/226 (19%)

Query: 7 ARALGIDFGTSNSTVGWWRPEVEPLIELEDGKITL--PSVVFFNVEERRPVYGRQALGEY 64
+ L ID GT+N+ LI ++ I L PSVV + A+G
Sbjct: 10 SNDLSIDLGTANT-----------LIYVKGQGIVLNEPSVVAIRQDRAGSPKSVAAVGHD 58

Query: 65 LEGYEGRL---MRSLKSLLGSKLLKSETTVLGSALPFKDLLGLFIGQLKARGEAAAGQAF 121
+ GR + +++ + + T + +L FI Q+ + + +
Sbjct: 59 AKQMLGRTPGNIAAIRPMKDGVIADFFVT--------EKMLQHFIKQVHSN---SFMRPS 107

Query: 122 DAVVLGRPVFFVDDDPEADREAQDTLVQVANKLGFKEVSFQYEPIAAAFDYERCIQREEL 181
V++ PV + A RE+ A G +EV EP+AAA +
Sbjct: 108 PRVLVCVPVGATQVERRAIRES-------AQGAGAREVFLIEEPMAAAIGAGLPVSEATG 160

Query: 182 VLIVDIGGGTSDFSLVRLAPERRNLAERQDDILATGGVHIGGTDFD 227
++VDIGGGT++ +++ L ++ + V IGG FD
Sbjct: 161 SMVVDIGGGTTEVAVISLN-----------GVVYSSSVRIGGDRFD 195


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4868UREASE10960.0 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 1096 bits (2837), Expect = 0.0
Identities = 423/567 (74%), Positives = 479/567 (84%), Gaps = 2/567 (0%)

Query: 2 KISRQAYADMFGPTVGDRVRLADTDLWIEVERDFTVYGEEVKFGGGKVIRDGMGQSQL-G 60
++SR AYA+MFGPTVGD+VRLADT+L+IEVE+DFT +GEEVKFGGGKVIRDGMGQSQ+
Sbjct: 4 RMSRAAYANMFGPTVGDKVRLADTELFIEVEKDFTTHGEEVKFGGGKVIRDGMGQSQVTR 63

Query: 61 AAQVVDTVITNALILDHWGVVKADVGLKDGRIQAIGKAGNPDIQPGVNIAIGAGTEVIAG 120
VDTVITNALILDHWG+VKAD+GLKDGRI AIGKAGNPD+QPGV I +G GTEVIAG
Sbjct: 64 EGGAVDTVITNALILDHWGIVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAG 123

Query: 121 EGMILTAGGIDTHIHFICPQQIEEALMSGVTTMIGGGTGPAAGTNATTCTSGPWHMARML 180
EG I+TAGG+D+HIHFICPQQIEEALMSG+T M+GGGTGPA GT ATTCT GPWH+ARM+
Sbjct: 124 EGKIVTAGGMDSHIHFICPQQIEEALMSGLTCMLGGGTGPAHGTLATTCTPGPWHIARMI 183

Query: 181 QAADAFPMNIGFTGKGNASLPLPLEEQVLAGAIGLKLHEDWGSTPAAIDNCLEVAERHDI 240
+AADAFPMN+ F GKGNASLP L E VL GA LKLHEDWG+TPAAID CL VA+ +D+
Sbjct: 184 EAADAFPMNLAFAGKGNASLPGALVEMVLGGATSLKLHEDWGTTPAAIDCCLSVADEYDV 243

Query: 241 QVAIHTDTLNESGFVETTLGAFKGRTIHTYHTEGAGGGHAPDIIKACGFANVLPSSTNPT 300
QV IHTDTLNESGFVE T+ A KGRTIH YHTEGAGGGHAPDII+ CG NV+PSSTNPT
Sbjct: 244 QVMIHTDTLNESGFVEDTIAAIKGRTIHAYHTEGAGGGHAPDIIRICGQPNVIPSSTNPT 303

Query: 301 RPFTRNTIDEHLDMLMVCHHLDPAIAEDVAFAESRIRRETIAAEDILHDLGAFSMISSDS 360
RP+T NT+ EHLDMLMVCHHL P I ED+AFAESRIR+ETIAAEDILHD+GAFS+ISSDS
Sbjct: 304 RPYTVNTLAEHLDMLMVCHHLSPTIPEDIAFAESRIRKETIAAEDILHDIGAFSIISSDS 363

Query: 361 QAMGRVGEVITRTWQTADKMKRQRGRLDGDGARNDNFRARRYIAKYTINPAITHGISHEV 420
QAMGRVGEV RTWQTADKMKRQRGRL + NDNFR +RYIAKYTINPAI HG+SHE+
Sbjct: 364 QAMGRVGEVAIRTWQTADKMKRQRGRLKEETGDNDNFRVKRYIAKYTINPAIAHGLSHEI 423

Query: 421 GSVEAGKWADLVLWRPAFFGVKPSLILKGGAIAASLMGDINGSIPTPQPVHYRPMFASYA 480
GS+E GK ADLVLW PAFFGVKP ++L GG IAA+ MGD N SIPTPQPVHYRPMF +Y
Sbjct: 424 GSLEVGKRADLVLWNPAFFGVKPDMVLLGGTIAAAPMGDPNASIPTPQPVHYRPMFGAYG 483

Query: 481 GSRHATSLTFVSQAAFAAGVPQQLGLRKAIGVVSGCR-GVQKTDLIHNGYLPTIEVDAQN 539
SR +S+TFVSQA+ AG+ +LG+ K + V R G+ K +IHN P IEVD +
Sbjct: 484 RSRTNSSVTFVSQASLDAGLAGRLGVAKELVAVQNTRGGIGKASMIHNSLTPHIEVDPET 543

Query: 540 YQVRADGQLLWCEPADVLPMAQRYFLF 566
Y+VRADG+LL CEPA VLPMAQRYFLF
Sbjct: 544 YEVRADGELLTCEPATVLPMAQRYFLF 570


9PA4826PA4818Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
PA4826281.597930hypothetical protein
PA4825271.555556Mg(2+) transport ATPase
PA4824373.077759hypothetical protein
PA4823573.135271hypothetical protein
PA4822483.050681hypothetical protein
PA4821283.873644transporter
PA4820083.465801hypothetical protein
PA4819083.728501glycosyl transferase family protein
PA4818-173.590465hypothetical protein
10PA4699PA4693Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA4699114-3.365799hypothetical protein
PA4698214-4.606527hypothetical protein
PA4697314-4.875443hypothetical protein
PA4696215-5.072114acetolactate synthase 3 catalytic subunit
PA4695016-5.214087acetolactate synthase small subunit
PA4694116-4.961311ketol-acid reductoisomerase
PA4693-112-3.633034phosphatidylserine synthase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4697IGASERPTASE373e-05 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 36.6 bits (84), Expect = 3e-05
Identities = 23/120 (19%), Positives = 47/120 (39%), Gaps = 18/120 (15%)

Query: 37 AQPPQGAQATTVNTQT----APPPDNFPLP---PSTPAPTIQQKPADPEQKAIDDKVKQQ 89
P QA + + D P+P P+TP+ T + + +Q++ + +Q
Sbjct: 997 ITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQ 1056

Query: 90 VAKE-EAERKQFCEETRNNLAQLKNNPRVRVDEGKGELRRLGEEERQ-ERIAKAEKAIQE 147
A E A+ ++ +E + V+ + E+ + G E ++ + E A E
Sbjct: 1057 DATETTAQNREVAKEAK---------SNVKANTQTNEVAQSGSETKETQTTETKETATVE 1107


11PA4657PA4648Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA46572101.151419hypothetical protein
PA4656291.207968hypothetical protein
PA46553111.113836ferrochelatase
PA46544121.111364major facilitator superfamily transporter
PA46537150.801096hypothetical protein
PA46524120.581486hypothetical protein
PA4651510-0.258480pili assembly chaperone
PA4650512-1.438600hypothetical protein
PA4649410-1.132884hypothetical protein
PA464838-1.341132hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4656NUCEPIMERASE392e-05 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 39.0 bits (91), Expect = 2e-05
Identities = 53/303 (17%), Positives = 97/303 (32%), Gaps = 73/303 (24%)

Query: 1 MHILLTGGTGLMGRRLCARWRQAGHRLTVF---------SRRPQQVESLCGVGVRGI--- 48
M L+TG G +G + R +AGH++ S + ++E L G +
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 49 ----DRFDE-YGDQPLDAVVNLAGEPIADKPWSHKRKALLWESRVRLTERLVEWLDAREQ 103
+ + + + V +S + +S + ++E R
Sbjct: 61 LADREGMTDLFASGHFERVFISPHR--LAVRYSLENPHAYADSNLTGFLNILE--GCRHN 116

Query: 104 RPDLLLSGSAVGWYGDSGERPVTEEGA--------AAGEDFASELCLAWEQIAREAEKLG 155
+ LL S+ YG + + P + + + AA + + + + G
Sbjct: 117 KIQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHL------YG 170

Query: 156 TRVVLLRTGLVLAPEG-------GFL-----GRLLPLYRLGLGGPLGDGRQWMPWIHIED 203
LR V P G F G+ + +Y G+ + +I+D
Sbjct: 171 LPATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVY--------NYGKMKRDFTYIDD 222

Query: 204 QIELIDFLLRRP---------DASGPYNACAP---------NPVRNRDFAKALGRALHKP 245
E I L + P + AP +PV D+ +AL AL
Sbjct: 223 IAEAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIE 282

Query: 246 ARI 248
A+
Sbjct: 283 AKK 285


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4654TCRTETA441e-06 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 43.7 bits (103), Expect = 1e-06
Identities = 60/303 (19%), Positives = 92/303 (30%), Gaps = 33/303 (10%)

Query: 61 FALFYTLCGIPLGRMADNRSRRGLILFGVLVWSAMTAACGLARSYWQFLTFRVGVGVGEA 120
+AL C LG ++D RR ++L + + A A W R+ G+
Sbjct: 52 YALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGI-TG 110

Query: 121 ALSPAAYSLIADSFPRERRATAISVYSMGIYLGSGLAFLLGGLVIKFASAQGDVHLPLFG 180
A A + IAD + RA S G +LGGL+ H P
Sbjct: 111 ATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLM-----GGFSPHAP--- 162

Query: 181 EVRPWQLIFLILGA-AGVLFCLLLLAIREP--ARRGVGAGVAVPLGEVGAYLRANRKTVL 237
F A G+ F + E R A+ + R
Sbjct: 163 --------FFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAA 214

Query: 238 CHNFGFACLSFAGYGSGAWVPTFFVRTHGWDAGHVGVVYGSIVAVFGCLGIVFGGRLADY 297
F + WV F WDA +G+ +A FG L + +
Sbjct: 215 LMAVFFIMQLVGQVPAALWV-IFGEDRFHWDATTIGIS----LAAFGILHSLAQAMITGP 269

Query: 298 WAKRGRSDANMRVGLLAAWAVIPFTLVYPLLDNANWAAALMAPTVFFLSMPFGVAPAAIQ 357
A R + +G++A W MA + L G+ A+Q
Sbjct: 270 VAARLGERRALMLGMIADGTGYILLAFA----TRGW----MAFPIMVLLASGGIGMPALQ 321

Query: 358 EIM 360
++
Sbjct: 322 AML 324


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4652PF00577403e-130 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 403 bits (1037), Expect = e-130
Identities = 129/800 (16%), Positives = 249/800 (31%), Gaps = 85/800 (10%)

Query: 49 YYLELVING--RDSGQVVPVNAADGHYL---LDAAALREAGVRLPGNPAGQVAVD----- 98
Y +++ +N + V + L A L G+ + D
Sbjct: 78 YRVDIYLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVP 137

Query: 99 ---ALPEVRADYDSASQQLHLQVPPDWLPEQRFDDPGLVARA-PARSSLGALFNYDLYYS 154
+ + A D Q+L+L +P ++ + G + L NY+ +
Sbjct: 138 LTSMIHDATAQLDVGQQRLNLTIPQAFMSNR---ARGYIPPELWDPGINAGLLNYNFSGN 194

Query: 155 DPAD-GATPWLSALLEQRLFDGFGV--ISNTGVYTRYFGDADNLDSRYLRYDTYWLYNDE 211
+ A L + G + + ++ D+ + ++ WL D
Sbjct: 195 SVQNRIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDI 254

Query: 212 RNMHS-YQLGDYVNGALNWTTPVRMGGFRFARNFGVRPDLVTYPLLRFDGQAAVPSTVDL 270
+ S LGD + + G + A + + PD G A + V +
Sbjct: 255 IPLRSRLTLGDGYTQGDIFDG-INFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTI 313

Query: 271 FINGYKASSADLQPGPFAISNVPYINGAGEATVVTTDAQGRQVVTSLPFYVSNTLLARGL 330
NGY ++ + PGPF I+++ +G+ V +A G + ++P+ L G
Sbjct: 314 KQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGH 373

Query: 331 SDFDLSVGRLRDDYGLRNFSYADNAASGIYRYGVSDRLTLSTHAEAASDLRLLGIGGDIA 390
+ + ++ G R + +G+ T+ + A R G
Sbjct: 374 TRYSITAGEYRSGNAQQ---EKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKN 430

Query: 391 VATFGTLSLAASGSDGQGDSGQQY---------------------LLGYSYYSRR-LGLS 428
+ G LS+ + ++ Q+ L+GY Y + +
Sbjct: 431 MGALGALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFA 490

Query: 429 LQHIERSAGYGDLGTLDGEYQLSRRTD------------QATASLTFDEQGTIGTGYFDI 476
R GY + TD Q T + T+
Sbjct: 491 DTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQ 550

Query: 477 RARDGS-RTRLANLSYSRPIGSRS-SFYLALNKDLDGDGYSALMQLVIPFDI-------- 526
S + + + +L K+ G ++ L +
Sbjct: 551 TYWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSDS 610

Query: 527 -----NGLLNIGVTRDSDRRYSERVIWSRSTPSQGGLGWNL------GYGGGASRYQQAD 575
+ + ++ D + R + + L +++ G G + A
Sbjct: 611 KSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYAT 670

Query: 576 LTWRMQNVQLQGGLYGETGNYTRWADLSGSLVWMDNAVFASNRINDAFVLVSTKGYPQVP 635
L +R G + +SG ++ N V +ND VLV G
Sbjct: 671 LNYRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAK 730

Query: 636 IRYENQLMGSTDDNGHLLVPWVAAYYPAKFQIEPLDLPANVSAPEVEQRVAVRQGSGLLL 695
+ ENQ TD G+ ++P+ Y + ++ L NV V +G+ +
Sbjct: 731 V--ENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRA 788

Query: 696 DFPIRAVVAASISLVDERGEPLPLGSQAEETGSGQRASVGWDGQVYFEGLQSDNQLRVV- 754
+F R V + + +PLP G+ S V +GQVY G+ +++V
Sbjct: 789 EFKAR-VGIKLLMTLTHNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGKVQVKW 847

Query: 755 -RPDGRACQARFRLDTRKPT 773
+ C A ++L
Sbjct: 848 GEEENAHCVANYQLPPESQQ 867


12PA4559PA4554Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA4559111-4.095239lipoprotein signal peptidase
PA4558111-3.985954FkbP-type peptidyl-prolyl cis-trans isomerase
PA4557212-4.1305854-hydroxy-3-methylbut-2-enyl diphosphate
PA4556112-4.276515type 4 fimbrial biogenesis protein PilE
PA4555013-4.354654type 4 fimbrial biogenesis protein PilY2
PA4554013-4.227030type 4 fimbrial biogenesis protein PilY1
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4558INFPOTNTIATR341e-04 Macrophage infectivity potentiator signature.
		>INFPOTNTIATR#Macrophage infectivity potentiator signature.

Length = 233

Score = 33.8 bits (77), Expect = 1e-04
Identities = 21/54 (38%), Positives = 33/54 (61%), Gaps = 5/54 (9%)

Query: 8 GQESRVTLHFALKLEDGNVVDSTFDK--QPASFKVGDGNLLPGFEQALFGLKAG 59
G+ VT+ + L DG V DST +K +PA+F+V ++PG+ +AL + AG
Sbjct: 142 GKSDTVTVEYTGTLIDGTVFDST-EKAGKPATFQV--SQVIPGWTEALQLMPAG 192


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4557PF06704280.033 DspF/AvrF protein
		>PF06704#DspF/AvrF protein

Length = 129

Score = 27.5 bits (61), Expect = 0.033
Identities = 10/28 (35%), Positives = 18/28 (64%), Gaps = 2/28 (7%)

Query: 194 KNDIC--YATQNRQDAVKELADQCDMVL 219
+N +C Y +Q+ + AV E+ D +MV+
Sbjct: 26 QNGVCALYDSQDNEAAVIEMPDHSEMVI 53


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4556BCTERIALGSPG421e-07 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 41.8 bits (98), Expect = 1e-07
Identities = 18/49 (36%), Positives = 31/49 (63%), Gaps = 3/49 (6%)

Query: 4 RQKGFTLLEMVVVVAVIGILLGIAIPSYQNYVIRSNRTEGQALLSDAAA 52
+Q+GFTLLE++VV+ +IG+L + +P N + + + Q +SD A
Sbjct: 6 KQRGFTLLEIMVVIVIIGVLASLVVP---NLMGNKEKADKQKAVSDIVA 51


13PA4533PA4519Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA4533214-1.077814hypothetical protein
PA4532010-3.311488hypothetical protein
PA4531-117-5.381514hypothetical protein
PA4530-115-4.564972zinc-binding protein
PA4529-113-3.663605dephospho-CoA kinase
PA4528-114-4.058296type 4 prepilin peptidase PilD
PA4526-115-3.537049type 4 fimbrial biogenesis protein PilB
PA4525-314-2.079031type 4 fimbrial protein PilA
PA4524-18-1.736565*nicotinate-nucleotide pyrophosphorylase
PA4523-110-1.932896hypothetical protein
PA4522-29-2.512758N-acetyl-anhydromuranmyl-L-alanine amidase
PA4521-28-1.861384hypothetical protein
PA4520-27-2.311918chemotaxis transducer
PA4519110-3.425243ornithine decarboxylase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4529DHBDHDRGNASE300.005 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 30.0 bits (67), Expect = 0.005
Identities = 23/88 (26%), Positives = 32/88 (36%), Gaps = 11/88 (12%)

Query: 5 WILGLTGGIGSGKSAAAEHFISLGVHLVDADHAARW--VVEPGRPALAKIVERFGDGILL 62
+I G GIG A A S G H+ D+ V A A+ E F
Sbjct: 12 FITGAAQGIGE---AVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAF------ 62

Query: 63 PDGQLDRAALRERIFQAPEERRWLEQLL 90
P D AA+ E + E ++ L+
Sbjct: 63 PADVRDSAAIDEITARIEREMGPIDILV 90


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4528PREPILNPTASE353e-125 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 353 bits (908), Expect = e-125
Identities = 165/283 (58%), Positives = 195/283 (68%), Gaps = 1/283 (0%)

Query: 3 LLDYLASHPLAFVLCTILLGLLVGSFLNVVVHRLPKMMERNWKAEAREALGLEPE-PKQA 61
LL+ P + L L++GSFLNVV+HRLP M+ER W+AE R + E +
Sbjct: 4 LLELAHGLPWLYFSLVFLFSLMIGSFLNVVIHRLPIMLEREWQAEYRSYFNPDDEGVDEP 63

Query: 62 TYNLVLPNSACPRCGHEIRPWENIPLVSYLALGGKCSSCKAAIGKRYPLVELATALLSGY 121
YNL++P S CP C H I ENIPL+S+L L G+C C+A I RYPLVEL TALLS
Sbjct: 64 PYNLMVPRSCCPHCNHPITALENIPLLSWLWLRGRCRGCQAPISARYPLVELLTALLSVA 123

Query: 122 VAWHFGFTWQAGAMLLLTWGLLAMSLIDADHQLLPDVLVLPLLWLGLIANHFGLFASLDD 181
VA W A LLLTW L+A++ ID D LLPD L LPLLW GL+ N G F SL D
Sbjct: 124 VAMTLAPGWGTLAALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNLLGGFVSLGD 183

Query: 182 ALFGAVFGYLSLWSVFWLFKLVTGKEGMGYGDFKLLAMLGAWGGWQILPLTILLSSLVGA 241
A+ GA+ GYL LWS++W FKL+TGKEGMGYGDFKLLA LGAW GWQ LP+ +LLSSLVGA
Sbjct: 184 AVIGAMAGYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLVGA 243

Query: 242 ILGVIMLRLRNAESGTPIPFGPYLAIAGWIALLWGDQITRTYL 284
+G+ ++ LRN PIPFGPYLAIAGWIALLWGD ITR YL
Sbjct: 244 FMGIGLILLRNHHQSKPIPFGPYLAIAGWIALLWGDSITRWYL 286


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4525BCTERIALGSPG501e-10 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 49.9 bits (119), Expect = 1e-10
Identities = 17/54 (31%), Positives = 34/54 (62%)

Query: 1 MKAQKGFTLIELMIVVAIIGILAAIAIPQYQNYVARSEGASALATINPLKTTVE 54
Q+GFTL+E+M+V+ IIG+LA++ +P +++ A++ I L+ ++
Sbjct: 4 TDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALD 57


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4524RTXTOXIND290.020 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.0 bits (65), Expect = 0.020
Identities = 23/141 (16%), Positives = 45/141 (31%), Gaps = 4/141 (2%)

Query: 75 QVEDGQRVEPNQMLFQLKGP-ARALLTGERSALNFLQLLSGTATRSQHYADLVAGTAVKL 133
V++G+ V +L +L A A +S+L +L TR Q + + +
Sbjct: 111 IVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARL---EQTRYQILSRSIELNKLPE 167

Query: 134 LDTRKTLPGLRLAQKYAVTCGGCHNHRIGLYDAFLIKENHIAACGGIDRAIAEARRIAPG 193
L ++++ + + + ++ +R AR
Sbjct: 168 LKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYE 227

Query: 194 KPVEVEVENLDELRQALEAGA 214
VE LD+ L A
Sbjct: 228 NLSRVEKSRLDDFSSLLHKQA 248


14PA4471PA4448Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
PA4471213-1.058167hypothetical protein
PA4470112-1.554248fumarate hydratase
PA4469-114-1.824561hypothetical protein
PA4468-215-2.718946superoxide dismutase
PA4467017-3.603188hypothetical protein
PA4466119-5.043265phosphoryl carrier protein
PA4465-118-5.074901hypothetical protein
PA4464019-5.690467nitrogen regulatory IIA protein
PA4463120-5.261184hypothetical protein
PA4462017-4.103210RNA polymerase factor sigma-54
PA4461315-3.399054ABC transporter ATP-binding protein
PA4460213-3.254959hypothetical protein
PA4459212-3.218886hypothetical protein
PA4458113-2.888437hypothetical protein
PA4457011-2.901937arabinose-5-phosphate isomerase KdsD
PA4456013-3.599706ABC transporter ATP-binding protein
PA4455-114-3.300246ABC transporter permease
PA4454-115-3.035881hypothetical protein
PA4453214-1.823239hypothetical protein
PA4452413-1.462086hypothetical protein
PA4451412-1.191866hypothetical protein
PA4450413-0.923784UDP-N-acetylglucosamine
PA4449311-0.935430ATP phosphoribosyltransferase
PA4448211-1.594847histidinol dehydrogenase
15PA4437PA4424Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA4437011-3.035726hypothetical protein
PA4436-111-3.089001transcriptional regulator
PA4435012-4.271212acyl-CoA dehydrogenase
PA4434115-4.759624oxidoreductase
PA4433219-5.94941350S ribosomal protein L13
PA4432220-5.62872230S ribosomal protein S9
PA4431215-5.186188iron-sulfur protein
PA4430114-5.376936cytochrome b
PA4429117-4.566382cytochrome C1
PA4428015-2.082808stringent starvation protein A
PA4427111-1.289347ClpXP protease specificity-enhancing factor
PA4426112-1.495210hypothetical protein
PA4425213-1.482247phosphoheptose isomerase
PA4424215-1.439572hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4434HELNAPAPROT290.016 Helicobacter neutrophil-activating protein A family ...
		>HELNAPAPROT#Helicobacter neutrophil-activating protein A family

signature.
Length = 153

Score = 29.1 bits (65), Expect = 0.016
Identities = 25/106 (23%), Positives = 45/106 (42%), Gaps = 17/106 (16%)

Query: 102 NLKFNRQHIVAALDASLERLQTDWLDLYQLHWPERRTNFFGQLGYQHQ--EESFTPLEET 159
N K N+ + +L+ L + L++ HW + +FF L H+ EE + ET
Sbjct: 5 NAKTNQTLVENSLNTQLSNWFLLYSKLHRFHWYVKGPHFF-TL---HEKFEELYDHAAET 60

Query: 160 LEVLDEQVRAGKIRHIGLSNETPWGTMT-FLRLA--EERGWPRAVS 202
++ + E++ A IG P T+ + A + G + S
Sbjct: 61 VDTIAERLLA-----IGGQ---PVATVKEYTEHASITDGGNETSAS 98


16PA4168PA4161Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA41682101.728601second ferric pyoverdine receptor FpvB
PA41672123.0779012,5-diketo-D-gluconate reductase B
PA41660114.096075acetyltransferase
PA41650105.193538transcriptional regulator
PA4164094.882827hypothetical protein
PA4163184.894122amidase
PA41622104.905111short-chain dehydrogenase
PA4161-2113.295108ferric enterobactin transporter FepG
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4166SACTRNSFRASE391e-06 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 39.2 bits (91), Expect = 1e-06
Identities = 18/61 (29%), Positives = 26/61 (42%), Gaps = 2/61 (3%)

Query: 76 RSTWAAQDVCYLEDLYVSPDVRGQQIGKQLIEYVRRQAEERRCARLYWHTQESNHRAQRL 135
RS W +ED+ V+ D R + +G L+ A+E L TQ+ N A
Sbjct: 83 RSNW--NGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHF 140

Query: 136 Y 136
Y
Sbjct: 141 Y 141


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4162DHBDHDRGNASE1196e-35 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 119 bits (299), Expect = 6e-35
Identities = 75/258 (29%), Positives = 117/258 (45%), Gaps = 32/258 (12%)

Query: 5 RTALVTGATRGIGLALARRLAASGWSVVGI-----------------ARHASDDFPGRLL 47
+ A +TGA +GIG A+AR LA+ G + + ARHA + FP
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHA-EAFP---- 63

Query: 48 CCDLADPAQTAETLRGLLSESA-VDALVNNAGIALPQSLENLDLAALQQVFDLNVRVAVQ 106
D+ D A E + E +D LVN AG+ P + +L + F +N
Sbjct: 64 -ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFN 122

Query: 107 LAQACLPGLKRSPAGRIVNLCSRAIHGAR-ERTAYAAAKSALVGVTRTWALELAPLGITV 165
+++ + +G IV + S R AYA++K+A V T+ LELA I
Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 166 NAVAPGPIETELFRQTRPVGGEEERRILST-------IPMQRLGRPDEVAALIEFLLSEG 218
N V+PG ET++ E+ I + IP+++L +P ++A + FL+S
Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242

Query: 219 ASFVTGQVIGVDGGGSLG 236
A +T + VDGG +LG
Sbjct: 243 AGHITMHNLCVDGGATLG 260


17PA3892PA3884Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA38922130.004826hypothetical protein
PA38912130.090188ABC transporter ATP-binding protein
PA38901100.860356ABC transporter permease
PA3889191.199831ABC transporter
PA3888291.707751ABC transporter permease
PA3887182.560347Na+/H+ antiporter NhaP
PA3886a1133.488723hypothetical protein
PA38860143.621274hypothetical protein
PA38851133.555749protein tyrosine phosphatase TpbA
PA38840133.007703hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3892RTXTOXIND506e-09 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 49.8 bits (119), Expect = 6e-09
Identities = 24/160 (15%), Positives = 60/160 (37%), Gaps = 13/160 (8%)

Query: 82 RIAVKQAESLVASRKATL-----EMRQLNAR-RRAEMDEMVVSRESRDDAHNTAAAAMAD 135
+ AV + E+ L ++ Q+ + A+ + +V++ +++ + +
Sbjct: 251 KHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDN 310

Query: 136 YEQAKAQLDAARLNLERTRVVAQVDGYVTNLNVHR-GDYARVGEAKMAVI-DKNSYWVYG 193
+L + + + A V V L VH G E M ++ + ++ V
Sbjct: 311 IGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTA 370

Query: 194 YFEETKLPYIREGDPVDMQLMS-----GEHLKGHVESIAR 228
+ + +I G +++ + +L G V++I
Sbjct: 371 LVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINL 410



Score = 49.4 bits (118), Expect = 7e-09
Identities = 21/179 (11%), Positives = 63/179 (35%), Gaps = 16/179 (8%)

Query: 13 LLILLVAVFIGRTLW--VNYMDTPWTRDGRVRAD--VINVAADVSGIVVDVPVRDNQLVK 68
+ ++ + + + ++ T +G++ + + IV ++ V++ + V+
Sbjct: 60 VAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVR 119

Query: 69 KGDLLMQIDPDHYRIAVKQAESLVASRKATLEMRQLNARRRAEMDEMVVSRESRDDAHNT 128
KGD+L+++ + +S + + R R E++++ + +
Sbjct: 120 KGDVLLKLTALGAEADTLKTQSSLLQARLEQT-RYQILSRSIELNKLPELKLPDEPYFQN 178

Query: 129 AAAAM---------ADYEQAKAQLDAARLNLERTRVVAQVDGYVTNLNVHRGDYARVGE 178
+ + + Q LNL++ R A+ + +N +
Sbjct: 179 VSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKR--AERLTVLARINRYENLSRVEKS 235


18PA3871PA3866Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA3871036-6.497232PpiC-type peptidyl-prolyl cis-trans isomerase
PA3870042-8.402670molybdenum cofactor biosynthesis protein A
PA3869143-9.296200hypothetical protein
PA3868038-8.200990hypothetical protein
PA3867030-6.214534DNA invertase
PA3866025-4.750031pyocin protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3866PYOCINKILLER1994e-57 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 199 bits (507), Expect = 4e-57
Identities = 169/484 (34%), Positives = 242/484 (50%), Gaps = 54/484 (11%)

Query: 197 QELESKA-------RSMEAQAQQVTQQLGADFNAITTPTATKVQQRVKAVDASLSQVSTQ 249
QE ES A R + Q ++ +++ F VQ + DA+L
Sbjct: 51 QEFESYADVGVDPRRYVPLQVKEKRREIELQFRDAEKKLEASVQAELDKADAALGPAKNL 110

Query: 250 VSGAVASATQAVQVKTAQAQQQANSQISSSQNLISAAFRNQIALAAQASGEAQVTAFNTQ 309
V + +++ + QQ+ + + + + S +N + A+ GE V N
Sbjct: 111 APLDVIN--RSLTIVGNALQQKNQKLLLNQKKITSLGAKNFLTRTAEEIGEQAVREGN-- 166

Query: 310 VKQIVNEATAFSTARKQALAQFAANAAAEVETEVNRLTAQVKASPTKAASQAAQATLRQF 369
+N A+ + + A ++ TE ++ Q TL
Sbjct: 167 ----INGPEAYMRFLDREMEGLTAAYNVKLFTE------------AISSLQIRMNTLTAA 210

Query: 370 TESTYAKVNTYAQQTEAELQAKAKEVSAAVAEAQRAATNDLQKTADVVPGQIIQAANTLA 429
S A A++ A + E A A RAA NT A
Sbjct: 211 KASIEAAAANKAREQAAAEAKRKAEEQARQQAAIRAA-------------------NTYA 251

Query: 430 MPATSTPLVSVAGFG--SAAVETARLAASLTNAVNRLVQIAGSGPGAYVATFAVLSLYSD 487
MPA + + + AG G A A LA ++++A+ L ++ S P FA L+ S
Sbjct: 252 MPANGSVVATAAGRGLIQVAQGAASLAQAISDAIAVLGRVLASAPSVMAVGFASLTYSSR 311

Query: 488 QAGKDSDKVPAGVRNALALEASALGLPGTADLQSVAKAGGTVDMPVRLTSAAQESPSGKS 547
A + D+ P VR AL ++A+ LGLP + +L +VAKA GTVD+P+RLT+ A+ + +
Sbjct: 312 TAEQWQDQTPDSVRYALGMDAAKLGLPPSVNLNAVAKASGTVDLPMRLTNEARGN---TT 368

Query: 548 QIAAMLTNGATVPKGVPVRAATLNAATGRYEVTVPAKSTVPNTPPLILTWTPATPPGSQN 607
++ + T+G +VPK VPVR A NA TG YEVTVP ST PPLILTWTPA+PPG+QN
Sbjct: 369 TLSVVSTDGVSVPKAVPVRMAAYNATTGLYEVTVP--STTAEAPPLILTWTPASPPGNQN 426

Query: 608 PSSTTPVVPQPVPVYEGATITPVQAEPESYPGVPLDLDDLIVIFPVGSGVKPIYIMFNHN 667
PSSTTPVVP+PVPVYEGAT+TPV+A PE+YPGV +DLI+ FP SG+KPIY+MF +
Sbjct: 427 PSSTTPVVPKPVPVYEGATLTPVKATPETYPGVITLPEDLIIGFPADSGIKPIYVMFR-D 485

Query: 668 PHDV 671
P DV
Sbjct: 486 PRDV 489


19PA3821PA3808Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA3821211-2.032861preprotein translocase subunit SecD
PA3820213-2.129103preprotein translocase subunit SecF
PA3819114-2.003170hypothetical protein
PA3818114-2.484070type III secretion system regulator SuhB
PA3817116-3.134504methyltransferase
PA3816118-3.017923O-acetylserine synthase
PA3815114-1.947335HTH-type transcriptional regulator
PA3814114-2.038388cysteine desulfurase
PA3813214-1.975376scaffold protein
PA3812317-2.093134iron-binding protein IscA
PA3811217-2.602931co-chaperone HscB
PA3810116-3.004326chaperone protein HscA
PA3809117-3.375336(2Fe-2S) ferredoxin
PA3808216-3.003920hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3821SECFTRNLCASE811e-18 Bacterial translocase SecF protein signature.
		>SECFTRNLCASE#Bacterial translocase SecF protein signature.

Length = 333

Score = 81.0 bits (200), Expect = 1e-18
Identities = 41/180 (22%), Positives = 85/180 (47%), Gaps = 15/180 (8%)

Query: 446 TIGPSLGADNIAKGIDASLWGMLFVSLFIIVIY---RF---FGVIATVALAFNMVMLVAL 499
++GP + + + + + L + +I+ Y RF F + A VAL ++++ V L
Sbjct: 142 SVGPKVSGELVWTAVWS-----LLAATVVIMFYIWVRFEWQFALGAVVALVHDVLLTVGL 196

Query: 500 MSILGATLTLPGIAGIVLTMGMAVDANVLIFSRIREEL--ANGMSVQRAIHEGFNRAFTA 557
++L L +A ++ G +++ V++F R+RE L M ++ ++ N +
Sbjct: 197 FAVLQLKFDLTTVAALLTITGYSINDTVVVFDRLRENLIKYKTMPLRDVMNLSVNETLSR 256

Query: 558 ILDANLTSLLVGGILYAMGTGPVKGFAVTMSLGIITSMFTAIMVTRAMVNLIFGGRDFKK 617
+ +T+LL + G ++GF M G+ T ++++ V A ++F G D K
Sbjct: 257 TVMTGMTTLLALVPMLIWGGDVIRGFVFAMVWGVFTGTYSSVYV--AKNIVLFIGLDRNK 314


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3820SECFTRNLCASE303e-105 Bacterial translocase SecF protein signature.
		>SECFTRNLCASE#Bacterial translocase SecF protein signature.

Length = 333

Score = 303 bits (778), Expect = e-105
Identities = 101/299 (33%), Positives = 163/299 (54%), Gaps = 20/299 (6%)

Query: 8 INFMGIRNVAFAVTLILTVIALGSWFTKGINFGLDFTGGTLIELTYEQPADLGKVRGQLV 67
+F + F +++ + ++ G+NFG+DF GGT I D+G R L
Sbjct: 14 FDFFRWQWATFGAAIVMMIASVILPLVIGLNFGIDFKGGTTIRTESTTAIDVGVYRAALE 73

Query: 68 GAGYEDAVVQSFGDAR------DVLVRMPSED------------PELGKKVATALQQADA 109
D ++ D ++R+ ++ EL KV TAL D
Sbjct: 74 PLELGDVIISEVRDPSFREDQHVAMIRIQMQEDGQGAEGQGAQGQELVNKVETALTAVDP 133

Query: 110 GNPANLKRVEYVGPQVGEELRDQGGLGMLLALGGILLYVGFRFQWKFALGAILSLVHDAI 169
+ E VGP+V EL +L A I+ Y+ RF+W+FALGA+++LVHD +
Sbjct: 134 A--LKITSFESVGPKVSGELVWTAVWSLLAATVVIMFYIWVRFEWQFALGAVVALVHDVL 191

Query: 170 IVMGVLSFFQVTFDLTVLAAVLAVVGYSLNDTIVIFDRVRENFRVLRKADLVENLNISTS 229
+ +G+ + Q+ FDLT +AA+L + GYS+NDT+V+FDR+REN + L + +N+S +
Sbjct: 192 LTVGLFAVLQLKFDLTTVAALLTITGYSINDTVVVFDRLRENLIKYKTMPLRDVMNLSVN 251

Query: 230 QTLLRTIATSVSTLLAIAALLFFGGDNLFGFSIALFVGVMAGTYSSIYIANVVLIWLNL 288
+TL RT+ T ++TLLA+ +L +GGD + GF A+ GV GTYSS+Y+A +++++ L
Sbjct: 252 ETLSRTVMTGMTTLLALVPMLIWGGDVIRGFVFAMVWGVFTGTYSSVYVAKNIVLFIGL 310


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3811CHANLCOLICIN270.045 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 27.0 bits (59), Expect = 0.045
Identities = 20/76 (26%), Positives = 30/76 (39%), Gaps = 5/76 (6%)

Query: 103 ELEELQDSADLAGVATFKRRLKAAQAELEREFAACWDDA-----QRREEAERLVRRMQFL 157
EL + + A A +R A E R+ A + A QRR+E ER +
Sbjct: 111 SATELAHANNAAMQAEDERLRLAKAEEKARKEAEAAEKAFQEAEQRRKEIEREKAETERQ 170

Query: 158 DKLAQEVRQLEERLDD 173
KLA+ + L +
Sbjct: 171 LKLAEAEEKRLAALSE 186


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3810SHAPEPROTEIN1072e-27 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 107 bits (269), Expect = 2e-27
Identities = 79/363 (21%), Positives = 139/363 (38%), Gaps = 56/363 (15%)

Query: 22 VGIDLGTTNSLVAAVRSGVAEPLPDAQGRLILPSAVRYHAERAEVGESARAAAAEDPFNT 81
+ IDLGT N+L+ G+ L PS V +RA +S A +
Sbjct: 13 LSIDLGTANTLIYVKGQGIV---------LNEPSVVAIRQDRAGSPKSVAAVGHD----- 58

Query: 82 VISVKRLMGRGLEDVKQLGEQLPYRFRQGESHMPFIETVQGLKSPV----EVSADILRE- 136
K+++GR + I ++ +K V V+ +L+
Sbjct: 59 ---AKQMLGRTPGN---------------------IAAIRPMKDGVIADFFVTEKMLQHF 94

Query: 137 LRQRAETTLGGELVGAVITVPAYFDDAQRQATKDAARLAGLNVLRLLNEPTAAAVAYGLD 196
++Q + ++ VP +R+A +++A+ AG + L+ EP AAA+ GL
Sbjct: 95 IKQVHSNSFMRPSPRVLVCVPVGATQVERRAIRESAQGAGAREVFLIEEPMAAAIGAGLP 154

Query: 197 KGAEGLVAIYDLGGGTFDISILRLTRGVFEVLATGGDTALGGDDFDHAIAGWVIEQAGLS 256
+ D+GGGT +++++ L V +GGD FD AI +V G
Sbjct: 155 VSEATGSMVVDIGGGTTEVAVISLNGVV-----YSSSVRIGGDRFDEAIINYVRRNYGSL 209

Query: 257 ADLDPGSQRQLLQIACAAKERLTDEASVR---VAYG-DWSGELSRATLDELIEPFVARSL 312
+ ++R +I A E VR +A G L+ + E ++ + +
Sbjct: 210 IG-EATAERIKHEIGSAYPGDEVREIEVRGRNLAEGVPRGFTLNSNEILEALQEPLTGIV 268

Query: 313 KSCRRAVRDSGVDLEEI---RSVVMVGGSTRVPRVRTAVGELFGCEPLTDIDPDQVVAIG 369
+ A+ +L R +V+ GG + + + E G + DP VA G
Sbjct: 269 SAVMVALEQCPPELASDISERGMVLTGGGALLRNLDRLLMEETGIPVVVAEDPLTCVARG 328

Query: 370 AAI 372

Sbjct: 329 GGK 331


20PA3795PA3778Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA379529-1.610406oxidoreductase
PA379408-0.730766hypothetical protein
PA3793-19-0.306019hypothetical protein
PA3792-19-0.1355942-isopropylmalate synthase
PA3791-281.094112hypothetical protein
PA3790-191.177665copper transport outer membrane porin OprC
PA3789391.851891hypothetical protein
PA37882131.471783hypothetical protein
PA37873111.783139hypothetical protein
PA37862131.229930hypothetical protein
PA37852111.052949hypothetical protein
PA37842111.219885hypothetical protein
PA37831111.347236hypothetical protein
PA37821102.073838transcriptional regulator
PA37810111.960519transporter
PA37801102.577377hypothetical protein
PA3779192.692523hypothetical protein
PA3778183.346789transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3783ISCHRISMTASE343e-04 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 33.8 bits (77), Expect = 3e-04
Identities = 26/124 (20%), Positives = 41/124 (33%), Gaps = 11/124 (8%)

Query: 5 QPKRALLVIDVQNEYVSGNLRIEFPAIQSSLERIGAAMDAAHAAGIPIVVVQHLA---PA 61
P RA+L+I Y + I + GIP+V P
Sbjct: 27 DPNRAVLLIHDMQNYFVDAFTAGASPVTELSANIRKLKNQCVQLGIPVVYTAQPGSQNPD 86

Query: 62 D--------SPLFARGSRQAELHEVVASRPYQHKVEKQLASSFVGTGLADWLRERGIDTL 113
D P G + ++ +A + K S+F T L + +R+ G D L
Sbjct: 87 DRALLTDFWGPGLNSGPYEEKIITELAPEDDDLVLTKWRYSAFKRTNLLEMMRKEGRDQL 146

Query: 114 AVVG 117
+ G
Sbjct: 147 IITG 150


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3781RTXTOXINA330.004 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 32.6 bits (74), Expect = 0.004
Identities = 25/94 (26%), Positives = 38/94 (40%), Gaps = 12/94 (12%)

Query: 161 SSLTNTSVGELFLAGVIPGLL--LAAAFMLLNAVYAYRNGLQARHAAPAWGEILAALSGA 218
+L N G + G+L ++A+F+L N + AA A E+ + G
Sbjct: 233 PNLDNIGAG----LDTVSGILSAISASFILSN-----ADADTRTKAA-AGVELTTKVLGN 282

Query: 219 LTALIAPVIIVAGIVLGLVTPTESGALIALYVAL 252
+ I+ II GL T + LIA V L
Sbjct: 283 VGKGISQYIIAQRAAQGLSTSAAAAGLIASAVTL 316


21PA3671PA3643Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA3671280.875803ABC transporter permease
PA3670281.619876hypothetical protein
PA36694111.936816hypothetical protein
PA36683141.666387hypothetical protein
PA36671121.246300cysteine desulfurase
PA36661130.0748462,3,4,5-tetrahydropyridine-2,6-dicarboxylate
PA36652100.208798hypothetical protein
PA3664-290.160500hypothetical protein
PA366309-0.804033hypothetical protein
PA3662010-1.362487hypothetical protein
PA3661010-1.787875hypothetical protein
PA3660-110-2.214638sodium/hydrogen antiporter
PA3659012-2.887185succinyldiaminopimelate transaminase
PA3658013-3.897382bifunctional
PA3657215-4.152746methionine aminopeptidase
PA3656114-2.96723330S ribosomal protein S2
PA3655110-1.503406elongation factor Ts
PA3654012-0.969415uridylate kinase
PA3653-19-1.875532ribosome recycling factor
PA3652-18-2.113627ditrans,polycis-undecaprenyl-diphosphate
PA3651-18-2.250574phosphatidate cytidylyltransferase
PA365009-3.1084601-deoxy-D-xylulose 5-phosphate reductoisomerase
PA3649110-3.856005zinc metalloprotease
PA3648010-3.432971outer membrane protein Opr86
PA3647213-2.564057hypothetical protein
PA3646311-2.033403UDP-3-O-acylglucosamine N-acyltransferase
PA3645110-2.2740373-hydroxyacyl-[acyl-carrier-protein] dehydratase
PA3644210-2.046856acyl-[acyl-carrier-protein]--UDP-N-
PA3643211-0.957699lipid-A-disaccharide synthase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3666FERRIBNDNGPP290.030 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 28.8 bits (64), Expect = 0.030
Identities = 27/106 (25%), Positives = 42/106 (39%), Gaps = 23/106 (21%)

Query: 12 VGTQNRQEAWLEVFYAL--------PLLKPSSEIVAAVAPILGY--AAGNQALTFTSQQA 61
VG R E LE+ + PS E++A +AP G+ + G Q L +
Sbjct: 81 VGL--RTEPNLELLTEMKPSFMVWSAGYGPSPEMLARIAPGRGFNFSDGKQPLAMARKSL 138

Query: 62 YQLADALKGIDAAQSALL----------SRLA-ESQKPLVATLLAE 96
++AD L AA++ L R +PL+ T L +
Sbjct: 139 TEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTTLID 184


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3658YERSSTKINASE320.014 Yersinia serine/threonine protein kinase signature.
		>YERSSTKINASE#Yersinia serine/threonine protein kinase signature.

Length = 732

Score = 32.0 bits (72), Expect = 0.014
Identities = 22/89 (24%), Positives = 43/89 (48%), Gaps = 1/89 (1%)

Query: 63 ILQQAWQRFDWGDDADIALVAVGGYGRGELHPYSDVDLLILLDSEDQESFREPIEGFLTL 122
I++ + QR D + +G R H + +++L+ L + Q E GFL
Sbjct: 538 IVEPSLQRIQKHLDQTHSFSDIGSLVRAHKHLETLLEVLVTLSQQGQPVSSETY-GFLNR 596

Query: 123 LWDIGLEVGQSVRSVQQCAEEARADLTVI 151
L + + + Q + ++QQ E A+A L+++
Sbjct: 597 LTEAKITLSQQLNTLQQQQESAKAQLSIL 625


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3654CARBMTKINASE373e-05 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 37.5 bits (87), Expect = 3e-05
Identities = 17/79 (21%), Positives = 28/79 (35%), Gaps = 15/79 (18%)

Query: 132 GEVVIFSAGTGNPFFTT-------------DSAACLRAIEIDADVVLKATKVDGVYTADP 178
G +VI S G G P D A A E++AD+ + T V+G
Sbjct: 186 GVIVIASGGGGVPVILEDGEIKGVEAVIDKDLAGEKLAEEVNADIFMILTDVNGAALY-- 243

Query: 179 FKDPNAEKFERLTYDEVLD 197
+ + + +E+
Sbjct: 244 YGTEKEQWLREVKVEELRK 262


22PA3569PA3547Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA3569-1103.1131783-hydroxyisobutyrate dehydrogenase
PA3568-1102.951524propionyl-CoA synthetase
PA35670104.076602oxidoreductase
PA35660104.496817hypothetical protein
PA3565-1114.754169transcriptional regulator
PA3564-1103.956987hypothetical protein
PA35630103.906078FruR family transcriptional regulator
PA35620104.089966PTS system fructose-specific transporter subunit
PA35612113.1515521-phosphofructokinase
PA35602113.385166PTS system fructose-specific transporter subunit
PA35592132.458336nucleotide sugar dehydrogenase
PA35583143.0773324-amino-4-deoxy-L-arabinose-phosphoundecaprenol
PA35571142.8329754-amino-4-deoxy-L-arabinose-phosphoundecaprenol
PA35560152.1393464-amino-4-deoxy-L-arabinose lipid A transferase
PA3555-1171.9609504-deoxy-4-formamido-L-arabinose-
PA3554-1161.394998bifunctional UDP-glucuronic acid
PA35530180.610214undecaprenyl-phosphate
PA35520180.343274UDP-4-amino-4-deoxy-L-arabinose--oxoglutarate
PA3551218-0.108825bifunctional mannose-1-phosphate
PA3550217-0.144676alginate o-acetyltransferase AlgF
PA3549117-0.142650alginate o-acetylase AlgJ
PA35482190.229839alginate o-acetylase AlgI
PA35472190.737918alginate lyase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3562PHPHTRNFRASE6090.0 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 609 bits (1573), Expect = 0.0
Identities = 219/565 (38%), Positives = 340/565 (60%), Gaps = 13/565 (2%)

Query: 401 ERLQAIAASPGIASGPAHVQVAQRFEFQPR-GESPAHERERLLRAKRAVDEEIVGLVERS 459
++ IAAS G+A A + + + + + E E+L A EE+ + +++
Sbjct: 3 HKITGIAASSGVAIAKAFIHLEPNVDIEKTSITDVSTEIEKLTAALEKSKEELRAIKDQT 62

Query: 460 TVKA---IREIFVTHREMLDDPELAEQVQLRL-NRGESAEAAWSRVVEDSAAQQEALHDA 515
EIF H +LDDPEL + ++ ++ N +AE A V + + E++ +
Sbjct: 63 EASMGADKAEIFAAHLLVLDDPELVDGIKGKIENEQMNAEYALKEVSDMFVSMFESMDNE 122

Query: 516 LLAERAADLRDLGRRVLARLCGVEAPREPE--QPYILVMDEVGPSDVARLDAQRVAGILT 573
+ ERAAD+RD+ +RVL L GVE + +++ +++ PSD A+L+ Q V G T
Sbjct: 123 YMKERAADIRDVSKRVLGHLIGVETGSLATIAEETVIIAEDLTPSDTAQLNKQFVKGFAT 182

Query: 574 ARGGATSHSAIIARALGIPALVGAGAAVLGLEPGTALLLDGEHGWLQVAPSTEQLQQAAA 633
GG TSHSAI++R+L IPA+VG ++ G +++DG G + V P+ E+++
Sbjct: 183 DIGGRTSHSAIMSRSLEIPAVVGTKEVTEKIQHGDMVIVDGIEGIVIVNPTEEEVKAYEE 242

Query: 634 ERDARQQRQARADAQRLEPARTRDGHAVEVCANLGDTAGAARAVELGAEGVGLLRTEFVF 693
+R A ++++ EP+ T+DG VE+ AN+G + G EG+GL RTEF++
Sbjct: 243 KRAAFEKQKQEWAKLVGEPSTTKDGAHVELAANIGTPKDVDGVLANGGEGIGLYRTEFLY 302

Query: 694 MNNARAPDLATQEAEYRRVLDALDGRPLVARTLDVGGDKPLPYWPIPHEENPYLGLRGIR 753
M+ + P Q Y+ V+ +DG+P+V RTLD+GGDK L Y +P E NP+LG R IR
Sbjct: 303 MDRDQLPTEEEQFEAYKEVVQRMDGKPVVIRTLDIGGDKELSYLQLPKELNPFLGFRAIR 362

Query: 754 LTLQRPQILETQLRALFRAAGERPLRVMFPMVGSLDEWRQARDLALRLREEI------PL 807
L L++ I TQLRAL RA+ L+VMFPM+ +L+E RQA+ + ++++
Sbjct: 363 LCLEKQDIFRTQLRALLRASTYGNLKVMFPMIATLEELRQAKAIMQEEKDKLLSEGVDVS 422

Query: 808 ADLQLGIMVEVPSAALLAPVLAREVDFFSVGTNDLTQYTLAIDRGHPSLSAQADGLHPAV 867
+++GIMVE+PS A+ A + A+EVDFFS+GTNDL QYT+A DR + +S HPA+
Sbjct: 423 DSIEVGIMVEIPSTAVAANLFAKEVDFFSIGTNDLIQYTMAADRMNERVSYLYQPYHPAI 482

Query: 868 LQLIDMTVRAAHAEGKWVGVCGELAADPLALPLLVGLGVDELSVSARSIALVKAGVRELQ 927
L+L+DM ++AAH+EGKWVG+CGE+A D +A+PLL+GLG+DE S+SA SI ++ + +L
Sbjct: 483 LRLVDMVIKAAHSEGKWVGMCGEMAGDEVAIPLLLGLGLDEFSMSATSILPARSQLLKLS 542

Query: 928 LVAARGLARKALGLASAAEVRALVE 952
+ A+KAL L +A EV LV+
Sbjct: 543 KEELKPFAQKALMLDTAEEVEQLVK 567


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3554NUCEPIMERASE1094e-28 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 109 bits (273), Expect = 4e-28
Identities = 78/362 (21%), Positives = 138/362 (38%), Gaps = 61/362 (16%)

Query: 319 RVLILGVNGFIGNHLSERLLRDGRYEVHGMDIGSDAIE-RLK-------ADPHFHFVEGD 370
+ L+ G GFIG H+S+RLL G ++V G+D +D + LK A P F F + D
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAG-HQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 371 IGIHSEWLE--YHVKKCDVILPLVAIATPIEYT-RNPLRVFELDFEENLRIVRYCVKYG- 426
+ E + + + + + Y+ NP + + L I+ C
Sbjct: 61 LADR-EGMTDLFASGHFERVFISPHRLA-VRYSLENPHAYADSNLTGFLNILEGCRHNKI 118

Query: 427 KRVVFPSTSEVYGMCQDPDFDEDRSNLVVGPINKQRWIYSVSKQLLDRVIWAYGQ-QGLR 485
+ +++ S+S VYG+ + F D S V P++ +Y+ +K+ + + Y GL
Sbjct: 119 QHLLYASSSSVYGLNRKMPFSTDDS--VDHPVS----LYAATKKANELMAHTYSHLYGLP 172

Query: 486 FTLFRPFNWMGPRLDRLDSARIGSSRAITQLILHLVEGTPIRLVDGGAQKRCFTDVDDGI 545
T R F GP R D A ++A+ +EG I + + G KR FT +DD
Sbjct: 173 ATGLRFFTVYGPW-GRPDMALFKFTKAM-------LEGKSIDVYNYGKMKRDFTYIDDIA 224

Query: 546 EALARIIDN---------------RDGRCDGQIVNIGNPDNEASIRQLGEELLRQFEAHP 590
EA+ R+ D ++ NIGN + + L
Sbjct: 225 EAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVE-LMDYIQALEDALGIEA 283

Query: 591 LRAQFPPFAGFREVESRSFYGDGYQDVAHRKPSIDNARRLLDWQPTIELRETIGKTLDFF 650
+ P G DV ++ + P +++ + ++++
Sbjct: 284 KKNMLPLQPG---------------DVLETSADTKALYEVIGFTPETTVKDGVKNFVNWY 328

Query: 651 LH 652

Sbjct: 329 RD 330


23PA3421PA3399Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA3421-2123.398605hypothetical protein
PA34200114.288069transcriptional regulator
PA3419-1114.176601hypothetical protein
PA3418-2113.675210leucine dehydrogenase
PA3417-2123.340295pyruvate dehydrogenase E1 component subunit
PA3416-2113.070026pyruvate dehydrogenase E1 component subunit
PA3415-2122.406622branched-chain alpha-keto acid dehydrogenase
PA34140143.062335hypothetical protein
PA34130111.517078hypothetical protein
PA34121111.894654hypothetical protein
PA34111112.942587hypothetical protein
PA34101103.129254ECF subfamily sigma-70 factor
PA34092113.455367transmembrane sensor
PA34082102.818319heme uptake outer membrane receptor HasR
PA34072133.352576heme acquisition protein HasA
PA34062123.310902transporter HasD
PA34051121.936538metalloprotease secretion protein
PA34042130.620261hypothetical protein
PA3403a113-0.397989hypothetical protein
PA34031140.186406hypothetical protein
PA3402115-0.094635hypothetical protein
PA34012150.063146hypothetical protein
PA34002160.897137hypothetical protein
PA33992142.608548hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3418DHBDHDRGNASE280.032 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 28.5 bits (63), Expect = 0.032
Identities = 17/62 (27%), Positives = 29/62 (46%), Gaps = 1/62 (1%)

Query: 160 LGSDDLEGLRVAVQGLGH-VGYALAEQLAAVGAELLVCDLDPGRVQLAVEQLGAHPLAPE 218
+ + +EG + G +G A+A LA+ GA + D +P +++ V L A E
Sbjct: 1 MNAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAE 60

Query: 219 AL 220
A
Sbjct: 61 AF 62


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3407PF064382761e-97 Heme acquisition protein HasAp
		>PF06438#Heme acquisition protein HasAp

Length = 205

Score = 276 bits (707), Expect = 1e-97
Identities = 205/205 (100%), Positives = 205/205 (100%)

Query: 1 MSISISYSTTYSGWTVADYLADWSAYFGDVNHRPGQVVDGSNTGGFNPGPFDGSQYALKS 60
MSISISYSTTYSGWTVADYLADWSAYFGDVNHRPGQVVDGSNTGGFNPGPFDGSQYALKS
Sbjct: 1 MSISISYSTTYSGWTVADYLADWSAYFGDVNHRPGQVVDGSNTGGFNPGPFDGSQYALKS 60

Query: 61 TASDAAFIAGGDLHYTLFSNPSHTLWGKLDSIALGDTLTGGASSGGYALDSQEVSFSNLG 120
TASDAAFIAGGDLHYTLFSNPSHTLWGKLDSIALGDTLTGGASSGGYALDSQEVSFSNLG
Sbjct: 61 TASDAAFIAGGDLHYTLFSNPSHTLWGKLDSIALGDTLTGGASSGGYALDSQEVSFSNLG 120

Query: 121 LDSPIAQGRDGTVHKVVYGLMSGDSSALQGQIDALLKAVDPSLSINSTFDQLAAAGVAHA 180
LDSPIAQGRDGTVHKVVYGLMSGDSSALQGQIDALLKAVDPSLSINSTFDQLAAAGVAHA
Sbjct: 121 LDSPIAQGRDGTVHKVVYGLMSGDSSALQGQIDALLKAVDPSLSINSTFDQLAAAGVAHA 180

Query: 181 TPAAAAAEVGVVGVQELPHDLALAA 205
TPAAAAAEVGVVGVQELPHDLALAA
Sbjct: 181 TPAAAAAEVGVVGVQELPHDLALAA 205


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3405RTXTOXIND417e-145 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 417 bits (1073), Expect = e-145
Identities = 96/435 (22%), Positives = 170/435 (39%), Gaps = 8/435 (1%)

Query: 15 AALELDEK---RFSRLGWGLVLLGFVGFLLWAGLAPLDKGVGVSGTVMVAGSRKAVQHPT 71
A LEL E R RL ++ V + + L ++ +G + +G K ++
Sbjct: 44 AHLELIETPVSRRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIE 103

Query: 72 GGLVRHIRVHEGERVEAGQVLLEMDATQARAQADGLFAQYLAALASLARLSAERDEKARI 131
+V+ I V EGE V G VLL++ A A A + L A R
Sbjct: 104 NSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELN 163

Query: 132 EFPAELLALDDPRLPTLLEQQ----RQLHDSRRRALRLELDGLAETVAGSQAQLDGLQAA 187
+ P EL D+P + E++ L + + + + +A+ + A
Sbjct: 164 KLP-ELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLAR 222

Query: 188 LRSKEQQRAALEEQLRGLRQLASEGYVPRNRLLDSERLLAQVNGEIAGDLGSLGSTRRQI 247
+ E + +L L + + ++ +L+ E + E+ L +I
Sbjct: 223 INRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEI 282

Query: 248 LELRLRMAQRREKFQEEVRASLADAQVRAEELRNRLASARFDLANSEVRAPVAGLVVGQE 307
L + + F+ E+ L L LA S +RAPV+ V +
Sbjct: 283 LSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLK 342

Query: 308 VFTEGGVIAPGQQLMEILPERQPLLVDARLPVEMVDKVRVGLPVELMFSAFNQSTTPRVE 367
V TEGGV+ + LM I+PE L V A + + + + VG + AF + +
Sbjct: 343 VHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLV 402

Query: 368 GEVTLVSADRLLDERSEAPYYRVRIRVGEEGVRRLAGLEIRPGMPVEAFVRSGERSLLNY 427
G+V ++ D + D+R + + + + GM V A +++G RS+++Y
Sbjct: 403 GKVKNINLDAIEDQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGMRSVISY 462

Query: 428 LFKPLADRTHLALGE 442
L PL + +L E
Sbjct: 463 LLSPLEESVTESLRE 477


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3404RTXTOXIND320.007 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 31.7 bits (72), Expect = 0.007
Identities = 20/171 (11%), Positives = 49/171 (28%), Gaps = 11/171 (6%)

Query: 60 LPSLRYDYNKARNDSTVSQGDARVERDYRSYASTLSLEQPLFDYEAYARYRQ-GEAQAL- 117
L +L + + + S++ Q R S + P ++ E + L
Sbjct: 127 LTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLR 186

Query: 118 ---FADEQFRGRSQELA---VRLFAAYSETLFAREQVVLAEAQRRALETQLAFNQRAFEE 171
EQF + + L +E L ++ E R +++L +
Sbjct: 187 LTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHK 246

Query: 172 GEGTRTDLLE---TRARLSLTRAEEIAASDRAAAARRTLEAMLGQALEDRE 219
+ +LE + ++ + + + + +
Sbjct: 247 QAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFK 297


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3402RTXTOXIND566e-11 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 56.0 bits (135), Expect = 6e-11
Identities = 25/161 (15%), Positives = 59/161 (36%), Gaps = 17/161 (10%)

Query: 41 IVSSKAKGRVQVLHVRRGDEVKQGDLLISLDSPELEAQLDALHAARNQAQAQLDESLHGT 100
+ V+ + V+ G+ V++GD+L+ L + EA ++ QA+ + +
Sbjct: 98 EIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILS 157

Query: 101 REESIRALKASLAQAEAELRNAESDFQRNQQMVERGFLSRTQFDLSRRERDVARDRVAEA 160
R + L E +N ++++ L + QF + ++ + +
Sbjct: 158 RSIELNKLPELKLPDEPYFQNV-----SEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKK 212

Query: 161 RANLDEGLKGDREERRQALQAAVRRADAQIAELQAQIDDLQ 201
RA + A + R + ++++DD
Sbjct: 213 RAER------------LTVLARINRYENLSRVEKSRLDDFS 241



Score = 52.9 bits (127), Expect = 7e-10
Identities = 29/205 (14%), Positives = 77/205 (37%), Gaps = 24/205 (11%)

Query: 75 LEAQLDALHAARNQAQAQLDESLHGTREESIRALKASLAQAEAELRNAESDFQRNQQMVE 134
++ Q + Q + LD+ + + A + + E R +S ++
Sbjct: 191 IKEQFSTWQNQKYQKELNLDK-----KRAERLTVLARINRYENLSRVEKSRLDDFSSLLH 245

Query: 135 RGFLSRTQFDLSRRERDVARDRVAEARANLDE------GLKGDREERRQALQAAV----R 184
+ +++ + A + + ++ L++ K + + Q + + R
Sbjct: 246 KQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLR 305

Query: 185 RADAQIAELQAQI----DDLQ---VRAPVNGEVGPIPA-EQGELINAYSPLLTLVRLDDS 236
+ I L ++ + Q +RAPV+ +V + +G ++ L+ +V DD+
Sbjct: 306 QTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDT 365

Query: 237 YFV-FNLREDILAKVRKGDRIVMQV 260
V ++ + + G +++V
Sbjct: 366 LEVTALVQNKDIGFINVGQNAIIKV 390


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3400ABC2TRNSPORT280.039 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 28.4 bits (63), Expect = 0.039
Identities = 27/122 (22%), Positives = 50/122 (40%), Gaps = 1/122 (0%)

Query: 246 LGYRQSASFFMLLGIVLPFLIAVIALSEFIAELLPTEESVYLTMTFITLPLFYMAGYSWP 305
LGY Q S L ++ +A +L + L P+ + T + P+ +++G +P
Sbjct: 139 LGYTQWLSLLYALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFP 198

Query: 306 EQAMPDWVRWLADAIPSTWAIRAIAEMNQMDLPLREVSDHALVLLGMAATYALLGTLLYQ 365
+P + A +P + +I I + + P+ +V H L L T L +
Sbjct: 199 VDQLPIVFQTAARFLPLSHSIDLIRPI-MLGHPVVDVCQHVGALCIYIVIPFFLSTALLR 257

Query: 366 YR 367
R
Sbjct: 258 RR 259


24PA3296PA3278Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA3296119-3.360048alkaline phosphatase
PA3295431-5.554693HIT family protein
PA3294531-5.258671hypothetical protein
PA3293742-6.993487hypothetical protein
PA3292743-7.403121hypothetical protein
PA3291326-5.040047hypothetical protein
PA3290218-3.011618hypothetical protein
PA3289-2100.155404hypothetical protein
PA3288-212-0.281479hypothetical protein
PA3287-114-0.317685hypothetical protein
PA3286012-0.8735243-oxoacyl-ACP synthase
PA3285113-1.867532ECF subfamily sigma-70 factor
PA3284313-2.436042hypothetical protein
PA3283212-2.344108hypothetical protein
PA3282112-2.536649hypothetical protein
PA3281211-2.754577hypothetical protein
PA3280212-2.708265pyrophosphate-specific outer membrane porin
PA3279010-2.157106phosphate-specific outer membrane porin OprP
PA3278215-0.961815hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3288ISCHRISMTASE280.013 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 28.4 bits (63), Expect = 0.013
Identities = 18/93 (19%), Positives = 35/93 (37%), Gaps = 3/93 (3%)

Query: 75 AADRVFVKHGY--LPTAELVDHLRALRAERVLVCGIQADTCVLAAGFALFDAGLQPTLIG 132
D V K Y L++ +R +++++ GI A L F ++ +G
Sbjct: 116 DDDLVLTKWRYSAFKRTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFFVG 175

Query: 133 DLVLGSSLDRSGELGVRLWKHHFGQVVSLAEVL 165
D V SL++ ++ + V +L
Sbjct: 176 DAVADFSLEKH-QMALEYAAGRCAFTVMTDSLL 207


25PA3211PA3201Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA32112101.416828ABC transporter permease
PA32102111.361655potassium uptake protein TrkH
PA32092132.973531hypothetical protein
PA32083132.929802hypothetical protein
PA32073152.399888hypothetical protein
PA32062160.589747two-component sensor
PA32051130.155566hypothetical protein
PA32040130.178246two-component response regulator
PA3203114-0.744538hypothetical protein
PA3202311-0.128251hypothetical protein
PA3201311-0.648491intracellular septation protein A
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3206PF06580290.026 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 29.4 bits (66), Expect = 0.026
Identities = 16/100 (16%), Positives = 35/100 (35%), Gaps = 17/100 (17%)

Query: 341 VDNLLRNAVRFNPVGQPLEVRASSAGDYLRLSVRDHGPGIAAELQEQLGEPFFRAPNQSS 400
V+N +++ + P G + ++ + + L V + G +E
Sbjct: 264 VENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKE-------------- 309

Query: 401 PGHGLGLA-IARRAIERHGGHLRLG-NHPDGGFIATLSLP 438
G GL + R +G ++ + G A + +P
Sbjct: 310 -STGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3204HTHFIS1039e-28 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 103 bits (258), Expect = 9e-28
Identities = 42/117 (35%), Positives = 63/117 (53%)

Query: 4 LLLIDDDRELCELLGTWLVQEGFSVRASHDGAQARRALAEQTPDAVVLDVMLPDGSGLEL 63
+L+ DDD + +L L + G+ VR + + A R +A D VV DV++PD + +L
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 64 LKQLRGDHPDLPVLMLSARGEPLDRILGLELGADDYLAKPCDPRELTARLRAVLRRT 120
L +++ PDLPVL++SA+ + I E GA DYL KP D EL + L
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3203IGASERPTASE280.010 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 28.1 bits (62), Expect = 0.010
Identities = 16/76 (21%), Positives = 31/76 (40%), Gaps = 8/76 (10%)

Query: 22 EEPAPAPIPAAQPSITQATAELERRLVETERQRDELVSRMRQENRQLREQ--------LQ 73
E P P P PA T+ AE ++ +T + ++ + +NR++ ++ Q
Sbjct: 1022 EAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQ 1081

Query: 74 AAQAQRQPPLLTEEQT 89
+ + E QT
Sbjct: 1082 TNEVAQSGSETKETQT 1097


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3202adhesinmafb309e-04 Neisseria meningitidis: adhesin MafB signature.
		>adhesinmafb#Neisseria meningitidis: adhesin MafB signature.

Length = 467

Score = 30.4 bits (68), Expect = 9e-04
Identities = 13/45 (28%), Positives = 18/45 (40%)

Query: 53 AAGFTGSLIVAEFDSLAAAQSWAEADPYRAAGVYAEVVVKPFKKV 97
G GS+ E ++ A W + +P A V A V KV
Sbjct: 278 VIGGLGSVAGFEKNTREAVDRWIQENPNAAETVEAVFNVAAAAKV 322


26PA3173PA3142Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA3173211-0.174148short-chain dehydrogenase
PA3172210-0.475929phosphoglycolate phosphatase
PA3171210-1.071622ubiquinone biosynthesis O-methyltransferase
PA317019-0.984401N-ethylammeline chlorohydrolase
PA3169112-1.331590methylthioribose-1-phosphate isomerase
PA3168113-2.942771DNA gyrase subunit A
PA3167315-3.7164353-phosphoserine/phosphohydroxythreonine
PA3166327-6.546531bifunctional chorismate mutase/prephenate
PA3165235-8.360960histidinol-phosphate aminotransferase
PA3163340-10.750571cytidylate kinase
PA3162148-12.67493130S ribosomal protein S1
PA3161259-13.987351integration host factor subunit beta
PA3160262-14.610081O-antigen chain length regulator
PA3159465-15.478475UDP-N-acetyl-d-glucosamine 6-dehydrogenase
PA3158572-16.559923UDP-N-acetyl-2-amino-2-deoxy-D-glucuronate
PA3157677-17.066223acetyltransferase
PA3156479-18.124452UDP-2-acetamido-3-amino-2,
PA3155483-19.455081UDP-2-acetamido-2-deoxy-3-oxo-D-glucuronate
PA3154387-20.524635B-band O-antigen polymerase
PA3153182-19.384645O-antigen translocase
PA3152177-18.134076imidazole glycerol phosphate synthase subunit
PA3151175-16.884536imidazole glycerol phosphate synthase subunit
PA3150375-16.114958LPS biosynthesis protein WbpG
PA3149372-13.815604glycosyltransferase WbpH
PA3148365-11.801660UDP-2,3-diacetamido-2,3-dideoxy-D-glucuronate
PA3147249-8.723325glycosyl transferase WbpJ
PA3146246-7.501433NAD-dependent epimerase/dehydratase
PA3145031-5.735572glycosyltransferase WbpL
PA3143018-3.636019hypothetical protein
PA3142011-3.375712hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3173DHBDHDRGNASE892e-23 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 89.3 bits (221), Expect = 2e-23
Identities = 56/195 (28%), Positives = 91/195 (46%), Gaps = 5/195 (2%)

Query: 11 LKDRVILVTGAGRGIGAAAAKTFAAHGATVLLLGKTEEYLNEVYDAIEAAGHPQAAVIPF 70
++ ++ +TGA +GIG A A+T A+ GA + + E L +V +++A A F
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEA---F 62

Query: 71 NLETAQPHQFEELAATLENEFGHIDGLLHNASILGPRSPMQQISGENFMRVMQVNVNAMF 130
+ +E+ A +E E G ID L++ A +L P + +S E + VN +F
Sbjct: 63 PADVRDSAAIDEITARIEREMGPIDILVNVAGVLRP-GLIHSLSDEEWEATFSVNSTGVF 121

Query: 131 MLTTAMLPLMKLSSDASIIFTSSSVGRKGRAYWGAYSVSKFATEGLMQTLADELDGTSAI 190
+ ++ M SI+ S+ R AY+ SK A + L EL I
Sbjct: 122 NASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAE-YNI 180

Query: 191 RANSVNPGATRTSMR 205
R N V+PG+T T M+
Sbjct: 181 RCNIVSPGSTETDMQ 195


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3170UREASE372e-04 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 36.6 bits (85), Expect = 2e-04
Identities = 20/41 (48%), Positives = 23/41 (56%), Gaps = 3/41 (7%)

Query: 341 DAHRALRMA---TLNGARALGLERLIGSLEAGKAADLVAFD 378
D R R T+N A A GL IGSLE GK ADLV ++
Sbjct: 398 DNFRVKRYIAKYTINPAIAHGLSHEIGSLEVGKRADLVLWN 438


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3161DNABINDINGHU1181e-38 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 118 bits (297), Expect = 1e-38
Identities = 35/89 (39%), Positives = 54/89 (60%), Gaps = 1/89 (1%)

Query: 2 TKSELIERIVTHQGQLSAKDVELAIKTMLEQMSQALATGDRIEIRGFGSFSLHYRAPRVG 61
K +LI + V +L+ KD A+ + +S LA G+++++ GFG+F + RA R G
Sbjct: 3 NKQDLIAK-VAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKG 61

Query: 62 RNPKTGESVRLDGKFVPHFKPGKELRDRV 90
RNP+TGE +++ VP FK GK L+D V
Sbjct: 62 RNPQTGEEIKIKASKVPAFKAGKALKDAV 90


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3147ANTHRAXTOXNA340.002 Anthrax toxin LF subunit signature.
		>ANTHRAXTOXNA#Anthrax toxin LF subunit signature.

Length = 800

Score = 33.6 bits (76), Expect = 0.002
Identities = 10/36 (27%), Positives = 16/36 (44%), Gaps = 2/36 (5%)

Query: 363 DITAAIFRLLLLSEDERRTMGQRGRDAVL-EHYTYE 397
D+ L LSE+E+ +M RG + +E
Sbjct: 110 DLVEHK-ELQDLSEEEKNSMNSRGEKVPFASRFVFE 144


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3146NUCEPIMERASE791e-18 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 78.7 bits (194), Expect = 1e-18
Identities = 57/342 (16%), Positives = 109/342 (31%), Gaps = 64/342 (18%)

Query: 1 MVTGASGFVGSALCCELARTGYAVIAV-------------VRRVVERIPSVTYIEADLTD 47
+VTGA+GF+G + L G+ V+ + R + P + + DL D
Sbjct: 4 LVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDLAD 63

Query: 48 PATFAGEFPT--VDCIIHLAGRAHILTDKVADPLAAFREVNRDATVRLATRALEAGVKRF 105
F + + + R + + +P A+ + N + + ++
Sbjct: 64 REGMTDLFASGHFERVFISPHRLAV-RYSLENP-HAYADSNLTGFLNILEGCRHNKIQHL 121

Query: 106 VFVSSIGVNGNSTRQQAFNEDSPAG-PHAPYAISKYEAEQELGTLLRGKGMELVVVRPPL 164
++ SS V G R+ F+ D P + YA +K E T G+ +R
Sbjct: 122 LYASSSSVYG-LNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFFT 180

Query: 165 IYANDAPGNFGR-------LLKLVASGLPLPLDG------------------VRNARSLV 199
+Y G +GR K + G + + +R +
Sbjct: 181 VY-----GPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIP 235

Query: 200 SRRNIVGFLSLCAEHPDAAGELFLVADGEDVSIAQMIEALSRGMGRRPALFTFPAVLLKL 259
+ A ++ + + V + I+AL +G K
Sbjct: 236 HADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIE----------AKK 285

Query: 260 VMCLLGKASMHEQLCGSLQVDASKARRLLGWVPVETIGAGLQ 301
M L + E D ++G+ P T+ G++
Sbjct: 286 NMLPLQPGDVLETSA-----DTKALYEVIGFTPETTVKDGVK 322


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3145RTXTOXIND290.031 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.0 bits (65), Expect = 0.031
Identities = 4/24 (16%), Positives = 8/24 (33%)

Query: 44 PTPRGGGVAIVLVFLAALVWMLSA 67
PR I+ + A + +
Sbjct: 55 RRPRLVAYFIMGFLVIAFILSVLG 78


27PA3085PA3074Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA3085211-1.077669hypothetical protein
PA3084012-1.569609hypothetical protein
PA3083012-1.306313aminopeptidase
PA3082112-1.149518glycine betaine transmethylase
PA3081011-0.185050hypothetical protein
PA30801100.589430hypothetical protein
PA30791111.097761hypothetical protein
PA30782132.954193two-component sensor
PA30773133.713310two-component response regulator
PA30763113.767853hypothetical protein
PA30752113.287898hypothetical protein
PA30742112.810831hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3079ACRIFLAVINRP711e-14 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 71.4 bits (175), Expect = 1e-14
Identities = 36/175 (20%), Positives = 77/175 (44%), Gaps = 11/175 (6%)

Query: 613 IEAATNEVIKQSELII-LVLVYICVAAMCMITFRSFAATLCIVLPLILTSVLGNALMAAL 671
++ + +EV+K L ++LV++ + + ++ ATL + + + + A++AA
Sbjct: 333 VQLSIHEVVK--TLFEAIMLVFLVMY----LFLQNMRATLIPTIAVPVVLLGTFAILAAF 386

Query: 672 GIGVKVATLPVIALGVGIGVDYGIYIYTRLESFLRM-GLPLQEAYYETLRSTGKAVLFTG 730
G + T+ + L +G+ VD I + +E + LP +EA +++ A++
Sbjct: 387 GYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIA 446

Query: 731 LCLAIGVATWIF---SAIKFQADMGLMLTFMLLWNMFGALWLLPALARFLINPAK 782
+ L+ F S + + + ++ AL L PAL L+ P
Sbjct: 447 MVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVS 501



Score = 42.9 bits (101), Expect = 6e-06
Identities = 35/221 (15%), Positives = 80/221 (36%), Gaps = 15/221 (6%)

Query: 251 LITLVLLYWFTKCIRSTIAVLITTLVAVLWQLGLLNLVGFGLDPYSMLVPFLIFAIGISH 310
++ +++Y F + +R+T+ I V +L +L G+ ++ +M L + +
Sbjct: 349 MLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDD 408

Query: 311 GVQKINGIA-LQSSGADNALMAARLTFRQLFLPGMIAILADAVGFITLLVID--IGVI-R 366
+ + + + A + Q+ + + + FI + G I R
Sbjct: 409 AIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYR 468

Query: 367 ELAIGASIGVAVIVFTNLILLPVAISYI--GISKKAVQRSKDDAVREHPFWRLLSNFSSP 424
+ +I +A+ V LIL P + + +S + + + + N +
Sbjct: 469 QFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTN 528

Query: 425 KVAPV------SIAIALLMLGGGLWYGKHLKIG---DLDQG 456
V + + I L++ G + L + DQG
Sbjct: 529 SVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQG 569



Score = 33.3 bits (76), Expect = 0.005
Identities = 23/113 (20%), Positives = 46/113 (40%), Gaps = 5/113 (4%)

Query: 626 LIILVLVYICVAAMCMITFRSFAATLCIVLPLILTSVLGNALMAALGIGVKVATLPVIAL 685
I V+V++C+AA+ + S++ + ++L + L V V + +
Sbjct: 877 AISFVVVFLCLAAL----YESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLT 932

Query: 686 GVGIGVDYGIYIYTRLESFLRM-GLPLQEAYYETLRSTGKAVLFTGLCLAIGV 737
+G+ I I + + G + EA +R + +L T L +GV
Sbjct: 933 TIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGV 985


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3078PF07675320.005 Cleaved Adhesin
		>PF07675#Cleaved Adhesin

Length = 1358

Score = 32.4 bits (73), Expect = 0.005
Identities = 23/88 (26%), Positives = 36/88 (40%), Gaps = 6/88 (6%)

Query: 64 PAPDSYYFKGSVGTAGLPPKLREMLDTPPYKSIGAMQLLGNWDDDDEEEDDDAPSDDAYV 123
PA + G G P + + K M+ G D D E +DD+P+ Y
Sbjct: 480 PASGKMWIAGDGGNQ--PARYDDFAFEAGKKYTFTMRRAGMGDGTDMEVEDDSPASYTYT 537

Query: 124 VVR--QPLADGKTLYLYDND--AAGSID 147
V R + +G T ++ D AAG+ +
Sbjct: 538 VYRDGTKIKEGLTATTFEEDGVAAGNHE 565


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3077HTHFIS845e-21 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 84.1 bits (208), Expect = 5e-21
Identities = 30/129 (23%), Positives = 59/129 (45%)

Query: 3 IHVLVVEDNFDLAGTVIDYLEAAGVVCDHARDGQAGLNLARANRYDVILLDIMLPRINGR 62
+LV +D+ + + L AG + A D+++ D+++P N
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 63 QVCRQLREAGLQTPVLMLTALDTLQDKLDGFDAGADDYLLKPFELPELLVRLQALSRRRS 122
+ ++++A PVL+++A +T + + GA DYL KPF+L EL+ +
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 123 GQAQRLQVD 131
+ +L+ D
Sbjct: 124 RRPSKLEDD 132


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3074TYPE4SSCAGX372e-04 Type IV secretion system CagX conjugation protein si...
		>TYPE4SSCAGX#Type IV secretion system CagX conjugation protein

signature.
Length = 522

Score = 37.1 bits (85), Expect = 2e-04
Identities = 39/158 (24%), Positives = 73/158 (46%), Gaps = 13/158 (8%)

Query: 340 LMLSLPQPAMAFQFEDLWLRPDQQGQRLLQRGQADEAAKRFEDFRWKGLSLYQARDYAAA 399
L++ P P + + L +++ + Q+ Q D+ KR E+ R K + +
Sbjct: 131 LIVDAPDPK-ELEEQKKALEKEKEAKEQAQKAQKDKREKRKEE-RAKNRA-----NLENL 183

Query: 400 AQAFAQGDQADDHYNRGNALARQGELEAAVDAYEQALERQPQLVAAQRNK-ALVEELLRQ 458
A + ++ N + +Q E E +D E+ + Q Q AQ N +EEL ++
Sbjct: 184 TNAMSNPQNLSNNKNLSELIKQQRENE--LDQMERLEDMQEQ---AQANALKQIEELNKK 238

Query: 459 RQEQAAQQQAGENKEQRQEASQQSPPSGSSQRPPRDAA 496
+ E+A +Q+A + + + SQ+SP S + P D+A
Sbjct: 239 QAEEAVRQRAKDKISIKTDKSQKSPEDNSIELSPSDSA 276


28PA3023PA2976Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA302328-0.082497lipid kinase
PA302218-0.602435hypothetical protein
PA302129-0.713178hypothetical protein
PA302028-1.002563lytic transglycosylase
PA301918-2.256443ABC transporter ATP-binding protein
PA301809-2.821107hypothetical protein
PA3017011-2.956076hypothetical protein
PA3016110-3.121276hypothetical protein
PA3015110-3.510979hypothetical protein
PA3014211-3.690588fatty acid oxidation complex subunit alpha
PA3013012-3.1386213-ketoacyl-CoA thiolase
PA3012211-3.094460hypothetical protein
PA301129-2.755126DNA topoisomerase I
PA3010416-1.445401hypothetical protein
PA3009416-0.753972hypothetical protein
PA3008315-0.273174cell division inhibitor SulA
PA300718-0.096350LexA repressor
PA3006-18-0.539621transcriptional regulator PsrA
PA3005-18-1.023264beta-hexosaminidase
PA3004-18-1.338258S-methyl-5'-thioinosine phosphorylase
PA3003-18-1.592684hypothetical protein
PA3002-18-1.647786transcription-repair coupling factor
PA3001210-2.868904glyceraldehyde-3-phosphate dehydrogenase
PA3000210-2.877061aromatic amino acid transporter AroP
PA299919-2.525278Na(+)-translocating NADH-quinone reductase
PA2998012-2.331820Na(+)-translocating NADH-quinone reductase
PA2997014-2.510535Na(+)-translocating NADH-quinone reductase
PA2996113-3.240939Na(+)-translocating NADH-quinone reductase
PA2995112-2.428325Na(+)-translocating NADH-quinone reductase
PA2994110-2.085462Na(+)-translocating NADH-quinone reductase
PA2993111-2.096349hypothetical protein
PA2992010-2.148497hypothetical protein
PA2991111-2.117885soluble pyridine nucleotide transhydrogenase
PA2990212-1.647227phosphodiesterase
PA29892150.117335hypothetical protein
PA29883140.053740hypothetical protein
PA29873160.507049lipoprotein-releasing system ABC transporter
PA29863170.667727hypothetical protein
PA29853181.342938hypothetical protein
PA29842171.700224hypothetical protein
PA2983213-0.345817translocation protein TolQ
PA2982017-0.102004hypothetical protein
PA29814170.343197tetraacyldisaccharide 4'-kinase
PA2980317-0.123538hypothetical protein
PA2979417-0.1214583-deoxy-manno-octulosonate cytidylyltransferase
PA2978316-0.735915phosphotyrosine protein phosphatase
PA2977314-0.651076UDP-N-acetylenolpyruvoylglucosamine reductase
PA2976415-1.095850ribonuclease E
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3019RTXTOXIND330.003 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 33.3 bits (76), Expect = 0.003
Identities = 16/62 (25%), Positives = 26/62 (41%), Gaps = 2/62 (3%)

Query: 575 KLQRELEALPGQIDAVEAELAGVQETIAQ--QDFYLRPQDEQRETLARLDALQQELDALL 632
+ EL Q++ +E+E+ +E Q F D+ R+T + L EL
Sbjct: 263 EAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNE 322

Query: 633 ER 634
ER
Sbjct: 323 ER 324


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3017SHAPEPROTEIN270.019 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 27.4 bits (61), Expect = 0.019
Identities = 12/39 (30%), Positives = 18/39 (46%), Gaps = 2/39 (5%)

Query: 17 DPVMKRAAALATSNQARLSVVHVV-EPMAMAFGGDVPMD 54
V +RA + A V ++ EPMA A G +P+
Sbjct: 119 TQVERRAIRESAQG-AGAREVFLIEEPMAAAIGAGLPVS 156


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3006HTHTETR692e-16 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 68.9 bits (168), Expect = 2e-16
Identities = 25/93 (26%), Positives = 40/93 (43%), Gaps = 1/93 (1%)

Query: 4 SETVERILDAAEQLFAEKGFAETSLRLITSKAGVNLAAVNYHFGSKKALIQAVFSRFLGP 63
ET + ILD A +LF+++G + TSL I AGV A+ +HF K L ++
Sbjct: 10 QETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESN 69

Query: 64 FCASLEKELDRRQAKPEAQ-HATLEDLLHLLVS 95
+ + P + L +L V+
Sbjct: 70 IGELELEYQAKFPGDPLSVLREILIHVLESTVT 102


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2981ENTSNTHTASED290.022 Enterobactin synthetase component D signature.
		>ENTSNTHTASED#Enterobactin synthetase component D signature.

Length = 234

Score = 28.8 bits (64), Expect = 0.022
Identities = 29/128 (22%), Positives = 45/128 (35%), Gaps = 22/128 (17%)

Query: 15 HPALALLRPLEALYRRVANGRRADFLSGRKPAYRAPLPVLVVGNITVGGTGKTPM----I 70
L P R R+A+ L+GR A A L + V + G + P+ +
Sbjct: 26 REHDLLWLPHHDRLRSAGRKRKAEHLAGRIAAVHA-LREVGVRTVPGMGDKRQPLWPDGL 84

Query: 71 LWMIEHCRARGLRVGVISRGYGARPPTTPWRVRAEQDAAEAGDEPLMIVRRSGVPLMIDP 130
I HC L V + + G + E+ ++ L P +ID
Sbjct: 85 FGSISHCATTALAV-ISRQRIG---------IDIEKIMSQHTATEL-------APSIIDS 127

Query: 131 DRPRALQA 138
D + LQA
Sbjct: 128 DERQILQA 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2976IGASERPTASE569e-10 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 55.8 bits (134), Expect = 9e-10
Identities = 51/349 (14%), Positives = 103/349 (29%), Gaps = 36/349 (10%)

Query: 508 EAQPVSSTRTLVRQEAAVKTVAPQQPAPQHTEAPVEPAKPMPEPSLFQGLVKSLVGLFAG 567
+ +++ + +V + + EAPV P P PS V A
Sbjct: 993 DTTNITTPNNIQADVPSVPSNNEEIARVD--EAPVPPPAP-ATPSETTETV-------AE 1042

Query: 568 KDQPAAKPAETSKPAAERQTRQDERRNGRQQNRRRDGRDGNRRDEERKPREERAERQPRE 627
+ +K E ++ A T Q+ ++ + N + +E + +E
Sbjct: 1043 NSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKE 1102

Query: 628 ERAERPNREERSERRREERAERPAREERQPREGREERAERTPREERQPREGREGREERSE 687
+ + E + + + + P++ + E + R+ +E +S+
Sbjct: 1103 TATVEKEEKAKVETEKTQEVPKVTSQV-SPKQEQSETVQPQAEPARENDPTVNIKEPQSQ 1161

Query: 688 RRREERAERPAREERQPREGREERAERPAREERQPREDRQARDAAALEAEALPNDESLEQ 747
E+PA+E E + +E
Sbjct: 1162 TNTTADTEQPAKETS--------SNVEQPVTESTTVN---------------TGNSVVEN 1198

Query: 748 DEQDDTDGERPRRRSRGQRRRSNRRERQ-REVSGELEGSEATDNAAAPLNTVAAAAAAGI 806
E +P S + NR R R V +E + + N + + +
Sbjct: 1199 PENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTN 1258

Query: 807 AVASEAVEANVEQAPATTSEAASETTASDETDASTSEAVETQGADSEAN 855
AV S+A A + +A S+ + E + V N
Sbjct: 1259 AVLSDAR-AKAQFVALNVGKAVSQHISQLEMNNEGQYNVWVSNTSMNKN 1306



Score = 55.5 bits (133), Expect = 1e-09
Identities = 52/310 (16%), Positives = 104/310 (33%), Gaps = 29/310 (9%)

Query: 760 RRSRGQRRRSNRRERQREVSGELEGSEATDNA-----AAPLNTVAAAAAAGIAVA--SEA 812
R G+ N +R + + +N + P N A V + A
Sbjct: 972 RNVNGRYDLYNPEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPA 1031

Query: 813 VEANVEQAPATTSEAASETTASDETDASTSEAVETQGA-----DSEANT---------GE 858
+ + A S+ S+T +E DA+ + A + A + +ANT E
Sbjct: 1032 TPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSE 1091

Query: 859 TADIEAPVTVSVVRDEADQSTLLVAQATEEAPFASESVESREDAESAVQPATEAAEEVAA 918
T + + T E ++ + + T+E P + V +++ VQP E A E
Sbjct: 1092 TKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDP 1151

Query: 919 PVPVEVAAPSE--PAATEEPTPAIAA-----VPANATGRALNDPREKRRLQREAERLARE 971
V ++ A TE+P ++ V + T N E A
Sbjct: 1152 TVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTV 1211

Query: 972 AAAAAE-AAAQAAPAVEEIPAVASEEASAQEEPAAPQAEEITQADVPSQADEAQEAVQAE 1030
+ ++ + +V +P ++ + + ++T + + +A+ Q
Sbjct: 1212 NSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTNAVLSDARAKAQFV 1271

Query: 1031 PEASGEGAAD 1040
G+ +
Sbjct: 1272 ALNVGKAVSQ 1281



Score = 52.8 bits (126), Expect = 7e-09
Identities = 32/178 (17%), Positives = 57/178 (32%), Gaps = 15/178 (8%)

Query: 893 SESVESREDAESAVQPATEAAEEVAAP--VPVEVAAPSEPAATEEPTPAIAAVPANATGR 950
+ ++ + + ++ V EE+A PV AP+ P+ T E + + +
Sbjct: 994 TTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEK 1053

Query: 951 ALNDPREKRRLQRE----------AERLAREAAAAAEAAAQAAPAVEEIPAVASEEASAQ 1000
D E RE A E A + + + A +E A+
Sbjct: 1054 NEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAK 1113

Query: 1001 EEPAAPQAEEITQADV-PSQADEAQEAVQAEPEASGEGA--ADTEHAKKTEESETSRP 1055
E Q + V P Q QAEP + ++ ++T +P
Sbjct: 1114 VETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQP 1171



Score = 48.9 bits (116), Expect = 1e-07
Identities = 44/226 (19%), Positives = 77/226 (34%), Gaps = 20/226 (8%)

Query: 835 DETDASTSEAVETQGADSEANTGETADI-EAPVTVSVVRDEADQSTLLVAQATEEAPFAS 893
D T+ +T ++ +N E A + EAPV ++ + E + S
Sbjct: 993 DTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTET----VAENSKQES 1048

Query: 894 ESVESRE-DAESAVQPATEAAEEVAAPVPV-----EVAAPSEPAATEEPTPAIAAVPANA 947
++VE E DA E A+E + V EVA + T
Sbjct: 1049 KTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEK 1108

Query: 948 TGRALNDPREKRRLQREAERLAREAAAAAEAAAQAAPAVEEIPAVASEEASAQEEPAAPQ 1007
+A + + + + + +++ + + QA PA E P V +E PQ
Sbjct: 1109 EEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKE---------PQ 1159

Query: 1008 AEEITQADVPSQADEAQEAVQAEPEASGEGAADTEHAKKTEESETS 1053
++ T AD A E V+ S + E + +
Sbjct: 1160 SQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPA 1205



Score = 48.9 bits (116), Expect = 1e-07
Identities = 44/329 (13%), Positives = 92/329 (27%), Gaps = 33/329 (10%)

Query: 661 REERAERTPREERQPREGREGREERSERRREERAERPAREERQPREGREERAERPAREER 720
E+ +T + S E R P E
Sbjct: 984 EVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAEN 1043

Query: 721 QPREDRQARDAAALEAEALPNDESLEQDEQDDTDGERPRRRSRGQRRRSNRRERQREVSG 780
+E + E ++ E ++ ++ + + + + S +E Q +
Sbjct: 1044 SKQESKTVEKNEQDATETTA--QNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETK 1101

Query: 781 ELEGSEATDNAAAPLNTVAAAAAAGIAVASEAVEANVEQAPATTSEAASETTASDETDAS 840
E E + A + + +Q + T + +E ++ +
Sbjct: 1102 ETATVEKEEKAKVETEKTQEVP-------KVTSQVSPKQEQSETVQPQAEPARENDPTVN 1154

Query: 841 TSEAVETQGADSEANTGETADIEAPVTVS---VVRDEADQSTLLVAQATEEAPFASES-- 895
E ++ T TAD E P + V + + +T+ + E P +
Sbjct: 1155 IKEP--------QSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPAT 1206

Query: 896 VESREDAESAVQPATEAAEEVAAPVPVEVAAPSEPAATEEPTPAIAAVPANATGRALNDP 955
+ ++ES+ +P V VP V + + N T L+D
Sbjct: 1207 TQPTVNSESSNKPKNRHRRSV-RSVPHNVEPATTSSNDRSTVALCDLTSTN-TNAVLSDA 1264

Query: 956 REKRR---------LQREAERLAREAAAA 975
R K + + + +L
Sbjct: 1265 RAKAQFVALNVGKAVSQHISQLEMNNEGQ 1293



Score = 44.3 bits (104), Expect = 3e-06
Identities = 47/237 (19%), Positives = 74/237 (31%), Gaps = 25/237 (10%)

Query: 427 EALKDRTAEVRARVPFQVAAFLLNEKRNAITKIELRTRARIFILPDDHLETPHFEVQRLR 486
A T E A Q + + +++A + T EV +
Sbjct: 1030 PATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSG 1089

Query: 487 DDSPELVAGQTSYEMATVEHEEAQPVSSTRTLVRQEAAVKT--VAPQQPAPQHTEAPVEP 544
++ E +T E ATVE EE V + +T QE T V+P+Q + + EP
Sbjct: 1090 SETKETQTTETK-ETATVEKEEKAKVETEKT---QEVPKVTSQVSPKQEQSETVQPQAEP 1145

Query: 545 AKPMPEPSLFQGLVKSLVGLFAGKDQPAAKPAETSKPAAERQTRQDERRNGRQQNRRRDG 604
A +P+ K+ ++T+ A Q ++ N Q
Sbjct: 1146 A-RENDPT------------VNIKE----PQSQTNTTADTEQPAKETSSNVEQPVTESTT 1188

Query: 605 RDGNRRDEERKPREERAERQP--REERAERPNREERSERRREERAERPAREERQPRE 659
+ E A QP E + +P R R PA R
Sbjct: 1189 VNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRS 1245


29PA2956PA2945Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA2956214-2.015859hypothetical protein
PA2955316-2.127774hypothetical protein
PA2954315-1.910254hypothetical protein
PA2953216-1.595835electron transfer flavoprotein-ubiquinone
PA2952215-0.528945electron transfer flavoprotein subunit beta
PA29511150.942739electron transfer flavoprotein subunit alpha
PA29500101.120462reductase
PA2949-262.832654lipase
PA2948-263.228849precorrin-4 C(11)-methyltransferase
PA2947-253.526766hypothetical protein
PA2946-163.578673hypothetical protein
PA2945-153.124770cobalamin biosynthesis protein CobW
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2952ALARACEMASE280.045 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 27.8 bits (62), Expect = 0.045
Identities = 22/85 (25%), Positives = 39/85 (45%), Gaps = 7/85 (8%)

Query: 19 VKADNSGVDLANVKM---SMNPFCEIAVEEAVRLKEKGVATEIVAVSVGPTAAQEQLRTA 75
VKA+ G + + + + F + +EEA+ L+E+G I+ + G AQ+
Sbjct: 34 VKANAYGHGIERIWSAIGATDGFALLNLEEAITLRERGWKGPIL-MLEGFFHAQD---LE 89

Query: 76 LALGADRAILVESNDELNSLAVAKL 100
+ V SN +L +L A+L
Sbjct: 90 IYDQHRLTTCVHSNWQLKALQNARL 114


30PA2917PA2906Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA29172113.667815transcriptional regulator
PA29161132.888109hypothetical protein
PA29150132.524181hypothetical protein
PA29140133.248567ABC transporter permease
PA2913-2133.144189hypothetical protein
PA2912-1133.858164ABC transporter ATP-binding protein
PA2911-1104.190327TonB-dependent receptor
PA2910195.438474manganese efflux pump MntP
PA2909095.390370cobalt-precorrin-6x reductase
PA2908195.420751cobalt-precorrin-5B C(1)-methyltransferase
PA2907-294.829826precorrin-6y-dependent methyltransferase CobL
PA2906-2124.013616oxidoreductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2917PF05272280.034 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.5 bits (63), Expect = 0.034
Identities = 6/16 (37%), Positives = 8/16 (50%)

Query: 258 FRRAYGMTPAAYRRQC 273
+R AYG + RQ
Sbjct: 672 YRGAYGRYVQDHPRQV 687


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2915PF05932270.049 Tir chaperone protein (CesT)
		>PF05932#Tir chaperone protein (CesT)

Length = 127

Score = 27.1 bits (60), Expect = 0.049
Identities = 9/53 (16%), Positives = 18/53 (33%), Gaps = 8/53 (15%)

Query: 74 ADHLSAAIFLQRELGGCLAIGARITQVQAKFSGLFNLGEAFPVDGRQFEHLFE 126
L+ A+ G L + + SGL++ ++ P + L
Sbjct: 66 QCLLAGALNPLLNAGPGLGLDEK--------SGLYHAYQSIPREKLSVPTLKR 110


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2913FERRIBNDNGPP383e-05 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 38.0 bits (88), Expect = 3e-05
Identities = 49/266 (18%), Positives = 96/266 (36%), Gaps = 43/266 (16%)

Query: 43 PSRAVSHDINLTEMMVALGLQTRMVGYTGISGW--WKNADPGLIAALKPLPELV-----A 95
P+R V+ + E+++ALG+ G + W + +P PLP+ V
Sbjct: 35 PNRIVALEWLPVELLLALGIVP--YGVADTINYRLWVS-EP-------PLPDSVIDVGLR 84

Query: 96 RYPTAETLLDVDADFFFAGWGYGMRVGGDLTPASLEPLG-VKVYELSESCAQIGEPRRAS 154
P E L ++ F GYG +P L + + + S+ + R++
Sbjct: 85 TEPNLELLTEMKPSFMVWSAGYGP------SPEMLARIAPGRGFNFSDGKQPLAMARKS- 137

Query: 155 LDELYRDLRNLGRIFDVEPRAERLVASLQARIERARAGIPANAEAPRVF--LYDSGEDRP 212
L + + +++ AE +A + I + P + L D
Sbjct: 138 -------LTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLV 190

Query: 213 FTSGRLGMPQALIEAAGGRSVTDDVAASW--TQVNWESVVA-RDPQVIVIVDYGETSAAQ 269
F + Q +++ G + W T V+ + + A +D V+ D+ +
Sbjct: 191 FGPN--SLFQEILDEYGIPNAWQGETNFWGSTAVSIDRLAAYKDVDVLCF-DHDNSKDMD 247

Query: 270 KQRFLEENPALRSLTAIRERRFIVLP 295
L P +++ +R RF +P
Sbjct: 248 A---LMATPLWQAMPFVRAGRFQRVP 270


31PA2886PA2873Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA28862111.469588hypothetical protein
PA28853120.443743atu genes repressor
PA28841111.136457hypothetical protein
PA28831111.277783hypothetical protein
PA28820101.542783two-component sensor
PA28810101.794705two-component response regulator
PA2880-181.703981hypothetical protein
PA28790102.832327transcriptional regulator
PA28781122.569553hypothetical protein
PA28772143.147071transcriptional regulator
PA28762143.187589orotidine 5'-phosphate decarboxylase
PA28752122.941115hypothetical protein
PA28742122.739739hypothetical protein
PA28732122.159674protein-glutamine gamma-glutamyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2885HTHTETR703e-17 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 70.0 bits (171), Expect = 3e-17
Identities = 32/190 (16%), Positives = 65/190 (34%), Gaps = 8/190 (4%)

Query: 14 ESARGKLLQTAAHLFRSKGYERTTVRDLASAVGIQSGSIFHHFKSKDEILRSVMEETILY 73
+ R +L A LF +G T++ ++A A G+ G+I+ HFK K ++ + E +
Sbjct: 10 QETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESN 69

Query: 74 NTALMRAALAD-AEDLRERVLGLIRCELQSIMGGTGEAMAVLVYEWRSLSAEGQAYILGL 132
L A D + ++ L+S + + + + + A +
Sbjct: 70 IGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQA 129

Query: 133 RDIYEQMWLD----VLGEARLAGYCQG--DPFILRRFLTGALSWT-TTWFRPEGPMSLDQ 185
+ D L A + G +S W L +
Sbjct: 130 QRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDLKK 189

Query: 186 LAEEALALVI 195
A + +A+++
Sbjct: 190 EARDYVAILL 199


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2882PF06580452e-07 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 45.2 bits (107), Expect = 2e-07
Identities = 35/172 (20%), Positives = 72/172 (41%), Gaps = 24/172 (13%)

Query: 198 QIGELVSGLKDFAR--LDRAFSEEVDLND---CVRNAVLIARTAIKDKAEISSQLGELPL 252
+ E+++ L + R L + + +V L D V + + +A +D+ + +Q+ +
Sbjct: 192 KAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIM 251

Query: 253 IACAPSQINQVLL-NLLTNAAQAMERFGRILLKSWADERQVFLSVQDNGKGMPAEVLGRI 311
P + Q L+ N + + + + G+ILLK D V L V++ G
Sbjct: 252 DVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLA-------- 303

Query: 312 FDPFFTTKPVGQGTGLGLSISYKIIQQHGG---TIRVASEPGRGTRFLISLP 360
K + TG GL + +Q G I+++ + G+ ++ +P
Sbjct: 304 ------LKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVN-AMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2881HTHFIS985e-25 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 97.6 bits (243), Expect = 5e-25
Identities = 29/136 (21%), Positives = 57/136 (41%), Gaps = 2/136 (1%)

Query: 7 RILFVDDEERILRSLAMQF-RRHYEVLTESDPRRALERLKTERIQVLVSDQRMPQMSGAE 65
IL DD+ I L R Y+V S+ + ++V+D MP + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 66 LLAQARERYPETLRILLTGYSDLDAAVDALNDGGIFRYLTKPWNPQEMAFTLRQAAEIAS 125
LL + ++ P+ ++++ + A+ A + G + YL KP++ E+ + +A
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKAS-EKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 126 RQGLPAPPAATLAAPL 141
R+ + PL
Sbjct: 124 RRPSKLEDDSQDGMPL 139



Score = 54.8 bits (132), Expect = 1e-10
Identities = 27/139 (19%), Positives = 55/139 (39%), Gaps = 5/139 (3%)

Query: 142 SVLLLDDDPETLDCVGAFCHAGGHRLLRARNLAEALVWLNTEPVEVLVSDLKLAGEHTAP 201
++L+ DDD + G+ + N A W+ +++V+D+ + E+
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 202 LLKSLAQAHPRLLSLVVTPFRDTQALLELINQAQIFRYLPKPIRRGLFEKGLKAAAEQAL 261
LL + +A P L LV++ ++ + + YLPKP L +AL
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKAS-EKGAYDYLPKP----FDLTELIGIIGRAL 119

Query: 262 LWRGRSLPEVDRLAEVPRD 280
R +++ ++
Sbjct: 120 AEPKRRPSKLEDDSQDGMP 138


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2875HTHFIS320.003 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 31.7 bits (72), Expect = 0.003
Identities = 12/43 (27%), Positives = 21/43 (48%)

Query: 103 DEINRATPKSQSALLEAMEEGQVTIEGATRPLPEPFFVIATQN 145
DEI +Q+ LL +++G+ T G P+ ++A N
Sbjct: 238 DEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATN 280


32PA2776PA2756Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA2776220-2.370248hypothetical protein
PA2775126-1.983912*hypothetical protein
PA2774021-1.206156hypothetical protein
PA2773017-1.214296hypothetical protein
PA2772a-116-0.305515hypothetical protein
PA2772-217-0.120270hypothetical protein
PA2771-1140.616621hypothetical protein
PA27700130.941302isomerase
PA27692130.624584hypothetical protein
PA27682150.217990hypothetical protein
PA2767114-0.009588enoyl-CoA hydratase
PA2766-214-0.625894transcriptional regulator
PA2765-117-1.831040hypothetical protein
PA2764114-3.678794hypothetical protein
PA2763a315-3.756975hypothetical protein
PA2763012-3.066470hypothetical protein
PA2762011-2.902888hypothetical protein
PA2761013-2.739465hypothetical protein
PA2760012-2.733846hypothetical protein
PA2759117-1.782105hypothetical protein
PA2758217-2.101300transcriptional regulator
PA2757217-2.290803hypothetical protein
PA2756218-1.665472hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2766HTHTETR453e-08 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 45.4 bits (107), Expect = 3e-08
Identities = 25/73 (34%), Positives = 36/73 (49%)

Query: 1 MRYKPEQKQATRALLLSKAAPLVKRQGFASTGLDTLMKAAGLTTGAFYSQFSSKAELLEA 60
R ++ Q TR +L A L +QG +ST L + KAAG+T GA Y F K++L
Sbjct: 2 ARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61

Query: 61 ILQRELSAQHDAF 73
I + S +
Sbjct: 62 IWELSESNIGELE 74


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2765PF06057280.045 Type IV secretory pathway VirJ component
		>PF06057#Type IV secretory pathway VirJ component

Length = 243

Score = 27.9 bits (62), Expect = 0.045
Identities = 14/48 (29%), Positives = 22/48 (45%), Gaps = 8/48 (16%)

Query: 18 IPVVG----RYLFFSISQPEQVAATLASLAA---ATDGYQLVVGIGHS 58
PVVG +Y ++ P+ V ++ A G Q V+ IG+S
Sbjct: 79 WPVVGWSSLKY-YWKQKDPKDVTQDTLAIIDKYQAEFGTQKVILIGYS 125


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2762MICOLLPTASE280.012 Microbial collagenase metalloprotease (M9) signature.
		>MICOLLPTASE#Microbial collagenase metalloprotease (M9) signature.

Length = 1104

Score = 27.8 bits (61), Expect = 0.012
Identities = 12/24 (50%), Positives = 15/24 (62%)

Query: 33 NLNAYASADGSKLMGTWICTPGKW 56
N YA+ADG+KL T PGK+
Sbjct: 1059 NYVDYANADGNKLSNTCKLNPGKY 1082


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2758TCRTETOQM290.028 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 29.1 bits (65), Expect = 0.028
Identities = 12/35 (34%), Positives = 16/35 (45%)

Query: 104 DSGRLFGALRTLSERYPLLDVEVLSAAQDDALALL 138
L AL +S+ PLL V SA + L+ L
Sbjct: 357 QREMLLDALLEISDSDPLLRYYVDSATHEIILSFL 391


33PA2746aPA2725Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA2746a-113-3.287416hypothetical protein
PA2746015-3.926650hypothetical protein
PA2745015-4.278732hydrolase
PA2744013-3.898623threonine--tRNA ligase
PA2743117-4.153378translation initiation factor IF-3
PA2742115-3.26677950S ribosomal protein L35
PA2741117-3.53074750S ribosomal protein L20
PA2740116-4.952103phenylalanine--tRNA ligase subunit alpha
PA2739121-5.879494phenylalanine--tRNA ligase subunit beta
PA2738229-8.520437integration host factor subunit alpha
PA2737025-8.280743hypothetical protein
PA2736030-9.123943*hypothetical protein
PA2735026-7.294389restriction-modification system protein
PA2734025-6.190756hypothetical protein
PA2733017-4.047437hypothetical protein
PA2732016-3.054851hypothetical protein
PA27303190.042348hypothetical protein
PA27293222.854165hypothetical protein
PA27260142.049551radical activating enzyme
PA27252121.575056chaperone
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2738DNABINDINGHU1131e-36 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 113 bits (284), Expect = 1e-36
Identities = 34/89 (38%), Positives = 54/89 (60%)

Query: 5 TKAEIAERLYEELGLNKREAKELVELFFEEIRQALEHNEQVKLSGFGNFDLRDKRQRPGR 64
K ++ ++ E L K+++ V+ F + L E+V+L GFGNF++R++ R GR
Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62

Query: 65 NPKTGEEIPITARRVVTFRPGQKLKARVE 93
NP+TGEEI I A +V F+ G+ LK V+
Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAVK 91


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2729RTXTOXIND300.024 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.8 bits (67), Expect = 0.024
Identities = 20/122 (16%), Positives = 43/122 (35%), Gaps = 4/122 (3%)

Query: 12 REEAIATCERDLQRLDKALARWENQASRLAQLSDAERAAAHARRASLHALLEQERWLDVQ 71
+E + + + + R+EN + D + H + + HA+LEQE V+
Sbjct: 205 KELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKY-VE 263

Query: 72 LQVKIESEFLKRDLAEREERAIRQAAETRQQHRR---LQENASALLQALDARPDAASAAL 128
++ + + E E + ++ + Q + L + + A
Sbjct: 264 AVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEE 323

Query: 129 RQ 130
RQ
Sbjct: 324 RQ 325



Score = 29.0 bits (65), Expect = 0.041
Identities = 30/232 (12%), Positives = 69/232 (29%), Gaps = 35/232 (15%)

Query: 98 ETRQQHRRLQENASALLQALDARPDAASAALRQTLHTLADGALRDDAEALLAQGLAALAS 157
+ L+ +S L L+ + + + +++ +
Sbjct: 129 ALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLT 188

Query: 158 APAEERLSEAQRELAQRLKTGETPMSLEQWRARQQQDAPHEQRLARIDRHIAELQLLQGE 217
+ +E+ S Q + Q+ +D+ AE +
Sbjct: 189 SLIKEQFSTWQNQKY--------------------------QKELNLDKKRAERLTVLAR 222

Query: 218 ASAQAFLERLARAEAEQRPERRNLLLDSLVLDLAQAAREHQQQRQRLEHLQDLASEVAAL 277
+ E L+R E + + +LL + +Q+ + +E + +L + L
Sbjct: 223 INR---YENLSRVEKSRLDDFSSLLHKQAI----AKHAVLEQENKYVEAVNELRVYKSQL 275

Query: 278 GAAEHAELLQRAAACQPDSDPQQ--LAELTERCNAILTAHLQQQAALARRQA 327
E L + + L +L + + I L+ R+QA
Sbjct: 276 EQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQA 327


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2725HTHFIS320.010 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 31.7 bits (72), Expect = 0.010
Identities = 41/257 (15%), Positives = 77/257 (29%), Gaps = 39/257 (15%)

Query: 225 DQPARRALAPALLRGLGGAGVAEEALQQAAATFVENTEGLLLLDL-----NAIVQLARVE 279
D A R + L G L++ D+ NA L R++
Sbjct: 11 DDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLLPRIK 70

Query: 280 GLAMER----------IADAVRRYKVGVTE---DPWLKID-RQRIRQADEIVRRRVKGQQ 325
+ A++ + G + P+ + I +A +RR +
Sbjct: 71 KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRRPSKLE 130

Query: 326 HAVTHMLDIVKR--AMTGV--GASRKGNRPRGVAFLAGPTGVGKTELAKTVTSLLFGDES 381
+ +V R AM + +R + + G +G GK +A+ +
Sbjct: 131 DDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTL-MITGESGTGKELVARALHDYGKRRNG 189

Query: 382 AYIRFDMSEFSAEHADQRLIGAPPGYVGYDVGGELTNAIREKP--FS-----VVLFDEIE 434
++ +M+ + + L G G T A F + DEI
Sbjct: 190 PFVAINMAAIPRDLIESELFGH--------EKGAFTGAQTRSTGRFEQAEGGTLFLDEIG 241

Query: 435 KAHPRILDKFLQILDDG 451
+ L++L G
Sbjct: 242 DMPMDAQTRLLRVLQQG 258


34PA2626PA2621Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
PA2626014-3.214368tRNA-specific 2-thiouridylase MnmA
PA2625010-3.282541hypothetical protein
PA2624010-3.962797isocitrate dehydrogenase
PA2623012-4.094723isocitrate dehydrogenase
PA2622-212-3.449019cold-shock protein CspD
PA2621-28-3.332421ATP-dependent Clp protease adapter protein Clp
35PA2586PA2570Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA2586023-3.639127response regulator GacA
PA2585024-3.661708excinuclease ABC subunit C
PA2584026-4.216371CDP-diacylglycerol--glycerol-3-phosphate
PA2583027-3.912096*sensor/response regulator hybrid protein
PA2582020-3.589200hypothetical protein
PA2581-117-3.081089*hypothetical protein
PA2580-114-2.307183hypothetical protein
PA2579-115-2.547434tryptophan 2,3-dioxygenase
PA2578013-1.991188acetyltransferase
PA2577-110-2.408895transcriptional regulator
PA2576-113-2.455654hypothetical protein
PA2575-116-3.387552hypothetical protein
PA2574018-3.412696alkane-1 monooxygenase
PA2573120-3.260701chemotaxis transducer
PA2572120-3.343611two-component response regulator
PA2571223-3.333640two-component sensor
PA2570022-3.290869*PA-I galactophilic lectin
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2586HTHFIS734e-17 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 72.6 bits (178), Expect = 4e-17
Identities = 26/116 (22%), Positives = 50/116 (43%), Gaps = 2/116 (1%)

Query: 2 IKVLVVDDHDLVRTGITRMLADIEGLQVVGQADCGEDCLKLARELKPDVVLMDVKMPGIG 61
+LV DD +RT + + L+ G V + D+V+ DV MP
Sbjct: 4 ATILVADDDAAIRTVLNQALSR-AGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 62 GLEATRKLLRSQPDIKVVVVTVCEEDPFPTRLMQAGAAGYMTKGAGLEEMVQAIRQ 117
+ ++ +++PD+ V+V++ + + GA Y+ K L E++ I +
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2583HTHFIS655e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 65.2 bits (159), Expect = 5e-13
Identities = 30/120 (25%), Positives = 49/120 (40%), Gaps = 9/120 (7%)

Query: 863 SILLAEDHPFNRLTLTMQLESLGHRVTSTEDGEEAFERWQGEDFDVVITDGMMPRMDGYE 922
+IL+A+D R L L G+ V T + + D D+V+TD +MP + ++
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 923 LARRIRSQEALGGRRRCLVIALTASAEKDALERCLAAGMDRVLFKP----TTLDELARAL 978
L RI+ R V+ ++A + G L KP + + RAL
Sbjct: 65 LLPRIKKA-----RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2578SACTRNSFRASE364e-05 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 35.7 bits (82), Expect = 4e-05
Identities = 19/82 (23%), Positives = 34/82 (41%), Gaps = 4/82 (4%)

Query: 74 DDQVIGHCQLLFDRRNGVVRLARIVLAPSARGQGLGLPMLEALLAEAFA-DADIERVELN 132
++ IG ++ NG + I +A R +G+G +L A +A + + L
Sbjct: 73 ENNCIGRIKIR-SNWNGYALIEDIAVAKDYRKKGVGTALLH--KAIEWAKENHFCGLMLE 129

Query: 133 VYDWNAAARHLYRRAGFREEGL 154
D N +A H Y + F +
Sbjct: 130 TQDINISACHFYAKHHFIIGAV 151


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2572HTHFIS1066e-27 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 106 bits (265), Expect = 6e-27
Identities = 32/154 (20%), Positives = 69/154 (44%), Gaps = 5/154 (3%)

Query: 14 RFSVLLVDDEPLILSSLRRLLRNQPYDLLLAESGEQALQLLESRPVDLVVSDARMPNMDG 73
++L+ DD+ I + L + L YD+ + + + + + DLVV+D MP+ +
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 74 AALLAEIHRRSPETIRILLTGHADLPTIAKAINEGRIHHYLSKPWNDDELLLTLRQSLEY 133
LL I + P+ ++++ T KA +G + YL KP++ EL+ + ++L
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKG-AYDYLPKPFDLTELIGIIGRALAE 121

Query: 134 LHSERERRRLERLTQE----QNDRLQQLNATLEK 163
+ + ++ +Q++ L +
Sbjct: 122 PKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLAR 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2571PF06580355e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 34.8 bits (80), Expect = 5e-04
Identities = 29/188 (15%), Positives = 63/188 (33%), Gaps = 44/188 (23%)

Query: 288 REGIGRVRKIVQDLKNFSR-VDAEDDWQWTDLHQGIESTLNIVASE-------LKYRADV 339
E + R+++ L R + + L + + + L++ +
Sbjct: 187 LEDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQI 246

Query: 340 VREYGDLPEVKCLPSQINQVVMNLVMNAAQ-AMGPER--GRIVIRTGHTVEHAWIEVEDS 396
D+ +P + Q LV N + + G+I+++ +EVE++
Sbjct: 247 NPAIMDVQ----VPPMLVQT---LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENT 299

Query: 397 GQGISPEILPRIFDPFFTTKPVGKGTGLGLS-------LSYGIVQKHGGTIEVRSQPGVG 449
G K + TG GL + YG + I++ + G
Sbjct: 300 GSLA--------------LKNTKESTGTGLQNVRERLQMLYG--TEAQ--IKLSEKQGKV 341

Query: 450 SAFRIVLP 457
+A +++P
Sbjct: 342 NA-MVLIP 348


36PA2528PA2515Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA25282140.878520resistance-nodulation-cell division (RND) efflux
PA25272141.300406resistance-nodulation-cell division (RND) efflux
PA25261121.840520resistance-nodulation-cell division (RND) efflux
PA25250121.818934hypothetical protein
PA2524-1111.712362two-component sensor
PA2523-1121.387825two-component response regulator
PA25220121.393460outer membrane protein CzcC
PA2521-1121.146264resistance-nodulation-cell division (RND)
PA25200130.982726resistance-nodulation-cell division (RND)
PA25193141.684524transcriptional regulator XylS
PA25184141.395226toluate 1,2-dioxygenase subunit alpha
PA25173141.250129toluate 1,2-dioxygenase subunit beta
PA25162131.691027toluate 1,2-dioxygenase electron transfer
PA25152121.8365801,6-dihydroxycyclohexa-2,4-diene-1-carboxylate
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2528RTXTOXIND455e-07 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 44.8 bits (106), Expect = 5e-07
Identities = 31/172 (18%), Positives = 65/172 (37%), Gaps = 16/172 (9%)

Query: 124 TYKAALAQAEGTLMQNQAQLKNAEIDLQRYKGLYAEDSIAKQTLDTQEAQVRQLQGTIRT 183
L + L Q ++++ +A+ + Q L+ + + ++RQ I
Sbjct: 263 EAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILD---------KLRQTTDNIGL 313

Query: 184 NQGQVDDARLNLTFTEVRAPISGR-LGLRQVDIGNLVTSGDTTPLVVITQVKPISVVFSL 242
++ + +RAP+S + L+ G +VT+ +T +V++ + + V +
Sbjct: 314 LTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETL-MVIVPEDDTLEVTALV 372

Query: 243 PQQQIGTVVEQMNGPGKLTVTALDRNQDKVLAEGTLT--TLDNQIDTTTGTV 292
+ IG + + V A + L G + LD D G V
Sbjct: 373 QNKDIGFINVGQ--NAIIKVEAFPYTRYGYL-VGKVKNINLDAIEDQRLGLV 421



Score = 41.4 bits (97), Expect = 6e-06
Identities = 26/125 (20%), Positives = 49/125 (39%), Gaps = 8/125 (6%)

Query: 80 ALGTVTAF-NTVNVKPRVNGELVKVLFQEGQEVKAGDLLAVVDPRTYKAALAQAEGTLMQ 138
A G +T + +KP N + +++ +EG+ V+ GD+L + +A + + +L+Q
Sbjct: 86 ANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQ 145

Query: 139 NQAQL--KNAEIDLQRYKGLYAEDSIAKQTLDT-QEAQVRQLQGTIR----TNQGQVDDA 191
+ + L + E +V +L I+ T Q Q
Sbjct: 146 ARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQK 205

Query: 192 RLNLT 196
LNL
Sbjct: 206 ELNLD 210


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2527ACRIFLAVINRP8400.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 840 bits (2171), Expect = 0.0
Identities = 301/1036 (29%), Positives = 514/1036 (49%), Gaps = 29/1036 (2%)

Query: 4 SRPFILRPVATTLLMVAILLSGLIAYRFLPISALPEVDYPTIQVVTLYPGASPEIMTSSI 63
+ FI RP+ +L + ++++G +A LP++ P + P + V YPGA + + ++
Sbjct: 2 ANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTV 61

Query: 64 TAPLENQLGQIPGLNEMSSSS-SGGASVITLQFSLQSNLDVAEQEVQAAINAAQSLLPND 122
T +E + I L MSS+S S G+ ITL F ++ D+A+ +VQ + A LLP +
Sbjct: 62 TQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE 121

Query: 123 LPNQPVFSKVNPADAPILTLAVMSDG--MPLPQIQDLVDTRLAQKISQISGVGLVSISGG 180
+ Q + S + + ++ +SD I D V + + +S+++GVG V + G
Sbjct: 122 VQQQGI-SVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 181 QRPAVRVRANPTALAAAGLSLEDLRSTVTSNNLNGPKGSFDGPTRAS------TLDANDQ 234
Q A+R+ + L L+ D+ + + N G G ++ A +
Sbjct: 181 QY-AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239

Query: 235 LRSADAYRDLII-AYKNGSPLRIRDVASVEDDAENVRLAAWANNLPAVVLNIQRQPGANV 293
++ + + + + +GS +R++DVA VE EN + A N PA L I+ GAN
Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299

Query: 294 IEVVDRIKALLPQLQSTLPGNLDVQVLTDRTTTIRASVKDVQFELALAVALVVMVTFLFL 353
++ IKA L +LQ P + V D T ++ S+ +V L A+ LV +V +LFL
Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359

Query: 354 RNVYATLIPSFAVPLSLIGTFGVMYLSGFSINNLTLMALTIATGFVVDDAIVMVENIARY 413
+N+ ATLIP+ AVP+ L+GTF ++ G+SIN LT+ + +A G +VDDAIV+VEN+ R
Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419

Query: 414 L-EQGDSPLEAALKGSKQIGFTIISLTFSLIAVLIPLLFMGDVAGRLFREFAITLAVAIL 472
+ E P EA K QI ++ + L AV IP+ F G G ++R+F+IT+ A+
Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479

Query: 473 ISGFVSLTLTPMLSAKLLRHIDEDQQ---GRFARAAGRVIDGLIAQYAKALRVVLRHQPL 529
+S V+L LTP L A LL+ + + G F D + Y ++ +L
Sbjct: 480 LSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539

Query: 530 TLLVAIATLALTALLYLAMPKGFFPVQDTGVIQGVAEAPQSISFQAMSERQRALAEVVLK 589
LL+ +A +L+L +P F P +D GV + + P + + + + + LK
Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599

Query: 590 DPA--VASLSSYIGVDGSNPTLNTGRLLINLKPHSERDV---TASEVIQRLQPELDHLPG 644
+ V S+ + G S N G ++LKP ER+ +A VI R + EL +
Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRD 659

Query: 645 IKLYMQPVQDLTIEDRVARTQYQFTLQD---ADPDVLAEWVPKLVARLQELP-QLADVAS 700
+ P I + T + F L D D L + +L+ + P L V
Sbjct: 660 GFVI--PFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRP 717

Query: 701 DWQDKGLQAYLNIDRDTASRLGVKLSDIDSVLYNAFGQRLISTIFTQATQYRVVLEVAPQ 760
+ + Q L +D++ A LGV LSDI+ + A G ++ + ++ ++ +
Sbjct: 718 NGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAK 777

Query: 761 FQLGPQALEQLYVPSSDGTQVRLSSLAKVEERHTLLAINHIAQFPSATLSFNLAKGYSLG 820
F++ P+ +++LYV S++G V S+ + + PS + A G S G
Sbjct: 778 FRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSG 837

Query: 821 EAVEAIRGVEASLELPLSMQGSFRGAALAFEASLSNTLLLILASVVTMYIVLGILYESFI 880
+A+ + + + +LP + + G + S + L+ S V +++ L LYES+
Sbjct: 838 DAMALMENLAS--KLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWS 895

Query: 881 HPVTILSTLPSAGVGALLALMLAGQEIGIVAIIGIILLIGIVKKNAIMMIDFALDAERNE 940
PV+++ +P VG LLA L Q+ + ++G++ IG+ KNAI++++FA D E
Sbjct: 896 IPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKE 955

Query: 941 GKPPHEAIYQACLLRFRPILMTTMAALLGALPLMLAGGAGAELRQPLGITMVGGLLLSQV 1000
GK EA A +R RPILMT++A +LG LPL ++ GAG+ + +GI ++GG++ + +
Sbjct: 956 GKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATL 1015

Query: 1001 LTLFTTPVIYLYFDRL 1016
L +F PV ++ R
Sbjct: 1016 LAIFFVPVFFVVIRRC 1031


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2526ACRIFLAVINRP8160.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 816 bits (2109), Expect = 0.0
Identities = 290/1034 (28%), Positives = 512/1034 (49%), Gaps = 31/1034 (2%)

Query: 7 FIRRPVATTLLTLALLLAGTLSFGLLPVAPLPNVDFPAIVVSASLPGASPETMASSVATP 66
FIRRP+ +L + L++AG L+ LPVA P + PA+ VSA+ PGA +T+ +V
Sbjct: 5 FIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQV 64

Query: 67 LERSLGRIAGISEMTSSS-SLGSTTVVLVFDLEKDIDGAAREVQAAINGAMSLLPSGMPN 125
+E+++ I + M+S+S S GS T+ L F D D A +VQ + A LLP +
Sbjct: 65 IEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQQ 124

Query: 126 NPSYRKANPSDMPIMVLTLTSET--QSRGEMYDLASTVLAPKLSQVQGVGQVSIGGSSLP 183
S +MV S+ ++ ++ D ++ + LS++ GVG V + G+
Sbjct: 125 -QGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQY- 182

Query: 184 AVRVDLNPDAMSQYGLSLDSVRTAIAAANSNGPKG------AVEKDDKHWQVDANDQLRK 237
A+R+ L+ D +++Y L+ V + N G A+ + + A + +
Sbjct: 183 AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKN 242

Query: 238 AREYEPLVIHYNADNGAAVRLGDVAKVSDSVEDVRNAGFSDDLPAVLLIVTRQPGANIIE 297
E+ + + N+D G+ VRL DVA+V E+ + PA L + GAN ++
Sbjct: 243 PEEFGKVTLRVNSD-GSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALD 301

Query: 298 ATDAIHAQLPVLQELLGPQVKLNVMDDRSPSIRASLEEAELTLLISVALVILVVFLFLRN 357
AI A+L LQ +K+ D +P ++ S+ E TL ++ LV LV++LFL+N
Sbjct: 302 TAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQN 361

Query: 358 GRATLIPSLAVPVSLIGTFAVMYLCDFSLNNLSLMALIIATGFVVDDAIVVVENIARRI- 416
RATLIP++AVPV L+GTFA++ +S+N L++ +++A G +VDDAIVVVEN+ R +
Sbjct: 362 MRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMM 421

Query: 417 EEGDPPIQAAITGARQVGFTVLSMTLSLVAVFIPLLLMGGLTGRLFREFAVTLSAAILVS 476
E+ PP +A Q+ ++ + + L AVFIP+ GG TG ++R+F++T+ +A+ +S
Sbjct: 422 EDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALS 481

Query: 477 LVVSLTLTPMLCARLLRPLKRPEG---ASLARRSDRFFAAFMLRYRASLGWALEHSRLMV 533
++V+L LTP LCA LL+P+ + F + Y S+G L + +
Sbjct: 482 VLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYL 541

Query: 534 VIMLACIAMNLWLFVVVPKGFLPQQDSGRLRGYAVADQSISFQSLSAKMGEYRKILSSDP 593
+I +A + LF+ +P FLP++D G + + + + +
Sbjct: 542 LIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNE 601

Query: 594 AVE-----NVVGFIGGGRWQSSNTGSFFVTLKPIGERDP----VEKVLTRLRERIAKVPG 644
V GF G Q+ N G FV+LKP ER+ E V+ R + + K+
Sbjct: 602 KANVESVFTVNGFSFSG--QAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRD 659

Query: 645 AALYLNAGQDVRLGGRDSNAQYEFTLRS-DDLTLLREWAPKVEAAMRKLP-QLVDVNSDS 702
+ + G + +E ++ L + ++ + P LV V +
Sbjct: 660 GFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNG 719

Query: 703 QDKGVQTRLVIDRDRAATLGINVEMVDAVLNDSFGQRQVSTIFNPLNQYRVVMEVDQQYQ 762
+ Q +L +D+++A LG+++ ++ ++ + G V+ + ++ ++ D +++
Sbjct: 720 LEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFR 779

Query: 763 QSPEILRQVQVIGNDGQRVPLSAFSHYEPSRAPLEVNHQGQFAATTLSFNLAPGAQIGPT 822
PE + ++ V +G+ VP SAF+ + + + APG G
Sbjct: 780 MLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDA 839

Query: 823 REAIMQALEPLHIPVDVQTSFEGNAGAVQDTQNQMPWLILLALLAVYIVLGILYESYVHP 882
+ L P + + G + + + NQ P L+ ++ + V++ L LYES+ P
Sbjct: 840 MALMENLASKL--PAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIP 897

Query: 883 LTILSTLPSAGVGALLALILCRSELSLIALIGIILLIGIVKKNAIMMIDFALEAERNHGL 942
++++ +P VG LLA L + + ++G++ IG+ KNAI++++FA + G
Sbjct: 898 VSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGK 957

Query: 943 SPREAILEACMMRFRPIMMTTLAALLGALPLIFGIGGDAALRRPLGITIVGGLIGSQLLT 1002
EA L A MR RPI+MT+LA +LG LPL G + + +GI ++GG++ + LL
Sbjct: 958 GVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLA 1017

Query: 1003 LYTTPVVYLYLDRL 1016
++ PV ++ + R
Sbjct: 1018 IFFVPVFFVVIRRC 1031



Score = 80.7 bits (199), Expect = 2e-17
Identities = 72/366 (19%), Positives = 135/366 (36%), Gaps = 15/366 (4%)

Query: 665 QYEFTLRSDDLTL--LREWAPK-VEAAMRKLPQLVDVNSDSQDKGVQTRLVIDRDRAATL 721
F + T + ++ V+ + +L + DV R+ +D D
Sbjct: 139 VAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQY--AMRIWLDADLLNKY 196

Query: 722 GINVEMVDAVL---NDSFGQRQVSTIFNPLNQYRVVMEVDQQYQQSPEILRQVQVIGN-D 777
+ V L ND Q+ Q + Q ++PE +V + N D
Sbjct: 197 KLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKNPEEFGKVTLRVNSD 256

Query: 778 GQRVPLSAFSHYEPSRAPLE--VNHQGQFAATTLSFNLAPGAQIGPTREAIMQALEPLH- 834
G V L + E G+ A L LA GA T +AI L L
Sbjct: 257 GSVVRLKDVARVELGGENYNVIARINGK-PAAGLGIKLATGANALDTAKAIKAKLAELQP 315

Query: 835 -IPVDVQ-TSFEGNAGAVQDTQNQMPWLILLALLAVYIVLGILYESYVHPLTILSTLPSA 892
P ++ VQ + +++ + A++ V++V+ + ++ L +P
Sbjct: 316 FFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPVV 375

Query: 893 GVGALLALILCRSELSLIALIGIILLIGIVKKNAIMMIDFALEAERNHGLSPREAILEAC 952
+G L ++ + + G++L IG++ +AI++++ L P+EA ++
Sbjct: 376 LLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSM 435

Query: 953 MMRFRPIMMTTLAALLGALPLIFGIGGDAALRRPLGITIVGGLIGSQLLTLYTTPVVYLY 1012
++ + +P+ F G A+ R ITIV + S L+ L TP +
Sbjct: 436 SQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCAT 495

Query: 1013 LDRLRH 1018
L +
Sbjct: 496 LLKPVS 501


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2525RTXTOXIND386e-05 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 38.3 bits (89), Expect = 6e-05
Identities = 25/216 (11%), Positives = 62/216 (28%), Gaps = 30/216 (13%)

Query: 229 RADVAQARTQLKSTQAQAIDLKYQ--RAQLEHAIAVLVGLPPAQFNLPPVASVPKLPDLP 286
+ A TQ+ + + + R Q+ L LP + P ++
Sbjct: 126 KLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVL 185

Query: 287 AVVP----------SQLLERRPDIASAERKVISANAQIGVAKAAY------FPDLTLSAA 330
+ +Q ++ ++ + ++ A+I + D +
Sbjct: 186 RLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLH 245

Query: 331 GGYRSGSLSNWISTPNRFWSIGPQFAMTLFDGGLIGSQVDQAEATYDQTVATYRQTVLDG 390
+ + N++ + + I S++ A+ Y ++ +LD
Sbjct: 246 KQA--IAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDK 303

Query: 391 FREVEDYLVQLSVLDEESGVQREALESAREALRLAE 426
R+ D + L L E + +
Sbjct: 304 LRQTTDNIGLL----------TLELAKNEERQQASV 329



Score = 31.3 bits (71), Expect = 0.009
Identities = 18/150 (12%), Positives = 43/150 (28%), Gaps = 18/150 (12%)

Query: 171 ASAADLAAVRLSQQSQLAQNYLQLRVMDEQIRLLNDTVTAYERSLKVAENK-------YR 223
+ + + Q+Q Q L L + + + YE +V +++
Sbjct: 186 RLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLH 245

Query: 224 AGIVTRADVAQARTQLKSTQAQAIDLKYQRAQLEHAIAVLVGLPPAQFNLPPVASVPKLP 283
+ + V + + + K Q Q+E I A+ V
Sbjct: 246 KQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILS------AKEEYQLVTQ----- 294

Query: 284 DLPAVVPSQLLERRPDIASAERKVISANAQ 313
+ +L + +I ++ +
Sbjct: 295 LFKNEILDKLRQTTDNIGLLTLELAKNEER 324


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2524PF06580363e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 35.6 bits (82), Expect = 3e-04
Identities = 25/105 (23%), Positives = 40/105 (38%), Gaps = 24/105 (22%)

Query: 370 LVSNAVRH----TPQGGRIDVRIGERAGHTEVRVSNDGPGIPPEYLPHLFERFYRRAGRQ 425
LV N ++H PQGG+I ++ + G + V N G
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKN---------------- 306

Query: 426 TGAQAGTGLGLAIV-QSIMAYHGGRAEAE-SVPQQKTHLRLLFPS 468
+ TG GL V + + +G A+ + S Q K + +L P
Sbjct: 307 --TKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIPG 349


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2523HTHFIS817e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 80.6 bits (199), Expect = 7e-20
Identities = 34/145 (23%), Positives = 63/145 (43%), Gaps = 8/145 (5%)

Query: 2 RILIIEDEVKTADYLHQGLTESGYIVDRANDGIDGLHMALQHPYELVILDVNLPGIDGWD 61
IL+ +D+ L+Q L+ +GY V ++ +LV+ DV +P + +D
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 LLRRLRER-SSARVMMLTGHGRLTDKVRGLDLGADDFMVKPFQFPELLARVRSLLRRHDQ 120
LL R+++ V++++ ++ + GA D++ KPF EL+ + L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE--- 121

Query: 121 APMQDVLRVADLELDASRHRAFRGR 145
R + LE D+ GR
Sbjct: 122 ----PKRRPSKLEDDSQDGMPLVGR 142


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2521RTXTOXIND462e-07 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 46.0 bits (109), Expect = 2e-07
Identities = 40/212 (18%), Positives = 82/212 (38%), Gaps = 22/212 (10%)

Query: 216 ISSPQLSDQRSEFAAAQRRLSLAQSTYKREQQLWKEGISAEQEFLLARQGLQ-EAEIALN 274
I+ + +Q +++ A L + +S + +Q+ E +SA++E+ L Q + E L
Sbjct: 249 IAKHAVLEQENKYVEAVNELRVYKS---QLEQIESEILSAKEEYQLVTQLFKNEILDKLR 305

Query: 275 NARAKIAALGG--NPSLQGGNRYELRAPFAGVLVE-KHLTQGEPVDGTANVFTLS-DLSS 330
I L + + +RAP + + + K T+G V + + + +
Sbjct: 306 QTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDT 365

Query: 331 VWATFNVPAQLLGQVRVGSKVKVLAQALDS----EVEGTVSYIG-DLLGEQTRAATARVT 385
+ T V + +G + VG + +A + G V I D + +Q V
Sbjct: 366 LEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVI 425

Query: 386 LSNPEST---------WRPGLFVSVQVAEATR 408
+S E+ G+ V+ ++ R
Sbjct: 426 ISIEENCLSTGNKNIPLSSGMAVTAEIKTGMR 457



Score = 31.3 bits (71), Expect = 0.010
Identities = 19/119 (15%), Positives = 42/119 (35%), Gaps = 13/119 (10%)

Query: 168 LAQVVSLPGEIRFNEDRTAHIVPRLPGIVDSVPANLGQAVKQGELLAVISSPQLSDQRSE 227
+ V + G++ + I P IV + G++V++G++L +++ ++
Sbjct: 80 VEIVATANGKLTHSGRSKE-IKPIENSIVKEIIVKEGESVRKGDVLLKLTALGA---EAD 135

Query: 228 FAAAQRRLSLAQSTYKREQQLWKE---------GISAEQEFLLARQGLQEAEIALNNAR 277
Q L A+ R Q L + + E F + +L +
Sbjct: 136 TLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQ 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2520ACRIFLAVINRP8110.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 811 bits (2097), Expect = 0.0
Identities = 237/1055 (22%), Positives = 435/1055 (41%), Gaps = 56/1055 (5%)

Query: 5 IIQFAIEQRWLVLLAVLGMAGVGIGSYQKLSIDAVPDITNVQVQINTAAPGYSPLEVEQR 64
+ F I + + + + G + +L + P I V ++ PG V+
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 65 ITYPVETVMAGLPGLQETRSLS-RPGISQVTVIFEEGTDIYFARQQVNERLSTAREQLPE 123
+T +E M G+ L S S G +T+ F+ GTD A+ QV +L A LP+
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 124 DISPTLGPISTGLGEIYLWTVEAEEGATKEDGSAYTPTDLRTIQDWIIRPQLRNVKGVAE 183
++ + YL D T D+ ++ L + GV +
Sbjct: 121 EVQQQGISVEKSSS-SYLMVA-----GFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGD 174

Query: 184 INTIGGYAKQFLIAPDPKKLAAYKLTLGDLQNAVLRNNENVGAGYI------ERRGEQLL 237
+ G I D L YKLT D+ N + N+ + AG + +
Sbjct: 175 VQLFGA-QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNAS 233

Query: 238 IRAPGQVKDMDDIRGIIV-SNVDGVPIRIRDVAEVGLGKELRTGAATENGREVVLGTVFM 296
I A + K+ ++ + + N DG +R++DVA V LG E A NG+ + +
Sbjct: 234 IIAQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKL 293

Query: 297 LIGENSREVAQAVGQRLEEINRTLPKGVKAITVYDRTTLVDKAVATVKKNLVEGAALVIA 356
G N+ + A+A+ +L E+ P+G+K + YD T V ++ V K L E LV
Sbjct: 294 ATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFL 353

Query: 357 VLFLFLGNIRAALITATIIPLSMLFTFTGMVGNRVSANLMSLG--ALDFGIIVDGAVVIV 414
V++LFL N+RA LI +P+ +L TF + S N +++ L G++VD A+V+V
Sbjct: 354 VMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVV 413

Query: 415 ENAIRRLAHAQAHHGRQLTRAERFHEVFAASREARRALVFGQIIIMVVYLPIFALTGVEG 474
EN + + + + ALV +++ V++P+ G G
Sbjct: 414 EN---------VERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTG 464

Query: 475 KMFHPMAFTVVTALLGAMILSVTFVPAAIALFITGKVKEEE----------NFVMRRARL 524
++ + T+V+A+ ++++++ PA A + E N +
Sbjct: 465 AIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVN 524

Query: 525 AYEPALRWVLGHRALVVGGALGAILLTGLVASRMGSEFIPSLSEGDFAMQGLRVPGTSL- 583
Y ++ +LG + + ++ R+ S F+P +G F G +
Sbjct: 525 HYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQE 584

Query: 584 -TQSVEMQQTLERKLMGKFPEIERVFARTGTAEIASDLMPPNASDSYVMLKPQSQWPDPK 642
TQ V + Q + L + +E VF G + NA ++V LKP + +
Sbjct: 585 RTQKV-LDQVTDYYLKNEKANVESVFTVNG---FSFSGQAQNAGMAFVSLKPWEERNGDE 640

Query: 643 KSREALLEELQAAALEVP-GSVYEFSQPIQLRFNELISGVRSDVA-VKVFGDDMQVLNDT 700
S EA++ + ++ G V F+ P EL + D + G L
Sbjct: 641 NSAEAVIHRAKMELGKIRDGFVIPFNMP---AIVELGTATGFDFELIDQAGLGHDALTQA 697

Query: 701 AEKI-SKVLQGIDGASEVKVEQTTGLPVLTVDIDRDKAARFGLNVGDIQDTVATALGGRN 759
++ Q V+ +++D++KA G+++ DI T++TALGG
Sbjct: 698 RNQLLGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTY 757

Query: 760 AGTLFEGDRRFDIVIRLPETLRADLPALSNLLIPLPPNNLARIDFIPLSDVARLDLSPGP 819
+ R + ++ R + L + + +P S G
Sbjct: 758 VNDFIDRGRVKKLYVQADAKFRMLPEDVDKLYVRSANG-----EMVPFSAFTTSHWVYGS 812

Query: 820 NQISRENGKRRIVVSANVRGRDIGSFVLEAQQKLQDGVKIPAGYWTTWGGQFEQLQSAAK 879
++ R NG + + + + L K+PAG W G Q + +
Sbjct: 813 PRLERYNGLPSMEIQGEAAPGTSSGDAMALMENLAS--KLPAGIGYDWTGMSYQERLSGN 870

Query: 880 RLQVVVPVALLLVFTLLFAMFNNVKDGLLVFTGIPFALTGGVLALWLRGIPLSISAAVGF 939
+ +V ++ ++VF L A++ + + V +P + G +LA L + VG
Sbjct: 871 QAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGL 930

Query: 940 IALSGVAVLNGLVMISFIRNLL-QEGRSLDQAVWEGAITRLRPVLMTALVASLGFVPMAL 998
+ G++ N ++++ F ++L+ +EG+ + +A RLRP+LMT+L LG +P+A+
Sbjct: 931 LTTIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAI 990

Query: 999 ATGTGAEVQRPLATVVIGGILSSTMLTLLVLPVLY 1033
+ G G+ Q + V+GG++S+T+L + +PV +
Sbjct: 991 SNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFF 1025



Score = 70.6 bits (173), Expect = 2e-14
Identities = 70/527 (13%), Positives = 160/527 (30%), Gaps = 46/527 (8%)

Query: 2 FERIIQFAIEQRWLVLLAVLGMAGVGIGSYQKLSIDAVPDITNVQVQINTAAPGYSPLEV 61
+ + + LL + + + +L +P+ P + E
Sbjct: 526 YTNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQER 585

Query: 62 EQRI---------TYPVETVMA--GLPGLQETRSLSRPGISQVTV-IFEEGTDIYFARQQ 109
Q++ V + + G + G++ V++ +EE + +
Sbjct: 586 TQKVLDQVTDYYLKNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEA 645

Query: 110 VNERLSTAREQLPED-ISPTLGPISTGLGEIYLWTVEAEEGATKEDGSAYTPTDLRTIQD 168
V R ++ + + P P LG D + L ++
Sbjct: 646 VIHRAKMELGKIRDGFVIPFNMPAIVELG------TATGFDFELIDQAGLGHDALTQARN 699

Query: 169 WIIRPQLRNVKGVAEINTIGGY-AKQFLIAPDPKKLAAYKLTLGDLQNAVLRNNENVGAG 227
++ ++ + + G QF + D +K A ++L D+ +
Sbjct: 700 QLLGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVN 759

Query: 228 YIERRGEQ--LLIRAPGQ-VKDMDDIRGIIVSNVDGVPIRIRDVAEVGLGKELRTGAAT- 283
RG L ++A + +D+ + V + +G + G+
Sbjct: 760 DFIDRGRVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTS----HWVYGSPRL 815

Query: 284 --ENGREVVLGTVFMLIGENSREVAQAVGQRLEEINRTLPKGVKAITVYDRTTLVDKAVA 341
NG + G +S + + E + LP G+ + +
Sbjct: 816 ERYNGLPSMEIQGEAAPGTSSGDAMALM----ENLASKLPAGI-GYDWTGMSYQERLSGN 870

Query: 342 TVKKNLVEGAALVIAVLFLFLGNIRAALITATIIPLSMLFTFTGMVGNRVSANLMSLGAL 401
+ +V L + + ++PL ++ ++ + L
Sbjct: 871 QAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGL 930

Query: 402 --DFGIIVDGAVVIVENAIRRLAHAQAHHGRQLTRAERFHEVFAASREARRALVFGQIII 459
G+ A++IVE A G+ + A A R R ++ +
Sbjct: 931 LTTIGLSAKNAILIVEFAK----DLMEKEGKGVVEA-----TLMAVRMRLRPILMTSLAF 981

Query: 460 MVVYLPIFALTGVEGKMFHPMAFTVVTALLGAMILSVTFVPAAIALF 506
++ LP+ G + + V+ ++ A +L++ FVP +
Sbjct: 982 ILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVI 1028


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2515DHBDHDRGNASE1015e-28 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 101 bits (253), Expect = 5e-28
Identities = 75/253 (29%), Positives = 109/253 (43%), Gaps = 15/253 (5%)

Query: 8 KVALVSGAAQGIGLGVARRLLEEGARVVAVDRSELVHELAGDACLCLT-------ADLER 60
K+A ++GAAQGIG VAR L +GA + AVD + E + AD+
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 61 YAECQRVLERTLERFGRLDILVNNVGGTLWAKPYQHYAEDEIEAELRRSLLPTLWCCRAA 120
A + R G +DILVN V G L +++E EA + R+
Sbjct: 69 SAAIDEITARIEREMGPIDILVN-VAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127

Query: 121 LPAMLGQGSGAIVNVSSVATRGVNRV---PYGAAKGGVNALTACLAFETADQGIRVNAVA 177
M+ + SG+IV V S GV R Y ++K T CL E A+ IR N V+
Sbjct: 128 SKYMMDRRSGSIVTVGSNPA-GVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVS 186

Query: 178 PGGTDAPPRRIPRNAAAQSEEERRWYRQIVEQTLDSSLMKRYGSIDEQVAAILFLASDEA 237
PG T+ + + A + + +E +K+ + A+LFL S +A
Sbjct: 187 PGSTETD---MQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQA 243

Query: 238 SYITGVTLPVAGG 250
+IT L V GG
Sbjct: 244 GHITMHNLCVDGG 256


37PA2489PA2458Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA24892142.569103transcriptional regulator
PA24882131.268749transcriptional regulator
PA24871140.538169hypothetical protein
PA24860130.560433hypothetical protein
PA2485191.932561hypothetical protein
PA2484182.539537hypothetical protein
PA2483293.130046hypothetical protein
PA2482383.520779cytochrome C
PA24813103.333775hypothetical protein
PA2480293.549145two-component sensor
PA2479192.780840two-component response regulator
PA2478192.655954thiol:disulfide interchange protein DsbD
PA2477072.572398thiol:disulfide interchange protein
PA2476092.593850thiol:disulfide interchange protein DsbG
PA2475-1112.827665cytochrome P450
PA24740123.059386hypothetical protein
PA24730123.347513glutathione S-transferase
PA24721123.797488major facilitator superfamily transporter
PA24711121.933873hypothetical protein
PA24701101.723825gentisate 1,2-dioxygenase
PA24691101.584802transcriptional regulator
PA24682121.472584ECF sigma factor FoxI
PA2467115-1.041604anti-sigma factor FoxR
PA2466118-1.872682ferrioxamine receptor FoxA
PA2465221-2.153053hypothetical protein
PA2464225-3.017064hypothetical protein
PA2463224-2.964460hypothetical protein
PA2462227-3.709424hypothetical protein
PA2461742-10.254821hypothetical protein
PA2460627-5.824159hypothetical protein
PA2459626-5.290671hypothetical protein
PA2458420-0.943252hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2484HTHTETR522e-10 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 51.9 bits (124), Expect = 2e-10
Identities = 37/201 (18%), Positives = 66/201 (32%), Gaps = 11/201 (5%)

Query: 1 MAKRGRPCGFD-REQALRRALDVFWEAGYEGVTMAALKEAMGGICAPSMYAAYGSKEALF 59
MA++ + + R+ L AL +F + G ++ + +A G+ ++Y + K LF
Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAA-GVTRGAIYWHFKDKSDLF 59

Query: 60 RSAVELYLSQECQLSKGAFA------LPTARESIAALLESAAVSYTTEGKPRGCLVDLST 113
EL S +L A L RE + +LES T E + +
Sbjct: 60 SEIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLEST---VTEERRRLLMEIIFHK 116

Query: 114 TNFSPANKGVEDYLRDHRRRAARLLRERFARGVADGDVPAGADLDALTSFYSSVLQGLSI 173
F V+ R+ + + + + +PA + GL
Sbjct: 117 CEFVGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLME 176

Query: 174 QARDGASRQQLLAIGRCAMAA 194
L R +A
Sbjct: 177 NWLFAPQSFDLKKEARDYVAI 197


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2479HTHFIS832e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 82.6 bits (204), Expect = 2e-20
Identities = 38/122 (31%), Positives = 62/122 (50%), Gaps = 1/122 (0%)

Query: 2 HVLLTEDDDLIASGIVAGLNAQGLTVDRVASAADTQALLQVARFDVLVLDLGLPDEDGLR 61
+L+ +DD I + + L+ G V ++AA + D++V D+ +PDE+
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 LLQRLRQQGVDLPVLVLTARDAVTDRVAGLQAGADDYLLKPFDLRELGARLHT-LQRRSA 120
LL R+++ DLPVLV++A++ + + GA DYL KPFDL EL + L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 121 GR 122

Sbjct: 125 RP 126


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2472TCRTETB493e-08 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 48.7 bits (116), Expect = 3e-08
Identities = 39/178 (21%), Positives = 72/178 (40%), Gaps = 3/178 (1%)

Query: 31 LCFLIVAMDGFDTAAIGFIAPALAHDWQLSPAQLSPILGAALAGLALGAFAAGPLADRFG 90
LC L + + P +A+D+ PA + + A + ++G G L+D+ G
Sbjct: 19 LCIL-SFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLG 77

Query: 91 RKSVLLLSVLFFGGWSLASAYAGS-VETLALLRFFTGLGLGGAMPNAITLTSEYCPRRHR 149
K +LL ++ S+ S L + RF G G + + + Y P+ +R
Sbjct: 78 IKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENR 137

Query: 150 ALMVTAMFCGFTLGSALGGLLAARMVPALGWESVLLLGGGLPLASLPLLWACLPESVR 207
+ +G +G + + + W S LLL + + ++P L L + VR
Sbjct: 138 GKAFGLIGSIVAMGEGVGPAIGGMIAHYIHW-SYLLLIPMITIITVPFLMKLLKKEVR 194



Score = 30.6 bits (69), Expect = 0.011
Identities = 36/196 (18%), Positives = 71/196 (36%), Gaps = 13/196 (6%)

Query: 256 AELRGGTLLLWATF--FMGLLIIYLLTNWLPTLIGGTGFSLGEAATISAMFQLGGTLGAL 313
+ LR +L+W F +L +L LP + ++ F L ++G
Sbjct: 8 SNLRHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTA 67

Query: 314 LLGSAMDRFDAHRVLSLAYVGGALFILG--IASLYHSFA---LLALCVAGVGFCISGSQV 368
+ G D+ R+L G + G I + HSF ++A + G G + V
Sbjct: 68 VYGKLSDQLGIKRLLLF---GIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALV 124

Query: 369 GANALAADFYPTRSRATGVSWALGLGRIGSIVGSLSGGALLG-LGLGFSGILALLVIPAL 427
+ A + P +R + +G VG GG + + + ++ ++ I +
Sbjct: 125 M--VVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITV 182

Query: 428 LAAVAVHRLGRRRARP 443
+ + + R
Sbjct: 183 PFLMKLLKKEVRIKGH 198


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2469MPTASEINHBTR280.030 Metalloprotease inhibitor signature.
		>MPTASEINHBTR#Metalloprotease inhibitor signature.

Length = 122

Score = 27.7 bits (61), Expect = 0.030
Identities = 13/43 (30%), Positives = 20/43 (46%)

Query: 59 RGLRPTPYGMTLFNHAQRVLTEMERARQNLEAMRSGSGSRVLL 101
PTP G+ L N +T + R ++ R+ SG+ V L
Sbjct: 76 VSWSPTPDGIWLMNAEGTGITHLNRQKEGEYTGRTPSGADVTL 118


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2466TYPE3OMGPROT340.002 Type III secretion system outer membrane G protein ...
		>TYPE3OMGPROT#Type III secretion system outer membrane G protein

family signature.
Length = 607

Score = 34.1 bits (78), Expect = 0.002
Identities = 19/81 (23%), Positives = 26/81 (32%), Gaps = 10/81 (12%)

Query: 15 LDFPRASRLSRSVRAALLSLAMAAGAAPLCASAAEAAAEQARPYAIPAGQ--LGDVLNRF 72
+ FP S R + LL L+ + A L PY A L D+L F
Sbjct: 1 MAFPLHSFFKRVLTGTLLLLSSYSWAQEL--------DWLPIPYVYVAKGESLRDLLTDF 52

Query: 73 AREAGITLSATPAQTGGYSSQ 93
T+ + S Q
Sbjct: 53 GANYDATVVVSDKINDKVSGQ 73


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2462PF05860825e-20 haemagglutination activity domain.
		>PF05860#haemagglutination activity domain.

Length = 117

Score = 82.2 bits (203), Expect = 5e-20
Identities = 23/130 (17%), Positives = 47/130 (36%), Gaps = 23/130 (17%)

Query: 34 DKAAGGNTGLGQAGNGVPIVNIATPNGAGLSNNHFRDYNVGANGLILNNATGKTQGTQLG 93
D N+ + I+ T G+ L ++ F++++V +G N
Sbjct: 6 DTTLPINSNI-TTEGNTRIIERGTQAGSNLFHS-FQEFSVPTSGTAFFNNPT-------- 55

Query: 94 GIILGNPNLKGQAAQVILNQVTGGNRSTLAGYTEVAGQSARVIVANPHGITCQGCGFINT 153
Q I+++VTGG+ S + G A + + NP+GI ++
Sbjct: 56 ------------NIQNIISRVTGGSVSNIDGLIRANAT-ANLFLINPNGIIFGQNARLDI 102

Query: 154 PRATLTTGKP 163
+ + +
Sbjct: 103 GGSFVGSTAN 112


38PA2444PA2435Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
PA2444-273.236644serine hydroxymethyltransferase
PA2443-173.869451L-serine dehydratase
PA2442174.661091glycine cleavage system protein T2
PA2441174.462118hypothetical protein
PA24402104.865266hypothetical protein
PA24393115.068923hypothetical protein
PA24384124.337545hypothetical protein
PA24373103.593417hypothetical protein
PA24363132.814302hypothetical protein
PA24353132.903778cation-transporting P-type ATPase
39PA2366PA2361Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
PA2366291.210345uricase
PA23654112.013800hypothetical protein
PA23644112.124463hypothetical protein
PA23634112.339853hypothetical protein
PA23624121.896581hypothetical protein
PA23612101.844332hypothetical protein
40PA2318PA2288Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA23183101.531752hypothetical protein
PA2317391.612943oxidoreductase
PA23166122.531353transcriptional regulator
PA23156112.118932hypothetical protein
PA23145121.234355major facilitator superfamily transporter
PA2312a3110.035343hypothetical protein
PA2312-2131.273819transcriptional regulator
PA2311-2150.737157hypothetical protein
PA2310-1122.975664hypothetical protein
PA2309-1123.220125hypothetical protein
PA2308-1113.331333ABC transporter ATP-binding protein
PA2307-1104.154194ABC transporter permease
PA2306-1104.356090protein AmbA
PA2305-194.247804protein AmbB
PA2304-1103.536290protein AmbC
PA2303-193.042394protein AmbD
PA2302-293.036380protein AmbE
PA2301-2100.441915hypothetical protein
PA2300-190.436054chitinase
PA2299-1101.015929transcriptional regulator
PA2298092.162324oxidoreductase
PA22970113.178105ferredoxin
PA2296-2122.281424hypothetical protein
PA22950122.643493ABC transporter permease
PA22941112.033791ABC transporter ATP-binding protein
PA22932121.903472hypothetical protein
PA22922121.226892hypothetical protein
PA22912101.780197glucose-sensitive porin
PA2290192.116183glucose dehydrogenase
PA22890102.340711hypothetical protein
PA22881123.197810hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2314TCRTETB453e-07 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 45.3 bits (107), Expect = 3e-07
Identities = 36/170 (21%), Positives = 72/170 (42%), Gaps = 7/170 (4%)

Query: 35 FVAILSETLPAGLLPQIGAGLAVSEALAGQLVSVYALGSLLAALPAASLTQGWRRRRVLL 94
F ++L+E + LP I A + + + L + L+ +R+LL
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 95 LALLIFFVCNSFTAVS-SDYRLTLLARFGSGVAAGLAWGLLAGYARRLVPPEQQGRALAV 153
++I + V S + L ++ARF G A L+ R +P E +G+A
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAF-- 141

Query: 154 AMLGAPLALSLGVPLGTWLGGLLG--WRWAFGLLSLTALLLVGWVLRSVP 201
++G+ +++G +G +GG++ W++ LL ++ L +
Sbjct: 142 GLIGS--IVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLL 189


41PA2246PA2222Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA2246282.603927Bkd operon transcriptional regulator BkdR
PA2244183.044203hypothetical protein
PA2243283.065382FAD-binding dehydrogenase
PA2242281.959060hypothetical protein
PA2241282.884712biofilm formation protein PslL
PA2240-191.461096biofilm formation protein PslJ
PA2239-191.316005biofilm formation protein PslI
PA2238-191.092507biofilm formation protein PslH
PA2237-1110.777979biofilm formation protein PslG
PA2236-2100.116357biofilm formation protein PslF
PA2235-211-0.613174biofilm formation protein PslE
PA2234-1130.180007biofilm formation protein PslD
PA2233117-3.432927biofilm formation protein PslC
PA2232228-7.087830biofilm formation protein PslB
PA2231343-10.760740biofilm formation protein PslA
PA2230560-13.044557hypothetical protein
PA2229472-16.266925hypothetical protein
PA2228480-18.874836hypothetical protein
PA2227377-18.332002HTH-type transcriptional regulator VqsM
PA2226266-14.247019hypothetical protein
PA2225155-11.640590hypothetical protein
PA2224149-9.768611hypothetical protein
PA2223141-7.545321hypothetical protein
PA2222128-3.831327hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2236TCRTETOQM290.035 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 29.1 bits (65), Expect = 0.035
Identities = 8/28 (28%), Positives = 15/28 (53%)

Query: 193 LYFGFIYRGKGIEDLLEALADLFASAPE 220
+Y G GI++L+E + + F S+
Sbjct: 216 VYHGSAKNNIGIDNLIEVITNKFYSSTH 243


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2235YERSSTKINASE340.002 Yersinia serine/threonine protein kinase signature.
		>YERSSTKINASE#Yersinia serine/threonine protein kinase signature.

Length = 732

Score = 33.9 bits (77), Expect = 0.002
Identities = 31/132 (23%), Positives = 61/132 (46%), Gaps = 19/132 (14%)

Query: 293 LNPAQQDLRRLLNQKRLERADMMRTYTDDAPP--------VKALDASIRALEKEVQDEGA 344
++P D+RR+ +K E +D++RT+ A + LD + AL+K ++ G
Sbjct: 430 MSPLSTDVRRITPKKLRELSDLLRTHLSSAATKQLDMGGVLSDLDTMLVALDKAEREGG- 488

Query: 345 TVQSSEDRAPNTLTTHLERVLLDETSNNAALRTQLAEQEKQLAELEAQRREALD---IEP 401
V + ++ N+L RV+ D ++ + + + E+ R +EP
Sbjct: 489 -VDKDQLKSFNSLILKTYRVIED------YVKGREGDTKNSSTEVSPYHRSNFMLSIVEP 541

Query: 402 TLARLQRELNAT 413
+L R+Q+ L+ T
Sbjct: 542 SLQRIQKHLDQT 553


42PA2133PA2091Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA21333102.094186hypothetical protein
PA21323101.463637chaperone CupA5
PA21313101.064968fimbrial subunit CupA4
PA2130271.364387usher CupA3
PA2129172.183426chaperone CupA2
PA2128072.277121fimbrial subunit CupA1
PA2127082.388492hypothetical protein
PA2126-192.929419hypothetical protein
PA2125-193.069224aldehyde dehydrogenase
PA21240112.033785dehydrogenase
PA2123-1100.921822transcriptional regulator
PA2122091.513860hypothetical protein
PA21211101.263162transcriptional regulator
PA21203101.196028hypothetical protein
PA21191111.140801alcohol dehydrogenase
PA2118a0121.576726hypothetical protein
PA21181111.618096O6-methylguanine-DNA methyltransferase
PA21170121.518023hypothetical protein
PA2116-1112.032007hypothetical protein
PA2115-2122.806764transcriptional regulator
PA2114-1143.117617major facilitator superfamily transporter
PA2113-193.388550pyroglutatmate porin OpdO
PA2112294.066356hypothetical protein
PA21111102.268935hypothetical protein
PA21102170.287582hypothetical protein
PA2109226-2.929978hypothetical protein
PA2108230-4.383738thiamine pyrophosphate protein
PA2107244-7.442946hypothetical protein
PA2106145-8.976086hypothetical protein
PA2105144-8.683371acetyltransferase
PA2104038-6.999557cysteine synthase
PA2103030-3.923237molybdopterin biosynthesis protein MoeB
PA2102022-2.069896hypothetical protein
PA2101017-0.596330hypothetical protein
PA2100-1131.116680transcriptional regulator
PA2099093.794332short-chain dehydrogenase
PA2098083.317047esterase
PA2097083.017147flavin-binding monooxygenase
PA20963113.990916transcriptional regulator
PA20951124.160973hypothetical protein
PA2094283.379697transmembrane sensor
PA2093083.012122RNA polymerase sigma factor
PA2092093.386063major facilitator superfamily transporter
PA2091193.012682hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2130PF005777720.0 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 772 bits (1994), Expect = 0.0
Identities = 272/881 (30%), Positives = 412/881 (46%), Gaps = 50/881 (5%)

Query: 7 RRCRTGTALMAGGMALAASAFGHAQPGYEFDDRLLLGSSLGGGDLSRFNQDGRIDPGRYH 66
R + A AA A + F+ R L DLSRF + PG Y
Sbjct: 21 HRLAGFFVRLFVACAFAAQA-PLSSAELYFNPRFLADDPQAVADLSRFENGQELPPGTYR 79

Query: 67 VDVYLNERFASRSEVSFRANPASGAVEPCLDEDFLRQRLGAKPGDDPRKSGDGRHCAFLG 126
VD+YLN + + +V+F + + PCL L C L
Sbjct: 80 VDIYLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVPLT 139

Query: 127 ARLPGSRFSLDVARLRLDLSVPQALLDLKPRGYVSPEEWDAGDSMGFVNYDTNLYRSEYR 186
+ + + LDV + RL+L++PQA + + RGY+ PE WD G + G +NY+ + + R
Sbjct: 140 SMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQNR 199

Query: 187 GGESGRSDYAYVGLNSGINLGLWRLRHQSNYTYSRYNGQA--RRKWNSIRTYAQRALPAW 244
G G S YAY+ L SG+N+G WRLR + ++Y+ + + + KW I T+ +R +
Sbjct: 200 IG--GNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPL 257

Query: 245 RSELTAGESYTAGNLLGSIGYRGLSLATDDRMLPESLRRYAPQVRGTAATAARVVISQNG 304
RS LT G+ YT G++ I +RG LA+DD MLP+S R +AP + G A A+V I QNG
Sbjct: 258 RSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNG 317

Query: 305 RKIREVNVAPGPFVIDDLYDSAYAGDLDVQVFEADGSVSSFSVPFASVPESMRPGLSRYS 364
I V PGPF I+D+Y + +GDL V + EADGS F+VP++SVP R G +RYS
Sbjct: 318 YDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYS 377

Query: 365 FTLGQARQYGDGDD--LFADFTYQRGMSNALTANLGLRVADDYLA-MLGGGVLATRFGAF 421
T G+ R + F T G+ T G ++AD Y A G G GA
Sbjct: 378 ITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGAL 437

Query: 422 GLNSTYSSARVEDGARKQGWRIGLDYSRTFQPTGTTLTLAGYRYSTEGYRELGDVLGSRD 481
++ T +++ + D ++ G + Y+++ +GT + L GYRYST GY D SR
Sbjct: 438 SVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRM 497

Query: 482 ALRHGDTWD-------------SGSYKQRNQFNLLVSQALGGYGNLYLSGSSSDFYDGKS 528
+ +T D + +Y +R + L V+Q LG LYLSGS ++ +
Sbjct: 498 NGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYWGTSN 557

Query: 529 RDTQLQFGYSNTWGQLSYNLAWSRQTTTYYQEQGDQDPGVELLRRDRRSGQRNDTLTLSV 588
D Q Q G + + +++ L++S ++ R+ L L+V
Sbjct: 558 VDEQFQAGLNTAFEDINWTLSYSLT-------------------KNAWQKGRDQMLALNV 598

Query: 589 SMPLGSSSRAPTLSA-----MATRRSGDSRGG-SLQTGLNGTLGDERTWSYALSA---NR 639
++P R+ + S + S D G + G+ GTL ++ SY++
Sbjct: 599 NIPFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGG 658

Query: 640 DSEVADTTWNGTLQKQAALATVNAGYAQGDRYRQYSGGIRGALVAHRDGLTLGPSVGDTF 699
+ +T TL + N GY+ D +Q G+ G ++AH +G+TLG + DT
Sbjct: 659 GDGNSGSTGYATLNYRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTV 718

Query: 700 ALVEAKGASGAAIRGGQGARIDGNGYALAPSLSPYRYNPISLDPVGIDPDAELLETERKV 759
LV+A GA A + G R D GYA+ P + YR N ++LD + + +L V
Sbjct: 719 VLVKAPGAKDAKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANV 778

Query: 760 APYAGASVRVTFRTLTGHPLLIQARREDGSVLPLGAVVVDDGGAAIGMVGQGGQVYARAE 819
P GA VR F+ G LL+ + LP GA+V + + G+V GQVY
Sbjct: 779 VPTRGAIVRAEFKARVGIKLLMTLTH-NNKPLPFGAMVTSESSQSSGIVADNGQVYLSGM 837

Query: 820 NQRGRLLVQWGTARKERCELPYDLAGVSRDQALIRLRGTCR 860
G++ V+WG C Y L S+ Q L +L CR
Sbjct: 838 PLAGKVQVKWGEEENAHCVANYQLPPESQQQLLTQLSAECR 878


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2114TCRTETB393e-05 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 39.1 bits (91), Expect = 3e-05
Identities = 37/180 (20%), Positives = 82/180 (45%), Gaps = 6/180 (3%)

Query: 29 FWSCKIGYGLDGMDTQMLSFVIPTLIALWGIGTGEAGFIHTMTLLASAAGGWIAGILSDR 88
W C + + ++ +L+ +P + + +++T +L + G + G LSD+
Sbjct: 17 IWLCILSF-FSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQ 75

Query: 89 IGRVLTLQLTVLWFAFFTFLCGLAQNYEQLLV-ARTLMGFGFGGEWTAGAVLIGEVIKAR 147
+G L ++ F + + + ++ LL+ AR + G G V++ I
Sbjct: 76 LGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKE 135

Query: 148 DRGKAVGLVQSGWAIGWGLTAILYSLMFSLLPPEEAWRALFMLGLLPALFVLVVRRLVKE 207
+RGKA GL+ S A+G G+ + ++ + W L ++ ++ + V + +L+K+
Sbjct: 136 NRGKAFGLIGSIVAMGEGVGPAIGGMIAHYI----HWSYLLLIPMITIITVPFLMKLLKK 191


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2099DHBDHDRGNASE856e-22 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 85.5 bits (211), Expect = 6e-22
Identities = 55/188 (29%), Positives = 85/188 (45%), Gaps = 9/188 (4%)

Query: 2 QNILISGAASGIGAASARLFHRRGWRVGLLDIDAEALRGLAAQLPGAWHRA----VDVSE 57
+ I+GAA GIG A AR +G + +D + E L + + L A DV +
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 58 PDAVGEALAQFCAD-GRLRLLFNCAGVLRFGRFEEVALEDHARLLAINLHGVLNCCHAAF 116
A+ E A+ + G + +L N AGVLR G ++ E+ ++N GV N +
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 117 PFLRATPQAQVLNMGSASGLYGVPE--MAVYSASKFAVRGLTEALELEWRRHGIRVADLM 174
++ ++ +GS GVP MA Y++SK A T+ L LE + IR +
Sbjct: 129 KYMMDRRSGSIVTVGSNPA--GVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVS 186

Query: 175 PPFVRTPM 182
P T M
Sbjct: 187 PGSTETDM 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2092TCRTETA552e-10 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 54.8 bits (132), Expect = 2e-10
Identities = 84/365 (23%), Positives = 139/365 (38%), Gaps = 20/365 (5%)

Query: 23 VGTVELVVAGVLDELAASFAVSQGRAGLLMSLYALVYALLGPLLVYLSAGIERRRLLAGA 82
+G + V+ G+L +L S V G+L++LYAL+ P+L LS RR +L +
Sbjct: 21 IGLIMPVLPGLLRDLVHSNDV-TAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVS 79

Query: 83 LLVFVGANLASAAAPSFALLLASRLLVAASASVIVVVAITLAVAIVAPERRGRAIGLVFA 142
L A AP +L R++ + + V +A I + R R G + A
Sbjct: 80 LAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIA-DITDGDERARHFGFMSA 138

Query: 143 GIVASLVLGVPLGTLIGEFWGWRSLFLLLAGVALLGLPLLLRLL---------PAIPGAP 193
+V G LG L+G F + F A + L LL P A
Sbjct: 139 CFGFGMVAGPVLGGLMGGF-SPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREAL 197

Query: 194 GIAPAEQLRALARGRVPFAHLASLLQMTGQFTVYTYIVPFLVGSMALDKPTISLVLLVYG 253
+ + + ++Q+ GQ +++ F D TI + L +G
Sbjct: 198 NPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVI-FGEDRFHWDATTIGISLAAFG 256

Query: 254 GGGILG-ALLGGRAADRWPGPATFVAFLLLHALALVLLPFATGGLPLLLGAVVFWCVFNM 312
L A++ G A R + ++ +LL FAT G V+
Sbjct: 257 ILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLL--ASGG 314

Query: 313 APGPAIQKYLVELSPDTAAIQISLNTSAIQLGVALGAFIGAILVDQVAVRALPWW-GAAL 371
PA+Q L + Q+ + +A+ +L + +G +L + ++ W G A
Sbjct: 315 IGMPALQAMLSRQVDEERQGQLQGSLAALT---SLTSIVGPLLFTAIYAASITTWNGWAW 371

Query: 372 ILGAA 376
I GAA
Sbjct: 372 IAGAA 376


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2091TCRTETA290.035 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 29.0 bits (65), Expect = 0.035
Identities = 42/175 (24%), Positives = 63/175 (36%), Gaps = 2/175 (1%)

Query: 71 VTLLGLAVALSSPLKGWLVDRWGARALVLPLTAALALCLASLALARSGWQLYLLFALLGL 130
+ L L +P+ G L DR+G R ++L A A+ A +A A W LY+ + G+
Sbjct: 49 LALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGI 108

Query: 131 L-TPGNIPYARILGGWFERRRGAAYGILGLGFGVGGPLALYLGSACIDAFGWRATFLVYG 189
G + A I R +G + FG G LG F A F
Sbjct: 109 TGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGG-FSPHAPFFAAA 167

Query: 190 LLEGLLALPLLYALFRERPGDLPQARRTPADALPGATPGQAWRSADFWLIVGNLI 244
L GL L + L G+ RR + L + + V ++
Sbjct: 168 ALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIM 222


43PA2042PA2031Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
PA2042222-6.523087serine/threonine transporter SstT
PA2041331-8.637149amino acid permease
PA2040329-6.331772glutamine synthetase
PA2039535-5.794140hypothetical protein
PA2038634-5.343721hypothetical protein
PA2037528-4.022305hypothetical protein
PA20361120.539465hypothetical protein
PA20351102.251782thiamine pyrophosphate protein
PA20341131.761129hypothetical protein
PA2033312-0.047134hypothetical protein
PA2032213-0.617826transcriptional regulator
PA2031213-1.309285hypothetical protein
44PA1913PA1899Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA19131113.081395hypothetical protein
PA19120103.420620ECF sigma factor FemI
PA19110103.355207sigma factor regulator FemR
PA1910-192.929802ferric-mycobactin receptor FemA
PA1909-1103.227607hypothetical protein
PA1908-2103.786779major facilitator superfamily transporter
PA1907-2123.189495hypothetical protein
PA19060143.547185hypothetical protein
PA19050142.774157pyridoxamine 5'-phosphate oxidase
PA1904-1121.712767trans-2,3-dihydro-3-hydroxyanthranilate
PA1903010-0.249060phenazine biosynthesis protein PhzE
PA1902010-2.581052phenazine biosynthesis protein PhzD
PA1901111-2.964249phenazine biosynthesis protein PhzC
PA1900-114-4.811122phenazine biosynthesis protein PhzB
PA1899013-4.247075phenazine biosynthesis protein PhzA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1908TCRTETA997e-25 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 98.7 bits (246), Expect = 7e-25
Identities = 85/335 (25%), Positives = 123/335 (36%), Gaps = 37/335 (11%)

Query: 49 GAAVTVGGIAWMLAARPWGIASDRHGRRRILLGGLAGFALSYGSLCLFIVLALHWTLPTL 108
G + + + A G SDR GRR +LL LAG A+ Y +
Sbjct: 46 GILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDY----------------AI 89

Query: 109 FAFAG---IVLLRGLAGGFYAAVPACTAALVADHVEAQRRAAALAGLGAASAIGMVIGPG 165
A A ++ + + G A A A +AD + RA + A GMV GP
Sbjct: 90 MATAPFLWVLYIGRIVAGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPV 149

Query: 166 LAGLLATHGLVLPLLVTGALPLVALLALWRWLP----------REERRQPNRGAALAIGD 215
L GL+ P AL + L LP R E P A G
Sbjct: 150 LGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGM 209

Query: 216 RRLRRPLAVGFVAMFSVTVAQITVGFFALDRLRLDSADAARVAGIALTAVGVALILAQLL 275
+ +AV F+ V F DR D+ GI+L A G+ LAQ +
Sbjct: 210 TVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATT----IGISLAAFGILHSLAQAM 265

Query: 276 VRRL---DWPPPRLIRFGGLVAAIGFAAVCFADSPPLLWLAFFVAAAGMGWVFPAVSALN 332
+ R + G + G+ + FA + + + A+G G PA+ A+
Sbjct: 266 ITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASG-GIGMPALQAML 324

Query: 333 ANAVRAEEQGAAAGTLVAVHGFGLISGPLLGALLH 367
+ V E QG G+L A+ I GPLL ++
Sbjct: 325 SRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIY 359



Score = 37.1 bits (86), Expect = 9e-05
Identities = 36/132 (27%), Positives = 55/132 (41%), Gaps = 8/132 (6%)

Query: 257 VAGIALTAVGVALILAQLLVRRLDWPPPRLIRFGGLVAAIGFAAVCFADSPPLLWLAFF- 315
+A AL A +L L R P L+ G AA+ +A + A P LW+ +
Sbjct: 49 LALYALMQFACAPVLGAL-SDRFGRRPVLLVSLAG--AAVDYAIMATA---PFLWVLYIG 102

Query: 316 -VAAAGMGWVFPAVSALNANAVRAEEQGAAAGTLVAVHGFGLISGPLLGALLHQLDSRAP 374
+ A G A A+ +E+ G + A GFG+++GP+LG L+ AP
Sbjct: 103 RIVAGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAP 162

Query: 375 YALVGLLLALAA 386
+ L L
Sbjct: 163 FFAAAALNGLNF 174


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1902ISCHRISMTASE351e-125 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 351 bits (901), Expect = e-125
Identities = 102/207 (49%), Positives = 136/207 (65%), Gaps = 2/207 (0%)

Query: 3 GIPEITAYPLPTAQQLPANLARWSLEPRRAVLLVHDMQRYFLRPLPESLRA--GLVANAA 60
IP I Y +PTA +P N W +P RAVLL+HDMQ YF+ L AN
Sbjct: 2 AIPAIQPYQMPTASDMPQNKVSWVPDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANIR 61

Query: 61 RLRRWCVEQGVQIAYTAQPGSMTEEQRGLLKDFWGPGMRASPADREVVEELAPGPDDWLL 120
+L+ CV+ G+ + YTAQPGS + R LL DFWGPG+ + P + +++ ELAP DD +L
Sbjct: 62 KLKNQCVQLGIPVVYTAQPGSQNPDDRALLTDFWGPGLNSGPYEEKIITELAPEDDDLVL 121

Query: 121 TKWRYSAFFHSDLLQRMRAAGRDQLVLCGVYAHVGVLISTVDAYSNDIQPFLVADAIADF 180
TKWRYSAF ++LL+ MR GRDQL++ G+YAH+G L++ +A+ DI+ F V DA+ADF
Sbjct: 122 TKWRYSAFKRTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVADF 181

Query: 181 SEAHHRMALEYAASRCAMVVTTDEVLE 207
S H+MALEYAA RCA V TD +L+
Sbjct: 182 SLEKHQMALEYAAGRCAFTVMTDSLLD 208


45PA1879PA1845Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA18794111.802523hypothetical protein
PA18784111.511438hypothetical protein
PA18774111.487911secretion protein
PA18764101.470938ABC transporter ATP-binding protein/permease
PA18754111.300377hypothetical protein
PA18745131.242717hypothetical protein
PA18730100.565538hypothetical protein
PA18721131.251221hypothetical protein
PA18711141.494172protease LasA
PA1870-1112.139076hypothetical protein
PA1869091.795612acyl carrier protein
PA1868182.155261secretion protein XqhA
PA1867491.592283type VI pilus biosynthesis protein
PA1866a281.315822hypothetical protein
PA1866171.388538hypothetical protein
PA1865091.093277Fanconi-associated nuclease
PA18640111.226685transcriptional regulator
PA18630101.068263molybdenum ABC transporter substrate-binding
PA1862091.151907molybdenum ABC transporter permease ModB
PA1861280.261374molybdenum ABC transporter ATP-binding protein
PA1860390.153898hypothetical protein
PA18593100.334722transcriptional regulator
PA1858280.411900streptomycin 3''-phosphotransferase
PA1857310-0.079752hypothetical protein
PA185649-0.063387cbb3-type cytochrome C oxidase subunit I
PA1855181.540168hypothetical protein
PA1854281.468069hypothetical protein
PA1853291.861964transcriptional regulator
PA18521101.094053hypothetical protein
PA1851-180.311245hypothetical protein
PA1850-190.147722transcriptional regulator
PA1849-19-0.545917hypothetical protein
PA1848-19-1.130247major facilitator superfamily transporter
PA1847110-1.864222Fe/S biogenesis protein NfuA
PA1846110-1.675377cis/trans isomerase
PA1845212-2.011071hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1877RTXTOXIND2969e-99 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 296 bits (759), Expect = 9e-99
Identities = 83/416 (19%), Positives = 179/416 (43%), Gaps = 53/416 (12%)

Query: 24 PVYRPLLWTLLGCVLLFIGWAAWAQLDEVTRGDGRVVPFSRIQKIQSLEGGILDRLLVKE 83
R + + ++G +++ + Q++ V +G++ R ++I+ +E I+ ++VKE
Sbjct: 55 RRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKE 114

Query: 84 GDLVEVGQPLVRLDETRFLTNFQESANQASVLRAAIARLDAEVLGKKSIEFPPDVDPEGP 143
G+ V G L++L + ++ + R R + + P P+ P
Sbjct: 115 GESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEP 174

Query: 144 LARSERELFKSRRDKLVE-----------------------------GTQAIQRQIHLAQ 174
++ E R L++ + + +
Sbjct: 175 YFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEK 234

Query: 175 SQLDLVRPLVAKRAVSQMEALK-------LSQDIATLSGKLTELKS-------------- 213
S+LD L+ K+A+++ L+ ++ +L +++S
Sbjct: 235 SRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQ 294

Query: 214 TYFQDAYTERAQRKADLSALEPIVQQRQDQLRRTEILSPVRGRVNTVLINTRGGVIQPGE 273
+ + + Q ++ L + + +++ + + I +PV +V + ++T GGV+ E
Sbjct: 295 LFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAE 354

Query: 274 PIMEVIPVEERLLVEAKIKPRDVAFLVPGMPAKVKITAYDYTIYGDLKGTLEQISADTIE 333
+M ++P ++ L V A ++ +D+ F+ G A +K+ A+ YT YG L G ++ I+ D IE
Sbjct: 355 TLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIE 414

Query: 334 EDTPHGKESYYQVLIKTDGSQLKRGEEVLPIIPGMVAEVDILSGKRSVLNYLLRPL 389
+ + V+I + + L G + +P+ GM +I +G RSV++YLL PL
Sbjct: 415 DQRLG---LVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGMRSVISYLLSPL 467


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1875RTXTOXIND310.009 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 31.0 bits (70), Expect = 0.009
Identities = 24/162 (14%), Positives = 49/162 (30%), Gaps = 18/162 (11%)

Query: 250 DANVAEAEVREAKASLLPQLNLEASALRREIGGHPESDSVVSLRFRMDTFQGLSNFRRPT 309
A AEA+ + ++SLL R +I S+ L +
Sbjct: 128 TALGAEADTLKTQSSLL---QARLEQTRYQI-------LSRSIELNKLPELKLPDEPYFQ 177

Query: 310 AAQQRLESAKWSADAMQRD-IRRQLQNLFDNGDTLRWREQSLTQQVTESEQVGELYREQ- 367
+ S Q + Q N D R ++ ++ E + + + +
Sbjct: 178 NVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRL 237

Query: 368 ------FEVGRRDVIDLLNVQRERFEAERQLINLRIERKRIE 403
+L + + EA +L + + ++IE
Sbjct: 238 DDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIE 279


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1874CABNDNGRPT492e-07 NodO calcium binding signature.
		>CABNDNGRPT#NodO calcium binding signature.

Length = 479

Score = 48.8 bits (116), Expect = 2e-07
Identities = 35/163 (21%), Positives = 65/163 (39%), Gaps = 8/163 (4%)

Query: 2273 TDSNGNDSAAYGITLTPNGLSLNIGQI-DVNGTSGDDVLSGANGSSEHINGGDGSDLIFN 2331
D+ G D+ + ++LN G DV G G+ ++ E+ GG G+D++
Sbjct: 296 WDAGGTDTFDFSGYSNNQRINLNEGSFSDVGGLKGNVSIAHGVTI-ENAIGGSGNDILVG 354

Query: 2332 VGTGDHVVAGNGNDTIQITATDFVSIDGGAGFDTLVLANGIDLDYNAVGVGT--LSNLER 2389
+ + G GND + A ++ GGAG DT V +G D A +++
Sbjct: 355 NSADNILQGGAGNDVLYGGAGA-DTLYGGAGRDTFVYGSGQDSTVAAYDWIADFQKGIDK 413

Query: 2390 IDLGKGDSGSVLTLTAAEVDAITDANNTLQITGENNDTLNVVG 2432
IDL + + D T + + + +++ +
Sbjct: 414 IDL---SAFRNEGQLSFVQDQFTGKGQEVMLQWDAANSITNLW 453


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1868BCTERIALGSPD5910.0 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 591 bits (1524), Expect = 0.0
Identities = 196/591 (33%), Positives = 324/591 (54%), Gaps = 26/591 (4%)

Query: 44 EQWTINMKDAEIGDFIEQVSSISGQTFVVDPRVKGRVTVVSQARLSLAEVYQLFLSVLAT 103
E+++ + K +I +FI VS +T ++DP V+G +TV S L+ + YQ FLSVL
Sbjct: 28 EEFSASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITVRSYDMLNEEQYYQFFLSVLDV 87

Query: 104 HGYAVLPQGDQA-RIVPNMEARQDAAQKTVRDGPG---SLETRVVQAQQTSVAELIPMIR 159
+G+AV+ + ++V + +A+ A PG + TRVV + +L P++R
Sbjct: 88 YGFAVINMNNGVLKVVRSKDAKTAAVPVASDAAPGIGDEVVTRVVPLTNVAARDLAPLLR 147

Query: 160 PLVPAHGHLAAV--PSANALIVSDRRSNIERIEAIVRSLDRAGEHDYSIYDMRHAWVAEI 217
L G + V +N L+++ R + I+R+ IV +D AG+ + A A++
Sbjct: 148 QLNDNAGVGSVVHYEPSNVLLMTGRAAVIKRLLTIVERVDNAGDRSVVTVPLSWASAADV 207

Query: 218 AEV---LDRSVTPAAGKSAATVQVLADSRSNRLVLLGPPQARARLLRLAQSLDVPSSRSA 274
++ L++ + +A + V+AD R+N +++ G P +R R++ + + LD +
Sbjct: 208 VKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEPNSRQRIIAMIKQLDRQQATQG 267

Query: 275 NSRVIRLRHGDAKTLAATLGEIGESLHGER-GQDGRGSGKRGLLVRADESLNALVILADP 333
N++VI L++ A L L I ++ E+ + + ++++A NAL++ A P
Sbjct: 268 NTKVIYLKYAKASDLVEVLTGISSTMQSEKQAAKPVAALDKNIIIKAHGQTNALIVTAAP 327

Query: 334 EDVGLLEDIVRQLDVPRAQLLVEAAIVELSGEIGDALGVQWALRSGHVAGGAGFADSGLS 393
+ + LE ++ QLD+ R Q+LVEA I E+ G LG+QWA AG F +SGL
Sbjct: 328 DVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGIQWA---NKNAGMTQFTNSGLP 384

Query: 394 IGTLLGAL----QAGKPPAELP------DGAIVGLGSRDFGALVTALSRNSRSNLLSTPS 443
I T + + G + L +G G ++ L+TALS ++++++L+TPS
Sbjct: 385 ISTAIAGANQYNKDGTVSSSLASALSSFNGIAAGFYQGNWAMLLTALSSSTKNDILATPS 444

Query: 444 LLTLDNQKAEILVGQNVPFQTGSYTTSASGSSNPFTTVERKDIGVTLKVTPHIGEDRMLR 503
++TLDN +A VGQ VP TGS TTS N F TVERK +G+ LKV P I E +
Sbjct: 445 IVTLDNMEATFNVGQEVPVLTGSQTTS---GDNIFNTVERKTVGIKLKVKPQINEGDSVL 501

Query: 504 LEIEQEISSIAPTATLAAKAVDLVTNKRSIKSTVLADDGQVIVLGGLIQDDLQRSDSRVP 563
LEIEQE+SS+A A+ + + N R++ + VL G+ +V+GGL+ + + +VP
Sbjct: 502 LEIEQEVSSVADAASSTSSDLGATFNTRTVNNAVLVGSGETVVVGGLLDKSVSDTADKVP 561

Query: 564 LLGDIPGVGRLFRSSRETRVKRNLMVFLRPSIVRDAAGLERISHGRYRSIQ 614
LLGDIP +G LFRS+ + KRNLM+F+RP+++RD + S G+Y +
Sbjct: 562 LLGDIPVIGALFRSTSKKVSKRNLMLFIRPTVIRDRDEYRQASSGQYTAFN 612


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1864HTHTETR653e-15 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 65.4 bits (159), Expect = 3e-15
Identities = 25/152 (16%), Positives = 56/152 (36%), Gaps = 8/152 (5%)

Query: 5 RQRNLQLILDAACEVFADCGFSAARLSDVAERAGVAKANVLYYYRSKAQLYEAVLDSIVE 64
Q Q ILD A +F+ G S+ L ++A+ AGV + + ++++ K+ L+ + +
Sbjct: 9 AQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSES 68

Query: 65 PLLEASRPFAGDQP--PAEALRAYVDNKMRIGAERPYAARVFSCEIMRGAPRMPAPLLER 122
+ E + P P LR + + + + + ++++
Sbjct: 69 NIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQ 128

Query: 123 LDAQAERN-----AERIRQWIDEG-LLAPLDP 148
+ ++ I+ L A L
Sbjct: 129 AQRNLCLESYDRIEQTLKHCIEAKMLPADLMT 160


46PA1808PA1797aY        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA180819-3.276195ABC transporter permease
PA1807111-4.196757ABC transporter ATP-binding protein
PA1806113-5.077674NADH-dependent enoyl-ACP reductase
PA1805215-5.920147peptidyl-prolyl cis-trans isomerase D
PA1804117-6.221004*DNA-binding protein HU
PA1803114-5.340824Lon protease
PA1802017-4.071556ATP-dependent protease ATP-binding subunit ClpX
PA1801017-3.254285ATP-dependent Clp protease proteolytic subunit
PA1800017-2.371241trigger factor
PA1799-110-1.172844two-component response regulator ParR
PA1798-19-1.385429two-component sensor ParS
PA1797010-1.866284hypothetical protein
PA1797a29-2.114562hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1806DHBDHDRGNASE639e-14 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 62.8 bits (152), Expect = 9e-14
Identities = 61/262 (23%), Positives = 99/262 (37%), Gaps = 23/262 (8%)

Query: 4 LTGKRALIVGVASKLSIASGIAAAMHREGAELAFTYQNDKLRGRVEEFASGWGSRPELCF 63
+ GK A I G A I +A + +GA +A N + +V E F
Sbjct: 6 IEGKIAFITGAAQ--GIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAE-AF 62

Query: 64 PCDVADDSQIEAVFAALGKHWDGLDIIVHSVGF---APGDQL-DGDFTAVTTREGFRIAH 119
P DV D + I+ + A + + +DI+V+ G L D ++ A + +
Sbjct: 63 PADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGV-- 120

Query: 120 DISAYSFIALAKAGREMMKGRNGSLLTLSYLGAERTMPNYNVMGMAKASLEAGVRYLAGS 179
F A + MM R+GS++T+ A + +KA+ + L
Sbjct: 121 ------FNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLE 174

Query: 180 LGAEGTRVNAVSAGPIRTLAASGI--------KSFRKMLAANERQTPLRRNVTIEEVGNA 231
L R N VS G T + + + L + PL++ ++ +A
Sbjct: 175 LAEYNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADA 234

Query: 232 GAFLCSDLASGISGEILYVDGG 253
FL S A I+ L VDGG
Sbjct: 235 VLFLVSGQAGHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1804DNABINDINGHU1171e-38 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 117 bits (296), Expect = 1e-38
Identities = 49/88 (55%), Positives = 64/88 (72%)

Query: 2 NKSELIDAIAASADIPKAVAGRALDAVIESVTGALKAGDSVVLVGFGTFAVKERAARTGR 61
NK +LI +A + ++ K + A+DAV +V+ L G+ V L+GFG F V+ERAAR GR
Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62

Query: 62 NPQTGKPIKIAAAKIPGFKAGKALKDAV 89
NPQTG+ IKI A+K+P FKAGKALKDAV
Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAV 90


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1799HTHFIS772e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 77.2 bits (190), Expect = 2e-18
Identities = 31/132 (23%), Positives = 63/132 (47%), Gaps = 5/132 (3%)

Query: 7 SKVLLVEDDQKLARLIASFLSQHGFEVRQVHRGDAAFAAFLDFKPQVVVLDLMLPGQNGL 66
+ +L+ +DD + ++ LS+ G++VR + +VV D+++P +N
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 67 QVCREIRRV-ANLPILILTAQEDDLDHILGLESGADDYVIKPIEPPVLLARLRALM---- 121
+ I++ +LP+L+++AQ + I E GA DY+ KP + L+ + +
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 122 RRHAPLPASPES 133
RR + L +
Sbjct: 124 RRPSKLEDDSQD 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1798PF06580290.026 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 29.4 bits (66), Expect = 0.026
Identities = 19/123 (15%), Positives = 37/123 (30%), Gaps = 31/123 (25%)

Query: 315 QIRIEPRFMARAVINLL-----RNAIRHAHS------RVEIALLDQGDSCQIRVNDDGPG 363
+ +I P M V +L N I+H + ++ + + + V + G
Sbjct: 243 ENQINPAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSL 302

Query: 364 IPADARQKIFEPFSRLDDSRDRSTGGFGLGLAIVH-RVAQWHGG-YAEALETPQGGASFR 421
+ ++ G GL V R+ +G L QG +
Sbjct: 303 ALKNTKE------------------STGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAM 344

Query: 422 LTW 424
+
Sbjct: 345 VLI 347


47PA1778PA1766Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA1778122-4.174899uroporphyrin-III C-methyltransferase
PA1777321-5.028938outer membrane porin F
PA1776-122-2.917257RNA polymerase sigma factor SigX
PA1775013-2.218640hypothetical protein
PA1774212-2.957515hypothetical protein
PA1773313-3.213360transporter
PA1772212-3.813469ribonuclease activity regulator protein RraA
PA1771210-3.738347hydrolase
PA1770112-3.889539phosphoenolpyruvate synthase
PA1769010-4.181032phosphoenolpyruvate synthase regulatory protein
PA176809-3.694130hypothetical protein
PA176708-3.664277hypothetical protein
PA1766110-3.307077hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1777OMPADOMAIN1631e-49 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 163 bits (415), Expect = 1e-49
Identities = 89/381 (23%), Positives = 140/381 (36%), Gaps = 79/381 (20%)

Query: 1 MKLKNTLGVVIGSLVAASAMNAFAQGQNSVEIEAFGKRYFTD------SVRNMKNADLYG 54
MK K + + + A+ A + G + D + +N G
Sbjct: 1 MK-KTAIAIAVALAGFATVAQAAPKDNTWYTGAKLGWSQYHDTGFINNNGPTHENQLGAG 59

Query: 55 GSIGYFLTDDVELALSYGEY--HDVRGTYETGNKKVHGNLTSLDAIYHFGTPGVGLRPYV 112
GY + V + Y +G+ E G K G + Y T + + +
Sbjct: 60 AFGGYQVNPYVGFEMGYDWLGRMPYKGSVENGAYKAQGVQLTAKLGYPI-TDDLDIYTRL 118

Query: 113 SAGL----AHQNITNINSDSQ-------GRQQMTMANIGAGLKYYFTENFFAKASLDGQY 161
+ N+ N D+ G + I L+Y +T N ++
Sbjct: 119 GGMVWRADTKSNVYGKNHDTGVSPVFAGGVEYAITPEIATRLEYQWTNNIGDAHTIG--- 175

Query: 162 GLEKRDNGHQGEWMAGLGVGFNFGGSKAAP----APEPVADVCSDSDNDGVCDNVDKCPD 217
+ DNG M LGV + FG +AAP AP P +V +
Sbjct: 176 --TRPDNG-----MLSLGVSYRFGQGEAAPVVAPAPAPAPEVQTKH-------------- 214

Query: 218 TPANVTVDANGCPAVAEVVRVQLDVKFDFDKSKVKENSYADIKNLADFMKQY--PSTSTT 275
++ DV F+F+K+ +K A + L + S
Sbjct: 215 ------------------FTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVV 256

Query: 276 VEGHTDSVGTDAYNQKLSERRANAVRDVLVNEYGVEGGRVNAVGYGESRPVADNATAEGR 335
V G+TD +G+DAYNQ LSERRA +V D L+++ G+ +++A G GES PV N +
Sbjct: 257 VLGYTDRIGSDAYNQGLSERRAQSVVDYLISK-GIPADKISARGMGESNPVTGNTCDNVK 315

Query: 336 ---------AINRRVEAEVEA 347
A +RRVE EV+
Sbjct: 316 QRAALIDCLAPDRRVEIEVKG 336


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1770PHPHTRNFRASE317e-101 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 317 bits (815), Expect = e-101
Identities = 113/446 (25%), Positives = 191/446 (42%), Gaps = 68/446 (15%)

Query: 360 RAIGQRI-GAGPVKVINDVSEMDKVQPGDVLVSDMTDPDWEPVMK-RASAIVTNRGGRTC 417
R + +R+ G ++ + + ++ D+T D + K T+ GGRT
Sbjct: 132 RDVSKRVLGHLIGVETGSLATIA--EETVIIAEDLTPSDTAQLNKQFVKGFATDIGGRTS 189

Query: 418 HAAIIARELGIPAVVGCGNATQILQDGQGVTVSCAEG---------DTGFIFEGELGFDV 468
H+AI++R L IPAVVG T+ +Q G V V EG + E F+
Sbjct: 190 HSAIMSRSLEIPAVVGTKEVTEKIQHGDMVIVDGIEGIVIVNPTEEEVKAYEEKRAAFEK 249

Query: 469 RKNSVDAMPDLP--------FKIMMNVGNPDRAFDFAQLPNEGVGLARLEFIINRMIGVH 520
+K + P ++ N+G P EG+GL R EF+
Sbjct: 250 QKQEWAKLVGEPSTTKDGAHVELAANIGTPKDVDGVLANGGEGIGLYRTEFLY------- 302

Query: 521 PKALLNFAGLPADIKESVEKRIAGYPDPVGFYVEKLVEGISTLAAAFWPKKVIVRLSDFK 580
++ LP + E++ Y + + K V++R D
Sbjct: 303 ----MDRDQLP-----TEEEQFEAYKE---------------VVQRMDGKPVVIRTLDIG 338

Query: 581 SNEYANLIGGKLYEPEEENPMLGFRGASRYISESFRDCFELECRALKKVRNEMGLTNVEI 640
++ + L P+E NP LGFR + + +D F + RAL + N+++
Sbjct: 339 GDKELSY----LQLPKELNPFLGFRAIRLCLEK--QDIFRTQLRALLRAS---TYGNLKV 389

Query: 641 MVPFVRTLGEASQVVELLAGNGLKRGENG------LKVIMMCELPSNALLADEFLEFFDG 694
M P + TL E Q ++ K G ++V +M E+PS A+ A+ F + D
Sbjct: 390 MFPMIATLEELRQAKAIMQEEKDKLLSEGVDVSDSIEVGIMVEIPSTAVAANLFAKEVDF 449

Query: 695 FSIGSNDLTQLTLGLDRDSGIVAHLFDERNPAVKKLLANAIAACNKAGKYIGICGQGPSD 754
FSIG+NDL Q T+ DR + V++L+ +PA+ +L+ I A + GK++G+CG+ D
Sbjct: 450 FSIGTNDLIQYTMAADRMNERVSYLYQPYHPAILRLVDMVIKAAHSEGKWVGMCGEMAGD 509

Query: 755 HPDLARWLMEQGIESVSLNPDSVLDT 780
L+ G++ S++ S+L
Sbjct: 510 -EVAIPLLLGLGLDEFSMSATSILPA 534


48PA1725PA1694Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA1725417-0.010990type III secretion system protein
PA1724418-0.816155type III export protein PscK
PA1723318-1.434019type III export protein PscJ
PA17223150.101276type III export protein PscI
PA17211150.316708type III export protein PscH
PA17201160.687469type III export protein PscG
PA17191150.631796type III export protein PscF
PA1718016-0.544588type III export protein PscE
PA1717-116-0.560341type III export protein PscD
PA1716-219-0.884258type III secretion outer membrane protein PscC
PA1715128-2.025533type III export apparatus protein
PA1714225-2.322608hypothetical protein
PA1713325-2.766279exoenzyme S transcriptional regulator ExsA
PA1712222-1.035002exoenzyme S synthesis protein ExsB
PA1711219-1.553346hypothetical protein
PA1710217-1.564270exoenzyme S synthesis protein ExsC
PA1709216-0.632473translocator outer membrane protein PopD
PA1708313-1.043018translocator protein PopB
PA1707314-0.598363regulatory protein PcrH
PA1706415-0.037941type III secretion protein PcrV
PA17055150.964709type III secretion regulator
PA17045160.433366transcriptional regulator PcrR
PA17035150.572129type III secretory apparatus protein PcrD
PA17024122.140346hypothetical protein
PA17013112.062573hypothetical protein
PA17003172.469445hypothetical protein
PA16992172.599647hypothetical protein
PA16981162.159758type III secretion outer membrane protein PopN
PA16970161.821438type III secretion system ATPase
PA16963171.474566translocation protein in type III secretion
PA16952180.344332translocation protein in type III secretion
PA1694211-1.179490type III secretion system protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1725TYPE4SSCAGX300.009 Type IV secretion system CagX conjugation protein si...
		>TYPE4SSCAGX#Type IV secretion system CagX conjugation protein

signature.
Length = 522

Score = 29.8 bits (66), Expect = 0.009
Identities = 27/102 (26%), Positives = 45/102 (44%), Gaps = 8/102 (7%)

Query: 21 LRARDYQDYLSANRLVEAA--------RERAAEIEREAHEVYQEQKRLGWEAGLEEARLR 72
L RDYQ++L +L+ A +++A E E+EA E Q+ ++ E EE
Sbjct: 117 LMTRDYQEFLKTKKLIVDAPDPKELEEQKKALEKEKEAKEQAQKAQKDKREKRKEERAKN 176

Query: 73 QAGLIQETLLRCNRYYRQVDRQLGEVVLQAVRKVLRHYDAVE 114
+A L T N ++ L E++ Q L + +E
Sbjct: 177 RANLENLTNAMSNPQNLSNNKNLSELIKQQRENELDQMERLE 218


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1723FLGMRINGFLIF751e-17 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 75.0 bits (184), Expect = 1e-17
Identities = 33/165 (20%), Positives = 69/165 (41%), Gaps = 6/165 (3%)

Query: 27 LYTGISQKEGNEMLALLRSEGVSADKQADKDGTVRLLVEESDIAEAVEVLKRKGYPRENF 86
L++ +S ++G ++A L + + V + E L ++G P+
Sbjct: 53 LFSNLSDQDGGAIVAQLTQMNIPYR--FANGSGA-IEVPADKVHELRLRLAQQGLPKGG- 108

Query: 87 STLKDVFPKDGLISSPIEERARLNYAKAQEISHTLSEIDGVLVARVHVVLPEERDGLGRK 146
+ ++ ++ S E+ A E++ T+ + V ARVH+ +P + R+
Sbjct: 109 AVGFELLDQEKFGISQFSEQVNYQRALEGELARTIETLGPVKSARVHLAMP-KPSLFVRE 167

Query: 147 SSPASASVFIKHAADVQLD-AYVPQIKQLVNNGIEGLSYDRISVV 190
SASV + LD + + LV++ + GL +++V
Sbjct: 168 QKSPSASVTVTLEPGRALDEGQISAVVHLVSSAVAGLPPGNVTLV 212


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1721PF090252052e-71 YopR Core
		>PF09025#YopR Core

Length = 143

Score = 205 bits (522), Expect = 2e-71
Identities = 143/143 (100%), Positives = 143/143 (100%)

Query: 1 MSRIDTPPGFAVYPSASPKAANLPAVDQVLAFEQALGGEPPAAGRRLAGLENGALGERLL 60
MSRIDTPPGFAVYPSASPKAANLPAVDQVLAFEQALGGEPPAAGRRLAGLENGALGERLL
Sbjct: 1 MSRIDTPPGFAVYPSASPKAANLPAVDQVLAFEQALGGEPPAAGRRLAGLENGALGERLL 60

Query: 61 QRFAQPLQGLEADRLELKAMLRAELPLGRQQQTFLLQLLGAVEHAPGGEYLAQLARRELQ 120
QRFAQPLQGLEADRLELKAMLRAELPLGRQQQTFLLQLLGAVEHAPGGEYLAQLARRELQ
Sbjct: 61 QRFAQPLQGLEADRLELKAMLRAELPLGRQQQTFLLQLLGAVEHAPGGEYLAQLARRELQ 120

Query: 121 VLIPLNGMLDNLVRNSHKLDLES 143
VLIPLNGMLDNLVRNSHKLDLES
Sbjct: 121 VLIPLNGMLDNLVRNSHKLDLES 143


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1716TYPE3OMGPROT8160.0 Type III secretion system outer membrane G protein ...
		>TYPE3OMGPROT#Type III secretion system outer membrane G protein

family signature.
Length = 607

Score = 816 bits (2108), Expect = 0.0
Identities = 375/600 (62%), Positives = 472/600 (78%), Gaps = 7/600 (1%)

Query: 1 MRRLLIGGLLALLPGAVLRAQPLDWPSLPYDYVAQGESLRDVLANFGANYDASVIVSDKV 60
+R+L G LL L + AQ LDW +PY YVA+GESLRD+L +FGANYDA+V+VSDK+
Sbjct: 9 FKRVLTGTLLLLSSYSW--AQELDWLPIPYVYVAKGESLRDLLTDFGANYDATVVVSDKI 66

Query: 61 NDQVSGRFDLESPQAFLQLMASLYNLGWYYDGTVLYVFKTTEMQSRLVRLEQVGEAELKR 120
ND+VSG+F+ ++PQ FLQ +ASLYNL WYYDG VLY+FK +E+ SRL+RL++ AELK+
Sbjct: 67 NDKVSGQFEHDNPQDFLQHIASLYNLVWYYDGNVLYIFKNSEVASRLIRLQESEAAELKQ 126

Query: 121 ALTAAGIWEARFGWRADPSGRLVHVSGPGRYLELVEQTAQVLEQQYTLRSEKTGDLSVEI 180
AL +GIWE RFGWR D S RLV+VSGP RYLELVEQTA LEQQ +RSEKTG L++EI
Sbjct: 127 ALQRSGIWEPRFGWRPDASNRLVYVSGPPRYLELVEQTAAALEQQTQIRSEKTGALAIEI 186

Query: 181 FPLRYAVAEDRKIEYRDDEIEAPGIASILSRVLSDANVVAVGDEPGKLRPGP--QSSHAV 238
FPL+YA A DR I YRDDE+ APG+A+IL RVLSDA + V + ++ S+ A
Sbjct: 187 FPLKYASASDRTIHYRDDEVAAPGVATILQRVLSDATIQQVTVDNQRIPQAATRASAQAR 246

Query: 239 VQAEPSLNAVVVRDHKDRLPMYRRLIEALDRPSARIEVGLSIIDINAENLAQLGVDWSAG 298
V+A+PSLNA++VRD +R+PMY+RLI ALD+PSARIEV LSI+DINA+ L +LGVDW G
Sbjct: 247 VEADPSLNAIIVRDSPERMPMYQRLIHALDKPSARIEVALSIVDINADQLTELGVDWRVG 306

Query: 299 IRLGNNKSIQIRTTGQDSEEGGGAGNGAVGSLVDSRGLDFLLAKVTLLQSQGQAQIGSRP 358
IR GNN + I+TTG S A NGA+GSLVD+RGLD+LLA+V LL+++G AQ+ SRP
Sbjct: 307 IRTGNNHQVVIKTTGDQS---NIASNGALGSLVDARGLDYLLARVNLLENEGSAQVVSRP 363

Query: 359 TLLTQENTQAVLDQSETYYVRVTGERVAELKAITYGTMLKMTPRVVTLGDTPEISLSLHI 418
TLLTQEN QAV+D SETYYV+VTG+ VAELK ITYGTML+MTPRV+T GD EISL+LHI
Sbjct: 364 TLLTQENAQAVIDHSETYYVKVTGKEVAELKGITYGTMLRMTPRVLTQGDKSEISLNLHI 423

Query: 419 EDGSQKPNSAGLDKIPTINRTVIDTIARVGHGQSLLIGGIYRDELSQSQRKVPWLGDIPY 478
EDG+QKPNS+G++ IPTI+RTV+DT+ARVGHGQSL+IGGIYRDELS + KVP LGDIPY
Sbjct: 424 EDGNQKPNSSGIEGIPTISRTVVDTVARVGHGQSLIIGGIYRDELSVALSKVPLLGDIPY 483

Query: 479 LGALFRTTADTVRRSVRLFLIEPRLIDDGVGHYLALNNRRDLRGGLLEIDELSNQSLSLR 538
+GALFR ++ RR+VRLF+IEPR+ID+G+ H+LAL N +DLR G+L +DE+SNQS +L
Sbjct: 484 IGALFRRKSELTRRTVRLFIIEPRIIDEGIAHHLALGNGQDLRTGILTVDEISNQSTTLN 543

Query: 539 KLLGSARCQALAPARAEQERLRQAGQGSFLTPCRMGAQEGWRVTDGACPKDGAWCVGAER 598
KLLG ++CQ L A+ Q+ L Q + S+LT C+M GWRV +GAC +WCV A +
Sbjct: 544 KLLGGSQCQPLNKAQEVQKWLSQNNKSSYLTQCKMDKSLGWRVVEGACTPAQSWCVSAPK 603


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1715PF05932932e-27 Tir chaperone protein (CesT)
		>PF05932#Tir chaperone protein (CesT)

Length = 127

Score = 92.6 bits (230), Expect = 2e-27
Identities = 25/120 (20%), Positives = 41/120 (34%), Gaps = 5/120 (4%)

Query: 2 DHLLSGLATRLGQGPFVADRTGSYHLRIDGQSVLLLRQGDDLLLESPLEHAPLDPQRDQQ 61
LL + L P V D G+ ++ ID L L D E L L+P +
Sbjct: 7 KTLLDDFSRSLEMQPLVFDDHGTCNMIIDNTFALTLS--CDYARERLLLIGLLEP--HKD 62

Query: 62 GLLRALLSRVASWSRRYPQAIVLDADGRLLLQA-RLGLDGLDPERLERALAAQVGLLEAL 120
+ LL+ + + LD L + + L L+R +A + +
Sbjct: 63 IPQQCLLAGALNPLLNAGPGLGLDEKSGLYHAYQSIPREKLSVPTLKREMAGLLEWMRGW 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1710PF05932477e-10 Tir chaperone protein (CesT)
		>PF05932#Tir chaperone protein (CesT)

Length = 127

Score = 47.5 bits (113), Expect = 7e-10
Identities = 27/118 (22%), Positives = 49/118 (41%), Gaps = 4/118 (3%)

Query: 10 LLAEFAGRIGLPSLSLDEEGMASLLFDEQVGVTLLLLAERERLLLEADVAGIDVLGEGIF 69
LL +F+ + + L D+ G +++ D +TL RERLLL + +
Sbjct: 9 LLDDFSRSLEMQPLVFDDHGTCNMIIDNTFALTLSCDYARERLLLIGLLEP---HKDIPQ 65

Query: 70 RQLASFNRHWHRFDLH-FGFDELTGKVQLYAQILAAQLTLECFEATLANLLDHAEFWQ 126
+ L + + G DE +G Y I +L++ + +A LL+ W+
Sbjct: 66 QCLLAGALNPLLNAGPGLGLDEKSGLYHAYQSIPREKLSVPTLKREMAGLLEWMRGWR 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1709PF05844385e-137 YopD protein
		>PF05844#YopD protein

Length = 295

Score = 385 bits (989), Expect = e-137
Identities = 291/295 (98%), Positives = 293/295 (99%)

Query: 1 MIDTQYSLAATQAAIPSEPIAPGAAGRSVGTPQAAADLPQVPAARADRVELNAPRQVLDP 60
MIDTQYSLAATQAAIPSEPIAPGAAGRSVGTPQAAA+LPQVPAARADRVELNAPRQVLDP
Sbjct: 1 MIDTQYSLAATQAAIPSEPIAPGAAGRSVGTPQAAAELPQVPAARADRVELNAPRQVLDP 60

Query: 61 VRMEAAGSELDSSVELLLILFRIAQKARELGVLQRDNENQSIIHAQKAQVDEMRSGATLM 120
VRMEAAGSELDSSVELLLILFRIAQKARELGVLQRDNENQ+IIHAQKAQVDEMRSGATLM
Sbjct: 61 VRMEAAGSELDSSVELLLILFRIAQKARELGVLQRDNENQAIIHAQKAQVDEMRSGATLM 120

Query: 121 IAMAVIAGVGALASAVVGSLGALKNGKAISQEKTLQKNIDGRNELIDAKMQALGKTSDED 180
IAMAVIAGVGALASAVVGSLGALKNGKAISQEKTLQKNIDGRNELIDAKMQALGKTSDED
Sbjct: 121 IAMAVIAGVGALASAVVGSLGALKNGKAISQEKTLQKNIDGRNELIDAKMQALGKTSDED 180

Query: 181 RKIVGKVWAADQVQDSVALRAAGRAFESRNGALQVANTVIQSFVQMANASVQVRQGESQA 240
RKIVGKVWAADQ QDSVALRAAGRAFESRNGALQVANTVIQSFVQMANASVQVRQGESQA
Sbjct: 181 RKIVGKVWAADQAQDSVALRAAGRAFESRNGALQVANTVIQSFVQMANASVQVRQGESQA 240

Query: 241 SAREGEVNATIGQSQKQKVEDQMSFDAGFMKDVLQLIQQYTQSHNQAWRAAAGVV 295
SARE EVNATIGQSQKQKVEDQMSFDAGFMKDVLQLIQQYTQSHNQAWRAAAGVV
Sbjct: 241 SAREEEVNATIGQSQKQKVEDQMSFDAGFMKDVLQLIQQYTQSHNQAWRAAAGVV 295


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1707SYCDCHAPRONE2084e-72 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 208 bits (531), Expect = 4e-72
Identities = 95/167 (56%), Positives = 126/167 (75%)

Query: 1 MNQPTPSDTDQQQALEAFLRDGGTLAMLRGLSEDTLEQLYALGFNQYQAGKWDDAQKIFQ 60
M Q T + Q A+E+FL+ GGT+AML +S DTLEQLY+L FNQYQ+GK++DA K+FQ
Sbjct: 1 MQQETTDTQEYQLAMESFLKGGGTIAMLNEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQ 60

Query: 61 ALCMLDHYDARYFLGLGACRQSLGLYEQALQSYSYGALMDINEPRFPFHAAECHLQLGDL 120
ALC+LDHYD+R+FLGLGACRQ++G Y+ A+ SYSYGA+MDI EPRFPFHAAEC LQ G+L
Sbjct: 61 ALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGEL 120

Query: 121 DGAESGFYSARALAAAQPAHEALAARAGAMLEAVTARKDRAYESDNA 167
AESG + A+ L A + + L+ R +MLEA+ +K+ +E +
Sbjct: 121 AEAESGLFLAQELIADKTEFKELSTRVSSMLEAIKLKKEMEHECVDN 167


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1706LCRVANTIGEN344e-121 Low calcium response V antigen signature.
		>LCRVANTIGEN#Low calcium response V antigen signature.

Length = 326

Score = 344 bits (884), Expect = e-121
Identities = 115/296 (38%), Positives = 171/296 (57%), Gaps = 32/296 (10%)

Query: 25 ASAEQEELLALLRSERIVLAHAGQPLSEAQVL-------------KALAWLLAANPSAPP 71
S+ EEL+ L++ + I ++ P +++V K LA+ L +
Sbjct: 28 GSSVLEELVQLVKDKNIDISIKYDPRKDSEVFANRVITDDIELLKKILAYFLPEDAILKG 87

Query: 72 GQ-------GLEVLREVLQARRQPGAQWDLREFLVSAYFSLHG-RLDEDVIGVYKDVLQT 123
G G++ ++E L++ P QW+LR F+ +FSL R+D+D++ V D +
Sbjct: 88 GHYDNQLQNGIKRVKEFLES--SPNTQWELRAFMAVMHFSLTADRIDDDILKVIVDSMNH 145

Query: 124 QDGKRKALLDELKALTAELKVYSVIQSQINAALSAKQGIRIDAGGIDLVDPTLYGYAVGD 183
R L +EL LTAELK+YSVIQ++IN LS+ I I I+L+D LYGY +
Sbjct: 146 HGDARSKLREELAELTAELKIYSVIQAEINKHLSSSGTINIHDKSINLMDKNLYGYT-DE 204

Query: 184 PRWKDSPEYALLSNLDTFSGKL--------SIKDFLSGSPKQSGELKGLSDEYPFEKDNN 235
+K S EY +L + + ++ SIKDFL K++G L L + Y + KDNN
Sbjct: 205 EIFKASAEYKILEKMPQTTIQVDGSEKKIVSIKDFLGSENKRTGALGNLKNSYSYNKDNN 264

Query: 236 PVGNFATTVSDRSRPLNDKVNEKTTLLNDTSSRYNSAVEALNRFIQKYDSVLRDIL 291
+ +FATT SD+SRPLND V++KTT L+D +SR+NSA+EALNRFIQKYDSV++ +L
Sbjct: 265 ELSHFATTCSDKSRPLNDLVSQKTTQLSDITSRFNSAIEALNRFIQKYDSVMQRLL 320


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1698PF072012844e-98 Hypersensitivity response secretion protein HrpJ
		>PF07201#Hypersensitivity response secretion protein HrpJ

Length = 293

Score = 284 bits (727), Expect = 4e-98
Identities = 134/294 (45%), Positives = 181/294 (61%), Gaps = 7/294 (2%)

Query: 1 MDILQSSSAAPLA-----PREAANAPAQQAGGSFQGERVHYVSVS-QSLADAAEELTFAF 54
M L + S P A++ Q G F+GE V VS + QS+AD AEE+TF F
Sbjct: 1 MTTLHNLSYGNTPLHNERPEIASSQIVNQTLGQFRGESVQIVSGTLQSIADMAEEVTFVF 60

Query: 55 SERAEKSLAKRRLSDAHARLSEVQAMLQEYWKRIPDLESQQKLEALIAHLGSGQLSSLAQ 114
SER E SL KR+LSD+ AR+S+V+ + +Y ++P+LE +Q + L++ L + SL+Q
Sbjct: 61 SERKELSLDKRKLSDSQARVSDVEEQVNQYLSKVPELEQKQNVSELLSLLSNSPNISLSQ 120

Query: 115 LSAYLEGFSSEISQRFLALSRARDVLAGRPEARAMLALVDQALLRMADEQGLEIELGLRI 174
L AYLEG S E S++F L RD L GRPE + LV+QAL+ MA+EQG I LG RI
Sbjct: 121 LKAYLEGKSEEPSEQFKMLCGLRDALKGRPELAHLSHLVEQALVSMAEEQGETIVLGARI 180

Query: 175 EPLAAEASAAGVGDIQALRDTYRDAVLDYRGLSAAWQDIQARFAATPLERVVAFLQKALS 234
P A S +GV +Q LRDTYRDAV+ Y+G+ A W D+Q RF ++ V+ FLQKALS
Sbjct: 181 TPEAYRESQSGVNPLQPLRDTYRDAVMGYQGIYAIWSDLQKRFPNGDIDSVILFLQKALS 240

Query: 235 ADLDSQSSRLDPVKLERVMSDMHKLRVLGGLAEQVGALWQVLVTGERGHGIRAF 288
ADL SQ S KL V+SD+ KL+ G +++QV WQ G + +G+R F
Sbjct: 241 ADLQSQQSGSGREKLGIVISDLQKLKEFGSVSDQVKGFWQFFSEG-KTNGVRPF 293


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1695IGASERPTASE401e-05 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 40.4 bits (94), Expect = 1e-05
Identities = 29/133 (21%), Positives = 42/133 (31%), Gaps = 18/133 (13%)

Query: 19 APLPPLRAQQIAFEQALPAHRPPAPRPPFDKGDETTEAAATADAPTSTPLADQPAAPAAD 78
+ + P + Q + R P T + T+T AD PA +
Sbjct: 1127 SQVSPKQEQSETVQPQAEPARENDP----------TVNIKEPQSQTNTT-ADT-EQPAKE 1174

Query: 79 RPPTIRQPPMPVAADATPTPTPTPTPTPTPTPTPTPTPTV-SPSGSVARQAPAVSARVAA 137
+ Q PV T + P T T PTV S S + + S R
Sbjct: 1175 TSSNVEQ---PVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVP 1231

Query: 138 STQAREPASVSAP 150
EPA+ S+
Sbjct: 1232 HNV--EPATTSSN 1242


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1694TYPE3OMOPROT832e-20 Type III secretion system outer membrane O protein ...
		>TYPE3OMOPROT#Type III secretion system outer membrane O protein

family signature.
Length = 303

Score = 83.1 bits (205), Expect = 2e-20
Identities = 46/177 (25%), Positives = 73/177 (41%), Gaps = 14/177 (7%)

Query: 130 RLALWLDGDPATLLARLPPRPSAQRLAIPLRLSLQWPGLPLDASELRTLEPGDLLLLPAG 189
R LW + P L A RP R + + L L + GD+LL+
Sbjct: 126 RGGLWFEHLPE-LPAVGGGRPKMLRWPLRFVIGSSDTQRSL----LGRIGIGDVLLIRTS 180

Query: 190 HRPDAALLGVLEGRPWARCQLHSTQL-ELLDMH----DTPSLADGEDLHELDQLPIPVSF 244
A + + ++ + E LD+ + + E L L+QLP+ + F
Sbjct: 181 R----AEVYCYAKKLGHFNRVEGGIIVETLDIQHIEEENNTTETAETLPGLNQLPVKLEF 236

Query: 245 EVGRRTLDLHTLSTLQPGSLLDLDSALDGEVRILANQRCLGIGELVRLQDRLGVRVT 301
+ R+ + L L + LL L + + V I+AN LG GELV++ D LGV +
Sbjct: 237 VLYRKNVTLAELEAMGQQQLLSLPTNAELNVEIMANGVLLGNGELVQMNDTLGVEIH 293


49PA1589PA1581Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA1589223-3.393867succinyl-CoA ligase subunit alpha
PA1588223-3.222138succinyl-CoA ligase subunit beta
PA1587322-3.2677772-oxoglutarate dehydrogenase complex
PA1586221-3.7086202-oxoglutarate dehydrogenase complex
PA1585219-4.4993332-oxoglutarate dehydrogenase subunit E1
PA1584217-4.813957succinate dehydrogenase iron-sulfur subunit
PA1583117-4.399258succinate dehydrogenase flavoprotein subunit
PA1582-113-3.546604succinate dehydrogenase subunit D
PA1581-212-3.514304succinate dehydrogenase subunit C
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1587ABC2TRNSPORT300.024 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 29.5 bits (66), Expect = 0.024
Identities = 15/51 (29%), Positives = 23/51 (45%), Gaps = 2/51 (3%)

Query: 317 IGDVVRGAMLAHKASEEGVMVAERIAGHKAQMNYDLIPSVIYTHPEIAWVG 367
+GD+V G M A+ + + I A + Y S++Y P IA G
Sbjct: 110 LGDIVLGEMAW--AATKAALAGAGIGVVAAALGYTQWLSLLYALPVIALTG 158


50PA1558PA1554Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
PA1558414-2.791822hypothetical protein
PA1557517-4.034281cbb3-type cytochrome C oxidase subunit I
PA1556317-4.718991cbb3-type cytochrome C oxidase subunit II
PA1555.1319-4.213396cytochrome C oxidase cbb3-type subunit CcoQ
PA1555217-3.873004cytochrome C oxidase cbb3-type subunit CcoP
PA1554216-3.816497cbb3-type cytochrome C oxidase subunit I
51PA1515PA1503Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA1515115-3.431347allantoicase
PA1514025-5.834300ureidoglycolate hydrolase
PA1513126-5.825023hypothetical protein
PA1512124-5.415960secreted protein Hcp
PA1511126-5.311233hypothetical protein
PA1510230-6.308561hypothetical protein
PA1509018-4.195045hypothetical protein
PA150839-0.142295hypothetical protein
PA15072120.051247transporter
PA1506-112-0.187069hypothetical protein
PA1505015-0.362396cyclic pyranopterin monophosphate synthase
PA15040150.028329transcriptional regulator
PA15032140.635277hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1508OMADHESIN250.048 Yersinia outer membrane adhesin signature.
		>OMADHESIN#Yersinia outer membrane adhesin signature.

Length = 455

Score = 24.9 bits (53), Expect = 0.048
Identities = 20/72 (27%), Positives = 26/72 (36%), Gaps = 9/72 (12%)

Query: 22 QTDLNGKPMAGVGHQVVCP---------LCKGTFPITEGSALLDVNGVPVALHGMKTACG 72
Q N P G+ + V P KG I G+ G VA+ A G
Sbjct: 38 QISPNADPALGLEYPVRPPVPGAGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATG 97

Query: 73 ASLIASGPLGAA 84
+ +A GPL A
Sbjct: 98 VNSVAIGPLSKA 109


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1504HTHTETR712e-17 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 71.2 bits (174), Expect = 2e-17
Identities = 24/166 (14%), Positives = 53/166 (31%), Gaps = 16/166 (9%)

Query: 5 RERNKRLILRAASEEFADKGFAATKTSDIAARAGLPKPNVYYYFQSKENLYRCVLESIVE 64
+ ++ IL A F+ +G ++T +IA AG+ + +Y++F+ K +L+ + E
Sbjct: 9 AQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSES 68

Query: 65 PLLQASA--PFRVEDDPLLALPAYIRSKIRISRELPH----ASKVFASEIMHGAPHLPKE 118
+ + + DPL L + + + +F G
Sbjct: 69 NIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGE----MA 124

Query: 119 YLDELNAQAQRNVTCLQTW-----IDRGQL-APVDPHHLLFAIWAA 158
+ + I+ L A + +
Sbjct: 125 VVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGY 170


52PA1480PA1474Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
PA14802100.495704cytochrome c-type biogenesis protein CcmF
PA14796100.212261cytochrome c-type biogenesis protein CcmE
PA14786100.529180heme exporter protein CcmD
PA1477690.332673heme exporter protein CcmC
PA14768110.773063heme exporter protein CcmB
PA14755101.483241cytochrome c biogenesis ATP-binding export
PA1474291.424758hypothetical protein
53PA1461PA1447Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA1461212-1.202708flagellar motor protein MotD
PA1460112-1.336459flagellar motor protein
PA1459112-1.567433chemotaxis-specific methylesterase
PA1458-111-1.282274two-component sensor
PA145709-1.592002protein phosphatase CheZ
PA1456013-0.459239chemotaxis protein CheY
PA14550150.181610flagellar biosynthesis sigma factor FliA
PA14541150.411224flagellar synthesis regulator FleN
PA14532170.557393flagellar biosynthesis regulator FlhF
PA14523170.118638flagellar biosynthesis protein FlhA
PA14513200.158947hypothetical protein
PA14504190.004032hypothetical protein
PA1449619-0.669460flagellar biosynthesis protein FlhB
PA1448618-1.377531flagellar biosynthesis protein FliR
PA1447316-1.847776flagellar biosynthesis protein FliQ
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1461OMPADOMAIN691e-15 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 69.2 bits (169), Expect = 1e-15
Identities = 34/125 (27%), Positives = 52/125 (41%), Gaps = 16/125 (12%)

Query: 128 EITLNSSLLFPSGDALPNDAAFDIVEKVAKILAPYKNP---IHVEGFTDDVPIHSPRYPT 184
TL S +LF A ++++ L+ + V G+TD I S Y
Sbjct: 214 HFTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDR--IGSDAY-- 269

Query: 185 NWELSAARAASIVRLLGNDGVEPSRMAAVGYGEFQPVADNASAEGR---------AKNRR 235
N LS RA S+V L + G+ +++A G GE PV N + A +RR
Sbjct: 270 NQGLSERRAQSVVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRR 329

Query: 236 VVLVI 240
V + +
Sbjct: 330 VEIEV 334


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1459HTHFIS582e-11 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 58.3 bits (141), Expect = 2e-11
Identities = 37/142 (26%), Positives = 56/142 (39%), Gaps = 6/142 (4%)

Query: 2 AVKVLVVDDSGFFRRRVSEILSADGQIQVVGTGTNGREAIEQALALRPDVITMDYEMPLM 61
+LV DD R +++ LS G V T N A D++ D MP
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAG-YDVRITS-NAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 62 DGITAVRNIMQRCP-TPVLMFSSLTHEGARVTLDALDAGAVDYLPKNFEDISRNPDKVRQ 120
+ + I + P PVL+ S+ + A + GA DYLPK F D++ + +
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAI--KASEKGAYDYLPKPF-DLTELIGIIGR 117

Query: 121 LLCEKVLTIARSNRRSISLPPL 142
L E ++ S PL
Sbjct: 118 ALAEPKRRPSKLEDDSQDGMPL 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1458PF06580427e-06 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 41.8 bits (98), Expect = 7e-06
Identities = 13/69 (18%), Positives = 30/69 (43%), Gaps = 10/69 (14%)

Query: 462 ETDLDKNLVEALADPLV--HLVRNAVDHGIESPEEREAAGKPRVGQVVLSAEQEGDHILL 519
E ++ +++ P++ LV N + HGI P+ G+++L ++ + L
Sbjct: 243 ENQINPAIMDVQVPPMLVQTLVENGIKHGIA--------QLPQGGKILLKGTKDNGTVTL 294

Query: 520 MITDDGKGM 528
+ + G
Sbjct: 295 EVENTGSLA 303


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1456HTHFIS902e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 89.9 bits (223), Expect = 2e-24
Identities = 32/120 (26%), Positives = 55/120 (45%), Gaps = 3/120 (2%)

Query: 2 KILIVDDFSTMRRIIKNLLRDLGFTNTAEADDGTTALPMLHSGNFDFLVTDWNMPGMTGI 61
IL+ DD + +R ++ L G+ + T + +G+ D +VTD MP
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 DLLRAVRADERLKHLPVLMVTAEAKRDQIIEAAQAGVNGYVVKPFTAQVLKEKIEKIFER 121
DLL ++ + LPVL+++A+ I+A++ G Y+ KPF L I +
Sbjct: 64 DLLPRIK--KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1451cloacin300.022 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 30.1 bits (67), Expect = 0.022
Identities = 15/48 (31%), Positives = 20/48 (41%)

Query: 398 SAGGSGGGRRRGGDYASSSGSSSSSSSSSSSDSFSGGGGSSGGGGASG 445
+ G GGG G ++S + S S G G+ GG G SG
Sbjct: 25 TGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSG 72


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1450cloacin363e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 35.8 bits (82), Expect = 3e-04
Identities = 15/37 (40%), Positives = 19/37 (51%)

Query: 380 GGGGSSGSSGGGSSSSSSSSSGGFSGGGGSSGGGGAS 416
GGG SG GG S + G SGGG +GG ++
Sbjct: 47 GGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSA 83



Score = 34.3 bits (78), Expect = 8e-04
Identities = 22/61 (36%), Positives = 24/61 (39%), Gaps = 15/61 (24%)

Query: 373 GQVRLSGGGGGSSGSS--------GGGSSS-------SSSSSSGGFSGGGGSSGGGGASD 417
G L GGG S GS GGGS S S + GG GG SG GG
Sbjct: 23 GPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLS 82

Query: 418 S 418
+
Sbjct: 83 A 83



Score = 33.1 bits (75), Expect = 0.002
Identities = 15/38 (39%), Positives = 18/38 (47%)

Query: 379 GGGGGSSGSSGGGSSSSSSSSSGGFSGGGGSSGGGGAS 416
GGG GS GGGS + +G GG G+ G A
Sbjct: 47 GGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAV 84



Score = 30.5 bits (68), Expect = 0.013
Identities = 15/40 (37%), Positives = 15/40 (37%)

Query: 380 GGGGSSGSSGGGSSSSSSSSSGGFSGGGGSSGGGGASDSW 419
GG G GG S S SS GGG SG S
Sbjct: 22 GGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61



Score = 30.5 bits (68), Expect = 0.016
Identities = 16/34 (47%), Positives = 21/34 (61%), Gaps = 1/34 (2%)

Query: 385 SGSSGGGSSSSSSSSSGGFSGG-GGSSGGGGASD 417
SG G G ++ + S+SG +GG G GGGASD
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASD 35



Score = 29.7 bits (66), Expect = 0.023
Identities = 13/37 (35%), Positives = 18/37 (48%)

Query: 378 SGGGGGSSGSSGGGSSSSSSSSSGGFSGGGGSSGGGG 414
GG G S G SG G + S+ ++ F S+ G G
Sbjct: 65 GGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAG 101


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1449TYPE3IMSPROT336e-116 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 336 bits (864), Expect = e-116
Identities = 98/345 (28%), Positives = 183/345 (53%), Gaps = 2/345 (0%)

Query: 9 DKTEEPTEKRRREAREKGQLPRSRELNTLAILMAGAGGLLIYGADLAGALLRLMRSNFEL 68
+KTE+PT K+ R+AR+KGQ+ +S+E+ + A+++A + L+ +LM E
Sbjct: 4 EKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIPAEQ 63

Query: 69 SRETAMNTESMLQLLGASAYLAAQGLWPILLMLLVAAIVGPIALGGWLFSMDALQPKFSR 128
S ++++ ++ +P+L + + AI + G+L S +A++P +
Sbjct: 64 SYLPF--SQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDIKK 121

Query: 129 LNPLSGLKRMFSAKSLLELSKALIKFLVVLAVALLVLSADRDALLALAHQPLEQAILHSV 188
+NP+ G KR+FS KSL+E K+++K +++ + +++ + LL L +E
Sbjct: 122 INPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPLLG 181

Query: 189 RVVGWSAFWMACSLLLIAAVDVPYQIWDNRQKLLMTKQEVRDEYKDSEGKPEVKSKIRQM 248
+++ ++I+ D ++ + ++L M+K E++ EYK+ EG PE+KSK RQ
Sbjct: 182 QILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRRQF 241

Query: 249 QREMAQRRMMAAVPEADVVITNPTHFAVALKYDPAGGGAPLLLAKGNDFLALKIREVAQE 308
+E+ R M V + VV+ NPTH A+ + Y PL+ K D +R++A+E
Sbjct: 242 HQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIAEE 301

Query: 309 HKVMVMESPALARAVYYSTELDQEIPAGLYLAVAQVLAYVYQLKQ 353
V +++ LARA+Y+ +D IPA A A+VL ++ +
Sbjct: 302 EGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQNI 346


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1448TYPE3IMRPROT1341e-40 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 134 bits (340), Expect = 1e-40
Identities = 96/232 (41%), Positives = 144/232 (62%), Gaps = 2/232 (0%)

Query: 1 MLELTNAQIGGWIASFVLPLFRVAALLMTMPVIGTQLVPVRVRLYLALGVCVVLVPNLPS 60
ML++T+ Q W+ + PL RV AL+ T P++ + VP RV+L LA+ + + P+LP+
Sbjct: 1 MLQVTSEQWLSWLNLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPA 60

Query: 61 MPQVDALSMKAMLLIGEQILVGALLGFSLQLLFHAFVIAGQIISMQMGLGFASMVDPANG 120
S A+ L +QIL+G LGF++Q F A AG+II +QMGL FA+ VDPA+
Sbjct: 61 NDVPV-FSFFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASH 119

Query: 121 VSVPVLGQFFTMLVTLLFLAMNGHLVVFEVIAESFVTLPVGEGLSGNHFWI-IAGKLGWV 179
+++PVL + ML LLFL NGHL + ++ ++F TLP+G ++ ++ + +
Sbjct: 120 LNMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLI 179

Query: 180 MGAALLLALPAITALLVVNLAFGAMTRAAPQLNIFSIGFPLTLVLGLVILWI 231
L+LALP IT LL +NLA G + R APQL+IF IGFPLTL +G+ ++
Sbjct: 180 FLNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAA 231


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1447TYPE3IMQPROT559e-14 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 54.8 bits (132), Expect = 9e-14
Identities = 24/75 (32%), Positives = 43/75 (57%)

Query: 7 LDLFREALWLTAMIVGVLVVPSLLVGLVVAMFQAATQINEQTLSFLPRLMVILLTLIVLG 66
+ +AL+L ++ G + + ++GL+V +FQ TQ+ EQTL F +L+ + L L +L
Sbjct: 5 VFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLFLLS 64

Query: 67 PWLLRQLMEYTQTLI 81
W L+ Y + +I
Sbjct: 65 GWYGEVLLSYGRQVI 79


54PA1408PA1369Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA1408292.003779hypothetical protein
PA1407391.834168hypothetical protein
PA14062101.587043hypothetical protein
PA1405-1102.724027helicase
PA1404-1103.156832hypothetical protein
PA14030103.138356transcriptional regulator
PA14020103.122376hypothetical protein
PA14010102.837402hypothetical protein
PA14000102.570246pyruvate carboxylase
PA13990131.539154transcriptional regulator
PA1398016-1.084702hypothetical protein
PA1397022-2.630059two-component response regulator
PA1396-229-3.350014two-component sensor
PA1395040-5.283136hypothetical protein
PA1394143-5.264326hypothetical protein
PA1393144-5.925527adenylyl-sulfate kinase
PA1392045-5.778384hypothetical protein
PA1391146-7.048675glycosyl transferase family protein
PA1390251-9.405668glycosyl transferase family protein
PA1389254-10.170015glycosyl transferase family protein
PA1388250-10.649001hypothetical protein
PA1387248-10.034908hypothetical protein
PA1386349-10.393835ABC transporter ATP-binding protein
PA1385242-8.220356glycosyl transferase family protein
PA1384133-5.819088UDP-glucose 4-epimerase
PA1383127-4.924545hypothetical protein
PA1382-124-3.250018type II secretion system protein
PA1381-214-0.793694hypothetical protein
PA1380-1141.215340transcriptional regulator
PA1379-2141.055476short-chain dehydrogenase
PA1378-2151.817305hypothetical protein
PA1377429-5.323086hypothetical protein
PA1376636-7.764398bifunctional isocitrate dehydrogenase
PA1375754-11.167212erythronate-4-phosphate dehydrogenase
PA1374866-15.268484hypothetical protein
PA1373563-15.0639003-oxoacyl-ACP synthase
PA1372569-16.760539hypothetical protein
PA1371355-13.945092hypothetical protein
PA1370134-8.816157hypothetical protein
PA1369022-5.523650hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1408RTXTOXIND396e-05 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 39.0 bits (91), Expect = 6e-05
Identities = 25/201 (12%), Positives = 63/201 (31%), Gaps = 13/201 (6%)

Query: 3 RASLHMLLCQFAMALGLLLSLGSEAWAARPAPQAAVDLEAPAALAEDASLDQLNAQLDLI 62
A A + + + L ++ S +++ LI
Sbjct: 133 EADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYF-QNVSEEEVLRLTSLI 191

Query: 63 RQRVTADASDDLLAELRQSALQVQRQ-ADALLALRVADIERLDDQLKVIGPPQPDEAESL 121
+++ + + EL + +R A + +L + SL
Sbjct: 192 KEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLD--------DFSSL 243

Query: 122 AAQRQALTRQKNALLDDERQATQLGQSSRDLAAQIVNLRRSLFNSQISSRAATPFSPSFW 181
++ K+A+L+ E + + R +Q+ + + +++ + T +
Sbjct: 244 LHKQAI---AKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEI 300

Query: 182 STLIRPTDDDLRRLDKLKAEA 202
+R T D++ L A+
Sbjct: 301 LDKLRQTTDNIGLLTLELAKN 321


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1403HTHTETR587e-13 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 58.5 bits (141), Expect = 7e-13
Identities = 24/149 (16%), Positives = 54/149 (36%)

Query: 14 QPQQARSSELVASILEAAVQVLASEGAQRFTTARVAERAGVSIGSLYQYFPNKAAILFRL 73
+ + + E IL+ A+++ + +G + +A+ AGV+ G++Y +F +K+ + +
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 74 QSDEWRRTTRLLGEILEDTTRPPLERLRRLVLAFVRSECEEAAIRVALSDAAPLYRDADE 133
L E PL LR +++ + S E R+ + E
Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGE 122

Query: 134 AREVKAEGARVFQAFLREALPEVAEAERS 162
V+ + + +
Sbjct: 123 MAVVQQAQRNLCLESYDRIEQTLKHCIEA 151


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1400RTXTOXIND374e-04 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 36.7 bits (85), Expect = 4e-04
Identities = 32/195 (16%), Positives = 59/195 (30%), Gaps = 23/195 (11%)

Query: 432 RNLLLHPAVQANRVDTRFVESHLETLLAPIPASHPRLRAECPLA--------------ED 477
R L P + + F+ +HLE + P+ PRL A + E
Sbjct: 26 RKQLDTPVREK--DENEFLPAHLELIETPVSR-RPRLVAYFIMGFLVIAFILSVLGQVEI 82

Query: 478 AAPA--RVEAPLGSLPLSAPSSGVLVALEVADGERVRAGQRVAILEAMKMEFEVKAPGGG 535
A A ++ S + + ++ + V +GE VR G + L A+ E +
Sbjct: 83 VATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLK---- 138

Query: 536 IVRRLAASLGEPLEEGATLLFLEPTEDDDEQAPTEQALDLAHIRADLAEVLERQAALGDE 595
L + E +E + + + P E L +
Sbjct: 139 TQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTW 198

Query: 596 RRPQALARRRKTGQR 610
+ + +R
Sbjct: 199 QNQKYQKELNLDKKR 213


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1397HTHFIS575e-12 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 57.1 bits (138), Expect = 5e-12
Identities = 34/157 (21%), Positives = 58/157 (36%), Gaps = 5/157 (3%)

Query: 3 GRIIVADDHPLFREGMLSILQRLLPEARIEEAGDLAGVLRLADEGEQPDSLILDLRFPGL 62
I+VADD R + L R + + A + R G D ++ D+ P
Sbjct: 4 ATILVADDDAAIRTVLNQALSRA--GYDVRITSNAATLWRWIAAG-DGDLVVTDVVMPDE 60

Query: 63 TRIEMLADLRRRFPRTTLIVVSMVDDPQLIGEVMNAGADGFLGKSIAPEELGQAILAIRA 122
++L +++ P ++V+S + + GA +L K EL I RA
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG--RA 118

Query: 123 GEVLVRYEPSGLLPLQPSPRLEGLTERQLDVLRLLAQ 159
R Q L G + ++ R+LA+
Sbjct: 119 LAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLAR 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1396HTHFIS502e-08 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 49.8 bits (119), Expect = 2e-08
Identities = 31/124 (25%), Positives = 51/124 (41%), Gaps = 9/124 (7%)

Query: 416 LTGLRVCLVEDDRNVLRATSALLERWGCTVQ-AETEADGWRTDC----DILVVDYDLGPH 470
+TG + + +DD + + L R G V+ A WR D++V D + P
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVM-PD 59

Query: 471 ASGVECIERVRRQRGEAIPALVISGH-DIERIQASVEDTDIALLSKPVRPTELRATL-RA 528
+ + + R+++ +P LV+S + E L KP TEL + RA
Sbjct: 60 ENAFDLLPRIKK-ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118

Query: 529 LRER 532
L E
Sbjct: 119 LAEP 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1384NUCEPIMERASE1848e-58 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 184 bits (468), Expect = 8e-58
Identities = 85/353 (24%), Positives = 142/353 (40%), Gaps = 51/353 (14%)

Query: 1 MRVLVTGGAGFIGSHVLVELLGQGAKVVVLDNLVNGSSESLK--RVERITGHPVGFVLGD 58
M+ LVTG AGFIG HV LL G +VV +DNL + SLK R+E + F D
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 59 VRDSLLVERLLIDEKVDAVIHLAGLKAVGESVDDPLEYYESNVQGTISLLRAMQRVGVFK 118
+ D + L + V AV S+++P Y +SN+ G +++L + +
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120

Query: 119 IVFSSSATIYQMPGTLPISESSKVGGVASPYGRTKLTAEHM------LDDLARSDTRWSI 172
++++SS+++Y + +P S V S Y TK E M L L
Sbjct: 121 LLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGL-------PA 173

Query: 173 AVLRYFNPIGAHESGLIGEDPCGTPNNLLPYIAQVAVGRLSRLTVHGGDYPTI--DGTGV 230
LR+F G P G P+ +A+ + ++ + G + G
Sbjct: 174 TGLRFFTVYG----------PWGRPD--------MALFKFTKAMLEGKS-IDVYNYGKMK 214

Query: 231 RDYIHVCDLAAGHTRALEYLGQGHG---------------YHVWNLGTGTGYSVLQVIEA 275
RD+ ++ D+A R + + Y V+N+G + ++ I+A
Sbjct: 215 RDFTYIDDIAEAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQA 274

Query: 276 FERVSGRRIPFTVSGRRPGDVAECWADVSKAERELGWKAGLGLECMIADAWRW 328
E G + +PGDV E AD +G+ ++ + + W
Sbjct: 275 LEDALGIEAKKNMLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNW 327


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1382BCTERIALGSPD1973e-56 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 197 bits (503), Expect = 3e-56
Identities = 129/668 (19%), Positives = 254/668 (38%), Gaps = 109/668 (16%)

Query: 104 SLNVEDVQLAAFINEVFGNILGLPFEIESALKEKTDRVTVRLEQPQTAQMVYEVARQVLV 163
S + + + FIN V L I+ +++ +TVR + Y+ VL
Sbjct: 31 SASFKGTDIQEFINTV-SKNLNKTVIIDPSVRGT---ITVRSYDMLNEEQYYQFFLSVLD 86

Query: 164 NYGVEILHQGDIYRFQIKQVGLSPDEPPILISGEARPSVPIAYRPVFQFVALHSVDPKDV 223
YG +++ + ++ + ++ +A P I V + V L +V +D+
Sbjct: 87 VYGFAVINMNNGVLKVVRSKDAKTA--AVPVASDAAPG--IGDEVVTRVVPLTNVAARDL 142

Query: 224 IPWLN--SAYEKSGLSVMADGARSGLMLKGMSSIVNQATEAVRLLDQPFMRGRHSLRIDP 281
P L + G V + + L++ G ++++ + V +D G S+ P
Sbjct: 143 APLLRQLNDNAGVGSVVHYEPSNV-LLMTGRAAVIKRLLTIVERVDNA---GDRSVVTVP 198

Query: 282 -AFVSAADMASQLKTVIAAQGYSVGIGEAVGSIMLVPLESSNGLIVFANDGQLLDLVREW 340
++ SAAD+ + + S G V +++ E +N ++V ++
Sbjct: 199 LSWASAADVVKLVTELNKDTSKSALPGSMVANVV--ADERTNAVLVSGEPNSRQRIIAM- 255

Query: 341 AQQVDRAPMAVAAGIGEEKEGLFFYEARNTRVTELAKSLRALVSGFAGEGAYGITSGLQS 400
+Q+DR + + ++L + L GI+S +QS
Sbjct: 256 IKQLDRQQATQG--------NTKVIYLKYAKASDLVEVLT------------GISSTMQS 295

Query: 401 SASKRSGGGRRAGDDGAAPAVAPLLQAAGAAALVGGDGANGLLGGLAAGISGSGTIVEDE 460
K++ A D + I
Sbjct: 296 E--KQAAKPVAALDK-------------------------------------NIIIKAHG 316

Query: 461 NRNAILFRGAARTWQQMQGLLREMDKPARQVLIEVTVASVSLSDTQELGVEWEMLNGSFN 520
NA++ A ++ ++ ++D QVL+E +A V +D LG++W N
Sbjct: 317 QTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGIQWANKNAGMT 376

Query: 521 SATSTGSK-GSAGKGGFNYVINT--------------------AGGNTAA-IQAMADNQR 558
T++G +A G Y + GN A + A++ + +
Sbjct: 377 QFTNSGLPISTAIAGANQYNKDGTVSSSLASALSSFNGIAAGFYQGNWAMLLTALSSSTK 436

Query: 559 VRVLATPRILVKSGEQANINVGRDIPIPTAQVNDDSTTAGSTNLRNEIAYRSTGTILNVA 618
+LATP I+ +A NVG+++P+ T S T N+ N + ++ G L V
Sbjct: 437 NDILATPSIVTLDNMEATFNVGQEVPVLTG-----SQTTSGDNIFNTVERKTVGIKLKVK 491

Query: 619 PVVYSDSRVDLTVSQELSDSGGSSGGGGKASGGGISAPEISRTSLETSLTLKSGGSVLMG 678
P + V L + QE+S ++ G + ++ ++ + SG +V++G
Sbjct: 492 PQINEGDSVLLEIEQEVSSVADAASSTSSDLG-----ATFNTRTVNNAVLVGSGETVVVG 546

Query: 679 GLIRDNITDSNAGVPLLKDIPGIGFLFGRQKAVKTREEVIMLIQPYVLESDADAREVTEK 738
GL+ +++D+ VPLL DIP IG LF ++ +++ I+P V+ + R+ +
Sbjct: 547 GLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNLMLFIRPTVIRDRDEYRQASSG 606

Query: 739 LHAMLSKT 746
+ +
Sbjct: 607 QYTAFNDA 614


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1379DHBDHDRGNASE1103e-31 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 110 bits (277), Expect = 3e-31
Identities = 61/185 (32%), Positives = 86/185 (46%), Gaps = 3/185 (1%)

Query: 5 KTLLITGASSGFGQALAREALDAGHRVVGTVRSEEARSALEAVAPGQAFGR---LLDVTD 61
K ITGA+ G G+A+AR G + + E + + +A DV D
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 62 LAAIEPTVAAIERDIGPLDVLVNSAGYGHEGILEESPLAEMRRQFEVNLFGAVAMIQAVL 121
AAI+ A IER++GP+D+LVN AG G++ E F VN G ++V
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 122 PYMRRRRRGHILNITSMGGYITMPGIAYYCGSKFALEGVSEALGKEVAGLGIAVTAVAPG 181
YM RR G I+ + S + +A Y SK A ++ LG E+A I V+PG
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188

Query: 182 SFRTD 186
S TD
Sbjct: 189 STETD 193


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1377SACTRNSFRASE379e-06 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 37.2 bits (86), Expect = 9e-06
Identities = 15/51 (29%), Positives = 23/51 (45%), Gaps = 1/51 (1%)

Query: 93 VAVAWQGKGVGSRLLGELLDIADNWMNLRRVELTVYTDNAPALALYRKFGF 143
VA ++ KGVG+ LL + ++ A + + L N A Y K F
Sbjct: 97 VAKDYRKKGVGTALLHKAIEWAKE-NHFCGLMLETQDINISACHFYAKHHF 146


55PA1322PA1316Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA1322212-2.623018TonB-dependent receptor
PA1321212-1.932818protoheme IX farnesyltransferase
PA1320212-1.948975cytochrome o ubiquinol oxidase subunit IV
PA1319111-1.676573cytochrome o ubiquinol oxidase subunit III
PA1318211-0.590805cytochrome o ubiquinol oxidase subunit I
PA13172121.214664cytochrome o ubiquinol oxidase subunit II
PA13162122.240044major facilitator superfamily transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1322ENTEROVIROMP290.034 Enterobacterial virulence outer membrane protein si...
		>ENTEROVIROMP#Enterobacterial virulence outer membrane protein

signature.
Length = 171

Score = 29.5 bits (66), Expect = 0.034
Identities = 17/42 (40%), Positives = 23/42 (54%)

Query: 466 GTSRSTPSGKPTVRADSSDGKLSTRAGLVFKPLENGRVYFSY 507
G + + PT + D+SD S AGL F P+EN + FSY
Sbjct: 109 GYGKFQTTEYPTYKHDTSDYGFSYGAGLQFNPMENVALDFSY 150


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1316TCRTETB1073e-27 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 107 bits (269), Expect = 3e-27
Identities = 82/400 (20%), Positives = 164/400 (41%), Gaps = 15/400 (3%)

Query: 17 TFIASLDISIVNLALPTLQYALDTDLAGLQWVVDAYALCLSAFMLSSGPLSDRYGRKLTW 76
+F + L+ ++N++LP + + A WV A+ L S G LSD+ G K
Sbjct: 23 SFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLL 82

Query: 77 LLGVGLFSFGSLLCALATS-LPLLLFGRAVQGIAGALLIPGALSILTQAFHDPGQRAQVI 135
L G+ + FGS++ + S LL+ R +QG GA P + ++ + R +
Sbjct: 83 LFGIIINCFGSVIGFVGHSFFSLLIMARFIQGA-GAAAFPALVMVVVARYIPKENRGKAF 141

Query: 136 GGWTSFSALSLILGPLLGGLLVEHAGWQSIFLINLPLGLLALALGLWGIEETAHPEHAAF 195
G S A+ +GP +GG++ + W + LI + + L +E H F
Sbjct: 142 GLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVRIKGH--F 199

Query: 196 DPLGQLLSVVWLGALTYALIAAGEGGWLSPTAWPALLLAGVGLLGFLFVERRTARPLLPL 255
D G +L V + + + L+++ + L F+ R+ P +
Sbjct: 200 DIKGIILMSVGIVFFMLFTTSY---------SISFLIVSVLSFLIFVKHIRKVTDPFVDP 250

Query: 256 GLFRQAGFAVCNLASFVLGFSGYASLFFLSLFFQQVQGASAQQAGF-YLAPQFLAMGALS 314
GL + F + L ++ + + + + V S + G + P +++
Sbjct: 251 GLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFG 310

Query: 315 MLFGRLQRHVPLRRLLVLGYLVIGLAMLALAACGTGTAYPWVGLLLVALGLGMGLAVPGT 374
+ G L +L +G + ++ L + T++ ++ +++V + G+
Sbjct: 311 YIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSW-FMTIIIVFVLGGLSFTKTVI 369

Query: 375 GLAVMASVARERSGMASATMNTLRQAGMAVGIALLGALLS 414
V +S+ ++ +G + +N GIA++G LLS
Sbjct: 370 STIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLS 409


56PA1299PA1255Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA12992100.397917hypothetical protein
PA1298190.388755hypothetical protein
PA1297090.615761metal transporter
PA1296-2120.5092372-hydroxyacid dehydrogenase
PA1295-1130.056072hypothetical protein
PA1294-1110.701475ribonuclease D
PA1293112-0.430247hypothetical protein
PA1292111-0.5542333-mercaptopyruvate sulfurtransferase
PA1291290.057115hypothetical protein
PA12902120.016049transcriptional regulator
PA12891140.237517hypothetical protein
PA12880110.056887hypothetical protein
PA1287-1101.825130glutathione peroxidase
PA1286-1112.554338major facilitator superfamily transporter
PA1285-192.408013transcriptional regulator
PA1284-1103.340582acyl-CoA dehydrogenase
PA12831134.663978transcriptional regulator
PA12822114.817121major facilitator superfamily transporter
PA12813125.209323adenosylcobinamide-GDP ribazoletransferase
PA12802115.380681hypothetical protein
PA12793115.447557nicotinate-nucleotide--dimethylbenzimidazole
PA1278194.990218bifunctional adenosylcobinamide
PA1277194.506822cobyric acid synthase
PA1276-183.361384threonine-phosphate decarboxylase
PA1275083.615224cobalamin biosynthesis protein CobD
PA1274-172.7896875,6-dimethylbenzimidazole synthase
PA1273-272.864534hydrogenobyrinate a,c-diamide synthase
PA1272-282.995030cob(I)yrinic acid a,c-diamide
PA1271-194.082427tonB-dependent receptor
PA1270275.429867hypothetical protein
PA12691115.524207transcriptional regulator
PA1268185.6967374-hydroxyproline 2-epimerase
PA1267395.592494hypothetical protein
PA1266275.162387oxidoreductase
PA1265193.666613hypothetical protein
PA1264-1113.760421transcriptional regulator
PA1263-1123.101720hypothetical protein
PA12620133.047869major facilitator superfamily transporter
PA12610132.165530transcriptional regulator
PA12601162.297651amino acid ABC transporter substrate-binding
PA12592152.303320hypothetical protein
PA12581132.061286ABC transporter permease
PA1257292.338153amino acid ABC transporter permease
PA12563102.588815amino acid ABC transporter ATP binding protein
PA1255282.848904trans-3-hydroxy-L-proline dehydratase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1295SSPAMPROTEIN280.003 Salmonella surface presentation of antigen gene typ...
		>SSPAMPROTEIN#Salmonella surface presentation of antigen gene type

M signature.
Length = 147

Score = 28.5 bits (63), Expect = 0.003
Identities = 18/63 (28%), Positives = 33/63 (52%)

Query: 1 MKRICSVYKSPRKSEMYLYVDKREALSRVPEALLVPFGAPQHVFDLLLTPERQLAREDVA 60
++R C+V+ S +S + Y D+ L EA++ + + D L RQL+RE++
Sbjct: 10 LQRRCTVFHSQCESILLRYQDEDRRLQVEEEAIVEQIAGLKLLLDTLRAENRQLSREEIY 69

Query: 61 KVL 63
+L
Sbjct: 70 ALL 72


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1290HTHTETR762e-19 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 75.8 bits (186), Expect = 2e-19
Identities = 37/202 (18%), Positives = 72/202 (35%), Gaps = 10/202 (4%)

Query: 5 SRQQENAEATREALLESALSAFIEHGYGGVSIDAIAREARVTKGAFYHHFGSKQELLAEC 64
+ ++ A+ TR+ +L+ AL F + G S+ IA+ A VT+GA Y HF K +L +E
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 65 YERQVRTIAEDLDRVPAHVDKWAEAAALA--EAFIDSVMARGKRQL----SLQEVITVVG 118
+E I E A + ++S + +R+L + V
Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGE 122

Query: 119 WE---RWKRIDSRHTLRYVGRLVDELAASGELK-DYRRETLVGQLYGFLTQAAMSLRDAR 174
+ +R + + + + + L D + G+++ + A
Sbjct: 123 MAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAP 182

Query: 175 NKRQAANEVKAIIRDFLYSLRR 196
E + + L
Sbjct: 183 QSFDLKKEARDYVAILLEMYLL 204


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1286TCRTETB454e-07 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 44.9 bits (106), Expect = 4e-07
Identities = 33/133 (24%), Positives = 62/133 (46%), Gaps = 3/133 (2%)

Query: 53 LVWGLAQPFTGALADRYGAARAVLVGGLLYALGLVLMGLSQSASGLSLSAGLLIGLGLSG 112
L + + G L+D+ G R +L G ++ G V+ + S L + A + G G +
Sbjct: 60 LTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAG-AA 118

Query: 113 TSFSVILGAVGRAVPAEQRSMAMGISSAAGSFGQFAMLPGTLGLIG-WLGWSSALLALGL 171
++++ V R +P E R A G+ + + G+ + P G+I ++ WS LL +
Sbjct: 119 AFPALVMVVVARYIPKENRGKAFGLIGSIVAMGE-GVGPAIGGMIAHYIHWSYLLLIPMI 177

Query: 172 LVALIVPLAGLMK 184
+ + L L+K
Sbjct: 178 TIITVPFLMKLLK 190


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1283HTHTETR542e-11 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 54.2 bits (130), Expect = 2e-11
Identities = 22/107 (20%), Positives = 41/107 (38%), Gaps = 1/107 (0%)

Query: 1 MGRRRTIDRDQLLDAAEAVIAREGAAGLTIDAVAKEMGITKGGVQYCFGTKDALIDAIFE 60
+ R +LD A + +++G + ++ +AK G+T+G + + F K L I+E
Sbjct: 5 TKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWE 64

Query: 61 RWGKAYDSLFEAVAGKQP-TPLTRVRAHAEATQRSDELSSSKAAALM 106
L K P PL+ +R S + +
Sbjct: 65 LSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLME 111


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1282TCRTETB1941e-58 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 194 bits (495), Expect = 1e-58
Identities = 91/398 (22%), Positives = 177/398 (44%), Gaps = 14/398 (3%)

Query: 18 FLIIIDMTVLYTALPRLTHDLGATAAEKLWIVNAYPLVVAGLLPGAGLLSDRLGHKRLFL 77
F +++ VL +LP + +D A W+ A+ L + G LSD+LG KRL L
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 78 AGLPLFGLASLCAAFAPSAAA-LIAARAGLAVGAAVMMPATLSIVRHVFQDERERALAIG 136
G+ + S+ S + LI AR GAA PA + +V + + R A G
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAA-AFPALVMVVVARYIPKENRGKAFG 142

Query: 137 IWASVASAGAALGPVVGGVLLEFFWWGSVFLINVPVVVVALLLALPAIPACGGQSRRPWD 196
+ S+ + G +GP +GG++ + W +L+ +P++ + + L + + + +D
Sbjct: 143 LIGSIVAMGEGVGPAIGGMIAHYIHWS--YLLLIPMITIITVPFLMKLLKKEVRIKGHFD 200

Query: 197 ALGSLQVMFGLVGVVYAIKELSTRAPDFGLAVLAALGGMLCLYLFVRRQRRAREPMIDFA 256
G + + G+V + L T + +++ +L +FV+ R+ +P +D
Sbjct: 201 IKGIILMSVGIVFFM-----LFTTSYSISFLIVS----VLSFLIFVKHIRKVTDPFVDPG 251

Query: 257 LFRNRRFARGVAVALVATMALLGMELVFSQHLQLVQGLTPLKAG-LFVLPIPLASLVVGP 315
L +N F GV + + G + ++ V L+ + G + + P ++ ++ G
Sbjct: 252 LGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGY 311

Query: 316 LAGWLVPRWGENRVMCASLLFGSAGLLGLALSYQSATGAQLASLVLLGVGFGGAMTAAST 375
+ G LV R G V+ + F S L + ++ + +V + G T ST
Sbjct: 312 IGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVIST 371

Query: 376 AVMLNVDEQSSGMAAAIEDVSYELGGVIGVTLLGSLMS 413
V ++ +Q +G ++ + + L G+ ++G L+S
Sbjct: 372 IVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLS 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1271PF00577300.028 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 30.2 bits (68), Expect = 0.028
Identities = 29/154 (18%), Positives = 38/154 (24%), Gaps = 37/154 (24%)

Query: 153 RRGDGQGAKPFFSAGYGTHQ-----TLEGSAGVSG-------GAGNGWYSLGVSSFDTAG 200
R G+ Q KP F H T+ G ++ G G +LG S D
Sbjct: 384 RSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALSVDMTQ 443

Query: 201 INTKRAGT-------------------------AGYEPDRDGYRNLSGNLRGGYRFDNGL 235
N+ GY GY N + N
Sbjct: 444 ANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGYNIE 503

Query: 236 ELDGTLLRAKSHNDYDQVFGNSGFNANADGEQNL 269
DG + DY + N Q L
Sbjct: 504 TQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQL 537


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1267NUCEPIMERASE290.020 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 29.4 bits (66), Expect = 0.020
Identities = 9/30 (30%), Positives = 15/30 (50%), Gaps = 1/30 (3%)

Query: 5 VIVVG-AGIVGSACAHELARRGLDVLVLDS 33
+V G AG +G + L G V+ +D+
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAGHQVVGIDN 32


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1262TCRTETB1111e-28 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 111 bits (278), Expect = 1e-28
Identities = 80/418 (19%), Positives = 163/418 (38%), Gaps = 30/418 (7%)

Query: 15 LCILLAGQLLPMIDFSIVNVALDALAHSLGASETELELIVAVYGVAFAVCLAMGGRLGDN 74
LCIL +++ ++NV+L +A+ + + + F++ A+ G+L D
Sbjct: 19 LCIL---SFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQ 75

Query: 75 YGRRRLFVLGVALFAVASLLCGLAGS-VWLLLVARALQGVGAALVVPQILATLHVSLSGH 133
G +RL + G+ + S++ + S LL++AR +QG GAA ++ + +
Sbjct: 76 LGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKE 135

Query: 134 AHSRALAAYGAIGGLAFVVGQVLGGFLVSADIGGLGWRSVFLINLPICLGILLCSRRWVP 193
+A G+I + VG +GG + + W +L+ +P+ I + +
Sbjct: 136 NRGKAFGLIGSIVAMGEGVGPAIGGMIAHY----IHWS--YLLLIPMITIITVPFLMKLL 189

Query: 194 ETRAEHAARVDAPGTLLLAALILCLLLPLALGPSLHWS-WPCALLLAAAVPLLAWLWRTE 252
+ D G +L++ I+ +L + L+
Sbjct: 190 KKEVRIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFLIF------------- 236

Query: 253 LRQERRQAWPLLPPSLLRLPSIRFGLLLAILFFACWSGFMFALALALQAGAGLSPVQAGN 312
++ R+ P + P L + G+L + F +GF+ + ++ LS + G+
Sbjct: 237 VKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGS 296

Query: 313 AFIALGA-SYFVSALLTARVAARIGPVRLLLLGCVIQMCGLLGLMLTLQRVWPQPGILNL 371
I G S + + + R GP+ +L +G L + +
Sbjct: 297 VIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFL-----LETTSWFM 351

Query: 372 APATLVIGFGQAFIVSSFFRIGLSEVPAAQAGAGSAMLATVQQASLGLGSALLGAVFA 429
+ + G +F + I S + +AGAG ++L S G G A++G + +
Sbjct: 352 TIIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLS 409


57PA1243PA1228Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA1243-193.514480sensor/response regulator hybrid protein
PA1242084.512510hypothetical protein
PA12411104.120091transcriptional regulator
PA1240294.466159enoyl-CoA hydratase
PA12391104.252447hypothetical protein
PA12381104.070914multidrug efflux pump outer membrane protein
PA12370123.255598multidrug resistance efflux pump
PA12360142.541302major facilitator superfamily transporter
PA12350162.643991transcriptional regulator
PA12340132.564434hypothetical protein
PA12330122.907887hypothetical protein
PA12321133.255663hypothetical protein
PA12310123.029666hypothetical protein
PA12300123.056480hypothetical protein
PA12291102.866703transcriptional regulator
PA12282142.917325hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1243HTHFIS823e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 81.8 bits (202), Expect = 3e-18
Identities = 33/119 (27%), Positives = 52/119 (43%), Gaps = 5/119 (4%)

Query: 742 THVLLVDDDRMVRYTTALLLGDLGYQVSEAASAEEALGEVERGLAPDLLVTDHLMADKTG 801
+L+ DDD +R L GY V ++A + G DL+VTD +M D+
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAG-DGDLVVTDVVMPDENA 62

Query: 802 VQLAEELRQRFPQLPVLVITGYANL----RPEQLNGFEVLTKPFRHNELAERLARLLEA 856
L +++ P LPVLV++ + + ++ L KPF EL + R L
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1242SUBTILISIN883e-21 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 88.4 bits (219), Expect = 3e-21
Identities = 60/293 (20%), Positives = 104/293 (35%), Gaps = 51/293 (17%)

Query: 256 VRIGVIERDVDFDAPDFADYLGPCKAPAPRTCLYARDAERPDNHGSTVAGILAARWDQGG 315
V++ V++ D D PD + + + + HG+ VAG +AA
Sbjct: 43 VKVAVLDTGCDADHPDLKARIIGGRNFTDDDEGDPEIFKDYNGHGTHVAGTIAA----TE 98

Query: 316 NSGFLRGLDRASQGFEVIVERNSDAGITANVAASVN-LVEDGVRVLNWSWGIHRVGARDV 374
N + G+ + + V +G + + +E V +++ S G
Sbjct: 99 NENGVVGVAPEADLLIIKVLNKQGSGQYDWIIQGIYYAIEQKVDIISMSLG------GPE 152

Query: 375 DGDEVDSLVRSGIAMSGYEELLEEFFLWLRKEHPDVLVVNSAGN-GSSYSGTDEYRLPSS 433
D E+ V+ +A +LV+ +AGN G TDE P
Sbjct: 153 DVPELHEAVKKAVA-------------------SQILVMCAAGNEGDGDDRTDELGYPGC 193

Query: 434 FVTEQLLVVGGHQRSERQGLAVDDPAYAVKRSTSNVDMRVDVTAAACTHASTLERDARGE 493
+ +++ VG A++ +A SN + VD+ A ST+ +
Sbjct: 194 Y--NEVISVG----------AINFDRHAS--EFSNSNNEVDLVAPGEDILSTV-PGGKYA 238

Query: 494 VHCGTSYATPMVAGTVAAMLSLNPRLR-----PEEIRMLLRRSAMTIGGDYDF 541
GTS ATP VAG +A + L E+ L + + +G
Sbjct: 239 TFSGTSMATPHVAGALALIKQLANASFERDLTEPELYAQLIKRTIPLGNSPKM 291


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1241HTHTETR601e-13 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 60.0 bits (145), Expect = 1e-13
Identities = 25/171 (14%), Positives = 53/171 (30%), Gaps = 11/171 (6%)

Query: 8 RDELLQRCAGTFRRYGYHGTTMEMLSSACGLTKASFYHHYPNKEALLRDVLEWTHQRLAE 67
R +L F + G T++ ++ A G+T+ + Y H+ +K L ++ E + + E
Sbjct: 13 RQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGE 72

Query: 68 TLFSIAYDPLLTPRERLEKLGRKAARLFQDDSIGCLMGVVAVDASYGRSELMAPIRSFLD 127
P L ++ + L+ + + +
Sbjct: 73 LELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIF-----HKCEFVGEMAVVQ 127

Query: 128 DWAQAFAQLYRPAFDEA--QALERGRQLVADFEGAILLARIYGEPGYIDGV 176
+ ++ +E L AD + GYI G+
Sbjct: 128 QAQRNLCLESYDRIEQTLKHCIEAK-MLPADLMTRRAAIIMR---GYISGL 174


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1237RTXTOXIND1225e-33 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 122 bits (307), Expect = 5e-33
Identities = 61/368 (16%), Positives = 110/368 (29%), Gaps = 68/368 (18%)

Query: 66 AVSAQVSGYVAEVLVADDADVQAGDLLLRLDPRDFR-------QRLRAAEAREAAAQAAL 118
+ + V E++V + V+ GD+LL+L L A + Q
Sbjct: 98 EIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILS 157

Query: 119 EAQ-------------------------------RAKLETLDRQLLEQAQTISRARADGE 147
+ + + T Q ++ + + RA+
Sbjct: 158 RSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERL 217

Query: 148 AARAEWRRAETDWR-------RYRQLADEHATSRQRLENADAAHQRARAAARRASAEEGR 200
A R E R + L + A ++ + + + A R ++ +
Sbjct: 218 TVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQ 277

Query: 201 QRAAKDVLKSR--------RREAEAALAQRQAELQEAAAARELARHALDDTEIRAPFAGR 252
+ K + E L Q + + IRAP + +
Sbjct: 278 IESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVK 337

Query: 253 VGQRKVRLR-QYVTPGLPLLAVVPLEQAYVV-ANYKETQLERIRPGQPVELEVDTFGRRW 310
V Q KV VT L+ +VP + V A + + I GQ ++V+ F
Sbjct: 338 VQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTR 397

Query: 311 RGRVDSVAPASGAVFALLPPDNATGNFTKIVQRFPVRIRLDADAAERG----RLLPGMSV 366
G + + D +V F V I ++ + G L GM+V
Sbjct: 398 YGYLV-------GKVKNINLDAIEDQRLGLV--FNVIISIEENCLSTGNKNIPLSSGMAV 448

Query: 367 IATVDTRE 374
A + T
Sbjct: 449 TAEIKTGM 456


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1236TCRTETB1097e-28 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 109 bits (273), Expect = 7e-28
Identities = 79/402 (19%), Positives = 168/402 (41%), Gaps = 17/402 (4%)

Query: 23 FMAGMNVHVTSAALPEIEGALGATFEEGSWISTAYLVAEISMIPLTAWLVEVFSLRRVML 82
F + +N V + +LP+I +W++TA+++ + L + ++R++L
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 83 LGSLVFLLSSLSCALAPN-LSTLILIRVIQGASGAVLIPLSMQLILTELPSSRIPLGMAL 141
G ++ S+ + + S LI+ R IQGA A L M ++ +P L
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143

Query: 142 FSLSNSVAQAAGPSIGGWLADAYSWRWIFLLQLLPGIALLAAVAWSIRPRDGDRERLRQA 201
++ + GP+IGG +A W ++ L+ ++ I + + + R +
Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKL---LKKEVRIK-GHF 199

Query: 202 DWLGIGAMVAGLGALQIVLEEGGRRDWFESGFIRTFAVLAVLALLLFVQRQLWGARPFIN 261
D GI M G+ + F + + +F +++VL+ L+FV+ PF++
Sbjct: 200 DIKGIILMSVGIVFFML----------FTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVD 249

Query: 262 LRLLGSYNFGVSSLAMAVFGAATFGLVFLVPNYLSQLQGFNARQIGDSLILYGLVQLLL- 320
L + F + L + G V +VP + + + +IG +I G + +++
Sbjct: 250 PGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIF 309

Query: 321 APLLPRLMRWLNPKLLVAGGFAIMALGCWMGAHLNADAGRNVIIPSIVVRGIGQPLIMVA 380
+ L+ P ++ G +++ ++ A + + IV G
Sbjct: 310 GYIGGILVDRRGPLYVLNIGVTFLSVS-FLTASFLLETTSWFMTIIIVFVLGGLSFTKTV 368

Query: 381 LSVLAVKGLDKAQAGSASALISMLRNLGGAIGTALLTQLVSL 422
+S + L + +AG+ +L++ L G A++ L+S+
Sbjct: 369 ISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSI 410


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1235HTHFIS339e-04 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 33.3 bits (76), Expect = 9e-04
Identities = 14/103 (13%), Positives = 31/103 (30%), Gaps = 6/103 (5%)

Query: 87 RHDLPRDCRVVDVPPLLRQLIVAAMRIAPDYPPGGRDERVMELILDELRVLPILALHVPQ 146
R + + R + + + ++ + D L + + +
Sbjct: 376 REIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVLAE 435

Query: 147 PVDPRLAALCRSLRAEPAADWSLGDAARRLGVSPRTLTRAFQR 189
P + L A A + AA LG++ TL + +
Sbjct: 436 MEYPLI------LAALTATRGNQIKAADLLGLNRNTLRKKIRE 472


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1231RTXTOXIND664e-14 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 65.6 bits (160), Expect = 4e-14
Identities = 43/214 (20%), Positives = 75/214 (35%), Gaps = 39/214 (18%)

Query: 79 RSYRLAVRQREAELEQARETLRQRDEQWRRRMQLAGAVSREEVANAGRALRIARARAEAA 138
R Y+ + Q E+E+ A+E + + ++ + LR
Sbjct: 269 RVYKSQLEQIESEILSAKEEYQLVTQLFKNEIL--------------DKLRQTTDNIGLL 314

Query: 139 GAALDQARLDLRRSELRSPVDGYVTQLRVQ-PGDYAAAGRTNIFIV-DRRSFWVTGYFEE 196
L + + S +R+PV V QL+V G T + IV + + VT +
Sbjct: 315 TLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQN 374

Query: 197 TKLRNVQVGAPATIKLMGFD----PLLDGHVASIGRGVADLNESRADSGLPQVSPNFSWI 252
+ + VG A IK+ F L G V +I D+ Q
Sbjct: 375 KDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNIN----------LDAIEDQRLGLV--- 421

Query: 253 RLAQRVPVRIELDRVPA---GVVLAAGMTGSVEV 283
V + IE + + + L++GM + E+
Sbjct: 422 ---FNVIISIEENCLSTGNKNIPLSSGMAVTAEI 452



Score = 47.5 bits (113), Expect = 3e-08
Identities = 18/114 (15%), Positives = 41/114 (35%), Gaps = 3/114 (2%)

Query: 41 VSAQVIRIAPEVSGSVEAVFVADNQRVARGDPLYRIDPRSYRLAVRQREAELEQARETLR 100
S + I P + V+ + V + + V +GD L ++ + ++ L QAR +
Sbjct: 92 HSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLE-Q 150

Query: 101 QRDEQWRRRMQLAGAVSREEVANAGRALRIARARAEAAGAALDQARLDLRRSEL 154
R + R ++L E + +L + + +++
Sbjct: 151 TRYQILSRSIELNKL--PELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQK 202


58PA1084PA1076Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA1084313-1.829681flagellar basal body P-ring protein
PA1083313-2.687494flagellar basal body L-ring protein
PA1082313-3.140516flagellar basal body rod protein FlgG
PA1081112-3.391830flagellar basal body rod protein FlgF
PA1080112-3.379175flagellar hook protein FlgE
PA1079014-3.667849flagellar basal body rod modification protein
PA1078013-3.732849flagellar basal body rod protein FlgC
PA1077112-2.972483flagellar basal-body rod protein FlgB
PA1076212-2.699993hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1084FLGPRINGFLGI436e-155 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 436 bits (1122), Expect = e-155
Identities = 168/366 (45%), Positives = 224/366 (61%), Gaps = 10/366 (2%)

Query: 7 LLALAALLLAAGAAQAERLKDIASIQGVRTNQLIGYGLVVGLSGSGDQTTQTPFTLQTFN 66
AL L A R+KDIAS+Q R NQLIGYGLVVGL G+GD +PFT Q+
Sbjct: 13 FSALPFLSTPPAQADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSPFTEQSMR 72

Query: 67 NMLAQFGIKVPANVGNVQLKNVAAVSVHADLPPFAKPGQPIDVTVSSIGNAKSLRGGSLL 126
ML GI G KN+AAV V A+LPPFA PG +DVTVSS+G+A SLRGG+L+
Sbjct: 73 AMLQNLGITTQG--GQSNAKNIAAVMVTANLPPFASPGSRVDVTVSSLGDATSLRGGNLI 130

Query: 127 MTPLKGIDGQVYAVAQGNLVVGGFDAEGRDGSKITVNVPSAGRIPAGATVERAVPSGFDQ 186
MT L G DGQ+YAVAQG L+V GF A+G D + +T V ++ R+P GA +ER +PS F
Sbjct: 131 MTSLSGADGQIYAVAQGALIVNGFSAQG-DAATLTQGVTTSARVPNGAIIERELPSKFKD 189

Query: 187 GNSLTLNLNRPDFTTAKRIVDRINEL----LGPGVAHAVDGGSVRVSAPLDPNQRVDYLS 242
+L L L PDF+TA R+ D +N G +A D + V P ++
Sbjct: 190 SVNLVLQLRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKP-RVADLTRLMA 248

Query: 243 ILENLDVQPGEAVAKVIINSRTGTIVIGQNVKVSPAAVTHGSLTVSITEDPIVSQPGAFS 302
+ENL V+ + AKV+IN RTGTIVIG +V++S AV++G+LTV +TE P V QP FS
Sbjct: 249 EIENLTVET-DTPAKVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESPQVIQPAPFS 307

Query: 303 NGQTAVVPRSRVNAEEETKPMFKFGPGTTLDDIVRAVNQVGAAPSDLMAILEALKQAGAL 362
GQTAV P++ + A +E + G L +V +N +G ++AIL+ +K AGAL
Sbjct: 308 RGQTAVQPQTDIMAMQEGSKVA-IVEGPDLRTLVAGLNSIGLKADGIIAILQGIKSAGAL 366

Query: 363 QADLIV 368
QA+L++
Sbjct: 367 QAELVL 372


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1083FLGLRINGFLGH1803e-59 Flagellar L-ring protein signature.
		>FLGLRINGFLGH#Flagellar L-ring protein signature.

Length = 232

Score = 180 bits (459), Expect = 3e-59
Identities = 81/224 (36%), Positives = 112/224 (50%), Gaps = 13/224 (5%)

Query: 12 IATALGGCVNPPPKPNDPYYAPVLPRTPLPAAQNNGAIYQAGF-----EQNLYDDRKAFR 66
+ +L GC P P P P P NG+I+Q+ Q L++DR+
Sbjct: 15 LVLSLTGCAWIPSTPLVQGATSAQP-VPGPTPVANGSIFQSAQPINYGYQPLFEDRRPRN 73

Query: 67 VGDIITITLNEKTQASKKANSDIQKDSKTKMGLTSLFGSGMTTNNPIGGGDLSLSAEYGG 126
+GD +TI L E ASK ++++ +D KT G + G + E G
Sbjct: 74 IGDTLTIVLQENVSASKSSSANASRDGKTNFGFDT---VPRYLQGLFGNARADV--EASG 128

Query: 127 SRDAKGDSQAGQSNSLTGSITVTVAEVLPNGILSVRGEKWMTLNTGNELVRIAGLVRADD 186
G A SN+ +G++TVTV +VL NG L V GEK + +N G E +R +G+V
Sbjct: 129 GNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRFSGVVNPRT 188

Query: 187 IATDNTVSSTRVADARITYSGTGAFADASQPGWLDRFF--LSPL 228
I+ NTV ST+VADARI Y G G +A GWL RFF LSP+
Sbjct: 189 ISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSPM 232


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1082FLGHOOKAP1452e-07 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 44.6 bits (105), Expect = 2e-07
Identities = 13/51 (25%), Positives = 25/51 (49%)

Query: 209 NGLGTVAQNTLENSNVNVVEELVNMITTQRAYEMNSKVISTADQMLSFVTQ 259
N + ++ S VN+ EE N+ Q+ Y N++V+ TA+ + +
Sbjct: 494 NVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALIN 544



Score = 39.6 bits (92), Expect = 9e-06
Identities = 20/79 (25%), Positives = 35/79 (44%), Gaps = 14/79 (17%)

Query: 3 SALWVSKTGLSAQDMNLTTISNNLANVSTTGFKRDRAEFQDLLYQIRRQPGGQSTQDSEL 62
S + + +GL+A L T SNN+++ + G+ R + +S L
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTI--------------MAQANSTL 47

Query: 63 PSGLQLGTGVRVVGTQKIF 81
+G +G GV V G Q+ +
Sbjct: 48 GAGGWVGNGVYVSGVQREY 66


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1080FLGHOOKAP1455e-07 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 44.9 bits (106), Expect = 5e-07
Identities = 17/49 (34%), Positives = 27/49 (55%)

Query: 414 ALQSGALEASNVDISNELVNLIVHQRNYQANAKTIQTEDAVTQTIINLR 462
L + S V++ E NL Q+ Y ANA+ +QT +A+ +IN+R
Sbjct: 498 QLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546



Score = 41.1 bits (96), Expect = 8e-06
Identities = 22/69 (31%), Positives = 34/69 (49%), Gaps = 3/69 (4%)

Query: 2 SFNIGLSGIQAASSGLNVTGNNIANAGTVGFKQSRAEFADVYAASVLGSGSNPQGSGVLL 61
N +SG+ AA + LN NNI++ G+ + A A S LG+G G+GV +
Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQ--ANSTLGAGGW-VGNGVYV 59

Query: 62 SDVSQMFKQ 70
S V + +
Sbjct: 60 SGVQREYDA 68


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1078FLGHOOKAP1363e-05 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 36.1 bits (83), Expect = 3e-05
Identities = 8/38 (21%), Positives = 20/38 (52%)

Query: 107 NVNVVEEMADMISASRAFQTNAEMMNTAKQMMQKVLTL 144
VN+ EE ++ + + NA+++ TA + ++ +
Sbjct: 508 GVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545



Score = 29.5 bits (66), Expect = 0.004
Identities = 15/54 (27%), Positives = 25/54 (46%), Gaps = 2/54 (3%)

Query: 4 ASVFNIAGSGMSAQSTRLNTVASNIANAETVSSSVDKTYRARHPVFSTMFQQAQ 57
+S+ N A SG++A LNT ++NI++ + T A ST+
Sbjct: 1 SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMA--QANSTLGAGGW 52


59PA1061PA1054Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
PA10612100.371769hypothetical protein
PA10603100.334044hypothetical protein
PA1059313-0.056775monovalent cation/H+ antiporter subunit G
PA10583120.205796monovalent cation/H+ antiporter subunit F
PA10572110.140893monovalent cation/H+ antiporter subunit E
PA10562111.435556monovalent cation/H+ antiporter subunit D
PA10550100.880215monovalent cation/H+ antiporter subunit C
PA10542111.320763monovalent cation/H+ antiporter subunit A
60PA0999PA0938Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA0999123-3.6851663-oxoacyl-ACP synthase
PA0998230-5.288557hypothetical protein
PA0997333-5.975727hypothetical protein
PA0996436-6.610518anthranilate--CoA ligase
PA0995538-8.257211methylated-DNA--protein-cysteine
PA0994437-8.148888usher CupC3
PA0993239-8.002229chaperone CupC2
PA0992232-6.765278fimbrial subunit CupC1
PA0990448-11.394204hypothetical protein
PA0989754-13.550331hypothetical protein
PA0988959-14.368692hypothetical protein
PA0987858-14.050572hypothetical protein
PA0986874-17.479703hypothetical protein
PA0985873-17.016547pyocin S5
PA0984556-11.805241colicin immunity protein
PA0983245-8.707761hypothetical protein
PA0982141-7.922864hypothetical protein
PA0981238-7.219771hypothetical protein
PA0979125-4.601720hypothetical protein
PA0978020-4.384315hypothetical protein
PA0976121-4.590070*7-cyano-7-deazaguanine synthase
PA0975221-5.131640radical activating enzyme
PA0974121-4.832310hypothetical protein
PA0973121-5.270237peptidoglycan associated lipoprotein OprL
PA0972120-4.941468translocation protein TolB
PA0971223-5.194336translocation protein TolA
PA0970224-4.543055translocation protein TolR
PA0969122-4.214492translocation protein TolQ
PA0968217-3.822335hypothetical protein
PA0967115-3.671872Holliday junction ATP-dependent DNA helicase
PA0966114-3.484782Holliday junction ATP-dependent DNA helicase
PA0965113-3.343513crossover junction endodeoxyribonuclease RuvC
PA0964013-3.366052transcriptional regulator PmpR
PA0963-113-3.806443aspartate--tRNA ligase
PA0962019-4.280600DNA-binding stress protein
PA0961-114-3.889531cold-shock protein
PA0960-213-3.839468hypothetical protein
PA0959-113-3.553487hypothetical protein
PA0958-113-3.651593porin D
PA0957016-3.009211hypothetical protein
PA0956-114-2.999041proline--tRNA ligase
PA0955-125-3.342883hypothetical protein
PA0954229-3.487967acylphosphatase
PA0953125-3.127114thioredoxin
PA0952221-2.723814hypothetical protein
PA0951a220-3.147663hypothetical protein
PA0951120-2.190151hypothetical protein
PA0950016-1.351070arsenate reductase
PA0949016-1.468739NAD(P)H dehydrogenase
PA0948-115-2.462543hypothetical protein
PA0947-117-2.782772DNA replication initiation factor
PA0946017-2.567955hypothetical protein
PA0945-217-3.690831phosphoribosylformylglycinamidine cyclo-ligase
PA0944-124-4.687336phosphoribosylglycinamide formyltransferase
PA0943123-5.309334hypothetical protein
PA0942227-5.294916transcriptional regulator
PA0941320-5.428295hypothetical protein
PA0940419-4.658629hypothetical protein
PA0939115-3.902850hypothetical protein
PA0938117-3.183860hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0999PF04183300.016 IucA / IucC family
		>PF04183#IucA / IucC family

Length = 580

Score = 29.8 bits (67), Expect = 0.016
Identities = 13/67 (19%), Positives = 27/67 (40%), Gaps = 5/67 (7%)

Query: 214 EGGGEFLMRGRPMFEHASQTLVRIAGEMLAAHELTLD-DIDHVICHQPNLRILDAVQEQL 272
+G + P + Q + + + +A L D H + LR + + +L
Sbjct: 444 QGDMRLVKEEFPEMDSLPQEVRDVTSRL-SADYLIHDLQTGHFVTV---LRFISPLMVRL 499

Query: 273 GIPQHKF 279
G+P+ +F
Sbjct: 500 GVPERRF 506


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0994PF005777880.0 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 788 bits (2037), Expect = 0.0
Identities = 282/861 (32%), Positives = 443/861 (51%), Gaps = 47/861 (5%)

Query: 14 VYSRSSCLVALGLALPAVTFAVEFNAEFLNNEGGAPVELKYFENGNSVSPGTYSVDIHLN 73
+ R A P + + FN FL ++ A +L FENG + PGTY VDI+LN
Sbjct: 26 FFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPGTYRVDIYLN 85

Query: 74 QIMIRREDVVFSADSETGSVRPVIRVGLLKEIGVDIARLTRDKLIPANLENNTPLNVAEL 133
+ DV F+ + P + L +G++ A ++ L+ ++ + + +
Sbjct: 86 NGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLA----DDACVPLTSM 141

Query: 134 IPGASVEFDVNSLSLLVSIPQLYVQRHSRGYVDPSLWDDGVTALFSNYQANFTRNTN-FG 192
I A+ + DV L ++IPQ ++ +RGY+ P LWD G+ A NY + N G
Sbjct: 142 IHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQNRIG 201

Query: 193 QNSDYRYLGLRNGFNLFGWRLRNDSSLS-----GGTGMRNKFSSNRTYVERDIRALKGTL 247
NS Y YL L++G N+ WRLR++++ S +G +NK+ T++ERDI L+ L
Sbjct: 202 GNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRSRL 261

Query: 248 SLGELYTSAQGDAFESVRMRGVQLQSDIGMLPDNEISYTPVVRGIAETNATVEVSQNGFV 307
+LG+ YT GD F+ + RG QL SD MLPD++ + PV+ GIA A V + QNG+
Sbjct: 262 TLGDGYTQ--GDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYD 319

Query: 308 IYSTNVPPGAFEITDIYPSGSNGDLEVKIIEADGRQRSFKQSYSYLPVMTRKGNLRYGLA 367
IY++ VPPG F I DIY +G++GDL+V I EADG + F YS +P++ R+G+ RY +
Sbjct: 320 IYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSIT 379

Query: 368 AGEYHNDG--QPSVNLLQGSAVYGLSDRVTGFGGLLAAEKYNATNLGLGFNT-PLGGLSA 424
AGEY + Q Q + ++GL T +GG A++Y A N G+G N LG LS
Sbjct: 380 AGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALSV 439

Query: 425 DVTHSQSRTRRGGRNQGQSLRLLYSKTINATETSFTVVGYRYSTEGYRTLSQH------- 477
D+T + S ++ GQS+R LY+K++N + T+ +VGYRYST GY +
Sbjct: 440 DMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNG 499

Query: 478 --------IDDMSDESYLYGSSSSRQKSRIDLTVNQTLFRRSSLYLTAGETTYWNRPGSS 529
+ + + Y + + ++ ++ LTV Q L R S+LYL+ TYW
Sbjct: 500 YNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYWGTSNVD 559

Query: 530 RRVQFGFSSGIKRASYSLAVSRTQETGSFGRSDTQFTASVSIPLGG--------SARSSQ 581
+ Q G ++ + +++L+ S T+ GR D +V+IP R +
Sbjct: 560 EQFQAGLNTAFEDINWTLSYSLTKNAWQKGR-DQMLALNVNIPFSHWLRSDSKSQWRHAS 618

Query: 582 VYANAVSSQHGDSSLNTGISGYLDEANAFNYSAQANYSKDG----GNSGSVGLGWDTSKA 637
+ +G + G+ G L E N +YS Q Y+ G G++G L +
Sbjct: 619 ASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYG 678

Query: 638 KLSANYSQGRDNKQINLGASGSVVVHPGGVTFGQPVGETFGLVEVPEVGGVGLDGYSSVR 697
+ YS D KQ+ G SG V+ H GVT GQP+ +T LV+ P ++ + VR
Sbjct: 679 NANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKVENQTGVR 738

Query: 698 TDGRGYAVLPYMQPYRYNWVNLDTNTLGSDTEISDSTQMAVPTRGAVIAKRFSAESGRRV 757
TD RGYAVLPY YR N V LDTNTL + ++ ++ VPTRGA++ F A G ++
Sbjct: 739 TDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKARVGIKL 798

Query: 758 QFDLSMDSGGKIPFGAQAYDKEERVVGMVDNLSRLLVFGIEDQGRLSIRWSDG---SCSV 814
L+ + +PFGA + + G+V + ++ + G+ G++ ++W + C
Sbjct: 799 LMTLTHN-NKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGKVQVKWGEEENAHCVA 857

Query: 815 DYQLPPRNKDLTYERVALSCR 835
+YQLPP ++ +++ CR
Sbjct: 858 NYQLPPESQQQLLTQLSAECR 878


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0985CHANLCOLICIN1652e-47 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 165 bits (419), Expect = 2e-47
Identities = 100/394 (25%), Positives = 174/394 (44%), Gaps = 35/394 (8%)

Query: 129 EKQSSLSIYEAWVKIWEKNSWEERKKYPFQQLVRDELERAVAYYK-QDSLSE---AVKVL 184
++ + EA K +++ ++ + +L+ A A K +LSE AV++
Sbjct: 135 AEEKARKEAEAAEKAFQEAEQRRKEIEREKAETERQLKLAEAEEKRLAALSEEAKAVEIA 194

Query: 185 RQELNKQKALKEKED------LSQLERDYRTRKANLE-------------MKVQSELDQA 225
+++L+ ++ K D S+L R A ++ K + +
Sbjct: 195 QKKLSAAQSEVVKMDGEIKTLNSRLSSSIHARDAEMKTLAGKRNELAQASAKYKELDELV 254

Query: 226 GSALPPLVSPTPEQWLERATRLVTQAIADKKQLQTTNNTLIKNSPTPLEKQKAIYNGELL 285
P P + ATR A +++ Q + S T + + A
Sbjct: 255 KKLSPRANDPLQNRPFFEATRRRVGAGKIREEKQKQ----VTASETRINRINADITQI-- 308

Query: 286 VDEIASLQARLVKLNAETTRRRTEAERKAAEE----QALQDAIKFTADFYKEVTEKFGAR 341
+ A Q + E K A+ ++DA+ T FY+ +TEK+G +
Sbjct: 309 --QKAISQVSNNRNAGIARVHEAEENLKKAQNNLLNSQIKDAVDATVSFYQTLTEKYGEK 366

Query: 342 TSEMARQLAEGARGKNIRSSAEAIKSFEKHKDALNKKLSLKDRQAIAKAFDSLDKQMMAK 401
S+MA++LA+ ++GK I + EA+ +FEK+KD LNKK S DR AI A S+ AK
Sbjct: 367 YSKMAQELADKSKGKKIGNVNEALAAFEKYKDVLNKKFSKADRDAIFNALASVKYDDWAK 426

Query: 402 SLEKFSKGFGVVGKAIDAASLYQEFKISTETGDWKPFFVKIETLAAGAAASWLVGIAFAT 461
L++F+K + G + + +TGDWKP F+ +E AA A S++V + F+
Sbjct: 427 HLDQFAKYLKITGHVSFGYDVVSDILKIKDTGDWKPLFLTLEKKAADAGVSYVVALLFSL 486

Query: 462 ATATPIGILGFALVMAVTGAMIDEDLLEKANNLV 495
T +GI G A+V + + ID++ L N ++
Sbjct: 487 LAGTTLGIWGIAIVTGILCSYIDKNKLNTINEVL 520


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0974RTXTOXIND320.002 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 32.1 bits (73), Expect = 0.002
Identities = 10/53 (18%), Positives = 19/53 (35%)

Query: 69 QLQQMQDELARLRGTLEEQQNQIQQLKQESLERYQDLDRRISGGGAPAAQNSA 121
+ + +EL + LE+ +++I K+E Q I N
Sbjct: 260 KYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIG 312


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0973OMPADOMAIN1166e-34 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 116 bits (291), Expect = 6e-34
Identities = 34/112 (30%), Positives = 53/112 (47%), Gaps = 12/112 (10%)

Query: 68 YFEYDSSDLKPEAMRALDVHA---KDLKGSGQRVVLEGHTDERGTREYNMALGERRAKAV 124
F ++ + LKPE ALD +L VV+ G+TD G+ YN L ERRA++V
Sbjct: 222 LFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGLSERRAQSV 281

Query: 125 QRYLVLQGVSPAQLELVSYGKERPVATGHDEQS---------WAQNRRVELK 167
YL+ +G+ ++ G+ PV + A +RRVE++
Sbjct: 282 VDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIE 333


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0971IGASERPTASE492e-08 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 49.3 bits (117), Expect = 2e-08
Identities = 36/204 (17%), Positives = 71/204 (34%), Gaps = 21/204 (10%)

Query: 54 QLKSKSQATTQTNQKIAGEAKKTASKQYE-----VEQLEQKKLEQQKLEQQKLEQQQVAA 108
Q + TT N + + + +++ + E +Q +
Sbjct: 990 QTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESK 1049

Query: 109 AKAAEQKKADEARKAEAQKAAEAKKADEAKKAAEAKAAEQKKQADIAKKRAEDEAKKKAA 168
++ A E + A EAK +A + ++A ++ E K+
Sbjct: 1050 TVEKNEQDATETTAQNREVAKEAKSNVKA----------NTQTNEVA--QSGSETKETQT 1097

Query: 169 EDAKKKAAEDAKKKAAEEAKKKAAAEAAKKKAAVEAAKKKAAAAAAAARKAAEDKKARAL 228
+ K+ A + ++KA E +K E K + V ++++ A A E+ +
Sbjct: 1098 TETKETATVEKEEKAKVETEKT--QEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNI 1155

Query: 229 AELLS--DTTERQQALADEVGSEV 250
E S +TT + A E S V
Sbjct: 1156 KEPQSQTNTTADTEQPAKETSSNV 1179


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA096960KDINNERMP290.017 60kDa inner membrane protein signature.
		>60KDINNERMP#60kDa inner membrane protein signature.

Length = 548

Score = 29.1 bits (65), Expect = 0.017
Identities = 17/72 (23%), Positives = 28/72 (38%), Gaps = 13/72 (18%)

Query: 12 WSLISNASIVVQLVMLTLVAASVTSWIMIFQRGNAMRAAKKALDAFEERFWS-----GID 66
+S+I + +V+ +M L A TS MR + + A ER +
Sbjct: 356 FSIII-ITFIVRGIMYPLTKAQYTSM-------AKMRMLQPKIQAMRERLGDDKQRISQE 407

Query: 67 LSKLYRQAGSNP 78
+ LY+ NP
Sbjct: 408 MMALYKAEKVNP 419


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0963ANTHRAXTOXNA320.009 Anthrax toxin LF subunit signature.
		>ANTHRAXTOXNA#Anthrax toxin LF subunit signature.

Length = 800

Score = 31.6 bits (71), Expect = 0.009
Identities = 31/117 (26%), Positives = 51/117 (43%), Gaps = 23/117 (19%)

Query: 212 YYQIAKCFRDEDLRADRQPEFTQIDIETSFLDESDIIGITEKMVRQLFKEVL-------D 264
YY+I K + + D+ + +++ S D+SD ++ + Q FKE L D
Sbjct: 170 YYEIGKGISLDIISKDKSLDPEFLNLIKSLSDDSD---SSDLLFSQKFKEKLELNNKSID 226

Query: 265 VEF-----DEFPHMPFEEAMRRYGSDKPDLRIPLEL-----VDVADQLKEVEFKVFS 311
+ F EF H F A Y + PD R LEL + ++L++ F+ S
Sbjct: 227 INFIKENLTEFQHA-FSLAFSYYFA--PDHRTVLELYAPDMFEYMNKLEKGGFEKIS 280


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0962HELNAPAPROT1573e-52 Helicobacter neutrophil-activating protein A family ...
		>HELNAPAPROT#Helicobacter neutrophil-activating protein A family

signature.
Length = 153

Score = 157 bits (398), Expect = 3e-52
Identities = 50/145 (34%), Positives = 72/145 (49%)

Query: 11 DRAAIAEGLSRLLADTYTLYLKTHNFHWNVTGPMFNTLHLMFEGQYTELAVAVDDIAERI 70
++ + L+ L++ + LY K H FHW V GP F TLH FE Y A VD IAER+
Sbjct: 9 NQTLVENSLNTQLSNWFLLYSKLHRFHWYVKGPHFFTLHEKFEELYDHAAETVDTIAERL 68

Query: 71 RALGFPAPGTYAAYARLSSIKEEEGVPEAEEMIRQLVQGQEAVVRTARSIFPLLDKVSDE 130
A+G T Y +SI + A EM++ LV + + ++ + L ++ D
Sbjct: 69 LAIGGQPVATVKEYTEHASITDGGNETSASEMVQALVNDYKQISSESKFVIGLAEENQDN 128

Query: 131 PTADLLTQRMQVHEKTAWMLRSLLA 155
TADL ++ EK WML S L
Sbjct: 129 ATADLFVGLIEEVEKQVWMLSSYLG 153


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0956ANTHRAXTOXNA300.031 Anthrax toxin LF subunit signature.
		>ANTHRAXTOXNA#Anthrax toxin LF subunit signature.

Length = 800

Score = 30.1 bits (67), Expect = 0.031
Identities = 10/54 (18%), Positives = 22/54 (40%)

Query: 208 HEFHVLANSGEDDIVFSDSSDYAANIEKAEAVPRESARGSATEDMRLVDTPNTK 261
V E + + DYA N E+++ V E +G + + + + + +
Sbjct: 138 ASRFVFEKKRETPKLIINIKDYAINSEQSKEVYYEIGKGISLDIISKDKSLDPE 191


61PA0830PA0805Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA0830012-3.015659hypothetical protein
PA0829115-2.964620hydrolase
PA0828220-3.679521transcriptional regulator
PA0827329-6.387270hypothetical protein
PA0826343-8.891205hypothetical protein
PA0825243-7.277434hypothetical protein
PA0824341-6.510106hypothetical protein
PA0823535-6.657106hypothetical protein
PA0821128-4.351435hypothetical protein
PA0820118-1.202307hypothetical protein
PA08192152.755566hypothetical protein
PA08182143.822773hypothetical protein
PA08172123.134187ring-cleaving dioxygenase
PA08162113.434507transcriptional regulator
PA08152103.182595transcriptional regulator
PA08144103.465530hypothetical protein
PA0813282.725000hypothetical protein
PA0812191.871033hypothetical protein
PA08111112.385616major facilitator superfamily transporter
PA0810-190.978180haloacid dehalogenase
PA0809-1100.659563divalent metal cation transporter MntH
PA0808-113-0.616838hypothetical protein
PA0807-112-0.019045protein AmpDh3
PA08062111.065125hypothetical protein
PA08052121.087226hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0828HTHTETR522e-10 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 52.3 bits (125), Expect = 2e-10
Identities = 29/193 (15%), Positives = 70/193 (36%), Gaps = 13/193 (6%)

Query: 39 RTNIQLAAIPVFTRKGVAETTANDLLEAARVSRRTFYKYFAGKLEVLESIYHSAVQLLLA 98
R +I A+ +F+++GV+ T+ ++ +AA V+R Y +F K ++ I+ + +
Sbjct: 13 RQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGE 72

Query: 99 RFGGLRSEAGSD-EDWLRAMVSLFFDYHL---AVGPIIRMMQEEALHAGS--PLAAHRQR 152
+++ D LR ++ + + ++ ++ + G + ++
Sbjct: 73 LELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRN 132

Query: 153 AHLKIVELWTERLG---AQGAAHDALTYRVLIWAMEAASLELLN----ASDPLELPRVKR 205
L+ + + L L R M L+ A +L + R
Sbjct: 133 LCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDLKKEAR 192

Query: 206 VLGDLLVGTLCPR 218
+L+
Sbjct: 193 DYVAILLEMYLLC 205


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0811TCRTETA340.001 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 33.6 bits (77), Expect = 0.001
Identities = 30/155 (19%), Positives = 58/155 (37%), Gaps = 7/155 (4%)

Query: 5 LAANPTQRYRWVILLIATFAQACACFFVQGIGAI-----AVFVQNDLQLSSLQIGLLVSA 59
A NP +RW + A F +Q +G + +F ++ + IG+ ++A
Sbjct: 195 EALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAA 254

Query: 60 AQLVPIVG-LLVAGELLDRYSERLVVGLGTLIVALALCASLWATDYLTILLFLVVVGAGY 118
++ + ++ G + R ER + LG + +AT +V++ +G
Sbjct: 255 FGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASG- 313

Query: 119 STAQPGGSKSVSRWFAKTQLGFAMGIRQAGLPLGG 153
P +SR + + G G A L
Sbjct: 314 GIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTS 348


62PA0743PA0715Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA07432121.803370NAD-dependent L-serine dehydrogenase
PA07424111.395858hypothetical protein
PA07412111.549042hypothetical protein
PA07403100.830314SDS hydrolase SdsA1
PA07393120.974413transcriptional regulator
PA07383120.134358hypothetical protein
PA07371130.169082hypothetical protein
PA0736a-113-0.170608hypothetical protein
PA0736013-0.144248hypothetical protein
PA0735014-0.891956hypothetical protein
PA0734116-2.614533hypothetical protein
PA0733117-3.88602916S rRNA pseudouridine(516) synthase
PA0732021-5.197099hypothetical protein
PA0731-126-5.470052hypothetical protein
PA0730-126-6.070397(R)-3-hydroxydecanoyl-ACP:CoA transacylase
PA0729131-5.796972*hypothetical protein
PA0728129-5.100697bacteriophage integrase
PA0727228-4.711650hypothetical protein
PA0726330-3.670429hypothetical protein
PA0725423-3.584509hypothetical protein
PA0724630-2.960911phage coat protein A
PA0723533-2.314679phage coat protein B
PA07221268-14.182695hypothetical protein
PA07211281-18.621185hypothetical protein
PA07201276-17.310948helix destabilizing protein of bacteriophage
PA07191181-18.136060hypothetical protein
PA07181078-18.890334hypothetical protein
PA07161074-17.823382hypothetical protein
PA0715747-10.734761hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0741NUCEPIMERASE351e-04 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 35.1 bits (81), Expect = 1e-04
Identities = 29/124 (23%), Positives = 45/124 (36%), Gaps = 21/124 (16%)

Query: 1 MKIALIGATGHVGHYFLNEALQRGHAV-----------TALVRDPSKLAARDGLCVVQAD 49
MK + GA G +G + L+ GH V +L + +L A+ G + D
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 50 VSDPAQVASAVAGHE---VVISAFNGGWGSADLRARHA------AGSQAILDGVKRSGVP 100
++D + A V IS L HA G IL+G + + +
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLA-VRYSLENPHAYADSNLTGFLNILEGCRHNKIQ 119

Query: 101 RLLV 104
LL
Sbjct: 120 HLLY 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0724cloacin432e-06 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 42.8 bits (100), Expect = 2e-06
Identities = 23/57 (40%), Positives = 29/57 (50%), Gaps = 1/57 (1%)

Query: 248 GSDGGGDGNGGGNNNGGGNDGGTGNGGDGSGGGDGNGGGDGSGDGDGSGTGGDGNGT 304
G G GG ++ G + GG GSG G GGG G G+G G+G G G+GT
Sbjct: 22 GGPTGLGVGGGASDGSGWSSENNPWGG-GSGSGIHWGGGSGHGNGGGNGNSGGGSGT 77



Score = 42.4 bits (99), Expect = 3e-06
Identities = 23/60 (38%), Positives = 29/60 (48%), Gaps = 1/60 (1%)

Query: 248 GSDGGGDGNGGGNNNGGGNDGGTGNGGDGSGGGDGNGGGDGSGDGDGSGTGGDGNGTCDP 307
G DG+G + N G G G G GNGGG+G+ G GSGTGG+ + P
Sbjct: 29 VGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNS-GGGSGTGGNLSAVAAP 87



Score = 39.7 bits (92), Expect = 2e-05
Identities = 22/56 (39%), Positives = 29/56 (51%)

Query: 248 GSDGGGDGNGGGNNNGGGNDGGTGNGGDGSGGGDGNGGGDGSGDGDGSGTGGDGNG 303
G G G G G + +G ++ GG GSG G G G G+G G+G+ GG G G
Sbjct: 23 GPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTG 78



Score = 37.8 bits (87), Expect = 8e-05
Identities = 21/70 (30%), Positives = 26/70 (37%), Gaps = 14/70 (20%)

Query: 249 SDGGGDGNGGGNNNGGGNDGGTGNGGDGSGG--------------GDGNGGGDGSGDGDG 294
S G G G+ G ++ GN G G GG G G+G G G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 295 SGTGGDGNGT 304
G GG +
Sbjct: 62 HGNGGGNGNS 71



Score = 28.9 bits (64), Expect = 0.039
Identities = 12/38 (31%), Positives = 16/38 (42%)

Query: 243 DPTTPGSDGGGDGNGGGNNNGGGNDGGTGNGGDGSGGG 280
+P GS G GG + GG +G +G G G
Sbjct: 44 NPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNL 81


63PA0656PA0611Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA065629-3.472457HIT family protein
PA065537-3.1493012-nonaprenyl-3-methyl-6-methoxy-1,4-benzoquinol
PA065438-3.496348S-adenosylmethionine decarboxylase
PA0653111-3.257989hypothetical protein
PA0652114-4.002073cAMP-regulatory protein
PA0651125-4.925276indole-3-glycerol phosphate synthase
PA0650229-6.102127anthranilate phosphoribosyltransferase
PA0649440-8.568711anthranilate synthase component II
PA0648749-9.638724hypothetical protein
PA0647427-4.666980hypothetical protein
PA0646425-4.269665hypothetical protein
PA0645320-2.775403hypothetical protein
PA0644318-2.260406hypothetical protein
PA0643217-1.725130hypothetical protein
PA0641-115-0.870535bacteriophage protein
PA0640-116-0.844320bacteriophage protein
PA0639-216-0.724406hypothetical protein
PA0638-116-1.672284bacteriophage protein
PA0637019-1.875532hypothetical protein
PA0636121-1.192236hypothetical protein
PA06353230.192140hypothetical protein
PA0634121-0.391725hypothetical protein
PA0633123-0.319107hypothetical protein
PA06312240.293895hypothetical protein
PA0630126-0.149443hypothetical protein
PA0629125-0.470073hypothetical protein
PA0628025-1.167690hypothetical protein
PA0627-122-1.371133hypothetical protein
PA0626-125-2.101905hypothetical protein
PA0625034-3.894574hypothetical protein
PA0624132-5.174963hypothetical protein
PA0623032-5.006966bacteriophage protein
PA0622134-4.859984bacteriophage protein
PA0621138-4.784747hypothetical protein
PA0620035-3.693737bacteriophage protein
PA0619-225-0.550107bacteriophage protein
PA0618-124-0.585807bacteriophage protein
PA0617-1220.588942bacteriophage protein
PA06160160.351335hypothetical protein
PA0615-116-0.259380hypothetical protein
PA0614012-1.299963hypothetical protein
PA0613010-1.756428hypothetical protein
PA0612210-1.803555repressor PtrB
PA0611212-2.104543HTH-type transcriptional regulator PrtR
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0641CHANLCOLICIN300.046 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 30.4 bits (68), Expect = 0.046
Identities = 39/203 (19%), Positives = 77/203 (37%), Gaps = 15/203 (7%)

Query: 886 QALKAVPAKTAPPNAAYWSD-IGQSL-ETANGLAQQVASHTAEISEL----DGSLTAQAS 939
QA +A A A A D + Q L + N + AS T +EL + ++ A+
Sbjct: 69 QAARAKAAAEAQAKAKANRDALTQRLKDIVNEALRHNASRTPSATELAHANNAAMQAEDE 128

Query: 940 RLGVLQA--ATRDDADDGNGAMADALRGWKTVARAAQEETVRATENEAQATRTTLLEART 997
RL + +A R +A+ A +A + K + R + ET R + A+A A
Sbjct: 129 RLRLAKAEEKARKEAEAAEKAFQEAEQRRKEIERE-KAETERQLK-LAEAEEKR--LAAL 184

Query: 998 ADAEGRIATVERV---ATSDRQATAQRLDQLSASIGGTAASLQSEQTARANADSALAQRI 1054
++ + ++ A S+ + L++ + + + +E A + LAQ
Sbjct: 185 SEEAKAVEIAQKKLSAAQSEVVKMDGEIKTLNSRLSSSIHARDAEMKTLAGKRNELAQAS 244

Query: 1055 DTVQARTDTNSAAIQTTSQAVTS 1077
+ + + + +
Sbjct: 245 AKYKELDELVKKLSPRANDPLQN 267


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0630PYOCINKILLER325e-04 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 31.7 bits (71), Expect = 5e-04
Identities = 14/49 (28%), Positives = 25/49 (51%), Gaps = 2/49 (4%)

Query: 51 LEALLDEQQRALAAVRASAERRAKDVEQALGEARAQAAEQYAAAVRLLQ 99
L+ ++ A A++ A+A +A+ EQA EA+ +A EQ +
Sbjct: 200 LQIRMNTLTAAKASIEAAAANKAR--EQAAAEAKRKAEEQARQQAAIRA 246


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0625PF07132320.011 Harpin protein (HrpN)
		>PF07132#Harpin protein (HrpN)

Length = 356

Score = 31.6 bits (71), Expect = 0.011
Identities = 23/57 (40%), Positives = 30/57 (52%)

Query: 621 GSLAGAALGASIGSVVPVVGTLIGGLVGGAIGAWGGSELGGRLGRSLAGDPPAASDN 677
GS+ G LG +G + +G L GGL+GG +G GS LG LG +L G A
Sbjct: 62 GSMMGGGLGGGLGGLGSSLGGLGGGLLGGGLGGGLGSSLGSGLGSALGGGLGGALGA 118


64PA0596PA0567Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
PA0596-113-3.009662hypothetical protein
PA0595-213-2.913244organic solvent tolerance protein OstA
PA0594-112-0.102154chaperone SurA
PA0593011-2.1925794-hydroxythreonine-4-phosphate dehydrogenase
PA0592012-3.344605ribosomal RNA small subunit methyltransferase A
PA0591012-4.131123protein ApaG
PA0590012-3.920146bis(5'-nucleosyl)-tetraphosphatase
PA0589112-3.450010thiosulfate sulfurtransferase
PA0588010-3.446908hypothetical protein
PA058708-1.971703hypothetical protein
PA058608-1.014101hypothetical protein
PA05850101.259168hypothetical protein
PA05841110.703287multifunctional tRNA nucleotidyl
PA05830120.302388hypothetical protein
PA058219-0.337190dihydroneopterin aldolase
PA0581210-1.569021glycerol-3-phosphate acyltransferase PlsY
PA0580011-1.415825tRNA N6-adenosine threonylcarbamoyltransferase
PA0579013-2.27033530S ribosomal protein S21
PA0578012-2.300481hypothetical protein
PA0577-210-1.868370DNA primase
PA0576-210-1.667501RNA polymerase sigma factor RpoD
PA0575012-0.970470hypothetical protein
PA0574213-1.053973*hypothetical protein
PA05730100.010899hypothetical protein
PA0572090.319456hypothetical protein
PA0571-1131.380158hypothetical protein
PA05701130.645379hypothetical protein
PA05691131.156657hypothetical protein
PA05681130.058644hypothetical protein
PA05672151.169205hypothetical protein
65PA0502PA0489Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA0502-113-3.541633biotin biosynthesis protein BioH
PA0501015-3.4445018-amino-7-oxononanoate synthase
PA0500118-4.735625biotin synthase
PA0499120-3.364884pili assembly chaperone
PA0498119-3.219750hypothetical protein
PA0497114-0.946667hypothetical protein
PA04962101.348878hypothetical protein
PA0495390.937977hypothetical protein
PA0494282.465754acetyl-CoA carboxylase biotin carboxylase
PA04931112.319301acetyl-CoA carboxylase biotin carboxyl carrier
PA04920122.390027hypothetical protein
PA0491-1112.259624transcriptional regulator
PA04903111.953093hypothetical protein
PA04892112.109950phosphoribosyl transferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0493RTXTOXIND313e-04 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 31.0 bits (70), Expect = 3e-04
Identities = 9/41 (21%), Positives = 21/41 (51%)

Query: 36 SVIGLIEVMKQFSEVQAGQAGILQAFHVEDGEAIEPGQVLA 76
+ G + + E++ + I++ V++GE++ G VL
Sbjct: 85 TANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLL 125


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0489ACRIFLAVINRP290.019 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 29.0 bits (65), Expect = 0.019
Identities = 16/66 (24%), Positives = 26/66 (39%), Gaps = 4/66 (6%)

Query: 118 GLPRPARLLPVPLAPRRERRRGFNQAQQLAERLAGEL----DLRCDPHSLRRVLDTPAQQ 173
G + A + V L P ER N A+ + R EL D P ++ +++
Sbjct: 618 GQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGFVIPFNMPAIVELGTAT 677

Query: 174 GLDATV 179
G D +
Sbjct: 678 GFDFEL 683


66PA0475PA0467Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA04752110.346554transcriptional regulator
PA04741110.136739esterase
PA04732100.857956glutathione S-transferase
PA04722111.217160RNA polymerase sigma factor
PA04711111.032098transmembrane sensor
PA0470390.418551ferrichrome receptor FiuA
PA0469192.121113hypothetical protein
PA04682112.352651hypothetical protein
PA04672111.222490hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0475HTHTETR477e-09 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 46.9 bits (111), Expect = 7e-09
Identities = 30/171 (17%), Positives = 57/171 (33%), Gaps = 8/171 (4%)

Query: 2 PKKSNAAERIVHATASLLASRGYFGTGLSDIIARAEAPKGSLYHYFPEGKPQIASAAIGF 61
+ + I+ L + +G T L +I A +G++Y +F K + S
Sbjct: 7 QEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFK-DKSDLFSEIWEL 65

Query: 62 VADEVAS-FLDRAGTQAPHARNVLRQ-FTATLRGWLEHSRFEEACPVLSTSLSIDAELAP 119
+ L+ +VLR+ L + R ++ E+A
Sbjct: 66 SESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAV 125

Query: 120 VHAECRRALRAWHASIERALRADGLAEKLAASR-----AWLILAALEGAVA 165
V R + IE+ L+ A+ L A A ++ + G +
Sbjct: 126 VQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLME 176


67PA0279PA0273Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA02793103.286387transcriptional regulator
PA02784103.425060hypothetical protein
PA02774112.727553hypothetical protein
PA02765122.778528hypothetical protein
PA02756112.628003transcriptional regulator
PA02745132.473962hypothetical protein
PA02732122.808169major facilitator superfamily transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0273TCRTETB290.042 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 28.7 bits (64), Expect = 0.042
Identities = 28/131 (21%), Positives = 47/131 (35%), Gaps = 6/131 (4%)

Query: 228 LAPYYL--EQGWSAQESGLLLGFLTAMEV-LSGLLAPALASRSRDRRPVLVGLTALMLAG 284
+ PY + S E G ++ F M V + G + L R R VL +
Sbjct: 278 MVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILVDR-RGPLYVLNIGVTFLSVS 336

Query: 285 FLGLAWAPASLPLLWALCLGLGIGGLFPMGLIVC--LDHFDAPQRAGQLAALVQGAGYLI 342
FL ++ + + + +GGL ++ + Q AG +L+ +L
Sbjct: 337 FLTASFLLETTSWFMTIIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLS 396

Query: 343 AGVSPWIAGLL 353
G I G L
Sbjct: 397 EGTGIAIVGGL 407


68PA0217PA0211Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
PA02170123.481879transcriptional regulator
PA02160124.525550malonate transporter MadM
PA02150125.333088malonate transporter MadL
PA02142116.141943acyl transferase
PA02133104.596395phosphoribosyl-dephospho-CoA transferase
PA02121104.185795malonate decarboxylase subunit gamma
PA02111103.131146malonate decarboxylase subunit beta
69PA0117PA0106Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA01172111.099413short-chain dehydrogenase
PA01164121.157089hypothetical protein
PA01155101.981798hypothetical protein
PA01145101.934377cytochrome c oxidase assembly protein SenC
PA01134121.483651protoheme IX farnesyltransferase
PA01122121.388589hypothetical protein
PA0111215-0.986837hypothetical protein
PA0110314-1.766586hypothetical protein
PA0109415-2.644690hypothetical protein
PA0108311-1.210910cytochrome C oxidase subunit III
PA0107410-0.955765cytochrome C oxidase assembly protein
PA01063100.263967cytochrome C oxidase subunit I
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0117DHBDHDRGNASE732e-17 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 72.8 bits (178), Expect = 2e-17
Identities = 47/192 (24%), Positives = 87/192 (45%), Gaps = 5/192 (2%)

Query: 5 KAVLVMGAGDATGGAIARRFAREGYVACVARRNAEKLEPLVQAIRDQGGEALACGCDARQ 64
K + GA G A+AR A +G N EKLE +V +++ + A A D R
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 65 EQQVIDLFARIEGEVGALEAVIFNVGANVWFPITETTERVYRKVWEMAAFGGFLTGREAA 124
+ ++ ARIE E+G ++ ++ G I ++ + + + + G F R +
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 125 RVMLPRQRGTIIFTGATASLRGRAHFAAFSGAKFALRALAQSMARELGPKDI--HVAHPI 182
+ M+ R+ G+I+ G+ + R AA++ +K A + + EL +I ++ P
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP- 187

Query: 183 IDGAIDTDFIRE 194
G+ +TD
Sbjct: 188 --GSTETDMQWS 197


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0115SACTRNSFRASE290.006 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 28.8 bits (64), Expect = 0.006
Identities = 13/48 (27%), Positives = 23/48 (47%), Gaps = 8/48 (16%)

Query: 88 RGQGLGHQLMERALQ-AAER----LWLDTPVYLSAQAHLQAYYGRYGF 130
R +G+G L+ +A++ A E L L+T + H Y ++ F
Sbjct: 102 RKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHF---YAKHHF 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0114PF06057280.026 Type IV secretory pathway VirJ component
		>PF06057#Type IV secretory pathway VirJ component

Length = 243

Score = 27.9 bits (62), Expect = 0.026
Identities = 18/81 (22%), Positives = 33/81 (40%), Gaps = 10/81 (12%)

Query: 72 GRWHLLFFGYTFCPDVCPTTLAQLRELQGKLPQEVRDDL-QVVFVSVDPNRDTPQQIKQY 130
G ++ GY+F +V P L + +P R ++ V +S + D + +
Sbjct: 115 GTQKVILIGYSFGAEVIPFVLNE-------MPARYRKNVLGAVLLSPSQSSDFEIHVSEM 167

Query: 131 LGYFNAGFQGLTGTPENIQKL 151
+ N + LT PE + K
Sbjct: 168 VTSDNQSARYLTL-PE-VNKQ 186


70PA0083PA0058Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA0083-1103.270276hypothetical protein
PA0082-1133.372991hypothetical protein
PA00810153.519804Fha domain-containing protein
PA00800153.363020hypothetical protein
PA0079-1153.358618hypothetical protein
PA00780153.668814hypothetical protein
PA00770143.815056type VI secretion protein IcmF
PA00760113.392109hypothetical protein
PA00751113.009088serine/threonine phosphatase
PA0074092.922439serine/threonine protein kinase PpkA
PA0073282.468673ABC transporter ATP-binding protein
PA0072191.536245hypothetical protein
PA0071090.487800hypothetical protein
PA0070-1130.764405hypothetical protein
PA00690121.071817hypothetical protein
PA00680101.247664hypothetical protein
PA00671101.360930oligopeptidase A
PA00662111.881688hypothetical protein
PA00652111.6832255'-nucleotidase
PA00642111.561860hypothetical protein
PA00632112.027280hypothetical protein
PA00620151.486779hypothetical protein
PA0061-1142.531317hypothetical protein
PA0060-1121.909274hypothetical protein
PA0059-1122.615817osmotically inducible protein OsmC
PA0058-2133.061886hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0081PF05616424e-06 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 42.4 bits (99), Expect = 4e-06
Identities = 25/71 (35%), Positives = 32/71 (45%), Gaps = 10/71 (14%)

Query: 232 TPAPSATPVAQPLPTAEPTPLAMPFADPGITQQPQPQPQPQPQPQPQP--------QPQP 283
TP + P AQPLP E +P P +P + P +P P+P P P QP
Sbjct: 316 TPGSAEAPNAQPLP--EVSPAENPANNPAPNENPGTRPNPEPDPDLNPDANPDTDGQPGT 373

Query: 284 QPASVAAPTPP 294
+P S A P P
Sbjct: 374 RPDSPAVPDRP 384



Score = 35.9 bits (82), Expect = 3e-04
Identities = 29/96 (30%), Positives = 34/96 (35%), Gaps = 25/96 (26%)

Query: 180 PRPDHVPAEQHDFRPPEPVIPPPPATTPAPPPAGGAPLIPADWDPFAELLGNTPAPSATP 239
PRPD P P P P +PA PA N PAP+ P
Sbjct: 311 PRPDLTPGSAE-----APNAQPLPEVSPAENPA------------------NNPAPNENP 347

Query: 240 VAQPLPTAEPTPLAMPFADPGITQQPQPQPQPQPQP 275
+P P EP P P A+P QP +P P
Sbjct: 348 GTRPNP--EPDPDLNPDANPDTDGQPGTRPDSPAVP 381


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0078OMPADOMAIN741e-16 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 74.2 bits (182), Expect = 1e-16
Identities = 40/138 (28%), Positives = 60/138 (43%), Gaps = 16/138 (11%)

Query: 318 AQRVAVEDAVDRSVVTIRGDELFASASASVRDEFQPLLLRIADALRKVK---GQVLVTGH 374
A A V T++ D LF A+++ E Q L ++ L + G V+V G+
Sbjct: 201 APAPAPAPEVQTKHFTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGY 260

Query: 375 SDNRPIATLRYPSNWKLSQARAQEVADLLGATTGDAGRFTAEGRSDTEPVATNASAEGRA 434
+D + Y N LS+ RAQ V D L + A + +A G ++ PV N +
Sbjct: 261 TDRI--GSDAY--NQGLSERRAQSVVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQ 316

Query: 435 R---------NRRVEITV 443
R +RRVEI V
Sbjct: 317 RAALIDCLAPDRRVEIEV 334


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0074PF03544387e-05 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 38.4 bits (89), Expect = 7e-05
Identities = 21/104 (20%), Positives = 32/104 (30%), Gaps = 4/104 (3%)

Query: 260 DRLAPSALEATQIRPLATPQGSPRASNPPPAEPAPLPPADLGGLQPVSIQLPPVTPSAGG 319
+AP+ LE Q P+ P P EP P PP + + P P
Sbjct: 53 TMVAPADLEPPQ-AVQPPPE--PVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVK 109

Query: 320 ATPPPPPPSQAA-KPPSPPPPPLPPAKPRAGGSRTPLIAAAAAA 362
P + P+ P PA+P + + +
Sbjct: 110 KVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSV 153



Score = 30.3 bits (68), Expect = 0.030
Identities = 19/93 (20%), Positives = 28/93 (30%), Gaps = 2/93 (2%)

Query: 270 TQIRPLATPQGSPRASNPPPAEPAPLPPADLGGLQPVSIQLPPVTPSAGGATPPPPPPSQ 329
Q+ L P + PA+ P P A +PV P P P +
Sbjct: 38 HQVIELPAPAQPISVTMVAPADLEP-PQAVQPPPEPVVEPEPEPEPIPE-PPKEAPVVIE 95

Query: 330 AAKPPSPPPPPLPPAKPRAGGSRTPLIAAAAAA 362
KP P P + P+ + A+
Sbjct: 96 KPKPKPKPKPKPVKKVEQPKRDVKPVESRPASP 128



Score = 29.9 bits (67), Expect = 0.046
Identities = 22/111 (19%), Positives = 30/111 (27%), Gaps = 9/111 (8%)

Query: 261 RLAPSALEATQIRPLATPQGSP---------RASNPPPAEPAPLPPADLGGLQPVSIQLP 311
A + P P+ P P +P P P + + +
Sbjct: 63 PQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVE 122

Query: 312 PVTPSAGGATPPPPPPSQAAKPPSPPPPPLPPAKPRAGGSRTPLIAAAAAA 362
S T P P S A + P + PRA P A A A
Sbjct: 123 SRPASPFENTAPARPTSSTATAATSKPVTSVASGPRALSRNQPQYPARAQA 173


71PA5531PA5524N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA5531110-2.537519transporter TonB
PA5530110-2.819871MFS dicarboxylate transporter
PA5529111-1.541190sodium/proton antiporter
PA5528-1120.017095hypothetical protein
PA5527-2120.169435hypothetical protein
PA5526-2120.162714hypothetical protein
PA5525-1110.711416transcriptional regulator
PA55240100.049212short-chain dehydrogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA5531TONBPROTEIN1144e-32 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 114 bits (286), Expect = 4e-32
Identities = 62/190 (32%), Positives = 92/190 (48%), Gaps = 14/190 (7%)

Query: 139 PTPQPPAAAPEPTPPKIEEPKPEPPKPKPVEKPKPKPKPKPKPVENAIPKAKPKPEPKPK 198
P P A +P P + EP+PEP K P KPKP PK K + +PK
Sbjct: 52 PADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPK 111

Query: 199 PEPEPSTEASSQPSPSSAAPPPAPTVGQSTPGAQTAPSGSQGPAGLPSGSLNDSDIKPLR 258
+ +P + P +P + ++ + + + S + S + L
Sbjct: 112 RDVKPV-----------ESRPASPFENTAPARLTSSTATAATSKPVTSVA---SGPRALS 157

Query: 259 MDPPVYPRMAQARGIEGRVKVLFTITSDGRIDDIQVLESVPSRMFDREVRQAMAKWRFEP 318
+ P YP AQA IEG+VKV F +T DGR+D++Q+L + P+ MF+REV+ AM +WR+EP
Sbjct: 158 RNQPQYPARAQALRIEGQVKVKFDVTPDGRVDNVQILSAKPANMFEREVKNAMRRWRYEP 217

Query: 319 RVSGGKIVAR 328
G IV
Sbjct: 218 GKPGSGIVVN 227


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA5530TCRTETA356e-04 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 34.8 bits (80), Expect = 6e-04
Identities = 16/48 (33%), Positives = 26/48 (54%), Gaps = 1/48 (2%)

Query: 286 AATLFLFMLLQPIVGALSDKIGRRPILIAFGVLGTVFTYPILSTLHSV 333
A + P++GALSD+ GRRP+L+ + G Y I++T +
Sbjct: 50 ALYALMQFACAPVLGALSDRFGRRPVLLV-SLAGAAVDYAIMATAPFL 96



Score = 34.4 bits (79), Expect = 9e-04
Identities = 37/192 (19%), Positives = 74/192 (38%), Gaps = 33/192 (17%)

Query: 49 KAFFPQGDMTAQLLNTAAIFAVGFLMRPIGGWLMGIYADRKGRKAALLASVLLMCFGSLI 108
+ D+TA A++A LM+ ++G +DR GR+ LL S+ I
Sbjct: 33 RDLVHSNDVTAHYGILLALYA---LMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAI 89

Query: 109 IALTPSYETIGVAAPILLVVARLLQGLSVGGEYGTSATYLSEMANKEQR----GFFSSFQ 164
+A P +L + R++ G++ G + Y++++ + ++R GF S+
Sbjct: 90 MATAPFLW--------VLYIGRIVAGIT-GATGAVAGAYIADITDGDERARHFGFMSACF 140

Query: 165 YVTLISGQLIALAVLIVLQQTLTVEQLESWGWRVPFFIGA----LCAVVAMFLRRGMEET 220
+++G ++ + + PFF A L + FL +
Sbjct: 141 GFGMVAGPVLG-------------GLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKG 187

Query: 221 ESFSKKKEEPKE 232
E ++E
Sbjct: 188 ERRPLRREALNP 199


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA5529RTXTOXINA320.008 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 31.9 bits (72), Expect = 0.008
Identities = 11/44 (25%), Positives = 23/44 (52%)

Query: 360 PVAVAVSAITTLLTPYLIRAADPLSQHLANAMPQRMARIFGHYG 403
PV+ V A+T +++ L + + +H+A+ M +A +G
Sbjct: 394 PVSALVGAVTGIISGILEASKQAMFEHVASKMADVIAEWEKKHG 437


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA5524DHBDHDRGNASE1045e-29 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 104 bits (260), Expect = 5e-29
Identities = 78/261 (29%), Positives = 119/261 (45%), Gaps = 24/261 (9%)

Query: 9 GQVALISGAGSELGIGFAIARRLAREGVRLL-ITASSERIRQRAEELSACGHDVRAASAD 67
G++A I+GA GIG A+AR LA +G + + + E++ + L A A AD
Sbjct: 8 GKIAFITGAAQ--GIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 68 LTDEAQVQGLLDWAEAQWGRVDILVNNAGMAQLDSAEPFSAVEATSLRDWQLSLSRNLTS 127
+ D A + + E + G +DILVN AG+ + + + S +W+ + S N T
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRP------GLIHSLSDEEWEATFSVNSTG 119

Query: 128 AFLLTRGLLPGMRERGYGRIVNVASTTGTRGSNPGEAAYSAAKAGLVGWSMGLALEVAKS 187
F +R + M +R G IV V S AAY+++KA V ++ L LE+A+
Sbjct: 120 VFNASRSVSKYMMDRRSGSIVTVGSNPAGV-PRTSMAAYASSKAAAVMFTKCLGLELAEY 178

Query: 188 GITVNSVAPG-------WIATASSTAEER-------QAALASPSGRAGRPEEVAAAVAFL 233
I N V+PG W A E+ P + +P ++A AV FL
Sbjct: 179 NIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFL 238

Query: 234 ASPEASFVNGELLVVDGGNCL 254
S +A + L VDGG L
Sbjct: 239 VSGQAGHITMHNLCVDGGATL 259


72PA5487PA5476N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA5487-1100.273331hypothetical protein
PA5486-210-0.644583hypothetical protein
PA5485-19-1.763861protein AmpDh2
PA5484-19-1.499581two-component sensor
PA5483-19-1.777938two-component response regulator AlgB
PA5482-19-1.936703hypothetical protein
PA5481-110-1.653558hypothetical protein
PA5479-110-0.811063glutamate/aspartate:proton symporter
PA5478-180.118314hypothetical protein
PA5477-280.039168hypothetical protein
PA5476-270.407600citrate transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA5487FLGHOOKFLIK340.001 Flagellar hook-length control protein signature.
		>FLGHOOKFLIK#Flagellar hook-length control protein signature.

Length = 375

Score = 34.4 bits (78), Expect = 0.001
Identities = 32/155 (20%), Positives = 50/155 (32%), Gaps = 25/155 (16%)

Query: 142 LSELSRLQRQALAERKGGDAEDGRPSLLQRLFGGKESETTAEPSASVPSVVAASNTPIQ- 200
++ + + + K D + + L LF PS V + P
Sbjct: 104 MALAAVADKNTTKDEKADDLNEDVTASLSALFAMLPGFDNTPKVTDAPSTVLPTEKPTLF 163

Query: 201 PAAAAPSLPVAEHDEAPGGPPQPLPARTVAAIESAPAGWVGVAERGEPNQILLDEPREIW 260
+ L A+ D+APG P QPL T E+ +
Sbjct: 164 TKLTSEQLTTAQPDDAPGTPAQPL---TPLVAEAQS---------------------KAE 199

Query: 261 LDSLPLPAGLSFSETLEEAGAEPSPAMPADVESAP 295
+ S P P + S + +P P + A V SAP
Sbjct: 200 VISTPSPVTAAASPLITPHQTQPLPTVAAPVLSAP 234


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA5484PF06580531e-09 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 53.3 bits (128), Expect = 1e-09
Identities = 28/165 (16%), Positives = 63/165 (38%), Gaps = 27/165 (16%)

Query: 430 LINDLLNFSRYQTGMQKLELASC----DLVDLLTQAQQ-RFIPKGEARRVSLQLELGDEL 484
++ L RY S +VD Q +F R+ + ++ +
Sbjct: 196 MLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQF-----EDRLQFENQINPAI 250

Query: 485 PRLQLDRLQIERVIDNLLENALRHSSEGGQIHLQARRQGDRVLIAVEDNGEGIPFSQQGR 544
+Q+ + ++ +++N +++ + +GG+I L+ + V + VE+ G
Sbjct: 251 MDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSL-------- 302

Query: 545 IFEPFVQVGRKKGGAGLGLALCKEIIQLHGG---RIAVRSQPGQG 586
+ K G GL +E +Q+ G +I + + G+
Sbjct: 303 ------ALKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKV 341


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA5483HTHFIS448e-157 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 448 bits (1153), Expect = e-157
Identities = 165/476 (34%), Positives = 251/476 (52%), Gaps = 39/476 (8%)

Query: 9 GRILLVDDESAILRTFRYCLEDEGYSVATASSAPQAEALLQRQVFDLCFLDLRLGEDNGL 68
IL+ DD++AI L GY V S+A + DL D+ + ++N
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 69 DVLAQMRVQAPWMRVVIVTAHSAVDTAVDAMQAGAVDYLVKPCSPDQLRLAAAKQLEVRQ 128
D+L +++ P + V++++A + TA+ A + GA DYL KP +L + L +
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 129 LTARLEALEDEVRRQGDGLESHSPAMAAVLETARQVAATDANILILGESGSGKGELARAI 188
R + ++ + G L S AM + ++ TD ++I GESG+GK +ARA+
Sbjct: 124 ---RRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARAL 180

Query: 189 HTWSKRAKKPQVTINCPSLTAELMESELFGHSRGAFTGATESTLGRVSQADGGTLFLDEI 248
H + KR P V IN ++ +L+ESELFGH +GAFTGA + GR QA+GGTLFLDEI
Sbjct: 181 HDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEI 240

Query: 249 GDFPLTLQPKLLRFIQDKEYERVGDPVTRRADVRILAATNRDLGAMVAQGQFREDLLYRL 308
GD P+ Q +LLR +Q EY VG R+DVRI+AATN+DL + QG FREDL YRL
Sbjct: 241 GDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRL 300

Query: 309 NVIVLNLPPLRERAEDILGLAERFLARFVKDYGRPARGFSEAAREAMRQYPWPGNVRELR 368
NV+ L LPPLR+RAEDI L F+ + K+ G + F + A E M+ +PWPGNVREL
Sbjct: 301 NVVPLRLPPLRDRAEDIPDLVRHFVQQAEKE-GLDVKRFDQEALELMKAHPWPGNVRELE 359

Query: 369 NVIERASIICNQELVDVDHLGFSAAQSASSAPR---------------IGESLS------ 407
N++ R + + Q+++ + + +P + E++
Sbjct: 360 NLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASF 419

Query: 408 -------------LEDLEKAHITAVMASSA-TLDQAAKTLGIDASTLYRKRKQYGL 449
L ++E I A + ++ +AA LG++ +TL +K ++ G+
Sbjct: 420 GDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGV 475


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA5479V8PROTEASE320.007 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 31.5 bits (71), Expect = 0.007
Identities = 7/41 (17%), Positives = 18/41 (43%)

Query: 292 AYGAPKAISSFVVPTGYSFNLDGSTLYQSIAAIFIAQLYGI 332
+ A ++ + TGY + +T+++S I + +
Sbjct: 186 SNNAETQVNQNITVTGYPGDKPVATMWESKGKITYLKGEAM 226


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA5476TCRTETA371e-04 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 36.7 bits (85), Expect = 1e-04
Identities = 59/273 (21%), Positives = 97/273 (35%), Gaps = 31/273 (11%)

Query: 61 LMRPLGAVFLGAYIDRHGRRQGLIITLGLMAMGTLLIAFVPGYATLGVAAPLLVLF-GRL 119
LM+ A LGA DR GRR L+++ L YA + A L VL+ GR+
Sbjct: 54 LMQFACAPVLGALSDRFGRRPVLLVS---------LAGAAVDYAIMATAPFLWVLYIGRI 104

Query: 120 LQGFSAGVELGGVSVYLSEIATPGRKGFFVSWQSASQQVAVVFAGLLGVLLNQWLSPQDM 179
+ G + G Y+++I + + SA +V +LG L M
Sbjct: 105 VAGIT-GATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGL---------M 154

Query: 180 GEWGWRVPFFIGCLIVPALFVIRRSLEETPEFEARTHRPSLSQVLRSIGQNFGVVLAGTA 239
G + PFF + F+ PE RP L + + +F T
Sbjct: 155 GGFSPHAPFFAAAALNGLNFLT--GCFLLPESHKGERRP-LRREALNPLASFRWARGMTV 211

Query: 240 MVVMTTVSFYLI------TAYTPTFGKNELQLSDLDSLLVTMCIGLSN-FIWLPVMGAFS 292
+ + V F + A FG++ + G+ + + G +
Sbjct: 212 VAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVA 271

Query: 293 DRIGRKPLLLGASALALLTAYPALSWLVREPSF 325
R+G + L+ +A T Y L++ R
Sbjct: 272 ARLGERRALM-LGMIADGTGYILLAFATRGWMA 303



Score = 31.3 bits (71), Expect = 0.006
Identities = 16/45 (35%), Positives = 21/45 (46%), Gaps = 2/45 (4%)

Query: 258 FGKNELQLSDLDSLLVTMCIGLSNFIWLPVMGAFSDRIGRKPLLL 302
+ + LL L F PV+GA SDR GR+P+LL
Sbjct: 35 LVHSNDVTAHYGILLA--LYALMQFACAPVLGALSDRFGRRPVLL 77


73PA5365PA5360N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA5365-111-0.726491phosphate uptake regulatory protein PhoU
PA5364-111-0.506789two-component response regulator
PA5363013-0.328574hypothetical protein
PA5362112-0.184065hypothetical protein
PA53610110.669300two-component sensor PhoR
PA5360-290.856754two-component response regulator PhoB
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA5365FLGHOOKAP1280.033 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 28.4 bits (63), Expect = 0.033
Identities = 22/88 (25%), Positives = 41/88 (46%), Gaps = 9/88 (10%)

Query: 11 ISQQFNAELEDVRSHLLAMGGLVEKQVNDAVNALIDADSGLAQQVREIDDQINQMERNID 70
+ QF + +R +KQVN A+ A +D + A+Q+ ++DQI+++
Sbjct: 139 LVNQFKTTDQYLR--------DQDKQVNIAIGASVDQINNYAKQIASLNDQISRLTGVGA 190

Query: 71 EECVR-ILARRQPAASDLRLIISISKSV 97
+L +R S+L I+ + SV
Sbjct: 191 GASPNNLLDQRDQLVSELNQIVGVEVSV 218


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA5364HTHFIS918e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 91.4 bits (227), Expect = 8e-23
Identities = 29/139 (20%), Positives = 63/139 (45%), Gaps = 4/139 (2%)

Query: 1 MSKVSALVVDDAPFIRDLMKKGLRDNFPGLHIEEAVNGRKAQQLLSRQNVDLILCDWEMP 60
M+ + LV DD IR ++ + L + G + N + ++ + DL++ D MP
Sbjct: 1 MTGATILVADDDAAIRTVLNQAL--SRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMP 58

Query: 61 EMSGLELLTWCRAQENLKTTPFIMVTSRGDKENVVQAIQAGVSDYIGKPFSNDQLVAKIK 120
+ + +LL + P ++++++ ++A + G DY+ KPF +L+ I
Sbjct: 59 DENAFDLLPRIKKAR--PDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116

Query: 121 KALSRSGKLEALAAHAPRR 139
+AL+ + + +
Sbjct: 117 RALAEPKRRPSKLEDDSQD 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA5361PF06580386e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 37.9 bits (88), Expect = 6e-05
Identities = 22/99 (22%), Positives = 36/99 (36%), Gaps = 25/99 (25%)

Query: 333 LVFNAVKY----TPDEGEIRIRWWADEQGAHLSVQDTGIGVDPKHLPRLTERFYRVDSSR 388
LV N +K+ P G+I ++ D L V++TG
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALK----------------- 305

Query: 389 ASNTGGTGLGLAIVKHVLIR---HRARLEISSVPGKGST 424
+ TG GL V+ L A++++S GK +
Sbjct: 306 -NTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNA 343


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA5360HTHFIS1002e-26 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 99.5 bits (248), Expect = 2e-26
Identities = 39/124 (31%), Positives = 63/124 (50%), Gaps = 2/124 (1%)

Query: 1 MVGKTILIVDDEAPIREMIAVALEMAGYECLEAENTQQAHAVIVDRKPDLILLDWMLPGT 60
M G TIL+ DD+A IR ++ AL AGY+ N I DL++ D ++P
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 SGIELARRLKRDELTVDIPIIMLTAKGEEDNKIQGLEVGADDYITKPFSPRELVARLKAV 120
+ +L R+K+ D+P+++++A+ I+ E GA DY+ KPF EL+ +
Sbjct: 61 NAFDLLPRIKKARP--DLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118

Query: 121 LRRT 124
L
Sbjct: 119 LAEP 122


74PA5333PA5322N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA5333013-2.111285hypothetical protein
PA5332-212-1.459645catabolite repression control protein
PA5331-211-0.295578orotate phosphoribosyltransferase
PA5330-110-0.791299hypothetical protein
PA5329-19-0.895280hypothetical protein
PA532809-1.018373mono-heme cytochrome C
PA5327-17-0.674806oxidoreductase
PA5326-18-0.461224hypothetical protein
PA5325011-0.239001hypothetical protein
PA5324-180.363107transcriptional regulator
PA5323-290.698128acetylglutamate kinase
PA5322-1100.697942phosphomannomutase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA5333ACRIFLAVINRP280.009 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 28.3 bits (63), Expect = 0.009
Identities = 7/41 (17%), Positives = 18/41 (43%)

Query: 65 FQITVALAMFVSFLLMLVVIGFFLLGLVCLAALVLTIIAGI 105
VA++ V FL + + + + + + + L I+ +
Sbjct: 872 APALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVL 912


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA5331PF00577280.041 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 27.9 bits (62), Expect = 0.041
Identities = 12/46 (26%), Positives = 23/46 (50%)

Query: 105 HGEGGTLVGAPLSGRVLIIDDVITAGTAIREVMQIIDAQGARAAGV 150
H + + +SG VL + +T G + + + ++ A GA+ A V
Sbjct: 686 HSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKV 731


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA5330RTXTOXIND300.008 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.8 bits (67), Expect = 0.008
Identities = 17/109 (15%), Positives = 36/109 (33%), Gaps = 1/109 (0%)

Query: 82 LRQRKAAQAQASSDAQLLRLYSSLEDVDRARERRLAELDGLSSVARGNLQSLKLQQANLQ 141
L + + + + L +SSL + + E + A L+ K Q ++
Sbjct: 220 LARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIE 279

Query: 142 GQAAN-QERAGRPVAQALVDQLDDLKQEEKRLQGEIGRFQKAREDAERT 189
+ + +E + LD L+Q + K E + +
Sbjct: 280 SEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQAS 328


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA5326ALARACEMASE290.041 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 28.6 bits (64), Expect = 0.041
Identities = 26/147 (17%), Positives = 49/147 (33%), Gaps = 21/147 (14%)

Query: 33 IDLDRLDHNIDVVMRSVRRGGKHLRL--VEKSLPSPGLLAYIARRAGTRRLMSFHQPFLN 90
+DL L N+ VR+ H R+ V K+ + I G
Sbjct: 9 LDLQALKQNL----SIVRQAATHARVWSVVKANAYGHGIERIWSAIGATDGF-------- 56

Query: 91 HDAVAFADADILL---GKPLPVRSAELFYREHKGAFDPARQLQWLIDTPQRLRQYLALAQ 147
A+ + I L G P+ E F+ +L + + +L+
Sbjct: 57 --ALLNLEEAITLRERGWKGPILMLEGFFHAQDLEIYDQHRLTTCVHSNWQLKALQNAR- 113

Query: 148 GLGTRMRVNIELDVGLHRGGVADQAAL 174
L + + ++++ G++R G L
Sbjct: 114 -LKAPLDIYLKVNSGMNRLGFQPDRVL 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA5324HTHFIS290.029 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 29.0 bits (65), Expect = 0.029
Identities = 19/136 (13%), Positives = 41/136 (30%), Gaps = 18/136 (13%)

Query: 161 RALVSPAFEPLGIELIHAAPPYAGEYLRLLGPQVRFGCLHNRMAIASHWLDMRLPNHNLP 220
R + + E+I + R G L A+ + R +
Sbjct: 364 RLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENM---RQYFASFG 420

Query: 221 ALRQALALLEQESTQVHRKLDLVQAVERAIARDLSLGSQIERISAELNMSSRTLRRRLAE 280
L ++ ++ L ++ A+ + + L ++ TLR+++ E
Sbjct: 421 DALPPSGLYDRVLAEMEYPL-ILAALTAT-------RGNQIKAADLLGLNRNTLRKKIRE 472

Query: 281 HGLTFEALLEQVRRGR 296
G+ V R
Sbjct: 473 LGV-------SVYRSS 481


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA5323CARBMTKINASE558e-11 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 54.8 bits (132), Expect = 8e-11
Identities = 66/301 (21%), Positives = 116/301 (38%), Gaps = 61/301 (20%)

Query: 26 VGKTLVIKYGGNAMESEELKAGF----------ARDVVLMKAVGINPVVVHGGGPQIGDL 75
+GK +VI GGNA++ K + AR + + A G V+ HG GPQ+G L
Sbjct: 1 MGKRVVIALGGNALQQRGQKGSYEEMMDNVRKTARQIAEIIARGYEVVITHGNGPQVGSL 60

Query: 76 LKRLSIESHFIDGMRVTDAATMDVV-----------------EMVLGGQVNKDIVNLINR 118
L L +++ A MDV + + K +V +I +
Sbjct: 61 L--LHMDAG--QATYGIPAQPMDVAGAMSQGWIGYMIQQALKNELRKRGMEKKVVTIITQ 116

Query: 119 -----------HGGSAIG--LTGKDAELIRAKKLTVTRQ---------TPEMTKPEIIDI 156
+ +G + A+ + +K + ++ P ++
Sbjct: 117 TIVDKNDPAFQNPTKPVGPFYDEETAKRLAREKGWIVKEDSGRGWRRVVPSPDPKGHVEA 176

Query: 157 GHVGEVTGVNVGLLNMLVKGDFIPVIAPIGVGSNGESYNINADLVAGKVAEALKAEKLML 216
+ V G++ + G +PVI G E+ I+ DL K+AE + A+ M+
Sbjct: 177 ETIK--KLVERGVIVIASGGGGVPVILEDGEIKGVEAV-IDKDLAGEKLAEEVNADIFMI 233

Query: 217 LTNIAGLMDKQG----QVLTGLSTEQVNELIADGT-IYGGMLPKIRCALEAVQGGVTSAH 271
LT++ G G Q L + E++ + +G G M PK+ A+ ++ G A
Sbjct: 234 LTDVNGAALYYGTEKEQWLREVKVEELRKYYEEGHFKAGSMGPKVLAAIRFIEWGGERAI 293

Query: 272 I 272
I
Sbjct: 294 I 294


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA5322PF03544300.039 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 29.9 bits (67), Expect = 0.039
Identities = 16/109 (14%), Positives = 30/109 (27%), Gaps = 4/109 (3%)

Query: 317 QRTAKPPVPSLPGFAPLIQALARQPRR-KPEPTSVPSPAKAAPVAPVAVAKAPPREEPAL 375
Q P + A P+ +P P V P P +AP E
Sbjct: 39 QVIELPAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPK 98

Query: 376 ADPLFQNTDILDIDILDEDQDLLGLEQT---PIMSTAKAPTLPASIFRA 421
P + + ++ D + + A+ + A+ +
Sbjct: 99 PKPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATS 147


75PA5232PA5226N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA523208-0.078648hypothetical protein
PA523118-0.542823ABC transporter ATP-binding protein/permease
PA5230212-0.873844ABC transporter permease
PA5229213-0.238165hypothetical protein
PA52280100.9250005-formyltetrahydrofolate cyclo-ligase
PA52270110.514583hypothetical protein
PA5226-1101.555024hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA5232RTXTOXIND816e-19 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 81.0 bits (200), Expect = 6e-19
Identities = 59/392 (15%), Positives = 126/392 (32%), Gaps = 85/392 (21%)

Query: 1 MKQESKRWLSRALIVAALLGVGVLVWQVSRPTGLGEGFASGNGRI--EATEVDVAAKLPG 58
++ R V + V E A+ NG++ ++
Sbjct: 49 IETPVSRRPRLVAYFIMGFLVIAFILSVLGQV---EIVATANGKLTHSGRSKEIKPIENS 105

Query: 59 RVAEIKVDEGDFVKAGEIVARMDTQVLEAQLAQAQAQVRQAENAKLTATSLVAQRESEKS 118
V EI V EG+ V+ G+++ ++ EA + Q+ + QA + L E K
Sbjct: 106 IVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKL 165

Query: 119 TAQAVVAQRQAELTAAQKRFTRTEALVKRNALPQQQLDDDRATLQSAQAALSAARSQV-- 176
+ + + + ++ T + ++ + Q Q L +A +++
Sbjct: 166 PELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINR 225

Query: 177 ------------------------------------ISAQAAIEAGRSQVIEAQSAIEAA 200
+ A + +SQ+ + +S I +A
Sbjct: 226 YENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSA 285

Query: 201 KASVARLQADIDD-----------------------------SLLKAPRNGRV-QYRVAQ 230
K + + S+++AP + +V Q +V
Sbjct: 286 KEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHT 345

Query: 231 PGEVLPAGGKLLNMVDLADVY-MTFFLPSMQAGRVGLGQEVRLVIDAVPDY---VIPAKV 286
G V+ L+ +V D +T + + G + +GQ + ++A P + KV
Sbjct: 346 EGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKV 405

Query: 287 SYVASVAQFTPKTVETANEREKLMFRVKARLD 318
+ +E ++R L+F V ++
Sbjct: 406 KNIN------LDAIE--DQRLGLVFNVIISIE 429


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA5231PF05272320.013 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 32.0 bits (72), Expect = 0.013
Identities = 13/39 (33%), Positives = 17/39 (43%), Gaps = 2/39 (5%)

Query: 36 MVGLIGPDGVGKSSLLALLAGARKMQDGEIRVLDGDMRD 74
V L G G+GKS+L+ L G D G +D
Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLDFFSDTHF--DIGTGKD 634


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA5230ABC2TRNSPORT444e-07 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 44.2 bits (104), Expect = 4e-07
Identities = 41/174 (23%), Positives = 67/174 (38%), Gaps = 4/174 (2%)

Query: 196 AALIREREHGTVEHLLVMPLSAFEIMMAKV-WSMGLVVLVAAGLSLQWVVRGWLDVPISG 254
AA R T E +L L +I++ ++ W+ L AG+ + G+
Sbjct: 89 AAFGRMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGAGIGVVAAALGYTQWL--- 145

Query: 255 SVGLFLLGAGLHLFATTSMGIFLGTVARSMPQLGLLTILVLLPLNILSGGTTPRESMPEL 314
S+ L L A S+G+ + +A S LV+ P+ LSG P + +P +
Sbjct: 146 SLLYALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPIV 205

Query: 315 VQNIMLAAPTTHFVSLAQAILFRGAGFDIVWPQFAGIVVIGSAFFFGALWRFRR 368
Q P +H + L + I+ D+ A + I FF RR
Sbjct: 206 FQTAARFLPLSHSIDLIRPIMLGHPVVDVCQHVGALCIYIVIPFFLSTALLRRR 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA5226ALARACEMASE260.048 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 25.5 bits (56), Expect = 0.048
Identities = 10/34 (29%), Positives = 15/34 (44%), Gaps = 2/34 (5%)

Query: 8 PARPGHWPRPGTILYSCPITPIPKREPMEDADLQ 41
P W RPG ILY +P + + + L+
Sbjct: 199 PEAHFDWVRPGIILYG--ASPSGQWRDIANTGLR 230


76PA5166PA5158N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA5166-2110.220540two-component response regulator
PA5165-212-0.193362two-component sensor
PA5164-214-0.619913dTDP-4-dehydrorhamnose 3,5-epimerase
PA5163-2140.419573glucose-1-phosphate thymidylyltransferase
PA5162-2130.611556dTDP-4-dehydrorhamnose reductase
PA5161-2130.691704dTDP-D-glucose 4,6-dehydratase
PA5160-1121.015787*drug efflux transporter
PA5159-1121.216830multidrug resistance protein
PA51580110.715591hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA5166HTHFIS446e-156 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 446 bits (1150), Expect = e-156
Identities = 176/480 (36%), Positives = 240/480 (50%), Gaps = 48/480 (10%)

Query: 11 TQVLLIDDDPHLRQALRQTLDLAGLKVATLDDARQLDTAQCKDWPGVVVSDIRMPGIDGM 70
+L+ DDD +R L Q L AG V +A L +VV+D+ MP +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 71 ELLRQLHEQDADLPVILITGHGDVPLAVQAMRGGAYDFLEKPFPSDALLDSVRRALEVRR 130
+LL ++ + DLPV++++ A++A GAYD+L KPF L+ + RAL +
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 131 LVLENRTLRLALAERHELHGRLIGRSAGMQRLREQVGSLAAIQADVLVLGETGAGKEVVA 190
R + + L+GRSA MQ + + L +++ GE+G GKE+VA
Sbjct: 124 -----RRPSKLEDDSQDGMP-LVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVA 177

Query: 191 RALHDLSSRRDGPFVAINAGALAESVVESELFGHEAGAFTGAQKRRIGKFEYANGGTLFL 250
RALHD RR+GPFVAIN A+ ++ESELFGHE GAFTGAQ R G+FE A GGTLFL
Sbjct: 178 RALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFL 237

Query: 251 DEIESMSLDVQVKLLRLLQERVVERLGSNQLIPLDIRIIAATKEDLRQAADQGRFRADLY 310
DEI M +D Q +LLR+LQ+ +G I D+RI+AAT +DL+Q+ +QG FR DLY
Sbjct: 238 DEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLY 297

Query: 311 YRLNVASLRIPPLRERGEDIPLLFRHFAEAGAMRHGLTPRELDAGQSARLLAYDWPGNVR 370
YRLNV LR+PPLR+R EDIP L RHF + A + GL + D + A+ WPGNVR
Sbjct: 298 YRLNVVPLRLPPLRDRAEDIPDLVRHFVQQ-AEKEGLDVKRFDQEALELMKAHPWPGNVR 356

Query: 371 ELQNAAERFAL-----------------------------------------GLGLSLDD 389
EL+N R +
Sbjct: 357 ELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYF 416

Query: 390 GVLPDAADEPHNLSAKVEAFERSLIAAELERPHNSLRSVAEALGIPRKTLHDKLRKHGLP 449
DA + E LI A L + A+ LG+ R TL K+R+ G+
Sbjct: 417 ASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVS 476


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA5162NUCEPIMERASE482e-08 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 47.9 bits (114), Expect = 2e-08
Identities = 32/158 (20%), Positives = 55/158 (34%), Gaps = 27/158 (17%)

Query: 3 RILLLGANGQVGWELQRALAPLGE--------------LLVCDRR-----------RADL 37
+ L+ GA G +G+ + + L G L R + DL
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 38 ADPEGLARLVRAERPQFIVNAGAYTAVDKAESDADNARLINARAVAVLAEEAAACG-AWL 96
AD EG+ L + + + + AV + + N + E L
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQHL 121

Query: 97 VHYSTDYVFDGAGSVPFAEDAPTG-PLSVYGQTKLEGE 133
++ S+ V+ +PF+ D P+S+Y TK E
Sbjct: 122 LYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANE 159


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA5161NUCEPIMERASE1769e-55 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 176 bits (449), Expect = 9e-55
Identities = 86/352 (24%), Positives = 136/352 (38%), Gaps = 42/352 (11%)

Query: 1 MTILVTGSAGFIGANFVLDWLALHDEPVVSLDKLT--YAGNRQNL-ASLDGDARHTFVAG 57
M LVTG+AGFIG + L + VV +D L Y + + L F
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQ-VVGIDNLNDYYDVSLKQARLELLAQPGFQFHKI 59

Query: 58 DIGDSQLVARLLAEHQPRAILNFAAESHVDRSIHGPEDFIQTNIVGTFRLLEEVRAYWGA 117
D+ D + + L A + V S+ P + +N+ G +LE R
Sbjct: 60 DLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHN--- 116

Query: 118 LEPEAKAAFRFLHVSTDEVYGSLAPSDPAFTENNRYEPNSPYSASKAASDHLVRAYHHTY 177
+ + L+ S+ VYG L P T+++ P S Y+A+K A++ + Y H Y
Sbjct: 117 -KIQ-----HLLYASSSSVYG-LNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLY 169

Query: 178 GLPVLTTNCSNNYGPYHFPEKLIPLVIHNALAGKPLPIYGDGQQIRDWLYVKDHCSAIRR 237
GLP YGP+ P+ + L GK + +Y G+ RD+ Y+ D AI R
Sbjct: 170 GLPATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIR 229

Query: 238 VLEAGQL------------------GETYNVGGWNEKANLDVVETLCAILDQEQPRADGR 279
+ + YN+G + +D ++ L L E A
Sbjct: 230 LQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIE---AK-- 284

Query: 280 SYREQITFVKDRPGHDRRYAIDATRLERELGWKPAETFETGIRKTVRWYLDN 331
+ +PG + D L +G+ P T + G++ V WY D
Sbjct: 285 -----KNMLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRDF 331


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA5160TCRTETB1096e-28 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 109 bits (274), Expect = 6e-28
Identities = 75/397 (18%), Positives = 155/397 (39%), Gaps = 16/397 (4%)

Query: 17 IGLSLATFMQVLDTTIANVALPTISGNLGVSSEQGTWVITSFAVSNAIALPLTGWLARRV 76
I L + +F VL+ + NV+LP I+ + WV T+F ++ +I + G L+ ++
Sbjct: 17 IWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQL 76

Query: 77 GEVRLFIAATLLFVLASFLCGIAQSMPSLVGFRALQGFVAGPLYPITQTLLISIY-PPAK 135
G RL + ++ S + + S SL+ +P ++++ Y P
Sbjct: 77 GIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKEN 136

Query: 136 RGMALALLAMVTVVAPIAGPILGGWITDDYSWPWIFFINVPVGLFAAFVVYQQLKARPVV 195
RG A L+ + + GP +GG I W ++ I + + F++ + V
Sbjct: 137 RGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLL---KKEV 193

Query: 196 IKKAPMDYVGLIALVVGVGALQIVLDKGNDLDWFESNFIVGGALIAAIALAFFIIWEFTD 255
K D G+I + VG+ + F +++ + +++ ++ F+
Sbjct: 194 RIKGHFDIKGIILMSVGIVFFML----------FTTSYSISFLIVSVLSFLIFVKHIRKV 243

Query: 256 RHPIVNLRLFAHRNFAAGTLALVLGYAAFFGINLLLPQWLQTQMGYTATWAGLAAAPIGI 315
P V+ L + F G L + + G ++P ++ + G G
Sbjct: 244 TDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGT 303

Query: 316 LPV-FLSPLVGRYANHFDLRMLAGLSFLAMAITCFMRANFTTEVDYQHIAIVQLIMGLGV 374
+ V + G + + + ++++ F+ A+F E + I+ + + G+
Sbjct: 304 MSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVS-FLTASFLLETTSWFMTIIIVFVLGGL 362

Query: 375 AFFFMPILSILLSDLPPDQIADGSGLATFLRTLGGSF 411
+F I +I+ S L + G L F L
Sbjct: 363 SFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGT 399


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA5159RTXTOXIND772e-17 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 77.2 bits (190), Expect = 2e-17
Identities = 47/368 (12%), Positives = 105/368 (28%), Gaps = 90/368 (24%)

Query: 54 GNVVQITPQIVGTVVSIGADDGDLVRKGQELVRFDPSDADIALQRAEANLA--------- 104
G +I P V I +G+ VRKG L++ A+ + +++L
Sbjct: 94 GRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRY 153

Query: 105 -----------------------------HTVRQVRGLFSNVDGYRAEVATRKVALAKAE 135
+R + ++ + +++ L K
Sbjct: 154 QILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKR 213

Query: 136 ADYK----RRKNLADDGAISQEELAH----------ARDALDSAKASLTSSEQQLNTNRA 181
A+ R + + + L A+ A+ + + +L ++
Sbjct: 214 AERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKS 273

Query: 182 LVDDTQ---------------------ITSHPDVKAAAAQLRQ----AYLDDARSTIVAP 216
++ + + L S I AP
Sbjct: 274 QLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAP 333

Query: 217 VTGYVAKRSVQ-VGQRVQPGNALMAVVPLDQ-IWIDANFKETQLKHMRIGQPVEIRSDLY 274
V+ V + V G V LM +VP D + + A + + + +GQ I+ + +
Sbjct: 334 VSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAF 393

Query: 275 GSDV--RYSGTVDSLGVGTGSAFSLLPAQNATGNWIKIVQRVPVRIHIDPQELQKHPLRI 332
G V ++ + G ++ + + + PL
Sbjct: 394 PYTRYGYLVGKVKNINLDA-------IEDQRLGLVFNVIISIEE--NCLSTGNKNIPLSS 444

Query: 333 GLSMDVKV 340
G+++ ++
Sbjct: 445 GMAVTAEI 452


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA5158RTXTOXIND340.001 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 34.0 bits (78), Expect = 0.001
Identities = 27/213 (12%), Positives = 50/213 (23%), Gaps = 26/213 (12%)

Query: 79 EALQGTPDLQIAEARARQAAATAQAQDAARQPTLDAKASYSGIRAPTSVAPAPLGGRYSA 138
AL D ++ QA + K +
Sbjct: 128 TALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRL 187

Query: 139 IKYLSLGFNYDFDLWGGERAAWEAALGQANAARIDSQAARIGLSASIARAYSDLAHAFTV 198
F W ++ E L + A R+ A S L ++
Sbjct: 188 TSL----IKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSL 243

Query: 199 RD--------LAEEELKRSQRMTELSQKR------MSAGLDSKVQLQQ--------TQTQ 236
+ E+E K + + EL + S L +K + Q +
Sbjct: 244 LHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDK 303

Query: 237 LATARQQLSAAEQDIASARIALAVLLGKGPDRG 269
L + ++A + + P
Sbjct: 304 LRQTTDNIGLLTLELAKNEERQQASVIRAPVSV 336


77PA5044PA5037N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA5044211-3.941056type 4 fimbrial biogenesis protein PilM
PA5043111-3.824147type 4 fimbrial biogenesis protein PilN
PA5042-112-2.481094type 4 fimbrial biogenesis protein PilO
PA5041-19-1.864266type 4 fimbrial biogenesis protein PilP
PA504009-1.762549type 4 fimbrial biogenesis outer membrane
PA5039010-1.085287shikimate kinase
PA5038-110-1.0595213-dehydroquinate synthase
PA5037-210-0.535910hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA5044SHAPEPROTEIN320.002 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 32.4 bits (74), Expect = 0.002
Identities = 40/158 (25%), Positives = 63/158 (39%), Gaps = 38/158 (24%)

Query: 197 VVDIGATMTTLSVLHNGRTIYTREQLFGGRQLTEEI----QRRYGLSVEE--AGLAKKQG 250
VVDIG T ++V+ +Y+ GG + E I +R YG + E A K +
Sbjct: 163 VVDIGGGTTEVAVISLNGVVYSSSVRIGGDRFDEAIINYVRRNYGSLIGEATAERIKHEI 222

Query: 251 G--LPDDYDSEV-------------------------LRPFKDAVVQQVSRSLQFF---F 280
G P D E+ L+ +V V +L+
Sbjct: 223 GSAYPGDEVREIEVRGRNLAEGVPRGFTLNSNEILEALQEPLTGIVSAVMVALEQCPPEL 282

Query: 281 AAGQFNDVDYIVLAGGTASIQDLDRLIQQKIGTPTLVA 318
A+ +VL GG A +++LDRL+ ++ G P +VA
Sbjct: 283 ASDISER--GMVLTGGGALLRNLDRLLMEETGIPVVVA 318


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA5040BCTERIALGSPD3133e-99 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 313 bits (804), Expect = 3e-99
Identities = 107/419 (25%), Positives = 184/419 (43%), Gaps = 53/419 (12%)

Query: 325 VPWDQALDLVLKTKGLDKRKLGNVLLVAPADEIAARERQEL--------EAQKQIAELAP 376
+ W A D+V L+K + L + + A ER Q+ IA +
Sbjct: 199 LSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEPNSRQRIIAMIKQ 258

Query: 377 LRRE--------LIQVNYAKAADIAKLFQSVTSDGGQEGKEGGRGS--------ITVDDR 420
L R+ +I + YAKA+D+ ++ + S Q K+ + I +
Sbjct: 259 LDRQQATQGNTKVIYLKYAKASDLVEVLTGI-SSTMQSEKQAAKPVAALDKNIIIKAHGQ 317

Query: 421 TNSIIAYQPQERLDELRRIVSQLDIPVRQVMIEARIVEANVGYDKSLGVRWGGAYHKGNW 480
TN++I + +++L R+++QLDI QV++EA I E +LG++W
Sbjct: 318 TNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGIQWANKN----- 372

Query: 481 SGYGKDGNIGIKDEDGMNCGPIAGSCTFPTTGTSKSPSPFVDLGAKDATSGIGIGFITDN 540
+G + N G+ + AG+ + GT S A + +GI GF N
Sbjct: 373 AGMTQFTNSGLPISTAI-----AGANQYNKDGTVSSSLA----SALSSFNGIAAGFYQGN 423

Query: 541 IILDLQLSAMEKTGNGEIVSQPKVVTSDKETAKILKGSEVPYQEASSSGATSTSF----- 595
+ L+A+ + +I++ P +VT D A G EVP S + + F
Sbjct: 424 --WAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVPVLTGSQTTSGDNIFNTVER 481

Query: 596 KEAALSLEVTPQITPDNRIIVEVK-----VTKDAPDYQNMLNGVPPINKNEVNAKILVND 650
K + L+V PQI + +++E++ V A + L N VN +LV
Sbjct: 482 KTVGIKLKVKPQINEGDSVLLEIEQEVSSVADAASSTSSDLGAT--FNTRTVNNAVLVGS 539

Query: 651 GETIVIGGVFSNEQSKSVEKVPFLGELPYLGRLFRRDTVTDRKNELLVFLTPRIMNNQA 709
GET+V+GG+ S + +KVP LG++P +G LFR + K L++F+ P ++ ++
Sbjct: 540 GETVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNLMLFIRPTVIRDRD 598



Score = 52.6 bits (126), Expect = 4e-09
Identities = 31/188 (16%), Positives = 74/188 (39%), Gaps = 13/188 (6%)

Query: 281 GEKLSLNFQDIDVRSVLQLIADFTDLNLVASDTVQGNITLRLQN-VPWDQALDL---VLK 336
E+ S +F+ D++ + ++ + ++ +V+G IT+R + + +Q VL
Sbjct: 27 AEEFSASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITVRSYDMLNEEQYYQFFLSVLD 86

Query: 337 TKGLDKRKLGN-VLLVAPADEIAARERQELEAQKQIAELAPLRRELIQVNYAKAADIAKL 395
G + N VL V + + A + + + ++ + A D+A L
Sbjct: 87 VYGFAVINMNNGVLKVVRSKD-AKTAAVPVASDAAPGIGDEVVTRVVPLTNVAARDLAPL 145

Query: 396 FQSVTSDGGQEGKEGGRGSITVDDRTNSIIAYQPQERLDELRRIVSQLDIPVRQVMIEAR 455
+ + + G GS+ + +N ++ + L IV ++D + ++
Sbjct: 146 LRQLNDN-------AGVGSVVHYEPSNVLLMTGRAAVIKRLLTIVERVDNAGDRSVVTVP 198

Query: 456 IVEANVGY 463
+ A+
Sbjct: 199 LSWASAAD 206


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA5039PF05272290.011 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.9 bits (64), Expect = 0.011
Identities = 11/33 (33%), Positives = 16/33 (48%), Gaps = 2/33 (6%)

Query: 4 LILVGPMGAGKSTIGRLLAKELHLAFKDSDKEI 36
++L G G GKST+ L F D+ +I
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDF--FSDTHFDI 629


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA5037PF03544431e-06 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 43.0 bits (101), Expect = 1e-06
Identities = 23/104 (22%), Positives = 31/104 (29%), Gaps = 1/104 (0%)

Query: 355 LPSAAVPPTVSSSAPPVTPLANNGVTPMHPVPPAPTEPTAPAATPTPTQTPAPAAPVASA 414
LP+ A P +V+ AP P PV EP P
Sbjct: 43 LPAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPK 102

Query: 415 PASKPA-PAPAPAKPAASKPATTAAAKPAPAPAAKPASGGGAGS 457
P KP P + + A+ APA +S A +
Sbjct: 103 PKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAAT 146



Score = 41.5 bits (97), Expect = 3e-06
Identities = 20/108 (18%), Positives = 30/108 (27%)

Query: 355 LPSAAVPPTVSSSAPPVTPLANNGVTPMHPVPPAPTEPTAPAATPTPTQTPAPAAPVASA 414
+ A + P + PP + P PP P P P P V
Sbjct: 55 VAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQP 114

Query: 415 PASKPAPAPAPAKPAASKPATTAAAKPAPAPAAKPASGGGAGSQWYRN 462
PA P + + A A +KP + +G +
Sbjct: 115 KRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVASGPRALSR 162



Score = 33.8 bits (77), Expect = 0.001
Identities = 24/72 (33%), Positives = 26/72 (36%), Gaps = 11/72 (15%)

Query: 383 HPVPPAPTEP-----TAPAATPTPTQTPAPAAPVASAPASKPAPAPAPAKPAASKPATTA 437
PAP +P APA P P PV +P P P P P K A
Sbjct: 40 VIELPAPAQPISVTMVAPADLEPPQAVQPPPEPVV-----EPEPEPEPI-PEPPKEAPVV 93

Query: 438 AAKPAPAPAAKP 449
KP P P KP
Sbjct: 94 IEKPKPKPKPKP 105


78PA4868PA4862N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA48683192.638271urease subunit alpha
PA48670173.193355urease subunit beta
PA4866-1162.667461hypothetical protein
PA4865-1162.650753urease subunit gamma
PA4864-1152.715779urease accessory protein
PA4863-1130.637807hypothetical protein
PA4862-1120.414564ABC transporter ATP-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4868UREASE10960.0 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 1096 bits (2837), Expect = 0.0
Identities = 423/567 (74%), Positives = 479/567 (84%), Gaps = 2/567 (0%)

Query: 2 KISRQAYADMFGPTVGDRVRLADTDLWIEVERDFTVYGEEVKFGGGKVIRDGMGQSQL-G 60
++SR AYA+MFGPTVGD+VRLADT+L+IEVE+DFT +GEEVKFGGGKVIRDGMGQSQ+
Sbjct: 4 RMSRAAYANMFGPTVGDKVRLADTELFIEVEKDFTTHGEEVKFGGGKVIRDGMGQSQVTR 63

Query: 61 AAQVVDTVITNALILDHWGVVKADVGLKDGRIQAIGKAGNPDIQPGVNIAIGAGTEVIAG 120
VDTVITNALILDHWG+VKAD+GLKDGRI AIGKAGNPD+QPGV I +G GTEVIAG
Sbjct: 64 EGGAVDTVITNALILDHWGIVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAG 123

Query: 121 EGMILTAGGIDTHIHFICPQQIEEALMSGVTTMIGGGTGPAAGTNATTCTSGPWHMARML 180
EG I+TAGG+D+HIHFICPQQIEEALMSG+T M+GGGTGPA GT ATTCT GPWH+ARM+
Sbjct: 124 EGKIVTAGGMDSHIHFICPQQIEEALMSGLTCMLGGGTGPAHGTLATTCTPGPWHIARMI 183

Query: 181 QAADAFPMNIGFTGKGNASLPLPLEEQVLAGAIGLKLHEDWGSTPAAIDNCLEVAERHDI 240
+AADAFPMN+ F GKGNASLP L E VL GA LKLHEDWG+TPAAID CL VA+ +D+
Sbjct: 184 EAADAFPMNLAFAGKGNASLPGALVEMVLGGATSLKLHEDWGTTPAAIDCCLSVADEYDV 243

Query: 241 QVAIHTDTLNESGFVETTLGAFKGRTIHTYHTEGAGGGHAPDIIKACGFANVLPSSTNPT 300
QV IHTDTLNESGFVE T+ A KGRTIH YHTEGAGGGHAPDII+ CG NV+PSSTNPT
Sbjct: 244 QVMIHTDTLNESGFVEDTIAAIKGRTIHAYHTEGAGGGHAPDIIRICGQPNVIPSSTNPT 303

Query: 301 RPFTRNTIDEHLDMLMVCHHLDPAIAEDVAFAESRIRRETIAAEDILHDLGAFSMISSDS 360
RP+T NT+ EHLDMLMVCHHL P I ED+AFAESRIR+ETIAAEDILHD+GAFS+ISSDS
Sbjct: 304 RPYTVNTLAEHLDMLMVCHHLSPTIPEDIAFAESRIRKETIAAEDILHDIGAFSIISSDS 363

Query: 361 QAMGRVGEVITRTWQTADKMKRQRGRLDGDGARNDNFRARRYIAKYTINPAITHGISHEV 420
QAMGRVGEV RTWQTADKMKRQRGRL + NDNFR +RYIAKYTINPAI HG+SHE+
Sbjct: 364 QAMGRVGEVAIRTWQTADKMKRQRGRLKEETGDNDNFRVKRYIAKYTINPAIAHGLSHEI 423

Query: 421 GSVEAGKWADLVLWRPAFFGVKPSLILKGGAIAASLMGDINGSIPTPQPVHYRPMFASYA 480
GS+E GK ADLVLW PAFFGVKP ++L GG IAA+ MGD N SIPTPQPVHYRPMF +Y
Sbjct: 424 GSLEVGKRADLVLWNPAFFGVKPDMVLLGGTIAAAPMGDPNASIPTPQPVHYRPMFGAYG 483

Query: 481 GSRHATSLTFVSQAAFAAGVPQQLGLRKAIGVVSGCR-GVQKTDLIHNGYLPTIEVDAQN 539
SR +S+TFVSQA+ AG+ +LG+ K + V R G+ K +IHN P IEVD +
Sbjct: 484 RSRTNSSVTFVSQASLDAGLAGRLGVAKELVAVQNTRGGIGKASMIHNSLTPHIEVDPET 543

Query: 540 YQVRADGQLLWCEPADVLPMAQRYFLF 566
Y+VRADG+LL CEPA VLPMAQRYFLF
Sbjct: 544 YEVRADGELLTCEPATVLPMAQRYFLF 570


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4866SACTRNSFRASE422e-07 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 41.9 bits (98), Expect = 2e-07
Identities = 15/63 (23%), Positives = 26/63 (41%), Gaps = 1/63 (1%)

Query: 81 RGTVEHSVYVRDDQRGKGLGVQLLQALIERARAQGLHVMVAAIESGNAASIGLHRRLGFE 140
+E + V D R KG+G LL IE A+ ++ + N ++ + + F
Sbjct: 89 YALIED-IAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFI 147

Query: 141 ISG 143
I
Sbjct: 148 IGA 150


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4863SACTRNSFRASE361e-05 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 36.5 bits (84), Expect = 1e-05
Identities = 16/74 (21%), Positives = 34/74 (45%), Gaps = 1/74 (1%)

Query: 57 DGQPVGLLVTRETADGFL-VDNLAVLPECKGQGIGRQLLERAERDATSLGYRSLYLYTNE 115
+ +G + R +G+ ++++AV + + +G+G LL +A A + L L T +
Sbjct: 73 ENNCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQD 132

Query: 116 RMTENIALYARVGY 129
YA+ +
Sbjct: 133 INISACHFYAKHHF 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4862PF05272280.046 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 27.7 bits (61), Expect = 0.046
Identities = 13/37 (35%), Positives = 19/37 (51%)

Query: 14 SHILRGLSFEAKVGEVTCLLGRNGVGKTTLLRCLMGL 50
H+ R + K L G G+GK+TL+ L+GL
Sbjct: 583 GHVARVMEPGCKFDYSVVLEGTGGIGKSTLINTLVGL 619


79PA4781PA4776N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA4781091.522335cyclic di-GMP phosphodiesterase
PA47801111.686601hypothetical protein
PA47792111.660320hypothetical protein
PA47783111.432110protein CueR
PA47772121.781550two-component regulator system signal sensor
PA47761112.002811two-component regulator system response
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4781HTHFIS742e-16 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 74.1 bits (182), Expect = 2e-16
Identities = 29/120 (24%), Positives = 52/120 (43%), Gaps = 6/120 (5%)

Query: 13 VLVVDDTPDNLLLMRELLE-EQYRVRTAGSGPAGLRAAVEEPRPDLILLDVNMPGMDGYE 71
+LV DD ++ + L Y VR + R + DL++ DV MP + ++
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRW-IAAGDGDLVVTDVVMPDENAFD 64

Query: 72 VCRRLKA-DPLTRDIPLMFLTARADRDDEQQGLALGAVDYLGKPVSPPIVLARVRTHLQL 130
+ R+K P D+P++ ++A+ + GA DYL KP ++ + L
Sbjct: 65 LLPRIKKARP---DLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4780IGASERPTASE310.008 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 30.8 bits (69), Expect = 0.008
Identities = 15/75 (20%), Positives = 28/75 (37%), Gaps = 4/75 (5%)

Query: 218 PELRQTRYAKEMWALYEAGELTAETPLSGTFVEAEEAADVRAVLREIEAAQREEARRQAL 277
+ AKE + +A T E SG+ E +E +E ++EE +
Sbjct: 1061 TTAQNREVAKEAKSNVKANTQTNEVAQSGS--ETKETQ--TTETKETATVEKEEKAKVET 1116

Query: 278 RQADDAPRGEREEPP 292
+ + P+ + P
Sbjct: 1117 EKTQEVPKVTSQVSP 1131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4778PF07675300.002 Cleaved Adhesin
		>PF07675#Cleaved Adhesin

Length = 1358

Score = 30.4 bits (68), Expect = 0.002
Identities = 17/82 (20%), Positives = 35/82 (42%), Gaps = 2/82 (2%)

Query: 3 IGEAAKKSGLTPKMIRYYESIELLRPAGRSASGYRHYNENDLHTLAFIRRSRDLGFSLDE 62
G + + +G P+ + +++L PAG +RHYN +DL+ + +G S
Sbjct: 933 FGLSTEANGAKPQSVWIERTVDL--PAGTKYVAFRHYNCSDLNYILLDDIQFTMGGSPTP 990

Query: 63 VGKLLTLWQDRQRASADVKALA 84
T+++D + +
Sbjct: 991 TDYTYTVYRDGTKIKEGLTETT 1012


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4777PF06580340.002 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 33.7 bits (77), Expect = 0.002
Identities = 15/81 (18%), Positives = 31/81 (38%), Gaps = 20/81 (24%)

Query: 360 LVGNALRY----TPAGGQVEIRVENRAQHAVLRVRDNGPGVALEEQQAIFTRFYRSPATS 415
LV N +++ P GG++ ++ L V + G +
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLA----------------LKN 306

Query: 416 SGEGSGLGLPIVKRIVELHFG 436
+ E +G GL V+ +++ +G
Sbjct: 307 TKESTGTGLQNVRERLQMLYG 327


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4776HTHFIS801e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 80.3 bits (198), Expect = 1e-19
Identities = 34/123 (27%), Positives = 58/123 (47%), Gaps = 1/123 (0%)

Query: 2 RILLAEDDLLLGDGIRAGLRLEGDTVEWVTDGVAAENALVTDEFDLLVLDIGLPRRSGLD 61
IL+A+DD + + L G V ++ + + DL+V D+ +P + D
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 ILRNLRHQGLLTPVLLLTARDKVADRVAGLDSGADDYLTKPFDLDELQARV-RALTRRTT 120
+L ++ PVL+++A++ + + GA DYL KPFDL EL + RAL
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 121 GRA 123
+
Sbjct: 125 RPS 127


80PA4600PA4589N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA4600-210-0.182542transcriptional regulator NfxB
PA4599-2100.242961resistance-nodulation-cell division (RND)
PA4598-190.055169resistance-nodulation-cell division (RND)
PA4597080.713231multidrug efflux outer membrane protein OprJ
PA4596-180.609900transcriptional regulator
PA4595-180.276267ABC transporter ATP-binding protein
PA4594-2110.384964ABC transporter ATP-binding protein
PA4593-1100.026822ABC transporter permease
PA4592-28-0.560226hypothetical protein
PA4591-19-0.336872hypothetical protein
PA4590-29-0.012306protein activator
PA4589-2100.028701hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4600HTHTETR388e-06 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 38.1 bits (88), Expect = 8e-06
Identities = 19/141 (13%), Positives = 52/141 (36%), Gaps = 7/141 (4%)

Query: 24 ATLKELAEAAGVSKATLHRFCGTRDNLVQMLEDHGETVLNQIIQACDLEHAEPLEALQRL 83
+L E+A+AAGV++ ++ + +L + + E+ + ++ + ++ R
Sbjct: 32 TSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEYQAKFPGDPLSVLRE 91

Query: 84 IKEHL-------THRELLVFLVFQYRPDFLDPHGEGARWQSYLEALDAFFLRGQQKGVFR 136
I H+ R LL+ ++F + ++ + + +
Sbjct: 92 ILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRNLCLESYDRIEQTLKHCIEA 151

Query: 137 IDITAAVFTELFITLVYGMVD 157
+ A + T ++ G +
Sbjct: 152 KMLPADLMTRRAAIIMRGYIS 172


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4599RTXTOXIND431e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 42.9 bits (101), Expect = 1e-06
Identities = 22/104 (21%), Positives = 41/104 (39%), Gaps = 4/104 (3%)

Query: 99 LKAAVSRAEGELARNRAVLFEAQARVRRYEPLVKIQAVSQQDFDTATADLRSAEAATRSA 158
+ A EL ++ L + ++ + + + Q V+Q + LR
Sbjct: 257 QENKYVEAVNELRVYKSQLEQIESEILSAK--EEYQLVTQLFKNEILDKLRQTTDNIGLL 314

Query: 159 QADLETARLNLGYASVTAPISGRIGRALV-TEGALVGQGEATLM 201
+L + + AP+S ++ + V TEG +V E TLM
Sbjct: 315 TLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAE-TLM 357



Score = 37.1 bits (86), Expect = 1e-04
Identities = 22/199 (11%), Positives = 67/199 (33%), Gaps = 28/199 (14%)

Query: 55 PGRIEPV-RVAEVRARVAGIVVRKRFEEGADVKAGDLLFQIDP-------APLKAAVSRA 106
G++ R E++ IV +EG V+ GD+L ++ ++++ +A
Sbjct: 87 NGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQA 146

Query: 107 EGELARNRAVLFEAQARVRRY--------------EPLVKIQAVSQQDFDTATADLRSAE 152
E R + + + E ++++ ++ ++ F T E
Sbjct: 147 RLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKE 206

Query: 153 AATRSAQADLETARLNLGYASVTAPISGR---IGRALVTEGALVGQGEATLMARIQQLDP 209
+A+ T + + + +L+ + A+ + ++ + +
Sbjct: 207 LNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAI---AKHAVLEQENKYVE 263

Query: 210 IYADFTQTAAEALRLRDAL 228
+ ++ ++ +
Sbjct: 264 AVNELRVYKSQLEQIESEI 282


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4598ACRIFLAVINRP11690.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 1169 bits (3027), Expect = 0.0
Identities = 518/1028 (50%), Positives = 715/1028 (69%), Gaps = 8/1028 (0%)

Query: 1 MSEFFIKRPNFAWVVALFISLAGLLVISKLPVAQYPNVAPPQITITATYPGASAKVLVDS 60
M+ FFI+RP FAWV+A+ + +AG L I +LPVAQYP +APP ++++A YPGA A+ + D+
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 61 VTSVLEESLNGAKGLLYFESTNNSNGTAEIVVTFEPGTDPDLAQVDVQNRLKKAEARMPQ 120
VT V+E+++NG L+Y ST++S G+ I +TF+ GTDPD+AQV VQN+L+ A +PQ
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 121 AVLTQGLQVEQTSAGFLLIYALSYKEGAQRSDTTALGDYAARNINNELRRLPGVGKLQFF 180
V QG+ VE++S+ +L++ D + DY A N+ + L RL GVG +Q F
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDD--ISDYVASNVKDTLSRLNGVGDVQLF 178

Query: 181 SSEAAMRVWIDPQKLVGFGLSIDDVSNAIRGQNVQVPAGAFGSAPGSSAQELTATLAVKG 240
++ AMR+W+D L + L+ DV N ++ QN Q+ AG G P Q+L A++ +
Sbjct: 179 GAQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQT 238

Query: 241 TLDDPQEFGQVVLRANEDGSLVRLADVARLELGKESYNISSRLNGTPTVGGAIQLSPGAN 300
+P+EFG+V LR N DGS+VRL DVAR+ELG E+YN+ +R+NG P G I+L+ GAN
Sbjct: 239 RFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGAN 298

Query: 301 AIQTATLVKQRLAELSAFFPEDMQYSVPYDTSRFVDVAIEKVIHTLIEAMVLVFLVMFLF 360
A+ TA +K +LAEL FFP+ M+ PYDT+ FV ++I +V+ TL EA++LVFLVM+LF
Sbjct: 299 ALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLF 358

Query: 361 LQNVRYTLIPSIVVPVCLLGTLMVMYLLGFSVNMMTMFGMVLAIGILVDDAIVVVENVER 420
LQN+R TLIP+I VPV LLGT ++ G+S+N +TMFGMVLAIG+LVDDAIVVVENVER
Sbjct: 359 LQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVER 418

Query: 421 IMAEEGISPAEATVKAMKQVSGAIVGITLVLSAVFLPLAFMAGSVGVIYQQFSVSLAVSI 480
+M E+ + P EAT K+M Q+ GA+VGI +VLSAVF+P+AF GS G IY+QFS+++ ++
Sbjct: 419 VMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAM 478

Query: 481 LFSGFLALTFTPALCATLLKPIPEGHHE-KRGFFGAFNRGFARVTERYSLLNSKLVARAG 539
S +AL TPALCATLLKP+ HHE K GFFG FN F Y+ K++ G
Sbjct: 479 ALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTG 538

Query: 540 RFMLVYAGLVAMLGYFYLRLPEAFVPAEDLGYMVVDVQLPPGASRVRTDATGEE-LERFL 598
R++L+YA +VA + +LRLP +F+P ED G + +QLP GA++ RT ++ + +L
Sbjct: 539 RYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYL 598

Query: 599 KSREA-VASVFLISGFSFSGQGDNAALAFPTFKDWSER-GAEQSAAAEIAALNEHFALPD 656
K+ +A V SVF ++GFSFSGQ NA +AF + K W ER G E SA A I
Sbjct: 599 KNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIR 658

Query: 657 DGTVMAVSPPPINGLGNSGGFALRLMDRSGVGREALLQARDTLLGEIQTNPKFLYAMM-E 715
DG V+ + P I LG + GF L+D++G+G +AL QAR+ LLG +P L ++
Sbjct: 659 DGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPN 718

Query: 716 GLAEAPQLRLLIDREKARALGVSFETISGTLSAAFGSEVINDFTNAGRQQRVVIQAEQGN 775
GL + Q +L +D+EKA+ALGVS I+ T+S A G +NDF + GR +++ +QA+
Sbjct: 719 GLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKF 778

Query: 776 RMTPESVLELYVPNAAGNLVPLSAFVSVKWEEGPVQLVRYNGYPSIRIVGDAAPGFSTGE 835
RM PE V +LYV +A G +VP SAF + W G +L RYNG PS+ I G+AAPG S+G+
Sbjct: 779 RMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGD 838

Query: 836 AMAEMERLASQLPAGIGYEWTGLSYQEKVSAGQATSLFALAILVVFLLLVALYESWSIPL 895
AMA ME LAS+LPAGIGY+WTG+SYQE++S QA +L A++ +VVFL L ALYESWSIP+
Sbjct: 839 AMALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPV 898

Query: 896 SVMLIVPIGAIGAVLAVMVSGMSNDVYFKVGLITIIGLSAKNAILIVEFAKELWE-QGHS 954
SVML+VP+G +G +LA + NDVYF VGL+T IGLSAKNAILIVEFAK+L E +G
Sbjct: 899 SVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKG 958

Query: 955 LRDAAIEAARLRFRPIIMTSMAFILGVIPLALASGAGAASQRAIGTGVIGGMLSATFLGV 1014
+ +A + A R+R RPI+MTS+AFILGV+PLA+++GAG+ +Q A+G GV+GGM+SAT L +
Sbjct: 959 VVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAI 1018

Query: 1015 LFVPICFV 1022
FVP+ FV
Sbjct: 1019 FFVPVFFV 1026



Score = 94.5 bits (235), Expect = 1e-21
Identities = 92/506 (18%), Positives = 179/506 (35%), Gaps = 40/506 (7%)

Query: 541 FMLVYAGLVAMLGYF-YLRLPEAFVPAEDLGYMVVDVQLP-PGAS-RVRTDATGEELERF 597
F V A ++ M G L+LP A P + V V PGA + D + +E+
Sbjct: 11 FAWVLAIILMMAGALAILQLPVAQYP--TIAPPAVSVSANYPGADAQTVQDTVTQVIEQN 68

Query: 598 LKSREAVASVFLISGFSFSGQGDNAALAFPTFKDWSERGAEQSAAAEIAALNEHFALPDD 657
+ + ++ +S S S L F + D A+ ++ LP +
Sbjct: 69 MNG---IDNLMYMSSTSDSAGSVTITLTFQSGTD--PDIAQVQVQNKLQLATP--LLPQE 121

Query: 658 GTVMAVSPPPINGLGNSGGFALRLMDRSGVGREALLQARDTLLGEIQTNPKFLYAMMEGL 717
V I+ +S + + S D + +N K + + G+
Sbjct: 122 -----VQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDY----VASNVKDTLSRLNGV 172

Query: 718 AEAP------QLRLLIDREKARALGVSFETISGTLSAA---FGSEVINDFTNAGRQQRVV 768
+ +R+ +D + ++ + L + + QQ
Sbjct: 173 GDVQLFGAQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNA 232

Query: 769 IQAEQGNRMTPESVLELYVP-NAAGNLVPLSAFVSVKW-EEGPVQLVRYNGYPSIRIVGD 826
Q PE ++ + N+ G++V L V+ E + R NG P+ +
Sbjct: 233 SIIAQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIK 292

Query: 827 AAPGFSTGEA----MAEMERLASQLPAGIGYEWTGLSYQEKVSAGQATSLFAL--AILVV 880
A G + + A++ L P G+ + V + L AI++V
Sbjct: 293 LATGANALDTAKAIKAKLAELQPFFPQGMKVLYP-YDTTPFVQLSIHEVVKTLFEAIMLV 351

Query: 881 FLLLVALYESWSIPLSVMLIVPIGAIGAVLAVMVSGMSNDVYFKVGLITIIGLSAKNAIL 940
FL++ ++ L + VP+ +G + G S + G++ IGL +AI+
Sbjct: 352 FLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIV 411

Query: 941 IVE-FAKELWEQGHSLRDAAIEAARLRFRPIIMTSMAFILGVIPLALASGAGAASQRAIG 999
+VE + + E ++A ++ ++ +M IP+A G+ A R
Sbjct: 412 VVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFS 471

Query: 1000 TGVIGGMLSATFLGVLFVPICFVWLL 1025
++ M + + ++ P LL
Sbjct: 472 ITIVSAMALSVLVALILTPALCATLL 497


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4596HTHTETR358e-05 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 34.6 bits (79), Expect = 8e-05
Identities = 21/141 (14%), Positives = 50/141 (35%), Gaps = 8/141 (5%)

Query: 7 ATMGELAELAGVSRATLNRHCGTREGL-KRRLESHARSTLERLTHSAALQRLEPREALRE 65
++GE+A+ AGV+R + H + L E + E A +P LRE
Sbjct: 32 TSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEYQAKFPGDPLSVLRE 91

Query: 66 LIREHL-------AQRDLLALLMFEQNPGRQAGHGDASWQSYVEALDAFFLRGQQKRVFR 118
++ L +R L+ ++ + + + ++ + + +
Sbjct: 92 ILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRNLCLESYDRIEQTLKHCIEA 151

Query: 119 IDISAATFSELFIVLIYGMVD 139
+ A + +++ G +
Sbjct: 152 KMLPADLMTRRAAIIMRGYIS 172


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4593PF05844290.035 YopD protein
		>PF05844#YopD protein

Length = 295

Score = 28.8 bits (64), Expect = 0.035
Identities = 34/123 (27%), Positives = 51/123 (41%), Gaps = 21/123 (17%)

Query: 226 VSPGADVYSVGAALGAALTARLPGHEAQV---QVSQQVLDGLKRQTRTFTYLLAGLGIIS 282
++PGA SVG AA ++P A +QVLD R + L + + ++
Sbjct: 20 IAPGAAGRSVGTPQAAAELPQVPAARADRVELNAPRQVLDP-VRMEAAGSELDSSVELLL 78

Query: 283 LLGGGVGVMNVMLMSVAERRREIGVRMALGARQRDIRNLFLIEAVTLTAAGALSGAVLGV 342
+L +A++ RE+GV QRD N +I A SGA L +
Sbjct: 79 IL-----------FRIAQKARELGVL------QRDNENQAIIHAQKAQVDEMRSGATLMI 121

Query: 343 AAA 345
A A
Sbjct: 122 AMA 124


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4591RTXTOXIND584e-11 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 57.5 bits (139), Expect = 4e-11
Identities = 47/373 (12%), Positives = 112/373 (30%), Gaps = 90/373 (24%)

Query: 65 GRVVTLAAPFAGNVEALLVEPGQRVAEGQELL-------RMDTREIAVQVREAQSALLKA 117
GR + V+ ++V+ G+ V +G LL DT + + +A+ +
Sbjct: 94 GRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRY 153

Query: 118 RRTLQDL---------------------RDWERGEDMARVRRALRSAQLAQSST------ 150
+ + + + R + + + + Q Q
Sbjct: 154 QILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKR 213

Query: 151 --------------------ERKLRETRELFQRGIVPRNELDDLEQQASQQRMDLEAARR 190
+ +L + L + + ++ + + E + + +L +
Sbjct: 214 AERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKS 273

Query: 191 EVESTRAKGQGENRQI-----------------AEMDLANASVKYETLQAQLDGRTVRAP 233
++E ++ + ++ +++ + + +RAP
Sbjct: 274 QLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAP 333

Query: 234 FAGIVVAAPGLAGEQGSREPVQAGSKLGQGQALFGLA-SVERLKVSAKVSELDINQLREG 292
+ V + G + + L + + L+V+A V DI + G
Sbjct: 334 VSVKVQ-------QLKVHTE---GGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVG 383

Query: 293 QAVEISGDGFEGTT---LAGVIAALGGQALPGMAQGGSPQFEVTVSV---APLDPRQLQK 346
Q I + F T L G + + A+ G F V +S+ +
Sbjct: 384 QNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLG--LVFNVIISIEENCLSTGNKNIP 441

Query: 347 IRLGMSAKLTVTT 359
+ GM+ + T
Sbjct: 442 LSSGMAVTAEIKT 454


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4589OMADHESIN300.021 Yersinia outer membrane adhesin signature.
		>OMADHESIN#Yersinia outer membrane adhesin signature.

Length = 455

Score = 29.9 bits (66), Expect = 0.021
Identities = 12/27 (44%), Positives = 19/27 (70%)

Query: 353 YRNTWTLAVGGDYKVTDQWTMRAGVAY 379
YR++ LA+G Y+V + ++AGVAY
Sbjct: 413 YRSSQALAIGSGYRVNENVALKAGVAY 439


81PA4558PA4534N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA4558111-3.985954FkbP-type peptidyl-prolyl cis-trans isomerase
PA4557212-4.1305854-hydroxy-3-methylbut-2-enyl diphosphate
PA4556112-4.276515type 4 fimbrial biogenesis protein PilE
PA4555013-4.354654type 4 fimbrial biogenesis protein PilY2
PA4554013-4.227030type 4 fimbrial biogenesis protein PilY1
PA4553016-2.595296type 4 fimbrial biogenesis protein PilX
PA4552012-1.813231type 4 fimbrial biogenesis protein PilW
PA4551011-1.363845type 4 fimbrial biogenesis protein PilV
PA4550010-1.329337type 4 fimbrial biogenesis protein FimU
PA454909-0.887469type 4 fimbrial biogenesis protein FimT
PA4548-18-0.058989D-amino acid oxidase
PA454708-0.793360two-component response regulator PilR
PA4546-111-1.180825two-component sensor PilS
PA4545-212-1.061708competence protein ComL
PA4544-213-1.129514pseudouridine synthase
PA4543-212-1.101810hypothetical protein
PA4542-213-1.495530chaperone protein ClpB
PA4541018-1.218273***hypothetical protein
PA4540-115-0.980588hypothetical protein
PA4539-116-1.004086hypothetical protein
PA4538-313-0.402196NADH dehydrogenase
PA4537015-0.629946hypothetical protein
PA4536017-0.836586hypothetical protein
PA4535118-1.211985hypothetical protein
PA4534016-0.757201hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4558INFPOTNTIATR341e-04 Macrophage infectivity potentiator signature.
		>INFPOTNTIATR#Macrophage infectivity potentiator signature.

Length = 233

Score = 33.8 bits (77), Expect = 1e-04
Identities = 21/54 (38%), Positives = 33/54 (61%), Gaps = 5/54 (9%)

Query: 8 GQESRVTLHFALKLEDGNVVDSTFDK--QPASFKVGDGNLLPGFEQALFGLKAG 59
G+ VT+ + L DG V DST +K +PA+F+V ++PG+ +AL + AG
Sbjct: 142 GKSDTVTVEYTGTLIDGTVFDST-EKAGKPATFQV--SQVIPGWTEALQLMPAG 192


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4557PF06704280.033 DspF/AvrF protein
		>PF06704#DspF/AvrF protein

Length = 129

Score = 27.5 bits (61), Expect = 0.033
Identities = 10/28 (35%), Positives = 18/28 (64%), Gaps = 2/28 (7%)

Query: 194 KNDIC--YATQNRQDAVKELADQCDMVL 219
+N +C Y +Q+ + AV E+ D +MV+
Sbjct: 26 QNGVCALYDSQDNEAAVIEMPDHSEMVI 53


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4556BCTERIALGSPG421e-07 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 41.8 bits (98), Expect = 1e-07
Identities = 18/49 (36%), Positives = 31/49 (63%), Gaps = 3/49 (6%)

Query: 4 RQKGFTLLEMVVVVAVIGILLGIAIPSYQNYVIRSNRTEGQALLSDAAA 52
+Q+GFTLLE++VV+ +IG+L + +P N + + + Q +SD A
Sbjct: 6 KQRGFTLLEIMVVIVIIGVLASLVVP---NLMGNKEKADKQKAVSDIVA 51


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4551PilS_PF08805300.003 PilS N terminal
		>PilS_PF08805#PilS N terminal

Length = 185

Score = 30.3 bits (68), Expect = 0.003
Identities = 11/58 (18%), Positives = 24/58 (41%)

Query: 3 LKSRHRSLHQSGFSMIEVLVALLLISIGVLGMIAMQGKTIQYTADSVERNKAAMLGSN 60
L +R + G +++EVL+ + +I + + S E+N + +N
Sbjct: 16 LSARRKKEQDKGATLMEVLLVVGVIVVLAASAYKLYSMVQSNIQSSNEQNNVLTVIAN 73


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4550BCTERIALGSPG415e-07 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 40.6 bits (95), Expect = 5e-07
Identities = 14/45 (31%), Positives = 30/45 (66%)

Query: 8 TGFTLIELLIIVVLLAIMASFAIPNFKQLTERNELQSAAEELNAM 52
GFTL+E+++++V++ ++AS +PN E+ + Q A ++ A+
Sbjct: 8 RGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVAL 52


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4549BCTERIALGSPG332e-04 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 32.9 bits (75), Expect = 2e-04
Identities = 15/40 (37%), Positives = 25/40 (62%), Gaps = 4/40 (10%)

Query: 4 RSQRALTLTELLFALVLLGILGSLALPGMAAWLDGNRERS 43
QR TL E++ +V++G+L SL +P L GN+E++
Sbjct: 5 DKQRGFTLLEIMVVIVIIGVLASLVVPN----LMGNKEKA 40


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4547HTHFIS5250.0 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 525 bits (1355), Expect = 0.0
Identities = 176/477 (36%), Positives = 261/477 (54%), Gaps = 33/477 (6%)

Query: 1 MSRQKALIVDDEPDIRELLEITLGRMKLDTRSARNVKEARELLAREPFDLCLTDMRLPDG 60
M+ L+ DD+ IR +L L R D R N +A DL +TD+ +PD
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 SGLDLVQYIQQRHPQTPVAMITAYGSLDTAIQALKAGAFDFLTKPVDLGRLRELVATALR 120
+ DL+ I++ P PV +++A + TAI+A + GA+D+L KP DL L ++ AL
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 121 LRNPEAEEAPVDNR----LLGESPPMRALRNQIGKLARSQAPVYISGESGSGKELVARLI 176
+ D++ L+G S M+ + + +L ++ + I+GESG+GKELVAR +
Sbjct: 121 EPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARAL 180

Query: 177 HEQGPRIERPFVPVNCGAIPSELMESEFFGHKKGSFTGAIEDKQGLFQAASGGTLFLDEV 236
H+ G R PFV +N AIP +L+ESE FGH+KG+FTGA G F+ A GGTLFLDE+
Sbjct: 181 HDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEI 240

Query: 237 ADLPMAMQVKLLRAIQEKAVRAVGGQQEVAVDVRILCATHKDLAAEVGAGRFRQDLYYRL 296
D+PM Q +LLR +Q+ VGG+ + DVRI+ AT+KDL + G FR+DLYYRL
Sbjct: 241 GDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRL 300

Query: 297 NVIELRVPPLRERREDIPLLAERILKRLAGDTGLPAARLTGDAQEKLKNYRFPGNVRELE 356
NV+ LR+PPLR+R EDIP L +++ + GL R +A E +K + +PGNVRELE
Sbjct: 301 NVVPLRLPPLRDRAEDIPDLVRHFVQQAEKE-GLDVKRFDQEALELMKAHPWPGNVRELE 359

Query: 357 NMLERAYTLCEDDQIQPHDLRL---------ADAPGASQEGAASLSEI------------ 395
N++ R L D I + A++ G+ S+S+
Sbjct: 360 NLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASF 419

Query: 396 -------DNLEDYLEDIERKLIMQALEETRWNRTAAAQRLGLTFRSMRYRLKKLGID 445
+ L ++E LI+ AL TR N+ AA LGL ++R ++++LG+
Sbjct: 420 GDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVS 476


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4542HTHFIS434e-06 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 43.3 bits (102), Expect = 4e-06
Identities = 50/266 (18%), Positives = 94/266 (35%), Gaps = 45/266 (16%)

Query: 551 MLEGEREKLLRMEQELHRRVIGQDEAVVAVSNAVRRSRAGLADPNRPSGSFLFLGPTGVG 610
+ KL Q+ ++G+ A+ + + R L + + G +G G
Sbjct: 121 EPKRRPSKLEDDSQDGMP-LVGRSAAMQEIYRVLAR----LMQTDLT---LMITGESGTG 172

Query: 611 KTELCKALAEFLFDTEEALVRIDMSEFMEKHSVARLIGAPPGYVGFEEGGYLTEAIRRKP 670
K + +AL ++ V I+M+ + L G E G T A R
Sbjct: 173 KELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGH--------EKGAFTGAQTRST 224

Query: 671 YSV-------VLLDEVEKAHPDVFNILLQVLEDG---RLTDSHGRTVDFRNTVVVMTSNL 720
+ LDE+ D LL+VL+ G + D R +V +N
Sbjct: 225 GRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVR---IVAATN- 280

Query: 721 GSAQIQELAGDREAQRAAVMDAVNAHFRPEFINRIDEVVVFEPLAREQIAGIAEIQLGRL 780
++L + ++ FR + R++ V + P R++ I ++ +
Sbjct: 281 -----KDL-------KQSINQ---GLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFV 325

Query: 781 RKRLAERELSLELSQEALDKLIAVGF 806
++ E QEAL+ + A +
Sbjct: 326 QQAEKEGLDVKRFDQEALELMKAHPW 351



Score = 34.4 bits (79), Expect = 0.002
Identities = 25/177 (14%), Positives = 59/177 (33%), Gaps = 32/177 (18%)

Query: 49 LLMQVGFDIAALRSGLNKELDALPKIQSPTGDVNLSQDLARLLNQADRLAQQKGDQFISS 108
L + G+D+ + I + GD+ ++ D+ + + +
Sbjct: 22 ALSRAGYDVRITSNAA----TLWRWIAAGDGDLVVT-DV--------VMPDENAFDLLPR 68

Query: 109 ELVLLAAMDENTRLGKLLLGQGVSRKALENAVANLRGGEA-------VNDPNVEESRQAL 161
+ + + ++ Q A+ G + +AL
Sbjct: 69 ----IKKARPDLPV-LVMSAQN----TFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119

Query: 162 DKYTVDMTKRAEEG-KLDPVIGRDDEIRRTIQVLQRRTKNN-PVLI-GEPGVGKTAI 215
+ +K ++ P++GR ++ +VL R + + ++I GE G GK +
Sbjct: 120 AEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELV 176


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4541PF05860651e-14 haemagglutination activity domain.
		>PF05860#haemagglutination activity domain.

Length = 117

Score = 65.2 bits (159), Expect = 1e-14
Identities = 31/93 (33%), Positives = 52/93 (55%), Gaps = 5/93 (5%)

Query: 71 TDGRHMVID---QQSHKLITNWNEFSVRADERVSFHQPGQDAVALNRVIGRNGSDIQGRI 127
T+G +I+ Q L ++ EFSV F+ P ++RV G + S+I G I
Sbjct: 17 TEGNTRIIERGTQAGSNLFHSFQEFSVPTSGTAFFNNPTNIQNIISRVTGGSVSNIDGLI 76

Query: 128 DANGK--VFLVNPNGVVFGKSAQVNVGGLVAST 158
AN +FL+NPNG++FG++A++++GG +
Sbjct: 77 RANATANLFLINPNGIIFGQNARLDIGGSFVGS 109


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4540PF00577372e-04 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 37.1 bits (86), Expect = 2e-04
Identities = 32/226 (14%), Positives = 69/226 (30%), Gaps = 19/226 (8%)

Query: 256 QRYYRAAYQLPLGSRGTRIGLAHAETTYRLVRDFSRLDAHGRAITDSLFVSQPLLRSRSL 315
Y+ A G I + A+ + L V+Q L R+ +L
Sbjct: 484 SGYFNFADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTL 543

Query: 316 SLS-TQLQYENKRLRDDQERTG-RHSRKEIRLWTASISGNAQDRLFGGGQS-----GFSL 368
LS + Y D+Q + G + ++I ++S + + G+ ++
Sbjct: 544 YLSGSHQTYWGTSNVDEQFQAGLNTAFEDINW---TLSYSLTKNAWQKGRDQMLALNVNI 600

Query: 369 AYAHGQLAIDSGEERLLDRYTIGTAGSFDKIMLNAVRLQHLGDRLQLFAQLNAQWSGGNL 428
++H + + R + ++ A L + L + ++GG
Sbjct: 601 PFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGD 660

Query: 429 DSAEQFDMG-----GPYGVRAFPLGSYKGYGDEGWQASAELRYSLA 469
++ G YG + D+ Q + +
Sbjct: 661 GNSGSTGYATLNYRGGYGN----ANIGYSHSDDIKQLYYGVSGGVL 702


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA45362FE2SRDCTASE270.041 Ferric iron reductase signature.
		>2FE2SRDCTASE#Ferric iron reductase signature.

Length = 262

Score = 26.5 bits (58), Expect = 0.041
Identities = 10/24 (41%), Positives = 12/24 (50%)

Query: 36 DHPHPPRQVTLVQWEHIEALGTLL 59
D P P +TL QW L +LL
Sbjct: 47 DEPAPLNAMTLAQWSSPNVLSSLL 70


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4534SACTRNSFRASE280.013 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 27.6 bits (61), Expect = 0.013
Identities = 5/25 (20%), Positives = 10/25 (40%)

Query: 66 RRGYLQHLVVDPGYRGLGLARRMLD 90
++ + V YR G+ +L
Sbjct: 88 GYALIEDIAVAKDYRKKGVGTALLH 112


82PA4529PA4524N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA4529-113-3.663605dephospho-CoA kinase
PA4528-114-4.058296type 4 prepilin peptidase PilD
PA4526-115-3.537049type 4 fimbrial biogenesis protein PilB
PA4525-314-2.079031type 4 fimbrial protein PilA
PA4524-18-1.736565*nicotinate-nucleotide pyrophosphorylase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4529DHBDHDRGNASE300.005 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 30.0 bits (67), Expect = 0.005
Identities = 23/88 (26%), Positives = 32/88 (36%), Gaps = 11/88 (12%)

Query: 5 WILGLTGGIGSGKSAAAEHFISLGVHLVDADHAARW--VVEPGRPALAKIVERFGDGILL 62
+I G GIG A A S G H+ D+ V A A+ E F
Sbjct: 12 FITGAAQGIGE---AVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAF------ 62

Query: 63 PDGQLDRAALRERIFQAPEERRWLEQLL 90
P D AA+ E + E ++ L+
Sbjct: 63 PADVRDSAAIDEITARIEREMGPIDILV 90


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4528PREPILNPTASE353e-125 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 353 bits (908), Expect = e-125
Identities = 165/283 (58%), Positives = 195/283 (68%), Gaps = 1/283 (0%)

Query: 3 LLDYLASHPLAFVLCTILLGLLVGSFLNVVVHRLPKMMERNWKAEAREALGLEPE-PKQA 61
LL+ P + L L++GSFLNVV+HRLP M+ER W+AE R + E +
Sbjct: 4 LLELAHGLPWLYFSLVFLFSLMIGSFLNVVIHRLPIMLEREWQAEYRSYFNPDDEGVDEP 63

Query: 62 TYNLVLPNSACPRCGHEIRPWENIPLVSYLALGGKCSSCKAAIGKRYPLVELATALLSGY 121
YNL++P S CP C H I ENIPL+S+L L G+C C+A I RYPLVEL TALLS
Sbjct: 64 PYNLMVPRSCCPHCNHPITALENIPLLSWLWLRGRCRGCQAPISARYPLVELLTALLSVA 123

Query: 122 VAWHFGFTWQAGAMLLLTWGLLAMSLIDADHQLLPDVLVLPLLWLGLIANHFGLFASLDD 181
VA W A LLLTW L+A++ ID D LLPD L LPLLW GL+ N G F SL D
Sbjct: 124 VAMTLAPGWGTLAALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNLLGGFVSLGD 183

Query: 182 ALFGAVFGYLSLWSVFWLFKLVTGKEGMGYGDFKLLAMLGAWGGWQILPLTILLSSLVGA 241
A+ GA+ GYL LWS++W FKL+TGKEGMGYGDFKLLA LGAW GWQ LP+ +LLSSLVGA
Sbjct: 184 AVIGAMAGYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLVGA 243

Query: 242 ILGVIMLRLRNAESGTPIPFGPYLAIAGWIALLWGDQITRTYL 284
+G+ ++ LRN PIPFGPYLAIAGWIALLWGD ITR YL
Sbjct: 244 FMGIGLILLRNHHQSKPIPFGPYLAIAGWIALLWGDSITRWYL 286


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4525BCTERIALGSPG501e-10 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 49.9 bits (119), Expect = 1e-10
Identities = 17/54 (31%), Positives = 34/54 (62%)

Query: 1 MKAQKGFTLIELMIVVAIIGILAAIAIPQYQNYVARSEGASALATINPLKTTVE 54
Q+GFTL+E+M+V+ IIG+LA++ +P +++ A++ I L+ ++
Sbjct: 4 TDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALD 57


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4524RTXTOXIND290.020 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.0 bits (65), Expect = 0.020
Identities = 23/141 (16%), Positives = 45/141 (31%), Gaps = 4/141 (2%)

Query: 75 QVEDGQRVEPNQMLFQLKGP-ARALLTGERSALNFLQLLSGTATRSQHYADLVAGTAVKL 133
V++G+ V +L +L A A +S+L +L TR Q + + +
Sbjct: 111 IVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARL---EQTRYQILSRSIELNKLPE 167

Query: 134 LDTRKTLPGLRLAQKYAVTCGGCHNHRIGLYDAFLIKENHIAACGGIDRAIAEARRIAPG 193
L ++++ + + + ++ +R AR
Sbjct: 168 LKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYE 227

Query: 194 KPVEVEVENLDELRQALEAGA 214
VE LD+ L A
Sbjct: 228 NLSRVEKSRLDDFSSLLHKQA 248


83PA4446PA4440N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA4446010-1.822335AlgW protein
PA4445010-2.016779hypothetical protein
PA4444-29-1.607464murein hydrolase B
PA4443-110-1.877233sulfate adenylyltransferase subunit 2
PA4442-210-1.319202bifunctional sulfate adenylyltransferase subunit
PA444109-1.213639hypothetical protein
PA4440-18-1.853908hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4446V8PROTEASE612e-12 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 60.8 bits (147), Expect = 2e-12
Identities = 33/163 (20%), Positives = 52/163 (31%), Gaps = 35/163 (21%)

Query: 118 LLTNNHVTAGADQIIVALR------------DGRETIAQLVGSDPETDLAVLKIDL---- 161
LLTN HV AL+ +G T Q+ E DLA++K
Sbjct: 114 LLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGEGDLAIVKFSPNEQN 173

Query: 162 ----KNLPAMTLGRSDGIRTGDVCLAIGNPFGVGQTVTMGIISATGRNQLGLNTYEDFIQ 217
+ + T+ + + G P TM + G+ L +Q
Sbjct: 174 KHIGEVVKPATMSNNAETQVNQNITVTGYPGD-KPVATMW--ESKGK-ITYLKGE--AMQ 227

Query: 218 TDAAINPGNSGGALVDAAGNLIGINTAIFSKSGGSQGIGFAIP 260
D + GNSG + + +IGI+ G+
Sbjct: 228 YDLSTTGGNSGSPVFNEKNEVIGIHWG---------GVPNEFN 261


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4443TCRTETOQM280.046 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 28.3 bits (63), Expect = 0.046
Identities = 17/90 (18%), Positives = 33/90 (36%), Gaps = 14/90 (15%)

Query: 94 GVAQG-INPFTHGSAKHTDVMKTEGLKQALDKYGFDAAFGGARRDEEKSRAKERVYSFRD 152
+ P HGSAK G+ ++ F + + +++ F
Sbjct: 207 RFHNCSLFPVYHGSAK-----NNIGIDNLIE--VITNKFYSS---THRGQSELCGKVF-- 254

Query: 153 SKHRWDPKNQRPELWNIYNGKVKKGESIRV 182
K + K QR +Y+G + +S+R+
Sbjct: 255 -KIEYSEKRQRLAYIRLYSGVLHLRDSVRI 283


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4442TCRTETOQM685e-14 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 68.0 bits (166), Expect = 5e-14
Identities = 53/150 (35%), Positives = 67/150 (44%), Gaps = 17/150 (11%)

Query: 33 VDDGKSTLIGRLLHDSKMIYEDHLEAITRDSKKVGTTGDDVDLALLVDGLQAEREQGITI 92
VD GK+TL LL++S I E K T D+ L ER++GITI
Sbjct: 12 VDAGKTTLTESLLYNSGAITE------LGSVDKGTTRTDNTLL---------ERQRGITI 56

Query: 93 DVAYRYFSTAKRKFIIADTPGHEQYTRNMATGASTCDLAIILIDARYGVQTQTRRHSFIA 152
F K I DTPGH + + S D AI+LI A+ GVQ QTR
Sbjct: 57 QTGITSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHAL 116

Query: 153 SLLGIRHIVVAINKMDLKDFD-QGVFEQIK 181
+GI I INK+D D V++ IK
Sbjct: 117 RKMGIPTIFF-INKIDQNGIDLSTVYQDIK 145


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4440FLAGELLIN310.004 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 30.8 bits (69), Expect = 0.004
Identities = 13/42 (30%), Positives = 20/42 (47%), Gaps = 5/42 (11%)

Query: 55 SARD--AGLA---TLRFNFRGVGQSAGSYGEGIGEIDDAEAA 91
SA+D AG A N +G+ Q++ + +GI E A
Sbjct: 39 SAKDDAAGQAIANRFTSNIKGLTQASRNANDGISIAQTTEGA 80


84PA4310PA4296N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA4310-19-1.031105chemotactic transducer PctB
PA4309-17-0.671721chemotactic transducer PctA
PA4308-170.215325hypothetical protein
PA4307080.045758chemotactic transducer PctC
PA43060131.084976type IVb pilin Flp
PA43052131.304791hypothetical protein
PA43042131.435001type II/III secretion system protein
PA43033121.711720hypothetical protein
PA43021150.101584ATPase TadA
PA4301116-0.432600type II secretion system protein TadB
PA4300020-2.319386type II secretion system protein TadC
PA4299024-2.949108type II secretion system protein TadD
PA4298-120-1.589414hypothetical protein
PA4297-215-1.925337hypothetical protein
PA4296-115-1.838349two-component response regulator PprB
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4310GPOSANCHOR300.030 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 30.0 bits (67), Expect = 0.030
Identities = 25/209 (11%), Positives = 63/209 (30%), Gaps = 11/209 (5%)

Query: 342 RFVERIHESIREVAGTARQLHDVAQLVVNASNSSMANSDEQSNRTNSVAAAINELGAAAQ 401
+ +E + + L + + N + + +A I L A
Sbjct: 127 KALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKA 186

Query: 402 EIARNAADASHHASDANHQ-AEDGKQVVEQTIRAMNELSEKISASCANIEALNSRTVNIG 460
+A + + A + I+ + ++A A++E +N
Sbjct: 187 A-----LEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFS 241

Query: 461 QILEVIKGISEQTN--LLALNAAIEAARAGEAGRGFAVVADEVRNLAHRAQESAQQIQKM 518
E L A A +E A A + +++ L + +
Sbjct: 242 TADSAKIKTLEAEKAALEARQAELEKALE-GAMNFSTADSAKIKTLEAEKAALEAEKADL 300

Query: 519 IEELQI--GAQEAVSTMTESQRYSLESVE 545
+ Q+ ++++ ++ R + + +E
Sbjct: 301 EHQSQVLNANRQSLRRDLDASREAKKQLE 329


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4309RTXTOXINA310.018 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 31.1 bits (70), Expect = 0.018
Identities = 24/167 (14%), Positives = 58/167 (34%), Gaps = 8/167 (4%)

Query: 308 GRAMQDIAQGEGDLTKRLAVTSRDEFGVLGDAFN---QFVERIHRSIREVAGTAHKLHDV 364
G ++ D+ + +L + ++ + F + + R + A KL
Sbjct: 61 GSSLNDLVRTADELGIEVQYDEKNGTAITKQVFGTAEKLIGLTERGVTIFAPQLDKLLQK 120

Query: 365 SQLVVNASNSSMANSDEQSNRTNSVAAAI-NELGAAAQEIARNAADASHHASDANHQAED 423
Q N N + + + + N LG A + + + +E
Sbjct: 121 YQKAGNILGGGAENIGDNLGKAGGILSTFQNFLGTALSSMKIDELIKKQKSGGNVSSSEL 180

Query: 424 GKQVVEQTIRAMNELSEKISASCANIEALNSRTVNIGQILEVIKGIS 470
K +E N+L + +++ N+ + + + +G +L K ++
Sbjct: 181 AKASIELI----NQLVDTVASLNNNVNSFSQQLNTLGSVLSNTKHLN 223


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4307RTXTOXINA310.019 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 30.7 bits (69), Expect = 0.019
Identities = 29/179 (16%), Positives = 65/179 (36%), Gaps = 8/179 (4%)

Query: 299 LIRVLMQPLTDMGRAMQDIAQGEGDLTKRLKVTSNDEFGTLANAFNRFVERIHESIREVA 358
LI ++ + G ++ D+ + +L ++ + F + I + R V
Sbjct: 49 LILLIPKDYKGQGSSLNDLVRTADELGIEVQYDEKNGTAITKQVFGTAEKLIGLTERGVT 108

Query: 359 GTARQLHDVAQLVVNASN---SSMANSDEQSNRTNSVAAAI-NELGAAAQEIARNAADAS 414
A QL + Q A N N + + + + N LG A + +
Sbjct: 109 IFAPQLDKLLQKYQKAGNILGGGAENIGDNLGKAGGILSTFQNFLGTALSSMKIDELIKK 168

Query: 415 HHASDANHQAEDGKQVVEQTIRAMNELSEKISASCANIEALNSRTVNIGQILEVIKGIS 473
+ +E K +E N+L + +++ N+ + + + +G +L K ++
Sbjct: 169 QKSGGNVSSSELAKASIELI----NQLVDTVASLNNNVNSFSQQLNTLGSVLSNTKHLN 223


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4304BCTERIALGSPD1462e-40 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 146 bits (369), Expect = 2e-40
Identities = 67/253 (26%), Positives = 109/253 (43%), Gaps = 15/253 (5%)

Query: 131 PNQVQTDIRFVEVSRSKLKQASTSFVRRGGNLWVLG------APGSLGDIKVNADGSGLG 184
QV + EV + + + + + G + N DG+ +
Sbjct: 344 RPQVLVEAIIAEVQDADGLNLGIQWANKNAGMTQFTNSGLPISTAIAGANQYNKDGT-VS 402

Query: 185 GTFGTGSSGFNLIFGG---GKWLSFMNALEGSGFAYTLARPSLVAMSGQSASFLAGGEFP 241
+ + S FN I G G W + AL S LA PS+V + A+F G E P
Sbjct: 403 SSLASALSSFNGIAAGFYQGNWAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVP 462

Query: 242 IPVP--NGTNDNV--TIEYKEFGIRLTLTPTVMNNRRIALKVAPEVSELDYSAGIQSGGV 297
+ + DN+ T+E K GI+L + P + + L++ EVS + +A S +
Sbjct: 463 VLTGSQTTSGDNIFNTVERKTVGIKLKVKPQINEGDSVLLEIEQEVSSVADAASSTSSDL 522

Query: 298 AVPALRVRRTDTSVMLADGESFVISGLTSSNSVSNVDKFPWLGDIPILGAFFRSTKLDKD 357
R + +V++ GE+ V+ GL + DK P LGDIP++GA FRST
Sbjct: 523 GA-TFNTRTVNNAVLVGSGETVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVS 581

Query: 358 DRELLMIVTPHLV 370
R L++ + P ++
Sbjct: 582 KRNLMLFIRPTVI 594


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4303HTHFIS330.002 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 32.9 bits (75), Expect = 0.002
Identities = 22/105 (20%), Positives = 40/105 (38%), Gaps = 10/105 (9%)

Query: 18 LQNSLASAG-QVVPAGSASLEELLALLDVTAAGVLFISL---GKSNLVSQGALVEGLVSA 73
L +L+ AG V +A L + ++ + ++ L+ + A
Sbjct: 19 LNQALSRAGYDVRITSNA--ATLWRWIAAGDGDLVVTDVVMPDENAF----DLLPRIKKA 72

Query: 74 RPMLSVVAIGDGLDNQLVLAAMRAGARDFITYGARASELTGLIRR 118
RP L V+ + + A GA D++ +EL G+I R
Sbjct: 73 RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4298TYPE3OMGPROT270.013 Type III secretion system outer membrane G protein ...
		>TYPE3OMGPROT#Type III secretion system outer membrane G protein

family signature.
Length = 607

Score = 26.8 bits (59), Expect = 0.013
Identities = 10/20 (50%), Positives = 13/20 (65%)

Query: 4 RILFGVLLLLSGTAWAADTP 23
R+L G LLLLS +WA +
Sbjct: 11 RVLTGTLLLLSSYSWAQELD 30


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4297BCTERIALGSPC300.031 Bacterial general secretion pathway protein C signa...
		>BCTERIALGSPC#Bacterial general secretion pathway protein C

signature.
Length = 272

Score = 29.5 bits (66), Expect = 0.031
Identities = 17/100 (17%), Positives = 34/100 (34%), Gaps = 1/100 (1%)

Query: 20 LLLALICLLLVVDTGRLYLEQRNLQRVADVAALESASQGALCGDQSSAQATSFAKASAML 79
LL+ L C L + R+ L + ++ Q D + + + L
Sbjct: 21 LLMLLFCQQLAMIFWRIGLPDNAPVSSVQITPAQARQQPVTLNDFTLFGVSPEKNKAGAL 80

Query: 80 N-GFDADAAGSSLSAEVGGVLSAGGLRSFIASASNAAVAN 118
+ ++ S+L+ + GV++ IA S
Sbjct: 81 DASQMSNLPPSTLNLSLTGVMAGDDDSRSIAIISKDNEQF 120


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4296HTHFIS744e-17 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 74.1 bits (182), Expect = 4e-17
Identities = 31/156 (19%), Positives = 66/156 (42%), Gaps = 4/156 (2%)

Query: 10 SVLIIDDEPQVTSELRELLENSGYRCVTSTHRESAIASFQADPNIGLVICDLYLGQDNGI 69
++L+ DD+ + + L + L +GY +++ + A LV+ D+ + +N
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDG-DLVVTDVVMPDENAF 63

Query: 70 RLIESLKEVAGNGRFFESIILTGHDGRQEVIEAMRVGAADYYQKPVAPQELLHGLERLES 129
L+ +K+ + ++++ + I+A GA DY KP EL+ + R +
Sbjct: 64 DLLPRIKKARPDLPV---LVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 130 RLHERVRSQLSLSHVNQRLEYLAESLNSIYRDIHKI 165
R S L + ++ IYR + ++
Sbjct: 121 EPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARL 156


85PA4282PA4276N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA4282320-2.678091exonuclease
PA4281-130-5.272952exonuclease SbcD
PA4280035-6.972959**biotin--protein ligase
PA4279033-7.220978pantothenate kinase
PA4278134-7.461012hypothetical protein
PA4277338-7.789055***elongation factor Tu
PA4276340-8.003651*preprotein translocase subunit SecE
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4282RTXTOXIND443e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 44.0 bits (104), Expect = 3e-06
Identities = 39/231 (16%), Positives = 75/231 (32%), Gaps = 27/231 (11%)

Query: 628 DQEQVRAEQSLERLRQTLVGLREGYSSQRERLNQSRQEQQELTGQLAALDR-QLDQWTLP 686
+ E VR L +L G + L Q+R EQ +++ +L + LP
Sbjct: 114 EGESVRKGDVLLKLTAL--GAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLP 171

Query: 687 EELRLLQPSAQLEWLAQRLDDLAGQRQQCQRDFDRLIARQRQTQQLQQELRAAETILQQR 746
+E S + L ++ Q Q Q + L +
Sbjct: 172 DEPYFQNVSEEEVLRLTSLIK------------EQFSTWQNQKYQKELNLDK----KRAE 215

Query: 747 QQALTEQRQRYEHLQQQVEEDSQQLRPLLSDEHWQRWQADPLRTFQALGESIEQRRQQQA 806
+ + + RYE+L + + LL + + L E++ + R ++
Sbjct: 216 RLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAV--LEQENKYVEAVNELRVYKS 273

Query: 807 RLQQIEQRLQELKQRCDESSWQLKQSDEQRNEARQAEERAQAELAELNGRL 857
+L+QIE + K+ + +NE + + L L
Sbjct: 274 QLEQIESEILSAKEE------YQLVTQLFKNEILDKLRQTTDNIGLLTLEL 318



Score = 38.3 bits (89), Expect = 2e-04
Identities = 24/178 (13%), Positives = 60/178 (33%), Gaps = 13/178 (7%)

Query: 878 AQAAQSAVETLQAPLDSLREEQLRLAEALEHLQQQRQRQQDEFQRLQADWQAWRERQDNL 937
Q ++E + P L +E + E + + +++F Q ++ Q L
Sbjct: 153 YQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQN-----QKYQKEL 207

Query: 938 DDSRLDALLGLSEEQATQWREQLQRLQEEITRQQTLEAER---QAQLLQHRRQRPETDRE 994
+ + A + ++ + + + +L ++ + +L+ + E E
Sbjct: 208 NLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNE 267

Query: 995 -----ALEDNLRQQRERLAASEQAYLETYSQLQADNQRREQSQALLAELERARAEFRR 1047
+ + + + Q + + D R+ L LE A+ E R+
Sbjct: 268 LRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQ 325



Score = 36.7 bits (85), Expect = 5e-04
Identities = 25/164 (15%), Positives = 63/164 (38%), Gaps = 11/164 (6%)

Query: 881 AQSAVETLQAPLDSLREEQLRLAEALEHLQQQRQRQQDEFQRLQADWQAWRERQDNLDDS 940
A++ Q+ L R EQ R R + ++ L+ + + + +
Sbjct: 132 AEADTLKTQSSLLQARLEQTRYQILS------RSIELNKLPELKLPDEPYFQNVSEEEVL 185

Query: 941 RLDALLGLSEEQATQWREQLQRLQEEITRQQTLEAERQAQLLQHRRQRPETDREALEDNL 1000
RL +L+ +EQ + W+ Q + + + +++ A++ ++ ++ L+D
Sbjct: 186 RLTSLI---KEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLS-RVEKSRLDD-F 240

Query: 1001 RQQRERLAASEQAYLETYSQLQADNQRREQSQALLAELERARAE 1044
+ A ++ A LE ++ ++ L ++E
Sbjct: 241 SSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILS 284



Score = 36.7 bits (85), Expect = 6e-04
Identities = 33/210 (15%), Positives = 63/210 (30%), Gaps = 13/210 (6%)

Query: 716 QRDFDRLIARQRQTQQLQQELRAAETILQQRQQALTEQRQRYEHLQQQVEEDSQQLRPLL 775
+ D L + Q ++ R L E + E Q V E+ L
Sbjct: 131 GAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSL 190

Query: 776 SDEHWQRWQADPLRTFQALGESIEQRRQQQARLQQIEQRLQELKQRCDESSWQLKQSDEQ 835
E + WQ + L + +R AR+ + E + K R D+ +
Sbjct: 191 IKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDD----FSSLLHK 246

Query: 836 RNEARQAEERAQAELAELNGRLGAHLGQHACAQDWQLSLEHAAQAAQSAVETLQAPLDSL 895
+ A+ A + + E L + Q + ++ + E
Sbjct: 247 QAIAKHAVLEQENKYVEAVNELRVYKSQLEQIE---------SEILSAKEEYQLVTQLFK 297

Query: 896 REEQLRLAEALEHLQQQRQRQQDEFQRLQA 925
E +L + +++ +R QA
Sbjct: 298 NEILDKLRQTTDNIGLLTLELAKNEERQQA 327



Score = 35.2 bits (81), Expect = 0.001
Identities = 19/166 (11%), Positives = 56/166 (33%), Gaps = 33/166 (19%)

Query: 656 RERLNQSRQEQQELTGQLAALDRQLDQWTLPEELRLLQPSAQLEWLAQRLDDLAGQRQQC 715
+E+ + + ++ + L + T+ + + RLDD + +
Sbjct: 192 KEQFSTWQNQKYQKELNLDKKRAERL--TVLARINRYE--NLSRVEKSRLDDFSSLLHKQ 247

Query: 716 QRDFDRLIARQRQTQQLQQELRAAETILQQRQQALTEQRQRYEHLQQQVEEDSQQLRPLL 775
++ ++ + + ELR ++ L+Q + + ++ Y+ + Q +
Sbjct: 248 AIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKN--------- 298

Query: 776 SDEHWQRWQADPLRTFQALGESIEQRRQQQARLQQIEQRLQELKQR 821
E +++ RQ + + L + ++R
Sbjct: 299 --------------------EILDKLRQTTDNIGLLTLELAKNEER 324



Score = 32.9 bits (75), Expect = 0.008
Identities = 27/214 (12%), Positives = 57/214 (26%), Gaps = 10/214 (4%)

Query: 253 QALQRLEGQQQWFTEEQRLLQSCEHAQGQLAEARQAWDALATERETLQWLERLAPVRGLI 312
L +L E L Q +L + R + + E L L+
Sbjct: 122 DVLLKLTALG---AEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQN 178

Query: 313 ERLKQLEQELRHSEQQQRQRTEQQAAGTERLQGLQARLQEARERQAQADNHLRQAQAPLR 372
+++ + ++Q Q+ L +A R N +
Sbjct: 179 VSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARI----NRYENLSRVEK 234

Query: 373 EAFQLESEARRLERTLAERQELHRQSNQRHAQQSDAARQL-DMEQQRHVAEQAQLQAALR 431
+L+ + L + + + Q N+ ++ +EQ A+ + L
Sbjct: 235 S--RLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLV 292

Query: 432 DSQALAALGDAWVTHQGQLATFVQRRQRALESQA 465
+ D + + E Q
Sbjct: 293 TQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQ 326



Score = 30.6 bits (69), Expect = 0.044
Identities = 30/210 (14%), Positives = 59/210 (28%), Gaps = 14/210 (6%)

Query: 120 ADGALQKSQQSLQDLETQQMLAANKKSEFREQLEQKL-------GLNFAQFTRAVLLAQS 172
AD +S LE + ++ E + E KL ++ + R L +
Sbjct: 134 ADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKE 193

Query: 173 EFSAFLKASDNDRGALLEKLTDTGLYSQLSKAAYQRASQADEQRKQLEQ-RLEGSLPL-- 229
+FS + L +K + + + + ++
Sbjct: 194 QFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHA 253

Query: 230 ---AEQARAGLEAALESHAQARLQEQQALQRLEGQQQWFTEEQRLLQSCEHAQGQLAEAR 286
E L + Q + + + + Q T+ + + Q
Sbjct: 254 VLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTD-NIG 312

Query: 287 QAWDALATERETLQWLERLAPVRGLIERLK 316
LA E Q APV +++LK
Sbjct: 313 LLTLELAKNEERQQASVIRAPVSVKVQQLK 342


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4279PF033091123e-32 Bvg accessory factor
		>PF03309#Bvg accessory factor

Length = 271

Score = 112 bits (283), Expect = 3e-32
Identities = 58/260 (22%), Positives = 98/260 (37%), Gaps = 23/260 (8%)

Query: 1 MILELDCGNSLIKWRVIEGAA---------RSVAGGLAESDD--ALVEQLTSQQALPVRA 49
M+L +D N+ +I G+ R +D+ ++ L A +
Sbjct: 1 MLLAIDVRNTHTVVGLISGSGDHAKVVQQWRIRTEPEVTADELALTIDGLIGDDAERLTG 60

Query: 50 CRLVSVRSEQETSQLVARLEQLFPVSALVASSGKQLAGVRNGYLDYQRLGLDRWLALVAA 109
+S V LEQ +P V G+ + + +G DR + +AA
Sbjct: 61 ASGLSTVPSVLHEVRVM-LEQYWPNVPHVLIEPGVRTGIPLLVDNPKEVGADRIVNCLAA 119

Query: 110 HHLAKKACLVIDLGTAVTSDLVAADGVHLGGYICPGMTLMRSQLRTHTRRI-RYDDAEAR 168
+H A +V+D G+++ D+V+A G LGG I PG+ + + + R + R
Sbjct: 120 YHKYGTAAIVVDFGSSICVDVVSAKGEFLGGAIAPGVQVSSDAAAARSAALRRVELTRPR 179

Query: 169 RALASLQPGQATAEAVERGCLL----MLRGFVREQYAMACELLGPDCEIFLTGGDAELVR 224
+ G+ T E ++ G + ++ G V G D + TG A LV
Sbjct: 180 SVI-----GKNTVECMQAGAVFGFAGLVDGLVNRIRDDVDGFSGADVAVVATGHTAPLVL 234

Query: 225 DELAGA-RIMPDLVFVGLAL 243
+L L GL L
Sbjct: 235 PDLRTVEHYDRHLTLDGLRL 254


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4277TCRTETOQM772e-17 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 77.2 bits (190), Expect = 2e-17
Identities = 51/152 (33%), Positives = 73/152 (48%), Gaps = 13/152 (8%)

Query: 13 VNVGTIGHVDHGKTTLTAAL------TKVCSDTWGGSARAFDQIDNAPEEKARGITINTS 66
+N+G + HVD GKTTLT +L G+ R DN E+ RGITI T
Sbjct: 4 INIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRT----DNTLLERQRGITIQTG 59

Query: 67 HVEYDSAVRHYAHVDCPGHADYVKNMITGAAQMDGAILVCSAADGPMPQTREHILLSRQV 126
+ +D PGH D++ + + +DGAIL+ SA DG QTR R++
Sbjct: 60 ITSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKM 119

Query: 127 GVPYIVVFLNKADMVDDAELLELVEMEVRDLL 158
G+P I F+NK D L V ++++ L
Sbjct: 120 GIPTI-FFINKIDQN--GIDLSTVYQDIKEKL 148


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4276SECETRNLCASE1302e-42 Bacterial translocase SecE signature.
		>SECETRNLCASE#Bacterial translocase SecE signature.

Length = 127

Score = 130 bits (327), Expect = 2e-42
Identities = 56/121 (46%), Positives = 80/121 (66%)

Query: 2 NAKAEAKESRFDLLKWLLVAVLVVVAVVGNQYFSAQPILYRVLGILVLAVIAAFLALQTA 61
N +A+ + +KW++V L++VA+VGN + + R L +++L A +AL T
Sbjct: 4 NTEAQGSGRGLEAMKWVVVVALLLVAIVGNYLYRDIMLPLRALAVVILIAAAGGVALLTT 63

Query: 62 KGQAFFSLAKEARVEIRKVVWPSRQETTQTTLIVVAVVLVMALLLWGLDSLLGWLVSMIV 121
KG+A + A+EAR E+RKV+WP+RQET TTLIV AV VM+L+LWGLD +L LVS I
Sbjct: 64 KGKATVAFAREARTEVRKVIWPTRQETLHTTLIVAAVTAVMSLILWGLDGILVRLVSFIT 123

Query: 122 G 122
G
Sbjct: 124 G 124


86PA4213PA4206N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA4213191.137015phenazine biosynthesis protein PhzD
PA4212-190.774700phenazine biosynthesis protein PhzC
PA4211-170.295312phenazine biosynthesis protein
PA4210-281.246014phenazine biosynthesis protein
PA4209-281.719543phenazine-specific methyltransferase
PA4208-1112.362523hypothetical protein
PA4207-1102.137810resistance-nodulation-cell division (RND) efflux
PA4206-293.120789resistance-nodulation-cell division (RND) efflux
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4213ISCHRISMTASE351e-125 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 351 bits (901), Expect = e-125
Identities = 102/207 (49%), Positives = 136/207 (65%), Gaps = 2/207 (0%)

Query: 3 GIPEITAYPLPTAQQLPANLARWSLEPRRAVLLVHDMQRYFLRPLPESLRA--GLVANAA 60
IP I Y +PTA +P N W +P RAVLL+HDMQ YF+ L AN
Sbjct: 2 AIPAIQPYQMPTASDMPQNKVSWVPDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANIR 61

Query: 61 RLRRWCVEQGVQIAYTAQPGSMTEEQRGLLKDFWGPGMRASPADREVVEELAPGPDDWLL 120
+L+ CV+ G+ + YTAQPGS + R LL DFWGPG+ + P + +++ ELAP DD +L
Sbjct: 62 KLKNQCVQLGIPVVYTAQPGSQNPDDRALLTDFWGPGLNSGPYEEKIITELAPEDDDLVL 121

Query: 121 TKWRYSAFFHSDLLQRMRAAGRDQLVLCGVYAHVGVLISTVDAYSNDIQPFLVADAIADF 180
TKWRYSAF ++LL+ MR GRDQL++ G+YAH+G L++ +A+ DI+ F V DA+ADF
Sbjct: 122 TKWRYSAFKRTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVADF 181

Query: 181 SEAHHRMALEYAASRCAMVVTTDEVLE 207
S H+MALEYAA RCA V TD +L+
Sbjct: 182 SLEKHQMALEYAAGRCAFTVMTDSLLD 208


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4208RTXTOXIND290.032 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.4 bits (66), Expect = 0.032
Identities = 18/120 (15%), Positives = 35/120 (29%), Gaps = 4/120 (3%)

Query: 334 LGSASRAFEL--APSVSWPAF-RLGNVRARLRAVEAQ-SDAALARYQRSLLLAQEDVGNA 389
SR+ EL P + P NV + +Q + ++
Sbjct: 153 YQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKK 212

Query: 390 LNQLAEHQRRLVALFQSATHGANALEIANERYRAGAGSYLAVLENQRALYQIREELAQAE 449
+ R+ + + L+ + A + AVLE + + EL +
Sbjct: 213 RAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYK 272


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4207ACRIFLAVINRP8020.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 802 bits (2072), Expect = 0.0
Identities = 316/1029 (30%), Positives = 530/1029 (51%), Gaps = 29/1029 (2%)

Query: 5 DLFVRRPVLALVVSTLILLLGLFSLGKLPIRQYPLLESSTITVTTEYPGASADLMQGFVT 64
+ F+RRP+ A V++ ++++ G ++ +LP+ QYP + ++V+ YPGA A +Q VT
Sbjct: 3 NFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVT 62

Query: 65 QPIAQAVSSVEGIDYLSSTSVQ-GRSVVTIRMLLNRDSTQAMTETMAKVNSVRYKLPERA 123
Q I Q ++ ++ + Y+SSTS G +T+ D A + K+ LP+
Sbjct: 63 QVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEV 122

Query: 124 YDSVIERSSGETTAVAYVGFSS--KTLPIPALTDYLSRVVEPMFSSIDGVAKVQTFGGQR 181
I ++ + GF S ++DY++ V+ S ++GV VQ FG Q
Sbjct: 123 QQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQY 182

Query: 182 LAMRLWLDADRLAGRGLTASDVAEAIRRNNYQAAPG------MVKGQYVLSNVRVNTDLT 235
AMR+WLDAD L LT DV ++ N Q A G + GQ + +++ T
Sbjct: 183 -AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFK 241

Query: 236 NVDDFREMVIRNDGNG-LVRLRDVGTVELGAAATETSALMDGDPAVHLGLFPTPTGNPLV 294
N ++F ++ +R + +G +VRL+DV VELG A ++G PA LG+ N L
Sbjct: 242 NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALD 301

Query: 295 IVDGIRKLLPEIQKTLPPDVRVDLAYETSRFIQASIDEVVRTLVEALLIVVLVIYLCLGS 354
I+ L E+Q P ++V Y+T+ F+Q SI EVV+TL EA+++V LV+YL L +
Sbjct: 302 TAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQN 361

Query: 355 LRSVLIPVATIPLSMLGAAALMLAFGFSVNLLTLLAMVLAIGLVVDDAIVVVENVHRHIE 414
+R+ LIP +P+ +LG A++ AFG+S+N LT+ MVLAIGL+VDDAIVVVENV R +
Sbjct: 362 MRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMM 421

Query: 415 EGKS-PVAAALIGAREVAGPVIAMTITLAAVYTPIGLMGGLTGALFREFALTLAGAVIVS 473
E K P A ++ G ++ + + L+AV+ P+ GG TGA++R+F++T+ A+ +S
Sbjct: 422 EDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALS 481

Query: 474 GVVALTLSPVMSSLLLQA-----HQNEGRMGRAAEWFFGGLTRRYGQVLEFSLGHRWLTG 528
+VAL L+P + + LL+ H+N+G F Y + LG
Sbjct: 482 VLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYL 541

Query: 529 GLALLVCISLPLLYSMPKRELAPTEDQAAVLTAIKAPQHANLDYVELFARKLDQVYTSIP 588
+ L+ + +L+ P EDQ LT I+ P A + + ++ Y
Sbjct: 542 LIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNE 601

Query: 589 E------TVSTWIINGTDGPAASFGGINLAAWEKRERD---ASAIQSELQGKVGDVEGSS 639
+ A ++L WE+R D A A+ + ++G +
Sbjct: 602 KANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGF 661

Query: 640 IFAFQLAA--LPGSTGGLPVQMVLRSPQDYPVLYRTMEEIKQKARQSGLFVV-VDSDLDY 696
+ F + A G+ G +++ ++ + L + ++ A Q +V V +
Sbjct: 662 VIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGLE 721

Query: 697 NNPVVQVRIDRAKANSLGIRMQDIGESLAVLVGENYVNRFGMEGRSYDVIPQSLRDQRFT 756
+ ++ +D+ KA +LG+ + DI ++++ +G YVN F GR + Q+ R
Sbjct: 722 DTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRML 781

Query: 757 PQALARQFVRTQDGNLVPLSTVVRVALQVEPNKLIQFDQQNAATLQAIPAPGVSMGQAVA 816
P+ + + +VR+ +G +VP S +L +++ + +Q APG S G A+A
Sbjct: 782 PEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAMA 841

Query: 817 FLDDVARGLPAGFSHDWQSDSRQYTQEGNTLVFAFLAALVVIYLVLAAQYESLADPLIIL 876
++++A LPAG +DW S Q GN + VV++L LAA YES + P+ ++
Sbjct: 842 LMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSVM 901

Query: 877 ITVPLSICGALLPLALGYATMNIYTQIGLVTLIGLISKHGILMVEFANELQLHERLDRRA 936
+ VPL I G LL L ++Y +GL+T IGL +K+ IL+VEFA +L E
Sbjct: 902 LVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVVE 961

Query: 937 AILRAAQIRLRPVLMTTAAMVFGLVPLLFASGAGAASRFGLGVVIVSGMLVGTLFTLFVL 996
A L A ++RLRP+LMT+ A + G++PL ++GAG+ ++ +G+ ++ GM+ TL +F +
Sbjct: 962 ATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFV 1021

Query: 997 PTVYTLLAR 1005
P + ++ R
Sbjct: 1022 PVFFVVIRR 1030



Score = 92.6 bits (230), Expect = 4e-21
Identities = 69/327 (21%), Positives = 135/327 (41%), Gaps = 13/327 (3%)

Query: 701 VQVRIDRAKANSLGIRMQDIGESLAV----LVGENYVNRFGMEGRSYDVIPQSLRDQRFT 756
+++ +D N + D+ L V + + G+ + + + +
Sbjct: 184 MRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIA-QTRFKN 242

Query: 757 PQALARQFVRT-QDGNLVPLSTVVRVALQVEP-NKLIQFDQQNAATLQAIPAPGVSMGQA 814
P+ + +R DG++V L V RV L E N + + + + AA L A G +
Sbjct: 243 PEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALDT 302

Query: 815 V----AFLDDVARGLPAGFSHDWQSDSRQYTQEG-NTLVFAFLAALVVIYLVLAAQYESL 869
A L ++ P G + D+ + Q + +V A+++++LV+ +++
Sbjct: 303 AKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNM 362

Query: 870 ADPLIILITVPLSICGALLPLALGYATMNIYTQIGLVTLIGLISKHGILMVEFANELQLH 929
LI I VP+ + G LA ++N T G+V IGL+ I++VE + +
Sbjct: 363 RATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMME 422

Query: 930 ERLDRRAAILRAAQIRLRPVLMTTAAMVFGLVPLLFASGAGAASRFGLGVVIVSGMLVGT 989
++L + A ++ ++ + +P+ F G+ A + IVS M +
Sbjct: 423 DKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSV 482

Query: 990 LFTLFVLPTV-YTLLARNHAEVDKSPR 1015
L L + P + TLL AE ++
Sbjct: 483 LVALILTPALCATLLKPVSAEHHENKG 509


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4206RTXTOXIND461e-07 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 46.4 bits (110), Expect = 1e-07
Identities = 18/106 (16%), Positives = 43/106 (40%), Gaps = 2/106 (1%)

Query: 65 AGRQVQVAAEAAGRITRIAFESGQQVQQGQLLVQLNDAVEQAELIRLKAQLRNAEILHAR 124
+GR ++ + I + G+ V++G +L++L +A+ ++ ++ L A + +
Sbjct: 93 SGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARL--EQ 150

Query: 125 ARKLVERNVASQEQLDNAVAARDMALGAVRQTQALIDQKAIRAPFS 170
R + +L + V + + L I+ FS
Sbjct: 151 TRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFS 196



Score = 40.2 bits (94), Expect = 9e-06
Identities = 25/134 (18%), Positives = 60/134 (44%), Gaps = 6/134 (4%)

Query: 102 AVEQAELIRLKAQLRNAEILHARARKLVERNVASQ-EQLDNAVAARDMALGAVRQTQALI 160
V +++L ++++++ +A+ + +L + + + Q + + + L + Q
Sbjct: 269 RVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQAS 328

Query: 161 DQKAIRAPFSGQLGIRRVH-LGQYLGVAEPVASLV-DARTLKSNFSLDESTSPELKLGQP 218
IRAP S ++ +VH G + AE + +V + TL+ + + +GQ
Sbjct: 329 V---IRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQN 385

Query: 219 LEVLVDAYPGRSFP 232
+ V+A+P +
Sbjct: 386 AIIKVEAFPYTRYG 399


87PA4166PA4159N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA41660114.096075acetyltransferase
PA41650105.193538transcriptional regulator
PA4164094.882827hypothetical protein
PA4163184.894122amidase
PA41622104.905111short-chain dehydrogenase
PA4161-2113.295108ferric enterobactin transporter FepG
PA4160-2112.782094ferric enterobactin transporter FepD
PA4159-1111.936628iron-enterobactin transporter periplasmic
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4166SACTRNSFRASE391e-06 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 39.2 bits (91), Expect = 1e-06
Identities = 18/61 (29%), Positives = 26/61 (42%), Gaps = 2/61 (3%)

Query: 76 RSTWAAQDVCYLEDLYVSPDVRGQQIGKQLIEYVRRQAEERRCARLYWHTQESNHRAQRL 135
RS W +ED+ V+ D R + +G L+ A+E L TQ+ N A
Sbjct: 83 RSNW--NGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHF 140

Query: 136 Y 136
Y
Sbjct: 141 Y 141


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4162DHBDHDRGNASE1196e-35 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 119 bits (299), Expect = 6e-35
Identities = 75/258 (29%), Positives = 117/258 (45%), Gaps = 32/258 (12%)

Query: 5 RTALVTGATRGIGLALARRLAASGWSVVGI-----------------ARHASDDFPGRLL 47
+ A +TGA +GIG A+AR LA+ G + + ARHA + FP
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHA-EAFP---- 63

Query: 48 CCDLADPAQTAETLRGLLSESA-VDALVNNAGIALPQSLENLDLAALQQVFDLNVRVAVQ 106
D+ D A E + E +D LVN AG+ P + +L + F +N
Sbjct: 64 -ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFN 122

Query: 107 LAQACLPGLKRSPAGRIVNLCSRAIHGAR-ERTAYAAAKSALVGVTRTWALELAPLGITV 165
+++ + +G IV + S R AYA++K+A V T+ LELA I
Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 166 NAVAPGPIETELFRQTRPVGGEEERRILST-------IPMQRLGRPDEVAALIEFLLSEG 218
N V+PG ET++ E+ I + IP+++L +P ++A + FL+S
Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242

Query: 219 ASFVTGQVIGVDGGGSLG 236
A +T + VDGG +LG
Sbjct: 243 AGHITMHNLCVDGGATLG 260


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4160PF04335300.010 VirB8 type IV secretion protein
		>PF04335#VirB8 type IV secretion protein

Length = 227

Score = 30.2 bits (68), Expect = 0.010
Identities = 16/68 (23%), Positives = 25/68 (36%), Gaps = 9/68 (13%)

Query: 7 RRRRLRAWGLLAGALLLALA---ALASLALGSRPVPLAVTLDALQAVDPHDDRHLVVREL 63
R + AW + A LA A A+A+L P +T VD + + +L
Sbjct: 29 ERSKKLAWVVAGVAGALATAGVVAVAALTPLKTVEPYVIT------VDRNTGEASIAAKL 82

Query: 64 RLPRTLVA 71
T+
Sbjct: 83 HGDATITY 90


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4159FERRIBNDNGPP375e-05 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 37.2 bits (86), Expect = 5e-05
Identities = 53/289 (18%), Positives = 97/289 (33%), Gaps = 28/289 (9%)

Query: 2 PTRRRSALPLLALALSLFAI-LAAAGEPKPARIVSTTPSVTGILLAMDAPLVASAATTPS 60
RR L +AL+ L+ + A A P RIV+ +LLA+ A T
Sbjct: 6 LISRRRLLTAMALSPLLWQMNTAHAAAIDPNRIVALEWLPVELLLALGIVPYGVADTINY 65

Query: 61 RLTDAKGFFSQWAKVADQRGVEVLYRNLRFD--IEAVIAQDPDLLVASA---TGADSAAP 115
RL + S+ V+ LR + +E + P +V SA + A
Sbjct: 66 RL-----WVSEPPLPDS-----VIDVGLRTEPNLELLTEMKPSFMVWSAGYGPSPEMLAR 115

Query: 116 Y-RAELEAQGVPTLVVDYSKHSWQELATELGRHTGLERQAQAAIQRFDAYTAEVAA-AIA 173
+ ++ S E+A L + A+ + +++ + + +
Sbjct: 116 IAPGRGFNFSDGKQPLAMARKSLTEMADLLNL----QSAAETHLAQYEDFIRSMKPRFVK 171

Query: 174 PPQGPVSVVGYNIAGSYSIGRQASPQARLLEALGFQVAELPEALAGKVTRASDFQFISRE 233
P+ + + S +L+ G +P A G+ +S +
Sbjct: 172 RGARPLLLTTLIDPRHMLVFGPNSLFQEILDEYG-----IPNAWQGETNFWG-STAVSID 225

Query: 234 NLPAAIAGDSVFLLGASDDDVQAFLADPVLANLSAVREKRVYALGPSSF 282
L A D + + D+ A +A P+ + VR R + F
Sbjct: 226 RLAAYKDVDVLCFDHDNSKDMDALMATPLWQAMPFVRAGRFQRVPAVWF 274


88PA4148PA4142N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA4148-182.640354short-chain dehydrogenase
PA41470102.361639transcriptional regulator AcoR
PA4145a0111.637716hypothetical protein
PA4145-1121.659886transcriptional regulator
PA4144-1131.365157hypothetical protein
PA4143-1100.830148toxin transporter
PA4142-1100.600961secretion protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4148DHBDHDRGNASE1272e-37 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 127 bits (319), Expect = 2e-37
Identities = 75/262 (28%), Positives = 126/262 (48%), Gaps = 14/262 (5%)

Query: 11 LSSRVALVTGAGRGIGRGIALALARAGADVAVADLDPQVAEETAAAIRSLGRRSLALGVD 70
+ ++A +TGA +GIG +A LA GA +A D +P+ E+ +++++ R + A D
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 71 VSDGDSVRAMVERVATEFGRLDVAVNNAGVISIRKVAELSLADWDRVMNVNARGVFLCCQ 130
V D ++ + R+ E G +D+ VN AGV+ + LS +W+ +VN+ GVF +
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 131 AELPLMQAQRWGRIVNLSSIAGKVGLPDLAHYCASKFAVIGFSNALAKEVARDGVTVNAL 190
+ M +R G IV + S V +A Y +SK A + F+ L E+A + N +
Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185

Query: 191 CPGIVGTGM----WRGEDGLSGRWRQAGESEAQSWERHQASLLPQGEAQTVEDMGQLVVY 246
PG T M W E+G + + E+ +P + D+ V++
Sbjct: 186 SPGSTETDMQWSLWADENGAEQVIKGSLETFKTG--------IPLKKLAKPSDIADAVLF 237

Query: 247 LAC--APHVTGQAIAVDGGFSL 266
L A H+T + VDGG +L
Sbjct: 238 LVSGQAGHITMHNLCVDGGATL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4147HTHFIS339e-112 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 339 bits (870), Expect = e-112
Identities = 134/390 (34%), Positives = 192/390 (49%), Gaps = 59/390 (15%)

Query: 273 FDLDALHAAADQAPCLLRGQAGELHVRLSAPRAKARRLEREVPDDAAL---DPRIAESLR 329
FDL L +A L+ P+ + +LE + D L + E R
Sbjct: 106 FDLTELIGIIGRA--------------LAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYR 151

Query: 330 LAVRVKDRNLPVLIQGETGAGKEVFARQLHQASARRDKPFVALNCAAIPESLIESELFGY 389
+ R+ +L ++I GE+G GKE+ AR LH RR+ PFVA+N AAIP LIESELFG+
Sbjct: 152 VLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGH 211

Query: 390 VGGAFTGAAAKGMRGLLQQADGGTLFLDEIGDMPLGLQTRLLRVLAEGEVAPLGAARRQA 449
GAFTGA + G +QA+GGTLFLDEIGDMP+ QTRLLRVL +GE +G
Sbjct: 212 EKGAFTGAQTRST-GRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIR 270

Query: 450 VDIQVICATHRDLAALVAAGGFREDLYFRLGGARFELPPLRERSDRLALIRRILDEETAH 509
D++++ AT++DL + G FREDLY+RL LPPLR+R++ + + R ++
Sbjct: 271 SDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEK 330

Query: 510 CGVRI-ELGEAALECLLGYRWPGNVRQLRHVLRYACALCGGATLQLADLPAELRGERRTP 568
G+ + + ALE + + WPGNVR+L +++R AL + + ELR E P
Sbjct: 331 EGLDVKRFDQEALELMKAHPWPGNVRELENLVRRLTALYPQDVITREIIENELRSE--IP 388

Query: 569 ASACESGGGP--------------------------------------ERDALLDALVRH 590
S E E +L AL
Sbjct: 389 DSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTAT 448

Query: 591 RWKPMAAARELGISRATLYRRVRRHGIRMP 620
R + AA LG++R TL +++R G+ +
Sbjct: 449 RGNQIKAADLLGLNRNTLRKKIRELGVSVY 478


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4145aFLGHOOKFLIK290.025 Flagellar hook-length control protein signature.
		>FLGHOOKFLIK#Flagellar hook-length control protein signature.

Length = 375

Score = 28.6 bits (63), Expect = 0.025
Identities = 19/71 (26%), Positives = 29/71 (40%), Gaps = 3/71 (4%)

Query: 1 MPVGFPCSVPGSGATPSVAPRQEP---TEETHARPAPASPQSDRGVRPMPLALLLAMGAF 57
M GF + + A +V P ++P T+ T + A P G PL L+A
Sbjct: 137 MLPGFDNTPKVTDAPSTVLPTEKPTLFTKLTSEQLTTAQPDDAPGTPAQPLTPLVAEAQS 196

Query: 58 SLSLSISPGPV 68
+ +P PV
Sbjct: 197 KAEVISTPSPV 207


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4144GPOSANCHOR300.015 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 30.4 bits (68), Expect = 0.015
Identities = 41/184 (22%), Positives = 65/184 (35%), Gaps = 5/184 (2%)

Query: 144 SAALRNAQQLLLAANASQDATLQNTFALAAQAYYDALAAQRSLAASRQVAELAAQNLEAA 203
+A + AA A++ A L+ A A ++L A + E LE A
Sbjct: 209 DSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKA 268

Query: 204 DAKY---RAGAAALSDRLQAQTALSQASLAQVRDEGALSNALGVIALRMGLAPDTPLRLS 260
+A L+A+ A +A A + + + NA +LR L +
Sbjct: 269 LEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNA-NRQSLRRDLDASREAKKQ 327

Query: 261 GELEAQPDTGFVKAIDEMLAEARREHPALLAAQARLKAAAASVEESRAAGRPSLA-LSAN 319
E E Q K + RR+ A A+ +L+A +EE S L +
Sbjct: 328 LEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRD 387

Query: 320 LARS 323
L S
Sbjct: 388 LDAS 391


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4142RTXTOXIND1565e-45 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 156 bits (396), Expect = 5e-45
Identities = 79/431 (18%), Positives = 175/431 (40%), Gaps = 55/431 (12%)

Query: 22 RPVSFTFLTLLAAAMALLVVGF--FLFGSYTKRSTVSGQLVPASGQVKVHAPQAGIVLRK 79
PVS + M LV+ F + G +T +G+L + ++ + IV
Sbjct: 51 TPVSRRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEI 110

Query: 80 FVQEGQAVRRGERLMVLSSERYGSDAGPVQAG--ISRRLEQRRDSLRDELEKLRRLQDD- 136
V+EG++VR+G+ L+ L++ +D Q+ +R + R L +E + +
Sbjct: 111 IVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKL 170

Query: 137 ------------------------------ERDSLTSKVASLQRELTTLAAQTDSQQRLL 166
++ + + E T+ A+ + + L
Sbjct: 171 PDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLS 230

Query: 167 ALASDAAARYQGLMDKGYISMDQLQQRQAELLGQRQTLQGLERERTSLRQQLTERRNELA 226
+ + L+ K I+ + +++ + + L+ + + + ++ + E
Sbjct: 231 RVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQ 290

Query: 227 GLSAR----QANQLAETRRQLSAVEQDLAESEAKRTLL-VTAPESGIATAVLAEA-GQTV 280
++ ++L +T + + +LA++E ++ + AP S + G V
Sbjct: 291 LVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVV 350

Query: 281 DSSRPLLSIVPADTPLQAELYAPSKSIGFIRPGDAVLIRYQAYPYQKFGQYHGKVQSISR 340
++ L+ IVP D L+ +K IGFI G +I+ +A+PY ++G GKV++I+
Sbjct: 351 TTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINL 410

Query: 341 ASVSYAELSSMVGGVPGLGQDGEQLYRLRVTLDDQAVTAYGQPRPLQSGMLLDADILQDT 400
++ Q ++ + +++++ ++ + PL SGM + A+I
Sbjct: 411 DAI--------------EDQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGM 456

Query: 401 RRLYEWVLEPL 411
R + ++L PL
Sbjct: 457 RSVISYLLSPL 467


89PA4084PA4078N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA4084019-0.171325usher CupB3
PA40830140.256542chaperone CupB4
PA4082014-0.167937adhesive protein CupB5
PA4081014-0.538678fimbrial subunit CupB6
PA40800130.278887response regulator
PA4079-1101.787676short-chain dehydrogenase
PA4078-281.876689nonribosomal peptide synthetase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4084PF005777400.0 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 740 bits (1911), Expect = 0.0
Identities = 268/868 (30%), Positives = 412/868 (47%), Gaps = 55/868 (6%)

Query: 1 MAVASPAGGLDAPSRRIVFDAQMLALGPGGRSIDTSRFERGDVIEPGRYRLDLLLNSRWR 60
+ A S + F+ + LA D SRFE G + PG YR+D+ LN+ +
Sbjct: 31 FVACAFAAQAPLSSAELYFNPRFLA-DDPQAVADLSRFENGQELPPGTYRVDIYLNNGYM 89

Query: 61 GVEEVELRRQPGRESAVFCYDRGLLERAGIDLEKSARGQDRSSARDPLPEGLHCDPLERY 120
+V + V C R L G++ S + L C PL
Sbjct: 90 ATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTA--------SVSGMNLLADDACVPLTSM 141

Query: 121 VPGARVKLDIAEQSIYVSVPSYYLSLDSSKTYVDPASWDSGISAALLNYNSNL-HVRENH 179
+ A +LD+ +Q + +++P ++S + ++ Y+ P WD GI+A LLNYN + V+
Sbjct: 142 IHDATAQLDVGQQRLNLTIPQAFMS-NRARGYIPPELWDPGINAGLLNYNFSGNSVQNRI 200

Query: 180 GRSATSGYAGMNAGFNFGRARLRHNGTATWSRRMGS-----HYQRSATYVQTDLPAWRAQ 234
G ++ Y + +G N G RLR N T +++ S +Q T+++ D+ R++
Sbjct: 201 GGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRSR 260

Query: 235 LLLGENSTSSEFFDAVSFRGVQLSSDDRMLPDSLRYYAPVVRGTASTNARVSVYQRGYLI 294
L LG+ T + FD ++FRG QL+SDD MLPDS R +APV+ G A A+V++ Q GY I
Sbjct: 261 LTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDI 320

Query: 295 YETTVAPGAFALDELQTASYGGDLEVRVTEASGEVRSFIVPFATTVQLLRPGTTRYSLTA 354
Y +TV PG F ++++ A GDL+V + EA G + F VP+++ L R G TRYS+TA
Sbjct: 321 YNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITA 380

Query: 355 GRL-NDPSLERRPNMLQGVYQRGLGNDVTAYAGGAFTGSYMSGLMGAALNT-PVGGFSGD 412
G + + + +P Q GL T Y G Y + G N +G S D
Sbjct: 381 GEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALSVD 440

Query: 413 VTLARTEVPGDDRLSGSSYRLAYSKNLPNTGTNFSLLAYRYSTGGYLGLRDAAFMQDRVE 472
+T A + +P D + G S R Y+K+L +GTN L+ YRYST GY D + +
Sbjct: 441 MTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGY 500

Query: 473 RGEPLE--------------SFSRLRNRLDANISQQLGNGGNLYLNGSSQRYWSGGGRAV 518
E + R +L ++QQLG LYL+GS Q YW
Sbjct: 501 NIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYWGTSNVDE 560

Query: 519 NFSVGYSNQWRDVSYSISAQRLRSQYEGFSSGDRRGETSTLFSLNLSIPLGG-------A 571
F G + + D+++++S ++ + + + +LN++IP +
Sbjct: 561 QFQAGLNTAFEDINWTLSYSLTKNAW--------QKGRDQMLALNVNIPFSHWLRSDSKS 612

Query: 572 GRGSPTLSSYLTRDSNSGTQLTSGVSGMLGKRGEASYSLSASHDRDSRQTSKS---ASLD 628
+ S ++ D N +GV G L + SYS+ + S S A+L+
Sbjct: 613 QWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLN 672

Query: 629 YRLPQVELGSSLSQGPGYRQLSVKAAGGLVAHSGGITAAQTLGETIGLVHAPNARGAA-A 687
YR S +QL +GG++AH+ G+T Q L +T+ LV AP A+ A
Sbjct: 673 YRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKVE 732

Query: 688 GYSGSRIDRHGYAVIPNLLPYQLNSVDLDPNGMADEIELRSSSRNVAPTAGAVVRLDYPT 747
+G R D GYAV+P Y+ N V LD N +AD ++L ++ NV PT GA+VR ++
Sbjct: 733 NQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKA 792

Query: 748 RVARPLLVDSRMPSGEPLPFAAEVLDAHSGQSVGAVGQGSRLVLRVEQDRGSVRVRWGNE 807
RV LL+ + +PLPF A V S QS G V ++ L G V+V+WG E
Sbjct: 793 RVGIKLLMTLT-HNNKPLPFGAMVTSE-SSQSSGIVADNGQVYLSGMPLAGKVQVKWGEE 850

Query: 808 PQQQCLVDYALGPRETTPPVLQLA--CR 833
C+ +Y L P + QL+ CR
Sbjct: 851 ENAHCVANYQLPPESQQQLLTQLSAECR 878


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4082PF05860791e-19 haemagglutination activity domain.
		>PF05860#haemagglutination activity domain.

Length = 117

Score = 79.5 bits (196), Expect = 1e-19
Identities = 31/108 (28%), Positives = 49/108 (45%), Gaps = 9/108 (8%)

Query: 54 LPSGGTVVGGSANGEIHLSGGNSLSVNQKVDKLIANWDSFSVAAGERVIFNQPSSSSIAL 113
LP + I Q L ++ FSV FN P++ +
Sbjct: 9 LPINSNITTEGNTRII-------ERGTQAGSNLFHSFQEFSVPTSGTAFFNNPTNIQNII 61

Query: 114 NRVIGTKASDIQGRIDANG--QVFLVNPNGVLFGRGAQVNVGGLVAST 159
+RV G S+I G I AN +FL+NPNG++FG+ A++++GG +
Sbjct: 62 SRVTGGSVSNIDGLIRANATANLFLINPNGIIFGQNARLDIGGSFVGS 109


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4080HTHFIS561e-11 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 56.4 bits (136), Expect = 1e-11
Identities = 30/126 (23%), Positives = 54/126 (42%), Gaps = 5/126 (3%)

Query: 5 RIRVMVADDHPAISLGISYELSQCGSLEMLGQVSNSTELIGRLNEGDCDVVIVDYTMPGG 64
++VADD AI ++ LS+ G + SN+ L + GD D+V+ D MP
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAG--YDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 65 KYGDGLALLSLLRRRYPHLQLVVFTMLNNPGLIRAILKQGINCILSKSDSTSHLLAAVSA 124
+ LL +++ P L ++V + N ++G L K + L+ +
Sbjct: 61 ---NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117

Query: 125 AYSRNQ 130
A + +
Sbjct: 118 ALAEPK 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4079DHBDHDRGNASE642e-14 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 64.3 bits (156), Expect = 2e-14
Identities = 42/190 (22%), Positives = 72/190 (37%), Gaps = 9/190 (4%)

Query: 3 NVLIVGASRGIGLGLADAFLQRGAQVFAVARRPQGSPGLQALAERAGERLQAVTGDLNQH 62
I GA++GIG +A +GA + AV P+ + + + +A D+
Sbjct: 10 IAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDS 69

Query: 63 DCAERIGEMLGER--RIDRLIVNAGIYGPQQQDVAEIDAEQTAQLFLTNAIAPLRLARAL 120
+ I + ID L+ AG+ P + E+ F N+ +R++
Sbjct: 70 AAIDEITARIEREMGPIDILVNVAGVLRPGLIH--SLSDEEWEATFSVNSTGVFNASRSV 127

Query: 121 SG--RVSRGGVVAFMSSQMASLALGLSATMPLYGASKAALNSLVRSWEGEFEELPFSLLL 178
S R G + + S A + +M Y +SKAA + E E +
Sbjct: 128 SKYMMDRRSGSIVTVGSNPAGVP---RTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNI 184

Query: 179 LHPGWVRTEM 188
+ PG T+M
Sbjct: 185 VSPGSTETDM 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA4078NUCEPIMERASE451e-06 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 44.8 bits (106), Expect = 1e-06
Identities = 49/199 (24%), Positives = 80/199 (40%), Gaps = 29/199 (14%)

Query: 621 ILLTGASGLMGAHLLAELLASREADLHCPVRAQNDAHALERLRQAARQHRIELAETDWRR 680
L+TGA+G +G H+ LL + ND + + +Q R+EL
Sbjct: 3 YLVTGAAGFIGFHVSKRLL--EAGHQVVGIDNLNDYYDVSL-----KQARLELLAQP--G 53

Query: 681 VRAYAADLAEPGFGLPAETYRELAGSVDQVFHSA--SAVNF-IQ-PYSYMKRDNVEGLGQ 736
+ + DLA+ + + G ++VF S AV + ++ P++Y N+ G
Sbjct: 54 FQFHKIDLAD--REGMTDLFAS--GHFERVFISPHRLAVRYSLENPHAYADS-NLTGFLN 108

Query: 737 VLRFCASGRCKPLMLLSSISVYSWGHLHTGKRLMREDDDIDQNLPAVVTDMGYVRSKWVM 796
+L C + + L+ SS SVY K DD +D P + Y +K
Sbjct: 109 ILEGCRHNKIQHLLYASSSSVYGLNR----KMPFSTDDSVDH--PVSL----YAATKKAN 158

Query: 797 EKIADLAAE-RGLPLMTFR 814
E +A + GLP R
Sbjct: 159 ELMAHTYSHLYGLPATGLR 177


90PA3974PA3969aN        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA39740112.016218lost adherence sensor LadS
PA39730112.740643transcriptional regulator
PA39721132.525641acyl-CoA dehydrogenase
PA39711122.259460hypothetical protein
PA39700101.475862AMP nucleosidase
PA3969a-1110.110519hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3974HTHFIS732e-15 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 72.9 bits (179), Expect = 2e-15
Identities = 32/114 (28%), Positives = 50/114 (43%), Gaps = 2/114 (1%)

Query: 669 TVLVVEDNAINQLVTRGMLLKLGYRVRTADNGSEALELLARERPDGVLLDCQMPVMDGFA 728
T+LV +D+A + V L + GY VR N + +A D V+ D MP + F
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 729 TCRAIRALPGCAELPVLALTAHSHSGDRERCLAAGMSDYMAKPVKFEELQTLLH 782
I+ +LPVL ++A + + G DY+ KP EL ++
Sbjct: 65 LLPRIKKAR--PDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3973HTHTETR602e-13 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 60.4 bits (146), Expect = 2e-13
Identities = 33/170 (19%), Positives = 63/170 (37%), Gaps = 8/170 (4%)

Query: 11 QRDSALRERILQLGLRRVVEGGFAALTMQALADDAGIATGSLYRHFRGKGELAAEIFRRA 70
Q R+ IL + LR + G ++ ++ +A AG+ G++Y HF+ K +L +EI+ +
Sbjct: 7 QEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELS 66

Query: 71 SQREVDALAVVL-RGPGAPAWRLAEGLRRF--AARAWSSQRLAFALI-----AEPVDPEV 122
+ + PG P L E L + +RL +I V
Sbjct: 67 ESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVV 126

Query: 123 DEQRLRYREAYAALFVELLEEGRRSGAFQLSLVPLAAACLVGAIAEALVG 172
+ + + L+ + L+ AA ++ L+
Sbjct: 127 QQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLME 176


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3970MYCMG045320.007 Hypothetical mycoplasma lipoprotein (MG045) signature.
		>MYCMG045#Hypothetical mycoplasma lipoprotein (MG045) signature.

Length = 483

Score = 31.6 bits (71), Expect = 0.007
Identities = 31/124 (25%), Positives = 50/124 (40%), Gaps = 19/124 (15%)

Query: 130 QDIPYPYVVEQGDELAGSGVTAAELARVFPSTDLSAASDDIADGLYEWERADQLPLALFD 189
Q++ + Y E+ EL V+ ++ + + +R + L D
Sbjct: 149 QNLVFVYRGEKISELEQENVSWTDVIKAI---------------VKHKDRFNDNRLVFID 193

Query: 190 AARVDFSLRRLVHYTGSDWRHVQPWILLTNYHRYV-DQFIRLGLTRLREDPRFVRMVLPG 248
AR FSL +V+ T ++ V P Y V + F RLGLT+ D FV
Sbjct: 194 DARTIFSLANIVN-TNNNSADVNPKEDGIGYFTNVYESFQRLGLTKSNLDSIFVNS--DS 250

Query: 249 NVII 252
N++I
Sbjct: 251 NIVI 254


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3969aSECA411e-06 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 41.0 bits (96), Expect = 1e-06
Identities = 14/22 (63%), Positives = 16/22 (72%), Gaps = 1/22 (4%)

Query: 162 GRGDQACPCGSGKRYRNCCSRL 183
GR D CPCGSGK+Y+ C RL
Sbjct: 880 GRND-PCPCGSGKKYKQCHGRL 900


91PA3953PA3946N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA3953-1100.841912hypothetical protein
PA3952-112-0.614991hypothetical protein
PA3951012-1.550572hypothetical protein
PA3950113-1.504763ATP-dependent RNA helicase
PA3949214-1.707888hypothetical protein
PA3948216-1.365554two-component response regulator RocA1
PA3947115-0.973979DNA-binding response regulator RocR
PA39461140.006425two-component sensor
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3953ISCHRISMTASE462e-08 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 45.8 bits (108), Expect = 2e-08
Identities = 47/198 (23%), Positives = 68/198 (34%), Gaps = 33/198 (16%)

Query: 11 SQVALLIVDLQRGMQRHDLPPRNNPGAE--ARIVELLAAWRAAGWPVVHVRHVSRQPGSP 68
++ LLI D+Q +P E A I +L G PVV+ + QPGS
Sbjct: 29 NRAVLLIHDMQNYFVDA-FTAGASPVTELSANIRKLKNQCVQLGIPVVY----TAQPGSQ 83

Query: 69 -----------FAPGQPG----VEFQPALAPRDDEAVFEKNVPDAFINSGLQRWLHVRDI 113
+ PG + LAP DD+ V K AF + L +
Sbjct: 84 NPDDRALLTDFWGPGLNSGPYEEKIITELAPEDDDLVLTKWRYSAFKRTNLLEMMRKEGR 143

Query: 114 RQVALVGVATENSVEASARSAGNLGFQTWVVADACFTFAKPDFHGTPRSADEVHAMALAN 173
Q+ + G+ +A A + + V DA F+ H MAL
Sbjct: 144 DQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVADFSLEK-----------HQMALEY 192

Query: 174 LHGEYAVVLRAAELLQRL 191
G A + LL +L
Sbjct: 193 AAGRCAFTVMTDSLLDQL 210


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3950TONBPROTEIN320.003 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 32.3 bits (73), Expect = 0.003
Identities = 24/104 (23%), Positives = 35/104 (33%), Gaps = 16/104 (15%)

Query: 352 EVELLAAIETLIGQTLQRREEPDFEPEHRVPQTA----PGGVVLKKPKKPKKPKAAESVG 407
V ++ + Q +Q EP EPE VV++KPK KPK
Sbjct: 46 SVTMVTPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKK 105

Query: 408 ---------KPGKIHLGSWFDSSAP---TVKAVRKAPGFGAGAA 439
KP + S F+++AP T A +
Sbjct: 106 VQEQPKRDVKPVESRPASPFENTAPARLTSSTATAATSKPVTSV 149


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3948HTHFIS734e-17 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 72.6 bits (178), Expect = 4e-17
Identities = 29/111 (26%), Positives = 51/111 (45%), Gaps = 1/111 (0%)

Query: 3 TVLIVDDHPVIRLAVRVLLEKHGLQVVAETDNGVDAIQLVREHEPDVVILDIGIPKLDGL 62
T+L+ DD IR + L + G V T N + + + D+V+ D+ +P +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 63 TVISRIKSLGLRSQVLVLTSQSAEAFCKRCIQVGARGFVNKEEDLNNLINA 113
++ RIK VLV+++Q+ + + GA ++ K DL LI
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGI 114


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3947HTHFIS531e-09 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 52.9 bits (127), Expect = 1e-09
Identities = 27/140 (19%), Positives = 51/140 (36%), Gaps = 9/140 (6%)

Query: 1 MNDLNVLVLEDEPFQRLVAVTALKKVVPGSILEAADGKEAVAILESCGHVDIAICDLQMS 60
M +LV +D+ R V AL + + ++ + + G D+ + D+ M
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRA-GYDVRITSNAATLWRWIAA-GDGDLVVTDVVMP 58

Query: 61 GMDGLAFLRHASLSGKVHSVILSSEVDPILRQATI-SMIECLGLNFLGDLGKPFSLERIT 119
+ L + V++ S Q T + I+ L KPF L +
Sbjct: 59 DENAFDLLPRIKKARPDLPVLVMSA------QNTFMTAIKASEKGAYDYLPKPFDLTELI 112

Query: 120 ALLTRYNARRQDLPRQIEVA 139
++ R A + P ++E
Sbjct: 113 GIIGRALAEPKRRPSKLEDD 132


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3946HTHFIS642e-12 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 64.1 bits (156), Expect = 2e-12
Identities = 31/112 (27%), Positives = 49/112 (43%), Gaps = 5/112 (4%)

Query: 957 RLQVLVVDDHAVNRQILHQQLSFLGHDVEEAENGLSALNLWHGQPFDMVITDCHMPLMSG 1016
+LV DD A R +L+Q LS G+DV N + D+V+TD MP +
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 1017 SDLARSIRQEERENGEEPVVIIGLTADAQPEEIERCIQAGMNECLIKPIGLD 1068
DL I+ + + PV+++ +A + + G + L KP L
Sbjct: 63 FDLLPRIK---KARPDLPVLVM--SAQNTFMTAIKASEKGAYDYLPKPFDLT 109


92PA3883PA3876N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA38831132.327333short-chain dehydrogenase
PA38820130.890690hypothetical protein
PA3881-1130.561384hypothetical protein
PA38800150.265195hypothetical protein
PA3879016-0.138623transcriptional regulator NarL
PA3878-1150.027867two-component sensor NarX
PA3877-115-0.383248nitrite extrusion protein 1
PA3876-1140.045579nitrite extrusion protein 2
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3883DHBDHDRGNASE873e-22 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 87.0 bits (215), Expect = 3e-22
Identities = 54/180 (30%), Positives = 81/180 (45%), Gaps = 7/180 (3%)

Query: 5 VAFVTGCSSGIGRALADAFQRAGYRVWA----SARKEDDVRALAEAGFQAVQ--LDVNDA 58
+AF+TG + GIG A+A G + A + E V +L A DV D+
Sbjct: 10 IAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDS 69

Query: 59 AALARLAEELGVEAAGLDVLVNNAGYGAMGPLLDGGVEAMRRQFETNVFAVVGVTRALFP 118
AA+ + + E +D+LVN AG G + E F N V +R++
Sbjct: 70 AAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSK 129

Query: 119 -LLRRKSGLVVNVGSVSGVLVTPFAGAYCASKAAVHALSDALRLELAPFGVEVLEVQPGA 177
++ R+SG +V VGS + AY +SKAA + L LELA + + V PG+
Sbjct: 130 YMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPGS 189


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3879HTHFIS801e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 79.9 bits (197), Expect = 1e-19
Identities = 44/197 (22%), Positives = 76/197 (38%), Gaps = 18/197 (9%)

Query: 13 RLLLVDDHPMMRKGVAQLLELEDDLSVVGEAGSGEEALRLAAELDPDMILLDLNMKGMNG 72
+L+ DD +R + Q L V + R A D D+++ D+ M N
Sbjct: 5 TILVADDDAAIRTVLNQALSRAG-YDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 73 LDTLRALREAGVDARIVVFTVSDDKGDVVNVLRAGADGYLLKDMEPERLLEHIRQAATGQ 132
D L +++A D ++V + + + GA YL K + L+ I +A
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA-- 120

Query: 133 MTLSPQLTQILAQALRGDD---RSKSLDELTERERQILRQIAHGYSNKMIARKLDITE-G 188
+ +++ + G RS ++ E+ ++L ++ MI E G
Sbjct: 121 -EPKRRPSKLEDDSQDGMPLVGRSAAMQEI----YRVLARLMQTDLTLMI-----TGESG 170

Query: 189 TVKVHVKRVLHKLGMRS 205
T K V R LH G R
Sbjct: 171 TGKELVARALHDYGKRR 187


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3878PF06580432e-06 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 43.3 bits (102), Expect = 2e-06
Identities = 26/111 (23%), Positives = 48/111 (43%), Gaps = 12/111 (10%)

Query: 495 FGERGEVTIELDNRLQHVPLSPNEEIHVLQIVREALSNVVRHSQAQR---AWVRLSSQAD 551
F +R + +++ + V + P ++Q + E N ++H AQ + L D
Sbjct: 236 FEDRLQFENQINPAIMDVQVPP----MLVQTLVE---NGIKHGIAQLPQGGKILLKGTKD 288

Query: 552 -GQVSIAVEDDGVGFDPQQNRSGHYGLTIMQERGQTL-GSQLRFEARAPHG 600
G V++ VE+ G S GL ++ER Q L G++ + + G
Sbjct: 289 NGTVTLEVENTGSLALKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQG 339


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3877TCRTETA415e-06 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 41.3 bits (97), Expect = 5e-06
Identities = 61/350 (17%), Positives = 114/350 (32%), Gaps = 30/350 (8%)

Query: 39 ELGLSESQ---FGLMVALPILTGSLVRLPLGLITDRFGGRIVFFIHMLLVAIPIYGLAFA 95
+L S +G+++AL L LG ++DRFG R V + + A+ +A A
Sbjct: 34 DLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATA 93

Query: 96 SQYWHYLVLGLFVGLAGGSFAVGIAYTSAWFEKERQGTAMGIFGAGNAGAAITNLVAPMI 155
W + + G+ G + AV AY + + + + G A + V +
Sbjct: 94 PFLWVLYIGRIVAGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGL 153

Query: 156 VVAFGWRMVPQVYSVAMLVTAVLFWLFTWTDPAHLKGAAEASQRPNLAKQLAPLAELRVW 215
+ F P + A+ L F + + + N + V
Sbjct: 154 MGGFSPHA-PFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVV 212

Query: 216 RFGLYYFFVFG--GFVALALWLPKYYIAEYGLDLKTASFITMLFTLPSGLIRA-LGGWFS 272
+ FF+ G V ALW+ + + D T F + L +A + G +
Sbjct: 213 AALMAVFFIMQLVGQVPAALWV-IFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVA 271

Query: 273 DHYGARS-VNWGVFWVCLVCLFFLSYPQTTMTIHGIQGDLSLGIGLNVWLFTFLVFVVGI 331
G R + G+ + + W+ F + V+
Sbjct: 272 ARLGERRALMLGMIADGTGYILLAFATR-------------------GWMA-FPIMVLLA 311

Query: 332 AQGFGKASVYRIIHDYYPSN-MGTVGGMVGVIGGLGGFCLPILFGYAADH 380
+ G G ++ ++ G + G + + L P+LF
Sbjct: 312 SGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAA 361


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3876TCRTETA300.019 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 30.2 bits (68), Expect = 0.019
Identities = 28/128 (21%), Positives = 47/128 (36%), Gaps = 11/128 (8%)

Query: 52 AVWMIWSTVTVRLNSAGFAFSNDQLFLLAALPSISGATLRVFYSFMVPIFGGRRWTALST 111
A+W+I+ ++ S L L S++ A + + G RR L
Sbjct: 231 ALWVIFGEDRFHWDATTIGIS---LAAFGILHSLAQA---MITGPVAARLGERRALMLGM 284

Query: 112 ASMLIPCIWLGFAVQDPSTPYWVFALIALLCGFGGGNFASSMSNISFFYPKSQQGTALGL 171
+ I L FA + W+ I +L GG + + +S + +QG G
Sbjct: 285 IADGTGYILLAFATR-----GWMAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGS 339

Query: 172 NAGLGNLG 179
A L +L
Sbjct: 340 LAALTSLT 347


93PA3846PA3838N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA3846-113-0.091179hypothetical protein
PA3845011-0.208751transcriptional regulator
PA3844-19-0.556070hypothetical protein
PA384219-0.729270chaperone
PA384106-0.620006exoenzyme S
PA384008-0.523137SAM-dependent methyltransferase
PA3839113-1.125282sodium:sulfate symporter
PA3838015-1.483069ABC transporter ATP-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3846ISCHRISMTASE431e-07 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 43.5 bits (102), Expect = 1e-07
Identities = 41/183 (22%), Positives = 64/183 (34%), Gaps = 29/183 (15%)

Query: 3 IRAATSTLLVVDIQERLLPAIDDG----PALVEYSQWLLRVARALDVPVLASEQ------ 52
+ LL+ D+Q + A G L + L L +PV+ + Q
Sbjct: 26 PDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANIRKLKNQCVQLGIPVVYTAQPGSQNP 85

Query: 53 ---------YSKGL--GPTVAALRDELEPTQ---ILEKLDFSAAADGALL---RAPGGDR 95
+ GL GP + EL P +L K +SA LL R G R
Sbjct: 86 DDRALLTDFWGPGLNSGPYEEKIITELAPEDDDLVLTKWRYSAFKRTNLLEMMRKEG--R 143

Query: 96 RQFVVCGSEAHVCVLQTVLDLLGRGREVFVVEEAIGSRRPSDKALAVERMRQAGAMIVSR 155
Q ++ G AH+ L T + + F V +A+ +A+E A V
Sbjct: 144 DQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVADFSLEKHQMALEYAAGRCAFTVMT 203

Query: 156 EMV 158
+ +
Sbjct: 204 DSL 206


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3842SYCECHAPRONE1694e-58 Gram-negative bacterial type III secretion SycE cha...
		>SYCECHAPRONE#Gram-negative bacterial type III secretion SycE

chaperone signature.
Length = 130

Score = 169 bits (430), Expect = 4e-58
Identities = 51/115 (44%), Positives = 65/115 (56%), Gaps = 3/115 (2%)

Query: 5 YRAAIHQLFLALDLPTPNDEESVLSLQVGPHLCHLAEHPTDHLLMFT--RLEGQGDA-TA 61
+ AI QLF L L P+ E V+ ++VG CH+ EHP +LMFT L+ + T
Sbjct: 4 FEQAITQLFQQLSLSIPDTIEPVIGVKVGEFACHITEHPVGQILMFTLPSLDNNDEKETL 63

Query: 62 NEQNLFSQDPCKPILGRDPESGERLLWNRQPLQLLDRAQIHHQLEQLVAAAEELR 116
N+FSQD KPIL D G +LWNRQPL LD ++ QLE LV AE L+
Sbjct: 64 LSHNIFSQDILKPILSWDEVGGHPVLWNRQPLNSLDNNSLYTQLEMLVQGAERLQ 118


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3841YERSINIAYOPE2187e-71 Yersinia virulence determinant YopE protein signature.
		>YERSINIAYOPE#Yersinia virulence determinant YopE protein signature.

Length = 219

Score = 218 bits (555), Expect = 7e-71
Identities = 55/220 (25%), Positives = 99/220 (45%), Gaps = 23/220 (10%)

Query: 9 SPSFAVELHQAASGRLGQIEARQVATPSE---AQQLAQRQDAPKGEGLLARLGAALVRPF 65
S S + + S +G++ R V+ + A LA R ++P+G L +R+ L
Sbjct: 8 STSLPLPTSVSGSSSVGEMSGRSVSQQTSDQYANNLAGRTESPQGSSLASRIIERLSSVA 67

Query: 66 VAIMDWLGKLL--GSHA---RTGPQPSQDAQPAVMSSAVVFKQMVLQQALPMTLKGLDKA 120
+++ ++ ++ GSH P P+Q P S ++ + + + LP ++
Sbjct: 68 HSVIGFIQRMFSEGSHKPVVTPAPTPAQMPSPTSFSDSI---KQLAAETLPKYMQ----- 119

Query: 121 SELATLTPEGLAREHSRLASGDGALRSLSTALAGIRAGSQVEESRIQAGRLLERSIGGIA 180
+L +L E L + H + A+G G LR T G+ E + +A +L + GI
Sbjct: 120 -QLNSLDAEMLQKNHDQFATGSGPLRGSITQCQGLMQFCG-GELQAEASAILNTPVCGIP 177

Query: 181 LQQWGTTGGAASQLV-----LDASPELRREITDQLHQVMS 215
QWGT GGAAS V L + + + Q+ +++S
Sbjct: 178 FSQWGTIGGAASAYVASGVDLTQAANEIKGLAQQMQKLLS 217


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3838BINARYTOXINB300.008 Binary toxin B family signature.
		>BINARYTOXINB#Binary toxin B family signature.

Length = 764

Score = 30.4 bits (68), Expect = 0.008
Identities = 23/107 (21%), Positives = 41/107 (38%), Gaps = 7/107 (6%)

Query: 3 SAKNLKITFNPGTPIETRALRGLSLDIPAGQFVTVIGSNGAGKSTFLNAVSGDLP-IDS- 60
S+ + + +N +E L D G T NG + + S LP I
Sbjct: 457 SSTPITMNYNQFLELEKTKQLRLDTDQVYGNIATYNFENGRVRVDTGSNWSEVLPQIQET 516

Query: 61 -GQILIDDEDVTRKPVWARANRVARVFQDPMAGTCEDLTIEENMALA 106
+I+ + +D+ A DP+ T D+T++E + +A
Sbjct: 517 TARIIFNGKDLN----LVERRIAAVNPSDPLETTKPDMTLKEALKIA 559


94PA3724PA3714N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA3724-17-0.895214elastase LasB
PA37230130.808709FMN oxidoreductase
PA37221180.142910hypothetical protein
PA3721-213-0.577477transcriptional regulator
PA3720-212-0.275463hypothetical protein
PA3719-211-0.591063MexR antirepressor ArmR
PA3718-210-0.398092major facilitator superfamily transporter
PA3717-111-0.703735FkbP-type peptidyl-prolyl cis-trans isomerase
PA3716-112-0.353009hypothetical protein
PA3715-1100.604393hypothetical protein
PA3714-180.254141two-component response regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3724THERMOLYSIN399e-136 Thermolysin metalloprotease (M4) family signature.
		>THERMOLYSIN#Thermolysin metalloprotease (M4) family signature.

Length = 544

Score = 399 bits (1027), Expect = e-136
Identities = 138/488 (28%), Positives = 206/488 (42%), Gaps = 59/488 (12%)

Query: 51 GAGGADELKAIRSTTLPNGKQVTRYEQFHNGVRVVGEAITEVKGPGKSVAAQRSGHFVAN 110
G + L I + G V R+EQ +G + G+ + SG + N
Sbjct: 69 GGQARERLSLIGNKLDELGHTVMRFEQAIAASLCMGAVLVAHVNDGE--LSSLSGTLIPN 126

Query: 111 IAADLPGSTTAAVSAEQVLAQAKS------LKAQGRKTENDKVELVIRLGENNIAQLVYN 164
+ T AA+S +Q AK K + E LVI E +L Y
Sbjct: 127 LDKRTL-KTEAAISIQQAEMIAKQDVADRVTKERPAAEEGKPTRLVIYPDEET-PRLAYE 184

Query: 165 VSYLIPGEGLSRPHFVIDAKTGEVLDQWEGLAHAEAGGPG---------------GNQKI 209
V+ ++IDA G+VL++W + A+ GG G+QK
Sbjct: 185 VNVRFLTPVPGNWIYMIDAADGKVLNKWNQMDEAKPGGAQPVAGTSTVGVGRGVLGDQKY 244

Query: 210 GKYTYGSDYGPLIVNDRCEMDDGNVITVDMNSSTDDSKTTPFRFACPTNTYKQVNGAYSP 269
TY S YG + D + T D + T + + Q +Y
Sbjct: 245 INTTYSSYYGYYYLQDNTR--GSGIFTYDGRNRTVLPGSLW------ADGDNQFFASYDA 296

Query: 270 -LNDAHFFGGVVFKLYRDWFG---TSPLTHKLYMKVHYGRSVENAYWDGTAMLFGDG-AT 324
DAH++ GVV+ Y++ G + VHYGR NA+W+G+ M++GDG
Sbjct: 297 AAVDAHYYAGVVYDYYKNVHGRLSYDGSNAAIRSTVHYGRGYNNAFWNGSQMVYGDGDGQ 356

Query: 325 MFYPLV-SLDVAAHEVSHGFTEQNSGLIYRGQSGGMNEAFSDMAGEAAEFYMRGKNDFLI 383
F P +DV HE++H T+ +GL+Y+ +SG +NEA SD+ G EFY D+ I
Sbjct: 357 TFLPFSGGIDVVGHELTHAVTDYTAGLVYQNESGAINEAMSDIFGTLVEFYANRNPDWEI 416

Query: 384 GYDIKK---GSGALRYMDQPSRDGRSIDNASQYYNGID----VHHSSGVYNRAFYLLANS 436
G DI ALR M P++ G D+ S+ Y G VH +SG+ N+A YLL+
Sbjct: 417 GEDIYTPGVAGDALRSMSDPAKYGDP-DHYSKRYTGTQDNGGVHTNSGIINKAAYLLSQG 475

Query: 437 --------PGWDTRKAFEVFVDANRYYWTATSNYNSGACGVIRSAQNRNYS----AADVT 484
G K ++F A YY T TSN++ +++A + S V
Sbjct: 476 GVHYGVSVTGIGRDKMGKIFYRALVYYLTPTSNFSQLRAACVQAAADLYGSTSQEVNSVK 535

Query: 485 RAFSTVGV 492
+AF+ VGV
Sbjct: 536 QAFNAVGV 543


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3721HTHTETR647e-15 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 64.3 bits (156), Expect = 7e-15
Identities = 25/96 (26%), Positives = 44/96 (45%), Gaps = 5/96 (5%)

Query: 10 ERGRQRRRAMLDAATQAFLEHGFEGTTLDMVIERAGGSRGTLYSSFGGKEGLFAAVIA-- 67
+ ++ R+ +LD A + F + G T+L + + AG +RG +Y F K LF+ +
Sbjct: 7 QEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELS 66

Query: 68 -HMIGEIFDDSADQPR--PAATLSATLEHFGRRFLT 100
IGE+ + + P + L L H +T
Sbjct: 67 ESNIGELELEYQAKFPGDPLSVLREILIHVLESTVT 102


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3718TCRTETB591e-11 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 59.1 bits (143), Expect = 1e-11
Identities = 63/379 (16%), Positives = 131/379 (34%), Gaps = 55/379 (14%)

Query: 40 IALPSLQRSFGGDLAALSWIMSAFPFVGVFGGIAAGLLVRRWGDRRLLTGGLAILGGASL 99
++LP + F A+ +W+ +AF G G L + G +RLL G+ I S+
Sbjct: 35 VSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSV 94

Query: 100 LGA-SMQDFTWLLATRFVEGLGFLIVVVAAPAVLHRITSETRRSVVFGLWSTFMAGGIAL 158
+G F+ L+ RF++G G V+ R + R FGL + +A G +
Sbjct: 95 IGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGV 154

Query: 159 SMLFGPLLADW-RADWQLSALLVLVAALLLPLSVPADDGCRAAGVRPAGLGTLLKVPAIT 217
G ++A + + L ++ + + + + + G+ +L I
Sbjct: 155 GPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDIKGI--ILMSVGIV 212

Query: 218 LLALGFTTYNLQFFALMTF----------------------------------------- 236
L T+Y++ F +
Sbjct: 213 FFMLFTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTV 272

Query: 237 -----LPVFLMQR---LGVALETAGLIGAAIVAANALGNVAAGFILSRGIRPGALLASTA 288
+ ++M+ L A + +I ++ G + + RG + T
Sbjct: 273 AGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTF 332

Query: 289 ILMGLTGAAFFHAAMPGLLAIALGFVFSAVAGMLPTTVLATAPLASPAPSLTPLAIGWVM 348
+ + A+F + I + FV ++ TV++T +S + +
Sbjct: 333 LSVSFLTASFLLETTSWFMTIIIVFVLGGLSFT--KTVISTIVSSSLKQQEAGAGMSLLN 390

Query: 349 QGNYLGQVIGPLLIGLIVS 367
++L + G ++G ++S
Sbjct: 391 FTSFLSEGTGIAIVGGLLS 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3717INFPOTNTIATR805e-22 Macrophage infectivity potentiator signature.
		>INFPOTNTIATR#Macrophage infectivity potentiator signature.

Length = 233

Score = 80.4 bits (198), Expect = 5e-22
Identities = 42/104 (40%), Positives = 59/104 (56%), Gaps = 2/104 (1%)

Query: 5 LQIEDLLLGDGKEVVKGALITTQYKGTLEDGTLFDSSYERGRPFQCVIGTGRVIKGWDQG 64
LQ + + G G + K +T +Y GTL DGT+FDS+ + G+P +VI GW +
Sbjct: 128 LQYKIIDAGTGAKPGKSDTVTVEYTGTLIDGTVFDSTEKAGKP--ATFQVSQVIPGWTEA 185

Query: 65 LMGMKVGGKRRLFVPSHLAYGERQVGAHIKPHSNLLFEIELLEV 108
L M G +FVP+ LAYG R VG I P+ L+F+I L+ V
Sbjct: 186 LQLMPAGSTWEVFVPADLAYGPRSVGGPIGPNETLIFKIHLISV 229


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3714HTHFIS613e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 61.4 bits (149), Expect = 3e-13
Identities = 28/130 (21%), Positives = 57/130 (43%), Gaps = 11/130 (8%)

Query: 2 KTRVILVDDHALTLIGMRYLLSAYD-DLRIVAQAQDADGLLAQLEAHPCDLLITDLMMPG 60
+++ DD A + LS D+RI + A + A DL++TD++MP
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATL---WRWIAAGDGDLVVTDVVMPD 59

Query: 61 SQQADGLRLVQKVRRRYPDLPIIVVTMLGNPALVSSLLKLGIHGLVSK----RGMLDDLP 116
+ L+ ++++ PDLP++V++ + G + + K ++ +
Sbjct: 60 ---ENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116

Query: 117 KAIRHAGRRP 126
+A+ RRP
Sbjct: 117 RALAEPKRRP 126


95PA3709PA3699N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA3709-180.681026major facilitator superfamily transporter
PA37080100.817924chemotaxis transducer
PA37071111.397791hypothetical protein
PA37060210.701956biofilm formation methyltransferase WspC
PA3705015-0.367061hypothetical protein
PA3704015-0.432563chemotaxis sensor/effector fusion protein
PA3703018-1.767154chemotaxis-specific methylesterase
PA3702216-1.885049two-component response regulator
PA3701117-2.141288peptide chain release factor 1
PA3700012-1.345495lysine--tRNA ligase
PA3699-111-0.951658transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3709TCRTETA356e-04 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 35.2 bits (81), Expect = 6e-04
Identities = 15/27 (55%), Positives = 17/27 (62%)

Query: 304 VAGWLSDRIGRKPVLLAGLLLATLFYF 330
V G LSDR GR+PVLL L A + Y
Sbjct: 62 VLGALSDRFGRRPVLLVSLAGAAVDYA 88



Score = 32.1 bits (73), Expect = 0.005
Identities = 24/113 (21%), Positives = 45/113 (39%), Gaps = 17/113 (15%)

Query: 63 IFALMAFAAGFLVRPFGALVFGRLGDMIGRKYTFLVTILLMGLSTFAVGLLPTYASIGVA 122
++ALM FA A V G L D GR+ LV++ + + P
Sbjct: 51 LYALMQFAC--------APVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLW----- 97

Query: 123 APIILVTLRMLQGLALGGEYGGAAIYVAEHAPANKRGSYTSWIQSTATLGLLL 175
+L R++ G+ G A Y+A+ ++R + ++ + G++
Sbjct: 98 ---VLYIGRIVAGIT-GATGAVAGAYIADITDGDERARHFGFMSACFGFGMVA 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3706FLGHOOKFLIK290.036 Flagellar hook-length control protein signature.
		>FLGHOOKFLIK#Flagellar hook-length control protein signature.

Length = 375

Score = 29.0 bits (64), Expect = 0.036
Identities = 18/76 (23%), Positives = 25/76 (32%), Gaps = 3/76 (3%)

Query: 249 QPIGVPLSFVFRRTSEAPRGARPKAVSDGARPVVAAAVERASIRPSPPPPAKPRQRLSSL 308
P P F + + A A+P+ E S P+ S L
Sbjct: 155 LPTEKPTLFTKLTSEQLTTAQPDDAPGTPAQPLTPLVAEAQSKAEVISTPSPVTAAASPL 214

Query: 309 VPPASGQPL---ASPV 321
+ P QPL A+PV
Sbjct: 215 ITPHQTQPLPTVAAPV 230


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3704HTHFIS747e-16 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 74.1 bits (182), Expect = 7e-16
Identities = 30/113 (26%), Positives = 52/113 (46%), Gaps = 2/113 (1%)

Query: 644 QRKRILVVDDSLTVRELERKLLLGRGYDVAVAVDGMDGWNALRSEHFDLLITDIDMPRMD 703
ILV DD +R + + L GYDV + + W + + DL++TD+ MP +
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 704 GIELVTLVRRDSRLQSLPVMVVSYKDREEDRRRGLDAGADYYLAKASFHDEAL 756
+L+ ++ LPV+V+S ++ + + GA YL K E +
Sbjct: 62 AFDLLPRIK--KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELI 112


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3703HTHFIS522e-09 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 51.8 bits (124), Expect = 2e-09
Identities = 31/141 (21%), Positives = 53/141 (37%), Gaps = 13/141 (9%)

Query: 2 RIGIVNDMPLAVEALRRALAFEPQHQIVWVASNGAEAVTQCAADTPDVVLMDLLMPVMDG 61
I + +D L +AL+ V + SN A AA D+V+ D++MP +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGY--DVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 62 VEATRRIMAESPCAIVIVTVDIEQNVHRVFEAMGYGALDAVNTP----------ALGIGN 111
+ RI P V+V + + +A GA D + P +
Sbjct: 63 FDLLPRIKKARPDLPVLV-MSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 112 PQTAAAPLLRKIQNVGWLIGQ 132
P+ + L Q+ L+G+
Sbjct: 122 PKRRPSKLEDDSQDGMPLVGR 142


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3702HTHFIS681e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 67.5 bits (165), Expect = 1e-14
Identities = 34/129 (26%), Positives = 53/129 (41%), Gaps = 3/129 (2%)

Query: 21 VLLVDDQAMIGEAVRRSLASEAGIDFHFCSDPQQAVAVANQIKPTVILQDLVMPGVDGLT 80
+L+ DD A I + ++L S AG D S+ +++ D+VMP +
Sbjct: 6 ILVADDDAAIRTVLNQAL-SRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 81 LLAAYRGNPATRDIPIIVLSTKEEPTVKSAAFAAGANDYLVKLPDAIELVARIRYHSRSY 140
LL + A D+P++V+S + A GA DYL K D EL+ I
Sbjct: 65 LLPRIKK--ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122

Query: 141 IALQQRDEA 149
+ E
Sbjct: 123 KRRPSKLED 131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3699HTHTETR524e-10 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 51.6 bits (123), Expect = 4e-10
Identities = 29/137 (21%), Positives = 58/137 (42%), Gaps = 7/137 (5%)

Query: 27 KASRQGSEQRRQAILDAAMRLIVRDGVRAVRHRAVAAEAQVPLSATTYYFKDIDDLITDT 86
+ ++Q +++ RQ ILD A+RL + GV + +A A V A ++FKD DL ++
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 87 FALFVERNAEALSAFWSSVEGDLQEMAAVLADD-------PGARGSLVERIVELAVQYVQ 139
+ L E + + GD + + R L+E I +
Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGE 122

Query: 140 VQLTERREHLLAEQAFR 156
+ + ++ + L +++
Sbjct: 123 MAVVQQAQRNLCLESYD 139


96PA3467PA3459N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA34671101.718731major facilitator superfamily transporter
PA34660111.611104ATP-dependent RNA helicase
PA34650112.081092hypothetical protein
PA3464-1101.733958hypothetical protein
PA3463-1101.476811hypothetical protein
PA3462-1111.967381sensor/response regulator hybrid protein
PA3461-2113.060323hypothetical protein
PA34600102.446086acetyltransferase
PA3459092.024746glutamine amidotransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3467TCRTETA583e-11 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 57.9 bits (140), Expect = 3e-11
Identities = 46/209 (22%), Positives = 86/209 (41%), Gaps = 4/209 (1%)

Query: 26 VIIALAFFFDSMDLAMMTFLLGSIKAEFGLDSAQA---GLLASSSFFGMVIGAALSGMLA 82
++I D++ + ++ +L + + + G+L + A + G L+
Sbjct: 8 IVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALS 67

Query: 83 DRFGRKPVFQASIVLWGLASYLCSTAGDLDSLTFYRVLLGIGMGMEFPIAQSLLSEMIPA 142
DRFGR+PV S+ + + +TA L L R++ GI G +A + ++++
Sbjct: 68 DRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGI-TGATGAVAGAYIADITDG 126

Query: 143 SRRGKYIALMDGFWPLGFVAAGCLSYFLLPLTGWRSIFLVLALPAVFVLAIRFLIPESPR 202
R ++ M + G VA L + + F AL + L FL+PES +
Sbjct: 127 DERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHK 186

Query: 203 WLEQAGRREQADRVLRDIEARVMRSLGLT 231
+ RRE + + AR M +
Sbjct: 187 GERRPLRREALNPLASFRWARGMTVVAAL 215



Score = 30.6 bits (69), Expect = 0.012
Identities = 37/167 (22%), Positives = 61/167 (36%), Gaps = 9/167 (5%)

Query: 286 LSALLQQSGFAVTQSVYYTVLISLAGIPGFLCAAWL---VESWGRKPSCVLMLLGGGAMA 342
L LL+ + + +Y +L++L + F CA L + +GR+P ++ L G
Sbjct: 28 LPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDY 87

Query: 343 YAYGQTAVFGGSLALLIGFGLAMQFFLFGMWAVLYTYTPELYPTSARATGSGFASAVGRI 402
L +L G + AV Y ++ RA GF SA
Sbjct: 88 AIMA----TAPFLWVLY-IGRIVAGITGATGAVAGAYIADITDGDERARHFGFMSACFGF 142

Query: 403 GSLLGPLVTGLVLPLTGQGGVFTLGALCFGVAALVVWAFGIETRGRT 449
G + GP++ GL+ + F A G+ L E+
Sbjct: 143 GMVAGPVLGGLMGGFSPHAP-FFAAAALNGLNFLTGCFLLPESHKGE 188


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3465TCRTETA433e-06 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 42.5 bits (100), Expect = 3e-06
Identities = 57/281 (20%), Positives = 94/281 (33%), Gaps = 13/281 (4%)

Query: 79 ALPLVLLSILSGVIADNHDRRKIMLWGLSFEMTGAMFATLLAFLGYLDPVLLIISILWIS 138
AL + + G ++D RR ++L L A + + P L ++ I I
Sbjct: 53 ALMQFACAPVLGALSDRFGRRPVLLVSL-------AGAAVDYAIMATAPFLWVLYIGRIV 105

Query: 139 LGGS-VTIPAWQAAVNEQVPARMVSDAVLLNSVNYNVARAAGPALGGLLLSAVGPAWVFL 197
G + T A + + + S + AGP LGGL+ P F
Sbjct: 106 AGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLM-GGFSPHAPFF 164

Query: 198 FNSFCY-MALIWAIWQWRRDVPKRSLPPEGILEGVTAALRFTQYSTVTRLVMMRSFAFGL 256
+ + + + P A+ R+ + TV +M F L
Sbjct: 165 AAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQL 224

Query: 257 SASAVWALLPLLAHRNPDGDAAIYGYMLGALG-LGAILGSTQVSRLRQRIGSSRLISLAG 315
AL + DA G L A G L ++ + + R+G R + L
Sbjct: 225 VGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGM 284

Query: 316 FTLALILLTLGLVDNLWVLFPVLIL--GGGCWIGALATYNS 354
+ L W+ FP+++L GG + AL S
Sbjct: 285 IADGTGYILLAFATRGWMAFPIMVLLASGGIGMPALQAMLS 325



Score = 40.6 bits (95), Expect = 1e-05
Identities = 32/189 (16%), Positives = 63/189 (33%), Gaps = 12/189 (6%)

Query: 12 PLKPEGQAAKPERTGTWAPFSIQAFRIIWICNLFANLGTWA--QSVAAAWVVTDA---HA 66
K E + + E A F + + Q AA WV+ H
Sbjct: 184 SHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHW 243

Query: 67 SPLMVA-MIQVAAALPLVLLSILSGVIADNHDRRKIMLWGLSFEMTGAMFATLLAFLGYL 125
+ + L + ++++G +A R+ ++ G+ + TG L +
Sbjct: 244 DATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTG------YILLAFA 297

Query: 126 DPVLLIISILWISLGGSVTIPAWQAAVNEQVPARMVSDAVLLNSVNYNVARAAGPALGGL 185
+ I+ + G + +PA QA ++ QV + ++ GP L
Sbjct: 298 TRGWMAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTA 357

Query: 186 LLSAVGPAW 194
+ +A W
Sbjct: 358 IYAASITTW 366



Score = 34.4 bits (79), Expect = 0.001
Identities = 32/142 (22%), Positives = 51/142 (35%), Gaps = 8/142 (5%)

Query: 277 AAIYGYMLGALGLGAILGSTQVSRLRQRIGSSR--LISLAGFTLALILLTLGLVDNLWVL 334
A YG +L L + + L R G L+SLAG + ++ LWVL
Sbjct: 42 TAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATA--PFLWVL 99

Query: 335 FPVLILGG-GCWIGALATYNSAVQILVPDWIKARALALYQTALYGGLALGSFLWGHLAET 393
+ I+ G GA+A + + + +AR G+ G L G +
Sbjct: 100 YIGRIVAGITGATGAVAG--AYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGG- 156

Query: 394 MTVHGALLAAGCLLLASVILLY 415
+ H AA L + +
Sbjct: 157 FSPHAPFFAAAALNGLNFLTGC 178


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3464PRPHPHLPASEC384e-05 Prokaryotic zinc-dependent phospholipase C signature.
		>PRPHPHLPASEC#Prokaryotic zinc-dependent phospholipase C signature.

Length = 398

Score = 38.5 bits (89), Expect = 4e-05
Identities = 13/38 (34%), Positives = 20/38 (52%)

Query: 241 QYFGLSRFAFANGHPYWGYRFLGWGMHYIQDITQPYHS 278
++ L+R+ + G+ +LG MHY DI PYH
Sbjct: 128 KFSALARYEWQRGNYKQATFYLGEAMHYFGDIDTPYHP 165


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3462HTHFIS692e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 69.5 bits (170), Expect = 2e-14
Identities = 34/131 (25%), Positives = 56/131 (42%), Gaps = 7/131 (5%)

Query: 788 DAPCILVAEDNPVNQLVVRGFLAKRGYAVRLAGNGRLALDEYLRDPNGIQLILMDGEMPE 847
ILVA+D+ + V+ L++ GY VR+ N L++ D MP+
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGD--LVVTDVVMPD 59

Query: 848 MDGFEATRLIRREERAQGWPRVPIVALTAHILDEHRRAGIEAGMDAYLGKPVDRAELYAT 907
+ F+ I++ P +P++ ++A E G YL KP D EL
Sbjct: 60 ENAFDLLPRIKKA-----RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGI 114

Query: 908 LERLLGQPSRQ 918
+ R L +P R+
Sbjct: 115 IGRALAEPKRR 125


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3460SACTRNSFRASE353e-04 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 35.3 bits (81), Expect = 3e-04
Identities = 15/53 (28%), Positives = 19/53 (35%)

Query: 197 LAVDPQCSRPGVGEALVRHLVEHFMSRELAYLDLSVLHNNQQAKALYRKLGFR 249
+AV + GVG AL+ +E L L N A Y K F
Sbjct: 95 IAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFI 147


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3459ANTHRAXTOXNA330.003 Anthrax toxin LF subunit signature.
		>ANTHRAXTOXNA#Anthrax toxin LF subunit signature.

Length = 800

Score = 33.2 bits (75), Expect = 0.003
Identities = 25/68 (36%), Positives = 38/68 (55%), Gaps = 7/68 (10%)

Query: 192 APHTLLEGVKKLPPATW-MSVDLDGSCEQRTWWT---LDYG--PRPDERELTLDDWQERV 245
AP +L E K++P W V+ S E++ T + YG +PD + TL +WQ+++
Sbjct: 498 AP-SLTEIKKQIPQKEWDKVVNTPNSLEKQKGVTNLLIKYGIERKPDSTKGTLSNWQKQM 556

Query: 246 LDGLREAV 253
LD L EAV
Sbjct: 557 LDRLNEAV 564


97PA3407PA3400N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA34072133.352576heme acquisition protein HasA
PA34062123.310902transporter HasD
PA34051121.936538metalloprotease secretion protein
PA34042130.620261hypothetical protein
PA3403a113-0.397989hypothetical protein
PA34031140.186406hypothetical protein
PA3402115-0.094635hypothetical protein
PA34012150.063146hypothetical protein
PA34002160.897137hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3407PF064382761e-97 Heme acquisition protein HasAp
		>PF06438#Heme acquisition protein HasAp

Length = 205

Score = 276 bits (707), Expect = 1e-97
Identities = 205/205 (100%), Positives = 205/205 (100%)

Query: 1 MSISISYSTTYSGWTVADYLADWSAYFGDVNHRPGQVVDGSNTGGFNPGPFDGSQYALKS 60
MSISISYSTTYSGWTVADYLADWSAYFGDVNHRPGQVVDGSNTGGFNPGPFDGSQYALKS
Sbjct: 1 MSISISYSTTYSGWTVADYLADWSAYFGDVNHRPGQVVDGSNTGGFNPGPFDGSQYALKS 60

Query: 61 TASDAAFIAGGDLHYTLFSNPSHTLWGKLDSIALGDTLTGGASSGGYALDSQEVSFSNLG 120
TASDAAFIAGGDLHYTLFSNPSHTLWGKLDSIALGDTLTGGASSGGYALDSQEVSFSNLG
Sbjct: 61 TASDAAFIAGGDLHYTLFSNPSHTLWGKLDSIALGDTLTGGASSGGYALDSQEVSFSNLG 120

Query: 121 LDSPIAQGRDGTVHKVVYGLMSGDSSALQGQIDALLKAVDPSLSINSTFDQLAAAGVAHA 180
LDSPIAQGRDGTVHKVVYGLMSGDSSALQGQIDALLKAVDPSLSINSTFDQLAAAGVAHA
Sbjct: 121 LDSPIAQGRDGTVHKVVYGLMSGDSSALQGQIDALLKAVDPSLSINSTFDQLAAAGVAHA 180

Query: 181 TPAAAAAEVGVVGVQELPHDLALAA 205
TPAAAAAEVGVVGVQELPHDLALAA
Sbjct: 181 TPAAAAAEVGVVGVQELPHDLALAA 205


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3405RTXTOXIND417e-145 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 417 bits (1073), Expect = e-145
Identities = 96/435 (22%), Positives = 170/435 (39%), Gaps = 8/435 (1%)

Query: 15 AALELDEK---RFSRLGWGLVLLGFVGFLLWAGLAPLDKGVGVSGTVMVAGSRKAVQHPT 71
A LEL E R RL ++ V + + L ++ +G + +G K ++
Sbjct: 44 AHLELIETPVSRRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIE 103

Query: 72 GGLVRHIRVHEGERVEAGQVLLEMDATQARAQADGLFAQYLAALASLARLSAERDEKARI 131
+V+ I V EGE V G VLL++ A A A + L A R
Sbjct: 104 NSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELN 163

Query: 132 EFPAELLALDDPRLPTLLEQQ----RQLHDSRRRALRLELDGLAETVAGSQAQLDGLQAA 187
+ P EL D+P + E++ L + + + + +A+ + A
Sbjct: 164 KLP-ELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLAR 222

Query: 188 LRSKEQQRAALEEQLRGLRQLASEGYVPRNRLLDSERLLAQVNGEIAGDLGSLGSTRRQI 247
+ E + +L L + + ++ +L+ E + E+ L +I
Sbjct: 223 INRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEI 282

Query: 248 LELRLRMAQRREKFQEEVRASLADAQVRAEELRNRLASARFDLANSEVRAPVAGLVVGQE 307
L + + F+ E+ L L LA S +RAPV+ V +
Sbjct: 283 LSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLK 342

Query: 308 VFTEGGVIAPGQQLMEILPERQPLLVDARLPVEMVDKVRVGLPVELMFSAFNQSTTPRVE 367
V TEGGV+ + LM I+PE L V A + + + + VG + AF + +
Sbjct: 343 VHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLV 402

Query: 368 GEVTLVSADRLLDERSEAPYYRVRIRVGEEGVRRLAGLEIRPGMPVEAFVRSGERSLLNY 427
G+V ++ D + D+R + + + + GM V A +++G RS+++Y
Sbjct: 403 GKVKNINLDAIEDQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGMRSVISY 462

Query: 428 LFKPLADRTHLALGE 442
L PL + +L E
Sbjct: 463 LLSPLEESVTESLRE 477


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3404RTXTOXIND320.007 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 31.7 bits (72), Expect = 0.007
Identities = 20/171 (11%), Positives = 49/171 (28%), Gaps = 11/171 (6%)

Query: 60 LPSLRYDYNKARNDSTVSQGDARVERDYRSYASTLSLEQPLFDYEAYARYRQ-GEAQAL- 117
L +L + + + S++ Q R S + P ++ E + L
Sbjct: 127 LTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLR 186

Query: 118 ---FADEQFRGRSQELA---VRLFAAYSETLFAREQVVLAEAQRRALETQLAFNQRAFEE 171
EQF + + L +E L ++ E R +++L +
Sbjct: 187 LTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHK 246

Query: 172 GEGTRTDLLE---TRARLSLTRAEEIAASDRAAAARRTLEAMLGQALEDRE 219
+ +LE + ++ + + + + +
Sbjct: 247 QAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFK 297


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3402RTXTOXIND566e-11 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 56.0 bits (135), Expect = 6e-11
Identities = 25/161 (15%), Positives = 59/161 (36%), Gaps = 17/161 (10%)

Query: 41 IVSSKAKGRVQVLHVRRGDEVKQGDLLISLDSPELEAQLDALHAARNQAQAQLDESLHGT 100
+ V+ + V+ G+ V++GD+L+ L + EA ++ QA+ + +
Sbjct: 98 EIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILS 157

Query: 101 REESIRALKASLAQAEAELRNAESDFQRNQQMVERGFLSRTQFDLSRRERDVARDRVAEA 160
R + L E +N ++++ L + QF + ++ + +
Sbjct: 158 RSIELNKLPELKLPDEPYFQNV-----SEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKK 212

Query: 161 RANLDEGLKGDREERRQALQAAVRRADAQIAELQAQIDDLQ 201
RA + A + R + ++++DD
Sbjct: 213 RAER------------LTVLARINRYENLSRVEKSRLDDFS 241



Score = 52.9 bits (127), Expect = 7e-10
Identities = 29/205 (14%), Positives = 77/205 (37%), Gaps = 24/205 (11%)

Query: 75 LEAQLDALHAARNQAQAQLDESLHGTREESIRALKASLAQAEAELRNAESDFQRNQQMVE 134
++ Q + Q + LD+ + + A + + E R +S ++
Sbjct: 191 IKEQFSTWQNQKYQKELNLDK-----KRAERLTVLARINRYENLSRVEKSRLDDFSSLLH 245

Query: 135 RGFLSRTQFDLSRRERDVARDRVAEARANLDE------GLKGDREERRQALQAAV----R 184
+ +++ + A + + ++ L++ K + + Q + + R
Sbjct: 246 KQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLR 305

Query: 185 RADAQIAELQAQI----DDLQ---VRAPVNGEVGPIPA-EQGELINAYSPLLTLVRLDDS 236
+ I L ++ + Q +RAPV+ +V + +G ++ L+ +V DD+
Sbjct: 306 QTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDT 365

Query: 237 YFV-FNLREDILAKVRKGDRIVMQV 260
V ++ + + G +++V
Sbjct: 366 LEVTALVQNKDIGFINVGQNAIIKV 390


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3400ABC2TRNSPORT280.039 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 28.4 bits (63), Expect = 0.039
Identities = 27/122 (22%), Positives = 50/122 (40%), Gaps = 1/122 (0%)

Query: 246 LGYRQSASFFMLLGIVLPFLIAVIALSEFIAELLPTEESVYLTMTFITLPLFYMAGYSWP 305
LGY Q S L ++ +A +L + L P+ + T + P+ +++G +P
Sbjct: 139 LGYTQWLSLLYALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFP 198

Query: 306 EQAMPDWVRWLADAIPSTWAIRAIAEMNQMDLPLREVSDHALVLLGMAATYALLGTLLYQ 365
+P + A +P + +I I + + P+ +V H L L T L +
Sbjct: 199 VDQLPIVFQTAARFLPLSHSIDLIRPI-MLGHPVVDVCQHVGALCIYIVIPFFLSTALLR 257

Query: 366 YR 367
R
Sbjct: 258 RR 259


98PA3206PA3202N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA32062160.589747two-component sensor
PA32051130.155566hypothetical protein
PA32040130.178246two-component response regulator
PA3203114-0.744538hypothetical protein
PA3202311-0.128251hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3206PF06580290.026 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 29.4 bits (66), Expect = 0.026
Identities = 16/100 (16%), Positives = 35/100 (35%), Gaps = 17/100 (17%)

Query: 341 VDNLLRNAVRFNPVGQPLEVRASSAGDYLRLSVRDHGPGIAAELQEQLGEPFFRAPNQSS 400
V+N +++ + P G + ++ + + L V + G +E
Sbjct: 264 VENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKE-------------- 309

Query: 401 PGHGLGLA-IARRAIERHGGHLRLG-NHPDGGFIATLSLP 438
G GL + R +G ++ + G A + +P
Sbjct: 310 -STGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3204HTHFIS1039e-28 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 103 bits (258), Expect = 9e-28
Identities = 42/117 (35%), Positives = 63/117 (53%)

Query: 4 LLLIDDDRELCELLGTWLVQEGFSVRASHDGAQARRALAEQTPDAVVLDVMLPDGSGLEL 63
+L+ DDD + +L L + G+ VR + + A R +A D VV DV++PD + +L
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 64 LKQLRGDHPDLPVLMLSARGEPLDRILGLELGADDYLAKPCDPRELTARLRAVLRRT 120
L +++ PDLPVL++SA+ + I E GA DYL KP D EL + L
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3203IGASERPTASE280.010 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 28.1 bits (62), Expect = 0.010
Identities = 16/76 (21%), Positives = 31/76 (40%), Gaps = 8/76 (10%)

Query: 22 EEPAPAPIPAAQPSITQATAELERRLVETERQRDELVSRMRQENRQLREQ--------LQ 73
E P P P PA T+ AE ++ +T + ++ + +NR++ ++ Q
Sbjct: 1022 EAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQ 1081

Query: 74 AAQAQRQPPLLTEEQT 89
+ + E QT
Sbjct: 1082 TNEVAQSGSETKETQT 1097


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3202adhesinmafb309e-04 Neisseria meningitidis: adhesin MafB signature.
		>adhesinmafb#Neisseria meningitidis: adhesin MafB signature.

Length = 467

Score = 30.4 bits (68), Expect = 9e-04
Identities = 13/45 (28%), Positives = 18/45 (40%)

Query: 53 AAGFTGSLIVAEFDSLAAAQSWAEADPYRAAGVYAEVVVKPFKKV 97
G GS+ E ++ A W + +P A V A V KV
Sbjct: 278 VIGGLGSVAGFEKNTREAVDRWIQENPNAAETVEAVFNVAAAAKV 322


99PA3147PA3141N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA3147249-8.723325glycosyl transferase WbpJ
PA3146246-7.501433NAD-dependent epimerase/dehydratase
PA3145031-5.735572glycosyltransferase WbpL
PA3143018-3.636019hypothetical protein
PA3142011-3.375712hypothetical protein
PA3141-18-2.697242nucleotide sugar epimerase/dehydratase WbpM
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3147ANTHRAXTOXNA340.002 Anthrax toxin LF subunit signature.
		>ANTHRAXTOXNA#Anthrax toxin LF subunit signature.

Length = 800

Score = 33.6 bits (76), Expect = 0.002
Identities = 10/36 (27%), Positives = 16/36 (44%), Gaps = 2/36 (5%)

Query: 363 DITAAIFRLLLLSEDERRTMGQRGRDAVL-EHYTYE 397
D+ L LSE+E+ +M RG + +E
Sbjct: 110 DLVEHK-ELQDLSEEEKNSMNSRGEKVPFASRFVFE 144


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3146NUCEPIMERASE791e-18 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 78.7 bits (194), Expect = 1e-18
Identities = 57/342 (16%), Positives = 109/342 (31%), Gaps = 64/342 (18%)

Query: 1 MVTGASGFVGSALCCELARTGYAVIAV-------------VRRVVERIPSVTYIEADLTD 47
+VTGA+GF+G + L G+ V+ + R + P + + DL D
Sbjct: 4 LVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDLAD 63

Query: 48 PATFAGEFPT--VDCIIHLAGRAHILTDKVADPLAAFREVNRDATVRLATRALEAGVKRF 105
F + + + R + + +P A+ + N + + ++
Sbjct: 64 REGMTDLFASGHFERVFISPHRLAV-RYSLENP-HAYADSNLTGFLNILEGCRHNKIQHL 121

Query: 106 VFVSSIGVNGNSTRQQAFNEDSPAG-PHAPYAISKYEAEQELGTLLRGKGMELVVVRPPL 164
++ SS V G R+ F+ D P + YA +K E T G+ +R
Sbjct: 122 LYASSSSVYG-LNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFFT 180

Query: 165 IYANDAPGNFGR-------LLKLVASGLPLPLDG------------------VRNARSLV 199
+Y G +GR K + G + + +R +
Sbjct: 181 VY-----GPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIP 235

Query: 200 SRRNIVGFLSLCAEHPDAAGELFLVADGEDVSIAQMIEALSRGMGRRPALFTFPAVLLKL 259
+ A ++ + + V + I+AL +G K
Sbjct: 236 HADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIE----------AKK 285

Query: 260 VMCLLGKASMHEQLCGSLQVDASKARRLLGWVPVETIGAGLQ 301
M L + E D ++G+ P T+ G++
Sbjct: 286 NMLPLQPGDVLETSA-----DTKALYEVIGFTPETTVKDGVK 322


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3145RTXTOXIND290.031 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.0 bits (65), Expect = 0.031
Identities = 4/24 (16%), Positives = 8/24 (33%)

Query: 44 PTPRGGGVAIVLVFLAALVWMLSA 67
PR I+ + A + +
Sbjct: 55 RRPRLVAYFIMGFLVIAFILSVLG 78


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3141NUCEPIMERASE579e-11 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 57.1 bits (138), Expect = 9e-11
Identities = 46/292 (15%), Positives = 103/292 (35%), Gaps = 56/292 (19%)

Query: 301 VMVTGAGGSIGSELCRQIMSCSPSVLILFEHSEYNLYSIHQELERRIKRESLSVNLLPIL 360
+VTGA G IG + ++++ V+ + ++Y S+ Q + +
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQP----GFQFHK 58

Query: 361 GSVRNPERLVDVMRTWKVNTVYHAAAYKHVPIVEHNIAEGVLNNVIGTLHAVQAAVQVGV 420
+ + E + D+ + V+ + V N +N+ G L+ ++ +
Sbjct: 59 IDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKI 118

Query: 421 QNFVLIST---------------DKAVRPTNVMGSTKRLAEMVLQALSNESAPVLFGDRK 465
Q+ + S+ D P ++ +TK+ E++ S
Sbjct: 119 QHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYS------------ 166

Query: 466 DVHHVNKTRFTMVRFGNVLGSSGS---VIPLFREQIKRGGPVTV-THPSITRYFMTIPEA 521
H+ T +RF V G G + F + + G + V + + R F I +
Sbjct: 167 ---HLYGLPATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDI 223

Query: 522 AQLVIQA----------GSMGQGGD--------VFVLDMGPPVKILELAEKM 555
A+ +I+ ++ G V+ + PV++++ + +
Sbjct: 224 AEAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQAL 275


100PA3110PA3087N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA311017-0.338986hypothetical protein
PA310917-0.801781hypothetical protein
PA310827-0.254626amidophosphoribosyltransferase
PA3107050.674419O-succinylhomoserine sulfhydrylase
PA3106060.144770oxidoreductase
PA3105070.122476type II secretion system protein D
PA31041101.076434type II secretion system protein N
PA31031131.512249type II secretion system protein E
PA31020141.002840type II secretion system protein F
PA31012141.690577type II secretion system protein G
PA31004162.628003type II secretion system protein H
PA30993112.624324type II secretion system protein I
PA30982112.557691type II secretion system protein J
PA3097-1102.013545type II secretion system protein K
PA3096092.314993type II secretion system protein L
PA3095-182.258096type II secretion system protein M
PA3094-182.403175***transcriptional regulator
PA3093-181.863780hypothetical protein
PA3092-181.9503612,4-dienoyl-CoA reductase
PA3091092.261164hypothetical protein
PA3089a0101.8978301-aminocyclopropane-1-carboxylate deaminase
PA3089080.687222hypothetical protein
PA3088-19-0.690458NAD kinase
PA3087-110-1.006930hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3110PERTACTIN310.003 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 31.2 bits (70), Expect = 0.003
Identities = 29/85 (34%), Positives = 34/85 (40%), Gaps = 3/85 (3%)

Query: 84 AAGQPSQPIGGLPATPPATQPPAQAQAPAASLPPSQPQPPAAPPSP-PPAEKRLD--ANN 140
A P+ P P QPP Q P PP PQ P+P PPA + L AN
Sbjct: 565 AKAPPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPPAGRELSAAANA 624

Query: 141 LPQSWSVQLASLSNRARAEELQKTL 165
+ V LAS A + L K L
Sbjct: 625 AVNTGGVGLASTLWYAESNALSKRL 649


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3106DHBDHDRGNASE1154e-33 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 115 bits (288), Expect = 4e-33
Identities = 74/254 (29%), Positives = 111/254 (43%), Gaps = 18/254 (7%)

Query: 10 GKVALVTGAARGIGLGISAWLIAEGWQVVLADNDRERGARVAE---ALGEHAWFVAMDVA 66
GK+A +TGAA+GIG ++ L ++G + D + E+ +V A HA DV
Sbjct: 8 GKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVR 67

Query: 67 QEGQVAMSVAEVLGQFGRLDGLVCNAAIANPRNTPLEALSLGEWTRTLAVNLTGPMLLAK 126
+ A + + G +D LV A + P + +LS EW T +VN TG ++
Sbjct: 68 DSAAIDEITARIEREMGPIDILVNVAGVLRP--GLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 127 YCTPYLRA-HNGAIVNIASTRAHQSEPDSEAYAASKGGLLALTHALAASLGP-DIRVNAL 184
+ Y+ +G+IV + S A AYA+SK + T L L +IR N +
Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185

Query: 185 SPG----------WIDTREAAEREAAPLTELDHDQHLVGRVGTVEDVASLVAWLLSEDAG 234
SPG W D A + L L ++ D+A V +L+S AG
Sbjct: 186 SPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPL-KKLAKPSDIADAVLFLVSGQAG 244

Query: 235 FVTGQEFLVDGGMT 248
+T VDGG T
Sbjct: 245 HITMHNLCVDGGAT 258


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3105BCTERIALGSPD5950.0 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 595 bits (1536), Expect = 0.0
Identities = 217/631 (34%), Positives = 345/631 (54%), Gaps = 35/631 (5%)

Query: 41 AFVPAGNQQEAHWTINLKDADIREFIDQISEITGETFVVDPRVKGQVSVVSKAQLSLSEV 100
F PA ++ ++ + K DI+EFI+ +S+ +T ++DP V+G ++V S L+ +
Sbjct: 21 LFRPAAAEE---FSASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITVRSYDMLNEEQY 77

Query: 101 YQLFLSVMSTHGFTVVAQGDQA-RIVPNAEAKTEAG--GGQSAP---DRLETRVIQVQQS 154
YQ FLSV+ +GF V+ + ++V + +AKT A +AP D + TRV+ +
Sbjct: 78 YQFFLSVLDVYGFAVINMNNGVLKVVRSKDAKTAAVPVASDAAPGIGDEVVTRVVPLTNV 137

Query: 155 PVSELIPLIRPLVPQYGHLAAV--PSANALIISDRSANIARIEDVIRQLDQKGSHDYSVI 212
+L PL+R L G + V +N L+++ R+A I R+ ++ ++D G +
Sbjct: 138 AARDLAPLLRQLNDNAGVGSVVHYEPSNVLLMTGRAAVIKRLLTIVERVDNAGDRSVVTV 197

Query: 213 NLRYGWVMDAAEV---LNNAMSRGQAKGAAGAQVIADARTNRLIILGPPQARAKLVQLAQ 269
L + D ++ LN S+ G+ A V+AD RTN +++ G P +R +++ + +
Sbjct: 198 PLSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEPNSRQRIIAMIK 257

Query: 270 SLDTPTARSANTRVIRLRHNDAKTLAETLGQISEGMKNNGGQGGEQTGGGRPSNILIRAD 329
LD A NT+VI L++ A L E L IS M++ + NI+I+A
Sbjct: 258 QLDRQQATQGNTKVIYLKYAKASDLVEVLTGISSTMQSEKQAAKPVAALDK--NIIIKAH 315

Query: 330 ESTNALVLLADPDTVNALEDIVRQLDVPRAQVLVEAAIVEISGDIQDAVGVQWAINKGGM 389
TNAL++ A PD +N LE ++ QLD+ R QVLVEA I E+ +G+QWA GM
Sbjct: 316 GQTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGIQWANKNAGM 375

Query: 390 GGTKTNFANTGLSIGTLLQSLESNKAPESIP----------DGAIVGIGSSSFGALVTAL 439
T F N+GL I T + ++ +G G ++ L+TAL
Sbjct: 376 ----TQFTNSGLPISTAIAGANQYNKDGTVSSSLASALSSFNGIAAGFYQGNWAMLLTAL 431

Query: 440 SANTKSNLLSTPSLLTLDNQKAEILVGQNVPFQTGSYTTNSEGSSNPFTTVERKDIGVSL 499
S++TK+++L+TPS++TLDN +A VGQ VP TGS TT+ + N F TVERK +G+ L
Sbjct: 432 SSSTKNDILATPSIVTLDNMEATFNVGQEVPVLTGSQTTSGD---NIFNTVERKTVGIKL 488

Query: 500 KVTPHINDGAALRLEIEQEISALLPNAQQRNNT-DLITSKRSIKSTILAENGQVIVIGGL 558
KV P IN+G ++ LEIEQE+S++ A ++ + R++ + +L +G+ +V+GGL
Sbjct: 489 KVKPQINEGDSVLLEIEQEVSSVADAASSTSSDLGATFNTRTVNNAVLVGSGETVVVGGL 548

Query: 559 IQDDVSQAESKVPLLGDIPLLGRLFRSTKDTHTKRNLMVFLRPTVVRDSAGLAALSGKKY 618
+ VS KVPLLGDIP++G LFRST +KRNLM+F+RPTV+RD S +Y
Sbjct: 549 LDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNLMLFIRPTVIRDRDEYRQASSGQY 608

Query: 619 SDIR-VIDGTRGPEGRPSILPTNANQLFDGQ 648
+ RG E ++L + +++ Q
Sbjct: 609 TAFNDAQSKQRGKENNDAMLNQDLLEIYPRQ 639


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3104BCTERIALGSPC493e-09 Bacterial general secretion pathway protein C signa...
		>BCTERIALGSPC#Bacterial general secretion pathway protein C

signature.
Length = 272

Score = 49.2 bits (117), Expect = 3e-09
Identities = 37/148 (25%), Positives = 57/148 (38%), Gaps = 19/148 (12%)

Query: 32 APALLAVALIIAMSISLAWQAAG--WLRLQRSPVAVAASPVSHESIRSDPTRLAR--LFG 87
+P+++ L + + Q A W V++ ++ R P L LFG
Sbjct: 10 SPSVIRRILFYLLMLLFCQQLAMIFWRIGLPDNAPVSSVQITPAQARQQPVTLNDFTLFG 69

Query: 88 TSAQDPNAPP----------PATNLDLVLKGSFVQSDPKLSSAIIQRQGDKPHRYAVGGE 137
S + N P + L+L L G D S AII + ++ V E
Sbjct: 70 VS-PEKNKAGALDASQMSNLPPSTLNLSLTGVMAGDDDSRSIAIISKDNEQ-FSRGVNEE 127

Query: 138 ISDG--VKLHAVYRDRVELQRGGRLESL 163
+ G K+ ++ DRV LQ GR E L
Sbjct: 128 V-PGYNAKIVSIRPDRVVLQYQGRYEVL 154


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3102BCTERIALGSPF501e-180 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 501 bits (1292), Expect = e-180
Identities = 213/406 (52%), Positives = 278/406 (68%), Gaps = 2/406 (0%)

Query: 1 MAAFEYLALDPSGRQQKGVLEADSARQVRQLLRERQLAPLDVKPTRTREQSGQGGRLTFA 60
MA + Y ALD G++ +G EADSARQ RQLLRER L PL V R +Q L+
Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLR 60

Query: 61 RG--LSARDLALVTRQLATLVQAALPIEEALRAAAAQSTSQRIQSMLLAVRAKVLEGHSL 118
R LS DLAL+TRQLATLV A++P+EEAL A A QS + ++ AVR+KV+EGHSL
Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSL 120

Query: 119 AGSLREFPTAFPELYRATVAAGEHAGHLGPVLEQLADYTEQRQQSRQKIQLALLYPVILM 178
A +++ FP +F LY A VAAGE +GHL VL +LADYTEQRQQ R +IQ A++YP +L
Sbjct: 121 ADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLT 180

Query: 179 VASLAIVGFLLGYVVPDVVRVFIDSGQTLPLLTRVLIGVSDWVKAWGALAFVAAIGGVIG 238
V ++A+V LL VVP VV FI Q LPL TRVL+G+SD V+ +G +A + G +
Sbjct: 181 VVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMA 240

Query: 239 FRYALRKDAFRERWHGFLLRVPLVGRLVRSTDTARFASTLAILTRSGVPLVEALAIAAEV 298
FR LR++ R +H LL +PL+GR+ R +TAR+A TL+IL S VPL++A+ I+ +V
Sbjct: 241 FRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDV 300

Query: 299 IANRIIRNEVVKAAQKVREGASLTRSLEATGQFPPMMLHMIASGERSGELDQMLARTARN 358
++N R+ + A VREG SL ++LE T FPPMM HMIASGERSGELD ML R A N
Sbjct: 301 MSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADN 360

Query: 359 QENDLAAQIGLMVGLFEPFMLIFMGAVVLVIVLAILLPILSLNQLV 404
Q+ + ++Q+ L +GLFEP +++ M AVVL IVLAIL PIL LN L+
Sbjct: 361 QDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLM 406


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3101BCTERIALGSPG2123e-74 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 212 bits (541), Expect = 3e-74
Identities = 75/138 (54%), Positives = 95/138 (68%), Gaps = 3/138 (2%)

Query: 10 RQQSGFTLIEIMVVVVILGILAALVVPQVMSRPDQAKVTVAKGDIKAIAAALDMYKLDNF 69
+Q GFTL+EIMVV+VI+G+LA+LVVP +M ++A A DI A+ ALDMYKLDN
Sbjct: 5 DKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLDNH 64

Query: 70 AYPSTQQGLEALVKKPTGNPQPKNWNKDGYLKKLPVDPWGNPYQYLAPGTKGPFDLYSLG 129
YP+T QGLE+LV+ PT P N+NK+GY+K+LP DPWGN Y + PG G +DL S G
Sbjct: 65 HYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDLLSAG 124

Query: 130 ADGKEGGSDNDADIGNWD 147
DG+ G D DI NW
Sbjct: 125 PDGEMGTED---DITNWG 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3100BCTERIALGSPH1433e-46 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 143 bits (361), Expect = 3e-46
Identities = 50/183 (27%), Positives = 85/183 (46%), Gaps = 32/183 (17%)

Query: 5 RGFTLIELMVVMVIISVLIGLAVLSTGFASTSRELDSEAERLAGL---IGVLTDEAVLDN 61
RGFTL+E+M++++++ V G+ +L+ SR+ DS A+ LA + + +
Sbjct: 4 RGFTLLEMMLILLLMGVSAGMVLLAFP---ASRD-DSAAQTLARFEAQLRFVQQRGLQTG 59

Query: 62 REYGLRLERDAYQVLRY------DEAKA-------RWLPVARDSHRLPEWAELTFELDGQ 108
+ +G+ + D +Q L D A A RWLP+ + + G
Sbjct: 60 QFFGVSVHPDRWQFLVLEARDGADPAPADDGWSGYRWLPLRAGR------VATSGSIAGG 113

Query: 109 PLVLAGSKGEKEQKKGTDQPQLLILSSGELSPFRLRLAERGPEGRALSLSSDGFRLPRVE 168
L LA ++GE D P +LI GE++PFRL L E ++ ++ G LP +
Sbjct: 114 KLNLAFAQGEAWTPG--DNPDVLIFPGGEMTPFRLTLG----EAPGIAFNARGESLPEPQ 167

Query: 169 VAR 171
A+
Sbjct: 168 EAQ 170


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3099BCTERIALGSPG352e-05 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 35.2 bits (81), Expect = 2e-05
Identities = 13/68 (19%), Positives = 30/68 (44%), Gaps = 4/68 (5%)

Query: 1 MKRARGFTLLEVLVALAIF----AMVAASVLSASARSLQNASRLEDKTLAMWIADNRLNE 56
+ RGFTLLE++V + I ++V +++ ++ + + + L + +L+
Sbjct: 4 TDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLDN 63

Query: 57 LQLEQTPP 64
T
Sbjct: 64 HHYPTTNQ 71


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3098PilS_PF08805367e-05 PilS N terminal
		>PilS_PF08805#PilS N terminal

Length = 185

Score = 35.7 bits (82), Expect = 7e-05
Identities = 11/45 (24%), Positives = 24/45 (53%)

Query: 1 MRLQRGFTLLELLIAIAIFALLALATYRMFDSVMQTDQATRVQEQ 45
+G TL+E+L+ + + +LA + Y+++ V Q++ Q
Sbjct: 22 KEQDKGATLMEVLLVVGVIVVLAASAYKLYSMVQSNIQSSNEQNN 66


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3095PYOCINKILLER280.027 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 27.8 bits (61), Expect = 0.027
Identities = 23/113 (20%), Positives = 33/113 (29%), Gaps = 5/113 (4%)

Query: 55 AERHLQSARQYFTEQRALHAYIQQQAPNVRQADAAAPQAQIDPAALQGMVTASAAQAGLS 114
A+R + + RA + Y +V A Q+ A S A A L
Sbjct: 230 AKRKAEEQARQQAAIRAANTYAMPANGSVVATAAGRGLIQVAQGAASLAQAISDAIAVLG 289

Query: 115 VERLDNEGEGAVQVALQPAPFAKLLPWLEQLNGQ-----GVQVAEAGLDRQVD 162
AV A W +Q G+ A+ GL V+
Sbjct: 290 RVLASAPSVMAVGFASLTYSSRTAEQWQDQTPDSVRYALGMDAAKLGLPPSVN 342


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3091RTXTOXIND514e-09 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 51.4 bits (123), Expect = 4e-09
Identities = 34/206 (16%), Positives = 67/206 (32%), Gaps = 18/206 (8%)

Query: 165 AAVEPQRLQMAAEEQWYAAGPAAPKAPPAEPPRKQEDEQTARLAQLVKQQRQQLAALARQ 224
A +E R Q+ + P K P + +E+ RL L+K+Q Q
Sbjct: 146 ARLEQTRYQILSRSIELNKLPEL-KLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQ 204

Query: 225 QEQRLAGLARQHEEELARREQDARGQLDILRSEVLSLQQALERQARENAELQQRLLEQGE 284
+E L + + R + +S + + A + +LEQ
Sbjct: 205 KELNLDKKRAE-RLTVLARINRYENLSRVEKSRL----DDFSSLLHKQAIAKHAVLEQ-- 257

Query: 285 QFQRNREELTRQLRFIENQGRNETDLLRSEFADELEARVAAAVAGYKEQVSIRDVELAYR 344
+ E +LR +++ + + SE + +K ++ +L
Sbjct: 258 --ENKYVEAVNELR----VYKSQLEQIESEIL-SAKEEYQLVTQLFKNEILD---KLRQT 307

Query: 345 NELDQQLEQELAELRAERDRLAAQGP 370
+ L ELA+ + + P
Sbjct: 308 TDNIGLLTLELAKNEERQQASVIRAP 333


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3089FLGFLIJ290.020 Flagellar FliJ protein signature.
		>FLGFLIJ#Flagellar FliJ protein signature.

Length = 147

Score = 28.6 bits (63), Expect = 0.020
Identities = 16/37 (43%), Positives = 23/37 (62%), Gaps = 2/37 (5%)

Query: 36 QRHPLAASRWRQEPERLAAWLREQERQPQHLAAWLAQ 72
Q+ +A + WR++ +RL AW QERQ AA LA+
Sbjct: 92 QKVDIALNSWREKKQRLQAWQTLQERQST--AALLAE 126


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3088PF06057290.028 Type IV secretory pathway VirJ component
		>PF06057#Type IV secretory pathway VirJ component

Length = 243

Score = 28.7 bits (64), Expect = 0.028
Identities = 13/58 (22%), Positives = 23/58 (39%), Gaps = 7/58 (12%)

Query: 65 LVVVVGGDGSML----GAARALARHKVPVLGINRGSLG-FLTDIRPDELEAKVGEVLD 117
LV+ + GDG L + PV+G + SL + P ++ ++D
Sbjct: 53 LVIFLSGDGGWATLDKAVGGILQQQGWPVVGWS--SLKYYWKQKDPKDVTQDTLAIID 108


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3087ANTHRAXTOXNA310.008 Anthrax toxin LF subunit signature.
		>ANTHRAXTOXNA#Anthrax toxin LF subunit signature.

Length = 800

Score = 30.9 bits (69), Expect = 0.008
Identities = 13/36 (36%), Positives = 19/36 (52%)

Query: 231 GDIVFQPDALPEAIAREPLSEEQKSSLLTYGADEPL 266
G+I F L E + LSEE+K+S+ + G P
Sbjct: 102 GEIYFTDIDLVEHKELQDLSEEEKNSMNSRGEKVPF 137


101PA3079PA3074N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA30791111.097761hypothetical protein
PA30782132.954193two-component sensor
PA30773133.713310two-component response regulator
PA30763113.767853hypothetical protein
PA30752113.287898hypothetical protein
PA30742112.810831hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3079ACRIFLAVINRP711e-14 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 71.4 bits (175), Expect = 1e-14
Identities = 36/175 (20%), Positives = 77/175 (44%), Gaps = 11/175 (6%)

Query: 613 IEAATNEVIKQSELII-LVLVYICVAAMCMITFRSFAATLCIVLPLILTSVLGNALMAAL 671
++ + +EV+K L ++LV++ + + ++ ATL + + + + A++AA
Sbjct: 333 VQLSIHEVVK--TLFEAIMLVFLVMY----LFLQNMRATLIPTIAVPVVLLGTFAILAAF 386

Query: 672 GIGVKVATLPVIALGVGIGVDYGIYIYTRLESFLRM-GLPLQEAYYETLRSTGKAVLFTG 730
G + T+ + L +G+ VD I + +E + LP +EA +++ A++
Sbjct: 387 GYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIA 446

Query: 731 LCLAIGVATWIF---SAIKFQADMGLMLTFMLLWNMFGALWLLPALARFLINPAK 782
+ L+ F S + + + ++ AL L PAL L+ P
Sbjct: 447 MVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVS 501



Score = 42.9 bits (101), Expect = 6e-06
Identities = 35/221 (15%), Positives = 80/221 (36%), Gaps = 15/221 (6%)

Query: 251 LITLVLLYWFTKCIRSTIAVLITTLVAVLWQLGLLNLVGFGLDPYSMLVPFLIFAIGISH 310
++ +++Y F + +R+T+ I V +L +L G+ ++ +M L + +
Sbjct: 349 MLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDD 408

Query: 311 GVQKINGIA-LQSSGADNALMAARLTFRQLFLPGMIAILADAVGFITLLVID--IGVI-R 366
+ + + + A + Q+ + + + FI + G I R
Sbjct: 409 AIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYR 468

Query: 367 ELAIGASIGVAVIVFTNLILLPVAISYI--GISKKAVQRSKDDAVREHPFWRLLSNFSSP 424
+ +I +A+ V LIL P + + +S + + + + N +
Sbjct: 469 QFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTN 528

Query: 425 KVAPV------SIAIALLMLGGGLWYGKHLKIG---DLDQG 456
V + + I L++ G + L + DQG
Sbjct: 529 SVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQG 569



Score = 33.3 bits (76), Expect = 0.005
Identities = 23/113 (20%), Positives = 46/113 (40%), Gaps = 5/113 (4%)

Query: 626 LIILVLVYICVAAMCMITFRSFAATLCIVLPLILTSVLGNALMAALGIGVKVATLPVIAL 685
I V+V++C+AA+ + S++ + ++L + L V V + +
Sbjct: 877 AISFVVVFLCLAAL----YESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLT 932

Query: 686 GVGIGVDYGIYIYTRLESFLRM-GLPLQEAYYETLRSTGKAVLFTGLCLAIGV 737
+G+ I I + + G + EA +R + +L T L +GV
Sbjct: 933 TIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGV 985


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3078PF07675320.005 Cleaved Adhesin
		>PF07675#Cleaved Adhesin

Length = 1358

Score = 32.4 bits (73), Expect = 0.005
Identities = 23/88 (26%), Positives = 36/88 (40%), Gaps = 6/88 (6%)

Query: 64 PAPDSYYFKGSVGTAGLPPKLREMLDTPPYKSIGAMQLLGNWDDDDEEEDDDAPSDDAYV 123
PA + G G P + + K M+ G D D E +DD+P+ Y
Sbjct: 480 PASGKMWIAGDGGNQ--PARYDDFAFEAGKKYTFTMRRAGMGDGTDMEVEDDSPASYTYT 537

Query: 124 VVR--QPLADGKTLYLYDND--AAGSID 147
V R + +G T ++ D AAG+ +
Sbjct: 538 VYRDGTKIKEGLTATTFEEDGVAAGNHE 565


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3077HTHFIS845e-21 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 84.1 bits (208), Expect = 5e-21
Identities = 30/129 (23%), Positives = 59/129 (45%)

Query: 3 IHVLVVEDNFDLAGTVIDYLEAAGVVCDHARDGQAGLNLARANRYDVILLDIMLPRINGR 62
+LV +D+ + + L AG + A D+++ D+++P N
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 63 QVCRQLREAGLQTPVLMLTALDTLQDKLDGFDAGADDYLLKPFELPELLVRLQALSRRRS 122
+ ++++A PVL+++A +T + + GA DYL KPF+L EL+ +
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 123 GQAQRLQVD 131
+ +L+ D
Sbjct: 124 RRPSKLEDD 132


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA3074TYPE4SSCAGX372e-04 Type IV secretion system CagX conjugation protein si...
		>TYPE4SSCAGX#Type IV secretion system CagX conjugation protein

signature.
Length = 522

Score = 37.1 bits (85), Expect = 2e-04
Identities = 39/158 (24%), Positives = 73/158 (46%), Gaps = 13/158 (8%)

Query: 340 LMLSLPQPAMAFQFEDLWLRPDQQGQRLLQRGQADEAAKRFEDFRWKGLSLYQARDYAAA 399
L++ P P + + L +++ + Q+ Q D+ KR E+ R K + +
Sbjct: 131 LIVDAPDPK-ELEEQKKALEKEKEAKEQAQKAQKDKREKRKEE-RAKNRA-----NLENL 183

Query: 400 AQAFAQGDQADDHYNRGNALARQGELEAAVDAYEQALERQPQLVAAQRNK-ALVEELLRQ 458
A + ++ N + +Q E E +D E+ + Q Q AQ N +EEL ++
Sbjct: 184 TNAMSNPQNLSNNKNLSELIKQQRENE--LDQMERLEDMQEQ---AQANALKQIEELNKK 238

Query: 459 RQEQAAQQQAGENKEQRQEASQQSPPSGSSQRPPRDAA 496
+ E+A +Q+A + + + SQ+SP S + P D+A
Sbjct: 239 QAEEAVRQRAKDKISIKTDKSQKSPEDNSIELSPSDSA 276


102PA2918PA2913N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA29180112.643386short-chain dehydrogenase
PA29172113.667815transcriptional regulator
PA29161132.888109hypothetical protein
PA29150132.524181hypothetical protein
PA29140133.248567ABC transporter permease
PA2913-2133.144189hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2918DHBDHDRGNASE694e-16 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 69.3 bits (169), Expect = 4e-16
Identities = 58/191 (30%), Positives = 91/191 (47%), Gaps = 6/191 (3%)

Query: 6 IKGKTVLITGGAKNLGGLIARDLAAHGAKAITIHYNSAASKADADATVAALQAAGAKAVA 65
I+GK ITG A+ +G +AR LA+ GA + YN + V++L+A A A
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKL----EKVVSSLKAEARHAEA 61

Query: 66 LQGDLTSAAAMEKLFADAIAAVGKPDIAINTVGKVLKKPITEISETEYDEMSAVNSKSAF 125
D+ +AA++++ A +G DI +N G + I +S+ E++ +VNS F
Sbjct: 62 FPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVF 121

Query: 126 FFLREAGKHVND--NGKICTLVTSLLGAYTPYYAAYAGTKAPVEHFTRAASKEFGARGIS 183
R K++ D +G I T+ ++ G AAYA +KA FT+ E I
Sbjct: 122 NASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIR 181

Query: 184 VTAVGPGPMDT 194
V PG +T
Sbjct: 182 CNIVSPGSTET 192


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2917PF05272280.034 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.5 bits (63), Expect = 0.034
Identities = 6/16 (37%), Positives = 8/16 (50%)

Query: 258 FRRAYGMTPAAYRRQC 273
+R AYG + RQ
Sbjct: 672 YRGAYGRYVQDHPRQV 687


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2915PF05932270.049 Tir chaperone protein (CesT)
		>PF05932#Tir chaperone protein (CesT)

Length = 127

Score = 27.1 bits (60), Expect = 0.049
Identities = 9/53 (16%), Positives = 18/53 (33%), Gaps = 8/53 (15%)

Query: 74 ADHLSAAIFLQRELGGCLAIGARITQVQAKFSGLFNLGEAFPVDGRQFEHLFE 126
L+ A+ G L + + SGL++ ++ P + L
Sbjct: 66 QCLLAGALNPLLNAGPGLGLDEK--------SGLYHAYQSIPREKLSVPTLKR 110


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2913FERRIBNDNGPP383e-05 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 38.0 bits (88), Expect = 3e-05
Identities = 49/266 (18%), Positives = 96/266 (36%), Gaps = 43/266 (16%)

Query: 43 PSRAVSHDINLTEMMVALGLQTRMVGYTGISGW--WKNADPGLIAALKPLPELV-----A 95
P+R V+ + E+++ALG+ G + W + +P PLP+ V
Sbjct: 35 PNRIVALEWLPVELLLALGIVP--YGVADTINYRLWVS-EP-------PLPDSVIDVGLR 84

Query: 96 RYPTAETLLDVDADFFFAGWGYGMRVGGDLTPASLEPLG-VKVYELSESCAQIGEPRRAS 154
P E L ++ F GYG +P L + + + S+ + R++
Sbjct: 85 TEPNLELLTEMKPSFMVWSAGYGP------SPEMLARIAPGRGFNFSDGKQPLAMARKS- 137

Query: 155 LDELYRDLRNLGRIFDVEPRAERLVASLQARIERARAGIPANAEAPRVF--LYDSGEDRP 212
L + + +++ AE +A + I + P + L D
Sbjct: 138 -------LTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLV 190

Query: 213 FTSGRLGMPQALIEAAGGRSVTDDVAASW--TQVNWESVVA-RDPQVIVIVDYGETSAAQ 269
F + Q +++ G + W T V+ + + A +D V+ D+ +
Sbjct: 191 FGPN--SLFQEILDEYGIPNAWQGETNFWGSTAVSIDRLAAYKDVDVLCF-DHDNSKDMD 247

Query: 270 KQRFLEENPALRSLTAIRERRFIVLP 295
L P +++ +R RF +P
Sbjct: 248 A---LMATPLWQAMPFVRAGRFQRVP 270


103PA2894PA2881N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA2894-1141.705669hypothetical protein
PA2893-2151.703180long-chain-acyl-CoA synthetase
PA2892-1162.113229short-chain dehydrogenase
PA2891-1142.180578geranyl-CoA carboxylase subunit alpha
PA2890-1141.466632isohexenylglutaconyl-CoA hydratase
PA2889-1130.786285citronellyl-CoA dehydrogenase
PA2888-1111.295181geranyl-CoA carboxylase subunit beta
PA28871101.200188citronellol catabolism dehydrogenase
PA28862111.469588hypothetical protein
PA28853120.443743atu genes repressor
PA28841111.136457hypothetical protein
PA28831111.277783hypothetical protein
PA28820101.542783two-component sensor
PA28810101.794705two-component response regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2894IGASERPTASE333e-04 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 33.1 bits (75), Expect = 3e-04
Identities = 19/100 (19%), Positives = 29/100 (29%), Gaps = 9/100 (9%)

Query: 4 APKTASKKVAPAAEQVAEPKPPAKPKPAAAPPKPASRPVAKDKPAPAKRASTARLDPEVR 63
PK S+ V+P EQ +P A+P P P ++ V
Sbjct: 1122 VPKVTSQ-VSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVE 1180

Query: 64 KPLPSAKLDLRLPK-------ELVQKMAPPGTEETH-KPK 95
+P+ + P E+ KPK
Sbjct: 1181 QPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPK 1220


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2892DHBDHDRGNASE813e-20 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 81.3 bits (200), Expect = 3e-20
Identities = 54/191 (28%), Positives = 85/191 (44%), Gaps = 9/191 (4%)

Query: 3 LHGKTLFITGASRGIGREIALRAARDGANLVIAAKSAEPHPKLEGTIFSVAAEVEAAGGQ 62
+ GK FITGA++GIG +A A GA++ + E K+ ++ + A EA
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEA---- 61

Query: 63 ALPLQLDVRDEQAVAAAMARAAERFGGIDALVNNAGAIRLVGVEKLEPKRFDLMYQINTR 122
DVRD A+ AR G ID LVN AG +R + L + ++ + +N+
Sbjct: 62 ---FPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNST 118

Query: 123 AVLVCSQAALPYLRRSANGHILSLSPPINLAGRWFAQHGPYTVTKYGMSMLTLGMHEEFG 182
V S++ Y+ +G I+++ N AG Y +K M T + E
Sbjct: 119 GVFNASRSVSKYMMDRRSGSIVTVGS--NPAGVPRTSMAAYASSKAAAVMFTKCLGLELA 176

Query: 183 KYAISVNALWP 193
+Y I N + P
Sbjct: 177 EYNIRCNIVSP 187


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2891RTXTOXIND382e-04 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 37.5 bits (87), Expect = 2e-04
Identities = 25/85 (29%), Positives = 42/85 (49%), Gaps = 6/85 (7%)

Query: 579 AGASAQVGASSGTLK-APMDGAIV-EVLVGEGERVGKGQLLLVLEAMKMEHPLKAGVDGV 636
A A+ ++ S + + P++ +IV E++V EGE V KG +LL L A+ E A
Sbjct: 84 ATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAE----ADTLKT 139

Query: 637 VRRVQVGRGEQVRNRQVLVEVEADA 661
+ R EQ R + + +E +
Sbjct: 140 QSSLLQARLEQTRYQILSRSIELNK 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2890adhesinmafb290.016 Neisseria meningitidis: adhesin MafB signature.
		>adhesinmafb#Neisseria meningitidis: adhesin MafB signature.

Length = 467

Score = 29.3 bits (65), Expect = 0.016
Identities = 28/132 (21%), Positives = 47/132 (35%), Gaps = 3/132 (2%)

Query: 11 LEPIEGVLRITLNRPQSRNAMSLAMVGELRAVLAAVRDDRSVRALVLRGADGHFCAGGDI 70
+E I GV LN S A +G++ D ++R + A+G F G +
Sbjct: 225 MEFINGVAAGALNPFIS--AGEALGIGDILYGTRYAIDKAAMRNIAPLPAEGKFAVIGGL 282

Query: 71 KDMAGARAAGAEAYRTLNRAFGSLLEEAQAAPQLLVAL-VEGAVLGGGFGLACVSDVAIA 129
+AG EA + + E +A + A V G A VS
Sbjct: 283 GSVAGFEKNTREAVDRWIQENPNAAETVEAVFNVAAAAKVAKLAKAAKPGKAAVSGDFAD 342

Query: 130 AADAQFGLPETS 141
+ + L +++
Sbjct: 343 SYKKKLALSDSA 354


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2887DHBDHDRGNASE1193e-34 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 119 bits (299), Expect = 3e-34
Identities = 74/255 (29%), Positives = 121/255 (47%), Gaps = 10/255 (3%)

Query: 13 DGQTIIVTGGGSGIGRCTAHELAALGAHVVLVGRKAEKLEKTAGEIVEDGGSVSWHACDI 72
+G+ +TG GIG A LA+ GAH+ V EKLEK + + D+
Sbjct: 7 EGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADV 66

Query: 73 REEEAVKTLVANILAERGTIHHLVNNAGGQYPSPLASISQKGFETVLRTNLVGGFLVARE 132
R+ A+ + A I E G I LVN AG P + S+S + +E N G F +R
Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126

Query: 133 VFNQSMSKTGGSIVNMLADMWGGMP--GMGHSGAARSGMENFTRTAAVEWGHAGVRVNAV 190
V M + GSIV + ++ G+P M ++++ FT+ +E +R N V
Sbjct: 127 VSKYMMDRRSGSIVTVGSNP-AGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185

Query: 191 APG-------WIASSGMDTYEGAFKAVIPTLREHVPLKRIGSESEVAAAIVFLLSPGAAF 243
+PG W + + E K + T + +PLK++ S++A A++FL+S A
Sbjct: 186 SPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGH 245

Query: 244 VSGNTIRIDGAASQG 258
++ + + +DG A+ G
Sbjct: 246 ITMHNLCVDGGATLG 260


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2885HTHTETR703e-17 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 70.0 bits (171), Expect = 3e-17
Identities = 32/190 (16%), Positives = 65/190 (34%), Gaps = 8/190 (4%)

Query: 14 ESARGKLLQTAAHLFRSKGYERTTVRDLASAVGIQSGSIFHHFKSKDEILRSVMEETILY 73
+ R +L A LF +G T++ ++A A G+ G+I+ HFK K ++ + E +
Sbjct: 10 QETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESN 69

Query: 74 NTALMRAALAD-AEDLRERVLGLIRCELQSIMGGTGEAMAVLVYEWRSLSAEGQAYILGL 132
L A D + ++ L+S + + + + + A +
Sbjct: 70 IGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQA 129

Query: 133 RDIYEQMWLD----VLGEARLAGYCQG--DPFILRRFLTGALSWT-TTWFRPEGPMSLDQ 185
+ D L A + G +S W L +
Sbjct: 130 QRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDLKK 189

Query: 186 LAEEALALVI 195
A + +A+++
Sbjct: 190 EARDYVAILL 199


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2882PF06580452e-07 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 45.2 bits (107), Expect = 2e-07
Identities = 35/172 (20%), Positives = 72/172 (41%), Gaps = 24/172 (13%)

Query: 198 QIGELVSGLKDFAR--LDRAFSEEVDLND---CVRNAVLIARTAIKDKAEISSQLGELPL 252
+ E+++ L + R L + + +V L D V + + +A +D+ + +Q+ +
Sbjct: 192 KAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIM 251

Query: 253 IACAPSQINQVLL-NLLTNAAQAMERFGRILLKSWADERQVFLSVQDNGKGMPAEVLGRI 311
P + Q L+ N + + + + G+ILLK D V L V++ G
Sbjct: 252 DVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLA-------- 303

Query: 312 FDPFFTTKPVGQGTGLGLSISYKIIQQHGG---TIRVASEPGRGTRFLISLP 360
K + TG GL + +Q G I+++ + G+ ++ +P
Sbjct: 304 ------LKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVN-AMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2881HTHFIS985e-25 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 97.6 bits (243), Expect = 5e-25
Identities = 29/136 (21%), Positives = 57/136 (41%), Gaps = 2/136 (1%)

Query: 7 RILFVDDEERILRSLAMQF-RRHYEVLTESDPRRALERLKTERIQVLVSDQRMPQMSGAE 65
IL DD+ I L R Y+V S+ + ++V+D MP + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 66 LLAQARERYPETLRILLTGYSDLDAAVDALNDGGIFRYLTKPWNPQEMAFTLRQAAEIAS 125
LL + ++ P+ ++++ + A+ A + G + YL KP++ E+ + +A
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKAS-EKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 126 RQGLPAPPAATLAAPL 141
R+ + PL
Sbjct: 124 RRPSKLEDDSQDGMPL 139



Score = 54.8 bits (132), Expect = 1e-10
Identities = 27/139 (19%), Positives = 55/139 (39%), Gaps = 5/139 (3%)

Query: 142 SVLLLDDDPETLDCVGAFCHAGGHRLLRARNLAEALVWLNTEPVEVLVSDLKLAGEHTAP 201
++L+ DDD + G+ + N A W+ +++V+D+ + E+
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 202 LLKSLAQAHPRLLSLVVTPFRDTQALLELINQAQIFRYLPKPIRRGLFEKGLKAAAEQAL 261
LL + +A P L LV++ ++ + + YLPKP L +AL
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKAS-EKGAYDYLPKP----FDLTELIGIIGRAL 119

Query: 262 LWRGRSLPEVDRLAEVPRD 280
R +++ ++
Sbjct: 120 AEPKRRPSKLEDDSQDGMP 138


104PA2678PA2665N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA2678-132-3.333553ABC transporter permease
PA2677133-2.351316type II secretion protein
PA2676137-1.868813type II secretion system protein
PA2675142-2.066609type II secretion system protein
PA2674243-2.474295type II secretion system protein
PA2673239-3.049280type II secretion system protein
PA2672236-3.928556type II secretion system protein
PA2671125-1.790608hypothetical protein
PA2670017-1.485216hypothetical protein
PA2669-113-1.304852hypothetical protein
PA26671110.205018hypothetical protein
PA2666091.0554006-carboxytetrahydropterin synthase QueD
PA2665-172.284721anaerobic nitric oxide reductase transcriptional
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2678ABC2TRNSPORT345e-04 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 33.8 bits (77), Expect = 5e-04
Identities = 22/121 (18%), Positives = 42/121 (34%)

Query: 116 LTPLLAAFFNAMLGYLVLCIFLLFSGVEPGWQLVLLPLALLPFLLCVTGLAWFLAGLGVY 175
L + A A L + + G L+ + L L + L
Sbjct: 115 LGEMAWAATKAALAGAGIGVVAAALGYTQWLSLLYALPVIALTGLAFASLGMVVTALAPS 174

Query: 176 VRDIGQFVQFLLVLLLFISPVFYPLSSLPPVMQPYLYLNPLTIPVEMVRAILFDAPYPTL 235
+ ++ +LF+S +P+ LP V Q PL+ ++++R I+ P +
Sbjct: 175 YDYFIFYQTLVITPILFLSGAVFPVDQLPIVFQTAARFLPLSHSIDLIRPIMLGHPVVDV 234

Query: 236 G 236

Sbjct: 235 C 235


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2676BCTERIALGSPF1893e-58 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 189 bits (481), Expect = 3e-58
Identities = 105/404 (25%), Positives = 188/404 (46%), Gaps = 11/404 (2%)

Query: 2 NFIYQAVDRKGRRVRGELCLPTRQDALRQLQRQGLTPLSLEVKR----------RNLGSR 51
+ YQA+D +G++ RG + + A + L+ +GL PLS++ R +L +
Sbjct: 3 QYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLRRK 62

Query: 52 RRLKAEELNMAIHELATMLAAGVSMADAVEAQERGARHPKLISALQAMANGLRQGQSFPV 111
RL +L + +LAT++AA + + +A++A + + P L + A+ + + +G S
Sbjct: 63 IRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSLAD 122

Query: 112 VLESAGLDLPRYVYQLVAAGEMTGNLAGALRDCATQMEYERRTRAELQGALIYPAILVLS 171
++ R +VAAGE +G+L L A E ++ R+ +Q A+IYP +L +
Sbjct: 123 AMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLTVV 182

Query: 172 GVLAVATLFVFVVPKFANLLNET-AQLPWLAWAVLSIGVWSNESSGLLAFAVLLLAGGIA 230
+ V+ L VVPK LP ++ + + A+L
Sbjct: 183 AIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMAFR 242

Query: 231 VALRNPALRAHALDQLVRLPVVGEWLMQAEIAQWSKVLGTLLGNRVPLVEALALSADGVR 290
V LR R +L+ LP++G A++++ L L + VPL++A+ +S D +
Sbjct: 243 VMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDVMS 302

Query: 291 IARQRRTLERVTQDVRAGIALSAALEERQAVTSIGSSLVRVGEASGQLAEMLQSLATLYG 350
R L T VR G++L ALE+ + ++ GE SG+L ML+ A
Sbjct: 303 NDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADNQD 362

Query: 351 EVGQARMKKALVLIEPLAILLIGSVFGLIITGVVLAITSANDMV 394
++M AL L EPL ++ + +V I+ ++ I N ++
Sbjct: 363 REFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLM 406


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2675BCTERIALGSPG1183e-37 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 118 bits (297), Expect = 3e-37
Identities = 47/128 (36%), Positives = 67/128 (52%), Gaps = 1/128 (0%)

Query: 6 QQGFTLLEMIVVLVIIGMLMGLVGPRLFNQADKAKAQTADTQVKMLKGALLTMRLDIGRL 65
Q+GFTLLE++VV+VIIG+L LV P L +KA Q A + + L+ AL +LD
Sbjct: 7 QRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLDNHHY 66

Query: 66 PTEEEGLALLNTPPSDERLGAFWHGPYLEGGVPLDPWNRPYLYSDRPSAEQPFTLYSQGA 125
PT +GL L P+ L A ++ +P DPW Y+ + P + L S G
Sbjct: 67 PTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVN-PGEHGAYDLLSAGP 125

Query: 126 DGQPGGKG 133
DG+ G +
Sbjct: 126 DGEMGTED 133


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2674BCTERIALGSPG391e-06 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 38.7 bits (90), Expect = 1e-06
Identities = 17/43 (39%), Positives = 31/43 (72%)

Query: 12 QAAFTLLELLVVLVIVGAIAAVALPGLVRMQETWARRTALDDL 54
Q FTLLE++VV+VI+G +A++ +P L+ +E ++ A+ D+
Sbjct: 7 QRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDI 49


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2673PilS_PF08805300.003 PilS N terminal
		>PilS_PF08805#PilS N terminal

Length = 185

Score = 29.5 bits (66), Expect = 0.003
Identities = 5/31 (16%), Positives = 16/31 (51%)

Query: 7 GFTLLEAVVALTLLAVVGGALFAWLNSAFRS 37
G TL+E ++ + ++ V+ + + + +
Sbjct: 27 GATLMEVLLVVGVIVVLAASAYKLYSMVQSN 57


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2672BCTERIALGSPG300.003 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 29.9 bits (67), Expect = 0.003
Identities = 15/56 (26%), Positives = 31/56 (55%), Gaps = 5/56 (8%)

Query: 2 IRSRRKQGAFTLLEMIVVLLVVSFIGTLLM-QGLSYASKANQSLHQNLGRGQVRAL 56
+R+ KQ FTLLE++VV++++ + +L++ + KA+ + + AL
Sbjct: 1 MRATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKAD----KQKAVSDIVAL 52


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2667PYOCINKILLER290.005 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 28.6 bits (63), Expect = 0.005
Identities = 24/84 (28%), Positives = 33/84 (39%), Gaps = 10/84 (11%)

Query: 21 LEKLKSDSSLKQELEFKDKLQALMDKYGMTLHNIIAILDPKAPVTVSAAPQRRA------ 74
+E L + ++K E LQ M+ +I A KA +A +R+A
Sbjct: 181 MEGLTAAYNVKLFTEAISSLQIRMNTLTAAKASIEAAAANKAREQAAAEAKRKAEEQARQ 240

Query: 75 ----RALKVYKNPNNGEVVETKGG 94
RA Y P NG VV T G
Sbjct: 241 QAAIRAANTYAMPANGSVVATAAG 264


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2665HTHFIS386e-132 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 386 bits (994), Expect = e-132
Identities = 135/369 (36%), Positives = 198/369 (53%), Gaps = 16/369 (4%)

Query: 164 ERMRDLTRRAEDEHQRAEAYLEAAGERPREMIGQSAAHKALLEEIRLVANSDLSVLITGE 223
+ + RA E +R + LE + ++G+SAA + + + + +DL+++ITGE
Sbjct: 109 TELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGE 168

Query: 224 TGVGKELVAQSIHRHSMRSGKPMISLNCAALPETLVESELFGHVRGAFSGAVNERRGKFE 283
+G GKELVA+++H + R P +++N AA+P L+ESELFGH +GAF+GA G+FE
Sbjct: 169 SGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFE 228

Query: 284 LADGGSLFLDEVGELPLAVQAKLLRVLQSGQLQRVGSDREHRVDVRLIAATNRDLAEEVR 343
A+GG+LFLDE+G++P+ Q +LLRVLQ G+ VG R DVR++AATN+DL + +
Sbjct: 229 QAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSIN 288

Query: 344 AGRFRADLYHRLSVYPLRVPPLRERGRDILLLAGYFLEENRPRLGLRSLRLDHEAQAALL 403
G FR DLY+RL+V PLR+PPLR+R DI L +F+++ + GL R D EA +
Sbjct: 289 QGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAE-KEGLDVKRFDQEALELMK 347

Query: 404 DYRWPGNVRELEHLVGRASLKALGHNPERSRIVTLHRQDLDLPAETPAPPRSVVEPAPAA 463
+ WPGNVRELE+LV R + R I R ++ A RS A
Sbjct: 348 AHPWPGNVRELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQA 407

Query: 464 PTTGHS---------------LRAATDAYQRRLIQDCLARHQGSWASAARELGLDRANLA 508
+ LI L +G+ AA LGL+R L
Sbjct: 408 VEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLR 467

Query: 509 RLARRLQLR 517
+ R L +
Sbjct: 468 KKIRELGVS 476


105PA2659PA2654N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA26590101.004709hypothetical protein
PA2658080.673980hypothetical protein
PA2657090.661346two-component response regulator
PA2656180.134170two-component sensor
PA2655090.138129hypothetical protein
PA2654111-0.170868chemotaxis transducer
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2659THERMOLYSIN270.010 Thermolysin metalloprotease (M4) family signature.
		>THERMOLYSIN#Thermolysin metalloprotease (M4) family signature.

Length = 544

Score = 27.3 bits (60), Expect = 0.010
Identities = 21/83 (25%), Positives = 32/83 (38%), Gaps = 6/83 (7%)

Query: 21 QARDLGPDEALKLRDAGTIKSFEELNKNAIAKHPGSSVHDTELE----EEYGRYIYQVEL 76
R L + A+ ++ A I + ++ + T L EE R Y+V +
Sbjct: 128 DKRTLKTEAAISIQQAEMIAKQDVADRVTKERPAAEEGKPTRLVIYPDEETPRLAYEVNV 187

Query: 77 R--DPQGVKWDLELDAATGAVLK 97
R P W +DAA G VL
Sbjct: 188 RFLTPVPGNWIYMIDAADGKVLN 210


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2657HTHFIS789e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 77.6 bits (191), Expect = 9e-19
Identities = 31/117 (26%), Positives = 54/117 (46%)

Query: 2 RLLLVEDHVPLADELMASLTRQGYAVDWLADGRDAAVQGASEPYDLIILDLGLPGRPGLE 61
+L+ +D + L +L+R GY V ++ A+ DL++ D+ +P +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 ILQEWRGLGLATPVLILTARGSWAERIDGLKAGADDYLTKPFHPEELALRIQALLRR 118
+L + PVL+++A+ ++ I + GA DYL KPF EL I L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2656PF06580385e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 38.3 bits (89), Expect = 5e-05
Identities = 44/263 (16%), Positives = 87/263 (33%), Gaps = 71/263 (26%)

Query: 187 RLQIAQLQQGQRSQLDNQAPEELEPLVEQIN-HLLAHTEETLKRSRNALGNLGHALKTPL 245
+ A++ Q + + + +A +L L QIN H + + + L + A +
Sbjct: 143 NYKQAEIDQWKMASMAQEA--QLMALKAQINPHFMFNALNNI--RALILEDPTKARE--- 195

Query: 246 AVLVSLAE--REEMARQPELQQVLREQLEQIQQRLGRELGKARLVGEALPGAHFDCAEEL 303
+L SL+E R + Q L ++L + L +L +
Sbjct: 196 -MLTSLSELMRYSLRYSNARQVSLADELTVVDSYL--QLASIQ----------------- 235

Query: 304 PSLCDTLRLIHGPHLQVSWSAPPGL---RLPWDREDLLEMLGNLLDNACKWA------DS 354
LQ P + ++P +L + L++N K
Sbjct: 236 ----------FEDRLQFENQINPAIMDVQVP----PML--VQTLVENGIKHGIAQLPQGG 279

Query: 355 EVRLTVAQGEGMVRLKVDDDGPGILPDQRQAVLERGTRLDEQVSGHGLGLGIARD-IAEA 413
++ L + G V L+V++ G L + ++ G GL R+ +
Sbjct: 280 KILLKGTKDNGTVTLEVENTGSLALKNTKE--------------STGTGLQNVRERLQML 325

Query: 414 CGGRLSLE-DSPLGGLRVSVELP 435
G ++ G + V +P
Sbjct: 326 YGTEAQIKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2654FLAGELLIN300.041 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 30.0 bits (67), Expect = 0.041
Identities = 17/98 (17%), Positives = 39/98 (39%), Gaps = 3/98 (3%)

Query: 438 DVKVSVRDARSTADQSAAISSQTSAGMQQQFREIDQVATASHEMTATAQDVARSAAQAAD 497
K+S +A + + I+ + + +A + + TA V+ + A
Sbjct: 352 SAKLSDLEANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAA 411

Query: 498 AARGADQATRDGLALIDRTTQSIDSLAANLTSAMGQVE 535
AA+ + LA ID +D++ ++L + + +
Sbjct: 412 AAKKSTANP---LASIDSALSKVDAVRSSLGAIQNRFD 446


106PA2528PA2520N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA25282140.878520resistance-nodulation-cell division (RND) efflux
PA25272141.300406resistance-nodulation-cell division (RND) efflux
PA25261121.840520resistance-nodulation-cell division (RND) efflux
PA25250121.818934hypothetical protein
PA2524-1111.712362two-component sensor
PA2523-1121.387825two-component response regulator
PA25220121.393460outer membrane protein CzcC
PA2521-1121.146264resistance-nodulation-cell division (RND)
PA25200130.982726resistance-nodulation-cell division (RND)
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2528RTXTOXIND455e-07 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 44.8 bits (106), Expect = 5e-07
Identities = 31/172 (18%), Positives = 65/172 (37%), Gaps = 16/172 (9%)

Query: 124 TYKAALAQAEGTLMQNQAQLKNAEIDLQRYKGLYAEDSIAKQTLDTQEAQVRQLQGTIRT 183
L + L Q ++++ +A+ + Q L+ + + ++RQ I
Sbjct: 263 EAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILD---------KLRQTTDNIGL 313

Query: 184 NQGQVDDARLNLTFTEVRAPISGR-LGLRQVDIGNLVTSGDTTPLVVITQVKPISVVFSL 242
++ + +RAP+S + L+ G +VT+ +T +V++ + + V +
Sbjct: 314 LTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETL-MVIVPEDDTLEVTALV 372

Query: 243 PQQQIGTVVEQMNGPGKLTVTALDRNQDKVLAEGTLT--TLDNQIDTTTGTV 292
+ IG + + V A + L G + LD D G V
Sbjct: 373 QNKDIGFINVGQ--NAIIKVEAFPYTRYGYL-VGKVKNINLDAIEDQRLGLV 421



Score = 41.4 bits (97), Expect = 6e-06
Identities = 26/125 (20%), Positives = 49/125 (39%), Gaps = 8/125 (6%)

Query: 80 ALGTVTAF-NTVNVKPRVNGELVKVLFQEGQEVKAGDLLAVVDPRTYKAALAQAEGTLMQ 138
A G +T + +KP N + +++ +EG+ V+ GD+L + +A + + +L+Q
Sbjct: 86 ANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQ 145

Query: 139 NQAQL--KNAEIDLQRYKGLYAEDSIAKQTLDT-QEAQVRQLQGTIR----TNQGQVDDA 191
+ + L + E +V +L I+ T Q Q
Sbjct: 146 ARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQK 205

Query: 192 RLNLT 196
LNL
Sbjct: 206 ELNLD 210


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2527ACRIFLAVINRP8400.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 840 bits (2171), Expect = 0.0
Identities = 301/1036 (29%), Positives = 514/1036 (49%), Gaps = 29/1036 (2%)

Query: 4 SRPFILRPVATTLLMVAILLSGLIAYRFLPISALPEVDYPTIQVVTLYPGASPEIMTSSI 63
+ FI RP+ +L + ++++G +A LP++ P + P + V YPGA + + ++
Sbjct: 2 ANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTV 61

Query: 64 TAPLENQLGQIPGLNEMSSSS-SGGASVITLQFSLQSNLDVAEQEVQAAINAAQSLLPND 122
T +E + I L MSS+S S G+ ITL F ++ D+A+ +VQ + A LLP +
Sbjct: 62 TQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE 121

Query: 123 LPNQPVFSKVNPADAPILTLAVMSDG--MPLPQIQDLVDTRLAQKISQISGVGLVSISGG 180
+ Q + S + + ++ +SD I D V + + +S+++GVG V + G
Sbjct: 122 VQQQGI-SVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 181 QRPAVRVRANPTALAAAGLSLEDLRSTVTSNNLNGPKGSFDGPTRAS------TLDANDQ 234
Q A+R+ + L L+ D+ + + N G G ++ A +
Sbjct: 181 QY-AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239

Query: 235 LRSADAYRDLII-AYKNGSPLRIRDVASVEDDAENVRLAAWANNLPAVVLNIQRQPGANV 293
++ + + + + +GS +R++DVA VE EN + A N PA L I+ GAN
Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299

Query: 294 IEVVDRIKALLPQLQSTLPGNLDVQVLTDRTTTIRASVKDVQFELALAVALVVMVTFLFL 353
++ IKA L +LQ P + V D T ++ S+ +V L A+ LV +V +LFL
Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359

Query: 354 RNVYATLIPSFAVPLSLIGTFGVMYLSGFSINNLTLMALTIATGFVVDDAIVMVENIARY 413
+N+ ATLIP+ AVP+ L+GTF ++ G+SIN LT+ + +A G +VDDAIV+VEN+ R
Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419

Query: 414 L-EQGDSPLEAALKGSKQIGFTIISLTFSLIAVLIPLLFMGDVAGRLFREFAITLAVAIL 472
+ E P EA K QI ++ + L AV IP+ F G G ++R+F+IT+ A+
Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479

Query: 473 ISGFVSLTLTPMLSAKLLRHIDEDQQ---GRFARAAGRVIDGLIAQYAKALRVVLRHQPL 529
+S V+L LTP L A LL+ + + G F D + Y ++ +L
Sbjct: 480 LSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539

Query: 530 TLLVAIATLALTALLYLAMPKGFFPVQDTGVIQGVAEAPQSISFQAMSERQRALAEVVLK 589
LL+ +A +L+L +P F P +D GV + + P + + + + + LK
Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599

Query: 590 DPA--VASLSSYIGVDGSNPTLNTGRLLINLKPHSERDV---TASEVIQRLQPELDHLPG 644
+ V S+ + G S N G ++LKP ER+ +A VI R + EL +
Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRD 659

Query: 645 IKLYMQPVQDLTIEDRVARTQYQFTLQD---ADPDVLAEWVPKLVARLQELP-QLADVAS 700
+ P I + T + F L D D L + +L+ + P L V
Sbjct: 660 GFVI--PFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRP 717

Query: 701 DWQDKGLQAYLNIDRDTASRLGVKLSDIDSVLYNAFGQRLISTIFTQATQYRVVLEVAPQ 760
+ + Q L +D++ A LGV LSDI+ + A G ++ + ++ ++ +
Sbjct: 718 NGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAK 777

Query: 761 FQLGPQALEQLYVPSSDGTQVRLSSLAKVEERHTLLAINHIAQFPSATLSFNLAKGYSLG 820
F++ P+ +++LYV S++G V S+ + + PS + A G S G
Sbjct: 778 FRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSG 837

Query: 821 EAVEAIRGVEASLELPLSMQGSFRGAALAFEASLSNTLLLILASVVTMYIVLGILYESFI 880
+A+ + + + +LP + + G + S + L+ S V +++ L LYES+
Sbjct: 838 DAMALMENLAS--KLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWS 895

Query: 881 HPVTILSTLPSAGVGALLALMLAGQEIGIVAIIGIILLIGIVKKNAIMMIDFALDAERNE 940
PV+++ +P VG LLA L Q+ + ++G++ IG+ KNAI++++FA D E
Sbjct: 896 IPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKE 955

Query: 941 GKPPHEAIYQACLLRFRPILMTTMAALLGALPLMLAGGAGAELRQPLGITMVGGLLLSQV 1000
GK EA A +R RPILMT++A +LG LPL ++ GAG+ + +GI ++GG++ + +
Sbjct: 956 GKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATL 1015

Query: 1001 LTLFTTPVIYLYFDRL 1016
L +F PV ++ R
Sbjct: 1016 LAIFFVPVFFVVIRRC 1031


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2526ACRIFLAVINRP8160.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 816 bits (2109), Expect = 0.0
Identities = 290/1034 (28%), Positives = 512/1034 (49%), Gaps = 31/1034 (2%)

Query: 7 FIRRPVATTLLTLALLLAGTLSFGLLPVAPLPNVDFPAIVVSASLPGASPETMASSVATP 66
FIRRP+ +L + L++AG L+ LPVA P + PA+ VSA+ PGA +T+ +V
Sbjct: 5 FIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQV 64

Query: 67 LERSLGRIAGISEMTSSS-SLGSTTVVLVFDLEKDIDGAAREVQAAINGAMSLLPSGMPN 125
+E+++ I + M+S+S S GS T+ L F D D A +VQ + A LLP +
Sbjct: 65 IEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQQ 124

Query: 126 NPSYRKANPSDMPIMVLTLTSET--QSRGEMYDLASTVLAPKLSQVQGVGQVSIGGSSLP 183
S +MV S+ ++ ++ D ++ + LS++ GVG V + G+
Sbjct: 125 -QGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQY- 182

Query: 184 AVRVDLNPDAMSQYGLSLDSVRTAIAAANSNGPKG------AVEKDDKHWQVDANDQLRK 237
A+R+ L+ D +++Y L+ V + N G A+ + + A + +
Sbjct: 183 AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKN 242

Query: 238 AREYEPLVIHYNADNGAAVRLGDVAKVSDSVEDVRNAGFSDDLPAVLLIVTRQPGANIIE 297
E+ + + N+D G+ VRL DVA+V E+ + PA L + GAN ++
Sbjct: 243 PEEFGKVTLRVNSD-GSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALD 301

Query: 298 ATDAIHAQLPVLQELLGPQVKLNVMDDRSPSIRASLEEAELTLLISVALVILVVFLFLRN 357
AI A+L LQ +K+ D +P ++ S+ E TL ++ LV LV++LFL+N
Sbjct: 302 TAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQN 361

Query: 358 GRATLIPSLAVPVSLIGTFAVMYLCDFSLNNLSLMALIIATGFVVDDAIVVVENIARRI- 416
RATLIP++AVPV L+GTFA++ +S+N L++ +++A G +VDDAIVVVEN+ R +
Sbjct: 362 MRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMM 421

Query: 417 EEGDPPIQAAITGARQVGFTVLSMTLSLVAVFIPLLLMGGLTGRLFREFAVTLSAAILVS 476
E+ PP +A Q+ ++ + + L AVFIP+ GG TG ++R+F++T+ +A+ +S
Sbjct: 422 EDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALS 481

Query: 477 LVVSLTLTPMLCARLLRPLKRPEG---ASLARRSDRFFAAFMLRYRASLGWALEHSRLMV 533
++V+L LTP LCA LL+P+ + F + Y S+G L + +
Sbjct: 482 VLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYL 541

Query: 534 VIMLACIAMNLWLFVVVPKGFLPQQDSGRLRGYAVADQSISFQSLSAKMGEYRKILSSDP 593
+I +A + LF+ +P FLP++D G + + + + +
Sbjct: 542 LIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNE 601

Query: 594 AVE-----NVVGFIGGGRWQSSNTGSFFVTLKPIGERDP----VEKVLTRLRERIAKVPG 644
V GF G Q+ N G FV+LKP ER+ E V+ R + + K+
Sbjct: 602 KANVESVFTVNGFSFSG--QAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRD 659

Query: 645 AALYLNAGQDVRLGGRDSNAQYEFTLRS-DDLTLLREWAPKVEAAMRKLP-QLVDVNSDS 702
+ + G + +E ++ L + ++ + P LV V +
Sbjct: 660 GFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNG 719

Query: 703 QDKGVQTRLVIDRDRAATLGINVEMVDAVLNDSFGQRQVSTIFNPLNQYRVVMEVDQQYQ 762
+ Q +L +D+++A LG+++ ++ ++ + G V+ + ++ ++ D +++
Sbjct: 720 LEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFR 779

Query: 763 QSPEILRQVQVIGNDGQRVPLSAFSHYEPSRAPLEVNHQGQFAATTLSFNLAPGAQIGPT 822
PE + ++ V +G+ VP SAF+ + + + APG G
Sbjct: 780 MLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDA 839

Query: 823 REAIMQALEPLHIPVDVQTSFEGNAGAVQDTQNQMPWLILLALLAVYIVLGILYESYVHP 882
+ L P + + G + + + NQ P L+ ++ + V++ L LYES+ P
Sbjct: 840 MALMENLASKL--PAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIP 897

Query: 883 LTILSTLPSAGVGALLALILCRSELSLIALIGIILLIGIVKKNAIMMIDFALEAERNHGL 942
++++ +P VG LLA L + + ++G++ IG+ KNAI++++FA + G
Sbjct: 898 VSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGK 957

Query: 943 SPREAILEACMMRFRPIMMTTLAALLGALPLIFGIGGDAALRRPLGITIVGGLIGSQLLT 1002
EA L A MR RPI+MT+LA +LG LPL G + + +GI ++GG++ + LL
Sbjct: 958 GVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLA 1017

Query: 1003 LYTTPVVYLYLDRL 1016
++ PV ++ + R
Sbjct: 1018 IFFVPVFFVVIRRC 1031



Score = 80.7 bits (199), Expect = 2e-17
Identities = 72/366 (19%), Positives = 135/366 (36%), Gaps = 15/366 (4%)

Query: 665 QYEFTLRSDDLTL--LREWAPK-VEAAMRKLPQLVDVNSDSQDKGVQTRLVIDRDRAATL 721
F + T + ++ V+ + +L + DV R+ +D D
Sbjct: 139 VAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQY--AMRIWLDADLLNKY 196

Query: 722 GINVEMVDAVL---NDSFGQRQVSTIFNPLNQYRVVMEVDQQYQQSPEILRQVQVIGN-D 777
+ V L ND Q+ Q + Q ++PE +V + N D
Sbjct: 197 KLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKNPEEFGKVTLRVNSD 256

Query: 778 GQRVPLSAFSHYEPSRAPLE--VNHQGQFAATTLSFNLAPGAQIGPTREAIMQALEPLH- 834
G V L + E G+ A L LA GA T +AI L L
Sbjct: 257 GSVVRLKDVARVELGGENYNVIARINGK-PAAGLGIKLATGANALDTAKAIKAKLAELQP 315

Query: 835 -IPVDVQ-TSFEGNAGAVQDTQNQMPWLILLALLAVYIVLGILYESYVHPLTILSTLPSA 892
P ++ VQ + +++ + A++ V++V+ + ++ L +P
Sbjct: 316 FFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPVV 375

Query: 893 GVGALLALILCRSELSLIALIGIILLIGIVKKNAIMMIDFALEAERNHGLSPREAILEAC 952
+G L ++ + + G++L IG++ +AI++++ L P+EA ++
Sbjct: 376 LLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSM 435

Query: 953 MMRFRPIMMTTLAALLGALPLIFGIGGDAALRRPLGITIVGGLIGSQLLTLYTTPVVYLY 1012
++ + +P+ F G A+ R ITIV + S L+ L TP +
Sbjct: 436 SQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCAT 495

Query: 1013 LDRLRH 1018
L +
Sbjct: 496 LLKPVS 501


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2525RTXTOXIND386e-05 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 38.3 bits (89), Expect = 6e-05
Identities = 25/216 (11%), Positives = 62/216 (28%), Gaps = 30/216 (13%)

Query: 229 RADVAQARTQLKSTQAQAIDLKYQ--RAQLEHAIAVLVGLPPAQFNLPPVASVPKLPDLP 286
+ A TQ+ + + + R Q+ L LP + P ++
Sbjct: 126 KLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVL 185

Query: 287 AVVP----------SQLLERRPDIASAERKVISANAQIGVAKAAY------FPDLTLSAA 330
+ +Q ++ ++ + ++ A+I + D +
Sbjct: 186 RLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLH 245

Query: 331 GGYRSGSLSNWISTPNRFWSIGPQFAMTLFDGGLIGSQVDQAEATYDQTVATYRQTVLDG 390
+ + N++ + + I S++ A+ Y ++ +LD
Sbjct: 246 KQA--IAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDK 303

Query: 391 FREVEDYLVQLSVLDEESGVQREALESAREALRLAE 426
R+ D + L L E + +
Sbjct: 304 LRQTTDNIGLL----------TLELAKNEERQQASV 329



Score = 31.3 bits (71), Expect = 0.009
Identities = 18/150 (12%), Positives = 43/150 (28%), Gaps = 18/150 (12%)

Query: 171 ASAADLAAVRLSQQSQLAQNYLQLRVMDEQIRLLNDTVTAYERSLKVAENK-------YR 223
+ + + Q+Q Q L L + + + YE +V +++
Sbjct: 186 RLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLH 245

Query: 224 AGIVTRADVAQARTQLKSTQAQAIDLKYQRAQLEHAIAVLVGLPPAQFNLPPVASVPKLP 283
+ + V + + + K Q Q+E I A+ V
Sbjct: 246 KQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILS------AKEEYQLVTQ----- 294

Query: 284 DLPAVVPSQLLERRPDIASAERKVISANAQ 313
+ +L + +I ++ +
Sbjct: 295 LFKNEILDKLRQTTDNIGLLTLELAKNEER 324


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2524PF06580363e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 35.6 bits (82), Expect = 3e-04
Identities = 25/105 (23%), Positives = 40/105 (38%), Gaps = 24/105 (22%)

Query: 370 LVSNAVRH----TPQGGRIDVRIGERAGHTEVRVSNDGPGIPPEYLPHLFERFYRRAGRQ 425
LV N ++H PQGG+I ++ + G + V N G
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKN---------------- 306

Query: 426 TGAQAGTGLGLAIV-QSIMAYHGGRAEAE-SVPQQKTHLRLLFPS 468
+ TG GL V + + +G A+ + S Q K + +L P
Sbjct: 307 --TKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIPG 349


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2523HTHFIS817e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 80.6 bits (199), Expect = 7e-20
Identities = 34/145 (23%), Positives = 63/145 (43%), Gaps = 8/145 (5%)

Query: 2 RILIIEDEVKTADYLHQGLTESGYIVDRANDGIDGLHMALQHPYELVILDVNLPGIDGWD 61
IL+ +D+ L+Q L+ +GY V ++ +LV+ DV +P + +D
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 LLRRLRER-SSARVMMLTGHGRLTDKVRGLDLGADDFMVKPFQFPELLARVRSLLRRHDQ 120
LL R+++ V++++ ++ + GA D++ KPF EL+ + L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE--- 121

Query: 121 APMQDVLRVADLELDASRHRAFRGR 145
R + LE D+ GR
Sbjct: 122 ----PKRRPSKLEDDSQDGMPLVGR 142


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2521RTXTOXIND462e-07 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 46.0 bits (109), Expect = 2e-07
Identities = 40/212 (18%), Positives = 82/212 (38%), Gaps = 22/212 (10%)

Query: 216 ISSPQLSDQRSEFAAAQRRLSLAQSTYKREQQLWKEGISAEQEFLLARQGLQ-EAEIALN 274
I+ + +Q +++ A L + +S + +Q+ E +SA++E+ L Q + E L
Sbjct: 249 IAKHAVLEQENKYVEAVNELRVYKS---QLEQIESEILSAKEEYQLVTQLFKNEILDKLR 305

Query: 275 NARAKIAALGG--NPSLQGGNRYELRAPFAGVLVE-KHLTQGEPVDGTANVFTLS-DLSS 330
I L + + +RAP + + + K T+G V + + + +
Sbjct: 306 QTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDT 365

Query: 331 VWATFNVPAQLLGQVRVGSKVKVLAQALDS----EVEGTVSYIG-DLLGEQTRAATARVT 385
+ T V + +G + VG + +A + G V I D + +Q V
Sbjct: 366 LEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVI 425

Query: 386 LSNPEST---------WRPGLFVSVQVAEATR 408
+S E+ G+ V+ ++ R
Sbjct: 426 ISIEENCLSTGNKNIPLSSGMAVTAEIKTGMR 457



Score = 31.3 bits (71), Expect = 0.010
Identities = 19/119 (15%), Positives = 42/119 (35%), Gaps = 13/119 (10%)

Query: 168 LAQVVSLPGEIRFNEDRTAHIVPRLPGIVDSVPANLGQAVKQGELLAVISSPQLSDQRSE 227
+ V + G++ + I P IV + G++V++G++L +++ ++
Sbjct: 80 VEIVATANGKLTHSGRSKE-IKPIENSIVKEIIVKEGESVRKGDVLLKLTALGA---EAD 135

Query: 228 FAAAQRRLSLAQSTYKREQQLWKE---------GISAEQEFLLARQGLQEAEIALNNAR 277
Q L A+ R Q L + + E F + +L +
Sbjct: 136 TLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQ 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2520ACRIFLAVINRP8110.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 811 bits (2097), Expect = 0.0
Identities = 237/1055 (22%), Positives = 435/1055 (41%), Gaps = 56/1055 (5%)

Query: 5 IIQFAIEQRWLVLLAVLGMAGVGIGSYQKLSIDAVPDITNVQVQINTAAPGYSPLEVEQR 64
+ F I + + + + G + +L + P I V ++ PG V+
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 65 ITYPVETVMAGLPGLQETRSLS-RPGISQVTVIFEEGTDIYFARQQVNERLSTAREQLPE 123
+T +E M G+ L S S G +T+ F+ GTD A+ QV +L A LP+
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 124 DISPTLGPISTGLGEIYLWTVEAEEGATKEDGSAYTPTDLRTIQDWIIRPQLRNVKGVAE 183
++ + YL D T D+ ++ L + GV +
Sbjct: 121 EVQQQGISVEKSSS-SYLMVA-----GFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGD 174

Query: 184 INTIGGYAKQFLIAPDPKKLAAYKLTLGDLQNAVLRNNENVGAGYI------ERRGEQLL 237
+ G I D L YKLT D+ N + N+ + AG + +
Sbjct: 175 VQLFGA-QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNAS 233

Query: 238 IRAPGQVKDMDDIRGIIV-SNVDGVPIRIRDVAEVGLGKELRTGAATENGREVVLGTVFM 296
I A + K+ ++ + + N DG +R++DVA V LG E A NG+ + +
Sbjct: 234 IIAQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKL 293

Query: 297 LIGENSREVAQAVGQRLEEINRTLPKGVKAITVYDRTTLVDKAVATVKKNLVEGAALVIA 356
G N+ + A+A+ +L E+ P+G+K + YD T V ++ V K L E LV
Sbjct: 294 ATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFL 353

Query: 357 VLFLFLGNIRAALITATIIPLSMLFTFTGMVGNRVSANLMSLG--ALDFGIIVDGAVVIV 414
V++LFL N+RA LI +P+ +L TF + S N +++ L G++VD A+V+V
Sbjct: 354 VMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVV 413

Query: 415 ENAIRRLAHAQAHHGRQLTRAERFHEVFAASREARRALVFGQIIIMVVYLPIFALTGVEG 474
EN + + + + ALV +++ V++P+ G G
Sbjct: 414 EN---------VERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTG 464

Query: 475 KMFHPMAFTVVTALLGAMILSVTFVPAAIALFITGKVKEEE----------NFVMRRARL 524
++ + T+V+A+ ++++++ PA A + E N +
Sbjct: 465 AIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVN 524

Query: 525 AYEPALRWVLGHRALVVGGALGAILLTGLVASRMGSEFIPSLSEGDFAMQGLRVPGTSL- 583
Y ++ +LG + + ++ R+ S F+P +G F G +
Sbjct: 525 HYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQE 584

Query: 584 -TQSVEMQQTLERKLMGKFPEIERVFARTGTAEIASDLMPPNASDSYVMLKPQSQWPDPK 642
TQ V + Q + L + +E VF G + NA ++V LKP + +
Sbjct: 585 RTQKV-LDQVTDYYLKNEKANVESVFTVNG---FSFSGQAQNAGMAFVSLKPWEERNGDE 640

Query: 643 KSREALLEELQAAALEVP-GSVYEFSQPIQLRFNELISGVRSDVA-VKVFGDDMQVLNDT 700
S EA++ + ++ G V F+ P EL + D + G L
Sbjct: 641 NSAEAVIHRAKMELGKIRDGFVIPFNMP---AIVELGTATGFDFELIDQAGLGHDALTQA 697

Query: 701 AEKI-SKVLQGIDGASEVKVEQTTGLPVLTVDIDRDKAARFGLNVGDIQDTVATALGGRN 759
++ Q V+ +++D++KA G+++ DI T++TALGG
Sbjct: 698 RNQLLGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTY 757

Query: 760 AGTLFEGDRRFDIVIRLPETLRADLPALSNLLIPLPPNNLARIDFIPLSDVARLDLSPGP 819
+ R + ++ R + L + + +P S G
Sbjct: 758 VNDFIDRGRVKKLYVQADAKFRMLPEDVDKLYVRSANG-----EMVPFSAFTTSHWVYGS 812

Query: 820 NQISRENGKRRIVVSANVRGRDIGSFVLEAQQKLQDGVKIPAGYWTTWGGQFEQLQSAAK 879
++ R NG + + + + L K+PAG W G Q + +
Sbjct: 813 PRLERYNGLPSMEIQGEAAPGTSSGDAMALMENLAS--KLPAGIGYDWTGMSYQERLSGN 870

Query: 880 RLQVVVPVALLLVFTLLFAMFNNVKDGLLVFTGIPFALTGGVLALWLRGIPLSISAAVGF 939
+ +V ++ ++VF L A++ + + V +P + G +LA L + VG
Sbjct: 871 QAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGL 930

Query: 940 IALSGVAVLNGLVMISFIRNLL-QEGRSLDQAVWEGAITRLRPVLMTALVASLGFVPMAL 998
+ G++ N ++++ F ++L+ +EG+ + +A RLRP+LMT+L LG +P+A+
Sbjct: 931 LTTIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAI 990

Query: 999 ATGTGAEVQRPLATVVIGGILSSTMLTLLVLPVLY 1033
+ G G+ Q + V+GG++S+T+L + +PV +
Sbjct: 991 SNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFF 1025



Score = 70.6 bits (173), Expect = 2e-14
Identities = 70/527 (13%), Positives = 160/527 (30%), Gaps = 46/527 (8%)

Query: 2 FERIIQFAIEQRWLVLLAVLGMAGVGIGSYQKLSIDAVPDITNVQVQINTAAPGYSPLEV 61
+ + + LL + + + +L +P+ P + E
Sbjct: 526 YTNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQER 585

Query: 62 EQRI---------TYPVETVMA--GLPGLQETRSLSRPGISQVTV-IFEEGTDIYFARQQ 109
Q++ V + + G + G++ V++ +EE + +
Sbjct: 586 TQKVLDQVTDYYLKNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEA 645

Query: 110 VNERLSTAREQLPED-ISPTLGPISTGLGEIYLWTVEAEEGATKEDGSAYTPTDLRTIQD 168
V R ++ + + P P LG D + L ++
Sbjct: 646 VIHRAKMELGKIRDGFVIPFNMPAIVELG------TATGFDFELIDQAGLGHDALTQARN 699

Query: 169 WIIRPQLRNVKGVAEINTIGGY-AKQFLIAPDPKKLAAYKLTLGDLQNAVLRNNENVGAG 227
++ ++ + + G QF + D +K A ++L D+ +
Sbjct: 700 QLLGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVN 759

Query: 228 YIERRGEQ--LLIRAPGQ-VKDMDDIRGIIVSNVDGVPIRIRDVAEVGLGKELRTGAAT- 283
RG L ++A + +D+ + V + +G + G+
Sbjct: 760 DFIDRGRVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTS----HWVYGSPRL 815

Query: 284 --ENGREVVLGTVFMLIGENSREVAQAVGQRLEEINRTLPKGVKAITVYDRTTLVDKAVA 341
NG + G +S + + E + LP G+ + +
Sbjct: 816 ERYNGLPSMEIQGEAAPGTSSGDAMALM----ENLASKLPAGI-GYDWTGMSYQERLSGN 870

Query: 342 TVKKNLVEGAALVIAVLFLFLGNIRAALITATIIPLSMLFTFTGMVGNRVSANLMSLGAL 401
+ +V L + + ++PL ++ ++ + L
Sbjct: 871 QAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGL 930

Query: 402 --DFGIIVDGAVVIVENAIRRLAHAQAHHGRQLTRAERFHEVFAASREARRALVFGQIII 459
G+ A++IVE A G+ + A A R R ++ +
Sbjct: 931 LTTIGLSAKNAILIVEFAK----DLMEKEGKGVVEA-----TLMAVRMRLRPILMTSLAF 981

Query: 460 MVVYLPIFALTGVEGKMFHPMAFTVVTALLGAMILSVTFVPAAIALF 506
++ LP+ G + + V+ ++ A +L++ FVP +
Sbjct: 982 ILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVI 1028


107PA2021PA2018N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA20210110.952274hypothetical protein
PA20200110.690465transcriptional regulator
PA20191120.921620multidrug efflux lipoprotein
PA20180120.368888multidrug efflux protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2021PF05272280.002 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.1 bits (62), Expect = 0.002
Identities = 18/68 (26%), Positives = 26/68 (38%), Gaps = 11/68 (16%)

Query: 14 VEIEGSRHRAPVDSLRIGTDAEARLSVLYIDGKRLHISEED---------AQRLVVAGAE 64
V + G + + R AEA LY+ G+R S ED RLV G +
Sbjct: 711 VLVPGRANLVWLQKFRGQLFAEAL--HLYLAGERYFPSPEDEEIYFRPEQELRLVETGVQ 768

Query: 65 DQRRHLMA 72
+ L+
Sbjct: 769 GRLWALLT 776


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2020HTHTETR973e-27 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 97.0 bits (241), Expect = 3e-27
Identities = 57/209 (27%), Positives = 100/209 (47%), Gaps = 5/209 (2%)

Query: 1 MARKTKEESQKTRDGILDAAERVFLEKGVGTTAMADLADAAGVSRGAVYGHYKNKIEVCL 60
MARKTK+E+Q+TR ILD A R+F ++GV +T++ ++A AAGV+RGA+Y H+K+K ++
Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60

Query: 61 AMCDRAFGQI-EVPDENA--RVPALDILLRAGM-GFLRQCCEPGSVQRVLEILYLKCERS 116
+ + + I E+ E +LR + L + ++EI++ KCE
Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120

Query: 117 DENEPLLRRRELLEKQGQRFGLRQIRRAVERGELPARLDVELASIYLQSLWDGICGTLAW 176
E + + + L + + ++ +E LPA L A+I ++ G+ +
Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLF 180

Query: 177 TERLRDDPWNRAERMFRAGLDSLRSSPYL 205
+ D A L+ P L
Sbjct: 181 APQSFDLK-KEARDYVAILLEMYLLCPTL 208


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2019RTXTOXIND423e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 42.1 bits (99), Expect = 3e-06
Identities = 14/83 (16%), Positives = 30/83 (36%), Gaps = 3/83 (3%)

Query: 117 ASHAAAADKLKRYADLIKDRAISERE--YTEAQTDARQALAQIASAKAELEQARLRLGYA 174
+ + ++++ K+ + E RQ I EL + R +
Sbjct: 269 RVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQAS 328

Query: 175 TVTAPIDGR-ARRALVTEGALVG 196
+ AP+ + + + TEG +V
Sbjct: 329 VIRAPVSVKVQQLKVHTEGGVVT 351



Score = 41.7 bits (98), Expect = 3e-06
Identities = 24/137 (17%), Positives = 47/137 (34%), Gaps = 7/137 (5%)

Query: 67 EVRARVAGIVTRRLYEEGQDVRAGTVLFQIDPAPLKAALDISRGALARAEASHAAAADKL 126
E++ IV + +EG+ VR G VL ++ +A ++ +L +A L
Sbjct: 98 EIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLE-QTRYQIL 156

Query: 127 KRYADLIKDRAISEREYT------EAQTDARQALAQIASAKAELEQARLRLGYATVTAPI 180
R +L K + + E + +L + + + ++ + L A
Sbjct: 157 SRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAER 216

Query: 181 DGRARRALVTEGALVGE 197
R E E
Sbjct: 217 LTVLARINRYENLSRVE 233


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA2018ACRIFLAVINRP10910.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 1091 bits (2822), Expect = 0.0
Identities = 507/1033 (49%), Positives = 709/1033 (68%), Gaps = 6/1033 (0%)

Query: 1 MARFFIDRPVFAWVISLLIVLAGVLAIRFLPVAQYPDIAPPVVNVSASYPGASAKVVEEA 60
MA FFI RP+FAWV+++++++AG LAI LPVAQYP IAPP V+VSA+YPGA A+ V++
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 61 VTAIIEREMNGAPGLLYTKATS-STGQASLTLTFRQGVNADLAAVEVQNRLKIVESRLPE 119
VT +IE+ MNG L+Y +TS S G ++TLTF+ G + D+A V+VQN+L++ LP+
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 120 SVRRDGIYVEKAADSIQLIVTLTSSSGRYDAMELGEIASSNVLQALRRVEGVGKVETWGA 179
V++ GI VEK++ S ++ S + ++ + +SNV L R+ GVG V+ +GA
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 180 EYAMRIWPDPAKLTSMNLSASDLVNAVRRHNARLTVGDIGNLGVPDSAPISATVKVDDTL 239
+YAMRIW D L L+ D++N ++ N ++ G +G ++A++
Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240

Query: 240 VTPEQFGEIPLRIRADGGAIRLRDVARVEFGQSEYGFVSRVNQMTATGLAVKMAPGSNAV 299
PE+FG++ LR+ +DG +RL+DVARVE G Y ++R+N A GL +K+A G+NA+
Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300

Query: 300 ATAKRIRATLDELSRYFPEGVSYNIPYDTSAFVEISIRKVVSTLLEAMLLVFAVMYLFMQ 359
TAK I+A L EL +FP+G+ PYDT+ FV++SI +VV TL EA++LVF VMYLF+Q
Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360

Query: 360 NFRATLIPTLVVPVALLGTFTVMLGLGFSINVLTMFGMVLAIGILVDDAIIVVENVERLM 419
N RATLIPT+ VPV LLGTF ++ G+SIN LTMFGMVLAIG+LVDDAI+VVENVER+M
Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420

Query: 420 AEEGLSPHDATVKAMRQISGAIVGITVVLVSVFVPMAFFSGAVGNIYRQFAVTLAVSIGF 479
E+ L P +AT K+M QI GA+VGI +VL +VF+PMAFF G+ G IYRQF++T+ ++
Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480

Query: 480 SAFLALSLTPALCATLLRPIDADHHE-KRGFFGWFNRAFLRLTGRYRNAVAGILARPIRW 538
S +AL LTPALCATLL+P+ A+HHE K GFFGWFN F Y N+V IL R+
Sbjct: 481 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540

Query: 539 MLVYTLVIGVVALLFVRLPQAFLPEEDQGDFMIMVMQPEGTPMAETMANVGDVERYLAEH 598
+L+Y L++ + +LF+RLP +FLPEEDQG F+ M+ P G T + V Y ++
Sbjct: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600

Query: 599 EP--VAYAYAVGGFSLYGDGTSSAMIFATLKDWSERREASQHVGAIVERINQRFAGLPNR 656
E V + V GFS G ++ M F +LK W ER A++ R + +
Sbjct: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG 660

Query: 657 TVYAMNSPPLPDLGSTSGFDFRLQDRGGVGYEALVKARDQLLARAAEDP-RLANVMFAGQ 715
V N P + +LG+ +GFDF L D+ G+G++AL +AR+QLL AA+ P L +V G
Sbjct: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720

Query: 716 GEAPQIRLDIDRRKAETLGVSMDEINTTLAVMFGSDYIGDFMHGSQVRKVVVQADGAKRL 775
+ Q +L++D+ KA+ LGVS+ +IN T++ G Y+ DF+ +V+K+ VQAD R+
Sbjct: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRM 780

Query: 776 GIDDIGRLHVRNEQGEMVPLATFAKAAWTLGPPQLTRYNGYPSFNLEGQAAPGYSSGEAM 835
+D+ +L+VR+ GEMVP + F + W G P+L RYNG PS ++G+AAPG SSG+AM
Sbjct: 781 LPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAM 840

Query: 836 QAMEQLMQGLPEGIAHEWSGQSFEERLSGAQAPALFALSVLIVFLALAALYESWSIPLAV 895
ME L LP GI ++W+G S++ERLSG QAPAL A+S ++VFL LAALYESWSIP++V
Sbjct: 841 ALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSV 900

Query: 896 ILVVPLGVLGALLGVSLRGLPNDIYFKVGLITIIGLSAKNAILIIEVAKD-HYQEGMSLL 954
+LVVPLG++G LL +L ND+YF VGL+T IGLSAKNAILI+E AKD +EG ++
Sbjct: 901 MLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVV 960

Query: 955 QATLEAARLRLRPIVMTSLAFGFGVVPLALSSGAGSGAQVAIGTGVLGGIVTATVLAVFL 1014
+ATL A R+RLRPI+MTSLAF GV+PLA+S+GAGSGAQ A+G GV+GG+V+AT+LA+F
Sbjct: 961 EATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFF 1020

Query: 1015 VPLFFLVVGRLFR 1027
VP+FF+V+ R F+
Sbjct: 1021 VPVFFVVIRRCFK 1033


108PA1728PA1721N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA1728-27-0.086857hypothetical protein
PA1727-18-0.027073signaling protein
PA1726090.061638beta-glucosidase
PA1725417-0.010990type III secretion system protein
PA1724418-0.816155type III export protein PscK
PA1723318-1.434019type III export protein PscJ
PA17223150.101276type III export protein PscI
PA17211150.316708type III export protein PscH
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1728SECBCHAPRONE260.025 Bacterial protein-transport SecB chaperone protein ...
		>SECBCHAPRONE#Bacterial protein-transport SecB chaperone protein

signature.
Length = 170

Score = 26.4 bits (58), Expect = 0.025
Identities = 8/30 (26%), Positives = 15/30 (50%)

Query: 19 EGGFDFARIHPIDFFAIFPSEREARQAAGQ 48
G F + P++F A+F + ++ A Q
Sbjct: 131 RGTFPALNLSPVNFDALFMDYLQRQEQAEQ 160


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1725TYPE4SSCAGX300.009 Type IV secretion system CagX conjugation protein si...
		>TYPE4SSCAGX#Type IV secretion system CagX conjugation protein

signature.
Length = 522

Score = 29.8 bits (66), Expect = 0.009
Identities = 27/102 (26%), Positives = 45/102 (44%), Gaps = 8/102 (7%)

Query: 21 LRARDYQDYLSANRLVEAA--------RERAAEIEREAHEVYQEQKRLGWEAGLEEARLR 72
L RDYQ++L +L+ A +++A E E+EA E Q+ ++ E EE
Sbjct: 117 LMTRDYQEFLKTKKLIVDAPDPKELEEQKKALEKEKEAKEQAQKAQKDKREKRKEERAKN 176

Query: 73 QAGLIQETLLRCNRYYRQVDRQLGEVVLQAVRKVLRHYDAVE 114
+A L T N ++ L E++ Q L + +E
Sbjct: 177 RANLENLTNAMSNPQNLSNNKNLSELIKQQRENELDQMERLE 218


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1723FLGMRINGFLIF751e-17 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 75.0 bits (184), Expect = 1e-17
Identities = 33/165 (20%), Positives = 69/165 (41%), Gaps = 6/165 (3%)

Query: 27 LYTGISQKEGNEMLALLRSEGVSADKQADKDGTVRLLVEESDIAEAVEVLKRKGYPRENF 86
L++ +S ++G ++A L + + V + E L ++G P+
Sbjct: 53 LFSNLSDQDGGAIVAQLTQMNIPYR--FANGSGA-IEVPADKVHELRLRLAQQGLPKGG- 108

Query: 87 STLKDVFPKDGLISSPIEERARLNYAKAQEISHTLSEIDGVLVARVHVVLPEERDGLGRK 146
+ ++ ++ S E+ A E++ T+ + V ARVH+ +P + R+
Sbjct: 109 AVGFELLDQEKFGISQFSEQVNYQRALEGELARTIETLGPVKSARVHLAMP-KPSLFVRE 167

Query: 147 SSPASASVFIKHAADVQLD-AYVPQIKQLVNNGIEGLSYDRISVV 190
SASV + LD + + LV++ + GL +++V
Sbjct: 168 QKSPSASVTVTLEPGRALDEGQISAVVHLVSSAVAGLPPGNVTLV 212


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1721PF090252052e-71 YopR Core
		>PF09025#YopR Core

Length = 143

Score = 205 bits (522), Expect = 2e-71
Identities = 143/143 (100%), Positives = 143/143 (100%)

Query: 1 MSRIDTPPGFAVYPSASPKAANLPAVDQVLAFEQALGGEPPAAGRRLAGLENGALGERLL 60
MSRIDTPPGFAVYPSASPKAANLPAVDQVLAFEQALGGEPPAAGRRLAGLENGALGERLL
Sbjct: 1 MSRIDTPPGFAVYPSASPKAANLPAVDQVLAFEQALGGEPPAAGRRLAGLENGALGERLL 60

Query: 61 QRFAQPLQGLEADRLELKAMLRAELPLGRQQQTFLLQLLGAVEHAPGGEYLAQLARRELQ 120
QRFAQPLQGLEADRLELKAMLRAELPLGRQQQTFLLQLLGAVEHAPGGEYLAQLARRELQ
Sbjct: 61 QRFAQPLQGLEADRLELKAMLRAELPLGRQQQTFLLQLLGAVEHAPGGEYLAQLARRELQ 120

Query: 121 VLIPLNGMLDNLVRNSHKLDLES 143
VLIPLNGMLDNLVRNSHKLDLES
Sbjct: 121 VLIPLNGMLDNLVRNSHKLDLES 143


109PA1716PA1706N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA1716-219-0.884258type III secretion outer membrane protein PscC
PA1715128-2.025533type III export apparatus protein
PA1714225-2.322608hypothetical protein
PA1713325-2.766279exoenzyme S transcriptional regulator ExsA
PA1712222-1.035002exoenzyme S synthesis protein ExsB
PA1711219-1.553346hypothetical protein
PA1710217-1.564270exoenzyme S synthesis protein ExsC
PA1709216-0.632473translocator outer membrane protein PopD
PA1708313-1.043018translocator protein PopB
PA1707314-0.598363regulatory protein PcrH
PA1706415-0.037941type III secretion protein PcrV
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1716TYPE3OMGPROT8160.0 Type III secretion system outer membrane G protein ...
		>TYPE3OMGPROT#Type III secretion system outer membrane G protein

family signature.
Length = 607

Score = 816 bits (2108), Expect = 0.0
Identities = 375/600 (62%), Positives = 472/600 (78%), Gaps = 7/600 (1%)

Query: 1 MRRLLIGGLLALLPGAVLRAQPLDWPSLPYDYVAQGESLRDVLANFGANYDASVIVSDKV 60
+R+L G LL L + AQ LDW +PY YVA+GESLRD+L +FGANYDA+V+VSDK+
Sbjct: 9 FKRVLTGTLLLLSSYSW--AQELDWLPIPYVYVAKGESLRDLLTDFGANYDATVVVSDKI 66

Query: 61 NDQVSGRFDLESPQAFLQLMASLYNLGWYYDGTVLYVFKTTEMQSRLVRLEQVGEAELKR 120
ND+VSG+F+ ++PQ FLQ +ASLYNL WYYDG VLY+FK +E+ SRL+RL++ AELK+
Sbjct: 67 NDKVSGQFEHDNPQDFLQHIASLYNLVWYYDGNVLYIFKNSEVASRLIRLQESEAAELKQ 126

Query: 121 ALTAAGIWEARFGWRADPSGRLVHVSGPGRYLELVEQTAQVLEQQYTLRSEKTGDLSVEI 180
AL +GIWE RFGWR D S RLV+VSGP RYLELVEQTA LEQQ +RSEKTG L++EI
Sbjct: 127 ALQRSGIWEPRFGWRPDASNRLVYVSGPPRYLELVEQTAAALEQQTQIRSEKTGALAIEI 186

Query: 181 FPLRYAVAEDRKIEYRDDEIEAPGIASILSRVLSDANVVAVGDEPGKLRPGP--QSSHAV 238
FPL+YA A DR I YRDDE+ APG+A+IL RVLSDA + V + ++ S+ A
Sbjct: 187 FPLKYASASDRTIHYRDDEVAAPGVATILQRVLSDATIQQVTVDNQRIPQAATRASAQAR 246

Query: 239 VQAEPSLNAVVVRDHKDRLPMYRRLIEALDRPSARIEVGLSIIDINAENLAQLGVDWSAG 298
V+A+PSLNA++VRD +R+PMY+RLI ALD+PSARIEV LSI+DINA+ L +LGVDW G
Sbjct: 247 VEADPSLNAIIVRDSPERMPMYQRLIHALDKPSARIEVALSIVDINADQLTELGVDWRVG 306

Query: 299 IRLGNNKSIQIRTTGQDSEEGGGAGNGAVGSLVDSRGLDFLLAKVTLLQSQGQAQIGSRP 358
IR GNN + I+TTG S A NGA+GSLVD+RGLD+LLA+V LL+++G AQ+ SRP
Sbjct: 307 IRTGNNHQVVIKTTGDQS---NIASNGALGSLVDARGLDYLLARVNLLENEGSAQVVSRP 363

Query: 359 TLLTQENTQAVLDQSETYYVRVTGERVAELKAITYGTMLKMTPRVVTLGDTPEISLSLHI 418
TLLTQEN QAV+D SETYYV+VTG+ VAELK ITYGTML+MTPRV+T GD EISL+LHI
Sbjct: 364 TLLTQENAQAVIDHSETYYVKVTGKEVAELKGITYGTMLRMTPRVLTQGDKSEISLNLHI 423

Query: 419 EDGSQKPNSAGLDKIPTINRTVIDTIARVGHGQSLLIGGIYRDELSQSQRKVPWLGDIPY 478
EDG+QKPNS+G++ IPTI+RTV+DT+ARVGHGQSL+IGGIYRDELS + KVP LGDIPY
Sbjct: 424 EDGNQKPNSSGIEGIPTISRTVVDTVARVGHGQSLIIGGIYRDELSVALSKVPLLGDIPY 483

Query: 479 LGALFRTTADTVRRSVRLFLIEPRLIDDGVGHYLALNNRRDLRGGLLEIDELSNQSLSLR 538
+GALFR ++ RR+VRLF+IEPR+ID+G+ H+LAL N +DLR G+L +DE+SNQS +L
Sbjct: 484 IGALFRRKSELTRRTVRLFIIEPRIIDEGIAHHLALGNGQDLRTGILTVDEISNQSTTLN 543

Query: 539 KLLGSARCQALAPARAEQERLRQAGQGSFLTPCRMGAQEGWRVTDGACPKDGAWCVGAER 598
KLLG ++CQ L A+ Q+ L Q + S+LT C+M GWRV +GAC +WCV A +
Sbjct: 544 KLLGGSQCQPLNKAQEVQKWLSQNNKSSYLTQCKMDKSLGWRVVEGACTPAQSWCVSAPK 603


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1715PF05932932e-27 Tir chaperone protein (CesT)
		>PF05932#Tir chaperone protein (CesT)

Length = 127

Score = 92.6 bits (230), Expect = 2e-27
Identities = 25/120 (20%), Positives = 41/120 (34%), Gaps = 5/120 (4%)

Query: 2 DHLLSGLATRLGQGPFVADRTGSYHLRIDGQSVLLLRQGDDLLLESPLEHAPLDPQRDQQ 61
LL + L P V D G+ ++ ID L L D E L L+P +
Sbjct: 7 KTLLDDFSRSLEMQPLVFDDHGTCNMIIDNTFALTLS--CDYARERLLLIGLLEP--HKD 62

Query: 62 GLLRALLSRVASWSRRYPQAIVLDADGRLLLQA-RLGLDGLDPERLERALAAQVGLLEAL 120
+ LL+ + + LD L + + L L+R +A + +
Sbjct: 63 IPQQCLLAGALNPLLNAGPGLGLDEKSGLYHAYQSIPREKLSVPTLKREMAGLLEWMRGW 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1710PF05932477e-10 Tir chaperone protein (CesT)
		>PF05932#Tir chaperone protein (CesT)

Length = 127

Score = 47.5 bits (113), Expect = 7e-10
Identities = 27/118 (22%), Positives = 49/118 (41%), Gaps = 4/118 (3%)

Query: 10 LLAEFAGRIGLPSLSLDEEGMASLLFDEQVGVTLLLLAERERLLLEADVAGIDVLGEGIF 69
LL +F+ + + L D+ G +++ D +TL RERLLL + +
Sbjct: 9 LLDDFSRSLEMQPLVFDDHGTCNMIIDNTFALTLSCDYARERLLLIGLLEP---HKDIPQ 65

Query: 70 RQLASFNRHWHRFDLH-FGFDELTGKVQLYAQILAAQLTLECFEATLANLLDHAEFWQ 126
+ L + + G DE +G Y I +L++ + +A LL+ W+
Sbjct: 66 QCLLAGALNPLLNAGPGLGLDEKSGLYHAYQSIPREKLSVPTLKREMAGLLEWMRGWR 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1709PF05844385e-137 YopD protein
		>PF05844#YopD protein

Length = 295

Score = 385 bits (989), Expect = e-137
Identities = 291/295 (98%), Positives = 293/295 (99%)

Query: 1 MIDTQYSLAATQAAIPSEPIAPGAAGRSVGTPQAAADLPQVPAARADRVELNAPRQVLDP 60
MIDTQYSLAATQAAIPSEPIAPGAAGRSVGTPQAAA+LPQVPAARADRVELNAPRQVLDP
Sbjct: 1 MIDTQYSLAATQAAIPSEPIAPGAAGRSVGTPQAAAELPQVPAARADRVELNAPRQVLDP 60

Query: 61 VRMEAAGSELDSSVELLLILFRIAQKARELGVLQRDNENQSIIHAQKAQVDEMRSGATLM 120
VRMEAAGSELDSSVELLLILFRIAQKARELGVLQRDNENQ+IIHAQKAQVDEMRSGATLM
Sbjct: 61 VRMEAAGSELDSSVELLLILFRIAQKARELGVLQRDNENQAIIHAQKAQVDEMRSGATLM 120

Query: 121 IAMAVIAGVGALASAVVGSLGALKNGKAISQEKTLQKNIDGRNELIDAKMQALGKTSDED 180
IAMAVIAGVGALASAVVGSLGALKNGKAISQEKTLQKNIDGRNELIDAKMQALGKTSDED
Sbjct: 121 IAMAVIAGVGALASAVVGSLGALKNGKAISQEKTLQKNIDGRNELIDAKMQALGKTSDED 180

Query: 181 RKIVGKVWAADQVQDSVALRAAGRAFESRNGALQVANTVIQSFVQMANASVQVRQGESQA 240
RKIVGKVWAADQ QDSVALRAAGRAFESRNGALQVANTVIQSFVQMANASVQVRQGESQA
Sbjct: 181 RKIVGKVWAADQAQDSVALRAAGRAFESRNGALQVANTVIQSFVQMANASVQVRQGESQA 240

Query: 241 SAREGEVNATIGQSQKQKVEDQMSFDAGFMKDVLQLIQQYTQSHNQAWRAAAGVV 295
SARE EVNATIGQSQKQKVEDQMSFDAGFMKDVLQLIQQYTQSHNQAWRAAAGVV
Sbjct: 241 SAREEEVNATIGQSQKQKVEDQMSFDAGFMKDVLQLIQQYTQSHNQAWRAAAGVV 295


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1707SYCDCHAPRONE2084e-72 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 208 bits (531), Expect = 4e-72
Identities = 95/167 (56%), Positives = 126/167 (75%)

Query: 1 MNQPTPSDTDQQQALEAFLRDGGTLAMLRGLSEDTLEQLYALGFNQYQAGKWDDAQKIFQ 60
M Q T + Q A+E+FL+ GGT+AML +S DTLEQLY+L FNQYQ+GK++DA K+FQ
Sbjct: 1 MQQETTDTQEYQLAMESFLKGGGTIAMLNEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQ 60

Query: 61 ALCMLDHYDARYFLGLGACRQSLGLYEQALQSYSYGALMDINEPRFPFHAAECHLQLGDL 120
ALC+LDHYD+R+FLGLGACRQ++G Y+ A+ SYSYGA+MDI EPRFPFHAAEC LQ G+L
Sbjct: 61 ALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGEL 120

Query: 121 DGAESGFYSARALAAAQPAHEALAARAGAMLEAVTARKDRAYESDNA 167
AESG + A+ L A + + L+ R +MLEA+ +K+ +E +
Sbjct: 121 AEAESGLFLAQELIADKTEFKELSTRVSSMLEAIKLKKEMEHECVDN 167


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1706LCRVANTIGEN344e-121 Low calcium response V antigen signature.
		>LCRVANTIGEN#Low calcium response V antigen signature.

Length = 326

Score = 344 bits (884), Expect = e-121
Identities = 115/296 (38%), Positives = 171/296 (57%), Gaps = 32/296 (10%)

Query: 25 ASAEQEELLALLRSERIVLAHAGQPLSEAQVL-------------KALAWLLAANPSAPP 71
S+ EEL+ L++ + I ++ P +++V K LA+ L +
Sbjct: 28 GSSVLEELVQLVKDKNIDISIKYDPRKDSEVFANRVITDDIELLKKILAYFLPEDAILKG 87

Query: 72 GQ-------GLEVLREVLQARRQPGAQWDLREFLVSAYFSLHG-RLDEDVIGVYKDVLQT 123
G G++ ++E L++ P QW+LR F+ +FSL R+D+D++ V D +
Sbjct: 88 GHYDNQLQNGIKRVKEFLES--SPNTQWELRAFMAVMHFSLTADRIDDDILKVIVDSMNH 145

Query: 124 QDGKRKALLDELKALTAELKVYSVIQSQINAALSAKQGIRIDAGGIDLVDPTLYGYAVGD 183
R L +EL LTAELK+YSVIQ++IN LS+ I I I+L+D LYGY +
Sbjct: 146 HGDARSKLREELAELTAELKIYSVIQAEINKHLSSSGTINIHDKSINLMDKNLYGYT-DE 204

Query: 184 PRWKDSPEYALLSNLDTFSGKL--------SIKDFLSGSPKQSGELKGLSDEYPFEKDNN 235
+K S EY +L + + ++ SIKDFL K++G L L + Y + KDNN
Sbjct: 205 EIFKASAEYKILEKMPQTTIQVDGSEKKIVSIKDFLGSENKRTGALGNLKNSYSYNKDNN 264

Query: 236 PVGNFATTVSDRSRPLNDKVNEKTTLLNDTSSRYNSAVEALNRFIQKYDSVLRDIL 291
+ +FATT SD+SRPLND V++KTT L+D +SR+NSA+EALNRFIQKYDSV++ +L
Sbjct: 265 ELSHFATTCSDKSRPLNDLVSQKTTQLSDITSRFNSAIEALNRFIQKYDSVMQRLL 320


110PA1698PA1690N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA16981162.159758type III secretion outer membrane protein PopN
PA16970161.821438type III secretion system ATPase
PA16963171.474566translocation protein in type III secretion
PA16952180.344332translocation protein in type III secretion
PA1694211-1.179490type III secretion system protein
PA1693110-1.744888type III secretion system protein
PA1692110-1.650617translocation protein in type III secretion
PA169109-0.854465translocation protein in type III secretion
PA1690-18-0.826011translocation protein in type III secretion
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1698PF072012844e-98 Hypersensitivity response secretion protein HrpJ
		>PF07201#Hypersensitivity response secretion protein HrpJ

Length = 293

Score = 284 bits (727), Expect = 4e-98
Identities = 134/294 (45%), Positives = 181/294 (61%), Gaps = 7/294 (2%)

Query: 1 MDILQSSSAAPLA-----PREAANAPAQQAGGSFQGERVHYVSVS-QSLADAAEELTFAF 54
M L + S P A++ Q G F+GE V VS + QS+AD AEE+TF F
Sbjct: 1 MTTLHNLSYGNTPLHNERPEIASSQIVNQTLGQFRGESVQIVSGTLQSIADMAEEVTFVF 60

Query: 55 SERAEKSLAKRRLSDAHARLSEVQAMLQEYWKRIPDLESQQKLEALIAHLGSGQLSSLAQ 114
SER E SL KR+LSD+ AR+S+V+ + +Y ++P+LE +Q + L++ L + SL+Q
Sbjct: 61 SERKELSLDKRKLSDSQARVSDVEEQVNQYLSKVPELEQKQNVSELLSLLSNSPNISLSQ 120

Query: 115 LSAYLEGFSSEISQRFLALSRARDVLAGRPEARAMLALVDQALLRMADEQGLEIELGLRI 174
L AYLEG S E S++F L RD L GRPE + LV+QAL+ MA+EQG I LG RI
Sbjct: 121 LKAYLEGKSEEPSEQFKMLCGLRDALKGRPELAHLSHLVEQALVSMAEEQGETIVLGARI 180

Query: 175 EPLAAEASAAGVGDIQALRDTYRDAVLDYRGLSAAWQDIQARFAATPLERVVAFLQKALS 234
P A S +GV +Q LRDTYRDAV+ Y+G+ A W D+Q RF ++ V+ FLQKALS
Sbjct: 181 TPEAYRESQSGVNPLQPLRDTYRDAVMGYQGIYAIWSDLQKRFPNGDIDSVILFLQKALS 240

Query: 235 ADLDSQSSRLDPVKLERVMSDMHKLRVLGGLAEQVGALWQVLVTGERGHGIRAF 288
ADL SQ S KL V+SD+ KL+ G +++QV WQ G + +G+R F
Sbjct: 241 ADLQSQQSGSGREKLGIVISDLQKLKEFGSVSDQVKGFWQFFSEG-KTNGVRPF 293


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1695IGASERPTASE401e-05 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 40.4 bits (94), Expect = 1e-05
Identities = 29/133 (21%), Positives = 42/133 (31%), Gaps = 18/133 (13%)

Query: 19 APLPPLRAQQIAFEQALPAHRPPAPRPPFDKGDETTEAAATADAPTSTPLADQPAAPAAD 78
+ + P + Q + R P T + T+T AD PA +
Sbjct: 1127 SQVSPKQEQSETVQPQAEPARENDP----------TVNIKEPQSQTNTT-ADT-EQPAKE 1174

Query: 79 RPPTIRQPPMPVAADATPTPTPTPTPTPTPTPTPTPTPTV-SPSGSVARQAPAVSARVAA 137
+ Q PV T + P T T PTV S S + + S R
Sbjct: 1175 TSSNVEQ---PVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVP 1231

Query: 138 STQAREPASVSAP 150
EPA+ S+
Sbjct: 1232 HNV--EPATTSSN 1242


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1694TYPE3OMOPROT832e-20 Type III secretion system outer membrane O protein ...
		>TYPE3OMOPROT#Type III secretion system outer membrane O protein

family signature.
Length = 303

Score = 83.1 bits (205), Expect = 2e-20
Identities = 46/177 (25%), Positives = 73/177 (41%), Gaps = 14/177 (7%)

Query: 130 RLALWLDGDPATLLARLPPRPSAQRLAIPLRLSLQWPGLPLDASELRTLEPGDLLLLPAG 189
R LW + P L A RP R + + L L + GD+LL+
Sbjct: 126 RGGLWFEHLPE-LPAVGGGRPKMLRWPLRFVIGSSDTQRSL----LGRIGIGDVLLIRTS 180

Query: 190 HRPDAALLGVLEGRPWARCQLHSTQL-ELLDMH----DTPSLADGEDLHELDQLPIPVSF 244
A + + ++ + E LD+ + + E L L+QLP+ + F
Sbjct: 181 R----AEVYCYAKKLGHFNRVEGGIIVETLDIQHIEEENNTTETAETLPGLNQLPVKLEF 236

Query: 245 EVGRRTLDLHTLSTLQPGSLLDLDSALDGEVRILANQRCLGIGELVRLQDRLGVRVT 301
+ R+ + L L + LL L + + V I+AN LG GELV++ D LGV +
Sbjct: 237 VLYRKNVTLAELEAMGQQQLLSLPTNAELNVEIMANGVLLGNGELVQMNDTLGVEIH 293


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1693TYPE3IMPPROT2463e-85 Type III secretion system inner membrane P protein ...
		>TYPE3IMPPROT#Type III secretion system inner membrane P protein

family signature.
Length = 224

Score = 246 bits (629), Expect = 3e-85
Identities = 92/217 (42%), Positives = 142/217 (65%), Gaps = 7/217 (3%)

Query: 6 DELGLILGLALLALVPFIAVMATSFIKMTVVFSLLRNALGVQQIPPNMAMYGLAIILSLY 65
+++ LI LA L+PFI T F+K ++VF ++RNALG+QQIP NM + G+A++LS++
Sbjct: 3 NDISLIALLAFSTLLPFIIASGTCFVKFSIVFVMVRNALGLQQIPSNMTLNGVALLLSMF 62

Query: 66 VMAPVGFATRDYLRNHDVSLSDSASVERFLDEGMAPYRNFLKRQIQEREHTFFMESTRQV 125
VM P+ Y + DV+ +D +S+ + +DEG+ YR++L + FF + +
Sbjct: 63 VMWPIMHDAYVYFEDEDVTFNDISSLSKHVDEGLDGYRDYLIKYSDRELVQFFENAQLKR 122

Query: 126 WPSEYAERLDPD-------SLLILLPAFTVSELTRAFEIGFLIYLPFIAIDLIISNILLA 178
E E + D S+ LLPA+ +SE+ AF+IGF +YLPF+ +DL++S++LLA
Sbjct: 123 QYGEETETVKRDKDEIEKPSIFALLPAYALSEIKSAFKIGFYLYLPFVVVDLVVSSVLLA 182

Query: 179 MGMMMVSPMTISLPFKLLLFVLLDGWARLTHGLVISY 215
+GMMM+SP+TIS P KL+LFV LDGW L+ GL++ Y
Sbjct: 183 LGMMMMSPVTISTPIKLVLFVALDGWTLLSKGLILQY 219


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1692TYPE3IMQPROT684e-19 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 68.3 bits (167), Expect = 4e-19
Identities = 35/78 (44%), Positives = 48/78 (61%)

Query: 5 DILHFTNQTLWLVLVLSLPPVLVAALIGTLVSLVQALTQIQEQTLGFVAKLVAVVVVLFA 64
D++ N+ L+LVL+LS P +VA +IG LV L Q +TQ+QEQTL F KL+ V + LF
Sbjct: 3 DLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLFL 62

Query: 65 TSGWLGGELYRFAEMTLL 82
SGW G L + +
Sbjct: 63 LSGWYGEVLLSYGRQVIF 80


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1691TYPE3IMRPROT1415e-43 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 141 bits (357), Expect = 5e-43
Identities = 46/245 (18%), Positives = 99/245 (40%), Gaps = 4/245 (1%)

Query: 9 LLLTYSLLLPRIISCFVVLPVLAKQTLGGGLVRNGVACSLALFAYPIVAGSLPPALGALD 68
L Y L R+++ P+L+++++ V+ G+A + P + + + +
Sbjct: 12 WLNLYFWPLLRVLALISTAPILSERSVPK-RVKLGLAMMITFAIAPSLPAN-DVPVFSFF 69

Query: 69 IALLIGKEVLLGLLIGFVATIPFWAMEATGFIIDNQRGAALASTFNPSLGSQTSPTGLLL 128
L +++L+G+ +GF F A+ G II Q G + A+ +P+ ++
Sbjct: 70 ALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHLNMPVLARIM 129

Query: 129 TQTLITLFFSGGAFLALVGSLFRSYASWPVSSFFPQLGSQWVAFFYAQFSQMLMLCALFA 188
+ LF + L L+ L ++ + P+ S S + + + A
Sbjct: 130 DMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPL--NSNAFLALTKAGSLIFLNGLMLA 187

Query: 189 APLLIAMFLAEFGLALVSRFAPSLNVFILAMPIKSLVASLLLVLYLGILMEHAYDALLLA 248
PL+ + L L++R AP L++F++ P+ V L+ + ++
Sbjct: 188 LPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFCEHLFSEI 247

Query: 249 VDPLR 253
+ L
Sbjct: 248 FNLLA 252


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1690TYPE3IMSPROT422e-150 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 422 bits (1087), Expect = e-150
Identities = 232/349 (66%), Positives = 294/349 (84%)

Query: 1 MSAEKTEQPTAKKLRDARRQGQVVKSKEIVSSALILSLVALLMGFSDYYLEHLGKLLLLP 60
MS EKTEQPT KK+RDAR++GQV KSKE+VS+ALI++L A+LMG SDYY EH KL+L+P
Sbjct: 1 MSGEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIP 60

Query: 61 AEYIDLPFRQALETILENLLQELLYLLAPVLLVAALVVVLSHVGQYGFLLSLDSVKPDLK 120
AE LPF QAL +++N+L E YL P+L VAAL+ + SHV QYGFL+S +++KPD+K
Sbjct: 61 AEQSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDIK 120

Query: 121 KINPVEGAKKIFSIRSLVEFLKSTLKVALLSLLVWLTLQGNLASLLRIPACGLDCVAPVS 180
KINP+EGAK+IFSI+SLVEFLKS LKV LLS+L+W+ ++GNL +LL++P CG++C+ P+
Sbjct: 121 KINPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPLL 180

Query: 181 GLMLRQLMLVCAVGFLAIAVADYAFERHQHYKQLRMSKDEVKREYKEMEGSPEIKSKRRQ 240
G +LRQLM++C VGF+ I++ADYAFE +Q+ K+L+MSKDE+KREYKEMEGSPEIKSKRRQ
Sbjct: 181 GQILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRRQ 240

Query: 241 FHQELQSSNLRADVRRSSVIVANPTHVAIGIRYRRGETPLPLVTLKHTDALALRVRRIAE 300
FHQE+QS N+R +V+RSSV+VANPTH+AIGI Y+RGETPLPLVT K+TDA VR+IAE
Sbjct: 241 FHQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIAE 300

Query: 301 EEGIPVLQRIPLARALLRDGNVDQYIPADLIQATAEVLRWLESQQTDTP 349
EEG+P+LQRIPLARAL D VD YIPA+ I+ATAEVLRWLE Q +
Sbjct: 301 EEGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQNIEKQ 349


111PA1461PA1432N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA1461212-1.202708flagellar motor protein MotD
PA1460112-1.336459flagellar motor protein
PA1459112-1.567433chemotaxis-specific methylesterase
PA1458-111-1.282274two-component sensor
PA145709-1.592002protein phosphatase CheZ
PA1456013-0.459239chemotaxis protein CheY
PA14550150.181610flagellar biosynthesis sigma factor FliA
PA14541150.411224flagellar synthesis regulator FleN
PA14532170.557393flagellar biosynthesis regulator FlhF
PA14523170.118638flagellar biosynthesis protein FlhA
PA14513200.158947hypothetical protein
PA14504190.004032hypothetical protein
PA1449619-0.669460flagellar biosynthesis protein FlhB
PA1448618-1.377531flagellar biosynthesis protein FliR
PA1447316-1.847776flagellar biosynthesis protein FliQ
PA1446011-0.556070flagellar biosynthesis protein FliP
PA144519-0.232770flagellar protein FliO
PA144408-0.842109flagellar motor switch protein FliN
PA1443-1100.238968flagellar motor switch protein FliM
PA14420100.645061flagellar basal body protein FliL
PA14410121.065403hypothetical protein
PA14401131.186448hypothetical protein
PA14391131.477253hypothetical protein
PA14380111.616378two-component sensor
PA1437-1100.519259two-component response regulator
PA1436-290.217486resistance-nodulation-cell division (RND) efflux
PA1435-113-1.193488resistance-nodulation-cell division (RND) efflux
PA1434-210-0.661788hypothetical protein
PA1433-211-0.894224hypothetical protein
PA1432-311-1.298525acyl-homoserine-lactone synthase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1461OMPADOMAIN691e-15 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 69.2 bits (169), Expect = 1e-15
Identities = 34/125 (27%), Positives = 52/125 (41%), Gaps = 16/125 (12%)

Query: 128 EITLNSSLLFPSGDALPNDAAFDIVEKVAKILAPYKNP---IHVEGFTDDVPIHSPRYPT 184
TL S +LF A ++++ L+ + V G+TD I S Y
Sbjct: 214 HFTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDR--IGSDAY-- 269

Query: 185 NWELSAARAASIVRLLGNDGVEPSRMAAVGYGEFQPVADNASAEGR---------AKNRR 235
N LS RA S+V L + G+ +++A G GE PV N + A +RR
Sbjct: 270 NQGLSERRAQSVVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRR 329

Query: 236 VVLVI 240
V + +
Sbjct: 330 VEIEV 334


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1459HTHFIS582e-11 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 58.3 bits (141), Expect = 2e-11
Identities = 37/142 (26%), Positives = 56/142 (39%), Gaps = 6/142 (4%)

Query: 2 AVKVLVVDDSGFFRRRVSEILSADGQIQVVGTGTNGREAIEQALALRPDVITMDYEMPLM 61
+LV DD R +++ LS G V T N A D++ D MP
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAG-YDVRITS-NAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 62 DGITAVRNIMQRCP-TPVLMFSSLTHEGARVTLDALDAGAVDYLPKNFEDISRNPDKVRQ 120
+ + I + P PVL+ S+ + A + GA DYLPK F D++ + +
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAI--KASEKGAYDYLPKPF-DLTELIGIIGR 117

Query: 121 LLCEKVLTIARSNRRSISLPPL 142
L E ++ S PL
Sbjct: 118 ALAEPKRRPSKLEDDSQDGMPL 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1458PF06580427e-06 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 41.8 bits (98), Expect = 7e-06
Identities = 13/69 (18%), Positives = 30/69 (43%), Gaps = 10/69 (14%)

Query: 462 ETDLDKNLVEALADPLV--HLVRNAVDHGIESPEEREAAGKPRVGQVVLSAEQEGDHILL 519
E ++ +++ P++ LV N + HGI P+ G+++L ++ + L
Sbjct: 243 ENQINPAIMDVQVPPMLVQTLVENGIKHGIA--------QLPQGGKILLKGTKDNGTVTL 294

Query: 520 MITDDGKGM 528
+ + G
Sbjct: 295 EVENTGSLA 303


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1456HTHFIS902e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 89.9 bits (223), Expect = 2e-24
Identities = 32/120 (26%), Positives = 55/120 (45%), Gaps = 3/120 (2%)

Query: 2 KILIVDDFSTMRRIIKNLLRDLGFTNTAEADDGTTALPMLHSGNFDFLVTDWNMPGMTGI 61
IL+ DD + +R ++ L G+ + T + +G+ D +VTD MP
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 DLLRAVRADERLKHLPVLMVTAEAKRDQIIEAAQAGVNGYVVKPFTAQVLKEKIEKIFER 121
DLL ++ + LPVL+++A+ I+A++ G Y+ KPF L I +
Sbjct: 64 DLLPRIK--KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1451cloacin300.022 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 30.1 bits (67), Expect = 0.022
Identities = 15/48 (31%), Positives = 20/48 (41%)

Query: 398 SAGGSGGGRRRGGDYASSSGSSSSSSSSSSSDSFSGGGGSSGGGGASG 445
+ G GGG G ++S + S S G G+ GG G SG
Sbjct: 25 TGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSG 72


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1450cloacin363e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 35.8 bits (82), Expect = 3e-04
Identities = 15/37 (40%), Positives = 19/37 (51%)

Query: 380 GGGGSSGSSGGGSSSSSSSSSGGFSGGGGSSGGGGAS 416
GGG SG GG S + G SGGG +GG ++
Sbjct: 47 GGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSA 83



Score = 34.3 bits (78), Expect = 8e-04
Identities = 22/61 (36%), Positives = 24/61 (39%), Gaps = 15/61 (24%)

Query: 373 GQVRLSGGGGGSSGSS--------GGGSSS-------SSSSSSGGFSGGGGSSGGGGASD 417
G L GGG S GS GGGS S S + GG GG SG GG
Sbjct: 23 GPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLS 82

Query: 418 S 418
+
Sbjct: 83 A 83



Score = 33.1 bits (75), Expect = 0.002
Identities = 15/38 (39%), Positives = 18/38 (47%)

Query: 379 GGGGGSSGSSGGGSSSSSSSSSGGFSGGGGSSGGGGAS 416
GGG GS GGGS + +G GG G+ G A
Sbjct: 47 GGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAV 84



Score = 30.5 bits (68), Expect = 0.013
Identities = 15/40 (37%), Positives = 15/40 (37%)

Query: 380 GGGGSSGSSGGGSSSSSSSSSGGFSGGGGSSGGGGASDSW 419
GG G GG S S SS GGG SG S
Sbjct: 22 GGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61



Score = 30.5 bits (68), Expect = 0.016
Identities = 16/34 (47%), Positives = 21/34 (61%), Gaps = 1/34 (2%)

Query: 385 SGSSGGGSSSSSSSSSGGFSGG-GGSSGGGGASD 417
SG G G ++ + S+SG +GG G GGGASD
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASD 35



Score = 29.7 bits (66), Expect = 0.023
Identities = 13/37 (35%), Positives = 18/37 (48%)

Query: 378 SGGGGGSSGSSGGGSSSSSSSSSGGFSGGGGSSGGGG 414
GG G S G SG G + S+ ++ F S+ G G
Sbjct: 65 GGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAG 101


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1449TYPE3IMSPROT336e-116 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 336 bits (864), Expect = e-116
Identities = 98/345 (28%), Positives = 183/345 (53%), Gaps = 2/345 (0%)

Query: 9 DKTEEPTEKRRREAREKGQLPRSRELNTLAILMAGAGGLLIYGADLAGALLRLMRSNFEL 68
+KTE+PT K+ R+AR+KGQ+ +S+E+ + A+++A + L+ +LM E
Sbjct: 4 EKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIPAEQ 63

Query: 69 SRETAMNTESMLQLLGASAYLAAQGLWPILLMLLVAAIVGPIALGGWLFSMDALQPKFSR 128
S ++++ ++ +P+L + + AI + G+L S +A++P +
Sbjct: 64 SYLPF--SQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDIKK 121

Query: 129 LNPLSGLKRMFSAKSLLELSKALIKFLVVLAVALLVLSADRDALLALAHQPLEQAILHSV 188
+NP+ G KR+FS KSL+E K+++K +++ + +++ + LL L +E
Sbjct: 122 INPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPLLG 181

Query: 189 RVVGWSAFWMACSLLLIAAVDVPYQIWDNRQKLLMTKQEVRDEYKDSEGKPEVKSKIRQM 248
+++ ++I+ D ++ + ++L M+K E++ EYK+ EG PE+KSK RQ
Sbjct: 182 QILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRRQF 241

Query: 249 QREMAQRRMMAAVPEADVVITNPTHFAVALKYDPAGGGAPLLLAKGNDFLALKIREVAQE 308
+E+ R M V + VV+ NPTH A+ + Y PL+ K D +R++A+E
Sbjct: 242 HQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIAEE 301

Query: 309 HKVMVMESPALARAVYYSTELDQEIPAGLYLAVAQVLAYVYQLKQ 353
V +++ LARA+Y+ +D IPA A A+VL ++ +
Sbjct: 302 EGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQNI 346


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1448TYPE3IMRPROT1341e-40 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 134 bits (340), Expect = 1e-40
Identities = 96/232 (41%), Positives = 144/232 (62%), Gaps = 2/232 (0%)

Query: 1 MLELTNAQIGGWIASFVLPLFRVAALLMTMPVIGTQLVPVRVRLYLALGVCVVLVPNLPS 60
ML++T+ Q W+ + PL RV AL+ T P++ + VP RV+L LA+ + + P+LP+
Sbjct: 1 MLQVTSEQWLSWLNLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPA 60

Query: 61 MPQVDALSMKAMLLIGEQILVGALLGFSLQLLFHAFVIAGQIISMQMGLGFASMVDPANG 120
S A+ L +QIL+G LGF++Q F A AG+II +QMGL FA+ VDPA+
Sbjct: 61 NDVPV-FSFFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASH 119

Query: 121 VSVPVLGQFFTMLVTLLFLAMNGHLVVFEVIAESFVTLPVGEGLSGNHFWI-IAGKLGWV 179
+++PVL + ML LLFL NGHL + ++ ++F TLP+G ++ ++ + +
Sbjct: 120 LNMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLI 179

Query: 180 MGAALLLALPAITALLVVNLAFGAMTRAAPQLNIFSIGFPLTLVLGLVILWI 231
L+LALP IT LL +NLA G + R APQL+IF IGFPLTL +G+ ++
Sbjct: 180 FLNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAA 231


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1447TYPE3IMQPROT559e-14 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 54.8 bits (132), Expect = 9e-14
Identities = 24/75 (32%), Positives = 43/75 (57%)

Query: 7 LDLFREALWLTAMIVGVLVVPSLLVGLVVAMFQAATQINEQTLSFLPRLMVILLTLIVLG 66
+ +AL+L ++ G + + ++GL+V +FQ TQ+ EQTL F +L+ + L L +L
Sbjct: 5 VFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLFLLS 64

Query: 67 PWLLRQLMEYTQTLI 81
W L+ Y + +I
Sbjct: 65 GWYGEVLLSYGRQVI 79


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1446FLGBIOSNFLIP2642e-91 Escherichia coli: Flagellar biosynthetic protein Fl...
		>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP

signature.
Length = 245

Score = 264 bits (676), Expect = 2e-91
Identities = 140/242 (57%), Positives = 176/242 (72%), Gaps = 3/242 (1%)

Query: 11 LAALCLLLLAPWPALAADPTSISAITVTTNGQGQQEYSVSLQILLIMTALSFIPAFVMLM 70
L+ +LL P A + IT G Q +S+ +Q L+ +T+L+FIPA +++M
Sbjct: 5 LSVAPVLLWLITPLAFA---QLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILLMM 61

Query: 71 TSFTRIIIVFSILRQALGLQSTPSNQVLVGLALFLTMFVMAPVFDKINSQALQPYLNEQI 130
TSFTRIIIVF +LR ALG S P NQVL+GLALFLT F+M+PV DKI A QP+ E+I
Sbjct: 62 TSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEEKI 121

Query: 131 PAQEALQKAEVPLKAFMLAQTRTSDLELFVRLSKRTDIGSPEATPLTILVPAFVTSELKT 190
QEAL+K PL+ FML QTR +DL LF RL+ + PEA P+ IL+PA+VTSELKT
Sbjct: 122 SMQEALEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSELKT 181

Query: 191 AFQIGFMIFIPFLIIDLVVSSVLMAMGMMMLSPLIISLPFKIMLFVLVDGWALIIGTLAG 250
AFQIGF IFIPFLIIDLV++SVLMA+GMMM+ P I+LPFK+MLFVLVDGW L++G+LA
Sbjct: 182 AFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLAQ 241

Query: 251 SF 252
SF
Sbjct: 242 SF 243


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1444FLGMOTORFLIN1208e-38 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 120 bits (301), Expect = 8e-38
Identities = 62/145 (42%), Positives = 90/145 (62%), Gaps = 24/145 (16%)

Query: 13 ALADEWAAALAE-AGDASQDDIDALMAQGGATPVAEPSTPRAPMEEFGASPKAPTISGLE 71
AL D WA AL E ++ DA+ Q G V+
Sbjct: 14 ALDDLWADALNEQKATTTKSAADAVFQQLGGGDVSGAMQ--------------------- 52

Query: 72 GPNLDVILDIPVTISMEVGHTDISIRNLLQLNQGSVIELDRLAGEPLDVLVNGTLIAHGE 131
++D+I+DIPV +++E+G T ++I+ LL+L QGSV+ LD LAGEPLD+L+NG LIA GE
Sbjct: 53 --DIDLIMDIPVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGE 110

Query: 132 VVVVNEKFGIRLTDVISPSERIKKL 156
VVVV +K+G+R+TD+I+PSER+++L
Sbjct: 111 VVVVADKYGVRITDIITPSERMRRL 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1443FLGMOTORFLIM2592e-87 Flagellar motor switch protein FliM signature.
		>FLGMOTORFLIM#Flagellar motor switch protein FliM signature.

Length = 344

Score = 259 bits (664), Expect = 2e-87
Identities = 98/326 (30%), Positives = 167/326 (51%), Gaps = 13/326 (3%)

Query: 5 DLLSQDEIDALLHGVDDGLVETEVEATPG-----SVKSYDLTSQDRIVRGRMPTLEMINE 59
++LSQDEID LL + G + +E + YD D+ + +M TL +++E
Sbjct: 3 EVLSQDEIDQLLTAISSG--DASIEDARPISDTRKITLYDFRRPDKFSKEQMRTLSLMHE 60

Query: 60 RFARYTRISMFNLLRRSADVAVGGVQVMKFGEYVHSLYVPTSLNLVKMKPLRGTALFILD 119
FAR T S+ LR V V V + + E++ S+ P++L ++ M PL+G A+ +D
Sbjct: 61 TFARLTTTSLSAQLRSMVHVHVASVDQLTYEEFIRSIPTPSTLAVITMDPLKGNAVLEVD 120

Query: 120 AKLVFKLVDNFFGGDGRHAKIEGREFTPTELRVVRMVLEQAFVDLKEAWQAVLEMNFEYV 179
+ F ++D FGG G+ AK++ R+ T E V+ V+ + +++E+W V+++
Sbjct: 121 PSITFSIIDRLFGGTGQAAKVQ-RDLTDIENSVMEGVIVRILANVRESWTQVIDLRPRLG 179

Query: 180 NSEVNPAMANIVSPSEVVVVSTFHIELDGGGGDLHITMPYSMIEPIREMLDAGF--QSDH 237
E NP A IV PSE+VV+ T ++ G ++ +PY IEPI L + F S
Sbjct: 180 QIETNPQFAQIVPPSEMVVLVTLETKVGEEEGMMNFCIPYITIEPIISKLSSQFWFSSVR 239

Query: 238 DDQDERWIKALREDVLDVQVPLGATVVRRQLKLRDILHMQPGDVIPVE---MPEHMVMRA 294
+++ LR+ + V + + A V +L +RDIL ++ GD+I + + + V+
Sbjct: 240 RSSTTQYMGVLRDKLSTVDMDVVAEVGSLRLSVRDILGLRVGDIIRLHDTHVGDPFVLSI 299

Query: 295 NGVPAFKVKLGAHKGNLALQILEAVE 320
F + G +A QILE +E
Sbjct: 300 GNRKKFLCQPGVVGKKIAAQILERIE 325


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1441FLGHOOKFLIK522e-09 Flagellar hook-length control protein signature.
		>FLGHOOKFLIK#Flagellar hook-length control protein signature.

Length = 375

Score = 52.1 bits (124), Expect = 2e-09
Identities = 73/302 (24%), Positives = 115/302 (38%), Gaps = 14/302 (4%)

Query: 126 LLDENTQATLLPPAVPTASSAPASLTEASSDPTLVKLNGVPAVNMALEQGAQDAAQTAKG 185
+ DE + +T L A A +A A + V A AL T K
Sbjct: 88 INDEQSTSTPLTTAQTMALAAVADKNTTKDEKADDLNEDVTASLSALFAMLPGFDNTPKV 147

Query: 186 GPAKSADPRQANLGDALAGLTSDSLTKAVDGKALEAQLQQTAEPAVASAASESLLESKAE 245
A S LTS+ LT A A Q P VA A S++ + S
Sbjct: 148 TDAPST-VLPTEKPTLFTKLTSEQLTTAQPDDAPGTPAQPLT-PLVAEAQSKAEVISTPS 205

Query: 246 PRGEPFAAKLNGLTQAMAQQALTNRPVNGTVPGQPVAMQQNGWSEAVVDRVMWMSSQNLK 305
P AA +T Q T P + + W +++ + + Q +
Sbjct: 206 PVT---AAASPLITPHQTQPLPT-----VAAPVLSAPLGSHEWQQSLSQHISLFTRQGQQ 257

Query: 306 SAEIQLDPAELGRLDVRIHMTADQTQVTFASPNAGVRDALESQMHRLRDMFSQQGMNQLD 365
SAE++L P +LG + + + + +Q Q+ SP+ VR ALE+ + LR ++ G+
Sbjct: 258 SAELRLHPQDLGEVQISLKVDDNQAQIQMVSPHQHVRAALEAALPVLRTQLAESGIQLGQ 317

Query: 366 VNVSDQSLARGWQGQQQGEGGSARGRGLAGEASGDEETLAGVSEIRSRPGASAARGLVDY 425
N+S +S + G Q + S R A D++TL ++ G VD
Sbjct: 318 SNISGESFS-GQQQAASQQQQSQRTANHEPLAGEDDDTLPVPVSLQ---GRVTGNSGVDI 373

Query: 426 YA 427
+A
Sbjct: 374 FA 375


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1438PF06580290.035 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 29.1 bits (65), Expect = 0.035
Identities = 17/93 (18%), Positives = 30/93 (32%), Gaps = 5/93 (5%)

Query: 316 EETSLAGEIATTVDFLEVI----FDEAGVGIEVRGEAR-ALVERALFQRAVTNLLYNAAQ 370
+ SLA E+ +L++ D ++ V L Q V N + +
Sbjct: 214 RQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPMLVQTLVENGIKHGIA 273

Query: 371 HTAAGGTLRVGVERRGDEVRVAVSNPGVPIADE 403
GG + + + V + V N G
Sbjct: 274 QLPQGGKILLKGTKDNGTVTLEVENTGSLALKN 306


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1437HTHFIS861e-21 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 86.0 bits (213), Expect = 1e-21
Identities = 33/129 (25%), Positives = 62/129 (48%), Gaps = 2/129 (1%)

Query: 2 RVLIVEDEAKTADYLNRGLSEQGFTVDLADNGIDGRHLALHGEYDVIVLDVMLPGVDGYG 61
+L+ +D+A LN+ LS G+ V + N G+ D++V DV++P + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 VLRALRERR-QTPVIMLTARERVEDRVRGLREGADDYLIKPFSFLELVARL-QALTRRGG 119
+L +++ R PV++++A+ ++ +GA DYL KPF EL+ + +AL
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 120 NHESHSQMR 128

Sbjct: 125 RPSKLEDDS 133


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1436ACRIFLAVINRP7620.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 762 bits (1968), Expect = 0.0
Identities = 289/1040 (27%), Positives = 490/1040 (47%), Gaps = 36/1040 (3%)

Query: 7 ISGWCVRHPIATALLTLASLLLGLLAFLRLGVAPLPEADFPTIQINALLPGGSPETMASS 66
++ + +R PI +L + ++ G LA L+L VA P P + ++A PG +T+ +
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 67 VATPLEVQFSAIPGITEMTSSSA-LGTTTLTLQFSLDKSIDVAAQEVQAAINAAAGRLPV 125
V +E + I + M+S+S G+ T+TL F D+A +VQ + A LP
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 126 DMPNLPTWRKVNPADSPIMILRVNSE--MMPLIELSDYAETILARQLSQVNGVGQIFVVG 183
++ + S +M+ S+ ++SDY + + LS++NGVG + + G
Sbjct: 121 EVQQ-QGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFG 179

Query: 184 QQRPAIRIQAQPEKLAAYQLTLADLRQSLQSASVNLAKGALYGEGRVS------TLAAND 237
Q A+RI + L Y+LT D+ L+ + +A G L G + ++ A
Sbjct: 180 AQY-AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQT 238

Query: 238 QLFNASDYDDLVV-AYRQGAPVFLKDVARIVSAPEDDYVQAWPNGVPGVALVILRQPGAN 296
+ N ++ + + G+ V LKDVAR+ E+ V A NG P L I GAN
Sbjct: 239 RFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGAN 298

Query: 297 IVDTADAIQAALPRLREMLPATIEVDVLNDRTRTIRSSLHEVELTLLLTIGLVVLVMGLF 356
+DTA AI+A L L+ P ++V D T ++ S+HEV TL I LV LVM LF
Sbjct: 299 ALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLF 358

Query: 357 LRQLSATLIVATVLAVSLSASFAAMYVLGFTLNNLTLVALIIAVGFIVDDAIVVVENIHR 416
L+ + ATLI + V L +FA + G+++N LT+ +++A+G +VDDAIVVVEN+ R
Sbjct: 359 LQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVER 418

Query: 417 HL-EAGASKVEAALKGAAEIGFTVISISFSLIAAFIPLLFMGGIVGRLFREFAVSVTVAI 475
+ E EA K ++I ++ I+ L A FIP+ F GG G ++R+F++++ A+
Sbjct: 419 VMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAM 478

Query: 476 LISVVASLTLAPMLASRFM-PALRHAEAPRKGFAEW-------LTGGYERGLRWALGHQR 527
+SV+ +L L P L + + P + GF W Y + LG
Sbjct: 479 ALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTG 538

Query: 528 LMLVGFAFTVLVAVAGYVGIPKGFFPLQDTAFVFGTSQAAEDISYDDMVAKHRQLAEIIA 587
L+ +A V V ++ +P F P +D Q + + Q+ +
Sbjct: 539 RYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYL 598

Query: 588 SDPA--VQSYNHAVGVTGGSQSLANGRFWIVLKDRGERDV---SVGEFIDRLRPQLAKVP 642
+ V+S G + Q+ G ++ LK ER+ S I R + +L K+
Sbjct: 599 KNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIR 658

Query: 643 GIMLYLRAAQDINLSSGPSRTQYQYAL---RSSDSTQLALWAQRLTERLKQVPG-LMDVS 698
+ I + T + + L L +L Q P L+ V
Sbjct: 659 DGFVIPFNMPAI--VELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVR 716

Query: 699 NDLQVGASVTALDIDRVAAARFGLSAEDVSQTLYDAFGQRQVGEYQTEVNQYKVVLELDA 758
+ + L++D+ A G+S D++QT+ A G V ++ K+ ++ DA
Sbjct: 717 PNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADA 776

Query: 759 RQRGRAESLDWFYLRSPLSGEMVPLSAIAKVAAPRSGPLQINHNGMFPAVNLSFNLAAGV 818
+ R E +D Y+RS +GEMVP SA G ++ P++ + A G
Sbjct: 777 KFRMLPEDVDKLYVRSA-NGEMVPFSAFTTSH-WVYGSPRLERYNGLPSMEIQGEAAPGT 834

Query: 819 SLGEAVQAVQRAQEEIGMPSTIIGVFQGAAQAFQSSLASQPLLILAALIAVYIILGVLYE 878
S G+A+ ++ + +P+ I + G + + S P L+ + + V++ L LYE
Sbjct: 835 SSGDAMALMENLASK--LPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYE 892

Query: 879 SFVHPLTILSTLPSAGIGAVFLLWAWGQDFSIMALIGIVLLIGIVKKNGILMVDFAIVAQ 938
S+ P++++ +P +G + + Q + ++G++ IG+ KN IL+V+FA
Sbjct: 893 SWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLM 952

Query: 939 REQGMSAEQAIYQACLTRFRPIMMTTLAALLGAIPLMIGFGTGSELRQPLGIAVVGGLLV 998
++G +A A R RPI+MT+LA +LG +PL I G GS + +GI V+GG++
Sbjct: 953 EKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVS 1012

Query: 999 SQVLTLFSTPVVYLALERLF 1018
+ +L +F PV ++ + R F
Sbjct: 1013 ATLLAIFFVPVFFVVIRRCF 1032


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1435RTXTOXIND507e-09 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 50.2 bits (120), Expect = 7e-09
Identities = 33/205 (16%), Positives = 65/205 (31%), Gaps = 60/205 (29%)

Query: 10 RVLVGVLAAGLVAFGGWAWLGGDAGAKAAPAPARVPVIVARVERRDVEQQVSGIGTVTSL 69
R++ + LV + LG VE + G +T
Sbjct: 58 RLVAYFIMGFLVIAFILSVLG------------------------QVEIVATANGKLTHS 93

Query: 70 HNV-VIRTQIDGQLTRLLVSEGQMVEAGELLATIDD-------RAVVAALEQAQASRASN 121
I+ + + ++V EG+ V G++L + ++L QA+ +
Sbjct: 94 GRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRY 153

Query: 122 QAQLKS--------------------AEQDLQRYRSLYAER--------AVSRQLLDQQQ 153
Q +S +E+++ R SL E+ LD+++
Sbjct: 154 QILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKR 213

Query: 154 ATVDQLRATLKANDATINAERVRLS 178
A + A + + E+ RL
Sbjct: 214 AERLTVLARINRYENLSRVEKSRLD 238



Score = 39.0 bits (91), Expect = 2e-05
Identities = 42/207 (20%), Positives = 78/207 (37%), Gaps = 16/207 (7%)

Query: 110 ALEQAQASRASNQAQLKSAEQDLQRYRSLYAERAVSRQLLDQ--QQATVDQLR-ATLKAN 166
A+ + + +L+ + L++ S QL+ Q + +D+LR T
Sbjct: 253 AVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIG 312

Query: 167 DAT--INAERVRLSYTRITSPVSGKVGIRNV-DVGNLVRVGDSLGLFSVTQIAPISVVFS 223
T + R + I +PVS KV V G +V ++L + V + + V
Sbjct: 313 LLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETL-MVIVPEDDTLEVTAL 371

Query: 224 LQQEQLLQLQALLGGEAAVRAY-SRDGGSALGEGRLLTIDNQIDSSTGTI-RVRASFD-- 279
+Q + + + V A+ G +G+ + + +D D G + V S +
Sbjct: 372 VQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVIISIEEN 431

Query: 280 -----NRQARLWPGQFVAVSLHTGVRR 301
N+ L G V + TG+R
Sbjct: 432 CLSTGNKNIPLSSGMAVTAEIKTGMRS 458


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1432AUTOINDCRSYN1535e-49 Autoinducer synthesis protein signature.
		>AUTOINDCRSYN#Autoinducer synthesis protein signature.

Length = 216

Score = 153 bits (387), Expect = 5e-49
Identities = 41/177 (23%), Positives = 74/177 (41%), Gaps = 6/177 (3%)

Query: 14 KLLGEMHKLRAQVFKERKGWDVSVIDEMEIDGYDALSPYYMLIQEDTPEAQVFGCWRILD 73
GE+ LR + FK+R W V D ME D YD + Y+ +D V R ++
Sbjct: 15 TKSGELFTLRKETFKDRLNWAVQCTDGMEFDQYDNNNTTYLFGIKDN---TVICSLRFIE 71

Query: 74 TTGPYMLKNTFPELLHGKEAPCSPHIWELSRFAINSGQKGSLGFSDCTLEAMRALARYSL 133
T P M+ TF P + E SRF ++ + + ++ + +M L+ +
Sbjct: 72 TKYPNMITGTFFPYFKEINIPEGNY-LESSRFFVDKSRAKDILGNEYPISSMLFLSMINY 130

Query: 134 QND--IQTLVTVTTVGVEKMMIRAGLDVSRFGPHLKIGIERAVALRIELNAKTQIAL 188
D + T+ + + ++ R+G + L ER + + ++ + Q AL
Sbjct: 131 SKDKGYDGIYTIVSHPMLTILKRSGWGIRVVEQGLSEKEERVYLVFLPVDDENQEAL 187


112PA1403PA1396N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA14030103.138356transcriptional regulator
PA14020103.122376hypothetical protein
PA14010102.837402hypothetical protein
PA14000102.570246pyruvate carboxylase
PA13990131.539154transcriptional regulator
PA1398016-1.084702hypothetical protein
PA1397022-2.630059two-component response regulator
PA1396-229-3.350014two-component sensor
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1403HTHTETR587e-13 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 58.5 bits (141), Expect = 7e-13
Identities = 24/149 (16%), Positives = 54/149 (36%)

Query: 14 QPQQARSSELVASILEAAVQVLASEGAQRFTTARVAERAGVSIGSLYQYFPNKAAILFRL 73
+ + + E IL+ A+++ + +G + +A+ AGV+ G++Y +F +K+ + +
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 74 QSDEWRRTTRLLGEILEDTTRPPLERLRRLVLAFVRSECEEAAIRVALSDAAPLYRDADE 133
L E PL LR +++ + S E R+ + E
Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGE 122

Query: 134 AREVKAEGARVFQAFLREALPEVAEAERS 162
V+ + + +
Sbjct: 123 MAVVQQAQRNLCLESYDRIEQTLKHCIEA 151


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1400RTXTOXIND374e-04 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 36.7 bits (85), Expect = 4e-04
Identities = 32/195 (16%), Positives = 59/195 (30%), Gaps = 23/195 (11%)

Query: 432 RNLLLHPAVQANRVDTRFVESHLETLLAPIPASHPRLRAECPLA--------------ED 477
R L P + + F+ +HLE + P+ PRL A + E
Sbjct: 26 RKQLDTPVREK--DENEFLPAHLELIETPVSR-RPRLVAYFIMGFLVIAFILSVLGQVEI 82

Query: 478 AAPA--RVEAPLGSLPLSAPSSGVLVALEVADGERVRAGQRVAILEAMKMEFEVKAPGGG 535
A A ++ S + + ++ + V +GE VR G + L A+ E +
Sbjct: 83 VATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLK---- 138

Query: 536 IVRRLAASLGEPLEEGATLLFLEPTEDDDEQAPTEQALDLAHIRADLAEVLERQAALGDE 595
L + E +E + + + P E L +
Sbjct: 139 TQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTW 198

Query: 596 RRPQALARRRKTGQR 610
+ + +R
Sbjct: 199 QNQKYQKELNLDKKR 213


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1397HTHFIS575e-12 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 57.1 bits (138), Expect = 5e-12
Identities = 34/157 (21%), Positives = 58/157 (36%), Gaps = 5/157 (3%)

Query: 3 GRIIVADDHPLFREGMLSILQRLLPEARIEEAGDLAGVLRLADEGEQPDSLILDLRFPGL 62
I+VADD R + L R + + A + R G D ++ D+ P
Sbjct: 4 ATILVADDDAAIRTVLNQALSRA--GYDVRITSNAATLWRWIAAG-DGDLVVTDVVMPDE 60

Query: 63 TRIEMLADLRRRFPRTTLIVVSMVDDPQLIGEVMNAGADGFLGKSIAPEELGQAILAIRA 122
++L +++ P ++V+S + + GA +L K EL I RA
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG--RA 118

Query: 123 GEVLVRYEPSGLLPLQPSPRLEGLTERQLDVLRLLAQ 159
R Q L G + ++ R+LA+
Sbjct: 119 LAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLAR 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1396HTHFIS502e-08 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 49.8 bits (119), Expect = 2e-08
Identities = 31/124 (25%), Positives = 51/124 (41%), Gaps = 9/124 (7%)

Query: 416 LTGLRVCLVEDDRNVLRATSALLERWGCTVQ-AETEADGWRTDC----DILVVDYDLGPH 470
+TG + + +DD + + L R G V+ A WR D++V D + P
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVM-PD 59

Query: 471 ASGVECIERVRRQRGEAIPALVISGH-DIERIQASVEDTDIALLSKPVRPTELRATL-RA 528
+ + + R+++ +P LV+S + E L KP TEL + RA
Sbjct: 60 ENAFDLLPRIKK-ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118

Query: 529 LRER 532
L E
Sbjct: 119 LAEP 122


113PA1384PA1377N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA1384133-5.819088UDP-glucose 4-epimerase
PA1383127-4.924545hypothetical protein
PA1382-124-3.250018type II secretion system protein
PA1381-214-0.793694hypothetical protein
PA1380-1141.215340transcriptional regulator
PA1379-2141.055476short-chain dehydrogenase
PA1378-2151.817305hypothetical protein
PA1377429-5.323086hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1384NUCEPIMERASE1848e-58 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 184 bits (468), Expect = 8e-58
Identities = 85/353 (24%), Positives = 142/353 (40%), Gaps = 51/353 (14%)

Query: 1 MRVLVTGGAGFIGSHVLVELLGQGAKVVVLDNLVNGSSESLK--RVERITGHPVGFVLGD 58
M+ LVTG AGFIG HV LL G +VV +DNL + SLK R+E + F D
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 59 VRDSLLVERLLIDEKVDAVIHLAGLKAVGESVDDPLEYYESNVQGTISLLRAMQRVGVFK 118
+ D + L + V AV S+++P Y +SN+ G +++L + +
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120

Query: 119 IVFSSSATIYQMPGTLPISESSKVGGVASPYGRTKLTAEHM------LDDLARSDTRWSI 172
++++SS+++Y + +P S V S Y TK E M L L
Sbjct: 121 LLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGL-------PA 173

Query: 173 AVLRYFNPIGAHESGLIGEDPCGTPNNLLPYIAQVAVGRLSRLTVHGGDYPTI--DGTGV 230
LR+F G P G P+ +A+ + ++ + G + G
Sbjct: 174 TGLRFFTVYG----------PWGRPD--------MALFKFTKAMLEGKS-IDVYNYGKMK 214

Query: 231 RDYIHVCDLAAGHTRALEYLGQGHG---------------YHVWNLGTGTGYSVLQVIEA 275
RD+ ++ D+A R + + Y V+N+G + ++ I+A
Sbjct: 215 RDFTYIDDIAEAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQA 274

Query: 276 FERVSGRRIPFTVSGRRPGDVAECWADVSKAERELGWKAGLGLECMIADAWRW 328
E G + +PGDV E AD +G+ ++ + + W
Sbjct: 275 LEDALGIEAKKNMLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNW 327


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1382BCTERIALGSPD1973e-56 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 197 bits (503), Expect = 3e-56
Identities = 129/668 (19%), Positives = 254/668 (38%), Gaps = 109/668 (16%)

Query: 104 SLNVEDVQLAAFINEVFGNILGLPFEIESALKEKTDRVTVRLEQPQTAQMVYEVARQVLV 163
S + + + FIN V L I+ +++ +TVR + Y+ VL
Sbjct: 31 SASFKGTDIQEFINTV-SKNLNKTVIIDPSVRGT---ITVRSYDMLNEEQYYQFFLSVLD 86

Query: 164 NYGVEILHQGDIYRFQIKQVGLSPDEPPILISGEARPSVPIAYRPVFQFVALHSVDPKDV 223
YG +++ + ++ + ++ +A P I V + V L +V +D+
Sbjct: 87 VYGFAVINMNNGVLKVVRSKDAKTA--AVPVASDAAPG--IGDEVVTRVVPLTNVAARDL 142

Query: 224 IPWLN--SAYEKSGLSVMADGARSGLMLKGMSSIVNQATEAVRLLDQPFMRGRHSLRIDP 281
P L + G V + + L++ G ++++ + V +D G S+ P
Sbjct: 143 APLLRQLNDNAGVGSVVHYEPSNV-LLMTGRAAVIKRLLTIVERVDNA---GDRSVVTVP 198

Query: 282 -AFVSAADMASQLKTVIAAQGYSVGIGEAVGSIMLVPLESSNGLIVFANDGQLLDLVREW 340
++ SAAD+ + + S G V +++ E +N ++V ++
Sbjct: 199 LSWASAADVVKLVTELNKDTSKSALPGSMVANVV--ADERTNAVLVSGEPNSRQRIIAM- 255

Query: 341 AQQVDRAPMAVAAGIGEEKEGLFFYEARNTRVTELAKSLRALVSGFAGEGAYGITSGLQS 400
+Q+DR + + ++L + L GI+S +QS
Sbjct: 256 IKQLDRQQATQG--------NTKVIYLKYAKASDLVEVLT------------GISSTMQS 295

Query: 401 SASKRSGGGRRAGDDGAAPAVAPLLQAAGAAALVGGDGANGLLGGLAAGISGSGTIVEDE 460
K++ A D + I
Sbjct: 296 E--KQAAKPVAALDK-------------------------------------NIIIKAHG 316

Query: 461 NRNAILFRGAARTWQQMQGLLREMDKPARQVLIEVTVASVSLSDTQELGVEWEMLNGSFN 520
NA++ A ++ ++ ++D QVL+E +A V +D LG++W N
Sbjct: 317 QTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGIQWANKNAGMT 376

Query: 521 SATSTGSK-GSAGKGGFNYVINT--------------------AGGNTAA-IQAMADNQR 558
T++G +A G Y + GN A + A++ + +
Sbjct: 377 QFTNSGLPISTAIAGANQYNKDGTVSSSLASALSSFNGIAAGFYQGNWAMLLTALSSSTK 436

Query: 559 VRVLATPRILVKSGEQANINVGRDIPIPTAQVNDDSTTAGSTNLRNEIAYRSTGTILNVA 618
+LATP I+ +A NVG+++P+ T S T N+ N + ++ G L V
Sbjct: 437 NDILATPSIVTLDNMEATFNVGQEVPVLTG-----SQTTSGDNIFNTVERKTVGIKLKVK 491

Query: 619 PVVYSDSRVDLTVSQELSDSGGSSGGGGKASGGGISAPEISRTSLETSLTLKSGGSVLMG 678
P + V L + QE+S ++ G + ++ ++ + SG +V++G
Sbjct: 492 PQINEGDSVLLEIEQEVSSVADAASSTSSDLG-----ATFNTRTVNNAVLVGSGETVVVG 546

Query: 679 GLIRDNITDSNAGVPLLKDIPGIGFLFGRQKAVKTREEVIMLIQPYVLESDADAREVTEK 738
GL+ +++D+ VPLL DIP IG LF ++ +++ I+P V+ + R+ +
Sbjct: 547 GLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNLMLFIRPTVIRDRDEYRQASSG 606

Query: 739 LHAMLSKT 746
+ +
Sbjct: 607 QYTAFNDA 614


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1379DHBDHDRGNASE1103e-31 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 110 bits (277), Expect = 3e-31
Identities = 61/185 (32%), Positives = 86/185 (46%), Gaps = 3/185 (1%)

Query: 5 KTLLITGASSGFGQALAREALDAGHRVVGTVRSEEARSALEAVAPGQAFGR---LLDVTD 61
K ITGA+ G G+A+AR G + + E + + +A DV D
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 62 LAAIEPTVAAIERDIGPLDVLVNSAGYGHEGILEESPLAEMRRQFEVNLFGAVAMIQAVL 121
AAI+ A IER++GP+D+LVN AG G++ E F VN G ++V
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 122 PYMRRRRRGHILNITSMGGYITMPGIAYYCGSKFALEGVSEALGKEVAGLGIAVTAVAPG 181
YM RR G I+ + S + +A Y SK A ++ LG E+A I V+PG
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188

Query: 182 SFRTD 186
S TD
Sbjct: 189 STETD 193


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1377SACTRNSFRASE379e-06 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 37.2 bits (86), Expect = 9e-06
Identities = 15/51 (29%), Positives = 23/51 (45%), Gaps = 1/51 (1%)

Query: 93 VAVAWQGKGVGSRLLGELLDIADNWMNLRRVELTVYTDNAPALALYRKFGF 143
VA ++ KGVG+ LL + ++ A + + L N A Y K F
Sbjct: 97 VAKDYRKKGVGTALLHKAIEWAKE-NHFCGLMLETQDINISACHFYAKHHF 146


114PA1250PA1231N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA1250-1100.917413alkaline proteinase inhibitor AprI
PA1249-2100.728896alkaline metalloproteinase
PA1248-391.904279alkaline protease secretion protein AprF
PA1247-292.305410alkaline protease secretion protein AprE
PA1246-182.252545alkaline protease secretion ATP-binding protein
PA1245092.074047hypothetical protein
PA1244-192.408923hypothetical protein
PA1243-193.514480sensor/response regulator hybrid protein
PA1242084.512510hypothetical protein
PA12411104.120091transcriptional regulator
PA1240294.466159enoyl-CoA hydratase
PA12391104.252447hypothetical protein
PA12381104.070914multidrug efflux pump outer membrane protein
PA12370123.255598multidrug resistance efflux pump
PA12360142.541302major facilitator superfamily transporter
PA12350162.643991transcriptional regulator
PA12340132.564434hypothetical protein
PA12330122.907887hypothetical protein
PA12321133.255663hypothetical protein
PA12310123.029666hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1250MPTASEINHBTR1295e-42 Metalloprotease inhibitor signature.
		>MPTASEINHBTR#Metalloprotease inhibitor signature.

Length = 122

Score = 129 bits (325), Expect = 5e-42
Identities = 40/118 (33%), Positives = 58/118 (49%), Gaps = 9/118 (7%)

Query: 12 CLLCGFFSTGI-SMASSLILLSASDLAGQWTLQQDEAPAICHLELRDSEVAEASGYDLGG 70
F S G +MASS ++ S + +AGQ ++ +C +E A A L G
Sbjct: 11 VWQVLFVSAGAQAMASSFVVPSTAQMAGQLGIEATG-SGVC---AGPAEQANA----LAG 62

Query: 71 DTACLTRWLPSEPRAWRPTPAGIALLERGGLTLMLLGRQGEGDYRVQKGDGGQLVLRR 128
D AC +WL +P +W PTP GI L+ G + L RQ EG+Y + G + L+R
Sbjct: 63 DVACAEQWLGDKPVSWSPTPDGIWLMNAEGTGITHLNRQKEGEYTGRTPSGADVTLQR 120


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1249CABNDNGRPT418e-145 NodO calcium binding signature.
		>CABNDNGRPT#NodO calcium binding signature.

Length = 479

Score = 418 bits (1077), Expect = e-145
Identities = 255/480 (53%), Positives = 319/480 (66%), Gaps = 29/480 (6%)

Query: 10 GRSDAYTQVDNFLHAYARGGDELVNGHPSYTVDQAAEQILREQASWQKAPGDSVLTLSYS 69
S AY V +FL + RG VNG SY++DQAA QI RE SW G +V S
Sbjct: 19 NTSSAYNSVYDFLRYHDRGDGLTVNGKTSYSIDQAAAQITRENVSWN---GTNVFGKSA- 74

Query: 70 FLTKPNDFFNTPWKYVSDIYSLGK----FSAFSAQQQAQAKLSLQSWSDVTNIHFVDAGQ 125
N +K++ + S+ F F+A+Q QAKLSLQSWSDV N+ F +
Sbjct: 75 ---------NLTFKFLQSVSSIPSGDTGFVKFNAEQIEQAKLSLQSWSDVANLTFTEVTG 125

Query: 126 GDQGDLTFGNFSSSVGG------AAFAFLPDVPDALKGQSWYLINSSYSANVNPANGNYG 179
++TFGN++ G A+A+ P G SWY N S NP + YG
Sbjct: 126 NKSANITFGNYTRDASGNLDYGTQAYAYYPGNYQG-AGSSWYNYNQSN--IRNPGSEEYG 182

Query: 180 RQTLTHEIGHTLGLSHPGDYNAGEGDPTYADATYAEDTRAYSVMSYWEEQNTGQDFKGAY 239
RQT THEIGH LGL+HPG+YNAGEGDP+Y DA YAED+ +S+MSYW E TG D+ G Y
Sbjct: 183 RQTFTHEIGHALGLAHPGEYNAGEGDPSYNDAVYAEDSYQFSIMSYWGENETGADYNGHY 242

Query: 240 SSAPLLDDIAAIQKLYGANLTTRTGDTVYGFNSNTERDFYSATSSSSKLVFSVWDAGGND 299
AP++DDIAAIQ+LYGAN+TTRTGD+VYGFNSNT+RDFY+AT SS L+FSVWDAGG D
Sbjct: 243 GGAPMIDDIAAIQRLYGANMTTRTGDSVYGFNSNTDRDFYTATDSSKALIFSVWDAGGTD 302

Query: 300 TLDFSGFSQNQKINLNEKALSDVGGLKGNVSIAAGVTVENAIGGSGSDLLIGNDVANVLK 359
T DFSG+S NQ+INLNE + SDVGGLKGNVSIA GVT+ENAIGGSG+D+L+GN N+L+
Sbjct: 303 TFDFSGYSNNQRINLNEGSFSDVGGLKGNVSIAHGVTIENAIGGSGNDILVGNSADNILQ 362

Query: 360 GGAGNDILYGGLGADQLWGGAGADTFVYGDIAESSAAAPDTLRDFVSGQDKIDLSGLDAF 419
GGAGND+LYGG GAD L+GGAG DTFVYG +S+ AA D + DF G DKIDLS AF
Sbjct: 363 GGAGNDVLYGGAGADTLYGGAGRDTFVYGSGQDSTVAAYDWIADFQKGIDKIDLS---AF 419

Query: 420 VNGGLVLQYVDAFAGKAGQAILSYDAASKAGSLAIDFSGDAHADFAINLIGQATQADIVV 479
N G + D F GK + +L +DAA+ +L + +G + DF + ++GQA Q+DI+V
Sbjct: 420 RNEGQLSFVQDQFTGKGQEVMLQWDAANSITNLWLHEAGHSSVDFLVRIVGQAAQSDIIV 479


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1247RTXTOXIND438e-154 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 438 bits (1128), Expect = e-154
Identities = 99/423 (23%), Positives = 181/423 (42%), Gaps = 2/423 (0%)

Query: 11 AYARLGWLLVLFGFGGALLWAAFAPLDQGVAVPATVIISGQRKSVQHPLGGVVKHILVRD 70
RL ++ A + + ++ + SG+ K ++ +VK I+V++
Sbjct: 55 RRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKE 114

Query: 71 GQHVEAGEPLIRMEPTQARANVDSLLNRYANARLNQARLQAEYDGRRTLEMPA-GLAEQA 129
G+ V G+ L+++ A A+ + ARL Q R Q ++P L ++
Sbjct: 115 GESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEP 174

Query: 130 PLPTLGERLEL-QRQLLHSRQTALANELSALRANIEGLRAQLEGLRQTEGNQRLQQRLLN 188
+ E L L+ + + N+ N++ RA+ + R+
Sbjct: 175 YFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEK 234

Query: 189 SQLSGARDLAEEGYMPRNQLLEQERQLAEVNARLSESSGRFGQIRQSIAEAQMRIAQREE 248
S+L L + + ++ +LEQE + E L + QI I A+ +
Sbjct: 235 SRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQ 294

Query: 249 EYRKEVNGQLAETQVNARTLWEELSSARYELRHAEIRAPVSGYVAGLKVFTDGGVIGPGE 308
++ E+ +L +T N L EL+ + + IRAPVS V LKV T+GGV+ E
Sbjct: 295 LFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAE 354

Query: 309 LLMYIVPNSDSLEVEGQLAVNLVDRIHSGLPVEMLFTAFNQSKTPRVTGEVTMVSADRLL 368
LM IVP D+LEV + + I+ G + AF ++ + G+V ++ D +
Sbjct: 355 TLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIE 414

Query: 369 DEQNKQPYYALRAQVDAAAMGKLKGLQIRPGMAVQVFVRTGERSLLNYLFKPLFDRAHVA 428
D++ + + + + K + + GMAV ++TG RS+++YL PL + +
Sbjct: 415 DQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGMRSVISYLLSPLEESVTES 474

Query: 429 LAE 431
L E
Sbjct: 475 LRE 477


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1243HTHFIS823e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 81.8 bits (202), Expect = 3e-18
Identities = 33/119 (27%), Positives = 52/119 (43%), Gaps = 5/119 (4%)

Query: 742 THVLLVDDDRMVRYTTALLLGDLGYQVSEAASAEEALGEVERGLAPDLLVTDHLMADKTG 801
+L+ DDD +R L GY V ++A + G DL+VTD +M D+
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAG-DGDLVVTDVVMPDENA 62

Query: 802 VQLAEELRQRFPQLPVLVITGYANL----RPEQLNGFEVLTKPFRHNELAERLARLLEA 856
L +++ P LPVLV++ + + ++ L KPF EL + R L
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1242SUBTILISIN883e-21 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 88.4 bits (219), Expect = 3e-21
Identities = 60/293 (20%), Positives = 104/293 (35%), Gaps = 51/293 (17%)

Query: 256 VRIGVIERDVDFDAPDFADYLGPCKAPAPRTCLYARDAERPDNHGSTVAGILAARWDQGG 315
V++ V++ D D PD + + + + HG+ VAG +AA
Sbjct: 43 VKVAVLDTGCDADHPDLKARIIGGRNFTDDDEGDPEIFKDYNGHGTHVAGTIAA----TE 98

Query: 316 NSGFLRGLDRASQGFEVIVERNSDAGITANVAASVN-LVEDGVRVLNWSWGIHRVGARDV 374
N + G+ + + V +G + + +E V +++ S G
Sbjct: 99 NENGVVGVAPEADLLIIKVLNKQGSGQYDWIIQGIYYAIEQKVDIISMSLG------GPE 152

Query: 375 DGDEVDSLVRSGIAMSGYEELLEEFFLWLRKEHPDVLVVNSAGN-GSSYSGTDEYRLPSS 433
D E+ V+ +A +LV+ +AGN G TDE P
Sbjct: 153 DVPELHEAVKKAVA-------------------SQILVMCAAGNEGDGDDRTDELGYPGC 193

Query: 434 FVTEQLLVVGGHQRSERQGLAVDDPAYAVKRSTSNVDMRVDVTAAACTHASTLERDARGE 493
+ +++ VG A++ +A SN + VD+ A ST+ +
Sbjct: 194 Y--NEVISVG----------AINFDRHAS--EFSNSNNEVDLVAPGEDILSTV-PGGKYA 238

Query: 494 VHCGTSYATPMVAGTVAAMLSLNPRLR-----PEEIRMLLRRSAMTIGGDYDF 541
GTS ATP VAG +A + L E+ L + + +G
Sbjct: 239 TFSGTSMATPHVAGALALIKQLANASFERDLTEPELYAQLIKRTIPLGNSPKM 291


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1241HTHTETR601e-13 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 60.0 bits (145), Expect = 1e-13
Identities = 25/171 (14%), Positives = 53/171 (30%), Gaps = 11/171 (6%)

Query: 8 RDELLQRCAGTFRRYGYHGTTMEMLSSACGLTKASFYHHYPNKEALLRDVLEWTHQRLAE 67
R +L F + G T++ ++ A G+T+ + Y H+ +K L ++ E + + E
Sbjct: 13 RQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGE 72

Query: 68 TLFSIAYDPLLTPRERLEKLGRKAARLFQDDSIGCLMGVVAVDASYGRSELMAPIRSFLD 127
P L ++ + L+ + + +
Sbjct: 73 LELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIF-----HKCEFVGEMAVVQ 127

Query: 128 DWAQAFAQLYRPAFDEA--QALERGRQLVADFEGAILLARIYGEPGYIDGV 176
+ ++ +E L AD + GYI G+
Sbjct: 128 QAQRNLCLESYDRIEQTLKHCIEAK-MLPADLMTRRAAIIMR---GYISGL 174


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1237RTXTOXIND1225e-33 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 122 bits (307), Expect = 5e-33
Identities = 61/368 (16%), Positives = 110/368 (29%), Gaps = 68/368 (18%)

Query: 66 AVSAQVSGYVAEVLVADDADVQAGDLLLRLDPRDFR-------QRLRAAEAREAAAQAAL 118
+ + V E++V + V+ GD+LL+L L A + Q
Sbjct: 98 EIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILS 157

Query: 119 EAQ-------------------------------RAKLETLDRQLLEQAQTISRARADGE 147
+ + + T Q ++ + + RA+
Sbjct: 158 RSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERL 217

Query: 148 AARAEWRRAETDWR-------RYRQLADEHATSRQRLENADAAHQRARAAARRASAEEGR 200
A R E R + L + A ++ + + + A R ++ +
Sbjct: 218 TVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQ 277

Query: 201 QRAAKDVLKSR--------RREAEAALAQRQAELQEAAAARELARHALDDTEIRAPFAGR 252
+ K + E L Q + + IRAP + +
Sbjct: 278 IESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVK 337

Query: 253 VGQRKVRLR-QYVTPGLPLLAVVPLEQAYVV-ANYKETQLERIRPGQPVELEVDTFGRRW 310
V Q KV VT L+ +VP + V A + + I GQ ++V+ F
Sbjct: 338 VQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTR 397

Query: 311 RGRVDSVAPASGAVFALLPPDNATGNFTKIVQRFPVRIRLDADAAERG----RLLPGMSV 366
G + + D +V F V I ++ + G L GM+V
Sbjct: 398 YGYLV-------GKVKNINLDAIEDQRLGLV--FNVIISIEENCLSTGNKNIPLSSGMAV 448

Query: 367 IATVDTRE 374
A + T
Sbjct: 449 TAEIKTGM 456


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1236TCRTETB1097e-28 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 109 bits (273), Expect = 7e-28
Identities = 79/402 (19%), Positives = 168/402 (41%), Gaps = 17/402 (4%)

Query: 23 FMAGMNVHVTSAALPEIEGALGATFEEGSWISTAYLVAEISMIPLTAWLVEVFSLRRVML 82
F + +N V + +LP+I +W++TA+++ + L + ++R++L
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 83 LGSLVFLLSSLSCALAPN-LSTLILIRVIQGASGAVLIPLSMQLILTELPSSRIPLGMAL 141
G ++ S+ + + S LI+ R IQGA A L M ++ +P L
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143

Query: 142 FSLSNSVAQAAGPSIGGWLADAYSWRWIFLLQLLPGIALLAAVAWSIRPRDGDRERLRQA 201
++ + GP+IGG +A W ++ L+ ++ I + + + R +
Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKL---LKKEVRIK-GHF 199

Query: 202 DWLGIGAMVAGLGALQIVLEEGGRRDWFESGFIRTFAVLAVLALLLFVQRQLWGARPFIN 261
D GI M G+ + F + + +F +++VL+ L+FV+ PF++
Sbjct: 200 DIKGIILMSVGIVFFML----------FTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVD 249

Query: 262 LRLLGSYNFGVSSLAMAVFGAATFGLVFLVPNYLSQLQGFNARQIGDSLILYGLVQLLL- 320
L + F + L + G V +VP + + + +IG +I G + +++
Sbjct: 250 PGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIF 309

Query: 321 APLLPRLMRWLNPKLLVAGGFAIMALGCWMGAHLNADAGRNVIIPSIVVRGIGQPLIMVA 380
+ L+ P ++ G +++ ++ A + + IV G
Sbjct: 310 GYIGGILVDRRGPLYVLNIGVTFLSVS-FLTASFLLETTSWFMTIIIVFVLGGLSFTKTV 368

Query: 381 LSVLAVKGLDKAQAGSASALISMLRNLGGAIGTALLTQLVSL 422
+S + L + +AG+ +L++ L G A++ L+S+
Sbjct: 369 ISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSI 410


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1235HTHFIS339e-04 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 33.3 bits (76), Expect = 9e-04
Identities = 14/103 (13%), Positives = 31/103 (30%), Gaps = 6/103 (5%)

Query: 87 RHDLPRDCRVVDVPPLLRQLIVAAMRIAPDYPPGGRDERVMELILDELRVLPILALHVPQ 146
R + + R + + + ++ + D L + + +
Sbjct: 376 REIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVLAE 435

Query: 147 PVDPRLAALCRSLRAEPAADWSLGDAARRLGVSPRTLTRAFQR 189
P + L A A + AA LG++ TL + +
Sbjct: 436 MEYPLI------LAALTATRGNQIKAADLLGLNRNTLRKKIRE 472


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1231RTXTOXIND664e-14 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 65.6 bits (160), Expect = 4e-14
Identities = 43/214 (20%), Positives = 75/214 (35%), Gaps = 39/214 (18%)

Query: 79 RSYRLAVRQREAELEQARETLRQRDEQWRRRMQLAGAVSREEVANAGRALRIARARAEAA 138
R Y+ + Q E+E+ A+E + + ++ + LR
Sbjct: 269 RVYKSQLEQIESEILSAKEEYQLVTQLFKNEIL--------------DKLRQTTDNIGLL 314

Query: 139 GAALDQARLDLRRSELRSPVDGYVTQLRVQ-PGDYAAAGRTNIFIV-DRRSFWVTGYFEE 196
L + + S +R+PV V QL+V G T + IV + + VT +
Sbjct: 315 TLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQN 374

Query: 197 TKLRNVQVGAPATIKLMGFD----PLLDGHVASIGRGVADLNESRADSGLPQVSPNFSWI 252
+ + VG A IK+ F L G V +I D+ Q
Sbjct: 375 KDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNIN----------LDAIEDQRLGLV--- 421

Query: 253 RLAQRVPVRIELDRVPA---GVVLAAGMTGSVEV 283
V + IE + + + L++GM + E+
Sbjct: 422 ---FNVIISIEENCLSTGNKNIPLSSGMAVTAEI 452



Score = 47.5 bits (113), Expect = 3e-08
Identities = 18/114 (15%), Positives = 41/114 (35%), Gaps = 3/114 (2%)

Query: 41 VSAQVIRIAPEVSGSVEAVFVADNQRVARGDPLYRIDPRSYRLAVRQREAELEQARETLR 100
S + I P + V+ + V + + V +GD L ++ + ++ L QAR +
Sbjct: 92 HSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLE-Q 150

Query: 101 QRDEQWRRRMQLAGAVSREEVANAGRALRIARARAEAAGAALDQARLDLRRSEL 154
R + R ++L E + +L + + +++
Sbjct: 151 TRYQILSRSIELNKL--PELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQK 202


115PA1165PA1157N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA1165-1142.7149424'-phosphopantetheinyl transferase
PA11640141.938589hypothetical protein
PA1163-1101.934683glycosyl transferase
PA1162-1112.151691succinyl-diaminopimelate desuccinylase
PA11610150.297162rRNA methyltransferase
PA1160016-1.135680hypothetical protein
PA1159114-1.596278cold-shock protein
PA1158013-1.037022two-component sensor
PA1157013-2.713373two-component response regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1165ENTSNTHTASED892e-23 Enterobactin synthetase component D signature.
		>ENTSNTHTASED#Enterobactin synthetase component D signature.

Length = 234

Score = 88.5 bits (219), Expect = 2e-23
Identities = 62/200 (31%), Positives = 93/200 (46%), Gaps = 12/200 (6%)

Query: 15 LDDRWPLPVALPGVQLRSTRFDPALLQPGDFALAGIQPPANILRAVAKRQAEFLAGRLCA 74
L +PLP A G +L FD + + D L + + A KR+AE LAGR+ A
Sbjct: 2 LTSHFPLPFA--GHRLHIVDFDASSFREHD--LLWLPHHDRLRSAGRKRKAEHLAGRIAA 57

Query: 75 RAALFALDGRAQTPAVGEDRAPVWPAAISGSITHGDRWAAALVAARGDWRGLGLDVETLL 134
AL + G P +G+ R P+WP + GSI+H A A+++ + +G+D+E ++
Sbjct: 58 VHALREV-GVRTVPGMGDKRQPLWPDGLFGSISHCATTALAVISR----QRIGIDIEKIM 112

Query: 135 EAERARYLHGEILTEGERLRFADDLERRTGLLVTLAFSLKESLFKALYPLVGKRFYFEHA 194
A L I+ ER L L +TLAFS KES++KA + F A
Sbjct: 113 SQHTATELAPSIIDSDERQILQASL-LPFPLALTLAFSAKESVYKA-FSDRVTLPGFNSA 170

Query: 195 ELLEWRADGQARLRLLTDLS 214
++ A L LL +
Sbjct: 171 KVTSLTA-THISLHLLPAFA 189


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1164ISCHRISMTASE300.009 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 30.0 bits (67), Expect = 0.009
Identities = 14/51 (27%), Positives = 18/51 (35%), Gaps = 3/51 (5%)

Query: 109 MAEYIVDF--DYLIDCIDSVAAKAALIAWCKRRKIPVITTGGAGGQVDPTQ 157
M Y VD + A L C + IPV+ T G Q +P
Sbjct: 38 MQNYFVDAFTAGASPVTELSANIRKLKNQCVQLGIPVVYTAQPGSQ-NPDD 87


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1163PF05704310.024 Capsular polysaccharide synthesis protein
		>PF05704#Capsular polysaccharide synthesis protein

Length = 307

Score = 30.6 bits (69), Expect = 0.024
Identities = 7/32 (21%), Positives = 20/32 (62%), Gaps = 1/32 (3%)

Query: 430 YNEPPELLKQTLDALARLDYPDYEVLVIDNNT 461
+ P +++Q + ++ + + D++V++ID N
Sbjct: 79 IEKAPYIVQQCVASV-KKNSGDFKVIIIDGNN 109


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1158PF06580423e-06 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 41.8 bits (98), Expect = 3e-06
Identities = 21/107 (19%), Positives = 35/107 (32%), Gaps = 25/107 (23%)

Query: 345 LQNLLTNALRHA------DRRVRISYRVSLERCRVDVEDDGPGVPEAQWERLFTPFLRLD 398
+Q L+ N ++H ++ + ++VE+ G + E
Sbjct: 260 VQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKE---------- 309

Query: 399 DSRTRASGGHGLGLSIVR-RIVYWHGGRASIGRSETLGGACFTLAWP 444
G GL VR R+ +G A I SE G + P
Sbjct: 310 --------STGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1157HTHFIS793e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 79.5 bits (196), Expect = 3e-19
Identities = 37/148 (25%), Positives = 66/148 (44%), Gaps = 3/148 (2%)

Query: 7 RILIVEDDRRLAELTREYLEGNGLKVDIEANGALAAARILAERPDLVVLDLMLPGEDGLS 66
IL+ +DD + + + L G V I +N A I A DLVV D+++P E+
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 67 ICRQVR-PQFDGPILMLTARTDDMDEVLGLEMGADDYVCKPVRPRVLLARIRALLRRSEA 125
+ +++ + D P+L+++A+ M + E GA DY+ KP L+ I L +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 126 PEAGAPAADSKRLAFGRLVIDNAMREAW 153
+ + + AM+E +
Sbjct: 125 RPSKLEDD--SQDGMPLVGRSAAMQEIY 150


116PA1108PA1078N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA1108-181.431365major facilitator superfamily transporter
PA1107010-0.055397hypothetical protein
PA11061100.162995hypothetical protein
PA110509-0.054294flagellar biosynthesis chaperone
PA1104080.387801flagellum-specific ATP synthase
PA1103-190.270646flagellar assembly protein FliH
PA1102-19-0.116485flagellar motor switch protein FliG
PA1101-29-0.044761flagellar MS-ring protein
PA1100013-0.704181flagellar hook-basal body complex protein FliE
PA1099-112-1.584539two-component response regulator
PA1098014-2.990456two-component sensor
PA1097016-4.404052transcriptional regulator FleQ
PA1096-115-1.441699hypothetical protein
PA1095-114-1.425869B-type flagellar protein FliS
PA1094013-1.258632B-type flagellar hook-associated protein
PA1093014-0.723586hypothetical protein
PA1092-112-0.913244B-type flagellin
PA1091-112-0.632063flagellar glycosyl transferase FgtA
PA1090-19-2.200345hypothetical protein
PA108908-1.609569hypothetical protein
PA1088-19-1.426983hypothetical protein
PA108709-1.368030flagellar hook-associated protein FlgL
PA1086010-1.019546flagellar hook-associated protein FlgK
PA1085113-1.129379peptidoglycan hydrolase FlgJ
PA1084313-1.829681flagellar basal body P-ring protein
PA1083313-2.687494flagellar basal body L-ring protein
PA1082313-3.140516flagellar basal body rod protein FlgG
PA1081112-3.391830flagellar basal body rod protein FlgF
PA1080112-3.379175flagellar hook protein FlgE
PA1079014-3.667849flagellar basal body rod modification protein
PA1078013-3.732849flagellar basal body rod protein FlgC
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1108TCRTETA574e-11 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 57.1 bits (138), Expect = 4e-11
Identities = 82/337 (24%), Positives = 125/337 (37%), Gaps = 34/337 (10%)

Query: 4 RPRPPLLLVLALLALPQVAETILSPALPALASHWRLDDATSQWT------MALFFVGFAP 57
+P PL+++L+ +AL V ++ P LP L + + AL AP
Sbjct: 2 KPNRPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAP 61

Query: 58 GIWLWGWLADRLGRRPALLGGLGLAALATFGAWASTDYSYLLACRLVQGLGLATCSVTVQ 117
+ G L+DR GRRP LL L AA+ + L R+V G+ AT +V
Sbjct: 62 ---VLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAV-AG 117

Query: 118 ASLRDVLQGPALMSYFVTLGAVLAWSPAVGPLGGQWLADLGGH-PAVFATLAVLLASLAA 176
A + D+ G +F + A + GP+ G + H P A L L
Sbjct: 118 AYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTG 177

Query: 177 LVV---PAWPETRPLLAGTPEPATLAIFRRVLADRPLQTRALLVAVLNVLVFSFYAAGPF 233
+ E RPL P FR + + V + LV AA
Sbjct: 178 CFLLPESHKGERRPLRREALNPLAS--FRWARGMTVVAA-LMAVFFIMQLVGQVPAALWV 234

Query: 234 MVGDLPGLGFGW----IGLAIAIAGSLGAL----LNRRLPRTWNSARRVRLGLALAAAGA 285
+ G+ F W IG+++A G L +L + + R + LG+ A
Sbjct: 235 IFGEDR---FHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGM---IADG 288

Query: 286 TAQTLLAAVGYAEGLYWALPALPIFIGFGVAIPNLLG 322
T LLA + A P + + G+ +P L
Sbjct: 289 TGYILLAFATRG---WMAFPIMVLLASGGIGMPALQA 322


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1105FLGFLIJ542e-12 Flagellar FliJ protein signature.
		>FLGFLIJ#Flagellar FliJ protein signature.

Length = 147

Score = 54.4 bits (130), Expect = 2e-12
Identities = 46/134 (34%), Positives = 74/134 (55%)

Query: 8 LAPVVDMASKAERDAATQLGRCQQQLLAAQQKLAELERYRNDYQQQWISQGQKGVSGQWL 67
LA + D+A K DAA LG ++ A+++L L Y+N+Y+ S G++
Sbjct: 7 LATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAGITSNRW 66

Query: 68 MNYQRFLSQLETAVAQQANSVTWHREAVDKARLNWQERYARLEGLRKLVERYLEEARQAE 127
+NYQ+F+ LE A+ Q + + VD A +W+E+ RL+ + L ER A AE
Sbjct: 67 INYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQERQSTAALLAE 126

Query: 128 DKREQKQLDELAQR 141
++ +QK++DE AQR
Sbjct: 127 NRLDQKKMDEFAQR 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1103FLGFLIH561e-11 Flagellar assembly protein FliH signature.
		>FLGFLIH#Flagellar assembly protein FliH signature.

Length = 228

Score = 56.3 bits (135), Expect = 1e-11
Identities = 47/202 (23%), Positives = 93/202 (46%), Gaps = 11/202 (5%)

Query: 40 VAAPQVPAVAEPAPAPPAVEEVELETVKPPTLEEIEAIRQDAYNEGFATGERDGFHAGQL 99
+A PQ V P +EE E + +++A Q Y G A G + G G
Sbjct: 15 LAPPQAEFVPIVEPEETIIEEAEPSLEQQLAQLQMQAHEQ-GYQAGIAEGRQQGHKQGYQ 73

Query: 100 KARQEAEEALKERLQS--------LERLMTQLLEPIAEQDALIEQGMVNLVNHVARQVIQ 151
+ + E +S +++L+++ + D++I ++ + ARQVI
Sbjct: 74 EGLAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALDSVIASRLMQMALEAARQVIG 133

Query: 152 RELHMDSSHVRQVLREALKLLPMGAANIRIHVNPQDFERVKAL--RERHEESWRILEDDS 209
+ +D+S + + +++ L+ P+ + ++ V+P D +RV + WR+ D +
Sbjct: 134 QTPTVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRVDDMLGATLSLHGWRLRGDPT 193

Query: 210 LLPGGCRIETEHSRIDATIETR 231
L PGGC++ + +DA++ TR
Sbjct: 194 LHPGGCKVSADEGDLDASVATR 215


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1102FLGMOTORFLIG305e-105 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 305 bits (784), Expect = e-105
Identities = 109/330 (33%), Positives = 204/330 (61%)

Query: 9 KLTKVDKAAILLLSLGETDAAQVLRHMGPKEVQRVGVAMASMRNVHREQVEQVMGEFVEV 68
LT KAAILL+S+G +++V +++ +E++ + +A + + E + V+ EF E+
Sbjct: 14 ALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITSELKDNVLLEFKEL 73

Query: 69 VGDQTSLGVGADGYIRKMLTQALGEDKANNLIDRILLGGSTSGLDSLKWMEPRAVADVIR 128
+ Q + G Y R++L ++LG KA ++I+ + + + ++ +P + + I+
Sbjct: 74 MMAQEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSALQSRPFEFVRRADPANILNFIQ 133

Query: 129 YEHPQIQAIVVAYLDPDQAAEVLSHFDHKVRLDIVLRVSSLNTVQPSALKELNLILEKQF 188
EHPQ A++++YLDP +A+ +LS +V+ ++ R++ ++ P ++E+ +LEK+
Sbjct: 134 QEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEVVREVERVLEKKL 193

Query: 189 AGNSNATRTTMGGVKRAADIMNYLDSSIEGQLMDSIREVDEDLSGQIEDLMFVFDNLADV 248
A S+ T+ GGV +I+N D E +++S+ E D +L+ +I+ MFVF+++ +
Sbjct: 194 ASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEIKKKMFVFEDIVLL 253

Query: 249 DDRGIQALLREVSSDVLVLALKGSDEAIREKVFKNMSKRAAELLRDDLEAKGPVRVSEVE 308
DDR IQ +LRE+ L ALK D ++EK+FKNMSKRAA +L++D+E GP R +VE
Sbjct: 254 DDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKEDMEFLGPTRRKDVE 313

Query: 309 GAQKEILTIARRMAESGDIVLGGKGGEEMI 338
+Q++I+++ R++ E G+IV+ G E+++
Sbjct: 314 ESQQKIVSLIRKLEEQGEIVISRGGEEDVL 343


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1101FLGMRINGFLIF6080.0 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 608 bits (1569), Expect = 0.0
Identities = 206/576 (35%), Positives = 311/576 (53%), Gaps = 39/576 (6%)

Query: 30 LDNLSEMTMLRQIGLLVGLAASVAIGFAVVLWSQQPDYKPLYGSLNGVDANRVVEALTAA 89
L+ L+ + +I L+V +A+VAI A+VLW++ PDY+ L+ +L+ D +V LT
Sbjct: 13 LEWLNRLRANPRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQDGGAIVAQLTQM 72

Query: 90 DIPYKVEPNSGALLVKADDLGRARMKVASAGVAPTDNNVGFEILDKEQALGTSQFMEATN 149
+IPY+ SGA+ V AD + R+++A G+ P VGFE+LD+E G SQF E N
Sbjct: 73 NIPYRFANGSGAIEVPADKVHELRLRLAQQGL-PKGGAVGFELLDQE-KFGISQFSEQVN 130

Query: 150 YRRGLEGELARTVSSLNNVKAARVHLAIPKSSVFVRDDRKPSASVLVELYPGRSLEPSQV 209
Y+R LEGELART+ +L VK+ARVHLA+PK S+FVR+ + PSASV V L PGR+L+ Q+
Sbjct: 131 YQRALEGELARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRALDEGQI 190

Query: 210 MAIVNLVATSVPELDKSQVTVVDQKGNLLSDQQELSELTMAGKQFDFTRRMEGLLTQRVH 269
A+V+LV+++V L VT+VDQ G+LL+ Q S + Q F +E + +R+
Sbjct: 191 SAVVHLVSSAVAGLPPGNVTLVDQSGHLLT-QSNTSGRDLNDAQLKFANDVESRIQRRIE 249

Query: 270 NILQPVLGNGRYKAEVSADVDFSAVESTSEMYNPDQPA----LRSEQRNNEERQNSSGPQ 325
IL P++GNG A+V+A +DF+ E T E Y+P+ A LRS Q N E+ + P
Sbjct: 250 AILSPIVGNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNISEQVGAGYPG 309

Query: 326 GVPGALSNQPPGPASAPQQATASAPADYVAPGQPLKDANGQTIIDPKTGKPELAPYPTDK 385
GVPGALSNQP P AP + P P N Q T + P
Sbjct: 310 GVPGALSNQPAPPNEAP----IATP--------PTNQQNAQNTPQTSTSTNSNSAGPRST 357

Query: 386 RDQTTRNYELDRSISYTKQQQGRLRRLSVAVVLDDQMKVDAKTGEVSHQPWSADELARFT 445
+ T NYE+DR+I +TK G + RLSVAVV++ + D K P +AD++ +
Sbjct: 358 QRNETSNYEVDRTIRHTKMNVGDIERLSVAVVVNYKTLADGKP-----LPLTADQMKQIE 412

Query: 446 RLVQDSVGYDASRGDSVSVINAPFAPAQAEEIDSIPFYSQPWFWDIVKQVLGVLFILVLV 505
L ++++G+ RGD+++V+N+PF A +PF+ Q F D + L +LV+
Sbjct: 413 DLTREAMGFSDKRGDTLNVVNSPF-SAVDNTGGELPFWQQQSFIDQLLAAGRWLLVLVVA 471

Query: 506 F----GVLRPVLSNITGGGKGKSLAGGGGRDGDLALGESGLEGSLADDRVSIGGPSSILL 561
+ +RP L+ K ++ E +E L+ D ++ L
Sbjct: 472 WILWRKAVRPQLTRRVEEAKAAQEQAQVRQE-----TEEAVEVRLSKDEQLQQRRANQRL 526

Query: 562 PSPTEGYDAQLNAIKNLVAQDPGRVAQVVKEWINAD 597
G + I+ + DP VA V+++W++ D
Sbjct: 527 -----GAEVMSQRIREMSDNDPRVVALVIRQWMSND 557


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1100FLGHOOKFLIE933e-28 Flagellar hook-basal body complex protein FliE signa...
		>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE

signature.
Length = 103

Score = 92.8 bits (230), Expect = 3e-28
Identities = 41/92 (44%), Positives = 55/92 (59%)

Query: 18 QMEAMAKAKPVQAPAEVGAPSFSEMLSQAVDKVNETQQASTAMANAFEVGQSGVDLTDVM 77
Q++A A + Q SF+ L A+D++++TQ A+ A F +G+ GV L DVM
Sbjct: 12 QLQATAMSARAQESLPQPTISFAGQLHAALDRISDTQTAARTQAEKFTLGEPGVALNDVM 71

Query: 78 IASQKASVSFQAMTQVRNKLVQAYQDIMQMPV 109
QKASVS Q QVRNKLV AYQ++M M V
Sbjct: 72 TDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV 103


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1099HTHFIS504e-179 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 504 bits (1300), Expect = e-179
Identities = 173/482 (35%), Positives = 255/482 (52%), Gaps = 18/482 (3%)

Query: 2 AAKVLLVEDDRALREALSDTLLLGGHEFVAVDSAEAALPVLAREAFSLVISDVNMPGMDG 61
A +L+ +DD A+R L+ L G++ +A +A LV++DV MP +
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 62 HQLLGLIRTRYPHLPVLLMTAYGAVDRAVEAMRQGAADYLVKPF--------EARALLDL 113
LL I+ P LPVL+M+A A++A +GA DYL KPF RAL +
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122

Query: 114 VARHALGQLPGSEEDGPVALEPASRQLLELAARVARSDSTVLISGESGTGKEVLANYIHQ 173
R + + + V A +++ + AR+ ++D T++I+GESGTGKE++A +H
Sbjct: 123 KRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHD 182

Query: 174 QSPRAGKPFIAINCAAIPDNMLEATLFGHEKGSFTGAIAAQPGKFELADGGTILLDEISE 233
R PF+AIN AAIP +++E+ LFGHEKG+FTGA G+FE A+GGT+ LDEI +
Sbjct: 183 YGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGD 242

Query: 234 MPLGLQAKLLRVLQEREVERVGARKPINLDIRVLATTNRDLAAEVAAGRFREDLYYRLSV 293
MP+ Q +LLRVLQ+ E VG R PI D+R++A TN+DL + G FREDLYYRL+V
Sbjct: 243 MPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNV 302

Query: 294 FPLAWRPLRERPADILPLAERLLRKHSRKMNLGAVALGPEAAQCLVRHAWPGNVRELDNA 353
PL PLR+R DI L +++ + K L EA + + H WPGNVREL+N
Sbjct: 303 VPLRLPPLRDRAEDIPDLVRHFVQQ-AEKEGLDVKRFDQEALELMKAHPWPGNVRELENL 361

Query: 354 IQRALILQQGGLIQPADLCLTAPIGMPLAAPVPVPMPAMPPATPPSVE------IPSPAA 407
++R L +I + +P + + + +VE S
Sbjct: 362 VRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGD 421

Query: 408 GQDASGALGDDLRRREFQVIIDTLRTERGRRKEAAERLGISPRTLRYKLAQMRDAGMDVE 467
SG L E+ +I+ L RG + +AA+ LG++ TLR K +R+ G+ V
Sbjct: 422 ALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKK---IRELGVSVY 478

Query: 468 AY 469

Sbjct: 479 RS 480


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1098PF06580371e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 36.8 bits (85), Expect = 1e-04
Identities = 20/97 (20%), Positives = 32/97 (32%), Gaps = 19/97 (19%)

Query: 299 LVENA----IQACGPELRLKVHLYARADSLRLSVSDNGPGMDPATLARLGEPFFTTKTTG 354
LVEN I ++ + ++ L V + G T
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNT------------KES 310

Query: 355 TGLGLAVVKAVARAHQG---QLQLRSRPGRGTCATLI 388
TG GL V+ + G Q++L + G+ LI
Sbjct: 311 TGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLI 347


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1097HTHFIS5100.0 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 510 bits (1314), Expect = 0.0
Identities = 181/489 (37%), Positives = 256/489 (52%), Gaps = 14/489 (2%)

Query: 5 TKLLLIDDNLDRSRDLAVILNFLGEDQLTCNS--EDWREVAAGLSNSREALCVLLGSVES 62
+L+ DD+ L L+ G D ++ WR +AAG + V+ V
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGD-----LVVTDVVMP 58

Query: 63 KGGAVELLKQLASWDEYLPILLI-GEPAPADWPEELRRRVLASLEMPPSYNKLLDSLHRA 121
A +LL ++ LP+L++ + + + L P +L+ + RA
Sbjct: 59 DENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118

Query: 122 QVYREMYDQARERGRSREPNLFRSLVGTSRAIQQVRQMMQQVADTDASVLILGESGTGKE 181
+ R + LVG S A+Q++ +++ ++ TD +++I GESGTGKE
Sbjct: 119 LAEP----KRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKE 174

Query: 182 VVARNLHYHSKRREGPFVPVNCGAIPAELLESELFGHEKGAFTGAITSRAGRFELANGGT 241
+VAR LH + KRR GPFV +N AIP +L+ESELFGHEKGAFTGA T GRFE A GGT
Sbjct: 175 LVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGT 234

Query: 242 LFLDEIGDMPLPMQVKLLRVLQERTFERVGSNKTQNVDVRIIAATHKNLEKMIEDGTFRE 301
LFLDEIGDMP+ Q +LLRVLQ+ + VG DVRI+AAT+K+L++ I G FRE
Sbjct: 235 LFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFRE 294

Query: 302 DLYYRLNVFPIEMAPLRERVEDIALLLNELISRMEHEKRGSIRFNSAAIMSLCRHDWPGN 361
DLYYRLNV P+ + PLR+R EDI L+ + + E E RF+ A+ + H WPGN
Sbjct: 295 DLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDVKRFDQEALELMKAHPWPGN 354

Query: 362 VRELANLVERLAIMHPYGVIGVGELPKKFR-HVDDEDEQLASSLREELEERAAINAGLPG 420
VREL NLV RL ++P VI + + R + D + A++ L A+ +
Sbjct: 355 VRELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQ 414

Query: 421 MDAPAM-LPAEGLDLKDYLANLEQGLIQQALDDAGGVVARAAERLRIRRTTLVEKMRKYG 479
A LA +E LI AL G +AA+ L + R TL +K+R+ G
Sbjct: 415 YFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELG 474

Query: 480 MSRRDDDLS 488
+S S
Sbjct: 475 VSVYRSSRS 483


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1092FLAGELLIN2047e-62 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 204 bits (519), Expect = 7e-62
Identities = 171/508 (33%), Positives = 243/508 (47%), Gaps = 22/508 (4%)

Query: 2 ALTVNTNIASLNTQRNLNASSNDLNTSLQRLTTGYRINSAKDDAAGLQISNRLSNQISGL 61
A +NTN SL TQ NLN S + L+++++RL++G RINSAKDDAAG I+NR ++ I GL
Sbjct: 1 AQVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGL 60

Query: 62 NVATRNANDGISLAQTAEGALQQSTNILQRIRDLALQSANGSNSDADRAALQKEVAAQQA 121
A+RNANDGIS+AQT EGAL + N LQR+R+L++Q+ NG+NSD+D ++Q E+ +
Sbjct: 61 TQASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLE 120

Query: 122 ELTRISDTTTFGGRKLLDGSFGTTSFQVGSNAYETIDISLQNASASAIGSYQVGSNGAGT 181
E+ R+S+ T F G K+L QVG+N ETI I LQ ++G NG
Sbjct: 121 EIDRVSNQTQFNGVKVLSQDN-QMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKE 179

Query: 182 VAS---------VAGTATASGIASGTVNLVGGGQVKNIAIAAGDSAKAIAEKMDGAIPNL 232
V G T + A+ V G V A K +G +
Sbjct: 180 ATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTD 239

Query: 233 SARART-VFTADVSGVTGGSLNFDVTVGSNTVSLAGVTSTQDLADQLNSNSSKLGITASI 291
A T V + T G+ G+ G T + +
Sbjct: 240 DAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKV 299

Query: 292 NDKGVLTITSATGENVKFGAQTGTATAGQVAVKVQGS---------DGKFEAAAKNVVAA 342
+ + T ++ GA A Q + V S D +AK
Sbjct: 300 STTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLE 359

Query: 343 GTAATTTIVTGYVQLNSPTAYSVSGTGTQA--SQVFGNASAAQKSSVASVDISTADGAQN 400
A V TA + T A + ++ + + + N
Sbjct: 360 ANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAAKKSTAN 419

Query: 401 AIAVVDNALAAIDAQRADLGAVQNRFKNTIDNLTNISENATNARSRIKDTDFAAETAALS 460
+A +D+AL+ +DA R+ LGA+QNRF + I NL N N +ARSRI+D D+A E + +S
Sbjct: 420 PLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNMS 479

Query: 461 KNQVLQQAGTAILAQANQLPQAVLSLLR 488
K Q+LQQAGT++LAQANQ+PQ VLSLLR
Sbjct: 480 KAQILQQAGTSVLAQANQVPQNVLSLLR 507


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1088PHPHTRNFRASE290.024 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 28.6 bits (64), Expect = 0.024
Identities = 17/81 (20%), Positives = 33/81 (40%), Gaps = 11/81 (13%)

Query: 174 IYLREPVAVEQRLTLDRFFSKELEHEYSAVYRTVAELKDMLVQAGGGEGP---AILE--- 227
I+L V +E+ D S E+E +A+ ++ EL+ + Q G I
Sbjct: 21 IHLEPNVDIEKTSITD--VSTEIEKLTAALEKSKEELRAIKDQTEASMGADKAEIFAAHL 78

Query: 228 ---EDCLFAEALEKRVETRQY 245
+D + ++ ++E Q
Sbjct: 79 LVLDDPELVDGIKGKIENEQM 99


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1087FLAGELLIN522e-09 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 52.4 bits (125), Expect = 2e-09
Identities = 31/141 (21%), Positives = 60/141 (42%)

Query: 1 MRISTIQAFNNSVSGISRNYADLTRTQAEISAGKRLLTPADDPVGAVRLLQLNQEQALNS 60
I+T + + ++++ + L+ +S+G R+ + DD G + +
Sbjct: 2 QVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLT 61

Query: 61 QYKSGITAAKNSLQQEETILNSVGTVIHRIREIAVQAGNGGLDASDKNALATELAQREDE 120
Q + Q E LN + + R+RE++VQA NG SD ++ E+ QR +E
Sbjct: 62 QASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEE 121

Query: 121 LLNLLNSRDASGKYLFSGSQG 141
+ + N +G + S
Sbjct: 122 IDRVSNQTQFNGVKVLSQDNQ 142



Score = 31.9 bits (72), Expect = 0.005
Identities = 15/77 (19%), Positives = 35/77 (45%)

Query: 349 DAVGVAISNLDSSNSQILTGQGRIGARMNVAESTETFIDDVTLVNTAVISQIQDLDYPEA 408
+ ++++DS+ S++ + +GA N +S T + + + S+I+D DY
Sbjct: 415 KSTANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATE 474

Query: 409 LSRLTLQSTIMDAAQQS 425
+S ++ + A
Sbjct: 475 VSNMSKAQILQQAGTSV 491


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1086FLGHOOKAP12462e-75 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 246 bits (629), Expect = 2e-75
Identities = 141/450 (31%), Positives = 232/450 (51%), Gaps = 19/450 (4%)

Query: 2 SDLLSIGLSGLGTSQTWLTVTGHNITNVKTPGYSRQDAIQQTRIPQFSGAGYMGSGSQIV 61
S L++ +SGL +Q L +NI++ GY+RQ I G++G+G +
Sbjct: 1 SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVS 60

Query: 62 DVRRLASDFLTGQLRNATSQNSELNAFLGQIDQLNSLLADNTTGVSPAMQRFFSALQTAA 121
V+R F+T QLR A +Q+S L A Q+ +++++L+ +T+ ++ MQ FF++LQT
Sbjct: 61 GVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLV 120

Query: 122 QNPSSTEAREAVLAQAQGLSKTFNTLYDQLDKQNSLINQQLGALTSQVNNLSQSVAEYND 181
N AR+A++ +++GL F T L Q+ +N +GA Q+NN ++ +A ND
Sbjct: 121 SNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLND 180

Query: 182 AIAK--AKSAGAVPNDLLDARDEAVRKLSEMVGVTAVTQDDNSVSLFIGSGQPLVVGNTV 239
I++ AGA PN+LLD RD+ V +L+++VGV QD + ++ + +G LV G+T
Sbjct: 181 QISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTA 240

Query: 240 STLSVVPGLDDPTRYQVQLTLGDS--TQNVTRLVSGGQMGGLLAYRDTVLDSSYNKLGQL 297
L+ VP DP+R V G + + +L++ G +GG+L +R LD + N LGQL
Sbjct: 241 RQLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQL 300

Query: 298 ALTFADTVNKQLGQGLDLAGKAGANLFGDINDPDITALRVLAKNGNTGNVHANLNITDTS 357
AL FA+ N Q G D G AG + F I VL N G+V +TD S
Sbjct: 301 ALAFAEAFNTQHKAGFDANGDAGEDFF------AIGKPAVLQNTKNKGDVAIGATVTDAS 354

Query: 358 KLNSSDFRLDFDGTNFTARRLGDDASMQVTVSGTGPYTLSFKDANGVDQGFSVTLDQLPA 417
+ ++D+++ FD + RL + + VT + G +T PA
Sbjct: 355 AVLATDYKISFDNNQWQVTRLASNTTFTVT---------PDANGKVAFDGLELTFTGTPA 405

Query: 418 AGDRFTLQPTRRGASDIETTLKNASQLAFA 447
D FTL+P +++ + + +++A A
Sbjct: 406 VNDSFTLKPVSDAIVNMDVLITDEAKIAMA 435



Score = 83.1 bits (205), Expect = 8e-19
Identities = 51/156 (32%), Positives = 71/156 (45%), Gaps = 20/156 (12%)

Query: 545 GSGQTYTYEFNLSNVPQTGDSFTLSFNKDGIAD--------------------NRNALNL 584
G E + P DSFTL D I + + + N
Sbjct: 389 GKVAFDGLELTFTGTPAVNDSFTLKPVSDAIVNMDVLITDEAKIAMASEEDAGDSDNRNG 448

Query: 585 NALQTKPTVGGTDSTGSTYNDAYGGLVERVGTLTAQARASADASQTVLKQAQDSRDSLSG 644
AL + T ++NDAY LV +G TA + S+ V+ Q + + S+SG
Sbjct: 449 QALLDLQSNSKTVGGAKSFNDAYASLVSDIGNKTATLKTSSATQGNVVTQLSNQQQSISG 508

Query: 645 VSLDEEAANLIQFQQYYSASAQVIQVARSLFDTLIG 680
V+LDEE NL +FQQYY A+AQV+Q A ++FD LI
Sbjct: 509 VNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALIN 544


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1085FLGFLGJ1481e-43 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 148 bits (374), Expect = 1e-43
Identities = 79/195 (40%), Positives = 114/195 (58%), Gaps = 7/195 (3%)

Query: 198 LPAQSYPAASRRGFSTDGVDSQGSRRIAQP-----PLARGKSMFASADEFIATMLPMAQK 252
LP +S PAA F + V ++ ++Q P S+ + F+A + AQ
Sbjct: 104 LPEESTPAA-PMKFPLETVVRYQNQALSQLVQKAVPRNYDDSLPGDSKAFLAQLSLPAQL 162

Query: 253 AAERIGVDARYLVAQAALETGWGKSIIRQQDGGSSHNLFGIKTGSRWDGASARALTTEYE 312
A+++ GV ++AQAALE+GWG+ IR+++G S+NLFG+K W G TTEYE
Sbjct: 163 ASQQSGVPHHLILAQAALESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYE 222

Query: 313 GGKAVKEIAAFRSYSSFEQSFHDYVSFLQGNDRYQNALDSAANPERFMQELQRAGYATDP 372
G+A K A FR YSS+ ++ DYV L N RY A+ +AA+ E+ Q LQ AGYATDP
Sbjct: 223 NGEAKKVKAKFRVYSSYLEALSDYVGLLTRNPRYA-AVTTAASAEQGAQALQDAGYATDP 281

Query: 373 QYARKVAQIARQMQT 387
YARK+ + +QM++
Sbjct: 282 HYARKLTNMIQQMKS 296



Score = 68.2 bits (166), Expect = 7e-15
Identities = 46/160 (28%), Positives = 78/160 (48%), Gaps = 10/160 (6%)

Query: 20 DLNRLNQLKVGKDRDGEANIRKVAQEFESLFLNEMLKSMRSANEALGDGNFMNSQTTKQY 79
D LN+LK D ANIR VA++ E +F+ MLKSMR +AL +S+ T+ Y
Sbjct: 14 DAQSLNELKAKAGEDPAANIRPVARQVEGMFVQMMLKSMR---DALPKDGLFSSEHTRLY 70

Query: 80 QDMYDQQLSVSLSKNAGGIGLADVLVRQLSKMKQGSRGNGENPFARVAENGAGRWPSNPS 139
MYDQQ++ ++ G+GLA+++V+Q++ + + + R+ +
Sbjct: 71 TSMYDQQIAQQMT-AGKGLGLAEMMVKQMTPEQPLPEESTPAAPMKFPLETVVRYQNQAL 129

Query: 140 AQAGKALPMPEAGRDDSKLLNQR----RLALPGKLAERML 175
+Q DDS + + +L+LP +LA +
Sbjct: 130 SQL--VQKAVPRNYDDSLPGDSKAFLAQLSLPAQLASQQS 167


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1084FLGPRINGFLGI436e-155 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 436 bits (1122), Expect = e-155
Identities = 168/366 (45%), Positives = 224/366 (61%), Gaps = 10/366 (2%)

Query: 7 LLALAALLLAAGAAQAERLKDIASIQGVRTNQLIGYGLVVGLSGSGDQTTQTPFTLQTFN 66
AL L A R+KDIAS+Q R NQLIGYGLVVGL G+GD +PFT Q+
Sbjct: 13 FSALPFLSTPPAQADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSPFTEQSMR 72

Query: 67 NMLAQFGIKVPANVGNVQLKNVAAVSVHADLPPFAKPGQPIDVTVSSIGNAKSLRGGSLL 126
ML GI G KN+AAV V A+LPPFA PG +DVTVSS+G+A SLRGG+L+
Sbjct: 73 AMLQNLGITTQG--GQSNAKNIAAVMVTANLPPFASPGSRVDVTVSSLGDATSLRGGNLI 130

Query: 127 MTPLKGIDGQVYAVAQGNLVVGGFDAEGRDGSKITVNVPSAGRIPAGATVERAVPSGFDQ 186
MT L G DGQ+YAVAQG L+V GF A+G D + +T V ++ R+P GA +ER +PS F
Sbjct: 131 MTSLSGADGQIYAVAQGALIVNGFSAQG-DAATLTQGVTTSARVPNGAIIERELPSKFKD 189

Query: 187 GNSLTLNLNRPDFTTAKRIVDRINEL----LGPGVAHAVDGGSVRVSAPLDPNQRVDYLS 242
+L L L PDF+TA R+ D +N G +A D + V P ++
Sbjct: 190 SVNLVLQLRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKP-RVADLTRLMA 248

Query: 243 ILENLDVQPGEAVAKVIINSRTGTIVIGQNVKVSPAAVTHGSLTVSITEDPIVSQPGAFS 302
+ENL V+ + AKV+IN RTGTIVIG +V++S AV++G+LTV +TE P V QP FS
Sbjct: 249 EIENLTVET-DTPAKVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESPQVIQPAPFS 307

Query: 303 NGQTAVVPRSRVNAEEETKPMFKFGPGTTLDDIVRAVNQVGAAPSDLMAILEALKQAGAL 362
GQTAV P++ + A +E + G L +V +N +G ++AIL+ +K AGAL
Sbjct: 308 RGQTAVQPQTDIMAMQEGSKVA-IVEGPDLRTLVAGLNSIGLKADGIIAILQGIKSAGAL 366

Query: 363 QADLIV 368
QA+L++
Sbjct: 367 QAELVL 372


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1083FLGLRINGFLGH1803e-59 Flagellar L-ring protein signature.
		>FLGLRINGFLGH#Flagellar L-ring protein signature.

Length = 232

Score = 180 bits (459), Expect = 3e-59
Identities = 81/224 (36%), Positives = 112/224 (50%), Gaps = 13/224 (5%)

Query: 12 IATALGGCVNPPPKPNDPYYAPVLPRTPLPAAQNNGAIYQAGF-----EQNLYDDRKAFR 66
+ +L GC P P P P P NG+I+Q+ Q L++DR+
Sbjct: 15 LVLSLTGCAWIPSTPLVQGATSAQP-VPGPTPVANGSIFQSAQPINYGYQPLFEDRRPRN 73

Query: 67 VGDIITITLNEKTQASKKANSDIQKDSKTKMGLTSLFGSGMTTNNPIGGGDLSLSAEYGG 126
+GD +TI L E ASK ++++ +D KT G + G + E G
Sbjct: 74 IGDTLTIVLQENVSASKSSSANASRDGKTNFGFDT---VPRYLQGLFGNARADV--EASG 128

Query: 127 SRDAKGDSQAGQSNSLTGSITVTVAEVLPNGILSVRGEKWMTLNTGNELVRIAGLVRADD 186
G A SN+ +G++TVTV +VL NG L V GEK + +N G E +R +G+V
Sbjct: 129 GNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRFSGVVNPRT 188

Query: 187 IATDNTVSSTRVADARITYSGTGAFADASQPGWLDRFF--LSPL 228
I+ NTV ST+VADARI Y G G +A GWL RFF LSP+
Sbjct: 189 ISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSPM 232


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1082FLGHOOKAP1452e-07 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 44.6 bits (105), Expect = 2e-07
Identities = 13/51 (25%), Positives = 25/51 (49%)

Query: 209 NGLGTVAQNTLENSNVNVVEELVNMITTQRAYEMNSKVISTADQMLSFVTQ 259
N + ++ S VN+ EE N+ Q+ Y N++V+ TA+ + +
Sbjct: 494 NVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALIN 544



Score = 39.6 bits (92), Expect = 9e-06
Identities = 20/79 (25%), Positives = 35/79 (44%), Gaps = 14/79 (17%)

Query: 3 SALWVSKTGLSAQDMNLTTISNNLANVSTTGFKRDRAEFQDLLYQIRRQPGGQSTQDSEL 62
S + + +GL+A L T SNN+++ + G+ R + +S L
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTI--------------MAQANSTL 47

Query: 63 PSGLQLGTGVRVVGTQKIF 81
+G +G GV V G Q+ +
Sbjct: 48 GAGGWVGNGVYVSGVQREY 66


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1080FLGHOOKAP1455e-07 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 44.9 bits (106), Expect = 5e-07
Identities = 17/49 (34%), Positives = 27/49 (55%)

Query: 414 ALQSGALEASNVDISNELVNLIVHQRNYQANAKTIQTEDAVTQTIINLR 462
L + S V++ E NL Q+ Y ANA+ +QT +A+ +IN+R
Sbjct: 498 QLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546



Score = 41.1 bits (96), Expect = 8e-06
Identities = 22/69 (31%), Positives = 34/69 (49%), Gaps = 3/69 (4%)

Query: 2 SFNIGLSGIQAASSGLNVTGNNIANAGTVGFKQSRAEFADVYAASVLGSGSNPQGSGVLL 61
N +SG+ AA + LN NNI++ G+ + A A S LG+G G+GV +
Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQ--ANSTLGAGGW-VGNGVYV 59

Query: 62 SDVSQMFKQ 70
S V + +
Sbjct: 60 SGVQREYDA 68


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA1078FLGHOOKAP1363e-05 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 36.1 bits (83), Expect = 3e-05
Identities = 8/38 (21%), Positives = 20/38 (52%)

Query: 107 NVNVVEEMADMISASRAFQTNAEMMNTAKQMMQKVLTL 144
VN+ EE ++ + + NA+++ TA + ++ +
Sbjct: 508 GVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545



Score = 29.5 bits (66), Expect = 0.004
Identities = 15/54 (27%), Positives = 25/54 (46%), Gaps = 2/54 (3%)

Query: 4 ASVFNIAGSGMSAQSTRLNTVASNIANAETVSSSVDKTYRARHPVFSTMFQQAQ 57
+S+ N A SG++A LNT ++NI++ + T A ST+
Sbjct: 1 SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMA--QANSTLGAGGW 52


117PA0974PA0969N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA0974121-4.832310hypothetical protein
PA0973121-5.270237peptidoglycan associated lipoprotein OprL
PA0972120-4.941468translocation protein TolB
PA0971223-5.194336translocation protein TolA
PA0970224-4.543055translocation protein TolR
PA0969122-4.214492translocation protein TolQ
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0974RTXTOXIND320.002 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 32.1 bits (73), Expect = 0.002
Identities = 10/53 (18%), Positives = 19/53 (35%)

Query: 69 QLQQMQDELARLRGTLEEQQNQIQQLKQESLERYQDLDRRISGGGAPAAQNSA 121
+ + +EL + LE+ +++I K+E Q I N
Sbjct: 260 KYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIG 312


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0973OMPADOMAIN1166e-34 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 116 bits (291), Expect = 6e-34
Identities = 34/112 (30%), Positives = 53/112 (47%), Gaps = 12/112 (10%)

Query: 68 YFEYDSSDLKPEAMRALDVHA---KDLKGSGQRVVLEGHTDERGTREYNMALGERRAKAV 124
F ++ + LKPE ALD +L VV+ G+TD G+ YN L ERRA++V
Sbjct: 222 LFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGLSERRAQSV 281

Query: 125 QRYLVLQGVSPAQLELVSYGKERPVATGHDEQS---------WAQNRRVELK 167
YL+ +G+ ++ G+ PV + A +RRVE++
Sbjct: 282 VDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIE 333


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0971IGASERPTASE492e-08 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 49.3 bits (117), Expect = 2e-08
Identities = 36/204 (17%), Positives = 71/204 (34%), Gaps = 21/204 (10%)

Query: 54 QLKSKSQATTQTNQKIAGEAKKTASKQYE-----VEQLEQKKLEQQKLEQQKLEQQQVAA 108
Q + TT N + + + +++ + E +Q +
Sbjct: 990 QTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESK 1049

Query: 109 AKAAEQKKADEARKAEAQKAAEAKKADEAKKAAEAKAAEQKKQADIAKKRAEDEAKKKAA 168
++ A E + A EAK +A + ++A ++ E K+
Sbjct: 1050 TVEKNEQDATETTAQNREVAKEAKSNVKA----------NTQTNEVA--QSGSETKETQT 1097

Query: 169 EDAKKKAAEDAKKKAAEEAKKKAAAEAAKKKAAVEAAKKKAAAAAAAARKAAEDKKARAL 228
+ K+ A + ++KA E +K E K + V ++++ A A E+ +
Sbjct: 1098 TETKETATVEKEEKAKVETEKT--QEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNI 1155

Query: 229 AELLS--DTTERQQALADEVGSEV 250
E S +TT + A E S V
Sbjct: 1156 KEPQSQTNTTADTEQPAKETSSNV 1179


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA096960KDINNERMP290.017 60kDa inner membrane protein signature.
		>60KDINNERMP#60kDa inner membrane protein signature.

Length = 548

Score = 29.1 bits (65), Expect = 0.017
Identities = 17/72 (23%), Positives = 28/72 (38%), Gaps = 13/72 (18%)

Query: 12 WSLISNASIVVQLVMLTLVAASVTSWIMIFQRGNAMRAAKKALDAFEERFWS-----GID 66
+S+I + +V+ +M L A TS MR + + A ER +
Sbjct: 356 FSIII-ITFIVRGIMYPLTKAQYTSM-------AKMRMLQPKIQAMRERLGDDKQRISQE 407

Query: 67 LSKLYRQAGSNP 78
+ LY+ NP
Sbjct: 408 MMALYKAEKVNP 419


118PA0756PA0749N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA0756-2110.466253two-component response regulator
PA0755-211-0.205242cis-aconitate porin OpdH
PA0754-190.234457hypothetical protein
PA0753-170.386453hypothetical protein
PA0752-190.400283hypothetical protein
PA0751-1100.800681hypothetical protein
PA0750-1100.704980uracil-DNA glycosylase
PA0749-1111.395401hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0756HTHFIS838e-21 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 83.3 bits (206), Expect = 8e-21
Identities = 35/127 (27%), Positives = 62/127 (48%)

Query: 2 RILLVEDHPQLAESVVQALKGAGWTVDLLQDGVAADLALASEEYALAILDVGLPRMDGFE 61
IL+ +D + + QAL AG+ V + + +A+ + L + DV +P + F+
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 VLARLRGRGKTLPVLMLTARGEVKDRVHGLNLGADDYLAKPFELSELEARVKALLRRSVL 121
+L R++ LPVL+++A+ + GA DYL KPF+L+EL + L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 122 GGEQLQR 128
+L+
Sbjct: 125 RPSKLED 131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0754AEROLYSIN290.028 Aerolysin signature.
		>AEROLYSIN#Aerolysin signature.

Length = 493

Score = 28.8 bits (64), Expect = 0.028
Identities = 16/40 (40%), Positives = 23/40 (57%)

Query: 2 MMKLSFRPLALVAAGLLLAGAAVAEPKRPECIAPASPGGG 41
M K+ L+L+ +GLL+A A AEP P+ + S G G
Sbjct: 1 MQKIKLTGLSLIISGLLMAQAQAAEPVYPDQLRLFSLGQG 40


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0753ACRIFLAVINRP270.043 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 26.7 bits (59), Expect = 0.043
Identities = 15/58 (25%), Positives = 25/58 (43%), Gaps = 5/58 (8%)

Query: 99 LGFILSAALVGSCMAILYGARPIPAVV-----TASLLGIGLYWLFDRALDVPLPLGVL 151
+S +V C+A LY + IP V + + LF++ DV +G+L
Sbjct: 874 ALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLL 931


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0749PF05043300.010 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 29.9 bits (67), Expect = 0.010
Identities = 7/27 (25%), Positives = 16/27 (59%)

Query: 47 YRYIKHTSKLIRRLGDSDLALQRNKVV 73
YR I +K+I+R +++L +++
Sbjct: 118 YRIISQINKVIKRQFQFEVSLTPVQII 144


119PA0687PA0677N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA06871113.462955type II secretion system protein
PA06860112.964710type II secretion system protein HxcR
PA0685-1122.691211type II secretion system protein
PA06842212.892275type II secretion system protein
PA06831193.124510type II secretion system protein
PA06821162.708585HxcX atypical pseudopilin
PA06811152.865486HxcT pseudopilin
PA0680-1152.418488HxcV pseudopilin
PA0679-1172.553433hypothetical protein
PA0678-1162.151263HxcU pseudopilin
PA06770141.572854HxcW pseudopilin
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0687BCTERIALGSPF378e-131 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 378 bits (972), Expect = e-131
Identities = 187/406 (46%), Positives = 253/406 (62%), Gaps = 3/406 (0%)

Query: 1 MQTFRYEAADAQGRIETGTLEADSQRGALGQLRARGLTPLEVREQAGGGTGQGAGALFAP 60
M + Y+A DAQG+ GT EADS R A LR RGL PL V E G G+ L
Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLR 60

Query: 61 R---LSDGDLAWATRQLASLLAASLPLEAALSATLDQAERKHIAQTLSAVRSDVRGGMRL 117
R LS DLA TRQLA+L+AAS+PLE AL A Q+E+ H++Q ++AVRS V G L
Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSL 120

Query: 118 ADALAARPRDFPEIYRALVAAGEESGDLAQVMERLADYIEERNALRGKILTAFIYPAVVG 177
ADA+ P F +Y A+VAAGE SG L V+ RLADY E+R +R +I A IYP V+
Sbjct: 121 ADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLT 180

Query: 178 VVSIGIVIFLLGYVVPQVVSAFSQARQDLPALTRAMLQASDFVRAWGWLCAGAIGSAYWG 237
VV+I +V LL VVP+VV F +Q LP TR ++ SD VR +G A+ + +
Sbjct: 181 VVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMA 240

Query: 238 WRLYLRDPQARLGWHRRVLRLPLLGRFVLGVNTARFASTLAILGSAGVPLLRALDAARQT 297
+R+ LR + R+ +HRR+L LPL+GR G+NTAR+A TL+IL ++ VPLL+A+ +
Sbjct: 241 FRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDV 300

Query: 298 LANDCLAQAVEEATAQVREGVSLASALRTRQVFPPILTHLIASGEKTGALPPMLDRAAQT 357
++ND + AT VREGVSL AL +FPP++ H+IASGE++G L ML+RAA
Sbjct: 301 MSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADN 360

Query: 358 LSRDIERRAMGMTALLEPLMIVVMGGVVLTIVMAVLMPIIEMNQLV 403
R+ + L EPL++V M VVL IV+A+L PI+++N L+
Sbjct: 361 QDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLM 406


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0685BCTERIALGSPD2557e-77 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 255 bits (654), Expect = 7e-77
Identities = 151/571 (26%), Positives = 257/571 (45%), Gaps = 50/571 (8%)

Query: 230 PGNNTVVVTDYAENLDRVAGIIASIDIPSASD---TDVVPIQNGIAVDIASTVSELLDSQ 286
NN V+ +++ A +AS P D T VVP+ N A D+A + +L D+
Sbjct: 94 NMNNGVLKVVRSKDAKTAAVPVASDAAPGIGDEVVTRVVPLTNVAARDLAPLLRQLNDNA 153

Query: 287 GSGGAEQGQKTVVLADPRSNSIVIRSPSPERTQLARDLIGKLDSVQSNPGNLHVVYLRNA 346
G G +VV +P SN +++ + +L ++ ++D+ ++ V L A
Sbjct: 154 GVG-------SVVHYEP-SNVLLMTGRAAVIKRL-LTIVERVDNAGDR--SVVTVPLSWA 202

Query: 347 QATRLAQALRGLITGDSGGEGNE--------GDQQRARLSGGG---------MLGGGNSG 389
A + + + L S ++ A L G M+ +
Sbjct: 203 SAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEPNSRQRIIAMIKQLDRQ 262

Query: 390 TGSQGLGSSGNTTGSGSSGLGGSNRSGGAYGAMGSGQGGAGPGAMGEENSAFSAGGVTVQ 449
+QG + +S L Q A+ + + ++
Sbjct: 263 QATQGNTKVIYLKYAKASDLVEVLTGIS-STMQSEKQAAKPVAALDK--------NIIIK 313

Query: 450 ADATTNTLLISAPEPLYRNLREVIDLLDQRRAQVVIESLIVEVSEDDSSEFGIQWQAGNL 509
A TN L+++A + +L VI LD RR QV++E++I EV + D GIQW N
Sbjct: 314 AHGQTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGIQWANKNA 373

Query: 510 GGNGVFG-GVNFGQSALNTAGKNTIDVLPKGLNIGLVDGTVDIPGIGKILDLKVLARALK 568
G G+ + N + L L G + + +L AL
Sbjct: 374 GMTQFTNSGLPISTAIAGANQYNKDGTVSSSLASALSSFNGIAAGFYQG-NWAMLLTALS 432

Query: 569 SRGGTNVLSTPNLLTLDNESASIMVGQTIPFVSGQYVTDGGGTSNNPFQTIQREDVGLKL 628
S ++L+TP+++TLDN A+ VGQ +P ++G T G N F T++R+ VG+KL
Sbjct: 433 SSTKNDILATPSIVTLDNMEATFNVGQEVPVLTGSQTTSGD----NIFNTVERKTVGIKL 488

Query: 629 NIRPQISEGGTVKLDVYQEVSSVDERASTAA---GVVTNKRAIDTSILLDDGQIMVLGGL 685
++PQI+EG +V L++ QEVSSV + AS+ + G N R ++ ++L+ G+ +V+GGL
Sbjct: 489 KVKPQINEGDSVLLEIEQEVSSVADAASSTSSDLGATFNTRTVNNAVLVGSGETVVVGGL 548

Query: 686 LQDNVQDNTDGVPGLSSLPGVGSLFRYQKRSRTKTNLMVFLRPYIVRDAAAGRSITLNRY 745
L +V D D VP L +P +G+LFR + +K NLM+F+RP ++RD R + +Y
Sbjct: 549 LDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNLMLFIRPTVIRDRDEYRQASSGQY 608

Query: 746 DFIRRAQ-QRVQPRHDWSVGDMQAPVLPPAQ 775
AQ ++ ++ ++ + + P Q
Sbjct: 609 TAFNDAQSKQRGKENNDAMLNQDLLEIYPRQ 639



Score = 159 bits (404), Expect = 6e-43
Identities = 72/276 (26%), Positives = 127/276 (46%), Gaps = 7/276 (2%)

Query: 87 VAPVSATAAELGEQPVSLNFVDTEVEAVVRALSRATGRQFLVDPRVKGKLTLVSEGQVPA 146
A + A E +F T+++ + +S+ + ++DP V+G +T+ S +
Sbjct: 17 FAALLFRPAAAEEFSA--SFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITVRSYDMLNE 74

Query: 147 RTAYRMLTSALRMQGFSVVDVD-GVSQVVPEADAKLLGGPVYGADRPA-ANGMVTRTFRL 204
Y+ S L + GF+V++++ GV +VV DAK PV P + +VTR L
Sbjct: 75 EQYYQFFLSVLDVYGFAVINMNNGVLKVVRSKDAKTAAVPVASDAAPGIGDEVVTRVVPL 134

Query: 205 RYENAVNLIPVLRPIVAQNNPINA--YPGNNTVVVTDYAENLDRVAGIIASIDIPSASDT 262
A +L P+LR + + Y +N +++T A + R+ I+ +D
Sbjct: 135 TNVAARDLAPLLRQLNDNAGVGSVVHYEPSNVLLMTGRAAVIKRLLTIVERVDNAGDRSV 194

Query: 263 DVVPIQNGIAVDIASTVSELLDSQGSGGAEQGQKTVVLADPRSNSIVIRSPSPERTQLAR 322
VP+ A D+ V+EL V+AD R+N++++ P Q
Sbjct: 195 VTVPLSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGE-PNSRQRII 253

Query: 323 DLIGKLDSVQSNPGNLHVVYLRNAQATRLAQALRGL 358
+I +LD Q+ GN V+YL+ A+A+ L + L G+
Sbjct: 254 AMIKQLDRQQATQGNTKVIYLKYAKASDLVEVLTGI 289



Score = 50.7 bits (121), Expect = 2e-08
Identities = 44/299 (14%), Positives = 103/299 (34%), Gaps = 56/299 (18%)

Query: 194 ANGMVTRTFRLRYENAVNLIPVLRPI----------VAQNNPINAYPGNNTVVVTDYAEN 243
A T L + +A +++ ++ + + + A N V+V+ +
Sbjct: 189 AGDRSVVTVPLSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEPNS 248

Query: 244 LDRVAGIIASIDIPSAS--DTDVVPIQNGIAVDIASTVSELL-----DSQGSGGAEQGQK 296
R+ +I +D A+ +T V+ ++ A D+ ++ + + Q + K
Sbjct: 249 RQRIIAMIKQLDRQQATQGNTKVIYLKYAKASDLVEVLTGISSTMQSEKQAAKPVAALDK 308

Query: 297 TV-VLADPRSNSIVIRSPSPERTQLARDLIGKLDSVQSNPGNLHVVYLRNAQATRLAQAL 355
+ + A ++N++++ + P+ +I +LD +R Q +
Sbjct: 309 NIIIKAHGQTNALIVTAA-PDVMNDLERVIAQLD-------------IRRPQV-----LV 349

Query: 356 RGLITGDSGGEGNEGDQQRARLSGGGMLG--GGNSGTGSQGLGSSGNTTGSGSSGLGGSN 413
+I +G LG N G +SG + +G N
Sbjct: 350 EAIIAEVQDADGLN-------------LGIQWANKNAGMTQFTNSGLPISTAIAGANQYN 396

Query: 414 RSGGAYGAMGSGQGGAGPGAMGEENSAFSAGGVTVQA-DATTNTLLISAPEPLYRNLRE 471
+ G ++ S A G + + + A ++T +++ P + + E
Sbjct: 397 KDGTVSSSLASALSSFNGIAAGFYQGNW---AMLLTALSSSTKNDILATPSIVTLDNME 452



Score = 44.1 bits (104), Expect = 2e-06
Identities = 20/84 (23%), Positives = 35/84 (41%), Gaps = 16/84 (19%)

Query: 190 DRPAANGMVTRTFRLRYENAVNLIPVLR----------------PIVAQNNPINAYPGNN 233
DR A T+ L+Y A +L+ VL + +N I A+ N
Sbjct: 260 DRQQATQGNTKVIYLKYAKASDLVEVLTGISSTMQSEKQAAKPVAALDKNIIIKAHGQTN 319

Query: 234 TVVVTDYAENLDRVAGIIASIDIP 257
++VT + ++ + +IA +DI
Sbjct: 320 ALIVTAAPDVMNDLERVIAQLDIR 343


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0681BCTERIALGSPG1671e-56 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 167 bits (425), Expect = 1e-56
Identities = 63/142 (44%), Positives = 87/142 (61%), Gaps = 6/142 (4%)

Query: 11 KGHRGQRGFTLIEIMVVVVILGILAAMVVPKVLDRPDQARATAARQDISGLMQALKLYRL 70
+ QRGFTL+EIMVV+VI+G+LA++VVP ++ ++A A DI L AL +Y+L
Sbjct: 2 RATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKL 61

Query: 71 DQGRYPSQAQGLKVLAERP-ADASASNWRS--YLERLPNDPWGKPYQYLNPGVNGEIDVF 127
D YP+ QGL+ L E P A+N+ Y++RLP DPWG Y +NPG +G D+
Sbjct: 62 DNHHYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDLL 121

Query: 128 SLGADGQPGGEGINADIGSWQL 149
S G DG+ G E DI +W L
Sbjct: 122 SAGPDGEMGTED---DITNWGL 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0680BCTERIALGSPG316e-04 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 31.0 bits (70), Expect = 6e-04
Identities = 19/62 (30%), Positives = 34/62 (54%), Gaps = 3/62 (4%)

Query: 8 RGFTLIEVLVALAIVAIALAAAIRAVGLMTDGNGLLRDKSLA-LLAAESRLAELRLGVGA 66
RGFTL+E++V IV I + A++ LM + + K+++ ++A E+ L +L
Sbjct: 8 RGFTLLEIMV--VIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLDNHH 65

Query: 67 AP 68
P
Sbjct: 66 YP 67


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0678BCTERIALGSPH511e-10 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 50.7 bits (121), Expect = 1e-10
Identities = 30/129 (23%), Positives = 46/129 (35%), Gaps = 7/129 (5%)

Query: 5 RQSGFTLIELMVVLVIVGIATAAISLSARPDPTGLLRQDAARLARLLEIAQGEARVRGTP 64
RQ GFTL+E+M++L+++G++ + L+ Q AR L Q G
Sbjct: 2 RQRGFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAAQTLARFEAQLRFVQQRGLQTGQF 61

Query: 65 ILWQPSAKGYRFSPQAYRGKTDAFAADTELRARDWQAAPLRVSVRPPRPVLLDAEWIGAP 124
++F R D AD W PLR V G
Sbjct: 62 FGVSVHPDRWQFLVLEARDGADPAPADDGWSGYRWL--PLR-----AGRVATSGSIAGGK 114

Query: 125 LRITLSDGQ 133
L + + G+
Sbjct: 115 LNLAFAQGE 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0677BCTERIALGSPG326e-04 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 32.2 bits (73), Expect = 6e-04
Identities = 18/60 (30%), Positives = 32/60 (53%), Gaps = 3/60 (5%)

Query: 12 RRQAGFTLIEVMVAIMLMAIV-SLMAWRGLDSIARASAHLEDSTEQGAALLRALNQLERD 70
+Q GFTL+E+MV I+++ ++ SL+ + + +A S AL AL+ + D
Sbjct: 5 DKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVS--DIVALENALDMYKLD 62


120PA0603PA0599N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA06030130.082999ABC transporter ATP-binding protein
PA06020140.517403ABC transporter
PA0601-1130.581335two-component response regulator
PA0600-115-1.698400two-component sensor
PA0599014-2.533586hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0603PF05272340.001 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 33.9 bits (77), Expect = 0.001
Identities = 15/88 (17%), Positives = 26/88 (29%), Gaps = 20/88 (22%)

Query: 40 LTLLGPSGSGKTTSLMMLAGFETPTAGEILLAGRSINNVPPHKRDIGMVFQNYALFPHMT 99
+ L G G GK+T + L G + + + +D
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDI---------GTGKDSYEQIAGIVA---YE 646

Query: 100 VAENLAFPLSVRGMSKTDVKERVKRALS 127
++E + + D E VK S
Sbjct: 647 LSE-------MTAFRRADA-EAVKAFFS 666


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0602BICOMPNTOXIN290.028 Staphylococcal bi-component toxin signature.
		>BICOMPNTOXIN#Staphylococcal bi-component toxin signature.

Length = 315

Score = 28.7 bits (64), Expect = 0.028
Identities = 14/61 (22%), Positives = 24/61 (39%), Gaps = 11/61 (18%)

Query: 135 KVAGSPQGWADFWDVKKFPGKRGLRWGAKYSLEFALMADGV-----APK------DVYQT 183
K+ G +++ KK + +RW +Y++ V PK +V QT
Sbjct: 80 KMQGFISSRTTYYNYKKTNHVKAMRWPFQYNIGLKTNDKYVSLINYLPKNKIESTNVSQT 139

Query: 184 L 184
L
Sbjct: 140 L 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0601HTHFIS644e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 63.7 bits (155), Expect = 4e-14
Identities = 31/116 (26%), Positives = 51/116 (43%), Gaps = 2/116 (1%)

Query: 3 IRVLVAEDHTIVREGIKQLIGMAKDLQVVGEATNGEQLLETLRGTPCEVVLLDISMPGVN 62
+LVA+D +R + Q + A V +N L + ++V+ D+ MP N
Sbjct: 4 ATILVADDDAAIRTVLNQALSRA-GYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 63 GLEAIPRIRALNEPPAILVLSMHDEAQMAARALKIGAAGYATKDSDPALLLTAIRR 118
+ +PRI+ +LV+S + A +A + GA Y K D L+ I R
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0599CHANLCOLICIN290.031 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 28.9 bits (64), Expect = 0.031
Identities = 31/124 (25%), Positives = 47/124 (37%), Gaps = 17/124 (13%)

Query: 120 AGWQTLSLALPDPQSTAPVTRPAESAASASADKDA---------------SAADSASKPD 164
A W T L + A AE+ A A A++DA +A+ + S +
Sbjct: 55 AKWSTAQLKKTQAEQAARAKAAAEAQAKAKANRDALTQRLKDIVNEALRHNASRTPSATE 114

Query: 165 VKGESGNA--PAPESTAEAGSGEPAQSEDQAPPPAIDPVEQRKAHAERVMARLQASIDLA 222
+ + A E A + E A+ E +A A EQR+ ER A + + LA
Sbjct: 115 LAHANNAAMQAEDERLRLAKAEEKARKEAEAAEKAFQEAEQRRKEIEREKAETERQLKLA 174

Query: 223 LQHE 226
E
Sbjct: 175 EAEE 178


121PA0463PA0458N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA04630140.127169DNA-binding response regulator CreB
PA0462113-0.078842hypothetical protein
PA04611120.014618acyltransferase
PA04600110.127630hypothetical protein
PA04590120.453229chaperone protein ClpB
PA04582100.670561major facilitator superfamily transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0463HTHFIS815e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 81.4 bits (201), Expect = 5e-20
Identities = 37/134 (27%), Positives = 64/134 (47%), Gaps = 1/134 (0%)

Query: 2 PHILIVEDEAAIADTLLYALQAEGFATTWVTLAGEALALQERQPADLLILDVGLPDISGF 61
IL+ +D+AAI L AL G+ + A DL++ DV +PD + F
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 EACKRLR-RFSEVPVIFLTARDAEIDRVVGLEIGADDYVVKPFSPREVAARVKAILKRMA 120
+ R++ ++PV+ ++A++ + + E GA DY+ KPF E+ + L
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 121 PRPAALEEAAPSGP 134
RP+ LE+ + G
Sbjct: 124 RRPSKLEDDSQDGM 137


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0462NEISSPPORIN280.027 Neisseria sp. porin signature.
		>NEISSPPORIN#Neisseria sp. porin signature.

Length = 348

Score = 27.6 bits (61), Expect = 0.027
Identities = 14/25 (56%), Positives = 18/25 (72%), Gaps = 1/25 (4%)

Query: 1 MKRALALLSLFALPVLA-AEPNLYG 24
MK++L L+L ALPV A A+ LYG
Sbjct: 1 MKKSLIALTLAALPVAAMADVTLYG 25


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0459HTHFIS496e-08 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 49.1 bits (117), Expect = 6e-08
Identities = 66/351 (18%), Positives = 116/351 (33%), Gaps = 48/351 (13%)

Query: 494 TAEEREKLLQMEERLHQRVIG---QQEAITAVSDAVRLARAGLRQGSRPIATFLFLGPTG 550
+ L +R ++ + S A++ L + + T + G +G
Sbjct: 111 LIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESG 170

Query: 551 VGKTELAKALAEVVFGDEDAMIRIDMSEYMERHAVSRLIGAPPGYVGYDEGGQLTERVRR 610
GK +A+AL + + I+M+ S L G E G T R
Sbjct: 171 TGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGH--------EKGAFTGAQTR 222

Query: 611 RPYSV-------ILLDEIEKAHADVNNILLQVFDDGRLTDGKGRVVDFTNTIIIATSNLG 663
+ LDEI D LL+V G T GR ++ I+A +N
Sbjct: 223 STGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATN-- 280

Query: 664 SELIMKNAQAGEFAQPPEKLKRELMTTLRGHFRPEFLNRLDEVIVFESLSKAQIEDIVRL 723
+ ++ + G FR + RL+ V + + + EDI L
Sbjct: 281 --------------KDLKQSINQ------GLFREDLYYRLNVVPLRLPPLRDRAEDIPDL 320

Query: 724 QLERVKRAAHAQDIYLHIDDSLVGHLAEEAYQPEFGARELKRQIRQQLETRLATAMLKGE 783
V++A D + + +A+ REL+ +R+ TA+ +
Sbjct: 321 VRHFVQQAEKEGLDVKRFDQEALELM--KAHPWPGNVRELENLVRR------LTALYPQD 372

Query: 784 VKEGETVTFFYDAKDGVGYRKGAAPNPAARKKSGAGETPRGRATAARKPAA 834
V E + ++ + AA + S A E + A+ A
Sbjct: 373 VITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDAL 423


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0458TCRTETB1209e-32 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 120 bits (302), Expect = 9e-32
Identities = 91/416 (21%), Positives = 177/416 (42%), Gaps = 17/416 (4%)

Query: 6 QLTPRIARQLPWLVAVAFFMQALDGTILNTALPSMASSLNENPLRMQAVVIAYLLTVALL 65
Q R + L WL ++FF L+ +LN +LP +A+ N+ P V A++LT ++
Sbjct: 7 QSNLRHNQILIWLCILSFF-SVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIG 65

Query: 66 IPASGWIADRFGTRRVFLGAVLLFSLGSLLCALSPS-LELLVGARIVQGVGGALMMPVGR 124
G ++D+ G +R+ L +++ GS++ + S LL+ AR +QG G A +
Sbjct: 66 TAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVM 125

Query: 125 LVILRVYPRQDLVRVLSFVTIPGLLGPLAGPTLGGWLVEYASWHWIFLINLP-VGLLGCL 183
+V+ R P+++ + + +G GP +GG + Y HW +L+ +P + ++
Sbjct: 126 VVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHY--IHWSYLLLIPMITIITVP 183

Query: 184 VAMKLMPDLRSPVPSRFDSIGFLLFGGSMVLISIALEGLGELHLSHLRVVLLLIGGLVLL 243
MKL+ + FD G +L +V + L + V+ LI
Sbjct: 184 FLMKLLKKEVR-IKGHFDIKGIILMSVGIVFFMLFTTSYSISFLI-VSVLSFLI------ 235

Query: 244 TAYWLRALRIDKPLFPPSLFKARTFAVGILGNLFARLGSGALPFLTPLLLQVGLGYPPST 303
+ ++ P P L K F +G+L + P +++ +
Sbjct: 236 --FVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAE 293

Query: 304 AG-MTMIPLALFAMVAKPMAKPLLDFFGYRKLLVGNTLILGCLIAGFGLVDQDTPYVWLL 362
G + + P + ++ + L+D G +L L + + T + +
Sbjct: 294 IGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTI 353

Query: 363 LHLSLLGAVNSLQFTAMNTLTLIDLQDSNASSGNSLMSVVVQLSISLGVACAAALL 418
+ + +LG ++ + T ++T+ L+ A +G SL++ LS G+A LL
Sbjct: 354 IIVFVLGGLSFTK-TVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLL 408


122PA0428PA0421N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA0428012-1.110334ATP-dependent RNA helicase
PA0427013-1.728614outer membrane protein OprM
PA0426-113-2.021413multidrug resistance protein MexB
PA0425-110-1.446958multidrug resistance protein MexA
PA0424-39-0.873026multidrug resistance operon repressor MexR
PA0423-280.498054hypothetical protein
PA0422-281.232363hypothetical protein
PA0421-271.964034hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0428SECA381e-04 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 37.9 bits (88), Expect = 1e-04
Identities = 28/108 (25%), Positives = 49/108 (45%), Gaps = 7/108 (6%)

Query: 212 IEVTPPNTTVERIEQ--RVFRLPAPQKRALLAHLVTVGAWEQ-VLVFTRTKHGANRLAEY 268
V P N + R + V+ A + +A++ + A Q VLV T + + ++
Sbjct: 409 TVVVPTNRPMIRKDLPDLVYMTEAEKIQAIIEDIKERTAKGQPVLVGTISIEKSELVSNE 468

Query: 269 LTKHGLPAAAIHG-NKSQNARTKALADFKANDVRILVATDIAARGLDI 315
LTK G+ ++ + A A A + A + +AT++A RG DI
Sbjct: 469 LTKAGIKHNVLNAKFHANEAAIVAQAGYPAA---VTIATNMAGRGTDI 513


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0426ACRIFLAVINRP13530.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 1353 bits (3503), Expect = 0.0
Identities = 692/1034 (66%), Positives = 838/1034 (81%), Gaps = 3/1034 (0%)

Query: 1 MSKFFIDRPIFAWVIALVIMLAGGLSILSLPVNQYPAIAPPAIAVQVSYPGASAETVQDT 60
M+ FFI RPIFAWV+A+++M+AG L+IL LPV QYP IAPPA++V +YPGA A+TVQDT
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 61 VVQVIEQQMNGIDNLRYISSESNSDGSMTITVTFEQGTDPDIAQVQVQNKLQLATPLLPQ 120
V QVIEQ MNGIDNL Y+SS S+S GS+TIT+TF+ GTDPDIAQVQVQNKLQLATPLLPQ
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 121 EVQRQGIRVTKAVKNFLMVVGVVSTDGSMTKEDLSNYIVSNIQDPLSRTKGVGDFQVFGS 180
EVQ+QGI V K+ ++LMV G VS + T++D+S+Y+ SN++D LSR GVGD Q+FG+
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 181 QYSMRIWLDPAKLNSYQLTPGDVSSAIQAQNVQISSGQLGGLPAVKGQQLNATIIGKTRL 240
QY+MRIWLD LN Y+LTP DV + ++ QN QI++GQLGG PA+ GQQLNA+II +TR
Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240

Query: 241 QTAEQFENILLKVNPDGSQVRLKDVADVGLGGQDYSINAQFNGSPASGIAIKLATGANAL 300
+ E+F + L+VN DGS VRLKDVA V LGG++Y++ A+ NG PA+G+ IKLATGANAL
Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300

Query: 301 DTAKAIRQTIANLEPFMPQGMKVVYPYDTTPVVSASIHEVVKTLGEAILLVFLVMYLFLQ 360
DTAKAI+ +A L+PF PQGMKV+YPYDTTP V SIHEVVKTL EAI+LVFLVMYLFLQ
Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360

Query: 361 NFRATLIPTIAVPVVLLGTFGVLAAFGFSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420
N RATLIPTIAVPVVLLGTF +LAAFG+SINTLTMFGMVLAIGLLVDDAIVVVENVERVM
Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420

Query: 421 AEEGLSPREAARKSMGQIQGALVGIAMVLSAVFLPMAFFGGSTGVIYRQFSITIVSAMAL 480
E+ L P+EA KSM QIQGALVGIAMVLSAVF+PMAFFGGSTG IYRQFSITIVSAMAL
Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480

Query: 481 SVIVALILTPALCATMLKPIEKGDHGEHKGGFFGWFNRMFLSTTHGYERGVASILKHRAP 540
SV+VALILTPALCAT+LKP+ H E+KGGFFGWFN F + + Y V IL
Sbjct: 481 SVLVALILTPALCATLLKPVSAE-HHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539

Query: 541 YLLIYVVIVAGMIWMFTRIPTAFLPDEDQGVLFAQVQTPPGSSAERTQVVVDSMREYLLE 600
YLLIY +IVAGM+ +F R+P++FLP+EDQGV +Q P G++ ERTQ V+D + +Y L+
Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599

Query: 601 KESSSVSSVFTVTGFNFAGRGQSSGMAFIMLKPWEERPGGENSVFELAKRAQMHFFSFKD 660
E ++V SVFTV GF+F+G+ Q++GMAF+ LKPWEER G ENS + RA+M +D
Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRD 659

Query: 661 AMVFAFAPPSVLELGNATGFDLFLQDQAGVGHEVLLQARNKFLMLAAQNPA-LQRVRPNG 719
V F P+++ELG ATGFD L DQAG+GH+ L QARN+ L +AAQ+PA L VRPNG
Sbjct: 660 GFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNG 719

Query: 720 MSDEPQYKLEIDDEKASALGVSLADINSTVSIAWGSSYVNDFIDRGRVKRVYLQGRPDAR 779
+ D Q+KLE+D EKA ALGVSL+DIN T+S A G +YVNDFIDRGRVK++Y+Q R
Sbjct: 720 LEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFR 779

Query: 780 MNPDDLSKWYVRNDKGEMVPFNAFATGKWEYGSPKLERYNGVPAMEILGEPAPGLSSGDA 839
M P+D+ K YVR+ GEMVPF+AF T W YGSP+LERYNG+P+MEI GE APG SSGDA
Sbjct: 780 MLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDA 839

Query: 840 MAAVEEIVKQLPKGVGYSWTGLSYEERLSGSQAPALYALSLLVVFLCLAALYESWSIPFS 899
MA +E + +LP G+GY WTG+SY+ERLSG+QAPAL A+S +VVFLCLAALYESWSIP S
Sbjct: 840 MALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVS 899

Query: 900 VMLVVPLGVIGALLATSMRGLSNDVFFQVGLLTTIGLSAKNAILIVEFAKELHE-QGKGI 958
VMLVVPLG++G LLA ++ NDV+F VGLLTTIGLSAKNAILIVEFAK+L E +GKG+
Sbjct: 900 VMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGV 959

Query: 959 VEAAIEACRMRLRPIVMTSLAFILGVVPLAISTGAGSGSQHAIGTGVIGGMVTATVLAIF 1018
VEA + A RMRLRPI+MTSLAFILGV+PLAIS GAGSG+Q+A+G GV+GGMV+AT+LAIF
Sbjct: 960 VEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIF 1019

Query: 1019 WVPLFYVAVSTLFK 1032
+VP+F+V + FK
Sbjct: 1020 FVPVFFVVIRRCFK 1033


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0425RTXTOXIND478e-08 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 47.1 bits (112), Expect = 8e-08
Identities = 24/93 (25%), Positives = 43/93 (46%), Gaps = 1/93 (1%)

Query: 62 RIAEVRPQVNGIILKRLFKEGSDVKAGQQLYQIDPATYEADYQSAQANLASTQEQAQRYK 121
R E++P N I+ + + KEG V+ G L ++ EAD Q++L + + RY+
Sbjct: 95 RSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQ 154

Query: 122 LLVADQAVSKQQYADANA-AYLQSKAAVEQARI 153
+L ++K Y Q+ + E R+
Sbjct: 155 ILSRSIELNKLPELKLPDEPYFQNVSEEEVLRL 187



Score = 42.9 bits (101), Expect = 1e-06
Identities = 42/268 (15%), Positives = 93/268 (34%), Gaps = 37/268 (13%)

Query: 37 EVGIVTLEAQTVTLNTELPGRTNAFRIAEVRPQVNGIILKRLFKEGSDVKAGQQLYQIDP 96
E+ + A+ +T+ + N R+ + R L + + K +
Sbjct: 206 ELNLDKKRAERLTVLARINRYENLSRVEKSRLD----DFSSLLHKQAIAKHAVLEQENKY 261

Query: 97 ATYEADYQSAQANLASTQEQAQRYK--LLVADQAVSKQ---QYADANAAYLQSKAAVEQA 151
+ + ++ L + + K + Q + + + +
Sbjct: 262 VEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKN 321

Query: 152 RINLRYTKVLSPISGRIGRSAV-TEGALVTNGQANAMATVQQLDPIYVDVTQPSTALLRL 210
+ + + +P+S ++ + V TEG +VT + M V + D + V + + +
Sbjct: 322 EERQQASVIRAPVSVKVQQLKVHTEGGVVTTAE-TLMVIVPEDDTLEVTALVQNKDIGFI 380

Query: 211 RRELASGQLERAGDNAAKVSLKLE--DGSQYP-LEGRLE--FSEVSVDEGTGSVT--IRA 263
GQ +K+E ++Y L G+++ + D+ G V I +
Sbjct: 381 N----VGQ---------NAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVIIS 427

Query: 264 V------FPNPNNELLPGMFVHAQLQEG 285
+ N N L GM V A+++ G
Sbjct: 428 IEENCLSTGNKNIPLSSGMAVTAEIKTG 455


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0421FLGFLGJ300.016 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 30.5 bits (68), Expect = 0.016
Identities = 24/128 (18%), Positives = 43/128 (33%), Gaps = 6/128 (4%)

Query: 171 NLSPTAR----LLVNQRIRSRYDEPSRLSLLYLAQQGRAYRGVDDRDLRAARLPGGSQVL 226
N+ P AR + V ++S D + L+ ++ R Y + D+ + G L
Sbjct: 32 NIRPVARQVEGMFVQMMLKSMRDALPK-DGLFSSEHTRLYTSMYDQQIAQQMTAGKGLGL 90

Query: 227 AEAFVKQIKTIKTKSKVSSIVQAKDGVAVKAGSETYKADYVVLAVPLKALGQIQMTPSLS 286
AE VKQ+ + + S+ A ++ L P S
Sbjct: 91 AEMMVKQMTPEQPLPEEST-PAAPMKFPLETVVRYQNQALSQLVQKAVPRNYDDSLPGDS 149

Query: 287 GTQMSALK 294
++ L
Sbjct: 150 KAFLAQLS 157


123PA0414PA0402N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA0414190.903620methylesterase
PA0413180.497700chemotactic signal transduction system protein
PA0412211-2.550658methyltransferase PilK
PA0411212-2.328213twitching motility protein PilJ
PA0410012-1.938277twitching motility protein PilI
PA0409014-1.572132twitching motility protein PilH
PA0408014-0.853585pilus biosynthesis/twitching motility protein
PA0407-212-0.150935glutathione synthetase
PA0406-1131.487865transporter TonB
PA0405-1131.940718hypothetical protein
PA04040121.775867Holliday junction resolvase
PA04031121.127266bifunctional pyrimidine regulatory protein
PA04022130.911422aspartate carbamoyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0414HTHFIS300.013 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 30.2 bits (68), Expect = 0.013
Identities = 21/82 (25%), Positives = 34/82 (41%), Gaps = 3/82 (3%)

Query: 7 PRVAVIADTSLQRHVLQQALLGHGYEVVLNADPARVDDAALECAPDLWLVDLTQQDDS-- 64
+ V D + R VL QAL GY+V + ++ A + DL + D+ D++
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 65 PLLDSLLEQD-RAPVLFGEGHA 85
LL + + PVL
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQN 85


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0413HTHFIS682e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 68.3 bits (167), Expect = 2e-13
Identities = 26/113 (23%), Positives = 54/113 (47%), Gaps = 2/113 (1%)

Query: 2353 VMVVDDSVTVRKVTTRLLERNGMNVLTAKDGVDAIAQLQEHRPDILLLDIEMPRMDGFEV 2412
++V DD +R V + L R G +V + + D+++ D+ MP + F++
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 2413 ATLVRHDERLGNLPIIMITSRTGEKHRERALGIGVNQYLGKPYQETELLEAIQ 2465
++ + +LP+++++++ +A G YL KP+ TEL+ I
Sbjct: 66 LPRIK--KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0409HTHFIS808e-21 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 79.9 bits (197), Expect = 8e-21
Identities = 33/119 (27%), Positives = 51/119 (42%), Gaps = 2/119 (1%)

Query: 2 ARILIVDDSPTEMYKLTAMLEKHGHQVLKAENGGDGVALARQEKPDVVLMDIVMPGLNGF 61
A IL+ DD L L + G+ V N D+V+ D+VMP N F
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 QATRQLTKDAETSAIPVIIVTTKDQETDKVWGKRQGARDYLTKPVDEETLLKTINAVLA 120
++ K +PV++++ ++ + +GA DYL KP D L+ I LA
Sbjct: 64 DLLPRIKKARPD--LPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0408HTHFIS733e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 72.9 bits (179), Expect = 3e-18
Identities = 32/117 (27%), Positives = 51/117 (43%), Gaps = 2/117 (1%)

Query: 6 DGLKVMVIDDSKTIRRTAETLLKKVGCDVITAIDGFDALAKIADTHPNIIFVDIMMPRLD 65
G ++V DD IR L + G DV + IA +++ D++MP +
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 66 GYQTCALIKNNSAFKSTPVIMLSSKDGLFDKAKGRIVGSDQYLTKPFSKEELLGAIK 122
+ IK A PV+++S+++ K G+ YL KPF EL+G I
Sbjct: 62 AFDLLPRIK--KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0406PF03544592e-12 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 59.2 bits (143), Expect = 2e-12
Identities = 31/183 (16%), Positives = 58/183 (31%), Gaps = 14/183 (7%)

Query: 117 APFQDNQVKKVAPPAT--------PKQARSEEAPKVAVTTTRQRQQKAPSKTQAQKAEQV 168
AP Q V VAP P + E P+ ++ + K +
Sbjct: 45 APAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPK 104

Query: 169 AKPAPHFDSTQLSAEIASLEADLAKEQQAYAKRPRIHRLSAASTMRDKGAWYKEDWRKKI 228
KP Q ++ + P S A+ K + +
Sbjct: 105 PKPVK--KVEQPKRDVKP--VESRPASPFENTAPARPTSSTATAATSKPVTSVASGPRAL 160

Query: 229 ERIGNLNYPDEARRQKLYGSLRLLVSINRDGTIYEVQVLESSGEPILDQAAQRIVRLAAP 288
R YP A+ ++ G +++ + DG + VQ+L + + ++ + +R
Sbjct: 161 SRN-QPQYPARAQALRIEGQVKVKFDVTPDGRVDNVQILSAKPANMFEREVKNAMR-RWR 218

Query: 289 YAP 291
Y P
Sbjct: 219 YEP 221


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0402TYPE3IMPPROT290.032 Type III secretion system inner membrane P protein ...
		>TYPE3IMPPROT#Type III secretion system inner membrane P protein

family signature.
Length = 224

Score = 28.6 bits (64), Expect = 0.032
Identities = 10/41 (24%), Positives = 17/41 (41%)

Query: 293 ADGAQSVILNQVTYGIAIRMAVLSMAMSGQNTQRQLEQEDA 333
A G Q + N G+A+ +++ M + E ED
Sbjct: 40 ALGLQQIPSNMTLNGVALLLSMFVMWPIMHDAYVYFEDEDV 80


124PA0373PA0367N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA03730121.395682signal recognition particle receptor FtsY
PA03720110.810403zinc protease
PA0371-1110.766829hypothetical protein
PA0370-190.934947hypothetical protein
PA0368-290.913200hypothetical protein
PA0367a-39-0.273948acetyltransferase
PA0367-210-0.993440transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0373TONBPROTEIN474e-08 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 47.3 bits (112), Expect = 4e-08
Identities = 23/95 (24%), Positives = 32/95 (33%), Gaps = 11/95 (11%)

Query: 59 SLTEQPGRQQP-----SAAEPAEPPAPVAEAPLAGDEPASAEEHSPRPEAPVAQPEPILA 113
+ E P QP EPP V P E P PE P+
Sbjct: 34 QVIELPAPAQPISVTMVTPADLEPPQAVQP------PPEPVVEPEPEPEPIPEPPKEAPV 87

Query: 114 AEPEPEPEPEPEPEPEPVAPLAAAPAVSEPATRPG 148
+P+P+P+P+P+P V +RP
Sbjct: 88 VIEKPKPKPKPKPKPVKKVQEQPKRDVKPVESRPA 122



Score = 35.0 bits (80), Expect = 4e-04
Identities = 28/123 (22%), Positives = 40/123 (32%), Gaps = 16/123 (13%)

Query: 28 QAGEQPA-DQPVEPVSETAAAEQRAPADDVAQSLTEQPGRQQPSAAEPAEPPAPVAEAPL 86
Q E PA QP+ T A + A EP P P+ E P
Sbjct: 34 QVIELPAPAQPISVTMVTPADLEPPQA----------VQPPPEPVVEPEPEPEPIPEPP- 82

Query: 87 AGDEPASAEEHSPRPE-APVAQPEPILAAEPEPEPEPEPEPEPEPVAPLAAAPAVSEPAT 145
+ A P+P+ P +P + +P+ + +P P A A S AT
Sbjct: 83 ---KEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPVESRPASPFENTAPARLTSSTAT 139

Query: 146 RPG 148

Sbjct: 140 AAT 142


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0371PHPHTRNFRASE340.002 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 33.6 bits (77), Expect = 0.002
Identities = 27/109 (24%), Positives = 46/109 (42%), Gaps = 16/109 (14%)

Query: 228 PTISREQLQAFHKKAYAAGN--VVIALVGDLS--RQEAEAIAAEVSKALPQGPALAKTVQ 283
I R QL+A +A GN V+ ++ L RQ + E K L +G ++ +++
Sbjct: 368 QDIFRTQLRAL-LRASTYGNLKVMFPMIATLEELRQAKAIMQEEKDKLLSEGVDVSDSIE 426

Query: 284 P----ETPKPGLT------HIDFPSEQTH-LMLAQLGIDRQDPDYAALY 321
E P + +DF S T+ L+ + DR + + LY
Sbjct: 427 VGIMVEIPSTAVAANLFAKEVDFFSIGTNDLIQYTMAADRMNERVSYLY 475


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0368PF06057290.024 Type IV secretory pathway VirJ component
		>PF06057#Type IV secretory pathway VirJ component

Length = 243

Score = 29.0 bits (65), Expect = 0.024
Identities = 14/73 (19%), Positives = 28/73 (38%), Gaps = 15/73 (20%)

Query: 79 GLQRALLERGWASVALN-----WRGCSGEPNRLPRGYHSGVSDDLAEVVAHLRARRPQAP 133
+ L ++GW V + W+ + P+ V+ D ++ +A
Sbjct: 69 AVGGILQQQGWPVVGWSSLKYYWK------QKDPKD----VTQDTLAIIDKYQAEFGTQK 118

Query: 134 LYAVGYSLGGNVL 146
+ +GYS G V+
Sbjct: 119 VILIGYSFGAEVI 131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0367HTHTETR595e-13 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 59.3 bits (143), Expect = 5e-13
Identities = 30/170 (17%), Positives = 63/170 (37%), Gaps = 11/170 (6%)

Query: 7 TRDRIAQASLELFNAQGERSVTTNHIATHLGISPGNLYYHYPNKQAIIAELFAEYESHVE 66
TR I +L LF+ QG S + IA G++ G +Y+H+ +K + +E++ ES++
Sbjct: 12 TRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIG 71

Query: 67 SFLRLPEGRGLTVDDKTF--YLEALLAAMWRYRFLHRDLEHLLESD------PELAARYR 118
+ + L +L + +E + + R
Sbjct: 72 ELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQR 131

Query: 119 AFAQRCLVNAKAIYRGFTEAGILR-MNETQLEALTLNAWI--ILTSWVRF 165
+ + EA +L T+ A+ + +I ++ +W+
Sbjct: 132 NLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFA 181


125PA0167PA0156N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PA0167-1120.877365transcriptional regulator
PA0166-1100.692228transporter
PA01651100.211279hypothetical protein
PA01640110.116830gamma-glutamyltranspeptidase
PA016309-0.204333transcriptional regulator
PA0162190.015461histidine porin OpdC
PA0160290.630402hypothetical protein
PA01592100.712671transcriptional regulator
PA01581110.653933resistance-nodulation-cell division (RND) efflux
PA01572122.405382resistance-nodulation-cell division (RND) efflux
PA01561112.051556resistance-nodulation-cell division (RND) efflux
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0167HTHTETR662e-15 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 66.2 bits (161), Expect = 2e-15
Identities = 32/219 (14%), Positives = 68/219 (31%), Gaps = 30/219 (13%)

Query: 15 KPAGRIRQKNEEAILAAAEEEFARHGFKGTSMNTIAQNVGLPKANLHYYFGNKLGLYTAV 74
+ + Q+ + IL A F++ G TS+ IA+ G+ + ++++F +K L++ +
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 75 LSNILELWDSTFNTLGVD--DDPAEALARYIRAKMEFSRRYPLASRIFA----------- 121
DP L + +E + +
Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGE 122

Query: 122 MEIISGGECLTAHFNQDYRSWFRGRAAVFEAWIAAGRMDP-VDPVHLIFLLWGSTQHYAD 180
M ++ + + + + I A + + ++ G Y
Sbjct: 123 MAVVQQAQ---RNLCLESYDRI---EQTLKHCIEAKMLPADLMTRRAAIIMRG----YIS 172

Query: 181 FASQIGLVTGR-KRMSRQDFAAAADNLVRIILKGCGLTP 218
GL+ D A + V I+L+ L P
Sbjct: 173 -----GLMENWLFAPQSFDLKKEARDYVAILLEMYLLCP 206


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0165CHANNELTSX467e-08 Nucleoside-specific channel-forming protein Tsx signa...
		>CHANNELTSX#Nucleoside-specific channel-forming protein Tsx

signature.
Length = 294

Score = 45.8 bits (108), Expect = 7e-08
Identities = 38/135 (28%), Positives = 58/135 (42%), Gaps = 9/135 (6%)

Query: 14 LLAAGQAVAEDHDMTPTHETDSGPLL---WHNESLTYLYGKNFKINPPIQQTFTLEHAS- 69
LLAAG VA + P W ++S+ + + + P I+ LE+ +
Sbjct: 5 LLAAGAVVALSTTFAAGAAENDKPQYLSDWWHQSVNVVGSYHTRFGPQIRNDTYLEYEAF 64

Query: 70 -GWTWGDLFIFFDQ-INYNGKEDAS---NGKNTYYGEITPRLSFGKLTGADLSFGPVKDV 124
W D + + D + + G A N + + EI PR S KLT DLSFGP K+
Sbjct: 65 AKKDWFDFYGYIDAPVFFGGNSTAKGIWNKGSPLFMEIEPRFSIDKLTNTDLSFGPFKEW 124

Query: 125 LLAGTYEFGEGDTEA 139
A Y + G ++
Sbjct: 125 YFANNYIYDMGRNDS 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0164PF09025300.015 YopR Core
		>PF09025#YopR Core

Length = 143

Score = 29.6 bits (66), Expect = 0.015
Identities = 26/90 (28%), Positives = 33/90 (36%), Gaps = 10/90 (11%)

Query: 134 LPFEQLL---RPAIELARDGFPVSPVIARLWQSGLDKFRAALPQRPELRAWFDEFLIDGR 190
L FEQ L PA G + RL Q + R EL+A L GR
Sbjct: 30 LAFEQALGGEPPAAGRRLAGLENGALGERLLQRFAQPLQGLEADRLELKAMLRAELPLGR 89

Query: 191 APRA------GEVFRQPGQADTLDELARSQ 214
+ G V PG + L +LAR +
Sbjct: 90 QQQTFLLQLLGAVEHAPG-GEYLAQLARRE 118


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0163PHAGEIV300.009 Gene IV protein signature.
		>PHAGEIV#Gene IV protein signature.

Length = 426

Score = 30.3 bits (68), Expect = 0.009
Identities = 15/76 (19%), Positives = 29/76 (38%), Gaps = 3/76 (3%)

Query: 105 TRCRVLEVTPLARELIKSFCELPVDYPEGDSAESRLVQVLLDQLRLLPEVAFSLPMPREP 164
R ++ + +KS + D + +V D L LP+ ++ +P +
Sbjct: 138 NNVRAKDLIRVVELFVKSNTSKSSNVLSVDGSNLLVVSAPKDILDNLPQFLSTVDLPTDQ 197

Query: 165 RLLRLCQALIDEPTQS 180
L+ + LI E Q
Sbjct: 198 ILI---EGLIFEVQQG 210


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0158ACRIFLAVINRP490e-159 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 490 bits (1263), Expect = e-159
Identities = 240/1052 (22%), Positives = 444/1052 (42%), Gaps = 69/1052 (6%)

Query: 7 LSDWALRHQSLVWYLMAVSLVMGVFSYLNLGREEDPSFAIKTMVIQTRWPGATVDDTLEQ 66
++++ +R W L + ++ G + L L + P+ A + + +PGA +
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 67 VTDRIEKKLEELDSLDYVKSYT-RPGESTVFVYLKDTTKAGDIPDIWYQVRKKISDIQGE 125
VT IE+ + +D+L Y+ S + G T+ + + T D QV+ K+
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGT---DPDIAQVQVQNKLQLATPL 117

Query: 126 FPQGIQGPG-FNDEFGDVFGSVYAFTADGLDFRQ--LRDYVEKVRLD-IRSVKDLGKVQM 181
PQ +Q G ++ + V F +D Q + DYV D + + +G VQ+
Sbjct: 118 LPQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQL 177

Query: 182 IGAQNEV-IYLNFSTRKLAALGLDQRQVVQSLQAQNAVTPSGVVEAGPE------RISVR 234
GAQ + I+L+ L L V+ L+ QN +G + P S+
Sbjct: 178 FGAQYAMRIWLDAD--LLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASII 235

Query: 235 TSGNFRSEKDLQAVNLRVNDRFY--RLSDLASISRDFVDPPTSLFRYKGEPAIGLAVAMK 292
F++ ++ V LRVN RL D+A + + + R G+PA GL + +
Sbjct: 236 AQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARV-ELGGENYNVIARINGKPAAGLGIKLA 294

Query: 293 EGGNILEFGEALNARMQEITGELPVGVGVHQVSNQAQVVKKAVGGFTRALFEAVVIVLIV 352
G N L+ +A+ A++ E+ P G+ V + V+ ++ + LFEA+++V +V
Sbjct: 295 TGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLV 354

Query: 353 SFVSLG-LRAGLVVACSIPLVLAMVFVFMEYTDITMQRVSLGALIIALGLLVDDAMITVE 411
++ L +RA L+ ++P+VL F + ++ +++ +++A+GLLVDDA++ VE
Sbjct: 355 MYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVE 414

Query: 412 MMITRLELGDSLHDSATY-AYTSTAFPMLTGTLVTVAGFVPIGLNASSAGEYTFTLFAVI 470
+ + AT + + ++ +V A F+P+ S G I
Sbjct: 415 NVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITI 474

Query: 471 AVALLLSWIVAVLFAPVIAVHILPKTLKHKSEQKKG---RIAERFDSLLHLA-------M 520
A+ LS +VA++ P + +L E K G FD ++ +
Sbjct: 475 VSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKIL 534

Query: 521 RRRWTTIFLTALLFGVSLFLMKFVQHQFFPSSDRPELLVDLNLPQNSSIHETRAVMDR-L 579
+ + AL+ + L + F P D+ L + LP ++ T+ V+D+
Sbjct: 535 GSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVT 594

Query: 580 EATLKDDEDID-HWSAYVGEGAIRFYLPLDQQLQNNFYGQLVIVTKDLEAR---ERVAAR 635
+ LK+++ G Q QN + K E R E A
Sbjct: 595 DYYLKNEKANVESVFTVNGFS-------FSGQAQNAGMAF--VSLKPWEERNGDENSAEA 645

Query: 636 LRDRLRKDYVGI-STYVQPLEMGPPV--------GRPIQYRVSGPQIDKVREYAMGLAGV 686
+ R + + I +V P M P + +G D + + L G+
Sbjct: 646 VIHRAKMELGKIRDGFVIPFNM-PAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGM 704

Query: 687 LDGNP-NIGDIVYDWNEPGKMLKIDIAQDKARQLGLSSEDVAQIMNSVVTGSAVTQVRDD 745
+P ++ + + E K+++ Q+KA+ LG+S D+ Q +++ + G+ V D
Sbjct: 705 AAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDR 764

Query: 746 IYLVNVIGRAEDSERGSLETLESLQIVTPSGTSIPLKAFAKVSYELEQPLVWRRDRKPTI 805
+ + +A+ R E ++ L + + +G +P AF + P + R + P++
Sbjct: 765 GRVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSM 824

Query: 806 TVKASLRGEIQPTDLVARLAPEVKRFADGLPANYRIEVGGTVEESGKAEGPIAKVVPLML 865
+ +GE P ++ A LPA + G + + +V +
Sbjct: 825 EI----QGEAAPGTSSGDAMALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISF 880

Query: 866 FLMATFLMIQLQSVQKLFLVASVAPLGLIGVVAALLPTGTPMGFVAILGILALIGIIIRN 925
++ L +S V V PLG++GV+ A ++G+L IG+ +N
Sbjct: 881 VVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKN 940

Query: 926 SVILVTQI-DAFEKDGKTPWEAVLEATHHRTRPILLTAAAASLGMIPIA------REVFW 978
++++V D EK+GK EA L A R RPIL+T+ A LG++P+A
Sbjct: 941 AILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQN 1000

Query: 979 GPMAYAMIGGIVAATLLTLIFLPALYVAWYRI 1010
+ ++GG+V+ATLL + F+P +V R
Sbjct: 1001 -AVGIGVMGGMVSATLLAIFFVPVFFVVIRRC 1031



Score = 79.1 bits (195), Expect = 5e-17
Identities = 79/517 (15%), Positives = 172/517 (33%), Gaps = 35/517 (6%)

Query: 518 LAMRRRWTTIFLTALLFGVSLFLMKFVQHQFFPSSDRPELLVDLNLPQNSSIHETRAVMD 577
+RR L +L + + +P+ P + V N P + V
Sbjct: 4 FFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQ 63

Query: 578 RLEATLKDDEDIDHWSAYVGEGAIRFYLPLDQQLQNNFYGQLVIVTKDLEARERVAARLR 637
+E + +++ + S+ + A + L Q + V V + L
Sbjct: 64 VIEQNMNGIDNLMYMSS-TSDSAGSVTITLTFQSGTDPDIAQVQVQ---NKLQLATPLLP 119

Query: 638 DRLRKDYVGISTYVQPLEMG----PPVGRPIQYRVSGPQIDKVREYAMGLAGVLDGNPNI 693
+++ + + M Q +S V++ L GV D
Sbjct: 120 QEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFG 179

Query: 694 GDIVYDWNEPGKMLKIDIAQDKARQLGLSSEDVAQIMNS----VVTGSAVTQVRDDIYLV 749
++I + D + L+ DV + + G +
Sbjct: 180 AQ---------YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQL 230

Query: 750 NVIGRAEDSERGSLETLESLQI-VTPSGTSIPLKAFAKVSYELE-QPLVWRRDRKPTITV 807
N A+ + E + + V G+ + LK A+V E ++ R + KP +
Sbjct: 231 NASIIAQT-RFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGL 289

Query: 808 KASLRGEIQPTDLVARLAPEVKRFADGLPANYRIEVGGTVEESGKAEGPIAKVVPLML-- 865
L D + ++ P ++ + + + I +VV +
Sbjct: 290 GIKLATGANALDTAKAIKAKLAELQPFFPQGMKVLY--PYDTTPFVQLSIHEVVKTLFEA 347

Query: 866 -FLMATFLMIQLQSVQKLFLVASVAPLGLIGVVAALLPTGTPMGFVAILGILALIGIIIR 924
L+ + + LQ+++ + P+ L+G A L G + + + G++ IG+++
Sbjct: 348 IMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVD 407

Query: 925 NSVILVTQI-DAFEKDGKTPWEAVLEATHHRTRPILLTAAAASLGMIPIA-----REVFW 978
+++++V + +D P EA ++ ++ A S IP+A +
Sbjct: 408 DAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIY 467

Query: 979 GPMAYAMIGGIVAATLLTLIFLPALYVAWYRIPEPGR 1015
+ ++ + + L+ LI PAL +
Sbjct: 468 RQFSITIVSAMALSVLVALILTPALCATLLKPVSAEH 504


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0157RTXTOXIND401e-05 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 39.8 bits (93), Expect = 1e-05
Identities = 16/103 (15%), Positives = 32/103 (31%), Gaps = 3/103 (2%)

Query: 63 TNGRIASRLFDVGDFVGKGALLATLDPTDQQNQLRASQGDLASAEAQLIDAQANARRQEE 122
N + + G+ V KG +L L + +Q L A + Q +R E
Sbjct: 103 ENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIEL 162

Query: 123 --LFARSVTAQARLDDARTR-LKTSQASFDQAKAAVQQARDQL 162
L + + + + + + + Q + Q
Sbjct: 163 NKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQK 205



Score = 39.0 bits (91), Expect = 2e-05
Identities = 23/182 (12%), Positives = 59/182 (32%), Gaps = 31/182 (17%)

Query: 51 IQARYESVLGFRTNGRIASRLFDVGDFVGKGALLATLDPTDQQNQLRASQGDLASAEAQL 110
I ++ S L + K A+L +Q+N+ + +L ++QL
Sbjct: 223 INRYENLSRVEKSRLDDFSSLLHKQ-AIAKHAVL------EQENKYVEAVNELRVYKSQL 275

Query: 111 IDAQANARRQEELFARSVTAQARLDDARTRLKTSQASFDQAKAAVQQARDQLSYTRLVTD 170
++ +E + Q ++ +L+ + + + + ++ + +
Sbjct: 276 EQIESEILSAKE--EYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAP 333

Query: 171 FDGVITTW--HAEAGQVVSAGQAVVTLARPEVREAVFDLPTEVAESLPADARFLVSAQLD 228
+ H E G VV+ + ++ + P D V+A +
Sbjct: 334 VSVKVQQLKVHTE-GGVVTTAETLMVIV-------------------PEDDTLEVTALVQ 373

Query: 229 PQ 230
+
Sbjct: 374 NK 375


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PA0156RTXTOXIND491e-08 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 49.1 bits (117), Expect = 1e-08
Identities = 23/129 (17%), Positives = 41/129 (31%), Gaps = 9/129 (6%)

Query: 75 VGGKIVERLVDVGDHVAAGQVLARLDP-------QDQRSNVENAQAAVAAQQAQSKLADL 127
+ E +V G+ V G VL +L +S++ A+ Q S+ +L
Sbjct: 103 ENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIEL 162

Query: 128 NYQRQKALLPKGYTSQSEYDQALASVRSAQSSLKAAQAQLANARDLLSYTELRASDAGVI 187
N + L + Y ++ L + Q Q L+ + RA V+
Sbjct: 163 NKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKE--LNLDKKRAERLTVL 220

Query: 188 TARQAEVGQ 196

Sbjct: 221 ARINRYENL 229



Score = 42.9 bits (101), Expect = 2e-06
Identities = 33/216 (15%), Positives = 71/216 (32%), Gaps = 26/216 (12%)

Query: 58 SITGDIQARVQADQSFRVGGKIVERLVDVGDHVAAGQVLARLDPQDQRSNVENAQAAVAA 117
++ I + + L+ A A L+ +++ N +
Sbjct: 218 TVLARINRYENLSRVEKSRLDDFSSLLHKQ----AIAKHAVLEQENKYVEAVNELRVYKS 273

Query: 118 QQAQSKLADLNYQRQKALLPKGYTSQSEYDQALASVRSAQSSLKAAQAQLANARDLLSYT 177
Q Q + L+ + + L+ + + ++ L +R ++ +LA + +
Sbjct: 274 QLEQIESEILSAKEEYQLVTQLFKNE-----ILDKLRQTTDNIGLLTLELAKNEERQQAS 328

Query: 178 ELRASDAGVITARQA-EVGQVVQATVPIFTLARDGERDAVFNVYESLFSHDVDGQRITVS 236
+RA + + + G VV + + + D V + + D+ I V
Sbjct: 329 VIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPE---DDTLEVTALVQNKDIG--FINVG 383

Query: 237 LLGKPEVTA---------SGKVREITP--TVDERSG 261
+V A GKV+ I D+R G
Sbjct: 384 QNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLG 419



 
Contact Sachin Pundhir for Bugs/Comments.
For best view 1024 x 768 resolution & IE 6.0 or above recommended.