PredictBias

identification of genomic and pathogenicity islands in prokaryotic genome
Home | Help | Analyzed genomes
 
A) Input parameters
Genome2807.gbkThreshold dinucleotide bias2
Threshold codon bias4Threshold %GC bias3
E-value (RPSBlast)0.05Genome (non-pathogenic)
 
B) Compare a potential GI or PAI in related non-pathogenic sp. (phylogenetic tree)
Potential GI or PAI start    end  
Select Organism     
 
C) Potential GIs and PAIs in NC_010159 (download)
S.NoStartEndBiasVirulenceInsertion elementsPrediction
1YpAngola_A0101YpAngola_A0106Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A0101-217-4.679040glycerol uptake facilitator protein
YpAngola_A0102-220-5.834694hypothetical protein
YpAngola_A0103-221-6.286023IS1541 transposase
YpAngola_A0104-220-5.361195hypothetical protein
YpAngola_A0105-219-5.205074bacteriocin ABC transporter ATP-binding
YpAngola_A0106-119-3.949169MFP family transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0105TYPE3IMQPROT280.034 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 27.8 bits (62), Expect = 0.034
Identities = 11/32 (34%), Positives = 17/32 (53%)

Query: 302 IGVFIMMFLYGGWLVWVVLGFTAMYMILRLAT 333
+GV + +FL GW V+L + + L LA
Sbjct: 54 LGVCLCLFLLSGWYGEVLLSYGRQVIFLALAK 85


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0106RTXTOXIND1087e-28 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 108 bits (271), Expect = 7e-28
Identities = 81/430 (18%), Positives = 159/430 (36%), Gaps = 62/430 (14%)

Query: 33 FIAALCAIFLVLLITLIIYGTYTRRINVNGEVISQPHPINIFSPQQGFITKKWVEVGDIV 92
+A FLV+ L + G NG++ I + + + V+ G+ V
Sbjct: 59 LVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESV 118

Query: 93 RKGQHLYQIDV--SRTTFSGNVSLNSLEAINNQLSQIDSIINNTQKNKELTLLN------ 144
RKG L ++ + S + QI S K EL L +
Sbjct: 119 RKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQN 178

Query: 145 ------------LRQQLAQYQKAHKKSQELVDNAGKGMDDMRRTMASYGTYQRQGLITKD 192
+++Q + +Q + + +D + + Y + + K
Sbjct: 179 VSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRY---ENLSRVEKS 235

Query: 193 QLTNQRSLF----------YQQQNAFQSLNTQLIQESLQIAKLESEIS-------TRASD 235
+L + SL +Q+N + +L Q+ ++ESEI
Sbjct: 236 RLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQL 295

Query: 236 FDNDISQYLFQKGDLKRQLAE-----VDASGMLLINSPSDGKIENMSV-TQGQMVNVNDS 289
F N+I L Q D L + +I +P K++ + V T+G +V ++
Sbjct: 296 FKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAET 355

Query: 290 LVQLTPSDNPYYCLVLWVPNNSVPYINTGDKVNIRYDAFPFEKFGQFPGRIISISNVPVS 349
L+ + P D+ L V N + +IN G I+ +AFP+ ++G G++ +I+ +
Sbjct: 356 LMVIVPEDDTLEVTAL-VQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIE 414

Query: 350 QQEIASYNIAPRLPNGGLIEPYYKVIVALDDIHFRYQSKPLMLSNGLKANVTLFLEKRPL 409
Q + + VI+++++ +K + LS+G+ + R +
Sbjct: 415 DQRLG---------------LVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGMRSV 459

Query: 410 YQWMLSPFYD 419
++LSP +
Sbjct: 460 ISYLLSPLEE 469


2YpAngola_A0124YpAngola_A0145Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A0124420-0.244210glutamate racemase
YpAngola_A01255251.223918hypothetical protein
YpAngola_A01323221.010828**isrso12-transposase orfa protein
YpAngola_A01331180.953709transposase B
YpAngola_A0134-1181.122128hypothetical protein
YpAngola_A01350171.046968transposase/IS protein
YpAngola_A0136-1161.023017insertion sequence transposase
YpAngola_A01370140.673008hypothetical protein
YpAngola_A01381140.836585hypothetical protein
YpAngola_A0139-1130.932434pyrroline-5-carboxylate reductase
YpAngola_A0140424-8.027959YggT family membrane protein
YpAngola_A0141424-7.709557hypothetical protein
YpAngola_A0142325-7.885507putative deoxyribonucleotide triphosphate
YpAngola_A0143424-7.882433coproporphyrinogen III oxidase
YpAngola_A0144426-8.639860IS285 transposase
YpAngola_A0145427-8.638322putative phage minor structural protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0136HTHTETR280.047 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 28.1 bits (62), Expect = 0.047
Identities = 12/106 (11%), Positives = 31/106 (29%), Gaps = 14/106 (13%)

Query: 16 QGMSSRAIARELGISRNTVKRYLQAKSEPPKYTPRPAVASLLDEYRDYIRQRIAD----- 70
S IA+ G++R + + + KS+ + + + I + +
Sbjct: 30 SSTSLGEIAKAAGVTRGAIYWHFKDKSD--------LFSEIWELSESNIGELELEYQAKF 81

Query: 71 -AHPYKIPATVIAREIRDQGYRGGMTILRAFIRSLSVPQEQEPAVR 115
P + ++ + +L I + V+
Sbjct: 82 PGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQ 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0145CABNDNGRPT531e-08 NodO calcium binding signature.
		>CABNDNGRPT#NodO calcium binding signature.

Length = 479

Score = 53.1 bits (127), Expect = 1e-08
Identities = 39/161 (24%), Positives = 63/161 (39%), Gaps = 22/161 (13%)

Query: 2138 DVAALFDLGGGDDVAKGYHKKKNIFTIGSGFKQYQGGENADTFILTSAVASKSHILSGGE 2197
D+AA+ L G + + G + + D + T + + +
Sbjct: 250 DIAAIQRLYGANMTTRT----------GDSVYGFNSNTDRDFYTATDSSKALIFSVWDAG 299

Query: 2198 GNDTVALGEVLGNEIDSIIDISNGYYSQVNGGVEKQV-ALLYDFENILGHENVNDTIIGN 2256
G DT + G + I+++ G +S V G A EN +G ND ++GN
Sbjct: 300 GTDTF---DFSGYSNNQRINLNEGSFSDVGGLKGNVSIAHGVTIENAIGGSG-NDILVGN 355

Query: 2257 DVDNYLNGMGGDDKIWGNGGNDLLALQSGLAQGGTGLDSYH 2297
DN L G G+D ++G G D L GG G D++
Sbjct: 356 SADNILQGGAGNDVLYGGAGADTLY-------GGAGRDTFV 389



Score = 45.0 bits (106), Expect = 4e-06
Identities = 31/137 (22%), Positives = 47/137 (34%), Gaps = 21/137 (15%)

Query: 2631 SSGNDEVVITSATFLPGNYIDTGDGNDAIIYIRGHEGT-MLKGGGGDDTYYYSAGSGAIN 2689
SGND +V SA N + G GND + G G L GG G DT+ Y +G +
Sbjct: 346 GSGNDILVGNSA----DNILQGGAGNDVLY---GGAGADTLYGGAGRDTFVYGSGQDSTV 398

Query: 2690 IADTSGLDHLY-----------LDKHILLHTLSAERRENNLVLNIADNTSGRIIFVDWYL 2738
A D + + + ++L S I + +
Sbjct: 399 AAYDWIADFQKGIDKIDLSAFRNEGQLSFVQDQFTGKGQEVMLQWDAANS--ITNLWLHE 456

Query: 2739 ADENKVEFIWVEDSQIT 2755
A + V+F+ Q
Sbjct: 457 AGHSSVDFLVRIVGQAA 473


3YpAngola_A0160YpAngola_A0183Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A0160023-5.493085hypothetical protein
YpAngola_A0163-117-5.518248*phage integrase family site specific
YpAngola_A0164-217-0.729561Rhs element Vgr protein
YpAngola_A0165-2212.258000insertion sequence protein
YpAngola_A0166-1232.384731IS1 family transposase orfA
YpAngola_A0167-2243.196309hypothetical protein
YpAngola_A0168-1275.724864M23 peptidase domain-containing protein
YpAngola_A01691408.616613hypothetical protein
YpAngola_A01701398.395797AAA ATPase
YpAngola_A01710387.304373hypothetical protein
YpAngola_A01730357.461781hypothetical protein
YpAngola_A01740325.787618hypothetical protein
YpAngola_A01751301.981258hypothetical protein
YpAngola_A0176222-2.944245hypothetical protein
YpAngola_A0177223-5.910948hypothetical protein
YpAngola_A0178525-7.318803IS1, transposase orfA
YpAngola_A0179119-4.853891putative lipoprotein
YpAngola_A0180018-2.909180hypothetical protein
YpAngola_A0181018-3.080695CAAX amino terminal protease family protein
YpAngola_A0183119-4.611847N-acylhomoserine lactone synthase YtbI
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0165MALTOSEBP290.004 Maltose binding protein signature.
		>MALTOSEBP#Maltose binding protein signature.

Length = 396

Score = 29.3 bits (65), Expect = 0.004
Identities = 16/48 (33%), Positives = 23/48 (47%), Gaps = 8/48 (16%)

Query: 31 KKTFRQLLGLLSGFNIVFWCTDNFSAY-------EMLPDEKHIRSKLY 71
++ F Q+ G +I+FW D F Y E+ PD K + KLY
Sbjct: 70 EEKFPQVAATGDGPDIIFWAHDRFGGYAQSGLLAEITPD-KAFQDKLY 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0170DPTHRIATOXIN300.035 Diphtheria toxin signature.
		>DPTHRIATOXIN#Diphtheria toxin signature.

Length = 567

Score = 30.5 bits (68), Expect = 0.035
Identities = 19/54 (35%), Positives = 24/54 (44%), Gaps = 9/54 (16%)

Query: 622 GIGKTETALALADSLFGGEKSLITINLSEYQEAHTVSQLKGSPPGYVGYGQGGV 675
GIG +A A AD + KS + N S Y G+ PGYV Q G+
Sbjct: 23 GIGAPPSAHAGADDVVDSSKSFVMENFSSYH---------GTKPGYVDSIQKGI 67


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0183AUTOINDCRSYN2542e-89 Autoinducer synthesis protein signature.
		>AUTOINDCRSYN#Autoinducer synthesis protein signature.

Length = 216

Score = 254 bits (651), Expect = 2e-89
Identities = 90/175 (51%), Positives = 121/175 (69%)

Query: 1 MEFDEYDNSDTRYLLGIYQGQLICSVRFIELHLPNMITHTFNALFDDVALPKRGYIESSR 60
MEFD+YDN++T YL GI +ICS+RFIE PNMIT TF F ++ +P+ Y+ESSR
Sbjct: 42 MEFDQYDNNNTTYLFGIKDNTVICSLRFIETKYPNMITGTFFPYFKEINIPEGNYLESSR 101

Query: 61 FFVDKTRAKLLFGNHYPISYLFFLSIINYSRHNGYTGIYTIVSRAMLTILKRSGWQVEVI 120
FFVDK+RAK + GN YPIS + FLS+INYS+ GY GIYTIVS MLTILKRSGW + V+
Sbjct: 102 FFVDKSRAKDILGNEYPISSMLFLSMINYSKDKGYDGIYTIVSHPMLTILKRSGWGIRVV 161

Query: 121 KEAHITEKERIYLLHLPIDRDNQARLLLQVNQRLQDPCSVLSTWPISLPVMPESA 175
++ ++ER+YL+ LP+D +NQ L ++N+ + L WP+ +P A
Sbjct: 162 EQGLSEKEERVYLVFLPVDDENQEALARRINRSGTFMSNELKQWPLRVPAAIAQA 216


4YpAngola_A0195YpAngola_A0230Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A0195013-3.351883sugar (glycoside-Pentoside-hexuronide)
YpAngola_A0196214-3.194046integrase core subunit
YpAngola_A0197316-4.011411IS3 family transposase
YpAngola_A0199116-2.604446hypothetical protein
YpAngola_A0200116-1.434758hypothetical protein
YpAngola_A0201118-2.276044flagellar motor protein MotA
YpAngola_A0202222-3.796094flagellar biosynthesis sigma factor
YpAngola_A0203116-2.536551putative flagellar protein
YpAngola_A0204214-1.185022flagellar hook-length control protein
YpAngola_A0205316-1.568742putative flagellar protein lafD
YpAngola_A0206316-1.559115flagellar protein FliS
YpAngola_A0207418-1.083088flagellar hook-associated protein 2
YpAngola_A02095250.551743IS285 transposase
YpAngola_A0208423-0.183545lateral flagellin
YpAngola_A0210319-1.012096lateral flagellin
YpAngola_A0211216-0.933528lateral flagellin
YpAngola_A0212015-2.541391hypothetical protein
YpAngola_A0213-115-2.770558lateral flagellin
YpAngola_A0214-118-3.393985transcriptional regulator
YpAngola_A0215121-1.374497hypothetical protein
YpAngola_A0216019-0.307165hypothetical protein
YpAngola_A0217117-0.159575flagellar hook-associated protein FlgL
YpAngola_A02182212.422026flagellar hook-associated protein FlgK
YpAngola_A02192244.046436peptidoglycan hydrolase
YpAngola_A02203213.988855flagellar basal body P-ring protein
YpAngola_A02211203.618941flagellar basal body L-ring protein
YpAngola_A02221182.894361flagellar basal body rod protein FlgG
YpAngola_A02231182.329201flagellar basal body rod protein FlgF
YpAngola_A02241181.183173flagellar hook protein FlgE
YpAngola_A02253200.853170flagellar basal body rod modification protein
YpAngola_A02260213.222786flagellar basal body rod protein FlgC
YpAngola_A02270213.456462flagellar basal body rod protein FlgB
YpAngola_A0228-1223.579356flagellar basal body P-ring biosynthesis protein
YpAngola_A02291213.372010hypothetical protein
YpAngola_A02302223.659557hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0200OMPADOMAIN361e-04 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 36.1 bits (83), Expect = 1e-04
Identities = 25/116 (21%), Positives = 41/116 (35%), Gaps = 17/116 (14%)

Query: 171 FQRSSAVLTPFFSRLLGELAPAFNEM---DNKIIITGHTDASRYRDQLLYNNWNLSGERA 227
F + A L P L +L + + D +++ G+TD D N LS RA
Sbjct: 223 FNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTD-RIGSDAY---NQGLSERRA 278

Query: 228 LMAHKALVNGGLDEGRVLQI----------NAMADQMLLDPTDPLAAKNRRIEIMV 273
L++ G+ ++ N + A +RR+EI V
Sbjct: 279 QSVVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIEV 334


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0204FLGHOOKFLIK300.017 Flagellar hook-length control protein signature.
		>FLGHOOKFLIK#Flagellar hook-length control protein signature.

Length = 375

Score = 29.8 bits (66), Expect = 0.017
Identities = 27/114 (23%), Positives = 47/114 (41%), Gaps = 11/114 (9%)

Query: 253 QHATIRLDPPDMGKIDISIHFEGGKLQVNINANQGEVYRALQ-----------QSSAELR 301
Q A +RL P D+G++ IS+ + + Q+ + + V AL+ +S +L
Sbjct: 257 QSAELRLHPQDLGEVQISLKVDDNQAQIQMVSPHQHVRAALEAALPVLRTQLAESGIQLG 316

Query: 302 QTLIGQNSTEVNVQVSANSQQQQQQPRHSNHHGQADILAAQHFESQAEINADDG 355
Q+ I S Q ++ QQ Q+ H G+ D Q + + G
Sbjct: 317 QSNISGESFSGQQQAASQQQQSQRTANHEPLAGEDDDTLPVPVSLQGRVTGNSG 370


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0208FLAGELLIN1062e-27 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 106 bits (266), Expect = 2e-27
Identities = 66/329 (20%), Positives = 120/329 (36%), Gaps = 8/329 (2%)

Query: 5 IHTNASAKTAINSLSNAGLANAKSSQRLSTGFRINSPADNAAGLQITNRMEKFLNSAGQA 64
I+TN+ + N+L+ + + + + +RLS+G RINS D+AAG I NR + QA
Sbjct: 4 INTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQA 63

Query: 65 KQNIQESIAMLQIADGGLAESVKTLNAMKKLATQAANDTNSAADREAIQKEFSELGKELQ 124
+N + I++ Q +G L E L +++L+ QA N TNS +D ++IQ E + +E+
Sbjct: 64 SRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEID 123

Query: 125 NALNNTEYNSEKLFADGGKMRKELNFQSGTDAESSLKLDLNSVIAELTESVTKQAPKITG 184
N T++N K+ + +M Q G + ++ +DL + +
Sbjct: 124 RVSNQTQFNGVKVLSQDNQM----KIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKE 179

Query: 185 KSSSATGSLEKQAYDLDKAVTDTKSLVAGAEGVQKTLEHDFAASGNKAVAEIKIPEYKDA 244
+ S K D + +K
Sbjct: 180 ATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAAN----GQ 235

Query: 245 LGKTVPEVVIALGAVITSANSNQMKDAVAALKTTHDAAVKAEATFQAKNSTGGGVMNMQL 304
L E A+ T+ ++ +A A ++ T
Sbjct: 236 LTTDDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDG 295

Query: 305 ADKDLAMKADKKLSDVIDAYGAFRATLGA 333
K +K++ + A A + A
Sbjct: 296 NGKVSTTINGEKVTLTVADITAGAANVDA 324



Score = 44.6 bits (105), Expect = 5e-07
Identities = 46/302 (15%), Positives = 88/302 (29%), Gaps = 10/302 (3%)

Query: 64 AKQNIQESIAMLQIADGGLAESVKTLNAMKKLATQAANDTNSAADREAIQKEFSELGKEL 123
+++ S + D + K + A + D+ + +L +
Sbjct: 181 TVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDD 240

Query: 124 QNALNNTEYNSEKLFADGGKMRKELNFQSGTDAESSLKLDLNSVIAELTESVTKQAPKIT 183
+ G K + E T++ K++
Sbjct: 241 AENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVS 300

Query: 184 GKSSSATGSLEKQAYDLDKAVTDTKSLVAGAEGVQKTLEHDFAASGNKAVAEIKIPEYKD 243
+ +L A D +L + + F K+ + +
Sbjct: 301 TTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDL-E 359

Query: 244 ALGKTVPEVVIALGAVITSANSNQMKDAVAALKTTHDAAVKAEATFQAKNSTGGGVMNMQ 303
A E I + +AN+ K +A D +T +++
Sbjct: 360 ANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAA-------- 411

Query: 304 LADKDLAMKADKKLSDVIDAYGAFRATLGANQNRLQSSSNNLDNMISNTAQALGSIKDTD 363
A K + + A R++LGA QNR S+ NL N ++N A I+D D
Sbjct: 412 -AAKKSTANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDAD 470

Query: 364 FA 365
+A
Sbjct: 471 YA 472


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0210FLAGELLIN982e-24 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 98.2 bits (244), Expect = 2e-24
Identities = 67/327 (20%), Positives = 119/327 (36%), Gaps = 6/327 (1%)

Query: 5 IHTNASAKTAINSLSNAGLSNAKSSQRLSTGFRINSPADNAAGLQITNRMEKFLNSAGQA 64
I+TN+ + N+L+ + S + + +RLS+G RINS D+AAG I NR + QA
Sbjct: 4 INTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQA 63

Query: 65 KQNIQESIAMLQIADGGLAESVKTLNAMKKLATQAANDTNSAADREAIQKEFSELGKELQ 124
+N + I++ Q +G L E L +++L+ QA N TNS +D ++IQ E + +E+
Sbjct: 64 SRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEID 123

Query: 125 NALNNTEYNSEKLFADGGKMRKELNFQSGTDAESSLKLDLNSVIAELTESVTKKATPVKA 184
N T++N K+ + +M Q G + ++ +DL + + K
Sbjct: 124 RVSNQTQFNGVKVLSQDNQM----KIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKE 179

Query: 185 DVAGSTLEKEADVLDKATKAAKKAKEAAEGVQKTLETDFAVAGNKASAKITIPEYKDALG 244
G K + + K+ + L
Sbjct: 180 ATVGDL--KSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLT 237

Query: 245 KTVPEVVINSGTAITPANSTQMKDAVAALKATHDAAVKAEATFQAKNSTGGGVMNMQLAD 304
E T ++ +A A A ++ T
Sbjct: 238 TDDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNG 297

Query: 305 KDLAMKADKKLSDVIDAYGAFRATLGA 331
K +K++ + A A + A
Sbjct: 298 KVSTTINGEKVTLTVADITAGAANVDA 324



Score = 61.2 bits (148), Expect = 3e-12
Identities = 57/336 (16%), Positives = 111/336 (33%), Gaps = 10/336 (2%)

Query: 64 AKQNIQESIAMLQIADGGLAESVKTLNAMKKLATQAANDTNSAADREAIQKEFSELGKEL 123
+++ S + D + K + A + D+ + +L +
Sbjct: 181 TVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDD 240

Query: 124 QNALNNTEYNSEKLFADGGKMRKELNFQSGTDAESSLKLDLNSVIAELTESVTKKATPVK 183
+ G K + E T++ V
Sbjct: 241 AENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVS 300

Query: 184 ADVAGSTLE-KEADVLDKATKAAKKAKEAAEGVQKTLETDFAVAGNKASAKITIPEYKDA 242
+ G + AD+ A ++++ V ++ +K + +A
Sbjct: 301 TTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEA 360

Query: 243 LGKTVPEVVINSGTAITPANSTQMKDAVAALKATHDAAVKAEATFQAKNSTGGGVMNMQL 302
E I A AN+ K +A D +T +++
Sbjct: 361 NNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAA--------- 411

Query: 303 ADKDLAMKADKKLSDVIDAYGAFRATLGANQNRLQSSSNNLDNMISNTAQALGSIKDTDF 362
A K + + A R++LGA QNR S+ NL N ++N A I+D D+
Sbjct: 412 AAKKSTANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADY 471

Query: 363 ADEMKNHAQSEMLMQSSVMMLKKANAATQLISTLLQ 398
A E+ N +++++L Q+ +L +AN Q + +LL+
Sbjct: 472 ATEVSNMSKAQILQQAGTSVLAQANQVPQNVLSLLR 507


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0211FLAGELLIN1057e-27 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 105 bits (263), Expect = 7e-27
Identities = 56/204 (27%), Positives = 96/204 (47%), Gaps = 4/204 (1%)

Query: 5 IHTNGSAKTAINSLSKAGLANAKSSQRLSTGFRINSPADNAAGLQITNRMEKFLNSAGQA 64
I+TN + N+L+K+ + + + +RLS+G RINS D+AAG I NR + QA
Sbjct: 4 INTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQA 63

Query: 65 KQNIQESIAMLQIADGGLAESVKTLNTMKKLATQAANDTNSAADREAIQKEFTELGQELQ 124
+N + I++ Q +G L E L +++L+ QA N TNS +D ++IQ E + +E+
Sbjct: 64 SRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEID 123

Query: 125 NALNNTEYNAEKLFADGGKMRKELNFQSGTDANSSLKLDLNKVIEELTESVTEERKKVTG 184
N T++N K+ + +M Q G + ++ +DL K+ +
Sbjct: 124 RVSNQTQFNGVKVLSQDNQM----KIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKE 179

Query: 185 TSASATGSLEKQAFDLNEATTKAN 208
+ S K + AN
Sbjct: 180 ATVGDLKSSFKNVTGYDTYAVGAN 203



Score = 60.8 bits (147), Expect = 3e-12
Identities = 51/334 (15%), Positives = 102/334 (30%), Gaps = 7/334 (2%)

Query: 64 AKQNIQESIAMLQIADGGLAESVKTLNTMKKLATQAANDTNSAADREAIQKEFTELGQEL 123
+++ S + D + K + A + D+ + +L +
Sbjct: 181 TVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDD 240

Query: 124 QNALNNTEYNAEKLFADGGKMRKELNFQSGTDANSSLKLDLNKVIEELTESVTEERKKVT 183
+ G K + T++ + KV+
Sbjct: 241 AENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVS 300

Query: 184 GTSASATGSLEKQAFDLNEATTKANTALKEAEILQEKITTNLTKTFPASVDIPGYINAKG 243
T +L A A L+ ++ + + + N
Sbjct: 301 TTINGEKVTL-TVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTK------NESA 353

Query: 244 VPVAHEIIPSGTPINTGHIGKIQTAVAALRATHDTAAKTEDEFQAEHSTGGGVMNLLLRN 303
E + + + + A A KT + +
Sbjct: 354 KLSDLEANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAA 413

Query: 304 KDRAMEADKKLSDVIDAYGAFRATLGANQNRLQSSSNNLDNMISNTTQALGSIKDTDFAD 363
K + + A R++LGA QNR S+ NL N ++N A I+D D+A
Sbjct: 414 KKSTANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYAT 473

Query: 364 EMKNHAQSEMLMQSSVMMLKKANAATQLISTLLQ 397
E+ N +++++L Q+ +L +AN Q + +LL+
Sbjct: 474 EVSNMSKAQILQQAGTSVLAQANQVPQNVLSLLR 507


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0213FLAGELLIN1003e-25 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 100 bits (250), Expect = 3e-25
Identities = 64/328 (19%), Positives = 120/328 (36%), Gaps = 10/328 (3%)

Query: 5 IHTNASAKTAINSLSNEGLANAKSSQRLSTGFRINSPADNAAGLQITNRMEKFLNSAGQA 64
I+TN+ + N+L+ + + + +RLS+G RINS D+AAG I NR + QA
Sbjct: 4 INTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQA 63

Query: 65 KQNIQESIAMLQIADGGLAESVKTLNAMKKLATQAANDTNSAADREAIQKEFSELGKELQ 124
+N + I++ Q +G L E L +++L+ QA N TNS +D ++IQ E + +E+
Sbjct: 64 SRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEID 123

Query: 125 NALNNTEYNSEKLFADGGKMRKELNFQSGTDAESSLKLDLNSV------IAELTESVTKP 178
N T++N K+ + +M Q G + ++ +DL + + + K
Sbjct: 124 RVSNQTQFNGVKVLSQDNQM----KIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKE 179

Query: 179 GLKANSGGTAEEKELARLEGLAKDAKSTAATTKSAETTLLVDDATGKGGKGGNASIDIII 238
+ + + + + + + T K
Sbjct: 180 ATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTD 239

Query: 239 PAHKDTTGKDVAEKKIASGTAITPANITSMADAKAYWDKQEIETPKAVNEYVVKHSADSG 298
A +T K +GTA A ++ K ++
Sbjct: 240 DAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKV 299

Query: 299 VMNMQLADKDLAMKADKKLSDVIDAYGA 326
+ L + + +DA
Sbjct: 300 STTINGEKVTLTVADITAGAANVDAATL 327



Score = 63.9 bits (155), Expect = 4e-13
Identities = 56/338 (16%), Positives = 105/338 (31%), Gaps = 12/338 (3%)

Query: 64 AKQNIQESIAMLQIADGGLAESVKTLNAMKKLATQAANDTNSAADREAIQKEFSELGKEL 123
+++ S + D + K + A + D+ + +L +
Sbjct: 181 TVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDD 240

Query: 124 QNALNNTEYNSEKLFADGGKMRKELNFQSGTDAESSLKLDLNSVIAELTESVTKPGLKAN 183
+ G K + E T++ K +
Sbjct: 241 AENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVS 300

Query: 184 SGGTAEEKELARLEGLAKDAKSTAATTKSAETTLLVDDATGKGGKGGNASIDIIIPAHKD 243
+ E+ L + A A AAT +S++ +
Sbjct: 301 TTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESA------- 353

Query: 244 TTGKDVAEKKIASGTAITPANITSMADAKAYWDKQEIETPKAVNEYVVKHSADSGVMNMQ 303
A + + IT A+A +T + +
Sbjct: 354 KLSDLEANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAA 413

Query: 304 LADKDLAM-KADKKLSDVIDAYGAFRATLGANQNRLQSSSNNLDNMISNTAQALGSIKDT 362
+ D LS V R++LGA QNR S+ NL N ++N A I+D
Sbjct: 414 KKSTANPLASIDSALSKVDAV----RSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDA 469

Query: 363 DFADEMKNHAQSEMLMQSSVMMLKKANAATQLISTLLQ 400
D+A E+ N +++++L Q+ +L +AN Q + +LL+
Sbjct: 470 DYATEVSNMSKAQILQQAGTSVLAQANQVPQNVLSLLR 507


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0218FLGHOOKAP11584e-45 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 158 bits (402), Expect = 4e-45
Identities = 93/324 (28%), Positives = 157/324 (48%), Gaps = 8/324 (2%)

Query: 4 IRTAFSGMQATQAHLNATSMNIANMHTPGYSRQRAEQSAIGADGQGGVNAGNGVNVDGIR 63
I A SG+ A QA LN S NI++ + GY+RQ + + G GNGV V G++
Sbjct: 4 INNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVSGVQ 63

Query: 64 RLSQQYVVMQEWRANSQQQYYDAGEQYLNAVELMVSNESTSLATGLNNFFSSLSAATQLP 123
R ++ Q A +Q A + ++ ++ M+S ++SLAT + +FF+SL
Sbjct: 64 REYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLVSNA 123

Query: 124 DSPPMRQQIIESANAMALRFNNVNNFIVQQKKSIGQQRDITVKEINSLTRSIADYNQQIL 183
+ P RQ +I + + +F + ++ Q K + +V +IN+ + IA N QI
Sbjct: 124 EDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLNDQIS 183

Query: 184 K--NRSDGNNINDLLDKQELQIKKLSGLIETQVNQAEDGTYRISVKQGQPLVNGAVAAEL 241
+ G + N+LLD+++ + +L+ ++ +V+ + GTY I++ G LV G+ A +L
Sbjct: 184 RLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTARQL 243

Query: 242 AVDTSSVDTKITLHFSGATQGMNMSC------GGQLGGINDYELTTLKKLQDSTQEMAKT 295
A SS D T N+ G LGGI + L + +++ ++A
Sbjct: 244 AAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQLALA 303

Query: 296 VADKFNDQLGKGTDFTGAPGQDLF 319
A+ FN Q G D G G+D F
Sbjct: 304 FAEAFNTQHKAGFDANGDAGEDFF 327



Score = 61.9 bits (150), Expect = 2e-12
Identities = 47/182 (25%), Positives = 79/182 (43%), Gaps = 8/182 (4%)

Query: 275 NDYELTTLKKLQDSTQEMAKTVADKFNDQLGKGTDFTGAPG-QDLFVFNPSDPNGMLQLS 333
N +++T L +T A+ G FTG P D F P + ++ +
Sbjct: 368 NQWQVTRLA---SNTTFTVTPDANGKVAFDGLELTFTGTPAVNDSFTLKPVS-DAIVNMD 423

Query: 334 AITAEQLALAAHGK-PAG--DNSNLFELLDIRKTPVTGMKNVPLDDAATALVGYIAITSN 390
+ ++ +A + AG DN N LLD++ T +DA +LV I +
Sbjct: 424 VLITDEAKIAMASEEDAGDSDNRNGQALLDLQSNSKTVGGAKSFNDAYASLVSDIGNKTA 483

Query: 391 RNHSELENAENTLNQATRYHESFSGVNNDEEAMNLMEYQRAYQSNMKVIATGDKLFSDLL 450
+ N + Q + +S SGVN DEE NL +Q+ Y +N +V+ T + +F L+
Sbjct: 484 TLKTSSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALI 543

Query: 451 AL 452
+
Sbjct: 544 NI 545


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0219FLGFLGJ454e-09 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 45.5 bits (107), Expect = 4e-09
Identities = 19/79 (24%), Positives = 41/79 (51%), Gaps = 4/79 (5%)

Query: 18 GDLQPQDLEQAAVQFEAVFMRTLLQQMRKAAEVLAADDDPFNSKQQRMMRDFYDDKLAST 77
G+ ++ A Q E +F++ +L+ MR A D F+S+ R+ YD ++A
Sbjct: 26 GEDPAANIRPVARQVEGMFVQMMLKSMRDAL----PKDGLFSSEHTRLYTSMYDQQIAQQ 81

Query: 78 LASQRSSGIANLLIQQLGS 96
+ + + G+A ++++Q+
Sbjct: 82 MTAGKGLGLAEMMVKQMTP 100


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0220FLGPRINGFLGI330e-113 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 330 bits (848), Expect = e-113
Identities = 146/359 (40%), Positives = 210/359 (58%), Gaps = 12/359 (3%)

Query: 39 LVLPTASAQP--LGSLVDIQGVRGNQLVGYSLVVGLDGSGDK-NQVKFTGQSMANMLRQF 95
L P A A + + +Q R NQL+GY LVVGL G+GD FT QSM ML+
Sbjct: 19 LSTPPAQADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSPFTEQSMRAMLQNL 78

Query: 96 GVQLPEKMDPKVKNVAAVAISATLPPGYGRGQSIDITVSSIGDAKSLRGGTLLLTQLRGA 155
G+ KN+AAV ++A LPP G +D+TVSS+GDA SLRGG L++T L GA
Sbjct: 79 GITTQGG-QSNAKNIAAVMVTANLPPFASPGSRVDVTVSSLGDATSLRGGNLIMTSLSGA 137

Query: 156 DGEVYALAQGNVVVGGIKAEGDSGSSVTVNTPTVGRIPNGASIERQIPSDFQTNNQVVLN 215
DG++YA+AQG ++V G A+GD +++T T R+PNGA IER++PS F+ + +VL
Sbjct: 138 DGQIYAVAQGALIVNGFSAQGD-AATLTQGVTTSARVPNGAIIERELPSKFKDSVNLVLQ 196

Query: 216 LKRPSFKSANNVALALNR----AFGANTATAQSATNVMVNAPQDAGARVAFMSLLEDVQI 271
L+ P F +A VA +N +G A + + + V P+ A M+ +E++ +
Sbjct: 197 LRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKPRVA-DLTRLMAEIENLTV 255

Query: 272 NAGEQSPRVVFNARTGTVVIGEGVMVRAAAVSHGNLTVNIREQKNVSQPNPLGGGKTVTT 331
+ +VV N RTGT+VIG V + AVS+G LTV + E V QP P G+T
Sbjct: 256 ET-DTPAKVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESPQVIQPAPFSRGQTAVQ 314

Query: 332 PESDIEVTKGKNQMVMVPAGTRLRSIVNTINSLGASPDDIMAILQALYEAGALDAELVV 390
P++DI + +++ +V G LR++V +NS+G D I+AILQ + AGAL AELV+
Sbjct: 315 PQTDIMAMQEGSKVAIVE-GPDLRTLVAGLNSIGLKADGIIAILQGIKSAGALQAELVL 372


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0221FLGLRINGFLGH1538e-49 Flagellar L-ring protein signature.
		>FLGLRINGFLGH#Flagellar L-ring protein signature.

Length = 232

Score = 153 bits (389), Expect = 8e-49
Identities = 74/221 (33%), Positives = 109/221 (49%), Gaps = 13/221 (5%)

Query: 4 FLILTPMVLALCGCESPALLVQKDDAEFAPPANLIQPATVTEGGGLFQPANS-----WSL 58
+ I + +VL+L GC A A P P G +FQ A L
Sbjct: 9 YAISSLLVLSLTGCAWIPSTPLVQGATSAQPVPGPTPVA---NGSIFQSAQPINYGYQPL 65

Query: 59 LQDRRAYRIGDILTVILDESTQSSKQAKTNFGKKNDMSLGVPEVLGKKLNKFGGSI---- 114
+DRR IGD LT++L E+ +SK + N + + G V FG +
Sbjct: 66 FEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQGLFGNARADVE 125

Query: 115 -SGKRDFDGSATSAQQNMLRGSITVAVHQVLPNGVLVIRGEKWLTLNQGDEYMRVTGLVR 173
SG F+G + N G++TV V QVL NG L + GEK + +NQG E++R +G+V
Sbjct: 126 ASGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRFSGVVN 185

Query: 174 ADDVARDNSVSSQRIANARISYAGRGALSDANSAGWLTRFF 214
++ N+V S ++A+ARI Y G G +++A + GWL RFF
Sbjct: 186 PRTISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFF 226


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0222FLGHOOKAP1422e-06 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 41.9 bits (98), Expect = 2e-06
Identities = 11/42 (26%), Positives = 20/42 (47%)

Query: 213 QLEQGALEGSNVQVVEEMVDMITVQRAYEMNAKMVSAADDML 254
QL S V + EE ++ Q+ Y NA+++ A+ +
Sbjct: 498 QLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIF 539



Score = 40.7 bits (95), Expect = 3e-06
Identities = 20/78 (25%), Positives = 35/78 (44%), Gaps = 14/78 (17%)

Query: 2 NSALWVSKTGLAAQDAKMGAISNNLANVNTDGFKRDRVVFADLFYQNQRTPGAPLDQNNT 61
+S + + +GL A A + SNN+++ N G+ R + A N+T
Sbjct: 1 SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMA--------------QANST 46

Query: 62 TPSGIQFGSGVQIVGTQK 79
+G G+GV + G Q+
Sbjct: 47 LGAGGWVGNGVYVSGVQR 64


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0224FLGHOOKAP1401e-05 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 39.9 bits (93), Expect = 1e-05
Identities = 20/60 (33%), Positives = 27/60 (45%), Gaps = 5/60 (8%)

Query: 2 SFSIANTALNAHTEQLNTISNNIANSATKGFKASR----TEFSSMYAQSQ-PLGVAVSGV 56
+ A + LNA LNT SNNI++ G+ S++ A GV VSGV
Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVSGV 62



Score = 34.2 bits (78), Expect = 8e-04
Identities = 10/42 (23%), Positives = 22/42 (52%)

Query: 371 LENSNVDITAELVGLMTAQRNYQASTKIISTNDSMMNALFQV 412
S V++ E L Q+ Y A+ +++ T +++ +AL +
Sbjct: 504 QSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0226FLGHOOKAP1300.004 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 29.9 bits (67), Expect = 0.004
Identities = 6/37 (16%), Positives = 19/37 (51%)

Query: 102 VNVVSEMADMMSASRSFETNVEVLNSVKSMQQSVLKL 138
VN+ E ++ + + N +VL + ++ +++ +
Sbjct: 509 VNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545


5YpAngola_A0240YpAngola_A0265Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A0240014-3.959486flagellar biosynthetic protein FliQ
YpAngola_A0241014-4.371909flagellar biosynthetic protein FliR
YpAngola_A0242014-4.444608flagellar biosynthesis protein FlhB
YpAngola_A0243216-3.468786flagellar biosynthesis protein FlhA
YpAngola_A0244216-2.421037putative lipoprotein
YpAngola_A0247013-0.796414hypothetical protein
YpAngola_A0248115-0.449105hypothetical protein
YpAngola_A0249114-0.326383iron-enterobactin transporter periplasmic
YpAngola_A0250116-1.845344fimbrial family protein
YpAngola_A0251116-3.212268fimbrial chaperone protein
YpAngola_A0252318-6.076413fimbrial usher protein
YpAngola_A0253627-10.457557IS1541 transposase
YpAngola_A0254730-12.223158type IV prepilin peptidase family protein
YpAngola_A0255531-13.589836hypothetical protein
YpAngola_A0258431-13.573382hypothetical protein
YpAngola_A0259431-13.061083hypothetical protein
YpAngola_A0260329-11.322719type II/IV secretion system protein
YpAngola_A0261533-13.257669type II secretion system protein F domain
YpAngola_A0262431-11.618013hypothetical protein
YpAngola_A0263224-7.244552hypothetical protein
YpAngola_A0264120-4.772180hypothetical protein
YpAngola_A0265018-3.642105hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0240TYPE3IMQPROT463e-10 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 45.5 bits (108), Expect = 3e-10
Identities = 25/74 (33%), Positives = 37/74 (50%)

Query: 14 GLHLVLMISIVAIVPSLLIGLLVSIFQATTQINEQTLSFLPRLVMTMLVLIFAGKWMMIK 73
L+LVL++S + + +IGLLV +FQ TQ+ EQTL F +L+ L L W
Sbjct: 11 ALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLFLLSGWYGEV 70

Query: 74 LSDFTVSIFQQAAQ 87
L + + A
Sbjct: 71 LLSYGRQVIFLALA 84


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0241TYPE3IMRPROT1053e-29 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 105 bits (263), Expect = 3e-29
Identities = 72/237 (30%), Positives = 128/237 (54%), Gaps = 3/237 (1%)

Query: 19 LPFVRILSFLHFCPVIRHKAFTRKAKIGTALLLAILITPMISQPVVSGELLSIENLLLAG 78
P +R+L+ + P++ ++ ++ K+G A+++ I P + V + S L LA
Sbjct: 18 WPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPANDV--PVFSFFALWLAV 75

Query: 79 EQILWGWLFGSMLHLVLAALEAAGQILSMNMGLGMAMMNDPTSGASTAVISQIIFTFSVL 138
+QIL G G + AA+ AG+I+ + MGL A DP S + V+++I+ ++L
Sbjct: 76 QQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHLNMPVLARIMDMLALL 135

Query: 139 IFFTLNGHLLFVTILLKSFSSWPIG-EAINDFSLRSLALSLGWILSSATLLALPTTFIML 197
+F T NGHL +++L+ +F + PIG E +N + +L + I + +LALP ++L
Sbjct: 136 LFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLIFLNGLMLALPLITLLL 195

Query: 198 IVQGSFGLLNRISPTLNLFSLGFPIGMLFGLLCLLLLAINIPDHYLHLTNEILTQFE 254
+ + GLLNR++P L++F +GFP+ + G+ + L I HL +EI
Sbjct: 196 TLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFCEHLFSEIFNLLA 252


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0242TYPE3IMSPROT298e-101 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 298 bits (764), Expect = e-101
Identities = 97/344 (28%), Positives = 173/344 (50%)

Query: 5 SGEKSEKPTAGKLSKARKKGDIPRSKDVTMAAGLVTSFILLSLFLPYYKALVSQSFVSVA 64
SGEK+E+PT K+ ARKKG + +SK+V A +V +L YY S+ + A
Sbjct: 2 SGEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIPA 61

Query: 65 QLASQLDDQGALEQFLLANLFIFAKFLATLIPIPLFSMLATLIPGGWNFTPVKLIPDLKK 124
+ + Q L F L L ++ + ++ G+ + + PD+KK
Sbjct: 62 EQSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDIKK 121

Query: 125 LSPLAGIKRIFSASNGTEVLKMLAKCSIVLYTLYLVVHSSLDDLLHLQTLPLEEAITQGF 184
++P+ G KRIFS + E LK + K ++ +++++ +L LL L T +E
Sbjct: 122 INPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPLLG 181

Query: 185 AQYHHILLYFIAIVVVFAAIDIPLSHHLFTKKMKMTKQEVKQEHKNNDGNPEIKSRVRQL 244
+++ VV + D ++ + K++KM+K E+K+E+K +G+PEIKS+ RQ
Sbjct: 182 QILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRRQF 241

Query: 245 QRQYAIGQINKTVPSADVIITNPTHFSVALKYAPEKASAPYIVAKGKDDIALYIRSIAQK 304
++ + + V + V++ NPTH ++ + Y + P + K D +R IA++
Sbjct: 242 HQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIAEE 301

Query: 305 HKIEIVEFPPLARAIYHTTKVNQQIPAQLYRAIAQVLTYVMQIK 348
+ I++ PLARA+Y V+ IPA+ A A+VL ++ +
Sbjct: 302 EGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQN 345


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0249FERRIBNDNGPP507e-09 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 49.6 bits (118), Expect = 7e-09
Identities = 21/97 (21%), Positives = 40/97 (41%), Gaps = 12/97 (12%)

Query: 129 QTEPNIKAVAKMRPDLIIISATGDDSTLELYDQLSAIAPTLVINYDDKS-----WQELTL 183
+TEPN++ + +M+P ++ SA S + L+ IAP N+ D ++
Sbjct: 84 RTEPNLELLTEMKPSFMVWSAGYGPS----PEMLARIAPGRGFNFSDGKQPLAMARKSLT 139

Query: 184 QLGQATGHEGDAEQVI---DKFARRLNEVKQKITLPP 217
++ + AE + + F R + K P
Sbjct: 140 EMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARP 176


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0252PF005776710.0 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 671 bits (1732), Expect = 0.0
Identities = 229/875 (26%), Positives = 374/875 (42%), Gaps = 67/875 (7%)

Query: 2 RIIKKIPIAMTTSLIMLSGAVSA--------IDFNTDAMDANDKQNIDLSHFTNVGYIMP 53
I+K +A + ++ A +A + FN + + + DLS F N + P
Sbjct: 16 LHIRKHRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPP 75

Query: 54 GEYRLEINVNNHRIPEQVIAFYARDDEPNSSEVCLPEAVVEQFGLKPDVLQKITFWHEGQ 113
G YR++I +NN + + + F D E CL A + GL + + +
Sbjct: 76 GTYRVDIYLNNGYMATRDVTFNTGDSE-QGIVPCLTRAQLASMGLNTASVSGMNLLADDA 134

Query: 114 CADLREL-AGLTTEVDLATSTLAINVPQDWMEYSDSNWVPSSQWDEGIPGSLLDYNVNSL 172
C L + T ++D+ L + +PQ +M ++P WD GI LL+YN +
Sbjct: 135 CVPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGN 194

Query: 173 FSKPKESGSTRNISLNGTSGLNAGPWRLRGDYQGNYSHNSGEQNSSTSTFDWSRIYMYRA 232
+ + G++ LN SGLN G WRLR + +Y+ + S + ++ R
Sbjct: 195 SVQNRIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNK-WQHINTWLERD 253

Query: 233 IKSLAATLSVGENYFASSLFDTFRYAGASLSSDERMLPPNLRGYAPEVSGIARTNAKVTV 292
I L + L++G+ Y +FD + GA L+SD+ MLP + RG+AP + GIAR A+VT+
Sbjct: 254 IIPLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTI 313

Query: 293 SQQGRILYQTTVASGPFRIQELSD-SVSGRLDVSVEEQDGTVQTFQVETAAVPYLTRPGA 351
Q G +Y +TV GPF I ++ SG L V+++E DG+ Q F V ++VP L R G
Sbjct: 314 KQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGH 373

Query: 352 IRYKTSVGQPSTLNHGTEGPVFASGEFSWGVSNRWSLFGGAIGSGDYNAVSVGVGRDLYA 411
RY + G+ + N E P F G+ W+++GG + Y A + G+G+++ A
Sbjct: 374 TRYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGA 433

Query: 412 FGAISTDITQTRASGLPNQETQSGKSLRVRYAKRFDELNSDISLAGNRFFEREFMSMNQY 471
GA+S D+TQ ++ LP+ G+S+R Y K +E ++I L G R+ + +
Sbjct: 434 LGALSVDMTQANST-LPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADT 492

Query: 472 LGTRYFDNDL--------------------GRNKEMYTVTASKNFPDIQTNINFSYSYQN 511
+R ++ + +T ++ T + S S+Q
Sbjct: 493 TYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTST-LYLSGSHQT 551

Query: 512 YWDQP-TSNSYSATVSHAFDAFSLKDMTVNLSASRSKNNGV--NDDVLYLSFSVPLGNQ- 567
YW + A ++ AF +D+ LS S +KN D +L L+ ++P +
Sbjct: 552 YWGTSNVDEQFQAGLNTAF-----EDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWL 606

Query: 568 ----------QTLSYSGQH-NGQGNNQTVNYSNSSAIDS--SYRLSAGVNNSNDNGARGQ 614
+ SYS H + D+ SY + G D +
Sbjct: 607 RSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGST 666

Query: 615 FSGFYIHRSSIAETSLNVAYAQDDFTSTGVSMRGGATVTAKGAALHGPGMSGGTRLMVNT 674
+R ++ DD + GG A G L P T ++V
Sbjct: 667 GYATLNYRGGYGNANIG-YSHSDDIKQLYYGVSGGVLAHANGVTLGQP--LNDTVVLVKA 723

Query: 675 DDIAGVPLEERNI-RSNRFGIAVLNNINSYYRTDTRIDINQLADDVEVKQSAVEFALTEG 733
+E + R++ G AVL Y +D N LAD+V++ + T G
Sbjct: 724 PGAKDAKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRG 783

Query: 734 AIGYRRFAMMKGEKVLATISLTDSSHPPFGSLVISAKGQELGIVSDDGFTYLSGVEPGET 793
AI F G K+L T++ ++ PFG++V S Q GIV+D+G YLSG+
Sbjct: 784 AIVRAEFKARVGIKLLMTLT-HNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGK 842

Query: 794 LDVVW--SGAKQCQV--AIPAVIQPQA--QILLPC 822
+ V W C +P Q Q Q+ C
Sbjct: 843 VQVKWGEEENAHCVANYQLPPESQQQLLTQLSAEC 877


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0254PREPILNPTASE368e-06 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 35.9 bits (83), Expect = 8e-06
Identities = 13/78 (16%), Positives = 33/78 (42%), Gaps = 1/78 (1%)

Query: 15 FIGYIIFHFNVMGGGDVKLITVLLLALTAEQSLNFIIYTAVMGGVVMVVGLLINRVDIQK 74
+ ++ MG GD KL+ L L + ++ ++++G + + +L+ K
Sbjct: 200 WAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLVGAFMGIGLILLRNHHQSK 259

Query: 75 RGVPYAVAITAGFLSSVL 92
+P+ + ++L
Sbjct: 260 -PIPFGPYLAIAGWIALL 276


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0264MICOLLPTASE280.022 Microbial collagenase metalloprotease (M9) signature.
		>MICOLLPTASE#Microbial collagenase metalloprotease (M9) signature.

Length = 1104

Score = 27.8 bits (61), Expect = 0.022
Identities = 14/54 (25%), Positives = 21/54 (38%), Gaps = 2/54 (3%)

Query: 62 DAENVLSYQQLFEHNFNRQVTVLGSLINTAPSAELTVNFSHSVADLINGNSEEN 115
D Y +F H N T +N P A + + S V + IN + E+
Sbjct: 747 DGNGNYVYDVVF-HGMN-TDTNTDVHVNKEPKAVIKSDSSVIVEEEINFDGTES 798


6YpAngola_A0311YpAngola_A0317Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A03115282.555127hypothetical protein
YpAngola_A03104271.965197filamentation induced by cAMP protein fic
YpAngola_A03124271.421591hypothetical protein
YpAngola_A03134270.315427hypothetical protein
YpAngola_A03142210.116228insertion sequence transposase
YpAngola_A0315223-1.213773transposase/IS protein
YpAngola_A0316218-2.187224phage family integrase
YpAngola_A0317217-2.938049transposase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0314HTHTETR280.047 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 28.1 bits (62), Expect = 0.047
Identities = 12/106 (11%), Positives = 31/106 (29%), Gaps = 14/106 (13%)

Query: 16 QGMSSRAIARELGISRNTVKRYLQAKSEPPKYTPRPAVASLLDEYRDYIRQRIAD----- 70
S IA+ G++R + + + KS+ + + + I + +
Sbjct: 30 SSTSLGEIAKAAGVTRGAIYWHFKDKSD--------LFSEIWELSESNIGELELEYQAKF 81

Query: 71 -AHPYKIPATVIAREIRDQGYRGGMTILRAFIRSLSVPQEQEPAVR 115
P + ++ + +L I + V+
Sbjct: 82 PGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQ 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0316FLGPRINGFLGI280.013 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 28.4 bits (63), Expect = 0.013
Identities = 11/24 (45%), Positives = 18/24 (75%)

Query: 134 LKARTLIQVLEPIKARGALETDLL 157
LKA +I +L+ IK+ GAL+ +L+
Sbjct: 348 LKADGIIAILQGIKSAGALQAELV 371


7YpAngola_A0415YpAngola_A0429Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A04152190.089361outer membrane protein assembly complex subunit
YpAngola_A0416017-0.181912hypothetical protein
YpAngola_A0417016-0.083047histidyl-tRNA synthetase
YpAngola_A0418-219-3.8732794-hydroxy-3-methylbut-2-en-1-yl diphosphate
YpAngola_A0419015-2.224622cytoskeletal protein RodZ
YpAngola_A0420015-2.750696type IV pilus biogenesis/stability protein PilW
YpAngola_A0421-114-2.508306ribosomal RNA large subunit methyltransferase N
YpAngola_A0422014-1.821498nucleoside diphosphate kinase
YpAngola_A0423014-1.840404hypothetical protein
YpAngola_A04253171.070320pertactin family protein
YpAngola_A04240173.848254hypothetical protein
YpAngola_A04270184.013081enhanced serine sensitivity protein SseB
YpAngola_A04280173.970968aminopeptidase B
YpAngola_A04291153.070228hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0420SYCDCHAPRONE300.008 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 29.5 bits (66), Expect = 0.008
Identities = 17/89 (19%), Positives = 25/89 (28%)

Query: 39 LGLAYLAQGDLTAARKNLEKAVEADPQDYRTQLGMAFYAQRIGENSAAEQRYQQAMKLAP 98
L G A K + D D R LG+ Q +G+ A Y +
Sbjct: 42 LAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDI 101

Query: 99 GNGTVLNNYGAFLCSLGQYVSAQQQFSAA 127
+ L G+ A+ A
Sbjct: 102 KEPRFPFHAAECLLQKGELAEAESGLFLA 130


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0425PRTACTNFAMLY1492e-38 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 149 bits (376), Expect = 2e-38
Identities = 119/438 (27%), Positives = 183/438 (41%), Gaps = 44/438 (10%)

Query: 1048 LTMASLNGTGNFNLGSVMQSDSVAPLNVSGDANGDFIIAMNSSGQAPTNLN----VVNTN 1103
LT+ +L G+G F + L V DA+G + + +SG P + N V
Sbjct: 473 LTVNTLAGSGLFRMNVFADLGLSDKLVVMQDASGQHRLWVRNSGSEPASANTLLLVQTPL 532

Query: 1104 GGDARFALAN--GPVALGNYMTNLAKDANGNFVLTADKSAMTPGTAGIL----------- 1150
G A F LAN G V +G Y LA + NG + L K+ P A
Sbjct: 533 GSAATFTLANKDGKVDIGTYRYRLAANGNGQWSLVGAKAPPAPKPAPQPGPQPPQPPQPQ 592

Query: 1151 -------------------AVANTTPV-----IFNAELSSIQQRLDKQSTETNQSGMWGS 1186
A NT V ++ AE +++ +RL + + G WG
Sbjct: 593 PEAPAPQPPAGRELSAAANAAVNTGGVGLASTLWYAESNALSKRLGELRLNPDAGGAWGR 652

Query: 1187 YLNNNFAVKGRAAN-FDQKLNGMTLGGDKATALADGVLSVGGFASYSSSDIKTDYQSKGK 1245
+ RA FDQK+ G LG D A A+A G +GG A Y+ D G
Sbjct: 653 GFAQRQQLDNRAGRRFDQKVAGFELGADHAVAVAGGRWHLGGLAGYTRGDRGFTGDGGGH 712

Query: 1246 VDSHSFGAYAQYLANSGYYMNAVVKNNQFSQDVNITSINGSA-SGVSNFSGMGIALKAGK 1304
DS G YA Y+A+SG+Y++A ++ ++ D + +G A G G+G +L+AG+
Sbjct: 713 TDSVHVGGYATYIADSGFYLDATLRASRLENDFKVAGSDGYAVKGKYRTHGVGASLEAGR 772

Query: 1305 HFNFNEA-YVSPYVAMSAFSSGKSNISLSNGMEAQSSSTRSAMGTLGVNAGYRFVMNNGA 1363
F + ++ P ++ F +G +NG+ + S +G LG+ G R + G
Sbjct: 773 RFTHADGWFLEPQAELAVFRAGGGAYRAANGLRVRDEGGSSVLGRLGLEVGKRIELAGGR 832

Query: 1364 ELKPYAIFAVDHEFAKNNQVTVNQEVFDNNLSGTRVNTGAGMNVNITPNLSVGSEVKLSS 1423
+++PY +V EF V N L GTR G GM + S+ + + S
Sbjct: 833 QVQPYIKASVLQEFDGAGTVHTNGIAHRTELRGTRAELGLGMAAALGRGHSLYASYEYSK 892

Query: 1424 GKDIKTPVTINLNVGYSF 1441
G + P T + YS+
Sbjct: 893 GPKLAMPWTFHAGYRYSW 910


8YpAngola_A0480YpAngola_A0502Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A0480-2163.819497hypothetical protein
YpAngola_A0481-2164.138885Mg chelatase-like protein
YpAngola_A0482-2172.680217acetolactate synthase 2 catalytic subunit
YpAngola_A0483-2150.749713acetolactate synthase 2 regulatory subunit
YpAngola_A0484-115-0.470158branched-chain amino acid aminotransferase
YpAngola_A0485113-2.341448dihydroxy-acid dehydratase
YpAngola_A0486315-5.754260threonine dehydratase
YpAngola_A0487420-8.054361putative entero membrane protein
YpAngola_A0488114-4.121240putative entero membrane protein
YpAngola_A0489014-3.096536putative entero membrane protein
YpAngola_A0490-214-1.386282hypothetical protein
YpAngola_A0491-115-0.561862hypothetical protein
YpAngola_A0492016-0.897187DNA-binding transcriptional regulator IlvY
YpAngola_A0493119-2.139906ketol-acid reductoisomerase
YpAngola_A0494324-3.527094hypothetical protein
YpAngola_A0495120-2.083132S-type pyocin family protein
YpAngola_A0496216-2.272221hypothetical protein
YpAngola_A0497216-2.306272colicin-E7 immunity protein
YpAngola_A0498215-2.047455hypothetical protein
YpAngola_A0499216-3.481065pili assembly chaperone
YpAngola_A0500216-3.819942hypothetical protein
YpAngola_A0501118-4.996301fimbrial usher protein
YpAngola_A0502015-4.712746pili assembly chaperone
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0481HTHFIS441e-06 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 43.7 bits (103), Expect = 1e-06
Identities = 42/185 (22%), Positives = 66/185 (35%), Gaps = 47/185 (25%)

Query: 180 EPAPSPDNHLDLHDIIGQSQA----KRALEIAAAGGHNLLLLGPPGTGKTMLATRLTGLL 235
P+ D+ D ++G+S A R L L++ G GTGK ++A R
Sbjct: 125 RPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVA-RALHDY 183

Query: 236 PPLTDQE--ALEAAAIT-GLLHSNALPTQWRCRAFRAPHHSASMAALIG-------GGSI 285
+ A+ AAI L+ S L G G
Sbjct: 184 GKRRNGPFVAINMAAIPRDLIES----------------------ELFGHEKGAFTGAQT 221

Query: 286 PRPGEISLAHNGVLFLDEL----PEFERRVLDSLREPLESGEIIISRAAAKICFPAKVQL 341
G A G LFLDE+ + + R+L L++ GE + + + V++
Sbjct: 222 RSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQ----GE--YTTVGGRTPIRSDVRI 275

Query: 342 IAAMN 346
+AA N
Sbjct: 276 VAATN 280


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0488RTXTOXIND290.011 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 28.6 bits (64), Expect = 0.011
Identities = 11/45 (24%), Positives = 16/45 (35%), Gaps = 3/45 (6%)

Query: 2 RLPGA---VMKAKSKKIICALLLLGSILLGYFFWLSLRPVEIVAI 43
LP + S++ + L+ F L VEIVA
Sbjct: 41 FLPAHLELIETPVSRRPRLVAYFIMGFLVIAFILSVLGQVEIVAT 85


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0489RTXTOXIND300.003 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 30.2 bits (68), Expect = 0.003
Identities = 10/36 (27%), Positives = 13/36 (36%)

Query: 1 MKAKSKKTLYALLLIGSVLLGYFFWLSLRPVEIVAV 36
S++ I L+ F L VEIVA
Sbjct: 50 ETPVSRRPRLVAYFIMGFLVIAFILSVLGQVEIVAT 85


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0495PYOCINKILLER941e-22 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 94.5 bits (234), Expect = 1e-22
Identities = 96/354 (27%), Positives = 143/354 (40%), Gaps = 55/354 (15%)

Query: 6 QQQRVNADLETAKITEPQRVENARLTAEAAEKAARDRRISEEIAATEAKRQRMENERLAE 65
N L T I+ Q N A+A+ +AA + E+ AA EAKR+ E R
Sbjct: 184 LTAAYNVKLFTEAISSLQIRMNTLTAAKASIEAAAANKAREQAAA-EAKRKAEEQARQQA 242

Query: 66 QERQRVEGTKQQVSEASCAQQASAWQNRFTLPALQPSGSAQYSFAASGMSAVGE-AAELH 124
R N + +PA +GS + A G+ V + AA L
Sbjct: 243 AIRA---------------------ANTYAMPA---NGSVVATAAGRGLIQVAQGAASLA 278

Query: 125 NSFLAAQEQLSAIATISASGSVAAMIALGIYQTKVGESSERPPGWNVSPKFVGSISLSAM 184
+ A L + SA +A A Y ++ + + S ++ + + +
Sbjct: 279 QAISDAIAVLGRVLA-SAPSVMAVGFASLTYSSRT--AEQWQDQTPDSVRYALGMDAAKL 335

Query: 185 GLPATESL----ASQGEMALPVRMRIIDAKDWIGCTEIYAVKTGVAGVLPK-VKVGAAQY 239
GLP + +L + G + LP MR+ + G T +V + +PK V V A Y
Sbjct: 336 GLPPSVNLNAVAKASGTVDLP--MRLTNEAR--GNTTTLSVVSTDGVSVPKAVPVRMAAY 391

Query: 240 DESTGVYTFTTDST----PPRTLIFTPAQPPGAETRPILAPPGSTPATLQHTGEM---II 292
+ +TG+Y T ST PP L +TPA PPG + P +TP + +
Sbjct: 392 NATTGLYEVTVPSTTAEAPPLILTWTPASPPGNQN-----PSSTTPVVPKPVPVYEGATL 446

Query: 293 KPVITPTILPLPQLYARDFHDYIIWFPADSGLEPVYVYLNSPY---GKTTAKGK 343
PV T P + D II FPADSG++P+YV P G T KG+
Sbjct: 447 TPV-KATPETYPGVITLP-EDLIIGFPADSGIKPIYVMFRDPRDVPGAATGKGQ 498


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0501PF005777620.0 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 762 bits (1970), Expect = 0.0
Identities = 252/900 (28%), Positives = 399/900 (44%), Gaps = 79/900 (8%)

Query: 15 RRKALTLCITLILHIDTAFGQEEP---QNFEFDESLFLGTKYASG-LTQLNKKNSITAGN 70
RK + L + AF + P F+ A L++ + G
Sbjct: 18 IRKHRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPGT 77

Query: 71 YDAVDVLVNNKLFKRMSVQFIKDANSSEVYPCLSDELLTAAGVELGRENSTPPKEPHVTE 130
Y VD+ +NN V F + + PCL+ L + G+ +
Sbjct: 78 Y-RVDIYLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSG---------- 126

Query: 131 ANTPITETHAPTNQCLPLSTRVKGASFRFDQAKLRLELSIPQALLQKRPRGYIERAEWQE 190
+ C+PL++ + A+ + D + RL L+IPQA + R RGYI W
Sbjct: 127 ------MNLLADDACVPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDP 180

Query: 191 GEKLAFINYSANAYRSDTRGQQKRTSDFGFIGLKSGINLGLWQVRQQSNVRYASN--DSG 248
G +NY+ + R S + ++ L+SG+N+G W++R + Y S+ SG
Sbjct: 181 GINAGLLNYNFSGNSVQNRIGG--NSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSG 238

Query: 249 SDTQWNSIRTYVQRPIPQLDSQLTLGETFTDSTLFGSMSFLGAKMATDQRMWPVSMRGFS 308
S +W I T+++R I L S+LTLG+ +T +F ++F GA++A+D M P S RGF+
Sbjct: 239 SKNKWQHINTWLERDIIPLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFA 298

Query: 309 PEVRGVASTNARVIIRQNGREIYETNVAPGPFVINDLFSTSSQGDLNVEVIEANGSRSTF 368
P + G+A A+V I+QNG +IY + V PGPF IND+++ + GDL V + EA+GS F
Sbjct: 299 PVIHGIARGTAQVTIKQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIF 358

Query: 369 TVPFSAVPDSMRPGVSRYNAVIGESRDFTN--IDNYFTDFTYERGLTNQLTANSGVRLAK 426
TVP+S+VP R G +RY+ GE R F T GL T G +LA
Sbjct: 359 TVPYSSVPLLQREGHTRYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLAD 418

Query: 427 DYTALLAGGVLGT-PVGALGLNATYSHAKVENDKTQDGWRMQATYSQTFNQTGTTFSLAG 485
Y A G +GAL ++ T +++ + +D DG ++ Y+++ N++GT L G
Sbjct: 419 RYRAFNFGIGKNMGALGALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVG 478

Query: 486 YRYSTKGYRDLNDVFGVRSMQKNGGTWD-------------SSTYKQRSQFTTTINQDLG 532
YRYST GY + D R N T D + Y +R + T+ Q LG
Sbjct: 479 YRYSTSGYFNFADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLG 538

Query: 533 NWGQLYASASTSDYYNDTARDTQLQLGYSNSYQQISYNLAVSRQRSVYTSTLYNWDSPDT 592
LY S S Y+ + D Q Q G + +++ I++ L+ S ++ +
Sbjct: 539 RTSTLYLSGSHQTYWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQ----------- 587

Query: 593 DETATTTRYGNTENIATFTVSIPL--------NIGSNNQYLSMSASRNPKSGNNYQTSLS 644
+ + V+IP + S S S + +
Sbjct: 588 ---------KGRDQMLALNVNIPFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVY 638

Query: 645 GTAGERNSFNYALNAGYDDSNFGSSSNNWGANVQKQFPNATVNGSYSRGNNYTQYGAGAR 704
GT E N+ +Y++ GY G+S + A + + N YS ++ Q G
Sbjct: 639 GTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYGNANIGYSHSDDIKQLYYGVS 698

Query: 705 GAAVIHRQGVTLGPYLGETFGLIEANGAQGARI--------DSNGFALVPALTPYNYNTI 756
G + H GVTLG L +T L++A GA+ A++ D G+A++P T Y N +
Sbjct: 699 GGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKVENQTGVRTDWRGYAVLPYATEYRENRV 758

Query: 757 GLDTKGINRNTELKENQGRVVPYAGAAVKVKFETLTGYAVLI--QAEGEGLPLGADVYNS 814
LDT + N +L VVP GA V+ +F+ G +L+ + LP GA V +
Sbjct: 759 ALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKARVGIKLLMTLTHNNKPLPFGAMVTSE 818

Query: 815 KDELVGMVGQGNQIYARIADNKGTLDVRWGESSGDQCQLPYAFNRQDTEQDIIHITASCR 874
+ G+V Q+Y G + V+WGE C Y + +Q + ++A CR
Sbjct: 819 SSQSSGIVADNGQVYLSGMPLAGKVQVKWGEEENAHCVANYQLPPESQQQLLTQLSAECR 878


9YpAngola_A0661YpAngola_A0695Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A06612170.377501hypothetical protein
YpAngola_A06623180.058909tellurium resistance protein
YpAngola_A0663216-0.235106tellurite resistance protein
YpAngola_A0664220-1.539515tellurite resistance protein
YpAngola_A0665-114-1.973975tellurium resistance protein
YpAngola_A0666115-2.732645tellurium resistance protein
YpAngola_A0667317-3.470668tellurium resistance protein
YpAngola_A0668316-1.812574IS1541 transposase
YpAngola_A0669416-2.175963hypothetical protein
YpAngola_A0670215-1.781118fimbrial usher protein
YpAngola_A0671218-0.928283putative periplasmic chaperone protein
YpAngola_A0673218-0.189617hypothetical protein
YpAngola_A06741180.081885hypothetical protein
YpAngola_A0675-215-1.068035hypothetical protein
YpAngola_A0676-215-0.523790putative oxidoreductase, FAD-binding
YpAngola_A06780230.003636chorismate pyruvate lyase
YpAngola_A0679-121-0.1503614-hydroxybenzoate octaprenyltransferase
YpAngola_A0680222-0.011016IS1541 transposase
YpAngola_A06812260.79149950S ribosomal protein L9
YpAngola_A0682-2182.47702230S ribosomal protein S18
YpAngola_A0683-1172.51987830S ribosomal protein S6
YpAngola_A06841182.884077esterase
YpAngola_A06852163.590851hypothetical protein
YpAngola_A06861163.427988putative lipoprotein
YpAngola_A06871133.476958isovaleryl CoA dehydrogenase
YpAngola_A06881152.664471transposase/IS protein
YpAngola_A06891162.690667insertion sequence transposase
YpAngola_A06900172.36516723S rRNA (guanosine-2'-O-)-methyltransferase
YpAngola_A06912191.846836hypothetical protein
YpAngola_A06922191.906032exoribonuclease R
YpAngola_A06932200.847683transcriptional repressor NsrR
YpAngola_A06943211.071621adenylosuccinate synthetase
YpAngola_A06952191.036394hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0663TONBPROTEIN320.003 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 32.3 bits (73), Expect = 0.003
Identities = 11/42 (26%), Positives = 13/42 (30%)

Query: 161 VVVEDEPAAPTPVPTPVSTPAPTAPPVAKPINLSKVSLTKEK 202
VVE EP P P P KP K ++
Sbjct: 67 PVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQE 108


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0666PF07824333e-04 Type III secretion chaperone
		>PF07824#Type III secretion chaperone

Length = 120

Score = 32.6 bits (74), Expect = 3e-04
Identities = 15/85 (17%), Positives = 34/85 (40%), Gaps = 13/85 (15%)

Query: 85 EGDDESLKIKLPLI--PADVDKIVFVVTIHDAQARRQSFGQVANAFIRLVNDDNGVEIAR 142
E + +S+ + P P +++ +++ ++++ + +D+ G IAR
Sbjct: 35 EKEGDSINLLCPFCALPENINDLIYALSLN-----------YSEKICLATDDEGGSLIAR 83

Query: 143 YDLSEDASTETAMLFGELYRHNAEW 167
DL+ E + E Y W
Sbjct: 84 LDLTGINEFEDIYVNTEYYISRVRW 108


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0670PF00577358e-113 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 358 bits (921), Expect = e-113
Identities = 181/876 (20%), Positives = 319/876 (36%), Gaps = 97/876 (11%)

Query: 1 MVARCINLQCIAFLFSFFPTLAFPVTEKG-EVVFDIETLERLGYSAELAKFFSGQDRFLP 59
+ R L A E+ F+ L + F P
Sbjct: 16 LHIRKHRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPP 75

Query: 60 GQHDVTIIINASKTYRIAATFDSE-----GKLCMDKALLMALKLR-------NTESDGSC 107
G + V I +N TF++ C+ +A L ++ L N +D +C
Sbjct: 76 GTYRVDIYLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDAC 135

Query: 108 ENMEARWPGMVVKLFPGQFRVEITLPQEAFDPEMEG----SEYQQGGHALLLNYNIFGQR 163
+ + +L GQ R+ +T+PQ G + G +A LLNYN G
Sbjct: 136 VPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNS 195

Query: 164 VESNNS-RFNLVQGQFEPGINFKNWVLRNRGSYSYNQGVSQ------YYNQETSALRAVE 216
V++ + + G+N W LR+ ++SYN S + + T R +
Sbjct: 196 VQNRIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDII 255

Query: 217 SLKSVVQLGEFGLVGNTFSGLPVTGIQLYSDNAQRDDTQ--LIVPIEGIANTNATIEIRQ 274
L+S + LG+ G+ F G+ G QL SD+ D+Q I GIA A + I+Q
Sbjct: 256 PLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQ 315

Query: 275 RGRVIYRTIVAPGPFSLSNISNFSSGVNTDVSIIEEDGTQQNFTV-TSALDINAEQQASI 333
G IY + V PGPF++++I + + V+I E DG+ Q FTV S++ + + +
Sbjct: 316 NGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTR 375

Query: 334 YQLAVGRYRDMFTGEDRPSPLLLSGEMS--FNPAATFYMTSAGLLSSGYQNIRVQNLYSG 391
Y + G YR + P + T Y L+ Y+ N G
Sbjct: 376 YSITAGEYRS--GNAQQEKPRFFQSTLLHGLPAGWTIY--GGTQLADRYRAF---NFGIG 428

Query: 392 WDQAWF---SAAASYANTKDAGQGYQFSVQNQMTINGNFGVSWSSV------YGSANYWL 442
+ S + AN+ + N + S +++ Y ++ Y+
Sbjct: 429 KNMGALGALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFN 488

Query: 443 PDDALSSSNNLNDL------------------MFGKLKNATSVAVSWVHPRWGAFSYALS 484
D S N ++ + + + V+ R + S
Sbjct: 489 FADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGS 548

Query: 485 NNMYYQASGR-TYHIFSISEQFGRATTILS-----SQLSSQGQNSLYVGINMPLG----- 533
+ Y+ S ++ F LS + L + +N+P
Sbjct: 549 HQTYWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRS 608

Query: 534 -------NGTLSGRVQR-NNGNVALGSTYQGRWGDNKDYSVGISGD-------NRQRRIN 578
+ + S + NG + + G ++ + S + N
Sbjct: 609 DSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGY 668

Query: 579 GSMNIRTAYSQLTGGVSQATNNSRSAYLSSRGSVAYVNNTFATSSSSVGDTFAVVNIPNQ 638
++N R Y G S + ++ + Y G V + T + DT +V P
Sbjct: 669 ATLNYRGGYGNANIGYSHS-DDIKQLYYGVSGGVL-AHANGVTLGQPLNDTVVLVKAPGA 726

Query: 639 PGLRVSSPSSGIAITDYAGIALLPLVRPYTASKVQISTQTLPLNIRLNNTSADLLMTRGS 698
+V + + TD+ G A+LP Y ++V + T TL N+ L+N A+++ TRG+
Sbjct: 727 KDAKVENQT--GVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGA 784

Query: 699 VATHHFETTETRQLLLTIRGSDGEMLPIGANVLDEKGNFLGTIIGDGNFMLENKAIGVTL 758
+ F+ +LL+T+ + + LP GA V E G + +G L + +
Sbjct: 785 IVRAEFKARVGIKLLMTLT-HNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGKV 843

Query: 759 RVKANNRDE--CRVNYREPEKFDPDVLYEVADAVCQ 792
+VK + C NY+ P + +L ++ A C+
Sbjct: 844 QVKWGEEENAHCVANYQLPPESQQQLLTQL-SAECR 878


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0689HTHTETR280.047 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 28.1 bits (62), Expect = 0.047
Identities = 12/106 (11%), Positives = 31/106 (29%), Gaps = 14/106 (13%)

Query: 16 QGMSSRAIARELGISRNTVKRYLQAKSEPPKYTPRPAVASLLDEYRDYIRQRIAD----- 70
S IA+ G++R + + + KS+ + + + I + +
Sbjct: 30 SSTSLGEIAKAAGVTRGAIYWHFKDKSD--------LFSEIWELSESNIGELELEYQAKF 81

Query: 71 -AHPYKIPATVIAREIRDQGYRGGMTILRAFIRSLSVPQEQEPAVR 115
P + ++ + +L I + V+
Sbjct: 82 PGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQ 127


10YpAngola_A0726YpAngola_A0777Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A07260143.850977anaerobic C4-dicarboxylate transporter
YpAngola_A07270143.940804divalent-cation tolerance protein CutA
YpAngola_A0728010-0.914402thiol:disulfide interchange protein
YpAngola_A0729013-2.337883formate dehydrogenase H
YpAngola_A0730117-4.7097474Fe-4S ferredoxin
YpAngola_A0731114-4.982639putative oxidoreductase Fe-S binding subunit
YpAngola_A0732321-7.405832putative transcriptional regulator
YpAngola_A0734324-8.334361*enhancing factor
YpAngola_A0735015-2.074230hypothetical protein
YpAngola_A0736-214-1.299852IS1541 transposase
YpAngola_A0737014-0.954731rhamnose-proton symporter
YpAngola_A07380170.981092transcriptional activator RhaR
YpAngola_A07390182.116258hypothetical protein
YpAngola_A07401182.769279transcriptional activator RhaS
YpAngola_A07410173.288966hypothetical protein
YpAngola_A07421173.477715hypothetical protein
YpAngola_A07430173.393080rhamnulokinase
YpAngola_A07441153.652643L-rhamnose isomerase
YpAngola_A07451133.023132rhamnulose-1-phosphate aldolase
YpAngola_A0746-3122.858773lactaldehyde reductase
YpAngola_A0748-2152.627590hypothetical protein
YpAngola_A0747-3132.463670hypothetical protein
YpAngola_A0749-3123.214236single-stranded DNA-binding protein
YpAngola_A0750-1142.833422hypothetical protein
YpAngola_A0751-1143.228626excinuclease ABC subunit A
YpAngola_A0752-1163.293031hypothetical protein
YpAngola_A0753-1153.425077aromatic amino acid aminotransferase
YpAngola_A0754-1173.487707alanine racemase
YpAngola_A0755-1172.795197replicative DNA helicase
YpAngola_A0756-2161.895821putative quinone oxidoreductase
YpAngola_A0758-1150.753349hypothetical protein
YpAngola_A0759016-1.538227hypothetical protein
YpAngola_A0760219-4.862250hypothetical protein
YpAngola_A0761420-5.762819hypothetical protein
YpAngola_A0762221-5.652137hypothetical protein
YpAngola_A0763226-6.217037hypothetical protein
YpAngola_A0764015-4.128833ImpA domain-containing protein
YpAngola_A0765-211-2.241702hypothetical protein
YpAngola_A0766-111-0.469954hypothetical protein
YpAngola_A0767-110-0.186639ribosomal large subunit pseudouridine synthase
YpAngola_A0768-111-0.243413Dna-J like membrane chaperone protein
YpAngola_A0769-113-0.477513organic solvent tolerance protein
YpAngola_A0770-215-1.409240peptidyl-prolyl cis-trans isomerase SurA
YpAngola_A0771017-2.3078384-hydroxythreonine-4-phosphate dehydrogenase
YpAngola_A0772223-5.005311dimethyladenosine transferase
YpAngola_A0773327-5.309990ApaG protein
YpAngola_A0774327-5.532039diadenosine tetraphosphatase
YpAngola_A0775423-5.025275hypothetical protein
YpAngola_A0776318-3.576870hypothetical protein
YpAngola_A0777215-1.949924hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0727AUTOINDCRSYN280.007 Autoinducer synthesis protein signature.
		>AUTOINDCRSYN#Autoinducer synthesis protein signature.

Length = 216

Score = 27.9 bits (62), Expect = 0.007
Identities = 15/46 (32%), Positives = 23/46 (50%), Gaps = 1/46 (2%)

Query: 60 EGKLEQEYEVQLLFKSNTDH-QQALLTYIKQHHPYQTPELLVLPVR 104
+G E+E V L+F D Q+AL I + + + EL P+R
Sbjct: 163 QGLSEKEERVYLVFLPVDDENQEALARRINRSGTFMSNELKQWPLR 208


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0732HTHTETR483e-09 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 48.1 bits (114), Expect = 3e-09
Identities = 34/173 (19%), Positives = 60/173 (34%), Gaps = 11/173 (6%)

Query: 3 REQVLSNALNLLEQQGLANTTLEMLAKALSVEVSDLTRFWPDREALLYDCLRYHSQQIDT 62
R+ +L AL L QQG+++T+L +AKA V + + D+ L + I
Sbjct: 13 RQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGE 72

Query: 63 WRRQLQLDETLSPQQKLLARY-QTLSEQVQNQRYPGCLFIAACSFYPDTEH----PIHQL 117
+ Q P L L V +R + F+ + Q
Sbjct: 73 LELEYQAKFPGDPLSVLREILIHVLESTVTEERRRL---LMEIIFHKCEFVGEMAVVQQA 129

Query: 118 AEQQKQASLHYTKALLQEMDAD---DADMVAQQMELILEGCLSKLLIKRQLAD 167
S + L+ AD++ ++ +I+ G +S L+ A
Sbjct: 130 QRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAP 182


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0746PF07520300.027 Virulence protein SrfB
		>PF07520#Virulence protein SrfB

Length = 1041

Score = 29.6 bits (66), Expect = 0.027
Identities = 16/83 (19%), Positives = 31/83 (37%), Gaps = 5/83 (6%)

Query: 282 ILLPVIEEYNRP---QATRRFARIAQAMGVDTQDMSDE-QASHQAIAAIRQLSLQVGIPA 337
++ VI P + + A + D Q + RQ S++V +P
Sbjct: 639 LVHRVISAIVLPRLQDSIAQAGGQFVAERMRELFGGDIGGQEQQTVQRRRQFSIRVLVPL 698

Query: 338 GFSAL-GIEESDIEGWLDKALAD 359
+ L E+++ +D +AD
Sbjct: 699 AEAILSACEDAEEADRIDIPVAD 721


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0754ALARACEMASE449e-161 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 449 bits (1156), Expect = e-161
Identities = 147/357 (41%), Positives = 217/357 (60%), Gaps = 4/357 (1%)

Query: 2 KAATAVIDRHALRHNLQQIRRLAPQSRLVAVVKANAYGHGLLAAAHTLQDADCYGVARIS 61
+ A +D AL+ NL +R+ A +R+ +VVKANAYGHG+ + D + + +
Sbjct: 3 RPIQASLDLQALKQNLSIVRQAATHARVWSVVKANAYGHGIERIWSAIGATDGFALLNLE 62

Query: 62 EALMLRAGGIVKPILLLEGFFDAEDLPVLVANHIETAVHSLEQLVALEAATLSAPINVWM 121
EA+ LR G PIL+LEGFF A+DL + + + T VHS QL AL+ A L AP+++++
Sbjct: 63 EAITLRERGWKGPILMLEGFFHAQDLEIYDQHRLTTCVHSNWQLKALQNARLKAPLDIYL 122

Query: 122 KLDTGMHRLGVRPDQAEAFYQRLSACRNVIQPVNIMSHFSRADEPEVAATQQQLACFDAF 181
K+++GM+RLG +PD+ +Q+L A NV + + +MSHF+ A+ P+ +A +
Sbjct: 123 KVNSGMNRLGFQPDRVLTVWQQLRAMANVGE-MTLMSHFAEAEHPD--GISGAMARIEQA 179

Query: 182 AAGKPGKQSIAASGGILRWPQAHRDWVRPGIVLYGVSPF-DAPYGRDFGLLPAMTLKSSL 240
A G ++S++ S L P+AH DWVRPGI+LYG SP + GL P MTL S +
Sbjct: 180 AEGLECRRSLSNSAATLWHPEAHFDWVRPGIILYGASPSGQWRDIANTGLRPVMTLSSEI 239

Query: 241 IAVREHKAGESVGYGGTWVSERDTRLGVIAIGYGDGYPRSAPSGTPVWLNGREVSIVGRV 300
I V+ KAGE VGYGG + + + R+G++A GY DGYPR AP+GTPV ++G VG V
Sbjct: 240 IGVQTLKAGERVGYGGRYTARDEQRIGIVAAGYADGYPRHAPTGTPVLVDGVRTMTVGTV 299

Query: 301 SMDMISIDLGPESTDKVGDEALMWGAELPVERVAACTGISAYELITNLTSRVAMEYL 357
SMDM+++DL P +G +WG E+ ++ VAA G YEL+ L RV + +
Sbjct: 300 SMDMLAVDLTPCPQAGIGTPVELWGKEIKIDDVAAAAGTVGYELMCALALRVPVVTV 356


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0759PERTACTIN300.026 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 30.5 bits (68), Expect = 0.026
Identities = 35/109 (32%), Positives = 47/109 (43%), Gaps = 22/109 (20%)

Query: 102 SPQWHSRVVLPKGSRVTLSDSSLNNRLANFSTGRTLKIQPLVIENAECAST-PPAYLPLS 160
+PQ + + +G+RVT+S SL+ N VIE A PP PLS
Sbjct: 309 APQLGAAIRAGRGARVTVSGGSLSAPHGN------------VIETGGGARRFPPPASPLS 356

Query: 161 VASQLQAGQAHLRLRLTTQGVASLSELDFAPMNLTLAGGIIQSNQLITT 209
+ LQAG QG A L + P+ LTLAGG ++ T
Sbjct: 357 I--TLQAGA-------RAQGRALLYRVLPEPVKLTLAGGAQGQGDIVAT 396


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0771PF07520300.016 Virulence protein SrfB
		>PF07520#Virulence protein SrfB

Length = 1041

Score = 29.9 bits (67), Expect = 0.016
Identities = 19/106 (17%), Positives = 30/106 (28%), Gaps = 11/106 (10%)

Query: 2 HNHNNRLVITPGEPAGVGPDLAITLAQQDWPVELVVCADPALLLARASQLNLPLQLREYQ 61
+ E G + + L DW E+ + A R+ + E+
Sbjct: 184 DPGAMSWFLQRLEADEDGNAVDLQLWVSDWLKEMFLDFKRAERPGRSISEENLPHMFEHW 243

Query: 62 ADQPAIAQQAGSLTILPVKTAVNVVPGK-----------LDVGNSH 96
A + Q P N V + LD+GNS
Sbjct: 244 ARYLSYLQVIQRAVAPPKMRFANTVAPRDAVAPVEVDLVLDIGNSR 289


11YpAngola_A0794YpAngola_A0802Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A0794-119-4.771412transcriptional activator NhaR
YpAngola_A0795-120-4.516763pH-dependent sodium/proton antiporter
YpAngola_A0796-121-4.915619chaperone protein DnaJ
YpAngola_A0797-122-5.968654molecular chaperone DnaK
YpAngola_A0798-121-6.854540hypothetical protein
YpAngola_A0799-122-7.392178acetyltransferase domain-containing protein
YpAngola_A0800-115-3.162114major facilitator transporter
YpAngola_A0801216-2.836641molybdenum cofactor biosynthesis protein MogA
YpAngola_A0802215-1.281529hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0797SHAPEPROTEIN1434e-40 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 143 bits (363), Expect = 4e-40
Identities = 81/387 (20%), Positives = 149/387 (38%), Gaps = 84/387 (21%)

Query: 5 IGIDLGTTNSCVAIMDGTKARVLENSEGDRTTPSIIAYTQDGET------LVGQPAKRQA 58
+ IDLGT N+ + + + VL PS++A QD VG AK+
Sbjct: 13 LSIDLGTANTLIYVKG--QGIVLNE-------PSVVAIRQDRAGSPKSVAAVGHDAKQML 63

Query: 59 VTNPQNTLFAIKRLIGRRFQDEEAQRDKDIMPYKIIAADNGDAWLEVKGQKMAPPQISAE 118
P N + AI+ + +D I + + +
Sbjct: 64 GRTPGN-IAAIRPM-----------KDGVIADFFVTEK------------------MLQH 93

Query: 119 VLKKMKKTAEDYLGEPVTEAVITVPAYFNDAQRQATKDAGRIAGLEVKRIINEPTAAALA 178
+K++ + P ++ VP +R+A +++ + AG +I EP AAA+
Sbjct: 94 FIKQVHS---NSFMRPSPRVLVCVPVGATQVERRAIRESAQGAGAREVFLIEEPMAAAIG 150

Query: 179 YGL--DKEVGNRTIAVYDLGGGTFDISIIEIDEVDGEKTFEVLATNGDTHLGGEDFDSRL 236
GL + G+ V D+GGGT ++++I ++ V + +GG+ FD +
Sbjct: 151 AGLPVSEATGS---MVVDIGGGTTEVAVISLNGV---------VYSSSVRIGGDRFDEAI 198

Query: 237 INYLVEEFKKDQGMDLRTDPLAMQRLKEAAEKAKIELSSA----QQTDVNLPYITADGSG 292
INY+ + G + AE+ K E+ SA + ++ +
Sbjct: 199 INYVRRNYGSLIG-------------EATAERIKHEIGSAYPGDEVREIEVRGRNLAEGV 245

Query: 293 PKHMNIKVTRAKLESLVEDLVNRSIEPLKVALQD-AGLSVSDIQD--VILVGGQTRMPMV 349
P+ + + LE+L E + + + VAL+ SDI + ++L GG + +
Sbjct: 246 PRGFTLN-SNEILEALQEP-LTGIVSAVMVALEQCPPELASDISERGMVLTGGGALLRNL 303

Query: 350 QKKVADFFGKEPRKDVNPDEAVAIGAA 376
+ + + G +P VA G
Sbjct: 304 DRLLMEETGIPVVVAEDPLTCVARGGG 330


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0800TCRTETB385e-05 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 38.3 bits (89), Expect = 5e-05
Identities = 39/280 (13%), Positives = 99/280 (35%), Gaps = 37/280 (13%)

Query: 95 LGGIIMAHFGDLVGRKKMFTLSILLMALPTLAIGMLPTYATIGITAPLLLLLMRVLQGAA 154
+G + D +G K++ I++ ++ + ++ ++ L++ R +QGA
Sbjct: 64 IGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSL-------LIMARFIQGAG 116

Query: 155 IGGEVPGAWVFVAEHVPRKRIGIACGTLTAGLTAGILLGSLVATVMNTTLGHQAIL---- 210
V VA ++P++ G A G + + + G +G + ++ + +L
Sbjct: 117 AAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPM 176

Query: 211 ---------------EGGWRIPFFLGGIFGLFA----------MYLRRWLQETPIFKEMQ 245
E + F + GI + Y +L + + +
Sbjct: 177 ITIITVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFLIF 236

Query: 246 ARKTLAEELPLKSVVVNHKKEVVVSMLLTWLLSAGIVVVILMTPTYLQKQFNVPP-ELAL 304
+ P + ++ +L ++ + + M P ++ + E+
Sbjct: 237 VKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGS 296

Query: 305 QANSLAIIALVIGCVVAGLAIDRFGASKTFIVGSLMLAMS 344
++++I + G+ +DR G +G L++S
Sbjct: 297 VIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVS 336


12YpAngola_A0853YpAngola_A0886Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A08531143.323717hypothetical protein
YpAngola_A08541184.049335hypothetical protein
YpAngola_A08551193.714973autoinducer-2 (AI-2) kinase
YpAngola_A08572203.173959hypothetical protein
YpAngola_A08582203.573627autoinducer AI-2 ABC transporter ATP binding
YpAngola_A08591222.866607autoinducer AI-2 ABC transporter permease LsrC
YpAngola_A0860-1183.019779autoinducer AI-2 ABC transporter permease LsrD
YpAngola_A0861-1152.919159autoinducer AI-2 ABC transporter periplasmic
YpAngola_A0862-1153.080714aldolase
YpAngola_A0863-1142.533323autoinducer-2 (AI-2) modifying protein LsrG
YpAngola_A0864-2121.065159hypothetical protein
YpAngola_A08650140.485866HPr family phosphocarrier protein
YpAngola_A0866-115-2.823234putative fructose-like permease EIIC subunit 2
YpAngola_A0867-115-3.237421putative PTS system fructose-like transporter
YpAngola_A0869-116-3.505054putative fructose-like phosphotransferase EIIB
YpAngola_A0868016-1.992814AraC family transcriptional regulator
YpAngola_A08701120.549888hypothetical protein
YpAngola_A08710121.517783hypothetical protein
YpAngola_A08721152.765937M48 family peptidase
YpAngola_A08732172.219555hypothetical protein
YpAngola_A08741172.356426hypothetical protein
YpAngola_A08752181.687381hypothetical protein
YpAngola_A08763230.812169hypothetical protein
YpAngola_A08783230.180148IS285 transposase
YpAngola_A0879320-0.25986650S ribosomal protein L19
YpAngola_A0880014-0.875664tRNA (guanine-N(1)-)-methyltransferase
YpAngola_A0881013-1.92201416S rRNA-processing protein RimM
YpAngola_A0882017-1.22082730S ribosomal protein S16
YpAngola_A0883016-0.265971hypothetical protein
YpAngola_A0884014-0.301932signal recognition particle protein
YpAngola_A0885215-0.375189hypothetical protein
YpAngola_A08863150.021014putative membrane protein, truncation
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0865PHPHTRNFRASE5790.0 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 579 bits (1494), Expect = 0.0
Identities = 193/587 (32%), Positives = 309/587 (52%), Gaps = 22/587 (3%)

Query: 73 PTLLRARSVSPGTACGKLLSLIRADLNA--LGDLPVAQGIEREQQMLADGVAQLGKAWES 130
+ + S G A K + +++ V+ IE+ L +L
Sbjct: 2 HHKITGIAASSGVAIAKAFIHLEPNVDIEKTSITDVSTEIEKLTAALEKSKEELRAI--- 58

Query: 131 LLVANSSTAANSSTTENNSTTENNSTTRAIREVHRSLLRDGTFRQRLLSHIVAGESCATA 190
++ + + I H +L D + I + A
Sbjct: 59 ---------------KDQTEASMGADKAEIFAAHLLVLDDPELVDGIKGKIENEQMNAEY 103

Query: 191 IVATAA-YFSQQLALAANTYLRERELDIRDVSFQLLQQIYGEQRFPSQQALSEDSLCIAD 249
+ + F N Y++ER DIRDVS ++L + G + S ++E+++ IA+
Sbjct: 104 ALKEVSDMFVSMFESMDNEYMKERAADIRDVSKRVLGHLIGVET-GSLATIAEETVIIAE 162

Query: 250 ELTPSQFLALDKRYLKGLLLGRGGSTSHTVILARSFNIPTLVGVDAAALQPYINQSLQID 309
+LTPS L+K+++KG GG TSH+ I++RS IP +VG + + +D
Sbjct: 163 DLTPSDTAQLNKQFVKGFATDIGGRTSHSAIMSRSLEIPAVVGTKEVTEKIQHGDMVIVD 222

Query: 310 GELGLVVCLLDEPVRRYYRQEQWLHDQLREQQSRYQNMPGRTLDGVRMVVAANITHAVEV 369
G G+V+ E + Y +++ ++ +++ ++ P T DG + +AANI +V
Sbjct: 223 GIEGIVIVNPTEEEVKAYEEKRAAFEKQKQEWAKLVGEPSTTKDGAHVELAANIGTPKDV 282

Query: 370 EGAFNQGAESIGLFRTEILYMDRAAAPSEEELYTLYAQALGAAKGKPIIIRTIDIGGDKP 429
+G G E IGL+RTE LYMDR P+EEE + Y + + GKP++IRT+DIGGDK
Sbjct: 283 DGVLANGGEGIGLYRTEFLYMDRDQLPTEEEQFEAYKEVVQRMDGKPVVIRTLDIGGDKE 342

Query: 430 VSYLNIPAESNPFLGYRAVRIYHEFLSLFHTQLRAILRASMHGPLKIMIPMISSMEEILW 489
+SYL +P E NPFLG+RA+R+ E +F TQLRA+LRAS +G LK+M PMI+++EE+
Sbjct: 343 LSYLQLPKELNPFLGFRAIRLCLEKQDIFRTQLRALLRASTYGNLKVMFPMIATLEELRQ 402

Query: 490 VKDQLAEVKQSLRISHLQFDETVPLGMMLEVPSVMFIIDQCCEEMDFLSIGSNDLTQYLL 549
K + E K L + +++ +G+M+E+PS + +E+DF SIG+NDL QY +
Sbjct: 403 AKAIMQEEKDKLLSEGVDVSDSIEVGIMVEIPSTAVAANLFAKEVDFFSIGTNDLIQYTM 462

Query: 550 AVDRDNAKVSEHYHCLPPALLRALDYAVCEVHRHGKWIGLCGELAAKDSVLPLLVAMGLD 609
A DR N +VS Y PA+LR +D + H GKW+G+CGE+A + +PLL+ +GLD
Sbjct: 463 AADRMNERVSYLYQPYHPAILRLVDMVIKAAHSEGKWVGMCGEMAGDEVAIPLLLGLGLD 522

Query: 610 EISMSASFIGATKARLAKLDRGECRLLLNRVMACRTSREVEYLLVQY 656
E SMSA+ I +++L KL + E + + + T+ EVE L+ +
Sbjct: 523 EFSMSATSILPARSQLLKLSKEELKPFAQKALMLDTAEEVEQLVKKT 569


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0875RTXTOXIND300.018 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 30.2 bits (68), Expect = 0.018
Identities = 5/71 (7%), Positives = 26/71 (36%), Gaps = 4/71 (5%)

Query: 14 LQEQANALAHIQALNFES-IDLPTAQRQLEELQARLDRLTHPQSDIAIAKAALDEAEARQ 72
+ + + + + + + + + E L +S + ++ + A+
Sbjct: 233 EKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVY---KSQLEQIESEILSAKEEY 289

Query: 73 KELERQYQQEV 83
+ + + ++ E+
Sbjct: 290 QLVTQLFKNEI 300


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0876TYPE3OMOPROT270.005 Type III secretion system outer membrane O protein ...
		>TYPE3OMOPROT#Type III secretion system outer membrane O protein

family signature.
Length = 303

Score = 26.9 bits (59), Expect = 0.005
Identities = 18/54 (33%), Positives = 24/54 (44%), Gaps = 2/54 (3%)

Query: 11 ELPSYITGANSIRLNHSVPRSVDSTDKTSRSLMALTGITDSGDVPTSRLLAYCS 64
ELP+ G ++ R V + T RSL+ GI D + TSR YC
Sbjct: 136 ELPAVGGG--RPKMLRWPLRFVIGSSDTQRSLLGRIGIGDVLLIRTSRAEVYCY 187


13YpAngola_A0991YpAngola_A1013Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A09912122.532845hypothetical protein
YpAngola_A09921122.522707iron-sulfur cluster insertion protein ErpA
YpAngola_A09931122.811094chloride channel protein
YpAngola_A09940113.578236hypothetical protein
YpAngola_A0995-1124.113429glutamate-1-semialdehyde aminotransferase
YpAngola_A0996-1155.406550iron-hydroxamate transporter permease subunit
YpAngola_A0997-1155.414320iron-hydroxamate transporter substrate-binding
YpAngola_A0999-1154.900467hypothetical protein
YpAngola_A0998-1134.727588iron-hydroxamate transporter ATP-binding
YpAngola_A1000-1114.166407penicillin-binding protein 1b
YpAngola_A10010114.138859ATP-dependent RNA helicase HrpB
YpAngola_A10020121.8920602'-5' RNA ligase
YpAngola_A10030141.385448sugar fermentation stimulation protein A
YpAngola_A10050161.959432RNA polymerase-binding transcription factor
YpAngola_A1006-2152.456153glutamyl-Q tRNA(Asp) synthetase
YpAngola_A1007-3193.213090hypothetical protein
YpAngola_A1008-2182.617807poly(A) polymerase I
YpAngola_A1009-216-0.0059652-amino-4-hydroxy-6-
YpAngola_A1010-216-0.583849hypothetical protein
YpAngola_A1011-117-0.3581893-methyl-2-oxobutanoate
YpAngola_A1012-116-1.208333pantoate--beta-alanine ligase
YpAngola_A1013-114-3.034798aspartate alpha-decarboxylase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0997FERRIBNDNGPP400e-143 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 400 bits (1030), Expect = e-143
Identities = 143/262 (54%), Positives = 182/262 (69%), Gaps = 1/262 (0%)

Query: 39 IDTKRVVALEWLPVELLLALGVTPFGVADIHNYRLWVGEPALPADVINVGQRTEPNLELL 98
ID R+VALEWLPVELLLALG+ P+GVAD NYRLWV EP LP VI+VG RTEPNLELL
Sbjct: 33 IDPNRIVALEWLPVELLLALGIVPYGVADTINYRLWVSEPPLPDSVIDVGLRTEPNLELL 92

Query: 99 QQMAPSLILLSQGYGPSPEKLAPIAPTMSFAFNEQGSSPLAVGKNSLQTLGQRLGLETAA 158
+M PS ++ S GYGPSPE LA IAP F F++ G PLA+ + SL + L L++AA
Sbjct: 93 TEMKPSFMVWSAGYGPSPEMLARIAPGRGFNFSD-GKQPLAMARKSLTEMADLLNLQSAA 151

Query: 159 QQHLADFDHFMLAARARLSGDTQTPLLMFSLLDPRHALIIGNGSLFQDVLSTLNIENAWQ 218
+ HLA ++ F+ + + R PLL+ +L+DPRH L+ G SLFQ++L I NAWQ
Sbjct: 152 ETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLVFGPNSLFQEILDEYGIPNAWQ 211

Query: 219 GETNFWGSAVVGIERLATIKTARAVCFGHGNNEMLQQVARTPLWQSLSFVRENQLRLLPP 278
GETNFWGS V I+RLA K +CF H N++ + + TPLWQ++ FVR + + +P
Sbjct: 212 GETNFWGSTAVSIDRLAAYKDVDVLCFDHDNSKDMDALMATPLWQAMPFVRAGRFQRVPA 271

Query: 279 VWFYGATLSAMRFVRLLEQAWG 300
VWFYGATLSAM FVR+L+ A G
Sbjct: 272 VWFYGATLSAMHFVRVLDNAIG 293


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A1005PF01540270.043 Adhesin lipoprotein
		>PF01540#Adhesin lipoprotein

Length = 475

Score = 27.0 bits (59), Expect = 0.043
Identities = 18/88 (20%), Positives = 31/88 (35%), Gaps = 11/88 (12%)

Query: 66 AQLSHFKLILEAWRNQLRDEVDRTVSHMQDEAANFPDPVDRAAQEEEFSL-----ELRNR 120
++L FK +W ++ E + E A D+ EE + EL
Sbjct: 196 SELESFKEFNTSWLEKIVSEWEEVKKAWSKELAEIKAEDDKKLAEENQKIKEGAKELLKL 255

Query: 121 DRE-RKLIKKIEKTLKKVE-----DDDF 142
+ + I T+ K+E D+ F
Sbjct: 256 SEKIQSFADTIALTITKLERKFQIDEKF 283


14YpAngola_A1028YpAngola_A1086Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A10282261.326046bifunctional aconitate hydratase
YpAngola_A10293260.718399IS1541 transposase
YpAngola_A10313280.693654dihydrolipoamide dehydrogenase
YpAngola_A10322240.617127hypothetical protein
YpAngola_A10331220.619367dihydrolipoamide acetyltransferase
YpAngola_A10340170.041223pyruvate dehydrogenase subunit E1
YpAngola_A1035-112-0.363204transcriptional regulator PdhR
YpAngola_A1036014-1.168383hypothetical protein
YpAngola_A1037016-1.329484aromatic amino acid transporter
YpAngola_A1038121-1.995819regulatory protein AmpE
YpAngola_A1039120-2.127856N-acetyl-anhydromuranmyl-L-alanine amidase
YpAngola_A1040222-2.287293quinolinate phosphoribosyltransferase
YpAngola_A1041122-2.825522putative major pilin subunit
YpAngola_A1042122-2.700045hypothetical protein
YpAngola_A1043121-2.657487type IV pilin biogenesis protein
YpAngola_A1044119-0.836587guanosine 5'-monophosphate oxidoreductase
YpAngola_A1045123-0.149760dephospho-CoA kinase
YpAngola_A10463230.184811hypothetical protein
YpAngola_A10473260.653845zinc-binding protein
YpAngola_A10481240.999560transposase
YpAngola_A10491260.027633insertion sequence transposase
YpAngola_A1050229-3.333469transposase/IS protein
YpAngola_A1051029-5.081746hypothetical protein
YpAngola_A1052228-6.366472hypothetical protein
YpAngola_A1053129-6.407318L-PSP family endoribonuclease
YpAngola_A1054228-6.894941putative endoribonuclease L-PSP
YpAngola_A1056021-3.546233hypothetical protein
YpAngola_A1057-114-0.948709Na+/H+ antiporter family protein
YpAngola_A10580141.245119putative aspartate aminotransferase
YpAngola_A10590192.804303hypothetical protein
YpAngola_A10600192.185178hypothetical protein
YpAngola_A1061-1192.456002multidrug resistance protein MdtN
YpAngola_A10631201.346492hypothetical protein
YpAngola_A10653292.248523hypothetical protein
YpAngola_A10662271.629902sugar ABC transporter permease
YpAngola_A10673261.604825sugar ABC transporter permease
YpAngola_A10683252.851376sugar ABC transporter periplasmic sugar-binding
YpAngola_A10694242.574370LacI family sugar-binding transcriptional
YpAngola_A10703242.827737hypothetical protein
YpAngola_A10712181.386263ABC transporter ATP-binding protein
YpAngola_A10720182.851737hypothetical protein
YpAngola_A10741183.346950putative autotransporter protein
YpAngola_A10772180.554599hypothetical protein
YpAngola_A1078-115-0.026346hypothetical protein
YpAngola_A1081-2130.774600hypothetical protein
YpAngola_A1084-2161.479026ShlB/FhaC/HecB family hemolysin
YpAngola_A10852180.537987hypothetical protein
YpAngola_A10862180.702294putative tellurium resistance protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A1033RTXTOXIND340.001 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 34.0 bits (78), Expect = 0.001
Identities = 18/83 (21%), Positives = 34/83 (40%), Gaps = 2/83 (2%)

Query: 26 DTVEAEQSLITVEGDKASMEVPSPQAGVVKEIKIAVGDKVATGSLIMVFDATGAAAAPVK 85
+ V +T G S E+ + +VKEI + G+ V G +++ A GA A +K
Sbjct: 81 EIVATANGKLTHSGR--SKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLK 138

Query: 86 AEEKPAAPAQVAAPAASAAKNVE 108
+ ++++E
Sbjct: 139 TQSSLLQARLEQTRYQILSRSIE 161



Score = 30.6 bits (69), Expect = 0.017
Identities = 10/49 (20%), Positives = 21/49 (42%), Gaps = 1/49 (2%)

Query: 26 DTVEAEQSLITVEGDKASMEVPSPQAGVVKEIKI-AVGDKVATGSLIMV 73
+ L E + + + +P + V+++K+ G V T +MV
Sbjct: 310 NIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMV 358



Score = 29.8 bits (67), Expect = 0.026
Identities = 12/44 (27%), Positives = 21/44 (47%), Gaps = 1/44 (2%)

Query: 133 EQSLITVEGDKASMEVPAPFAGIVKEIKIST-GDKVKTGSLIMV 175
L E + + + AP + V+++K+ T G V T +MV
Sbjct: 315 TLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMV 358


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A1041BCTERIALGSPG413e-07 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 41.0 bits (96), Expect = 3e-07
Identities = 21/66 (31%), Positives = 38/66 (57%)

Query: 10 QKGFTLIELMVAVAIIAVLSGIGIPSYQRYIQKAALTDMLQAIVPYKMAVELCALEQSNL 69
Q+GFTL+E+MV + II VL+ + +P+ +KA + IV + A+++ L+ +
Sbjct: 7 QRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLDNHHY 66

Query: 70 DSCNAG 75
+ N G
Sbjct: 67 PTTNQG 72


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A1043BCTERIALGSPF2745e-91 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 274 bits (702), Expect = 5e-91
Identities = 107/405 (26%), Positives = 204/405 (50%), Gaps = 13/405 (3%)

Query: 6 LFNWTALNKTGELQTGMLLATERNSVYEHIIQHGLQPLGV-----KGGRRLSARYWQGER 60
+++ AL+ G+ G A + + + GL PL V + S +
Sbjct: 3 QYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLRRK 62

Query: 61 -------LVAMTRQLATLLQAGLPLVNSLQLLAKEADDSAWRCLLDEISQQVAQGQSLSE 113
L +TRQLATL+ A +PL +L +AK+++ L+ + +V +G SL++
Sbjct: 63 IRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSLAD 122

Query: 114 VMEQYPHVFPRLYPPVVAVGELTGNLEQCCTQLVHHQERQQNLHKKVIKALKYPVVVCIV 173
M+ +P F RLY +VA GE +G+L+ +L + E++Q + ++ +A+ YP V+ +V
Sbjct: 123 AMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLTVV 182

Query: 174 ALVVSVIMLVMVLPEFAQIYQSFDTPLPGLTASLLWLSTFLTFYGPYLALIIAIVCIGYF 233
A+ V I+L +V+P+ + + LP T L+ +S + +GP++ L + + +
Sbjct: 183 AIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMAFR 242

Query: 234 YTLRKKSRWQQWEQTILLSIPLVSTLIRGSCLSQIFQTLAITQQAGLPLSAGLDAAARSI 293
LR++ R + + LL +PL+ + RG ++ +TL+I + +PL + + +
Sbjct: 243 VMLRQEKRRVSFHRR-LLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDVM 301

Query: 294 HNYNYQQALRCIQKQISQGIPLYTTLNQHPLFPAICQQLIRVGEESGSQDVLLEKLACWH 353
N + L + +G+ L+ L Q LFP + + +I GE SG D +LE+ A
Sbjct: 302 SNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADNQ 361

Query: 354 QQQTQNLADNVTQMLEPLLMLIIGSIVGVLVIAMYLPIFQLGDVI 398
++ + + EPLL++ + ++V +V+A+ PI QL ++
Sbjct: 362 DREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLM 406


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A1049HTHTETR280.047 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 28.1 bits (62), Expect = 0.047
Identities = 12/106 (11%), Positives = 31/106 (29%), Gaps = 14/106 (13%)

Query: 16 QGMSSRAIARELGISRNTVKRYLQAKSEPPKYTPRPAVASLLDEYRDYIRQRIAD----- 70
S IA+ G++R + + + KS+ + + + I + +
Sbjct: 30 SSTSLGEIAKAAGVTRGAIYWHFKDKSD--------LFSEIWELSESNIGELELEYQAKF 81

Query: 71 -AHPYKIPATVIAREIRDQGYRGGMTILRAFIRSLSVPQEQEPAVR 115
P + ++ + +L I + V+
Sbjct: 82 PGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQ 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A1061RTXTOXIND522e-09 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 51.8 bits (124), Expect = 2e-09
Identities = 61/417 (14%), Positives = 113/417 (27%), Gaps = 96/417 (23%)

Query: 7 SGRKRQLALIVAGVIIIAAAISGWLSVRQTTLNPLSEDAELGASVVH------IASSVPG 60
S R R +A + G ++IA +S + A + H I
Sbjct: 54 SRRPRLVAYFIMGFLVIAFILSVL--------GQVEIVATANGKLTHSGRSKEIKPIENS 105

Query: 61 RIISINVEENSKVRRGDLLFSIEPDLYRLQVEQAQAELKMAEAT---------------- 104
+ I V+E VR+GD+L + + Q+ L A
Sbjct: 106 IVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKL 165

Query: 105 -----------HDTQQRTVVAERSN--AAITNEQIVRAQANLKLATQT------------ 139
+ + V+ S + Q + Q L L +
Sbjct: 166 PELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINR 225

Query: 140 -----------LARLQPLRPKGYVTAQQVDDAATAKHDAEVSLKQALKQSVAAEALVSST 188
L L K + V + +A L+ Q E+ + S
Sbjct: 226 YENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSA 285

Query: 189 -------------------ASSEALVVARRAALAIAERELANTQIHAPNDGRVVGLTV-S 228
+ + LA E + I AP +V L V +
Sbjct: 286 KEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHT 345

Query: 229 AGEFVAPDQAIFTLINTEH-WHASAFFRETELKHIKVGDCATVYVMADRQRAIQGRVEGI 287
G V + + ++ + +A + ++ I VG A + V A G + G
Sbjct: 346 EGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRY-GYLVGK 404

Query: 288 GWGVSSEDMLNIPRGLPYVPKSLNWVRVVQRFPVRISLEKPPEDLMRIGATAVVIVR 344
++ + + + GL + V+ + G ++
Sbjct: 405 VKNINLDAIEDQRLGLVF--------NVIISIEENCLSTGNKNIPLSSGMAVTAEIK 453


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A1071PF05272320.007 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 31.6 bits (71), Expect = 0.007
Identities = 11/35 (31%), Positives = 17/35 (48%)

Query: 34 MVIVGPSGCAKSTMLRMIAGLEEISSGELTIADRK 68
+V+ G G KST++ + GL+ S I K
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGK 633


15YpAngola_A1236YpAngola_A1257Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A12362191.414795DNA-damage-inducible protein J
YpAngola_A1237216-1.834753aliphatic sulfonates transport ATP-binding
YpAngola_A1238319-4.488480transposase
YpAngola_A1239424-8.649151hypothetical protein
YpAngola_A1240325-9.274541hypothetical protein
YpAngola_A1241632-13.291780IS66 family Orf1
YpAngola_A1242535-14.046535modification methylase
YpAngola_A1243331-12.486054hypothetical protein
YpAngola_A1245231-10.782542hypothetical protein
YpAngola_A1244230-9.982998IS285 transposase
YpAngola_A1248440-11.910449putative mannosyltransferase WbyJ
YpAngola_A1250337-10.578392glycosyl transferase WbyK, group 1 family
YpAngola_A1253232-8.655273GDP-L-fucose synthetase
YpAngola_A1254224-6.408317mannose-1-phosphate guanylyltransferase
YpAngola_A1255220-5.880231glycosyltransferase
YpAngola_A1256118-4.896902phosphomannomutase
YpAngola_A1257115-3.515953ferric enterobactin transport protein FepE
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A1237PF05272310.006 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.8 bits (69), Expect = 0.006
Identities = 13/30 (43%), Positives = 15/30 (50%)

Query: 44 VGRSGCGKSTLLRLLAGLEAASDGTLLSGN 73
G G GKSTL+ L GL+ SD G
Sbjct: 602 EGTGGIGKSTLINTLVGLDFFSDTHFDIGT 631


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A1241PF05043280.006 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 27.6 bits (61), Expect = 0.006
Identities = 6/24 (25%), Positives = 13/24 (54%)

Query: 20 GVSARELCRKHAISDATFYTLRKK 43
G A +C++ IS ++ Y + +
Sbjct: 100 GCQAESICKEFYISSSSLYRIISQ 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A1253NUCEPIMERASE803e-19 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 80.2 bits (198), Expect = 3e-19
Identities = 58/352 (16%), Positives = 122/352 (34%), Gaps = 59/352 (16%)

Query: 5 RVFIAGHRGMVGSAIVRQLENRND--------------------IELIIRDR---TELDL 41
+ + G G +G + ++L +EL+ + ++DL
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 42 MSQSAVQKFFATEKIDEIYLAAAKVGGIQANNNYPAEFIYQNLMIECNIIHAAHLAGIQK 101
+ + FA+ + ++++ ++ + N P + NL NI+ IQ
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLEN-PHAYADSNLTGFLNILEGCRHNKIQH 120

Query: 102 LLFLGASCIYPKLAAQPMTEEALLTGVLEPTNEP---YAIAKIAGIKLCESYNRQYGRDY 158
LL+ +S +Y P + + + + P YA K A + +Y+ YG
Sbjct: 121 LLYASSSSVYGLNRKMPFSTD-------DSVDHPVSLYAATKKANELMAHTYSHLYGLPA 173

Query: 159 RSVMPTNLYGENDNFHPENSHVIPALLRRFHEAKIRNDKEMVVWGTGKPMREFLHVDDMA 218
+ +YG P + + K + V+ GK R+F ++DD+A
Sbjct: 174 TGLRFFTVYGPWGR---------PDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIA 224

Query: 219 AASVHVMELSDQIYQTNTQPMLSH------------INVGTGVDCTIRELAETMAKVVGF 266
A ++ L D I +TQ + N+G + + + + +G
Sbjct: 225 EA---IIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGI 281

Query: 267 TGNLVFDSTKPDGTPRKLMDVSRLAK-LGWCYQISLEVGLTMTYQWFLAHQN 317
+P D L + +G+ + +++ G+ W+
Sbjct: 282 EAKKNMLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRDFYK 333


16YpAngola_A1268YpAngola_A1273Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A1268-1143.215619hypothetical protein
YpAngola_A1269-1134.001047putative thioredoxin
YpAngola_A1270-1134.544642short chain dehydrogenase
YpAngola_A12710123.884146multifunctional acyl-CoA thioesterase I/protease
YpAngola_A1272-1123.601805putative ABC transporter ATP-binding protein
YpAngola_A12730123.093573efflux ABC transporter permease
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A1268CHANLCOLICIN290.021 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 29.3 bits (65), Expect = 0.021
Identities = 33/153 (21%), Positives = 61/153 (39%), Gaps = 20/153 (13%)

Query: 130 SQRDNINSRLLHIVDEATNPWGIKITRIEIRDVRPP--TELISAMNAQMKAERTKRADIL 187
+ RD + RL IV+EA + R P TEL A NA M+AE +
Sbjct: 85 ANRDALTQRLKDIVNEA----------LRHNASRTPSATELAHANNAAMQAEDERLRLAK 134

Query: 188 EAEGVRQAAILRAEGEKQSQILKAEGERQSA-------FLQAEARERAAEAEAQATKMVS 240
E R+ A + ++++ + E ER+ A +AE + AA +E ++
Sbjct: 135 AEEKARKEAEAAEKAFQEAEQRRKEIEREKAETERQLKLAEAEEKRLAALSEEAKAVEIA 194

Query: 241 EAIAAGDIQAINYFVAQKYTDALQHIGSANNSK 273
+ + + + T + S+ +++
Sbjct: 195 QKKLSAAQSEVVKMDGEIKT-LNSRLSSSIHAR 226


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A1269PF06057290.013 Type IV secretory pathway VirJ component
		>PF06057#Type IV secretory pathway VirJ component

Length = 243

Score = 29.4 bits (66), Expect = 0.013
Identities = 15/68 (22%), Positives = 25/68 (36%), Gaps = 12/68 (17%)

Query: 20 QSMSVPV-----LFYFWSERSQHCLQLTPTLDKLAAEYAGQFILARVDCDAQPMVASQFG 74
Q PV L Y+W ++ +T + +Y +F +V ++ FG
Sbjct: 75 QQQGWPVVGWSSLKYYWKQKDPK--DVTQDTLAIIDKYQAEFGTQKV-----ILIGYSFG 127

Query: 75 LRSIPAVY 82
IP V
Sbjct: 128 AEVIPFVL 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A1270DHBDHDRGNASE813e-20 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 80.9 bits (199), Expect = 3e-20
Identities = 51/191 (26%), Positives = 85/191 (44%), Gaps = 7/191 (3%)

Query: 3 KAVLITGCSSGIGLVAAQDLKNRGYRVLAACRKPDDVAKMVQ-LGLEG-----IELDLDD 56
K ITG + GIG A+ L ++G + A P+ + K+V L E D+ D
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 57 SASVERAAAQVIELTGGRLYGLFNNGGFGLYGSLHTISRQQLEKQFSTNLFGTHQLTQLL 116
SA+++ A++ G + L N G G +H++S ++ E FS N G ++ +
Sbjct: 69 SAAIDEITARIEREMG-PIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127

Query: 117 LPAMLPHGEGRIIQTSSVMGLVSTAGRGAYAASKYALEAWSDALRMELQSSGIHVSLIEP 176
M+ G I+ S V AYA+SK A ++ L +EL I +++ P
Sbjct: 128 SKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187

Query: 177 GPISTHFTQNV 187
G T ++
Sbjct: 188 GSTETDMQWSL 198


17YpAngola_A1338YpAngola_A1369Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A1338213-1.408636hydrophobic amino acid ABC transporter permease
YpAngola_A1339115-2.841927hydrophobic amino acid ABC transporter permease
YpAngola_A1340117-4.821909urea ABC transporter urea binding protein
YpAngola_A1341023-6.637968hypothetical protein
YpAngola_A1342222-5.444473phosphonate ABC transporter permease
YpAngola_A1343321-5.236357phosphonate ABC transporter permease
YpAngola_A1344323-5.451971phosphonate ABC transporter periplasmic
YpAngola_A1345424-4.890757phosphonate ABC transporter ATP-binding protein
YpAngola_A1346117-3.683230hypothetical protein
YpAngola_A1347116-3.482368hypothetical protein
YpAngola_A1348117-4.357282hypothetical protein
YpAngola_A1349017-4.513532hypothetical protein
YpAngola_A1350017-3.432150adenylate cyclase 2
YpAngola_A1351121-5.342282D-lactate dehydrogenase
YpAngola_A1352119-5.965186D-alanyl-D-alanine endopeptidase
YpAngola_A1353-125-6.851060hypothetical protein
YpAngola_A1354024-6.282364hypothetical protein
YpAngola_A1355-120-2.738405tRNA-dihydrouridine synthase C
YpAngola_A1356024-3.438043hypothetical protein
YpAngola_A13571220.565464IS1541 transposase
YpAngola_A13581212.163497putative DNA-binding prophage protein
YpAngola_A13591202.842274hypothetical protein
YpAngola_A13601202.993771hypothetical protein
YpAngola_A13622220.671514putative prophage protein
YpAngola_A1363423-5.821561AlpA family transcriptional regulator
YpAngola_A1364325-4.734650putative DNA-binding prophage protein
YpAngola_A1365426-6.664021hypothetical protein
YpAngola_A1366225-7.117431hypothetical protein
YpAngola_A1367226-6.946067hypothetical protein
YpAngola_A1369124-6.732206hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A1346HTHTETR270.010 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 26.5 bits (58), Expect = 0.010
Identities = 5/41 (12%), Positives = 18/41 (43%), Gaps = 6/41 (14%)

Query: 4 LSWIIFGLIAGILAKWIMP------GEDGGGFIMTIILGII 38
+ I+ G I+G++ W+ ++ ++ ++ +
Sbjct: 163 AAIIMRGYISGLMENWLFAPQSFDLKKEARDYVAILLEMYL 203


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A1352BLACTAMASEA474e-08 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 46.7 bits (111), Expect = 4e-08
Identities = 33/196 (16%), Positives = 67/196 (34%), Gaps = 24/196 (12%)

Query: 5 IRFALLSFLLLSTGISVAPLAIARGSAVEVKGTAPLELASGSAM---VVDLQTNKVIYAN 61
+R+ L + L + +A S ++ E + +DL + + + A
Sbjct: 1 MRYIRLCIISLLATLPLA----VHASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAW 56

Query: 62 NADKVVPIASITKLMTAMVVLD----AKLPLDEILSVDIDQTKELKGVFSRVRVNSEISR 117
AD+ P+ S K++ VL L+ + + V S + ++
Sbjct: 57 RADERFPMMSTFKVVLCGAVLARVDAGDEQLERKIHYRQQDLVDYSPV-SEKHLADGMTV 115

Query: 118 KDMLLLTLMSSENRAAASLAHHY--PGGYNAFIKAMNAKAKSL-----GMNSTHYVEPTG 170
++ + S+N AA L P G AF++ + L +N +
Sbjct: 116 GELCAAAITMSDNSAANLLLATVGGPAGLTAFLRQIGDNVTRLDRWETELNEALPGDAR- 174

Query: 171 LSINNVSTARDLAKLL 186
+ +T +A L
Sbjct: 175 ----DTTTPASMAATL 186


18YpAngola_A1390YpAngola_A1396Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A139010180.716924cyd operon protein YbgT
YpAngola_A13916170.615001hypothetical protein
YpAngola_A13936200.331084hypothetical protein
YpAngola_A13946180.093585colicin uptake protein TolQ
YpAngola_A13956140.481720colicin uptake protein TolR
YpAngola_A13965130.256621cell envelope integrity inner membrane protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A1393BINARYTOXINA250.013 Clostridial binary toxin A signature.
		>BINARYTOXINA#Clostridial binary toxin A signature.

Length = 454

Score = 25.0 bits (54), Expect = 0.013
Identities = 13/45 (28%), Positives = 22/45 (48%)

Query: 5 QLLLMKISLAQHFSSRPFIKGNVARMVNHATSIGIFKVDSYRPSK 49
++ + K S + S+ P G ++NH + I KVDSY+
Sbjct: 398 RINIPKDSPGAYLSAIPGYAGEYEVLLNHGSKFKINKVDSYKDGT 442


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A1396IGASERPTASE607e-12 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 60.1 bits (145), Expect = 7e-12
Identities = 29/193 (15%), Positives = 63/193 (32%), Gaps = 4/193 (2%)

Query: 64 YNRQQQQQTDAKRAEQQRQKKAEQQAEELQQKQAAEQQRLKELEKERLQAQEDAK---LA 120
YN + +++ Q E R+ E ++
Sbjct: 981 YNPEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETV 1040

Query: 121 AEEQKKQVAEQQKQIAEQQKQAAEQQKIAAAAVAKAKEEQKQAETAAAQAKAEADKIVKA 180
AE K++ +K + + A+ +++A A + K + E A + ++ + + +
Sbjct: 1041 AENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTET 1100

Query: 181 QAEAQKKAEAEAKKEAA-VAAAAKKQADADAKKAVEVAEKAAADAAEKKAAADAEKKAAA 239
+ A + E +AK E K + K+ + A+ A + K+ +
Sbjct: 1101 KETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQS 1160

Query: 240 AKKVAAAAEAKKK 252
A E K
Sbjct: 1161 QTNTTADTEQPAK 1173



Score = 52.4 bits (125), Expect = 2e-09
Identities = 22/199 (11%), Positives = 68/199 (34%), Gaps = 5/199 (2%)

Query: 67 QQQQQTDAKRAEQQRQKKAEQQAEELQQKQAAEQQRLKELEKERLQAQ-EDAKLAAEEQK 125
+Q+ ++ EQ + Q E ++ ++ + + E + ++ ++ + ++
Sbjct: 1044 SKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKET 1103

Query: 126 KQVAEQQKQIAEQQKQAAEQQKIAAAAVAKAKEEQKQAETAAAQAKAEADKIVKAQAEAQ 185
V +++K E +K + + + + + E Q + A+ I + Q++
Sbjct: 1104 ATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTN 1163

Query: 186 KKAEAEAKKE----AAVAAAAKKQADADAKKAVEVAEKAAADAAEKKAAADAEKKAAAAK 241
A+ E + + VE E + +++ K
Sbjct: 1164 TTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRH 1223

Query: 242 KVAAAAEAKKKAAAEAAAS 260
+ + + A +++
Sbjct: 1224 RRSVRSVPHNVEPATTSSN 1242



Score = 44.7 bits (105), Expect = 5e-07
Identities = 32/218 (14%), Positives = 65/218 (29%), Gaps = 10/218 (4%)

Query: 47 GEVIDAVMVDPGAVTEQYNRQQQQQTDAKRAEQQRQKKAEQQAEELQQKQAAEQQRLKEL 106
EV + A T+ Q + + ++ A + EE + + + Q
Sbjct: 1066 REVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQ----- 1120

Query: 107 EKERLQAQEDAKLAAEEQKKQVAEQQKQ--IAEQQKQAAEQQKIAAAAVAKAKEEQKQAE 164
E ++ +Q K E + AE ++ K+ Q A AKE E
Sbjct: 1121 EVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVE 1180

Query: 165 TAAAQAKAEADKIVKAQAEAQKKAEAEAKKEAAVAAAAKKQADADAKKAVEVAEKA-AAD 223
++ + E + + + ++ K + + V A
Sbjct: 1181 QPVTESTTV--NTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPAT 1238

Query: 224 AAEKKAAADAEKKAAAAKKVAAAAEAKKKAAAEAAAST 261
+ + A + A ++A+ KA A
Sbjct: 1239 TSSNDRSTVALCDLTSTNTNAVLSDARAKAQFVALNVG 1276


19YpAngola_A1416YpAngola_A1426Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A14162202.583122DNA-binding transcriptional regulator ModE
YpAngola_A14171213.311679hypothetical protein
YpAngola_A14181204.013939hypothetical protein
YpAngola_A14190203.840510molybdate transporter periplasmic protein
YpAngola_A14200205.092393molybdate ABC transporter permease
YpAngola_A14211195.039585molybdate transporter ATP-binding protein
YpAngola_A14220185.012088phosphotransferase
YpAngola_A14240184.815753adenosylmethionine-8-amino-7-oxononanoate
YpAngola_A1425-2153.706630biotin synthase
YpAngola_A1426-1163.6761878-amino-7-oxononanoate synthase
20YpAngola_A1469YpAngola_A1477Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A14692214.735391putative DNA circulation protein
YpAngola_A14702162.997961bacteriophage Mu P protein
YpAngola_A14713160.296146phage baseplate assembly protein
YpAngola_A1472113-0.884918phage protein GP46
YpAngola_A1473214-2.349521baseplate J-like protein
YpAngola_A1475116-5.371448putative bacteriophage protein GP48
YpAngola_A1476115-4.832919tail collar domain-containing protein
YpAngola_A1477-110-3.606234RpiR family transcriptional regulator
21YpAngola_A1643YpAngola_A1685Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A16432151.5645952-oxo-hepta-3-ene-1,7-dioic acid hydratase
YpAngola_A16440120.7667205-carboxymethyl-2-hydroxymuconate
YpAngola_A16450120.9153143,4-dihydroxyphenylacetate 2,3-dioxygenase
YpAngola_A1646012-0.3177365-carboxymethyl-2-hydroxymuconate semialdehyde
YpAngola_A1649-114-1.538732homoprotocatechuate degradative operon
YpAngola_A1650113-0.804327hypothetical protein
YpAngola_A1651214-1.687029PTS system mannose/fructose/sorbose family
YpAngola_A1652211-1.191240PTS system mannose/fructose/sorbose family
YpAngola_A1654110-1.973414hypothetical protein
YpAngola_A165509-0.711222hypothetical protein
YpAngola_A1656010-0.672073ferrichrome receptor FcuA
YpAngola_A1657-215-1.339686hypothetical protein
YpAngola_A1658-317-1.885266putative sensory box-containing diguanylate
YpAngola_A1659121-2.716785hypothetical protein
YpAngola_A1660118-1.99511623S rRNA methyltransferase A
YpAngola_A1661220-2.659907hypothetical protein
YpAngola_A1662319-3.457745hypothetical protein
YpAngola_A1663220-2.927211hypothetical protein
YpAngola_A1664218-2.420558palmitoyl transferase
YpAngola_A1665-117-2.990725aromatic amino acid transporter
YpAngola_A1666025-4.869217hypothetical protein
YpAngola_A1667-126-5.937224hypothetical protein
YpAngola_A1668-117-3.607614hypothetical protein
YpAngola_A1669-113-2.997193hypothetical protein
YpAngola_A1671-114-2.852670hypothetical protein
YpAngola_A1672-213-3.328164hypothetical protein
YpAngola_A1673-314-1.419190hypothetical protein
YpAngola_A1674-314-1.254758ABC transporter permease/ATP-binding protein
YpAngola_A1675-215-1.503778hypothetical protein
YpAngola_A1676-111-1.848028YCII domain-containing protein
YpAngola_A1680-112-2.245576*galactoside permease
YpAngola_A1681113-1.491089alpha-galactosidase
YpAngola_A1682316-2.702045hypothetical protein
YpAngola_A1683217-2.205782hypothetical protein
YpAngola_A1684117-2.3106024'-phosphopantetheinyl transferase
YpAngola_A1685117-4.030008TRAP transporter solute receptor DctP family
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A1680TCRTETA441e-06 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 43.7 bits (103), Expect = 1e-06
Identities = 65/297 (21%), Positives = 108/297 (36%), Gaps = 38/297 (12%)

Query: 35 PFFPIWLHDI--NNLSKTDTGIVFGSISLFALAFQPIMGPLSDKLGLRKTLMWIIVGLLV 92
P P L D+ +N GI+ +L A P++G LSD+ G R L +V L
Sbjct: 26 PVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVL---LVSLAG 82

Query: 93 LFAPFFIYVFSPLLKYNIFIGAIVGGCYLGFVFTGGSHAI-EAYIEKVSRHSNFEYGRVR 151
+ I +P L + ++IG IV G TG + A+ AYI ++ R R
Sbjct: 83 AAVDYAIMATAPFL-WVLYIGRIVAG------ITGATGAVAGAYIADITDGDE----RAR 131

Query: 152 MFG----CIGWALCATVV--GILYTVNNQLIFWMASGCALILAVLLFFARPDRQSTAFVV 205
FG C G+ + A V G++ + F+ A+ + + F P+
Sbjct: 132 HFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKG---- 187

Query: 206 DTLGANKAVFNLKNALAL---LRKRELWFFVMYIVGVACIYDVFDQQFANFFTSFFATK- 261
+ + N + + V +I+ + Q A + F +
Sbjct: 188 ERRPLRREALNPLASFRWARGMTVVAALMAVFFIM------QLVGQVPAALWVIFGEDRF 241

Query: 262 QQGTEIFGFVTTGGEILNATV-MFFAPVIIARIGSKNALLLAGTIMSVRILGSAFAT 317
G IL++ + AR+G + AL+L + AFAT
Sbjct: 242 HWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFAT 298


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A1684ENTSNTHTASED992e-27 Enterobactin synthetase component D signature.
		>ENTSNTHTASED#Enterobactin synthetase component D signature.

Length = 234

Score = 99.3 bits (247), Expect = 2e-27
Identities = 70/196 (35%), Positives = 96/196 (48%), Gaps = 22/196 (11%)

Query: 24 PFHG-LLAKCDFEVNEYR--DELFAAYGIPFPGSLNKAVIKRRAEYLAGRFVARQVLNLL 80
PF G L DF+ + +R D L+ +P L A KR+AE+LAGR A L +
Sbjct: 9 PFAGHRLHIVDFDASSFREHDLLW----LPHHDRLRSAGRKRKAEHLAGRIAAVHALREV 64

Query: 81 DIRDYPLATGMDRAPQWPTNLIGSISHNNQRALCAAQMIEPRGVESSTLHGIGLDIESHI 140
+R P G R P WP L GSISH CA + + IG+DIE +
Sbjct: 65 GVRTVP-GMGDKRQPLWPDGLFGSISH------CAT-----TALAVISRQRIGIDIEKIM 112

Query: 141 AEEKAQEIWSGIISDEEYSLLQQGPLPFNQALTLVFSAKESLFKAVYPQSGRYFDFIEAR 200
++ A E+ II +E +LQ LPF ALTL FSAKES++KA + F A+
Sbjct: 113 SQHTATELAPSIIDSDERQILQASLLPFPLALTLAFSAKESVYKA-FSDRVTLPGFNSAK 171

Query: 201 LLSYSLVSGNFELQLL 216
+ S + + + L LL
Sbjct: 172 VTSLT--ATHISLHLL 185


22YpAngola_A1706YpAngola_A1716Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A1706-113-3.488276N-methyltryptophan oxidase
YpAngola_A1707-114-4.196794LuxR family transcriptional regulator
YpAngola_A1708-115-2.885402hypothetical protein
YpAngola_A1709-215-4.197806putative transport protein
YpAngola_A1710-219-7.049901hypothetical protein
YpAngola_A1711023-8.372151hypothetical protein
YpAngola_A1712122-6.108088hypothetical protein
YpAngola_A1714122-6.555455hypothetical protein
YpAngola_A1716120-5.056614N-acylhomoserine lactone synthase YpsI
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A1716AUTOINDCRSYN301e-107 Autoinducer synthesis protein signature.
		>AUTOINDCRSYN#Autoinducer synthesis protein signature.

Length = 216

Score = 301 bits (773), Expect = e-107
Identities = 114/211 (54%), Positives = 156/211 (73%)

Query: 5 MLKVFNVNFDRMSENKLDEIFTLRKITFKDRLDWKVTCIDGKESDQYDDENTNYLLGTID 64
ML++F+VN +SE K E+FTLRK TFKDRL+W V C DG E DQYD+ NT YL G D
Sbjct: 1 MLEIFDVNHTLLSETKSGELFTLRKETFKDRLNWAVQCTDGMEFDQYDNNNTTYLFGIKD 60

Query: 65 DTLVCSVRFVEMQYPTMITGPFAPYFRDLDLPIDGFIESSRFFVEKALARDKLGNNGSLS 124
+T++CS+RF+E +YP MITG F PYF+++++P ++ESSRFFV+K+ A+D LGN +S
Sbjct: 61 NTVICSLRFIETKYPNMITGTFFPYFKEINIPEGNYLESSRFFVDKSRAKDILGNEYPIS 120

Query: 125 AILFLSMVNYARNRGYKGILTVVSRGMYTILKRSGWGITVINQGESEKNEVIYLLHLSID 184
++LFLSM+NY++++GY GI T+VS M TILKRSGWGI V+ QG SEK E +YL+ L +D
Sbjct: 121 SMLFLSMINYSKDKGYDGIYTIVSHPMLTILKRSGWGIRVVEQGLSEKEERVYLVFLPVD 180

Query: 185 SNSQQQLIRKIQRVHNIDTHTLASWPLVVPS 215
+Q+ L R+I R ++ L WPL VP+
Sbjct: 181 DENQEALARRINRSGTFMSNELKQWPLRVPA 211


23YpAngola_A1730YpAngola_A1763Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A1730314-3.242451hypothetical protein
YpAngola_A1731415-3.178677hypothetical protein
YpAngola_A1732315-2.569390hypothetical protein
YpAngola_A1733214-3.249934hypothetical protein
YpAngola_A1734215-4.428184carbohydrate ABC transporter permease
YpAngola_A1735216-4.342310carbohydrate ABC transporter permease
YpAngola_A1737217-3.965636LacI family transcriptional regulator
YpAngola_A1738119-4.833304phosphomannomutase
YpAngola_A1739225-7.281921carbohydrate ABC transporter ATP-binding
YpAngola_A1740325-1.577626hypothetical protein
YpAngola_A1741324-1.749644hypothetical protein
YpAngola_A1743225-3.103496hypothetical protein
YpAngola_A1744-2245.313950hypothetical protein
YpAngola_A1745-2234.908987hypothetical protein
YpAngola_A1746-2224.639365hypothetical protein
YpAngola_A1749-2214.260671hypothetical protein
YpAngola_A1748-2214.311938hypothetical protein
YpAngola_A1750-2214.861353hemagglutination activity domain-containing
YpAngola_A1752-212-0.072987oxidoreductase
YpAngola_A1753-112-1.143129Rieske (2Fe-2S) domain-containing protein
YpAngola_A1754113-0.365901putative transporter
YpAngola_A1755115-0.407789hypothetical protein
YpAngola_A1756114-0.679843tartrate dehydrogenase
YpAngola_A1757113-0.778466LysR family substrate-binding transcriptional
YpAngola_A1758114-3.036834LacI family sugar-binding transcriptional
YpAngola_A1759114-2.449781ribose ABC transporter permease
YpAngola_A1760017-4.409447ribose ABC transporter ATP-binding protein
YpAngola_A1761017-5.435931sugar binding protein
YpAngola_A1762117-5.445448oxidoreductase, zinc-binding dehydrogenase
YpAngola_A1763121-5.343687hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A1731ENTEROVIROMP290.017 Enterobacterial virulence outer membrane protein si...
		>ENTEROVIROMP#Enterobacterial virulence outer membrane protein

signature.
Length = 171

Score = 29.1 bits (65), Expect = 0.017
Identities = 19/76 (25%), Positives = 29/76 (38%), Gaps = 5/76 (6%)

Query: 176 GYEKNRTTGGDNNIGGDGYGFRPYYRYQVSDR-LSVNTDVKMLVEDKDARGADNGRFQFY 234
GY ++ G N +GG F YRY+ + L V + + A D + Q+Y
Sbjct: 31 GYAQSDAQGQMNKMGG----FNLKYRYEEDNSPLGVIGSFTYTEKSRTASSGDYNKNQYY 86

Query: 235 EALVNVNYRVADNVHA 250
YR+ D
Sbjct: 87 GITAGPAYRINDWASI 102


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A1739PF05272347e-04 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 34.3 bits (78), Expect = 7e-04
Identities = 16/49 (32%), Positives = 23/49 (46%), Gaps = 1/49 (2%)

Query: 32 IVLVGPSGCGKSTLLRMIAGLEDVNSGEIKI-EDKDVTQTNAGARGVSM 79
+VL G G GKSTL+ + GL+ + I KD + AG +
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQIAGIVAYEL 647


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A1750PF05860831e-20 haemagglutination activity domain.
		>PF05860#haemagglutination activity domain.

Length = 117

Score = 82.9 bits (205), Expect = 1e-20
Identities = 21/141 (14%), Positives = 41/141 (29%), Gaps = 24/141 (17%)

Query: 68 AAIVADASAPGNQQPTIINSANGTPQVNIQAPSSGGVSRNVYSQFDVDGRGVILNNGHGV 127
A I D + P N + I + T + + + + + +F V G N
Sbjct: 1 AQITPDTTLPIN---SNITTEGNTRIIERGTQAGSNLFHS-FQEFSVPTSGTAFFN---- 52

Query: 128 NQTELGGFIDGNPWLARGEASIILNEVNSRDPSKLNGYIEVAGRKAQVVIANSAGITCEG 187
I++ V S ++G I A + + N GI
Sbjct: 53 ---------------NPTNIQNIISRVTGGSVSNIDGLIRANAT-ANLFLINPNGIIFGQ 96

Query: 188 CGFINANRVTLTTGQAQLNNG 208
++ + + +L
Sbjct: 97 NARLDIGGSFVGSTANRLKFA 117


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A1761FLGHOOKAP1290.024 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 29.2 bits (65), Expect = 0.024
Identities = 11/60 (18%), Positives = 23/60 (38%), Gaps = 4/60 (6%)

Query: 40 NDYFVSMKEALEQAANDIGAKVYIADAGHDVSKQINDVED---MLQKKIDILLINPTDSV 96
D+F S++ L A D A+ + + Q + K+++I + D +
Sbjct: 110 QDFFTSLQT-LVSNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQI 168


24YpAngola_A1777YpAngola_A1784Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A17770164.316228putative SAM-dependent methyltransferase
YpAngola_A17790154.929075hypothetical protein
YpAngola_A17780165.447807hypothetical protein
YpAngola_A17800156.533239glycosyl transferase family protein
YpAngola_A17810146.267308O-succinylbenzoic acid--CoA ligase
YpAngola_A17820144.743344O-succinylbenzoate synthase
YpAngola_A17830133.855209naphthoate synthase
YpAngola_A17840143.390274acyl-CoA thioester hydrolase YfbB
25YpAngola_A1798YpAngola_A1811Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A1798118-3.682980putative lipoprotein
YpAngola_A1799-114-2.632256hypothetical protein
YpAngola_A1800-113-1.136536gluconate 5-dehydrogenase
YpAngola_A1801-1170.918772thermosensitive gluconokinase
YpAngola_A18020181.072623hypothetical protein
YpAngola_A18030191.247824hypothetical protein
YpAngola_A18041264.145788NADH dehydrogenase subunit N
YpAngola_A1805-1273.720918NADH dehydrogenase subunit M
YpAngola_A18060274.623120NADH dehydrogenase subunit L
YpAngola_A18070254.507130NADH dehydrogenase subunit K
YpAngola_A18081254.004145NADH dehydrogenase subunit J
YpAngola_A18090243.871592NADH dehydrogenase subunit I
YpAngola_A18100223.762064NADH dehydrogenase subunit H
YpAngola_A18110173.205335NADH dehydrogenase subunit G
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A1800DHBDHDRGNASE1443e-44 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 144 bits (363), Expect = 3e-44
Identities = 84/260 (32%), Positives = 137/260 (52%), Gaps = 8/260 (3%)

Query: 3 NLFSLENRKVLITGSAQGIGFLLAKGLAEFGAEIIINDITAERAEKAVAELRASGFIAHA 62
N +E + ITG+AQGIG +A+ LA GA I D E+ EK V+ L+A A A
Sbjct: 2 NAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEA 61

Query: 63 AAFNVTNHDEVNEAIEKIESHIGAIDVLINNAGIQRRHAFTEFPEKDWDDVIAVNQKSVF 122
+V + ++E +IE +G ID+L+N AG+ R +++W+ +VN VF
Sbjct: 62 FPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVF 121

Query: 123 LVSQAVARYMVKRQRGKIINICSMQSELGRDTITPYAASKGAVKMLTRGMCVELARYNIQ 182
S++V++YM+ R+ G I+ + S + + R ++ YA+SK A M T+ + +ELA YNI+
Sbjct: 122 NASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIR 181

Query: 183 VNGIAPGYFKTDMTKALVDDQ--------AFTDWLCKRTPAARWGDPEELIGAAVYLSSK 234
N ++PG +TDM +L D+ + P + P ++ A ++L S
Sbjct: 182 CNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSG 241

Query: 235 ASDFVNGHLLFVDGGMLVAV 254
+ + H L VDGG + V
Sbjct: 242 QAGHITMHNLCVDGGATLGV 261


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A1808TYPE3OMBPROT290.011 Type III secretion system outer membrane B protein ...
		>TYPE3OMBPROT#Type III secretion system outer membrane B protein

family signature.
Length = 538

Score = 28.9 bits (64), Expect = 0.011
Identities = 17/45 (37%), Positives = 23/45 (51%)

Query: 102 LLSVLIYAISSVSDQGISGEMVDAKAVGISLFGPYVLAVELASML 146
L+S +Y+ + Q +SG+ VD K V SL P L SML
Sbjct: 251 LVSAALYSRPELLSQALSGKTVDLKIVSTSLLTPTSLTGGEESML 295


26YpAngola_A1855YpAngola_A1885Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A1855018-3.265656lipoyl synthase
YpAngola_A1856121-8.284668twin arginine translocase protein A
YpAngola_A1857221-9.443607camphor resistance protein CrcB
YpAngola_A1858322-10.288000cold shock protein CspE
YpAngola_A1859322-10.764222cold-shock DNA-binding domain-containing
YpAngola_A1860221-10.210823hypothetical protein
YpAngola_A1861115-6.901961LuxR family transcriptional regulator
YpAngola_A1862216-5.679785hypothetical protein
YpAngola_A1863120-2.344790hypothetical protein
YpAngola_A1864019-0.666679hypothetical protein
YpAngola_A18650202.301066antibiotic biosynthesis monooxygenase
YpAngola_A18660203.631985heavy metal ABC transporter (HMT) family
YpAngola_A18670245.741313hypothetical protein
YpAngola_A1868-1236.053591hypothetical protein
YpAngola_A18690236.032533AP endonuclease
YpAngola_A1870-1256.674103PfkB family kinase
YpAngola_A18710256.660588hypothetical protein
YpAngola_A1872-1256.569795sugar transport system permease
YpAngola_A1873-1236.056884sugar transport ATP-binding protein
YpAngola_A1874-2173.660550ribose ABC transporter periplasmic
YpAngola_A1876-2163.622256thiamine pyrophosphate-dependent enzyme
YpAngola_A1877-2152.836678methylmalonate-semialdehyde dehydrogenase
YpAngola_A1878-1141.261716RpiR family transcriptional regulator
YpAngola_A1879014-0.111566hypothetical protein
YpAngola_A1880014-0.336618alpha-2-macroglobulin domain-containing protein
YpAngola_A1881319-3.272153penicillin-binding protein 1C
YpAngola_A1882325-7.001120sugar fermentation stimulation protein B
YpAngola_A1884324-6.285130hypothetical protein
YpAngola_A1885019-4.486966hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A1867PF03627260.024 PapG
		>PF03627#PapG

Length = 336

Score = 26.1 bits (57), Expect = 0.024
Identities = 10/46 (21%), Positives = 20/46 (43%)

Query: 5 SNNSRAHCSKPFLYRQNQWHFNQAISEYRLPAPLSAQDLTDSVNHI 50
+ ++ C KP + F+ I + LPA L D + ++ +
Sbjct: 131 AFDAGNLCQKPGETTRLTEKFDDIIFKVALPADLPLGDYSVTIPYT 176


27YpAngola_A1941YpAngola_A1950Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A1941222-0.593627hypothetical protein
YpAngola_A1942020-0.7472993-hydroxydecanoyl-ACP dehydratase
YpAngola_A1943119-0.373787hypothetical protein
YpAngola_A1944019-1.378708hypothetical protein
YpAngola_A1945-117-2.184576IS1541 transposase
YpAngola_A1946016-3.764455formate acetyltransferase 1
YpAngola_A1947016-4.314109formate transporter
YpAngola_A1948014-3.305313hypothetical protein
YpAngola_A1949118-3.785067L-asparaginase II
YpAngola_A1950120-3.402895hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A1950OMADHESIN1129e-30 Yersinia outer membrane adhesin signature.
		>OMADHESIN#Yersinia outer membrane adhesin signature.

Length = 455

Score = 112 bits (281), Expect = 9e-30
Identities = 101/341 (29%), Positives = 171/341 (50%), Gaps = 23/341 (6%)

Query: 40 TAVGNNNSLGGSTNGVVVGNGGSLSNSINGVVIG-NGSVSDGDGVSVGGGTSTNG----G 94
+AV + +GV +G S S++ GV +G N + V++G +
Sbjct: 113 SAVTYGAASTAQKDGVAIGARASTSDT--GVAVGFNSKADAKNSVAIGHSSHVAANHGYS 170

Query: 95 IAIGSGSNATRSDEMNIG----DRQITGVKAGVADTDAANVGQL-----------VAKAG 139
IAIG S R + ++IG +RQ+T + AG DTDA NV QL ++
Sbjct: 171 IAIGDRSKTDRENSVSIGHESLNRQLTHLAAGTKDTDAVNVAQLKKEIEKTQENTNKRSA 230

Query: 140 ETLNSANIYVDNQATETLNNANIYTDNKATETINNANTYTDNKSSETLNSANSYTDNKSS 199
E L +AN Y DN+++ L AN YTD+K+ ET+ NA +S + LN A +++++ +
Sbjct: 231 ELLANANAYADNKSSSVLGIANNYTDSKSAETLENARKEAFAQSKDVLNMAKAHSNSVAR 290

Query: 200 ETLNSANTYTDSKTAEIFNTTKTYMDGKSKETLNNTYDYVDSKVSSIVYDVNSYTDKTVN 259
TL +A + +S T + + + KS E L + Y DSK S + NSYTD TV+
Sbjct: 291 TTLETAEEHANSVARTTLETAEEHANKKSAEALASANVYADSKSSHTLKTANSYTDVTVS 350

Query: 260 TAFETSLSDAKSYVDDKYNQLSDKVNKNFNKTNAGISGAMAMSGIPQKFGYEK-SFGMAI 318
+ + ++ ++ Y D K+ QL ++++K + + G++ + A++ + Q +G K +F +
Sbjct: 351 NSTKKAIRESNQYTDHKFRQLDNRLDKLDTRVDKGLASSAALNSLFQPYGVGKVNFTAGV 410

Query: 319 GAYRGQSALAVGGDWNINHKTITRVNVSADTEGGVGVAAGF 359
G YR ALA+G + +N + V+ V A F
Sbjct: 411 GGYRSSQALAIGSGYRVNENVALKAGVAYAGSSDVMYNASF 451


28YpAngola_A1994YpAngola_A2023Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A19942142.137916invasin domain-containing protein
YpAngola_A19962162.522018anti-sigma-28 factor FlgM
YpAngola_A19971163.790282flagellar basal body P-ring biosynthesis protein
YpAngola_A19982183.784086flagellar basal body rod protein FlgB
YpAngola_A19993214.442013flagellar basal body rod protein FlgC
YpAngola_A20003194.557962flagellar basal body rod modification protein
YpAngola_A20011193.750276flagellar hook protein FlgE
YpAngola_A20022193.751354flagellar basal body rod protein FlgF
YpAngola_A20032183.145428flagellar basal-body rod protein FlgG
YpAngola_A20051172.861665flagellar basal body P-ring protein
YpAngola_A2006-1173.055657flagellar rod assembly protein/muramidase FlgJ
YpAngola_A2007-1173.009571flagellar hook-associated protein FlgK
YpAngola_A20090153.651040flagellar hook-associated protein 3
YpAngola_A2008-1164.227620flagellar hook-length control protein FliK
YpAngola_A2010-1174.135288flagellar biosynthesis chaperone
YpAngola_A2011-1174.110604flagellum-specific ATP synthase
YpAngola_A20120152.852669flagellar assembly protein H
YpAngola_A20130140.835683flagellar motor switch protein G
YpAngola_A2014017-0.401061flagellar MS-ring protein
YpAngola_A2015321-3.758062flagellar hook-basal body protein FliE
YpAngola_A2016220-3.600558base excision DNA repair protein
YpAngola_A2018017-3.893935hypothetical protein
YpAngola_A2019017-2.465789AraC family transcriptional regulator
YpAngola_A2020217-0.371117flagellar biosynthesis protein FliT
YpAngola_A2021319-0.281263flagellar protein FliS
YpAngola_A20224170.477870flagellar capping protein
YpAngola_A20235220.514706flagellin
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A1994INTIMIN432e-142 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 432 bits (1113), Expect = e-142
Identities = 128/415 (30%), Positives = 198/415 (47%), Gaps = 31/415 (7%)

Query: 7 FGKDNLQRNPYAVTAGINYTPVPLLTVGVDQRMGKSSKHETQWNLQMNYRLGESFQSQLS 66
F D LQ NP A T G+NYTP+PL+T+G+D R G ++++ +++Q Y+ + + Q+
Sbjct: 361 FNSDKLQSNPGAATVGVNYTPIPLVTMGIDYRHGTGNENDLLYSMQFRYQFDKPWSQQIE 420

Query: 67 PSAVAGTRLLAESRYNLVDRNNNIVLEYQKQQVVKLTLSPATISGLPGQVYQVNAQVQGA 126
P V R L+ SRY+LV RNNNI+LEY+KQ ++ L + P I+G ++ V+
Sbjct: 421 PQYVNELRTLSGSRYDLVQRNNNIILEYKKQDILSLNI-PHDINGTERSTQKIQLIVKSK 479

Query: 127 SAVREIVWSDAELIAAGGTLT---PLSTTQFNLVLPPYKRTAQVSRVTDDLTANFYSLSA 183
+ IVW D+ L + GG + S + +LP Y + +N Y ++A
Sbjct: 480 YGLDRIVWDDSALRSQGGQIQHSGSQSAQDYQAILPAYVQGG----------SNVYKVTA 529

Query: 184 LAVDHQGNRSNSFTLSVTVQQPQLTLTAAVIGD------GAPASGKTAITVEFTVADFEG 237
A D GN SN+ L++TV + + D A A G AIT TV G
Sbjct: 530 RAYDRNGNSSNNVLLTITVLSNGQVVDQVGVTDFTADKTSAKADGTEAITYTATVKK-NG 588

Query: 238 KPLAGQEVVITTNNG-ALPNKITEKTDANGVARIALTNTTDGVTVVTAEVEGQRQSVDTH 296
A V +G A+ + + T+ +G A + L + G VV+A+ +++ +
Sbjct: 589 VAQANVPVSFNIVSGTAVLSANSANTNGSGKATVTLKSDKPGQVVVSAKTAEMTSALNAN 648

Query: 297 ---FVKGTIAADKSTLAAVPTSIIADGLMASTITLELKDTYGD-PQAGANVAFDTTLGNM 352
FV T A + + A T+ +A+G IT +K GD P + V F TTLG +
Sbjct: 649 AVIFVDQT-KASITEIKADKTTAVANG--QDAITYTVKVMKGDKPVSNQEVTFTTTLGKL 705

Query: 353 GVITDHN--DGTYSAPLTSTTLGVATVTVKVDGAAFSVPSVTVNFTADPIPDAGR 405
T+ +G LTSTT G + V+ +V A V + V F D G
Sbjct: 706 SNSTEKTDTNGYAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFFTTLTIDDGN 760



Score = 150 bits (381), Expect = 6e-40
Identities = 70/363 (19%), Positives = 128/363 (35%), Gaps = 27/363 (7%)

Query: 309 LAAVPTSIIADGLMASTITLELKDTYGDPQAGANVAFDTTLG----NMGVITDHNDGTYS 364
A TS ADG A T T +K G QA V+F+ G + + G +
Sbjct: 563 FTADKTSAKADGTEAITYTATVKKN-GVAQANVPVSFNIVSGTAVLSANSANTNGSGKAT 621

Query: 365 APLTSTTLGVATVTVKVDGAAFSVPSVTVNFTADPIPDAGRSSFTVSTPDILADGTMSST 424
L S G V+ K ++ + V F + S + A
Sbjct: 622 VTLKSDKPGQVVVSAKTAEMTSALNANAVIF----VDQTKASITEIKADKTTAVANGQDA 677

Query: 425 LSFVPVDKNGHFISGMQGLSFTQNGVPVSISPITEQPDSY-TATVVGNTAGDVTITPQVD 483
+++ G Q ++FT +S S + Y T+ T G ++ +V
Sbjct: 678 ITYTVKVMKGDKPVSNQEVTFTTTLGKLSNSTEKTDTNGYAKVTLTSTTPGKSLVSARVS 737

Query: 484 TLILSTLQKKISLFPVPTLTGILVNGQNFATDKGFPKTIFKNATFQLQMDNDVANNTQYE 543
+ + ++ F T+ + P + L+ N +Y
Sbjct: 738 DVAVDVKAPEVEFFTTLTIDDGNIEIVGTGVKGKLPTVWLQYGQVNLKAS---GGNGKYT 794

Query: 544 WSSSFTPNVSVN-DQGQVTITYQTYSEVAVTAKSKKFPSYSVSYRFYPNRWIYDGGTSLV 602
W S+ SV+ GQVT+ + + ++V + + +Y+++ PN I + V
Sbjct: 795 WRSANPAIASVDASSGQVTLKEKGTTTISVISSDNQTATYTIA---TPNSLIVPNMSKRV 851

Query: 603 SSIEASRQCQGSDMSAVLESSRATNGTRAPDGTLWGEWGSLTAYS--SDWQSGEYWVKRT 660
+ +A C+ + L SS+ ++ WG+ Y Q+ WV++T
Sbjct: 852 TYNDAVNTCK--NFGGKLPSSQNEL------ENVFKAWGAANKYEYYKSSQTIISWVQQT 903

Query: 661 STD 663
+ D
Sbjct: 904 AQD 906


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2000SYCECHAPRONE290.008 Gram-negative bacterial type III secretion SycE cha...
		>SYCECHAPRONE#Gram-negative bacterial type III secretion SycE

chaperone signature.
Length = 130

Score = 28.9 bits (64), Expect = 0.008
Identities = 15/34 (44%), Positives = 21/34 (61%), Gaps = 2/34 (5%)

Query: 40 LKNQDPTNPMENNELTTQLAQINTVSGIEKLNTT 73
L N+ P N ++NN L TQL + V G E+L T+
Sbjct: 89 LWNRQPLNSLDNNSLYTQLEML--VQGAERLQTS 120


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2001FLGHOOKAP1453e-07 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 45.3 bits (107), Expect = 3e-07
Identities = 22/87 (25%), Positives = 42/87 (48%), Gaps = 8/87 (9%)

Query: 6 AVSGMNAASSNLDVIGNNIANSATSGFKAGSVSFAD----MFAGSQTGMGVKVAGITQDF 61
A+SG+NAA + L+ NNI++ +G+ + A + AG G GV V+G+ +++
Sbjct: 7 AMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVSGVQREY 66

Query: 62 NDGTATTTNRRLDLAISQNGFFRMQDS 88
+ +L A +Q+ +
Sbjct: 67 DA----FITNQLRAAQTQSSGLTARYE 89



Score = 40.7 bits (95), Expect = 9e-06
Identities = 15/49 (30%), Positives = 28/49 (57%)

Query: 380 TLTSGALESSNVDLSKELVNMIVAQRNYQSNAQTIKTQDQILQTLVSLR 428
L++ S V+L +E N+ Q+ Y +NAQ ++T + I L+++R
Sbjct: 498 QLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2003FLGHOOKAP1412e-06 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 41.5 bits (97), Expect = 2e-06
Identities = 11/41 (26%), Positives = 22/41 (53%)

Query: 192 ETSNVNVAEELVNMIQTQRAYEINSKAVSTSDQMLQKLAQL 232
S VN+ EE N+ + Q+ Y N++ + T++ + L +
Sbjct: 505 SISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2005FLGPRINGFLGI391e-138 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 391 bits (1007), Expect = e-138
Identities = 155/366 (42%), Positives = 217/366 (59%), Gaps = 9/366 (2%)

Query: 5 SLVTLLMVLLSLVWLPASAERIRDLVTVQGVRDNALIGYGLVVGLDGSGDQTMQTPFTTQ 64
+LV + LS A RI+D+ ++Q RDN LIGYGLVVGL G+GD +PFT Q
Sbjct: 10 ALVFSALPFLSTPPAQADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSPFTEQ 69

Query: 65 SLSNMLSQLGITVPPGTNMQLKNVAAVMVTAKLPAFSRAGQTIDVVVSSMGNAKSIRGGT 124
S+ ML LGIT G + KN+AAVMVTA LP F+ G +DV VSS+G+A S+RGG
Sbjct: 70 SMRAMLQNLGITTQGGQS-NAKNIAAVMVTANLPPFASPGSRVDVTVSSLGDATSLRGGN 128

Query: 125 LLMTPLKGVDNQVYALAQGNVLVGGAGAAAGGSSVQVNQLAGGRISNGATIERELPTTFG 184
L+MT L G D Q+YA+AQG ++V G A +++ R+ NGA IERELP+ F
Sbjct: 129 LIMTSLSGADGQIYAVAQGALIVNGFSAQGDAATLTQGVTTSARVPNGAIIERELPSKFK 188

Query: 185 TDGIINLQLNSEDFTLAQQVSDAINR----QRGFGSATAIDARTIQVLVPRGGSSQVRFL 240
+ LQL + DF+ A +V+D +N + G A D++ I V PR + R +
Sbjct: 189 DSVNLVLQLRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKPR-VADLTRLM 247

Query: 241 ADIQNIPINVDPGDAKVIINSRTGSVVMNRNVVLDSCAVAQGNLSVVVDKQNIVSQPDTP 300
A+I+N+ + D AKV+IN RTG++V+ +V + AV+ G L+V V + V QP P
Sbjct: 248 AEIENLTVETD-TPAKVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESPQVIQP-AP 305

Query: 301 FGGGQTVVTPNTQISVQQQGGVLQRVNASPNLNNVVRALNSLGATPIDLMSILQAMESAG 360
F GQT V P T I Q+G + V P+L +V LNS+G +++ILQ ++SAG
Sbjct: 306 FSRGQTAVQPQTDIMAMQEGSKVAIVE-GPDLRTLVAGLNSIGLKADGIIAILQGIKSAG 364

Query: 361 CLRAKL 366
L+A+L
Sbjct: 365 ALQAEL 370


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2006FLGFLGJ310e-108 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 310 bits (796), Expect = e-108
Identities = 179/316 (56%), Positives = 232/316 (73%), Gaps = 6/316 (1%)

Query: 1 MSDLLAMSGAAYDARSLEALKRDAARDPEGNLKQVAQQVEGMFVQMMLKSMRAALPQDGV 60
+SD ++ AA+DA+SL LK A DP N++ VA+QVEGMFVQMMLKSMR ALP+DG+
Sbjct: 2 ISDSKLLASAAWDAQSLNELKAKAGEDPAANIRPVARQVEGMFVQMMLKSMRDALPKDGL 61

Query: 61 MNSEQTKLYTSLYDQQIAQQMSA-KGLGLADMMVEQLS-GSTSASETAGTVPMMLDNEVL 118
+SE T+LYTS+YDQQIAQQM+A KGLGLA+MMV+Q++ E+ PM E +
Sbjct: 62 FSSEHTRLYTSMYDQQIAQQMTAGKGLGLAEMMVKQMTPEQPLPEESTPAAPMKFPLETV 121

Query: 119 QSMPAQALAQVMRRAIPTPPSSSMAAISPGNGNFVARMSIPAQIASQQSGIPHQLIMAQA 178
QAL+Q++++A+P S+ S F+A++S+PAQ+ASQQSG+PH LI+AQA
Sbjct: 122 VRYQNQALSQLVQKAVPRNYDDSLPGDSK---AFLAQLSLPAQLASQQSGVPHHLILAQA 178

Query: 179 ALESGWGQREIPTADGKSSYNVFGIKAGSSWNGPVSEITATEYEQGVAKKTKARFRVYGS 238
ALESGWGQR+I +G+ SYN+FG+KA +W GPV+EIT TEYE G AKK KA+FRVY S
Sbjct: 179 ALESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRVYSS 238

Query: 239 YVEAVSDYVKLLTQNPRYAHVAAAQSPEQGAHALQKAGYATDPQYAQKLVSVIQQMRSTG 298
Y+EA+SDYV LLT+NPRYA V A S EQGA ALQ AGYATDP YA+KL ++IQQM+S
Sbjct: 239 YLEALSDYVGLLTRNPRYAAVTTAASAEQGAQALQDAGYATDPHYARKLTNMIQQMKSIS 298

Query: 299 EQAVKAYGGSDLSQLF 314
++ K Y ++ LF
Sbjct: 299 DKVSKTY-SMNIDNLF 313


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2007FLGHOOKAP1436e-150 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 436 bits (1123), Expect = e-150
Identities = 315/552 (57%), Positives = 398/552 (72%), Gaps = 9/552 (1%)

Query: 3 NSLMNTAMSGLNAAQYALSTVSNNITNFQVAGYNRQNTVFAQNGGTITSAGFIGNGVTVT 62
+SL+N AMSGLNAAQ AL+T SNNI+++ VAGY RQ T+ AQ T+ + G++GNGV V+
Sbjct: 1 SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVS 60

Query: 63 GVNREYNAFITNQLRASQTQSSGLATYYQQISQIDNLLSNASNNLSTTMQDFFSNLQNLV 122
GV REY+AFITNQLRA+QTQSSGL Y+Q+S+IDN+LS ++++L+T MQDFF++LQ LV
Sbjct: 61 GVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLV 120

Query: 123 SNADDDAARKTVLGKAEGLVNQFQNADKYLRDMDDGVNQKITDSATQINNYAEQIAKLND 182
SNA+D AAR+ ++GK+EGLVNQF+ D+YLRD D VN I S QINNYA+QIA LND
Sbjct: 121 SNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLND 180

Query: 183 QITRLRG-SSGSEPNALLDQRDQLVTELNQIMAVTVTQQDGDAYNVSFAGGLSLVQGPNA 241
QI+RL G +G+ PN LLDQRDQLV+ELNQI+ V V+ QDG YN++ A G SLVQG A
Sbjct: 181 QISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTA 240

Query: 242 YKVEAIPSSADATRLTLGYKRGNGEATEVDESRITTGSLGGTLKFRSEALDSARNQLGQL 301
++ A+PSSAD +R T+ Y G E+ E + TGSLGG L FRS+ LD RN LGQL
Sbjct: 241 RQLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQL 300

Query: 302 ALVMADSFNTQHNAGFDINGDEGEDFFSFADPTVLKNAKNQGNASITVEYKDTSKVKASD 361
AL A++FNTQH AGFD NGD GEDFF+ P VL+N KN+G+ +I D S V A+D
Sbjct: 301 ALAFAEAFNTQHKAGFDANGDAGEDFFAIGKPAVLQNTKNKGDVAIGATVTDASAVLATD 360

Query: 362 YTVEFDGTDWQVTRLSDNTKVQTTPGVNADGDPTLEFEGVAIKIDNGTPGPQAKDKFTIK 421
Y + FD WQVTRL+ NT TP D + + F+G+ + P D FT+K
Sbjct: 361 YKISFDNNQWQVTRLASNTTFTVTP----DANGKVAFDGLELTFTG---TPAVNDSFTLK 413

Query: 422 TVSNVAANLQVAITDSSKIAAAGSADGGISDNTNAQALLDLQSKKLVEGK-TTLSGAYAG 480
VS+ N+ V ITD +KIA A D G SDN N QALLDLQS G + + AYA
Sbjct: 414 PVSDAIVNMDVLITDEAKIAMASEEDAGDSDNRNGQALLDLQSNSKTVGGAKSFNDAYAS 473

Query: 481 LVSNVGNQTATAKTNSTAQANIVTQLTTEQQSISGVNLDEEYGDLQRFQQYYLANAQVLQ 540
LVS++GN+TAT KT+S Q N+VTQL+ +QQSISGVNLDEEYG+LQRFQQYYLANAQVLQ
Sbjct: 474 LVSDIGNKTATLKTSSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQ 533

Query: 541 AASTLFNALLSI 552
A+ +F+AL++I
Sbjct: 534 TANAIFDALINI 545


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2009FLAGELLIN404e-06 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 40.4 bits (94), Expect = 4e-06
Identities = 35/137 (25%), Positives = 63/137 (45%), Gaps = 7/137 (5%)

Query: 4 STSMLYQQNMQGITNAQSLWMQTGQQLSTGKRVVNPSDDPMAASQAVMVSQAESENSQYT 63
S S+L Q N+ ++ S ++LS+G R+ + DD AA QA+ +
Sbjct: 8 SLSLLTQNNLNKSQSSLS---SAIERLSSGLRINSAKDD--AAGQAIANRFTSNIKGLTQ 62

Query: 64 LARSFARQSSSLETT--VLAQTTSTIQSIQSLVISAKNDTLSDDDRASYATQLQGLKDQL 121
+R+ S +TT L + + +Q ++ L + A N T SD D S ++Q +++
Sbjct: 63 ASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEI 122

Query: 122 LNQANTTDGNGRYIFAG 138
+N T NG + +
Sbjct: 123 DRVSNQTQFNGVKVLSQ 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2010FLGFLIJ1129e-35 Flagellar FliJ protein signature.
		>FLGFLIJ#Flagellar FliJ protein signature.

Length = 147

Score = 112 bits (281), Expect = 9e-35
Identities = 82/144 (56%), Positives = 102/144 (70%)

Query: 1 MKSQSPLVTLCDLAQKAVEQASTQLGHVRQSYQNAEQQLTMLLTYQDEYRERLNDTLCNG 60
M L TL DLA+K VE A+ LG +R+ Q AE+QL ML+ YQ+EYR LN + G
Sbjct: 1 MAEHGALATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAG 60

Query: 61 MASSSWQNYQQFIQTLEQAIDQHRKQLAQWSIKVEQAVKYWQEKQQRLNAFETLQERAET 120
+ S+ W NYQQFIQTLE+AI QHR+QL QW+ KV+ A+ W+EK+QRL A++TLQER T
Sbjct: 61 ITSNRWINYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQERQST 120

Query: 121 TQRQQENRLDQKLMDEFAQRASQR 144
ENRLDQK MDEFAQRA+ R
Sbjct: 121 AALLAENRLDQKKMDEFAQRAAMR 144


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2012FLGFLIH2215e-75 Flagellar assembly protein FliH signature.
		>FLGFLIH#Flagellar assembly protein FliH signature.

Length = 228

Score = 221 bits (563), Expect = 5e-75
Identities = 128/233 (54%), Positives = 167/233 (71%), Gaps = 7/233 (3%)

Query: 6 NALPWQPWSLKDFASQSEAPLSESMPDISLLFPNEPMEATAAVDEQQVLVNLQLEAEKQG 65
+ LPW+ W+ D A P +E +P + P E + A +Q L LQ++A +QG
Sbjct: 3 DNLPWKTWTPDDLAP----PQAEFVPIVE---PEETIIEEAEPSLEQQLAQLQMQAHEQG 55

Query: 66 RQQGFAKGLQEGLDKGYQTGLEEGHQQALADAQQQLAPMTAHWQVMVTDFQNTLDTLDSV 125
Q G A+G Q+G +GYQ GL +G +Q LA+A+ Q AP+ A Q +V++FQ TLD LDSV
Sbjct: 56 YQAGIAEGRQQGHKQGYQEGLAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALDSV 115

Query: 126 IASRLVQIALAAAKQIIGQPAICDGTALLAQIQQMIQQEPMFAGKTQLRVNPDDLAIVEQ 185
IASRL+Q+AL AA+Q+IGQ D +AL+ QIQQ++QQEP+F+GK QLRV+PDDL V+
Sbjct: 116 IASRLMQMALEAARQVIGQTPTVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRVDD 175

Query: 186 RLGSTLSLHGWRLLGDSQIHAGGCKVSAEEGDLDASLATRWHELCRLAAPGEL 238
LG+TLSLHGWRL GD +H GGCKVSA+EGDLDAS+ATRW ELCRLAAPG +
Sbjct: 176 MLGATLSLHGWRLRGDPTLHPGGCKVSADEGDLDASVATRWQELCRLAAPGVV 228


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2013FLGMOTORFLIG314e-108 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 314 bits (806), Expect = e-108
Identities = 113/327 (34%), Positives = 192/327 (58%), Gaps = 2/327 (0%)

Query: 2 SLTGTEKSAIMLMTLGEDHAAEVFKHLSSREVQQLSTTMASMRQVSHQQLVDVLAEFEDD 61
+LTG +K+AI+L+++G + +++VFK+LS E++ L+ +A + ++ + +VL EF++
Sbjct: 14 ALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITSELKDNVLLEFKEL 73

Query: 62 AEQYAALSVNASDYLRSVLIKALGEERASSLLEDILESRETTSGMETLNFMEPQMAADLI 121
+ DY R +L K+LG ++A ++ + L S + E + +P + I
Sbjct: 74 MMAQEFIQKGGIDYARELLEKSLGTQKAVDIINN-LGSALQSRPFEFVRRADPANILNFI 132

Query: 122 RDEHPQIIATILVHLKRAQAADILALFDERLRNDVMLRIATFGGVQPAALAELTEVLNNL 181
+ EHPQ IA IL +L +A+ IL+ ++ +V RIA P + E+ VL
Sbjct: 133 QQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEVVREVERVLEKK 192

Query: 182 LDGQ-NLKRSKMGGIRTAAEIINLMKTQQEETVMDAVREYDGELAQKIIDEMFLFENLVS 240
L + + GG+ EIIN+ + E+ +++++ E D ELA++I +MF+FE++V
Sbjct: 193 LASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEIKKKMFVFEDIVL 252

Query: 241 VDDRSIQRLLQEIDNESLLIALKGADQALRERFLSNMSLRAAEILRDDLATRGPVRMSLV 300
+DDRSIQR+L+EID + L ALK D ++E+ NMS RAA +L++D+ GP R V
Sbjct: 253 LDDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKEDMEFLGPTRRKDV 312

Query: 301 ENEQKSILLIVRRLAESGEIVIGGGED 327
E Q+ I+ ++R+L E GEIVI G +
Sbjct: 313 EESQQKIVSLIRKLEEQGEIVISRGGE 339


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2014FLGMRINGFLIF5770.0 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 577 bits (1488), Expect = 0.0
Identities = 354/552 (64%), Positives = 443/552 (80%), Gaps = 9/552 (1%)

Query: 19 LARLRANPKIPLLIAAAAAIAIIVALMLWAKSPDYRVLYSNLSDRDGGDIVTQLTQLNIP 78
L RLRANP+IPL++A +AA+AI+VA++LWAK+PDYR L+SNLSD+DGG IV QLTQ+NIP
Sbjct: 16 LNRLRANPRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQDGGAIVAQLTQMNIP 75

Query: 79 YRFADNGGALLIPAEKVHETRLRLAQQGLPKGGAVGFELLDQEKFGISQFSEQINYQRAL 138
YRFA+ GA+ +PA+KVHE RLRLAQQGLPKGGAVGFELLDQEKFGISQFSEQ+NYQRAL
Sbjct: 76 YRFANGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQFSEQVNYQRAL 135

Query: 139 EGELSRTIGTLGPVLNVRVHLAMPKPSLFVREQKSPTASVTLALQPGRALDDGQINAIVY 198
EGEL+RTI TLGPV + RVHLAMPKPSLFVREQKSP+ASVT+ L+PGRALD+GQI+A+V+
Sbjct: 136 EGELARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRALDEGQISAVVH 195

Query: 199 MVSSSVAGLPPGNVTVVDQTGRLLTQSDSAGRDLNASQLKFTSEVENRYQRRIENILAPM 258
+VSS+VAGLPPGNVT+VDQ+G LLTQS+++GRDLN +QLKF ++VE+R QRRIE IL+P+
Sbjct: 196 LVSSAVAGLPPGNVTLVDQSGHLLTQSNTSGRDLNDAQLKFANDVESRIQRRIEAILSPI 255

Query: 259 VGNGNVHAQVTAQVDFASREQTDEEYKPNQAANQGAVRSQQVSTSEQLGGTNVGGVPGAL 318
VGNGNVHAQVTAQ+DFA++EQT+E Y PN A++ +RS+Q++ SEQ+G GGVPGAL
Sbjct: 256 VGNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNISEQVGAGYPGGVPGAL 315

Query: 319 SNQPPVAPIAPIEIPQPAGAAANNAAPANTAATANANTTATAAKASSSNSRHDQTTNFEV 378
SNQP API P A N +T+ +N+ A +++ ++T+N+EV
Sbjct: 316 SNQPAPPNEAPIATPPTNQQNAQNTPQTSTSTNSNS--------AGPRSTQRNETSNYEV 367

Query: 379 DRTIRHTQQQAGMVQRLSVAVVVNYTSDKAGKPIALSKDQLAQVESLTREAMGFSTVRGD 438
DRTIRHT+ G ++RLSVAVVVNY + GKP+ L+ DQ+ Q+E LTREAMGFS RGD
Sbjct: 368 DRTIRHTKMNVGDIERLSVAVVVNYKTLADGKPLPLTADQMKQIEDLTREAMGFSDKRGD 427

Query: 439 TLNVVNTPFTASDDTRGSSLPFWQQQSFFDQLLNAGRYLLILLVAWILWRKLLRPMLAKK 498
TLNVVN+PF+A D+T G LPFWQQQSF DQLL AGR+LL+L+VAWILWRK +RP L ++
Sbjct: 428 TLNVVNSPFSAVDNT-GGELPFWQQQSFIDQLLAAGRWLLVLVVAWILWRKAVRPQLTRR 486

Query: 499 QVADKAAASVNNIVQTAQAAETVKQSKEELALRKKNQQRVSAEVQAQRIRELADKDPRVV 558
KAA + Q + A V+ SK+E +++ QR+ AEV +QRIRE++D DPRVV
Sbjct: 487 VEEAKAAQEQAQVRQETEEAVEVRLSKDEQLQQRRANQRLGAEVMSQRIREMSDNDPRVV 546

Query: 559 ALVIRQWMSNDQ 570
ALVIRQWMSND
Sbjct: 547 ALVIRQWMSNDH 558


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2015FLGHOOKFLIE802e-23 Flagellar hook-basal body complex protein FliE signa...
		>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE

signature.
Length = 103

Score = 80.5 bits (198), Expect = 2e-23
Identities = 59/102 (57%), Positives = 73/102 (71%)

Query: 2 SVQGIEGVLQQLQVTALQASGSAKTLPAEAGFASELKAAIGKISENQQVARTSAQNFELG 61
++QGIEGV+ QLQ TA+ A FA +L AA+ +IS+ Q ART A+ F LG
Sbjct: 2 AIQGIEGVISQLQATAMSARAQESLPQPTISFAGQLHAALDRISDTQTAARTQAEKFTLG 61

Query: 62 VPGVGLNDVMVNAQKSSVSLQLGIQVRNKLVAAYQEVMNMGV 103
PGV LNDVM + QK+SVS+Q+GIQVRNKLVAAYQEVM+M V
Sbjct: 62 EPGVALNDVMTDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV 103


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2022ACRIFLAVINRP290.049 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 29.0 bits (65), Expect = 0.049
Identities = 20/121 (16%), Positives = 38/121 (31%), Gaps = 11/121 (9%)

Query: 32 PLTTQQTSYKSKLTAYGVLQSALAKLETASTALKKADTLNSTAVSGSNSAFSATTDSAAS 91
P + +Y A V + +E + ++ST+ S + + T S
Sbjct: 41 PAVSVSANY-PGADAQTVQDTVTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSG-- 97

Query: 92 AGTYSIEVTNLAKAQSLLSADVPSATDKLGSSDATRTITITQPGQKEPMKISLTSEQTSL 151
T+ AQ + + AT L + I++ + M S+
Sbjct: 98 --------TDPDIAQVQVQNKLQLATPLLPQEVQQQGISVEKSSSSYLMVAGFVSDNPGT 149

Query: 152 T 152
T
Sbjct: 150 T 150


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2023FLAGELLIN1426e-43 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 142 bits (359), Expect = 6e-43
Identities = 133/156 (85%), Positives = 144/156 (92%)

Query: 3 VINTNSLSLLTQNNLNKSQSSLGTAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQ 62
VINTNSLSLLTQNNLNKSQSSL +AIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQ
Sbjct: 3 VINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQ 62

Query: 63 AARNANDGISIAQTTEGSLNEINNNLQRVRELTVQAQNGSNSSSDLDSIQDEISLRLAEI 122
A+RNANDGISIAQTTEG+LNEINNNLQRVREL+VQA NG+NS SDL SIQDEI RL EI
Sbjct: 63 ASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEI 122

Query: 123 DRVSDQTQFNGKKVLAENTTMSIQVGANDGETIDVN 158
DRVS+QTQFNG KVL+++ M IQVGANDGETI ++
Sbjct: 123 DRVSNQTQFNGVKVLSQDNQMKIQVGANDGETITID 158


29YpAngola_A2044YpAngola_A2109Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A2044115-3.892651N-acetylneuraminic acid mutarotase
YpAngola_A2045-119-6.329886D-alanine/D-serine/glycine permease
YpAngola_A2047130-8.102848*hypothetical protein
YpAngola_A2048-127-5.926936attachment invasion locus protein
YpAngola_A2049-225-4.735487hypothetical protein
YpAngola_A2050-226-4.821294GlpM protein
YpAngola_A2051-126-5.114188hypothetical protein
YpAngola_A2052-126-5.382956response regulator
YpAngola_A2053-120-2.347680excinuclease ABC subunit C
YpAngola_A20540190.108258phosphatidylglycerophosphate synthetase
YpAngola_A20600221.170511***hypothetical protein
YpAngola_A20610242.283597hypothetical protein
YpAngola_A2062-1254.068035hypothetical protein
YpAngola_A2063-1265.057220transposase
YpAngola_A20640231.497996hypothetical protein
YpAngola_A20652241.039499hypothetical protein
YpAngola_A2066-319-4.028183hypothetical protein
YpAngola_A2068023-5.910948hypothetical protein
YpAngola_A2069118-3.060622hypothetical protein
YpAngola_A2070119-3.576466hypothetical protein
YpAngola_A2071123-3.953229hypothetical protein
YpAngola_A2074224-5.672073hypothetical protein
YpAngola_A20751180.715749hypothetical protein
YpAngola_A20772222.937918GntR family transcriptional regulator
YpAngola_A20780232.389473hypothetical protein
YpAngola_A20790232.417394hypothetical protein
YpAngola_A20800242.858842hypothetical protein
YpAngola_A20812243.588469oxidoreductase, NAD-binding
YpAngola_A20822252.866011carbohydrate ABC transporter periplasmic-binding
YpAngola_A20833252.580810carbohydrate ABC transporter permease
YpAngola_A20844271.986689carbohydrate ABC transporter permease
YpAngola_A20873280.606364hypothetical protein
YpAngola_A20881273.486706insertion sequence transposase
YpAngola_A20890295.256071transposase/IS protein
YpAngola_A20901306.394528hypothetical protein
YpAngola_A20911337.447857hypothetical protein
YpAngola_A209243810.477369yp42; ORF 74, len
YpAngola_A209343911.070298yersiniabactin/pesticin receptor FyuA
YpAngola_A209453911.241903yersiniabactin synthetase, YbtE component
YpAngola_A209553911.087889yersiniabactin biosynthesis thioesterase YbtT
YpAngola_A209653911.041722yersiniabactin synthetase, YbtU component
YpAngola_A209753810.986646yersiniabactin synthetase, HMWP1 component
YpAngola_A20983359.626968yersiniabactin synthetase, HMWP2 component
YpAngola_A20992286.797039yersiniabactin transcriptional regulator YbtA
YpAngola_A2100-1215.133900yersiniabactin ABC transporter ATP-binding
YpAngola_A2101-1173.630546yersiniabactin ABC transporter ATP-binding
YpAngola_A2102-2141.879945yersinabactin region putative transporter YbtX
YpAngola_A2104013-0.167022phage integrase family site specific
YpAngola_A2107115-1.946053*hypothetical protein
YpAngola_A2108015-2.277372fimbrial usher protein
YpAngola_A2109-123-5.281496pili assembly chaperone
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2048ENTEROVIROMP1371e-43 Enterobacterial virulence outer membrane protein si...
		>ENTEROVIROMP#Enterobacterial virulence outer membrane protein

signature.
Length = 171

Score = 137 bits (346), Expect = 1e-43
Identities = 58/185 (31%), Positives = 97/185 (52%), Gaps = 16/185 (8%)

Query: 1 MKNKTTLAAFITAILLSSSAAYAAGDRTISLGYAQGDVRLGDGNRKDIRLDDDLKGINVK 60
MK L+A + ++ + AA T++ GYAQ D + G N + G N+K
Sbjct: 1 MKKIACLSALAAVLAFTAGTSVAATS-TVTGGYAQSDAQ-GQMN--------KMGGFNLK 50

Query: 61 YLHKLSE-MFGAIGSFTYTDLNYDYLNNNVKIGDASFDYYSLMVGPSVHFNEFFSMYALL 119
Y ++ G IGSFTYT+ ++ YY + GP+ N++ S+Y ++
Sbjct: 51 YRYEEDNSPLGVIGSFTYTE--KSRTASSGDYNKN--QYYGITAGPAYRINDWASIYGVV 106

Query: 120 GIGHGNAKASVL-GYGKKEEQDSLAYGVGMQFNPLNNIAIDASYEYTKLKDANIGTWVLG 178
G+G+G + + Y +YG G+QFNP+ N+A+D SYE ++++ ++GTW+ G
Sbjct: 107 GVGYGKFQTTEYPTYKHDTSDYGFSYGAGLQFNPMENVALDFSYEQSRIRSVDVGTWIAG 166

Query: 179 IGYRF 183
+GYRF
Sbjct: 167 VGYRF 171


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2052HTHFIS727e-17 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 71.8 bits (176), Expect = 7e-17
Identities = 25/115 (21%), Positives = 46/115 (40%), Gaps = 2/115 (1%)

Query: 2 ISVLLVDDHELVRAGIRRILDDIKGIKVAGEMQCGEDAVKWCRSHVVDIVLMDMNMPGIG 61
++L+ DD +R + + L G V +W + D+V+ D+ MP
Sbjct: 4 ATILVADDDAAIRTVLNQALSR-AGYDVRIT-SNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 62 GLEATRKILRFSPDTKVIMLTIHTENPLPAKVMQAGAGGYLSKGAAPQDVITAIR 116
+ +I + PD V++++ K + GA YL K ++I I
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2064ENTEROTOXINB260.038 Heat labile enterotoxin B chain signature.
		>ENTEROTOXINB#Heat labile enterotoxin B chain signature.

Length = 124

Score = 26.2 bits (57), Expect = 0.038
Identities = 18/68 (26%), Positives = 31/68 (45%), Gaps = 1/68 (1%)

Query: 18 MEGISEATLYNWRNQAKSEGEPVPGAEKNSEQWPAEARLAVIVETATLSETEIAEYCRKK 77
+ G E + ++N A + E VPG++ Q A R+ + A L+E ++ + C
Sbjct: 52 LAGKREMAIITFKNGAIFQVE-VPGSQHIDSQKKAIERMKDTLRIAYLTEAKVEKLCVWN 110

Query: 78 GLYPAQIA 85
P IA
Sbjct: 111 NKTPHAIA 118


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2082MALTOSEBP493e-08 Maltose binding protein signature.
		>MALTOSEBP#Maltose binding protein signature.

Length = 396

Score = 48.6 bits (115), Expect = 3e-08
Identities = 101/420 (24%), Positives = 170/420 (40%), Gaps = 55/420 (13%)

Query: 14 TLLMAGNASA---QETLRVLLEGHSTSDSIKALLPEFEKQTGIKVQAEIVPYSDLTSKAL 70
T++ + +A A + L + + G + + + +FEK TGIKV E + D +
Sbjct: 17 TMMFSASALAKIEEGKLVIWINGDKGYNGLAEVGKKFEKDTGIKVTVE---HPDKLEEKF 73

Query: 71 LAFSSHSGRYDVVMDDWVHAV--GYASAGYITPVDQWMESDTAFYDGADFVKSYA---DT 125
++ D++ W H GYA +G + + D AF D K Y D
Sbjct: 74 PQVAATGDGPDIIF--WAHDRFGGYAQSGLLAEI----TPDKAFQD-----KLYPFTWDA 122

Query: 126 LRYKDGYYGLPVYGESTFLMYRKDLFEQYGIAVPKTFDELTAAAKTIKEKTEGKVAGITL 185
+RY P+ E+ L+Y KDL PKT++E+ A K +K K GK A +
Sbjct: 123 VRYNGKLIAYPIAVEALSLIYNKDLLPN----PPKTWEEIPALDKELKAK--GKSA--LM 174

Query: 186 RGAQGIQNTFAWASFLWGYGGQWIDDNGK-----SAITSPQAVEATKSFVNILKNYGPIG 240
Q + F W G + +NGK + + A V+++KN
Sbjct: 175 FNLQ--EPYFTWPLIAADGGYAFKYENGKYDIKDVGVDNAGAKAGLTFLVDLIKNKHMNA 232

Query: 241 AANFGWQENRLVFQQGKAAMTIDSTVNGGFNEDPKESTVVGKVGYAPVPVQPGDHPGNSG 300
++ E F +G+ AMTI NG + +++ KV Y + +
Sbjct: 233 DTDYSIAE--AAFNKGETAMTI----NGPWAWSNIDTS---KVNYGVTVLPTFKGQPSKP 283

Query: 301 ALQVHGLYISSDSKKQDAAWKFISWATDKQTQMKSVELNPNAGVSSLSAINSDAFTKRYG 360
+ V I++ S ++ A +F+ +++V + L A+ ++ +
Sbjct: 284 FVGVLSAGINAASPNKELAKEFLENYLLTDEGLEAVNKD-----KPLGAVALKSYEEELA 338

Query: 361 AFKDGMLAALQNGNAK--YLPTIPQSTQIINITGIALSEALAGTQTVENALQQANTRNDK 418
KD +AA K +P IPQ + A+ A +G QTV+ AL+ A TR K
Sbjct: 339 --KDPRIAATMENAQKGEIMPNIPQMSAFWYAVRTAVINAASGRQTVDEALKDAQTRITK 396


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2088HTHTETR280.047 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 28.1 bits (62), Expect = 0.047
Identities = 12/106 (11%), Positives = 31/106 (29%), Gaps = 14/106 (13%)

Query: 16 QGMSSRAIARELGISRNTVKRYLQAKSEPPKYTPRPAVASLLDEYRDYIRQRIAD----- 70
S IA+ G++R + + + KS+ + + + I + +
Sbjct: 30 SSTSLGEIAKAAGVTRGAIYWHFKDKSD--------LFSEIWELSESNIGELELEYQAKF 81

Query: 71 -AHPYKIPATVIAREIRDQGYRGGMTILRAFIRSLSVPQEQEPAVR 115
P + ++ + +L I + V+
Sbjct: 82 PGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQ 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2097DHBDHDRGNASE461e-06 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 45.8 bits (108), Expect = 1e-06
Identities = 32/156 (20%), Positives = 55/156 (35%), Gaps = 19/156 (12%)

Query: 1561 LVTGAFGGLGRLAVNWLREKGARRIALLAPRVDESWLRDVEGGQTRVCR------CDVGD 1614
+TGA G+G L +GA + A + L V R DV D
Sbjct: 12 FITGAAQGIGEAVARTLASQGAH---IAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 1615 AGQLATVLDDLAAN-GGIAGAIHAAGVLADAPLQELDDHQLAAVFAVKAQAASQLLQTLR 1673
+ + + + G I ++ AGVL + L D + A F+V + +++
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 1674 NH-----DGRYLILYSSAAAT----LGAPGQSAHAL 1700
+ G + + S+ A + A S A
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAA 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2098ISCHRISMTASE512e-08 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 51.2 bits (122), Expect = 2e-08
Identities = 22/70 (31%), Positives = 44/70 (62%)

Query: 22 QQLRERLIQELNLTPQQLHEESNLIQAGLDSIRLMRWLHWFRKNGYRLTLRELYAAPTLA 81
+ +R+++ + L TP+ + ++ +L+ GLDS+R+M + +R+ G +T EL PT+
Sbjct: 233 ENIRKQIAELLQETPEDITDQEDLLDRGLDSVRIMTLVEQWRREGAEVTFVELAERPTIE 292

Query: 82 AWNQLMLSRS 91
W +L+ +RS
Sbjct: 293 EWQKLLTTRS 302


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2107PF00577300.022 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 29.8 bits (67), Expect = 0.022
Identities = 13/116 (11%), Positives = 26/116 (22%), Gaps = 15/116 (12%)

Query: 287 ESGTSSGQTAIGIQTSLPGYLKALGLGLVNTAGGVSYLLSDSYG--TDSRIATGVGISLS 344
+ + + ++P L + + S SY D +
Sbjct: 584 NAWQKGRDQMLALNVNIP-----FSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVY 638

Query: 345 DSNGSTMNFVGWG-------GCAQTQDCLTTADAGWYPILTGASGNGSHSAGYNNY 393
+ N + + G A + A+ SHS
Sbjct: 639 GTLLED-NNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYGNANIGYSHSDDIKQL 693


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2108PF005777420.0 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 742 bits (1917), Expect = 0.0
Identities = 244/882 (27%), Positives = 392/882 (44%), Gaps = 72/882 (8%)

Query: 6 LLVTHISSAADNNNQDDYIFDDALVRGSSLGLGSIARFNKKNSYDAGQYQVDMYMNNKFV 65
L V +A + + F+ + + ++RF G Y+VD+Y+NN ++
Sbjct: 30 LFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPGTYRVDIYLNNGYM 89

Query: 66 DRLKMLFVDKDNS--VEPCLSVAQLLQAGVKEEALKTAD--PKTPCLAFQSILPASDFRF 121
+ F D+ + PCL+ AQL G+ ++ + C+ S++ + +
Sbjct: 90 ATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVPLTSMIHDATAQL 149

Query: 122 DHAKLRFDLSIPQKFVKNVPRGYVDPKNLTAGNTIGFSNYNLNQYHVDYNKEGIKRTTNS 181
D + R +L+IPQ F+ N RGY+ P+ G G NYN + V G ++
Sbjct: 150 DVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQNRIGG---NSHY 206

Query: 182 TYLSLNSGINIGMWRFRQQGSLRYDASRG-----TNWTSNRLYSQRALPTIGSEITLGET 236
YL+L SG+NIG WR R + Y++S W + +R + + S +TLG+
Sbjct: 207 AYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRSRLTLGDG 266

Query: 237 FSSGQFFSSLGFLGVALSTDDRMLPESQRGYAPVVRGIARTNARVMVYQNNRSIYQTTVS 296
++ G F + F G L++DD MLP+SQRG+APV+ GIAR A+V + QN IY +TV
Sbjct: 267 YTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIYNSTVP 326

Query: 297 PGAFEFNDLSVTHFGGDLTVEINEADGSVSTFQVPFASVPESLRPGYSRYSFAAGQVRDV 356
PG F ND+ GDL V I EADGS F VP++SVP R G++RYS AG+ R
Sbjct: 327 PGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITAGEYRSG 386

Query: 357 GN---NETFSELTYQQGISNAITANTGIRLASGYQAIMLGGVF-THYIGALGLNTTYSHA 412
F + T G+ T G +LA Y+A G +GAL ++ T +++
Sbjct: 387 NAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALSVDMTQANS 446

Query: 413 RLPDGEQQQGWMAKASFSRTFQPTNTTLSVAGYRYSTDGYRDLSDVLGVR---------- 462
LPD Q G + ++++ + T + + GYRYST GY + +D R
Sbjct: 447 TLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGYNIETQD 506

Query: 463 ----ATSNDSSWNSSTYRQRSRAEISLNQNFHRYGSLYLTASSQDYRDDRSRDSQLQLGY 518
+ + + Y +R + ++++ Q R +LYL+ S Q Y + D Q Q G
Sbjct: 507 GVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYWGTSNVDEQFQAGL 566

Query: 519 SNTFWRNTSFNLAISQQKTGGANKIYFVDPGSGMPASNGANTLATRETVAQMSISFPLGG 578
+ + + ++ L+ S K R+ + ++++ P
Sbjct: 567 NTA-FEDINWTLSYSLTKNAWQKG---------------------RDQMLALNVNIPFSH 604

Query: 579 SSSAP--------YVSAGAVNSRTSGASYQTSLSGTMGSDQTAGYSVDVARNEP---TNE 627
+ S + + + GT+ D YSV +
Sbjct: 605 WLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSG 664

Query: 628 NTLSGSLQKQLPTTSLSGSASRSPGYWQGSASARGSVAFHRGGVTLGPYLSDTFALIEAK 687
+T +L + + + S S Q G V H GVTLG L+DT L++A
Sbjct: 665 STGYATLNYRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAP 724

Query: 688 GASGAKVMYGQGARIDRFGYALVPTLTPYRYNTLSLDPDGMDFNTELQDGERQIAPYAGS 747
GA AKV G R D GYA++P T YR N ++LD + + N +L + + P G+
Sbjct: 725 GAKDAKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGA 784

Query: 748 TVKVTFRTLNGYPALITIKMPDGSQLPMGTVVYNYNGKGTNDKNDIVGMVGQSSQAYLRA 807
V+ F+ G L+T+ + LP G +V T++ + G+V + Q YL
Sbjct: 785 IVRAEFKARVGIKLLMTLT-HNNKPLPFGAMV-------TSESSQSSGIVADNGQVYLSG 836

Query: 808 EELSGTLTLVWGESSKERCQLDYDLGKPTDNDKQLYKLDALC 849
L+G + + WGE C +Y L P + L +L A C
Sbjct: 837 MPLAGKVQVKWGEEENAHCVANYQL-PPESQQQLLTQLSAEC 877


30YpAngola_A2147YpAngola_A2206Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A2147327-2.628401transporter
YpAngola_A2148219-2.177520YciI-like protein
YpAngola_A2150123-0.055900hypothetical protein
YpAngola_A2151-117-0.594087hypothetical protein
YpAngola_A2152-117-0.619973attachment invasion locus protein
YpAngola_A2153017-0.688869hypothetical protein
YpAngola_A2154015-1.392385transposase/IS protein
YpAngola_A2155-115-2.036071insertion sequence transposase
YpAngola_A2156-118-3.221637cardiolipin synthetase
YpAngola_A2157018-3.875625dsDNA-mimic protein
YpAngola_A2158119-4.427224hypothetical protein
YpAngola_A2159018-3.974472oligopeptide transport ATP-binding protein
YpAngola_A2160-117-3.344057oligopeptide ABC transporter ATP-binding
YpAngola_A2161-117-3.452770oligopeptide transport system permease
YpAngola_A2162020-3.759302oligopeptide transporter permease
YpAngola_A2163119-3.535539periplasmic oligopeptide-binding protein
YpAngola_A2164121-2.024946hypothetical protein
YpAngola_A2165222-1.941648bifunctional acetaldehyde-CoA/alcohol
YpAngola_A2166221-0.385423thymidine kinase
YpAngola_A21673240.360640global DNA-binding transcriptional dual
YpAngola_A21682260.951378IS1541 transposase
YpAngola_A21692271.494784IS285 transposase
YpAngola_A21702291.687737hypothetical protein
YpAngola_A21712281.463425insertion sequence transposase
YpAngola_A21720290.192543transposase/IS protein
YpAngola_A2173-126-1.859355DNA adenine methyltransferase
YpAngola_A2174-125-3.394020NinB protein
YpAngola_A2175127-3.055809hypothetical protein
YpAngola_A2176328-2.053509antitermination protein Q
YpAngola_A2177228-2.187224hypothetical protein
YpAngola_A2178226-1.684787hypothetical protein
YpAngola_A2179225-1.196233hypothetical protein
YpAngola_A2180326-0.927062lysozyme
YpAngola_A2181428-1.934460bacteriophage lysis protein
YpAngola_A2182322-1.504714hypothetical protein
YpAngola_A2183424-0.764873phage regulatory protein
YpAngola_A2184323-0.528425hypothetical protein
YpAngola_A21865230.127699phage protein
YpAngola_A21873230.161303terminase large subunit
YpAngola_A21883230.096319IS285 transposase
YpAngola_A2189322-0.145226hypothetical protein
YpAngola_A2192020-0.625807hypothetical protein
YpAngola_A2193-120-0.412442hypothetical protein
YpAngola_A2194-119-1.193355major capsid protein
YpAngola_A2195424-1.873459hypothetical protein
YpAngola_A2196424-1.306094hypothetical protein
YpAngola_A2197425-1.477798hypothetical protein
YpAngola_A2198425-1.272398phage tail protein
YpAngola_A2199425-3.218335hypothetical protein
YpAngola_A2200324-3.868218prophage tail length tape measure protein
YpAngola_A2201117-5.601203phage minor tail protein
YpAngola_A2202119-6.250819IS1541 transposase
YpAngola_A2203122-6.585049hypothetical protein
YpAngola_A2204021-6.382370MerR family transcriptional regulator
YpAngola_A2205121-5.030281hypothetical protein
YpAngola_A2206-118-3.874299C32 tRNA thiolase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2147TONBPROTEIN1608e-51 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 160 bits (405), Expect = 8e-51
Identities = 87/248 (35%), Positives = 120/248 (48%), Gaps = 20/248 (8%)

Query: 10 RRLTWSLIFSIGLHGSVVAALLYVSVEQMKIQPEIEDTPLAVTMVNIAEFAAPQPAAAAP 69
RR W + S+ +HG+VVA LLY SV Q+ P P++VTMV A
Sbjct: 7 RRFPWPTLLSVCIHGAVVAGLLYTSVHQVIELPAPAQ-PISVTMVT-----------PAD 54

Query: 70 EPVQETPAVPEETPPVLEETPPEPEELPEPVPVPVPEPVKPKPKPVKKEVKKPEVKKTQ- 128
+ P E E P E P+ PV + +P K K E K
Sbjct: 55 LEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDV 114

Query: 129 ---APPDDKPFKSDEAALVANNAPVKSAPVASTPGLSTSAGPKALSKAKPSYPARALALG 185
PF++ A + ++ + P S ++GP+ALS+ +P YPARA AL
Sbjct: 115 KPVESRPASPFENTAPARLTSSTATAATS---KPVTSVASGPRALSRNQPQYPARAQALR 171

Query: 186 IEGQVKVQYDIDESGRVTNVRVLEATPRNTFEREVKQVMRKWRFEA-VAAKNYVTTIVFK 244
IEGQVKV++D+ GRV NV++L A P N FEREVK MR+WR+E V I+FK
Sbjct: 172 IEGQVKVKFDVTPDGRVDNVQILSAKPANMFEREVKNAMRRWRYEPGKPGSGIVVNILFK 231

Query: 245 LDGKMEMN 252
++G E+
Sbjct: 232 INGTTEIQ 239


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2152ENTEROVIROMP1583e-52 Enterobacterial virulence outer membrane protein si...
		>ENTEROVIROMP#Enterobacterial virulence outer membrane protein

signature.
Length = 171

Score = 158 bits (402), Expect = 3e-52
Identities = 71/180 (39%), Positives = 105/180 (58%), Gaps = 10/180 (5%)

Query: 1 MKWITTLAPLSLALSLGISVANAASDASNTVSFGYAQSTLKIDGEKIGKDNKGFNLKYRH 60
MK I L+ L+ L+ + AA+ +TV+ GYAQS + K+ GFNLKYR+
Sbjct: 1 MKKIACLSALAAVLAFTAGTSVAAT---STVTGGYAQSDAQGQMNKM----GGFNLKYRY 53

Query: 61 ELD-SVLGIVASFTHTKQNYGMPGDSDGKRKVEYYSLMVGPSWRFNEFVSAYALIGATQG 119
E D S LG++ SFT+T+++ K +YY + GP++R N++ S Y ++G G
Sbjct: 54 EEDNSPLGVIGSFTYTEKSRTASSGDYNK--NQYYGITAGPAYRINDWASIYGVVGVGYG 111

Query: 120 KSTHTKPRMVSNTVSKTSMGYGAGLQFNPVKHVAIDTAYEYAKIEDVKIGTWIVGGGYRF 179
K T+ + S YGAGLQFNP+++VA+D +YE ++I V +GTWI G GYRF
Sbjct: 112 KFQTTEYPTYKHDTSDYGFSYGAGLQFNPMENVALDFSYEQSRIRSVDVGTWIAGVGYRF 171


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2155HTHTETR280.047 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 28.1 bits (62), Expect = 0.047
Identities = 12/106 (11%), Positives = 31/106 (29%), Gaps = 14/106 (13%)

Query: 16 QGMSSRAIARELGISRNTVKRYLQAKSEPPKYTPRPAVASLLDEYRDYIRQRIAD----- 70
S IA+ G++R + + + KS+ + + + I + +
Sbjct: 30 SSTSLGEIAKAAGVTRGAIYWHFKDKSD--------LFSEIWELSESNIGELELEYQAKF 81

Query: 71 -AHPYKIPATVIAREIRDQGYRGGMTILRAFIRSLSVPQEQEPAVR 115
P + ++ + +L I + V+
Sbjct: 82 PGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQ 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2159HTHFIS310.007 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 31.0 bits (70), Expect = 0.007
Identities = 9/16 (56%), Positives = 11/16 (68%)

Query: 54 VVGESGCGKSTFARAI 69
+ GESG GK ARA+
Sbjct: 165 ITGESGTGKELVARAL 180


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2171HTHTETR280.047 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 28.1 bits (62), Expect = 0.047
Identities = 12/106 (11%), Positives = 31/106 (29%), Gaps = 14/106 (13%)

Query: 16 QGMSSRAIARELGISRNTVKRYLQAKSEPPKYTPRPAVASLLDEYRDYIRQRIAD----- 70
S IA+ G++R + + + KS+ + + + I + +
Sbjct: 30 SSTSLGEIAKAAGVTRGAIYWHFKDKSD--------LFSEIWELSESNIGELELEYQAKF 81

Query: 71 -AHPYKIPATVIAREIRDQGYRGGMTILRAFIRSLSVPQEQEPAVR 115
P + ++ + +L I + V+
Sbjct: 82 PGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQ 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2200RTXTOXIND300.048 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 30.2 bits (68), Expect = 0.048
Identities = 21/90 (23%), Positives = 33/90 (36%), Gaps = 5/90 (5%)

Query: 251 RAAAQATKAQENADLSAATAKENFIQRLKAQADLQGKTASEIQAYKAAQLGVTEQAAPFI 310
A A K Q + L A ++ Q L +L ++ Q E+
Sbjct: 131 GAEADTLKTQ--SSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLT 188

Query: 311 AKLKEQESAWQNGALSAKQYRLALRQLPSQ 340
+ +KEQ S WQN Q L L + ++
Sbjct: 189 SLIKEQFSTWQN---QKYQKELNLDKKRAE 215


31YpAngola_A2218YpAngola_A2224Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A2218221-3.476533azoreductase
YpAngola_A2220220-4.139235DNA-binding protein
YpAngola_A2221318-3.766876putative protease
YpAngola_A2222320-6.441750acid shock protein
YpAngola_A2223216-3.813241cytochrome b561 family protein
YpAngola_A2224114-3.613491hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2221V8PROTEASE1043e-28 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 104 bits (260), Expect = 3e-28
Identities = 37/249 (14%), Positives = 83/249 (33%), Gaps = 41/249 (16%)

Query: 33 QTALFFGKDDRTAVTNSRQWPWEAIGQVET---ASGNLCTATLISPRLVLTAGHCVLTP- 88
+ +DR +T++ + + ++ + + ++ +LT H V
Sbjct: 66 HANVILPNNDRHQITDTTNGHYAPVTYIQVEAPTGTFIASGVVVGKDTLLTNKHVVDATH 125

Query: 89 --PGNIDQAVALRFISDKGHWKYQITDLKTRVDAKLGQKLKADGDGWIVPPAAAAYDFAL 146
P + + + + + + +GD A F+
Sbjct: 126 GDPHALKAFPSAINQDNYPNGGFTAEQITKY---------SGEGD-------LAIVKFSP 169

Query: 147 IQLTNAAPIPIKPLPLWEGTANELTKALKLVNRKVTQAGYPLD-NLNTLYKHEDCLVTGW 205
+ +KP + A VN+ +T GYP D + T+++ + +
Sbjct: 170 NEQNKHIGEVVKPATM-------SNNAETQVNQNITVTGYPGDKPVATMWESKG--KITY 220

Query: 206 AQQGVLAHQCDTLPGDSGSPLLLKNGNSWSLIAIQSSAPAAKERYLADNRALSVT-AINN 264
+ + + T G+SGSP+ + +I I + N A+ + + N
Sbjct: 221 LKGEAMQYDLSTTGGNSGSPVFNEKNE---VIGIHWGGVPNE-----FNGAVFINENVRN 272

Query: 265 RLKKLVNKI 273
LK+ + I
Sbjct: 273 FLKQNIEDI 281


32YpAngola_A2319YpAngola_A2331Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A2319-215-3.102257phospholipid-binding domain-containing protein
YpAngola_A2321-116-2.912205tryptophan synthase subunit beta
YpAngola_A2322-214-2.866963outer membrane protein W
YpAngola_A2323017-3.212865hypothetical protein
YpAngola_A2324-117-3.689698hypothetical protein
YpAngola_A2326-219-3.947668intracellular septation protein A
YpAngola_A2327-217-3.414956acyl-CoA thioester hydrolase
YpAngola_A2328-217-3.411016IS285 transposase
YpAngola_A2329-124-5.066223hypothetical protein
YpAngola_A2330-217-3.643589UDP-glucose/GDP-mannose dehydrogenase family
YpAngola_A2331-117-3.579902response regulator of RpoS
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2330NUCEPIMERASE290.032 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 29.4 bits (66), Expect = 0.032
Identities = 20/88 (22%), Positives = 33/88 (37%), Gaps = 14/88 (15%)

Query: 1 MKVTVFGI-GYVGLVQATVLAEVGHDVLCID-IDANKVADLKKGRIAIFEPGLAPLVK-- 56
MK V G G++G + L E GH V+ ID ++ LK+ R+ + K
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 57 -ENYEAGRLQFSTD---------AQAGV 74
+ E F++ + V
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAV 88


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2331HTHFIS844e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 84.1 bits (208), Expect = 4e-20
Identities = 31/115 (26%), Positives = 48/115 (41%), Gaps = 1/115 (0%)

Query: 10 ILVVEDEVVFRTVLAEYLGSLGATIHQAENGLAALYQLKGHSPDLILCDLAMPKMGGIEF 69
ILV +D+ RTVL + L G + N + DL++ D+ MP +
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 70 VEQLLLKGIKIPVLVISATDKMADIAQVLRLGVKDVLLKPIVDLNRLREAVLACL 124
+ ++ +PVLV+SA + + G D L KP DL L + L
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKP-FDLTELIGIIGRAL 119


33YpAngola_A2432YpAngola_A2446Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A2432018-4.773903hypothetical protein
YpAngola_A2433015-3.663650hypothetical protein
YpAngola_A2434014-3.466613arginyl-tRNA synthetase
YpAngola_A2435117-3.373881putative hemolysin
YpAngola_A2437016-3.371707hypothetical protein
YpAngola_A2439016-2.239563ShlB/FhaC/HecB family hemolysin
YpAngola_A24400121.415859integral membrane protein MviN
YpAngola_A2442-1130.898906ribosomal-protein-S5-alanine
YpAngola_A2443-1142.247210multidrug resistance protein MdtH
YpAngola_A2444-1172.417643hypothetical protein
YpAngola_A24450213.512521TorD family cytoplasmic chaperone
YpAngola_A2446-2243.414752putative hydrolase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2443TCRTETA637e-13 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 62.5 bits (152), Expect = 7e-13
Identities = 66/356 (18%), Positives = 127/356 (35%), Gaps = 15/356 (4%)

Query: 14 FLLFDNLLVVLGFFVVFPLISIRFVDQLGWAALVV---GLALGLRQLVQQGLGIFGGAIA 70
+L L +G ++ P++ + L + V G+ L L L+Q GA++
Sbjct: 9 VILSTVALDAVGIGLIMPVLPG-LLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALS 67

Query: 71 DRFGAKPMIVTGMLMRAAGFALMAMADEPWILWLACALSGLGGTLFDPPRTALVIKLTRP 130
DRFG +P+++ + A +A+MA A W+L++ ++G+ G A + +T
Sbjct: 68 DRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVA-GAYIADITDG 126

Query: 131 HERGRFYSLLMMQDSAGAVIGALIGSWLLQYDFHFVCWTGAAIFVLAAGWNAWLLPAYRI 190
ER R + + G V G ++G + + H + AA+ L +LLP
Sbjct: 127 DERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHK 186

Query: 191 STVRAPMKEGLMRVLRDRRFVTYVLTLTGYYMLAVQVMLMLPI--------VVNELAGSP 242
R P++ + L R+ + + + + L+ + +
Sbjct: 187 GE-RRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDA 245

Query: 243 AAVKWMYAIEAALSLTLLYPLARWSEKRFSLEQRLMAGLLIMTLSLFPIGMITHLQTLFM 302
+ A L + R + LM G++ + T F
Sbjct: 246 TTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFP 305

Query: 303 FICFFYMGSILAEPARETLGASLADSRARGSYMGFSRLGLALGGALGYTGGGWMYD 358
+ G I PA + + + D +G G +L +G +Y
Sbjct: 306 IMVLLASGGI-GMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYA 360


34YpAngola_A2482YpAngola_A2496Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A2482121-4.990923carboxymuconolactone decarboxylase family
YpAngola_A2483123-6.050861cupin domain-containing protein
YpAngola_A2484223-6.855812hypothetical protein
YpAngola_A2485125-7.826637hypothetical protein
YpAngola_A2486-125-7.797883hypothetical protein
YpAngola_A2487-217-4.513349hypothetical protein
YpAngola_A2488-117-3.026353short chain dehydrogenase/reductase family
YpAngola_A2489-116-5.220351hypothetical protein
YpAngola_A2490-116-5.304894hypothetical protein
YpAngola_A2491119-6.687700hypothetical protein
YpAngola_A2492119-6.762209hypothetical protein
YpAngola_A2493221-7.521252hypothetical protein
YpAngola_A2494320-6.961999hypothetical protein
YpAngola_A2496318-3.897751transposase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2488DHBDHDRGNASE673e-15 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 66.6 bits (162), Expect = 3e-15
Identities = 56/231 (24%), Positives = 90/231 (38%), Gaps = 25/231 (10%)

Query: 7 IILTGASGLIGSAIADALYKSGMNLVLACKRSQKLQDRYLSDDKSKRAYFWY-GDLTNEK 65
+TGA+ IG A+A L G ++ +KL+ S R + D+ +
Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSA 70

Query: 66 ACRELVEYAVQQMGGVDVLINCAGVFNFSALEEMTYSRITDTISTNLLAPIYLTHLVLPY 125
A E+ ++MG +D+L+N AGV + ++ T S N + V Y
Sbjct: 71 AIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSKY 130

Query: 126 IKTSACPIIVNISSIAGFSSLPEGACYAASKWGLNGFIHSIREELRKKSIHICNI-SPCQ 184
+ IV + S A YA+SK F + EL + +I CNI SP
Sbjct: 131 MMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIR-CNIVSPGS 189

Query: 185 VKT-----LSHHSDTAIRTIA-----------------PENIANAVILVLS 213
+T L + A + I P +IA+AV+ ++S
Sbjct: 190 TETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVS 240


35YpAngola_A2604YpAngola_A2633Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A2604219-2.704901lipoate-protein ligase A
YpAngola_A2605118-2.678599NlpC/P60 family lipoprotein
YpAngola_A2606017-2.708604hypothetical protein
YpAngola_A2607-115-2.119226SMR family multidrug efflux pump
YpAngola_A2608-114-2.3145784-amino-4-deoxy-L-arabinose transferase
YpAngola_A2609-215-1.844530hypothetical protein
YpAngola_A2610-215-2.415719bifunctional UDP-glucuronic acid
YpAngola_A2611-116-1.698927undecaprenyl phosphate
YpAngola_A2612217-1.550799UDP-4-amino-4-deoxy-L-arabinose--oxoglutarate
YpAngola_A2613018-2.076289IS1541 transposase
YpAngola_A2614016-1.862567vitamin B12-transporter ATPase
YpAngola_A2615-119-1.240320putative glutathione peroxidase
YpAngola_A2616-119-0.943866vtamin B12-transporter permease
YpAngola_A2617018-0.905943hypothetical protein
YpAngola_A2618120-0.881774IS1541 transposase
YpAngola_A2619223-0.679028integration host factor subunit alpha
YpAngola_A2620120-0.940245phenylalanyl-tRNA synthetase subunit beta
YpAngola_A2621321-1.486796phenylalanyl-tRNA synthetase subunit alpha
YpAngola_A2622420-1.91902550S ribosomal protein L20
YpAngola_A2623116-2.82902850S ribosomal protein L35
YpAngola_A2624014-2.623122translation initiation factor IF-3
YpAngola_A2625-113-3.214227threonyl-tRNA synthetase
YpAngola_A2626-216-3.911362hypothetical protein
YpAngola_A2627-116-3.397189hypothetical protein
YpAngola_A2628-115-2.985290multiple drug resistance protein MarC
YpAngola_A2629-115-2.917574transglycosylase slt family protein
YpAngola_A2630-217-2.724449periplasmic-binding protein
YpAngola_A2631017-2.311063ATP-binding transport protein
YpAngola_A2632018-2.949936chelated iron transport system membrane protein
YpAngola_A2633-119-3.255045chelated iron transport system membrane protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2610NUCEPIMERASE1027e-26 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 102 bits (256), Expect = 7e-26
Identities = 74/361 (20%), Positives = 138/361 (38%), Gaps = 60/361 (16%)

Query: 317 RVLILGVNGFIGNHLTERLLQDDRYEVYGLDIGSD--------AISRFLGNPAFHFVEGD 368
+ L+ G GFIG H+++RLL+ ++V G+D +D A L P F F + D
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAG-HQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 369 ISIHSEWIE--YHIKKCDVILPLVAIATPIEYT-RNPLRVFELDFEENLKIVRDCVKYN- 424
++ E + + + + + Y+ NP + + L I+ C
Sbjct: 61 LADR-EGMTDLFASGHFERVFISPHRLA-VRYSLENPHAYADSNLTGFLNILEGCRHNKI 118

Query: 425 KRIVFPSTSEVYGMCDDKEFDEDTSRLIVGPINKQRWIYSVSKQLLDRVIWAYGVKEGLK 484
+ +++ S+S VYG+ F D ++ +Y+ +K+ + + Y GL
Sbjct: 119 QHLLYASSSSVYGLNRKMPFSTD------DSVDHPVSLYAATKKANELMAHTYSHLYGLP 172

Query: 485 FTLFRPFNWMGPRLDNLDAARIGSSRAITQLILNLVEGSPIKLVDGGAQKRCFTDIHDGI 544
T R F GP D A ++A+ +EG I + + G KR FT I D
Sbjct: 173 ATGLRFFTVYGPWGRP-DMALFKFTKAM-------LEGKSIDVYNYGKMKRDFTYIDDIA 224

Query: 545 EALFRIIEN---------------RDGCCDGRIINIGNPTNEASIRELAEMLLTSFENHE 589
EA+ R+ + R+ NIGN ++ + + + L +
Sbjct: 225 EAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGN-SSPVELMDYIQALEDALGIEA 283

Query: 590 LRDHFPPFAGFKDIESSAYYGKGYQDVEYRTPSIKNARRILHWQPEIAMQQTVTETLDFF 649
++ P G DV + K ++ + PE ++ V ++++
Sbjct: 284 KKNMLPLQPG---------------DVLETSADTKALYEVIGFTPETTVKDGVKNFVNWY 328

Query: 650 L 650

Sbjct: 329 R 329


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2611PREPILNPTASE320.002 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 32.5 bits (74), Expect = 0.002
Identities = 25/90 (27%), Positives = 36/90 (40%), Gaps = 4/90 (4%)

Query: 220 LMYDLITCLTTTPLRLLSLVGSAIALLGF-TFSVLLVALRLIFGPEWAGGGVFTLFAVLF 278
L L+ L + L V A+A G+ L A +L+ G E G G F L A L
Sbjct: 166 LWGGLLFNLLGGFVSLGDAVIGAMA--GYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALG 223

Query: 279 MFIGAQFV-GMGLLGEYIGRIYNDVRARPR 307
++G Q + + LL +G R
Sbjct: 224 AWLGWQALPIVLLLSSLVGAFMGIGLILLR 253


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2614PF05272320.003 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 31.6 bits (71), Expect = 0.003
Identities = 17/66 (25%), Positives = 27/66 (40%), Gaps = 10/66 (15%)

Query: 29 LIGPNGAGKSTLLASLAGL------LPASGEIVLAGKSLQHYEGHELAR----QRAYLSQ 78
L G G GKSTL+ +L GL G + + + +EL+ +RA
Sbjct: 601 LEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQIAGIVAYELSEMTAFRRADAEA 660

Query: 79 QQSALS 84
++ S
Sbjct: 661 VKAFFS 666


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2617OUTRSURFACE300.004 Outer surface protein signature.
		>OUTRSURFACE#Outer surface protein signature.

Length = 273

Score = 29.5 bits (66), Expect = 0.004
Identities = 18/52 (34%), Positives = 27/52 (51%), Gaps = 8/52 (15%)

Query: 1 MKKYLLLFGVLSFMPLIAQSDVSLD------INMPGIN--LHLGDQDKRGYY 44
MKKYLL G++ + Q+ SLD +++PG L ++DK G Y
Sbjct: 1 MKKYLLGIGLILALIACKQNVSSLDEKNSASVDLPGEMKVLVSKEKDKDGKY 52


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2619DNABINDINGHU1172e-38 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 117 bits (296), Expect = 2e-38
Identities = 36/89 (40%), Positives = 55/89 (61%)

Query: 4 TKAEMSEHLFEKLGLSKRDAKDLVELFFEEVRRALENGEQVKLSGFGNFDLRDKNQRPGR 63
K ++ + E L+K+D+ V+ F V L GE+V+L GFGNF++R++ R GR
Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62

Query: 64 NPKTGEDIPITARRVVTFRPGQKLKSRVE 92
NP+TGE+I I A +V F+ G+ LK V+
Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAVK 91


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2630ADHESNFAMILY388e-138 Adhesin family signature.
		>ADHESNFAMILY#Adhesin family signature.

Length = 309

Score = 388 bits (997), Expect = e-138
Identities = 105/309 (33%), Positives = 179/309 (57%), Gaps = 7/309 (2%)

Query: 21 LRAAALFTIVAFSSLISTAALAENNPSDTAKKFKVVTTFTIIQDIAQNIAGDVAVVESIT 80
++ ++ S++I A + + + +K KVV T +II DI +NIAGD + SI
Sbjct: 1 MKKLGTLLVLFLSAIILVACASGKKDTTSGQKLKVVATNSIIADITKNIAGDKIDLHSIV 60

Query: 81 KPGAEIHDYQPTPRDIVKAQSADLILWNGMNLER----WFEKFFESIK---DVPSAVVTA 133
G + H+Y+P P D+ K ADLI +NG+NLE WF K E+ K + V+
Sbjct: 61 PIGQDPHEYEPLPEDVKKTSEADLIFYNGINLETGGNAWFTKLVENAKKTENKDYFAVSD 120

Query: 134 GITPLPIREGPYSGIANPHAWMSPSNALIYIENIRKALVEHDPAHAETYNRNAQAYAEKI 193
G+ + + G +PHAW++ N +I+ +NI K L DP + E Y +N + Y +K+
Sbjct: 121 GVDVIYLEGQNEKGKEDPHAWLNLENGIIFAKNIAKQLSAKDPNNKEFYEKNLKEYTDKL 180

Query: 194 KALDAPLRERLSRIPAEQRWLVTSEGAFSYLAKDYGFKEVYLWPINAEQQGIPQQVRHVI 253
LD +++ ++IPAE++ +VTSEGAF Y +K YG Y+W IN E++G P+Q++ ++
Sbjct: 181 DKLDKESKDKFNKIPAEKKLIVTSEGAFKYFSKAYGVPSAYIWEINTEEEGTPEQIKTLV 240

Query: 254 DIIRENKIPVVFSESTISDKPAKQVSKETGAQYGGVLYVDSLSGEKGPVPTYISLINMTV 313
+ +R+ K+P +F ES++ D+P K VS++T ++ DS++ + +Y S++ +
Sbjct: 241 EKLRQTKVPSLFVESSVDDRPMKTVSQDTNIPIYAQIFTDSIAEQGKEGDSYYSMMKYNL 300

Query: 314 DTIAKGFGQ 322
D IA+G +
Sbjct: 301 DKIAEGLAK 309


36YpAngola_A2643YpAngola_A2648Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A2643-116-3.7690942-deoxy-D-gluconate 3-dehydrogenase
YpAngola_A2644-117-4.661514periplasmic pectate lyase
YpAngola_A2645-119-5.216534oligogalacturonide ABC transporter, permease
YpAngola_A2646019-4.275980oligogalacturonide ABC transporter, permease
YpAngola_A2647018-3.841233oligogalacturonide ABC transporter ATP-binding
YpAngola_A2648019-4.105278oligogalacturonide ABC transporter periplasmic
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2643DHBDHDRGNASE1205e-35 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 120 bits (301), Expect = 5e-35
Identities = 75/252 (29%), Positives = 127/252 (50%), Gaps = 11/252 (4%)

Query: 8 LKGKVALVTGCDTGLGQGMAIGLAEAGCDIIGVN-IVEPRETIEQ-VTALGRRFFSLTAD 65
++GK+A +TG G+G+ +A LA G I V+ E E + + A R + AD
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 66 LSNIECIPSLLERAVAEFGHIDILVNNAGIIRREDAINFSEKDWDDVMNVNIKSVFFMSQ 125
+ + I + R E G IDILVN AG++R + S+++W+ +VN VF S+
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 126 AVAKQFIKQGNGGKIINVASMLSYQGGIRVPSYTASKSAVMGVTRLLANEWAKHGINVNA 185
+V+K + + G I+ V S + + +Y +SK+A + T+ L E A++ I N
Sbjct: 126 SVSKYMMDR-RSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNI 184

Query: 186 VAPGYMATNNTQQLRKDEERSKEILD--------RIPAGRWGLPDDLKGPVVFLASKASD 237
V+PG T+ L DE +++++ IP + P D+ V+FL S +
Sbjct: 185 VSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAG 244

Query: 238 YISGYTIAVDGG 249
+I+ + + VDGG
Sbjct: 245 HITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2644PF069179930.0 Periplasmic pectate lyase
		>PF06917#Periplasmic pectate lyase

Length = 555

Score = 993 bits (2568), Expect = 0.0
Identities = 553/555 (99%), Positives = 555/555 (100%)

Query: 1 MGFTGNIKGSDAMMINWLSAIRSYVDLVQSVGHSQLNPSPLLADGFDVLTHQPVVWEFPD 60
MGFTGNIKGSDAMMINWLSAIRSYVDLVQSVGHSQLNPSPLLADGFDVLTHQPVVWEFPD
Sbjct: 1 MGFTGNIKGSDAMMINWLSAIRSYVDLVQSVGHSQLNPSPLLADGFDVLTHQPVVWEFPD 60

Query: 61 GHHTPISNFASQQNWLRTLDALSLVTQDPQYHQQARIQSGYFMQHGVHNESGLFYWGGHR 120
GHHTPISNFASQQNWLRTLDALSLVTQDPQYHQQARIQSGYFMQHGVHNESGLFYWGGHR
Sbjct: 61 GHHTPISNFASQQNWLRTLDALSLVTQDPQYHQQARIQSGYFMQHGVHNESGLFYWGGHR 120

Query: 121 FLNLDTLKTEGPASKDQVYELKHHLPYYDLLITIDRERTLNFLQGFWHAHVEDWKTLDLG 180
FLNLDTLKTEGPASKDQV+ELKHHLPYYDLL+TIDRERTLNFLQGFWHAHVEDWKTLDLG
Sbjct: 121 FLNLDTLKTEGPASKDQVHELKHHLPYYDLLVTIDRERTLNFLQGFWHAHVEDWKTLDLG 180

Query: 181 RHGNYSKQRDPQVFTHPRYDVVNPAELPKLPETKGLTFVNAGTDLIYAAYKYAEYTGDAA 240
RHGNYSKQRDPQVFTHPRYDVVNPAELPKLPETKGLTFVNAGTDLIYAAYKYAEYTGDAA
Sbjct: 181 RHGNYSKQRDPQVFTHPRYDVVNPAELPKLPETKGLTFVNAGTDLIYAAYKYAEYTGDAA 240

Query: 241 AAAWGKHLYRQYVLARNPETGLPVYQFSSPQQRQPIPADDNQTQSWYGDRAKRQFGPEFG 300
AAAWGKHLYRQYVLARNPETGLPVYQFSSPQQRQPIPADDNQTQSWYGDRAKRQFGPEFG
Sbjct: 241 AAAWGKHLYRQYVLARNPETGLPVYQFSSPQQRQPIPADDNQTQSWYGDRAKRQFGPEFG 300

Query: 301 EIAREANVLFRDMRPLLIDNPLAMLDILRQQPDAEVLQWVIDGLKNYYRFAYDVESNTLR 360
EIAREANVLFRDMRPLLIDNPLAMLDILRQQPDAEVLQWVIDGLKNYYRFAYDVESNTLR
Sbjct: 301 EIAREANVLFRDMRPLLIDNPLAMLDILRQQPDAEVLQWVIDGLKNYYRFAYDVESNTLR 360

Query: 361 PLWNDGQDMSGYVLPRDGYYGVKGTVISPFPLDVDYLLPLVRAWRLSEDEELLDLIGVLL 420
PLWNDGQDMSGYVLPRDGYYGVKGTVISPFPLDVDYLLPLVRAWRLSEDEELLDLIGVLL
Sbjct: 361 PLWNDGQDMSGYVLPRDGYYGVKGTVISPFPLDVDYLLPLVRAWRLSEDEELLDLIGVLL 420

Query: 421 LRWQLAELNKTQRRATLMAAQRPIASPYLLLALVELAEHCQCPTLFTLAWQIGDDLFKRH 480
LRWQLAELNKTQRRATLMAAQRPIASPYLLLALVELAEHCQCPTLFTLAWQIGDDLFKRH
Sbjct: 421 LRWQLAELNKTQRRATLMAAQRPIASPYLLLALVELAEHCQCPTLFTLAWQIGDDLFKRH 480

Query: 481 YHRGLFVESAQHRYFRIDNPIALALLTLIAAKQDKLAAIPQFITNGGYIHGDYRVNGESR 540
YHRGLFVESAQHRYFRIDNPIALALLTLIAAKQDKLAAIPQFITNGGYIHGDYRVNGESR
Sbjct: 481 YHRGLFVESAQHRYFRIDNPIALALLTLIAAKQDKLAAIPQFITNGGYIHGDYRVNGESR 540

Query: 541 TLYDIDFIYPTLLNQ 555
TLYDIDFIYPTLLNQ
Sbjct: 541 TLYDIDFIYPTLLNQ 555


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2647BACINVASINB371e-04 Salmonella/Shigella invasin protein B signature.
		>BACINVASINB#Salmonella/Shigella invasin protein B signature.

Length = 593

Score = 37.0 bits (85), Expect = 1e-04
Identities = 24/95 (25%), Positives = 42/95 (44%), Gaps = 10/95 (10%)

Query: 60 EVRIGDRVVNNLAPKSRGIAM-VFQNYALYPHMTVKENLAFGLKLSKLPKDQIEAQVAEA 118
+V +G V N A + G+A VF A E LA L++ DQI+ + ++
Sbjct: 504 KVALGMEVTNTAAQSAGGVAEGVFIKNA-------SEALA-DFMLARFAMDQIQQWLKQS 555

Query: 119 AKIL-ELEDLLDRLPRQLSGGQAQRVAVGRAIVKK 152
+I E + + L + +S Q R I+++
Sbjct: 556 VEIFGENQKVTAELQKAMSSAVQQNADASRFILRQ 590


37YpAngola_A2666YpAngola_A2674Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A2666216-0.752625PqiA family integral membrane protein
YpAngola_A2667317-1.870786hypothetical protein
YpAngola_A2668423-3.590316hypothetical protein
YpAngola_A2670424-4.217139pilus assembly chaperone
YpAngola_A2671220-2.550861fimbrial usher protein
YpAngola_A2672222-1.413306hypothetical protein
YpAngola_A2673120-1.138005spore coat U domain-containing protein
YpAngola_A26742180.631706hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2671PF00577459e-151 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 459 bits (1182), Expect = e-151
Identities = 141/825 (17%), Positives = 294/825 (35%), Gaps = 78/825 (9%)

Query: 46 TLYLELVVNDRNFGST-VPISYRNNRYY----LSQSQLRTIGLPISEPLAPEIAIDN--- 97
T +++ +N+ + V + ++ L+++QL ++GL ++ + +
Sbjct: 77 TYRVDIYLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNT-ASVSGMNLLADDAC 135

Query: 98 ------MAGVNVKYDGENQRLLINVPSEWLPKQQIEVTEQDDFNLAQSSLGALFNYDIYA 151
+ + D QRL + +P ++ + + ++ L NY+
Sbjct: 136 VPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWD--PGINAGLLNYNFSG 193

Query: 152 TQGYPYSSLTHFSAWTEQRIFDRFGLLSNTGVYRTHFPSNNNTDDAKGYIRFDTQWQKND 211
A+ + G + S++++ +K + W + D
Sbjct: 194 NSVQNRIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERD 253

Query: 212 EEHLL-RYSTGDLITGALPWSSAIRLGGIQIARHFAIRPDLITYPLPQFSGQAAVPSTVD 270
L R + GD T + I G Q+A + PD P G A + V
Sbjct: 254 IIPLRSRLTLGDGYTQGDIFDG-INFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVT 312

Query: 271 LYIDNFRTQSANINPGPFVINNAPRINGAGQATIVTTDALGRQISTSVPFYVASTLLKPG 330
+ + + ++ + PGPF IN+ +G + +A G +VP+ L + G
Sbjct: 313 IKQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREG 372

Query: 331 VWDFSLSGGALRRNYAIRSADYGEMVASGVVRYGTTPWLTLEGRGDIAKEMHVIGGGVNF 390
+S++ G R A + + +G T+ G +A G+
Sbjct: 373 HTRYSITAGEYRSGNAQQEKPR---FFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGK 429

Query: 391 RMGLLGVLNSAYSISNTSNGAFNNVAEPLNTNNATPNRLPSPAASRRGRGNQRSLGYSYS 450
MG LG L+ + +N++ + + + + L + + + G N + +GY YS
Sbjct: 430 NMGALGALSVDMTQANST------LPDDSQHDGQSVRFLYNKSLNESGT-NIQLVGYRYS 482

Query: 451 NA-FFNL--------NAQHIISSDEYSD----LANYKTPSLLSRRMTQLTGSLSLGSYGT 497
+ +FN N +I + D +Y + R QLT + LG T
Sbjct: 483 TSGYFNFADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTST 542

Query: 498 V----------GSGYFDVRDALGEQTRLINISYSTSLLHNSNFYSALNRELGRKGYNVQL 547
+ G+ D + G T +I+++ S N + + + L
Sbjct: 543 LYLSGSHQTYWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQ------KGRDQMLAL 596

Query: 548 VWSIPLGPR-----------GSSSISATRTNDNQWIQQLNYSRSAPSNGGLGWNL--AYA 594
+IP S+S S + + + + + L +++ YA
Sbjct: 597 NVNIPFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYA 656

Query: 595 NSTNNNNQ-YQQADIVWRTSMMESRMGLYGNSNNYNYWGGLTGSLVVMNRSVYASNMIND 653
+ N+ A + +R + +G + + + G++G ++ V +ND
Sbjct: 657 GGGDGNSGSTGYATLNYRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLND 716

Query: 654 AFALVSTNGFSNIPVSYENQLIGTTNAKGYLLIPTVASYYQAKFQIDPMNLPADVMLPNV 713
LV G + V ENQ T+ +GY ++P Y + + +D L +V L N
Sbjct: 717 TVVLVKAPGAKDAKV--ENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNA 774

Query: 714 ERRLAIGERSGYLINFPIKRISAVNIRITDASGQDLPKGSAIYTTGNIPISYVGWDGMVY 773
+ + F + + + +T + + LP G+ + + + V +G VY
Sbjct: 775 VANVVPTRGAIVRAEFKARVGIKLLMTLTH-NNKPLPFGAMVTSESSQSSGIVADNGQVY 833

Query: 774 IEQVAQLNNLRI-IRADNGTQCYSQFKLKTTEGIQDAG--TTVCR 815
+ + +++ + C + ++L Q + CR
Sbjct: 834 LSGMPLAGKVQVKWGEEENAHCVANYQLPPESQQQLLTQLSAECR 878


38YpAngola_A2700YpAngola_A2713Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A2700-113-4.304789hypothetical protein
YpAngola_A2701014-5.279965hypothetical protein
YpAngola_A2702018-6.529330hypothetical protein
YpAngola_A2703015-4.557925OmpA domain-containing protein
YpAngola_A2704119-5.540760hypothetical protein
YpAngola_A2705218-3.387612LuxR family transcriptional regulator
YpAngola_A27062140.913437hypothetical protein
YpAngola_A27070132.368064hypothetical protein
YpAngola_A27080131.471086murein hydrolase B
YpAngola_A2709016-1.237623hypothetical protein
YpAngola_A2710016-1.245956iron(III)-binding periplasmic protein
YpAngola_A2711016-1.564228iron(III)-transport system permease
YpAngola_A2712016-2.689016iron(III)-transport ATP-binding protein
YpAngola_A2713118-3.329112hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2700FIMBRIALPAPE310.003 Escherichia coli: P pili tip fibrillum papE protein...
		>FIMBRIALPAPE#Escherichia coli: P pili tip fibrillum papE protein

signature.
Length = 173

Score = 31.2 bits (70), Expect = 0.003
Identities = 21/83 (25%), Positives = 39/83 (46%), Gaps = 7/83 (8%)

Query: 200 PSCTFDGPQKVDFGIVTSSNL-NNGGIERDLDFNITCKTDYGHYSATAAIFTQTSSADNN 258
P+CT + V++G + NL +GG ++D ++ C G T + ++ N
Sbjct: 37 PACTVQNAE-VNWGDIEIQNLVQSGGNQKDFTVDMNCPYSLG----TMKVTITSNGQTGN 91

Query: 259 YIKVKDSQN-QEDRLLIKISDTN 280
I V ++ D LLI + ++N
Sbjct: 92 SILVPNTSTASGDGLLIYLYNSN 114


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2703OMPADOMAIN885e-21 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 88.1 bits (218), Expect = 5e-21
Identities = 38/129 (29%), Positives = 67/129 (51%), Gaps = 16/129 (12%)

Query: 441 LFDSNSTKLKLNSQT--NEMMMELLSLVERNKEKKILIVGHSDNTGSSSMNMALSEQRAL 498
LF+ N LK Q +++ +L +L K+ ++++G++D GS + N LSE+RA
Sbjct: 222 LFNFNKATLKPEGQAALDQLYSQLSNL--DPKDGSVVVLGYTDRIGSDAYNQGLSERRAQ 279

Query: 499 ALRDWLIKRSDITVDNFITKGMGASEPVATNHTEAGR---------EQNRRVEVLILPTQ 549
++ D+LI + I D +GMG S PV N + + +RRVE+ + +
Sbjct: 280 SVVDYLISKG-IPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIEVKGIK 338

Query: 550 DRTRMTEPK 558
D +T+P+
Sbjct: 339 D--VVTQPQ 345


39YpAngola_A2743YpAngola_A2766Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A2743213-0.108734hypothetical protein
YpAngola_A2744215-0.450745NAD-dependent DNA ligase LigA
YpAngola_A2745317-1.857275cell division protein ZipA
YpAngola_A2746216-1.711876putative sulfate transport protein CysZ
YpAngola_A2748114-1.770558PTS system phosphohistidinoprotein-hexose
YpAngola_A2749011-1.158500phosphoenolpyruvate-protein phosphotransferase
YpAngola_A2750-110-0.195800PTS system glucose-specific transporter
YpAngola_A2751013-1.221392IS1541 transposase
YpAngola_A2752015-2.556932sensor histidine kinase CpxA
YpAngola_A2753116-4.020010transcriptional regulatory protein CpxR
YpAngola_A2754014-3.869530RND efflux transporter
YpAngola_A2755113-4.023042efflux ABC transporter ATP-binding
YpAngola_A2756319-7.272192pyridine nucleotide-disulfide oxidoreductase
YpAngola_A2757316-5.828608solute/sodium symporter (SSS) family protein
YpAngola_A2760014-3.467740aminotransferase, classes I and II
YpAngola_A2761013-1.246351von Willebrand factor type A domain-containing
YpAngola_A2763116-0.205723DNA-binding response regulator
YpAngola_A2764114-0.567601hypothetical protein
YpAngola_A27652172.985520cysteine synthase B
YpAngola_A27662152.233761sulfate/thiosulfate transporter subunit
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2749PHPHTRNFRASE7500.0 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 750 bits (1939), Expect = 0.0
Identities = 278/571 (48%), Positives = 392/571 (68%), Gaps = 2/571 (0%)

Query: 1 MISGILVSPGIAFGKALLLKEDEIVINRKKISADQVEQEVERFKAGRAKAAEQLEAIKTK 60
I+GI S G+A KA + E + I + I V E+E+ A K+ E+L AIK +
Sbjct: 4 KITGIAASSGVAIAKAFIHLEPNVDIEKTSI--TDVSTEIEKLTAALEKSKEELRAIKDQ 61

Query: 61 AGVSLGEEKAAIFEGHIMLLEDEELEQEIIALIKDEHASADAAAYSVIEGQAKALEELDD 120
S+G +KA IF H+++L+D EL I I++E +A+ A V + E +D+
Sbjct: 62 TEASMGADKAEIFAAHLLVLDDPELVDGIKGKIENEQMNAEYALKEVSDMFVSMFESMDN 121

Query: 121 EYLKERAADVRDIGKRLLKNILGLNIVDLSAIQDEVILVATDLTPSETAQLNLDKVLGFI 180
EY+KERAAD+RD+ KR+L +++G+ L+ I +E +++A DLTPS+TAQLN V GF
Sbjct: 122 EYMKERAADIRDVSKRVLGHLIGVETGSLATIAEETVIIAEDLTPSDTAQLNKQFVKGFA 181

Query: 181 TDIGGRTSHTSIMARSLELPAIVGTSNVTKQVKNDDYLILDAVNNKVYLNPTADVIEQLK 240
TDIGGRTSH++IM+RSLE+PA+VGT VT+++++ D +I+D + V +NPT + ++ +
Sbjct: 182 TDIGGRTSHSAIMSRSLEIPAVVGTKEVTEKIQHGDMVIVDGIEGIVIVNPTEEEVKAYE 241

Query: 241 AVKNQYITEKNELAKLKDLPAITLDGHQVEVVANIGTVRDIAGAERNGAEGVGLYRTEFL 300
+ + +K E AKL P+ T DG VE+ ANIGT +D+ G NG EG+GLYRTEFL
Sbjct: 242 EKRAAFEKQKQEWAKLVGEPSTTKDGAHVELAANIGTPKDVDGVLANGGEGIGLYRTEFL 301

Query: 301 FMDRDSLPTEEEQFQAYKAVAEAMGSQAVIVRTMDIGGDKDLPYMNLPKEENPFLGWRAI 360
+MDRD LPTEEEQF+AYK V + M + V++RT+DIGGDK+L Y+ LPKE NPFLG+RAI
Sbjct: 302 YMDRDQLPTEEEQFEAYKEVVQRMDGKPVVIRTLDIGGDKELSYLQLPKELNPFLGFRAI 361

Query: 361 RIAMDRKEILHAQLRAILRASAFGKLRIMFPMIISVEEVRELKAELELLKSQLREENKAF 420
R+ +++++I QLRA+LRAS +G L++MFPMI ++EE+R+ KA ++ K +L E
Sbjct: 362 RLCLEKQDIFRTQLRALLRASTYGNLKVMFPMIATLEELRQAKAIMQEEKDKLLSEGVDV 421

Query: 421 DETIEVGVMVETPAAAVIARHLAKEVDFFSIGTNDLTQYTLAVDRGNELISHLYNPMSPS 480
++IEVG+MVE P+ AV A AKEVDFFSIGTNDL QYT+A DR NE +S+LY P P+
Sbjct: 422 SDSIEVGIMVEIPSTAVAANLFAKEVDFFSIGTNDLIQYTMAADRMNERVSYLYQPYHPA 481

Query: 481 VLGLIKQVIDASHAEGKWTGMCGELAGDERATLLLLGMGLDEFSMSAISIPRIKKIIRNT 540
+L L+ VI A+H+EGKW GMCGE+AGDE A LLLG+GLDEFSMSA SI + +
Sbjct: 482 ILRLVDMVIKAAHSEGKWVGMCGEMAGDEVAIPLLLGLGLDEFSMSATSILPARSQLLKL 541

Query: 541 NFEDVKVLAEQALAQPTAKELMDLVTTFIEE 571
+ E++K A++AL TA+E+ LV +
Sbjct: 542 SKEELKPFAQKALMLDTAEEVEQLVKKTYLK 572


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2752PF06580385e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 38.3 bits (89), Expect = 5e-05
Identities = 41/231 (17%), Positives = 83/231 (35%), Gaps = 59/231 (25%)

Query: 239 ELRSPLARLQLAIGLAHQNPDNVDNAL----QRIEHESERLDKMIGEL-------LALSR 287
++ S QL A NP + NAL I + + +M+ L L S
Sbjct: 153 KMASMAQEAQLMALKAQINPHFMFNALNNIRALILEDPTKAREMLTSLSELMRYSLRYSN 212

Query: 288 AENHSLADD----DEYFDLQEL-------VKVVVNDARYEAQLPGVEIQLEVAAQSEYTV 336
A SLAD+ D Y L + + +N A + Q+P + +Q V
Sbjct: 213 ARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPMLVQTLV-------- 264

Query: 337 KGNAELMRRAIENIVRNALRFSASGQQVKVTLSALDKRYQIQVADQGPGVEENKLSSIFD 396
EN +++ + G ++ + + + ++V + G +N
Sbjct: 265 -----------ENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKN------- 306

Query: 397 PFVRVKSAMSGKGYGLGLAITHK-VILAHGGQVEAR-NGEQGGLVITLRVP 445
+ + G GL + + + +G + + + + +QG + + +P
Sbjct: 307 ---------TKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2753HTHFIS891e-22 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 89.1 bits (221), Expect = 1e-22
Identities = 32/122 (26%), Positives = 63/122 (51%), Gaps = 1/122 (0%)

Query: 2 KILLVDDDLELGTMLKEYLGGEGFTAKHVLTGKAGIDGALSGDYTALILDIMLPDMSGID 61
IL+ DDD + T+L + L G+ + +GD ++ D+++PD + D
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 VLRQVRK-KSRLPIIMLTAKGDNIDRVIGLEMGADDYMPKPCYPRELVARLRAVLRRFEE 120
+L +++K + LP+++++A+ + + E GA DY+PKP EL+ + L +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 121 QP 122
+P
Sbjct: 125 RP 126


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2754RTXTOXIND606e-12 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 60.2 bits (146), Expect = 6e-12
Identities = 40/259 (15%), Positives = 87/259 (33%), Gaps = 25/259 (9%)

Query: 27 FRWISPPDKPSYITAVAEIRDLEQTVLADGTIKAQKQVTVGAQVSGQIKALHVTLGQQVE 86
F+ +S + + + E Q + K+ V +I +
Sbjct: 176 FQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKS 235

Query: 87 KNQLVAEI--DDLAQQNALKDAEEALKNVQAQRAAKIA--TQKNNQLTYQRQQQILAKGV 142
+ + + ++A+ + E + + Q +++ +++ L
Sbjct: 236 RLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQL---- 291

Query: 143 GVRADFDS-IKATLEATQAEISALDAQIAQAEIAVSTAKLNLGYTKISSPIAGTVVAIPV 201
V F + I L T I L ++A+ E + I +P++ V + V
Sbjct: 292 -VTQLFKNEILDKLRQTTDNIGLLTLELAKNE-------ERQQASVIRAPVSVKVQQLKV 343

Query: 202 -EEGQTVNAVQSAPTIIKVAQLDTMTVEAQISEADVVKVKTGMPVYFTILGEPEKRF--- 257
EG V + ++ V + DT+ V A + D+ + G + P R+
Sbjct: 344 HTEGGVVTTAE--TLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYL 401

Query: 258 SATLRAIEPAPDSINDETT 276
++ I D+I D+
Sbjct: 402 VGKVKNI--NLDAIEDQRL 418



Score = 48.7 bits (116), Expect = 3e-08
Identities = 17/167 (10%), Positives = 57/167 (34%), Gaps = 17/167 (10%)

Query: 10 RLIGWVVLLLIIGGLLFFRWISPPDKPSYITAVAEIRDLEQTVLADGTIKAQKQV-TVGA 68
RL+ + ++ ++ + + + +E A+G + + +
Sbjct: 58 RLVAYFIMGFLVIAFI---L-------------SVLGQVEIVATANGKLTHSGRSKEIKP 101

Query: 69 QVSGQIKALHVTLGQQVEKNQLVAEIDDLAQQNALKDAEEALKNVQAQRAAKIATQKNNQ 128
+ +K + V G+ V K ++ ++ L + + +L + ++ ++ +
Sbjct: 102 IENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIE 161

Query: 129 LTYQRQQQILAKGVGVRADFDSIKATLEATQAEISALDAQIAQAEIA 175
L + ++ + + + + + S Q Q E+
Sbjct: 162 LNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELN 208


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2763HTHFIS938e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 92.6 bits (230), Expect = 8e-24
Identities = 33/134 (24%), Positives = 62/134 (46%)

Query: 2 KILIAEDNAHIRNGLMEVLAHEGYRPIAAENGVQALALYRQQQPDFIILDIMMPELDGYK 61
IL+A+D+A IR L + L+ GY N D ++ D++MP+ + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 VCREIRKHDWQTPIIFLSAKDEEIDRVIGLELGADDYISKPFGIHEMRARIKTIVRRCLR 121
+ I+K P++ +SA++ + + E GA DY+ KPF + E+ I + R
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 122 KVPESAEDAGFPFG 135
+ + +D+
Sbjct: 125 RPSKLEDDSQDGMP 138


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2766PF05272290.041 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.9 bits (64), Expect = 0.041
Identities = 10/23 (43%), Positives = 14/23 (60%)

Query: 30 MVALLGPSGSGKTTLLRIIAGLE 52
V L G G GK+TL+ + GL+
Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLD 620


40YpAngola_A2884YpAngola_A2924Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A28843140.212323hypothetical protein
YpAngola_A28853140.189494primosomal replication protein n''
YpAngola_A28861121.437114hypothetical protein
YpAngola_A28870131.040121hypothetical protein
YpAngola_A28880100.130512adenine phosphoribosyltransferase
YpAngola_A2889112-0.149691DNA polymerase III subunits gamma and tau
YpAngola_A2890117-2.270176hypothetical protein
YpAngola_A2891019-3.970348recombination protein RecR
YpAngola_A2892019-4.865730hypothetical protein
YpAngola_A2893122-5.889108heat shock protein 90
YpAngola_A2894231-8.828501adenylate kinase
YpAngola_A2895123-7.557412ferrochelatase
YpAngola_A2896023-7.751393CDP-6-deoxy-delta-3,4-glucoseen reductase
YpAngola_A2897019-5.088903glucose-1-phosphate cytidylyltransferase
YpAngola_A2900120-4.226923paratose synthase
YpAngola_A2901120-2.746692hypothetical protein
YpAngola_A29022210.422503IS285 transposase
YpAngola_A29032170.053643hypothetical protein
YpAngola_A29053160.544132insertion sequence transposase
YpAngola_A2906316-0.225511transposase/IS protein
YpAngola_A2907216-0.988679hypothetical protein
YpAngola_A2908117-0.030079nucleoside triphosphate pyrophosphohydrolase
YpAngola_A29091160.211989preprotein translocase subunit SecA
YpAngola_A29100150.667236SecA regulator SecM
YpAngola_A29110141.090685hypothetical protein
YpAngola_A29120132.100780UDP-3-O-[3-hydroxymyristoyl] N-acetylglucosamine
YpAngola_A29131142.703348cell division protein FtsZ
YpAngola_A29141112.935496cell division protein FtsA
YpAngola_A29162143.119701D-alanine--D-alanine ligase
YpAngola_A29172143.357330UDP-N-acetylmuramate--L-alanine ligase
YpAngola_A29182143.805613undecaprenyldiphospho-muramoylpentapeptide
YpAngola_A29191143.514914cell division protein FtsW
YpAngola_A29201143.616505UDP-N-acetylmuramoyl-L-alanyl-D-glutamate
YpAngola_A29210133.298261phospho-N-acetylmuramoyl-pentapeptide-
YpAngola_A29220133.587623UDP-N-acetylmuramoyl-tripeptide--D-alanyl-D-
YpAngola_A2923-1133.345630UDP-N-acetylmuramoylalanyl-D-glutamate--2,
YpAngola_A2924-2123.062977penicillin-binding protein 3
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2900NUCEPIMERASE661e-14 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 66.0 bits (161), Expect = 1e-14
Identities = 59/321 (18%), Positives = 115/321 (35%), Gaps = 70/321 (21%)

Query: 1 MKILITGVSGYLGSQLANALMLE-HEVVGTVRAGSVCNRITDIGNVNL------------ 47
MK L+TG +G++G ++ L+ H+VVG + + D +V+L
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGI-------DNLNDYYDVSLKQARLELLAQPG 53

Query: 48 -----INVTDSGWIDKVL-SFSPDVVINTAALYGRKGELLS--ELVDANIQFPLRILE-- 97
I++ D + + S + V + + L + D+N+ L ILE
Sbjct: 54 FQFHKIDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGC 113

Query: 98 --------MLVST----GKGLFFQCGTSLPAD--VSQYALTKNQFTELAREYCNKFSGKF 143
+ S+ G T D VS YA TK +A Y + +
Sbjct: 114 RHNKIQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPA 173

Query: 144 IELKLEHFFGPFDDST----KFTTYVINSCRSHSDLKL-TAGLQRRDFIYINDLINA--- 195
L+ +GP+ KFT + + + G +RDF YI+D+ A
Sbjct: 174 TGLRFFTVYGPWGRPDMALFKFT----KAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIR 229

Query: 196 ------------FKIMISKSESLISGESISIGSGHAVTIKEFVETVAKMTSYQGNLQFGA 243
+ + S+ +IG+ V + ++++ + +
Sbjct: 230 LQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNM-- 287

Query: 244 IPTRENELMYSCASLARIQEL 264
+P + +++ + A + E+
Sbjct: 288 LPLQPGDVLETSADTKALYEV 308


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2905HTHTETR280.047 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 28.1 bits (62), Expect = 0.047
Identities = 12/106 (11%), Positives = 31/106 (29%), Gaps = 14/106 (13%)

Query: 16 QGMSSRAIARELGISRNTVKRYLQAKSEPPKYTPRPAVASLLDEYRDYIRQRIAD----- 70
S IA+ G++R + + + KS+ + + + I + +
Sbjct: 30 SSTSLGEIAKAAGVTRGAIYWHFKDKSD--------LFSEIWELSESNIGELELEYQAKF 81

Query: 71 -AHPYKIPATVIAREIRDQGYRGGMTILRAFIRSLSVPQEQEPAVR 115
P + ++ + +L I + V+
Sbjct: 82 PGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQ 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2909SECA13730.0 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 1373 bits (3556), Expect = 0.0
Identities = 805/904 (89%), Positives = 852/904 (94%), Gaps = 3/904 (0%)

Query: 1 MLIKLLTKVFGSRNDRTLRRMQKVVDVINRMEPDIEKLTDTELRAKTDEFRERLAKGEVL 60
MLIKLLTKVFGSRNDRTLRRM+KVV++IN MEP++EKL+D EL+ KT EFR RL KGEVL
Sbjct: 1 MLIKLLTKVFGSRNDRTLRRMRKVVNIINAMEPEMEKLSDEELKGKTAEFRARLEKGEVL 60

Query: 61 ENLIPEAFAVVREASKRVFGMRHFDVQLLGGMVLNERCIAEMRTGEGKTLTATLPAYLNA 120
ENLIPEAFAVVREASKRVFGMRHFDVQLLGGMVLNERCIAEMRTGEGKTLTATLPAYLNA
Sbjct: 61 ENLIPEAFAVVREASKRVFGMRHFDVQLLGGMVLNERCIAEMRTGEGKTLTATLPAYLNA 120

Query: 121 LSGRGVHVVTVNDYLAQRDAENNRPLFEFLGLSIGINLPNMTAPAKRAAYAADITYGTNN 180
L+G+GVHVVTVNDYLAQRDAENNRPLFEFLGL++GINLP M APAKR AYAADITYGTNN
Sbjct: 121 LTGKGVHVVTVNDYLAQRDAENNRPLFEFLGLTVGINLPGMPAPAKREAYAADITYGTNN 180

Query: 181 EFGFDYLRDNMAFSPEERVQRQLHYALVDEVDSILIDEARTPLIISGPAEDSSEMYIRVN 240
E+GFDYLRDNMAFSPEERVQR+LHYALVDEVDSILIDEARTPLIISGPAEDSSEMY RVN
Sbjct: 181 EYGFDYLRDNMAFSPEERVQRKLHYALVDEVDSILIDEARTPLIISGPAEDSSEMYKRVN 240

Query: 241 KLIPKLIRQEKEDSDSFQGEGHFSVDEKSRQVHLTERGLILIEQMLVEAGIMDEGESLYS 300
K+IP LIRQEKEDS++FQGEGHFSVDEKSRQV+LTERGL+LIE++LV+ GIMDEGESLYS
Sbjct: 241 KIIPHLIRQEKEDSETFQGEGHFSVDEKSRQVNLTERGLVLIEELLVKEGIMDEGESLYS 300

Query: 301 PANIMLMHHVTAALRAHVLFTRDVDYIVKDGEVIIVDEHTGRTMQGRRWSDGLHQAVEAK 360
PANIMLMHHVTAALRAH LFTRDVDYIVKDGEVIIVDEHTGRTMQGRRWSDGLHQAVEAK
Sbjct: 301 PANIMLMHHVTAALRAHALFTRDVDYIVKDGEVIIVDEHTGRTMQGRRWSDGLHQAVEAK 360

Query: 361 EGVEIQNENQTLASITFQNYFRLYEKLAGMTGTADTEAFEFSSIYKLDTIVVPTNRPMIR 420
EGV+IQNENQTLASITFQNYFRLYEKLAGMTGTADTEAFEFSSIYKLDT+VVPTNRPMIR
Sbjct: 361 EGVQIQNENQTLASITFQNYFRLYEKLAGMTGTADTEAFEFSSIYKLDTVVVPTNRPMIR 420

Query: 421 KDLADLVYMTEQEKIGAIIEDIRERTANGQPVLVGTISIEKSEVVSAELTKAGIEHKVLN 480
KDL DLVYMTE EKI AIIEDI+ERTA GQPVLVGTISIEKSE+VS ELTKAGI+H VLN
Sbjct: 421 KDLPDLVYMTEAEKIQAIIEDIKERTAKGQPVLVGTISIEKSELVSNELTKAGIKHNVLN 480

Query: 481 AKFHAMEAEIVSQAGQPGAVTIATNMAGRGTDIVLGGSWQSEIAALEDPTEEQIAAIKAA 540
AKFHA EA IV+QAG P AVTIATNMAGRGTDIVLGGSWQ+E+AALE+PT EQI IKA
Sbjct: 481 AKFHANEAAIVAQAGYPAAVTIATNMAGRGTDIVLGGSWQAEVAALENPTAEQIEKIKAD 540

Query: 541 WQIRHDAVLASGGLHIIGTERHESRRIDNQLRGRAGRQGDAGSSRFYLSMEDALMRIFAS 600
WQ+RHDAVL +GGLHIIGTERHESRRIDNQLRGR+GRQGDAGSSRFYLSMEDALMRIFAS
Sbjct: 541 WQVRHDAVLEAGGLHIIGTERHESRRIDNQLRGRSGRQGDAGSSRFYLSMEDALMRIFAS 600

Query: 601 DRVSGMMRKLGMKPGEAIEHPWVTKAIANAQRKVESRNFDIRKQLLEYDDVANDQRRAIY 660
DRVSGMMRKLGMKPGEAIEHPWVTKAIANAQRKVESRNFDIRKQLLEYDDVANDQRRAIY
Sbjct: 601 DRVSGMMRKLGMKPGEAIEHPWVTKAIANAQRKVESRNFDIRKQLLEYDDVANDQRRAIY 660

Query: 661 SQRNELLDVSDVSETINSIREDVFKTTIDSYIPTQSLEEMWDIEGLEQRLKNDFDLDMPI 720
SQRNELLDVSDVSETINSIREDVFK TID+YIP QSLEEMWDI GL++RLKNDFDLD+PI
Sbjct: 661 SQRNELLDVSDVSETINSIREDVFKATIDAYIPPQSLEEMWDIPGLQERLKNDFDLDLPI 720

Query: 721 AKWLEDEPQLHEETLRERILQQAIETYQRKEEVVGIEMMRNFEKGVMLQTLDSLWKEHLA 780
A+WL+ EP+LHEETLRERIL Q+IE YQRKEEVVG EMMR+FEKGVMLQTLDSLWKEHLA
Sbjct: 721 AEWLDKEPELHEETLRERILAQSIEVYQRKEEVVGAEMMRHFEKGVMLQTLDSLWKEHLA 780

Query: 781 AMDYLRQGIHLRGYAQKDPKQEYKRESFAMFAAMLESLKYEVISVLSKVQVRMPEEVEAL 840
AMDYLRQGIHLRGYAQKDPKQEYKRESF+MFAAMLESLKYEVIS LSKVQVRMPEEVE L
Sbjct: 781 AMDYLRQGIHLRGYAQKDPKQEYKRESFSMFAAMLESLKYEVISTLSKVQVRMPEEVEEL 840

Query: 841 EVQRREEAERLARQQQLSHQTDNSALMSEEEVKVANSLERKVGRNDPCPCGSGKKYKQCH 900
E QRR EAERLA+ QQLSHQ D+SA + + ERKVGRNDPCPCGSGKKYKQCH
Sbjct: 841 EQQRRMEAERLAQMQQLSHQDDDSAAAAALAAQTG---ERKVGRNDPCPCGSGKKYKQCH 897

Query: 901 GRLQ 904
GRLQ
Sbjct: 898 GRLQ 901


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2914SHAPEPROTEIN537e-10 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 53.2 bits (128), Expect = 7e-10
Identities = 47/201 (23%), Positives = 72/201 (35%), Gaps = 18/201 (8%)

Query: 171 IVKAVERCGLKVDQLIFAGLAASYAVLTEDERELGVCVVDIGGGTMDMAVYTGGALRHTK 230
I ++ + G + LI +AA+ G VVDIGGGT ++AV + + ++
Sbjct: 126 IRESAQGAGAREVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLNGVVYSS 185

Query: 231 VIPYAGNVVTSDI------AYAFGTPPTDAEAIKVRHGCALGSIVSKDESVEVPSVGGRP 284
+ G+ I Y AE IK G A + V V GR
Sbjct: 186 SVRIGGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAY-----PGDEVREIEVRGRN 240

Query: 285 -----PRSLQRQTLAEVIEPRYTELLNLVNDEILQLQEQLRQQGVKHHLAAGIVLTGGAA 339
PR E++E E L + ++ EQ + G+VLTGG A
Sbjct: 241 LAEGVPRGF-TLNSNEILEA-LQEPLTGIVSAVMVALEQCPPELASDISERGMVLTGGGA 298

Query: 340 QIDGLAECAQRVFHAQVRIGQ 360
+ L V + +
Sbjct: 299 LLRNLDRLLMEETGIPVVVAE 319


41YpAngola_A2941YpAngola_A3018Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A29411173.039280hypothetical protein
YpAngola_A29420183.472830transcriptional regulator SgrR
YpAngola_A29430193.584745thiamine transporter substrate binding subunit
YpAngola_A2944-1183.557648thiamine transporter membrane protein
YpAngola_A2945-1162.668419thiamine transporter ATP-binding subunit
YpAngola_A2946-2162.455255hypothetical protein
YpAngola_A2947-1152.620445DNA polymerase II
YpAngola_A29480162.061860ATP-dependent helicase HepA
YpAngola_A29490181.267174IS1541 transposase
YpAngola_A2950-1151.486997hypothetical protein
YpAngola_A29521192.744283hypothetical protein
YpAngola_A29543192.944334putative lipoprotein
YpAngola_A29563192.703254hypothetical protein
YpAngola_A29553182.191250hypothetical protein
YpAngola_A29583171.458506pentapeptide repeat-containing protein
YpAngola_A29594181.314601hypothetical protein
YpAngola_A2960118-1.056024insertion sequence transposase
YpAngola_A2961017-2.410309transposase/IS protein
YpAngola_A2963118-3.156921FAD binding domain-containing protein
YpAngola_A2964322-6.136633short chain dehydrogenase/reductase family
YpAngola_A2965221-6.514853hypothetical protein
YpAngola_A2967121-6.794745beta-ketoacyl synthase family protein
YpAngola_A2968123-7.337897short-chain dehydrogenase/reductase family
YpAngola_A2969222-8.127362putative hydroxymethylglutaryl-coenzyme A
YpAngola_A2970124-10.311124enoyl-CoA hydratase/isomerase family protein
YpAngola_A2971127-11.090178polyketide biosynthesis enoyl-CoA hydratase
YpAngola_A2972230-10.918568hypothetical protein
YpAngola_A2973125-8.458411hypothetical protein
YpAngola_A2975018-2.427705hypothetical protein
YpAngola_A2974-121-0.677551hypothetical protein
YpAngola_A29760314.867449putative acyl carrier protein
YpAngola_A2977-1356.332724putative acyltransferase
YpAngola_A29781407.466983hypothetical protein
YpAngola_A29790407.522344hypothetical protein
YpAngola_A29810367.337398hypothetical protein
YpAngola_A29820358.492699AAA ATPase
YpAngola_A2983-1328.391511Rhs element Vgr protein
YpAngola_A2985-1256.312645hypothetical protein
YpAngola_A2986-1235.379114hypothetical protein
YpAngola_A2988-2266.236808hypothetical protein
YpAngola_A2989-2286.438974hypothetical protein
YpAngola_A2990-1254.436688ImpA domain-containing protein
YpAngola_A29910262.701665PAAR domain-containing protein
YpAngola_A2992-1306.059404hypothetical protein
YpAngola_A2993-1316.713599hypothetical protein
YpAngola_A2994-2193.419650hypothetical protein
YpAngola_A2995-2130.059574hypothetical protein
YpAngola_A2996-211-0.285671hypothetical protein
YpAngola_A2997-29-0.581385ImpA domain-containing protein
YpAngola_A2998015-4.715057hypothetical protein
YpAngola_A2999-117-4.688776ABC transporter ATP-binding protein
YpAngola_A3000129-8.335634hypothetical protein
YpAngola_A3001122-6.225390molybdopterin biosynthesis protein MoeB
YpAngola_A3002026-8.456372molybdopterin biosynthesis protein MoeA
YpAngola_A3003228-10.392353radical SAM domain-containing protein
YpAngola_A3004127-10.844362ABC transporter ATP-binding protein
YpAngola_A3005124-9.083304RND family efflux transporter MFP subunit
YpAngola_A3007-117-4.869981IS285 transposase
YpAngola_A3006-118-4.514795hypothetical protein
YpAngola_A3008-214-0.223400hypothetical protein
YpAngola_A3009-1100.794579hypothetical protein
YpAngola_A3010-2101.508203S-formylglutathione hydrolase
YpAngola_A3011-1110.212936alcohol dehydrogenase
YpAngola_A3012-112-2.246881LysR family substrate-binding transcriptional
YpAngola_A3014-214-3.785771GTP cyclohydrolase I
YpAngola_A3015-114-3.730573hypothetical protein
YpAngola_A3016-116-4.632888hypothetical protein
YpAngola_A3017-213-3.643569galactose-binding protein
YpAngola_A3018-114-3.041543galactose/methyl galaxtoside transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2944PF06580355e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 35.2 bits (81), Expect = 5e-04
Identities = 18/82 (21%), Positives = 28/82 (34%), Gaps = 5/82 (6%)

Query: 2 AIRRQPLIAPWLWPGLLAAGLIVAVALLAFAAIWHHAPTADWQSVWHDR-YLWHVIRFTF 60
A R WL + L V A + +W A T S+W ++
Sbjct: 58 AYRSFIKRQGWLKLNMGQIILRVLPACVVIGMVWFVANT----SIWRLLAFINTKPVAFT 113

Query: 61 WQAFLSALLSVIPAIFLARALY 82
LS + +V+ F+ LY
Sbjct: 114 LPLALSIIFNVVVVTFMWSLLY 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2952OMPADOMAIN614e-12 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 60.7 bits (147), Expect = 4e-12
Identities = 40/124 (32%), Positives = 52/124 (41%), Gaps = 18/124 (14%)

Query: 391 FTSDGAFRTGEATLSEEFINK-KNIERLGLALAPWPGDIEVIGHTDNKPFRSTSGNNNLK 449
SD F +ATL E + L P G + V+G+TD R S N
Sbjct: 217 LKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTD----RIGSDAYNQG 272

Query: 450 LSAARASVVADKLRESTQINETHQREISAIGRGESDPLADNATEEGRKR---------NR 500
LS RA V D L S I ISA G GES+P+ N + ++R +R
Sbjct: 273 LSERRAQSVVDYL-ISKGIPADK---ISARGMGESNPVTGNTCDNVKQRAALIDCLAPDR 328

Query: 501 RVDI 504
RV+I
Sbjct: 329 RVEI 332


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2959ICENUCLEATIN320.008 Ice nucleation protein signature.
		>ICENUCLEATIN#Ice nucleation protein signature.

Length = 1258

Score = 32.0 bits (72), Expect = 0.008
Identities = 51/236 (21%), Positives = 88/236 (37%), Gaps = 8/236 (3%)

Query: 407 TGMSVSATGISVSTTGTSLSVTGMSTSVTGVSVGFTLIGTS--FTGVSTSFTGVGTSFTG 464
+G + I ++T G++LS T S + G T +S G ++ T S
Sbjct: 150 SGSTQPTQTIEIATYGSTLSGTHQSQLIAGYGSTETAGDSSTLIAGYGSTGTAGADSTLV 209

Query: 465 ASNSLTGVSNSMTGCSSSFTGTSNSMTGSSHSMTGMSTSITGHSMSQ-TGSSSSITGDST 523
A T + + + + T M GS + ST G S G S+ T
Sbjct: 210 AGYGSTQTAGEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGED 269

Query: 524 SFTGSSVSSTGSSVSTTGVSTSTTGSSTSTTGCSVSTTGSSTSTTGNSVSMTG----NST 579
S + ST ++ + ++ + T+ S+ ST T G + T T
Sbjct: 270 SSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQT 329

Query: 580 STTGCSISTTGSSIGTVGSSISTTGSSVSTTGSSISTTGLSVSYTGAQYSDVGVDL 635
+ G ++ S GT G S+ + +T ++ + L+ Y Q + G DL
Sbjct: 330 AQKGSDLTAGYGSTGTAGDD-SSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDL 384



Score = 30.9 bits (69), Expect = 0.022
Identities = 31/143 (21%), Positives = 63/143 (44%), Gaps = 6/143 (4%)

Query: 492 GSSHSMTGMSTSITGHSMSQTGSSSSITGDSTSFTGSSVSSTGSSVSTTGVSTSTTGSST 551
GS+ + S+ I G+ +QT +SI T+ GS+ ++ S T G +++T +
Sbjct: 629 GSTSTAGADSSLIAGYGSTQTAGYNSIL---TAGYGSTQTAQEGSDLTAGYGSTSTAGAD 685

Query: 552 STTGCSVSTTGSSTSTTGNSVSMTGNSTSTTGCSISTTGSSIGTVGSS---ISTTGSSVS 608
S+ +T ++ + + T+ G +++ S T G+ I+ GS+ +
Sbjct: 686 SSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQT 745

Query: 609 TTGSSISTTGLSVSYTGAQYSDV 631
+ S T G + T + S +
Sbjct: 746 ASYHSSLTAGYGSTQTAREQSVL 768



Score = 30.1 bits (67), Expect = 0.038
Identities = 32/143 (22%), Positives = 62/143 (43%), Gaps = 6/143 (4%)

Query: 492 GSSHSMTGMSTSITGHSMSQTGSSSSITGDSTSFTGSSVSSTGSSVSTTGVSTSTTGSST 551
GS+ + S+ I G+ +QT S T+ GS+ ++ S TG +++T +
Sbjct: 485 GSTSTAGYESSLIAGYGSTQTAGYGSTL---TAGYGSTQTAQNESDLITGYGSTSTAGAN 541

Query: 552 STTGCSVSTTGSSTSTTGNSVSMTGNSTSTTGCSISTTGSSIGTVGSS---ISTTGSSVS 608
S+ +T +++ + + T+ G ++ S GT GS I+ GS+ +
Sbjct: 542 SSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQT 601

Query: 609 TTGSSISTTGLSVSYTGAQYSDV 631
+ S T G + T + S +
Sbjct: 602 ASYHSSLTAGYGSTQTAREQSVL 624


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2960HTHTETR280.047 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 28.1 bits (62), Expect = 0.047
Identities = 12/106 (11%), Positives = 31/106 (29%), Gaps = 14/106 (13%)

Query: 16 QGMSSRAIARELGISRNTVKRYLQAKSEPPKYTPRPAVASLLDEYRDYIRQRIAD----- 70
S IA+ G++R + + + KS+ + + + I + +
Sbjct: 30 SSTSLGEIAKAAGVTRGAIYWHFKDKSD--------LFSEIWELSESNIGELELEYQAKF 81

Query: 71 -AHPYKIPATVIAREIRDQGYRGGMTILRAFIRSLSVPQEQEPAVR 115
P + ++ + +L I + V+
Sbjct: 82 PGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQ 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2964DHBDHDRGNASE1043e-29 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 104 bits (261), Expect = 3e-29
Identities = 67/252 (26%), Positives = 106/252 (42%), Gaps = 10/252 (3%)

Query: 3 KTILITGALSGIGNTATKLFSEMGYNVVFSGRRPEEGRVILDDLKRINKDVLYVNADMNS 62
K ITGA GIG + + G ++ PE+ ++ LK + AD+
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 63 ESDIKHLIEMTLERFGSLDVAVNCAGTVGETAEIQAVTQDNFHLVFNTNVLGTLLAMKYQ 122
+ I + G +D+ VN AG + I +++ + + F+ N G A +
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVL-RPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127

Query: 123 IPVMVERGKGSIINISSIAGLVGLPSTGIYVASKHAIEGLTKTAALEVATTGVRINSISP 182
M++R GSI+ + S V S Y +SK A TK LE+A +R N +SP
Sbjct: 128 SKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187

Query: 183 GPVEGKMFDRFLGHDENNKKAFIE--------MMPNKRFTTQEEVAHTIVFLAEDNVTAI 234
G E M L DEN + I+ +P K+ ++A ++FL I
Sbjct: 188 GSTETDM-QWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHI 246

Query: 235 TGQTITIDGGYT 246
T + +DGG T
Sbjct: 247 TMHNLCVDGGAT 258


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2968DHBDHDRGNASE1226e-36 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 122 bits (307), Expect = 6e-36
Identities = 76/251 (30%), Positives = 118/251 (47%), Gaps = 16/251 (6%)

Query: 2 NLFISGGASGIGRSVVIAALSKGWNV-GFSYHNNKEGAQQLLDIAVAEFPRQLCRAYQLD 60
FI+G A GIG +V S+G ++ Y+ K A A + D
Sbjct: 10 IAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEA----FPAD 65

Query: 61 VIDSGAVEYVGDRLLVDFSNIDAVVCNAGIDLPGNLVSMTDEDWALVLNTNLTGTFYLIR 120
V DS A++ + R+ + ID +V AG+ PG + S++DE+W + N TG F R
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 121 YFLPLFLANKYGRIVTL-SSLAKDGSSGQAAYAASKAGLVGLTKTTAKEYGHFGITANVV 179
+ + G IVT+ S+ A + AAYA+SKA V TK E + I N+V
Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185

Query: 180 VPGLINTEI-----IGDD-----IKGIKNFFAQYAPVGRLGSPSEVAEAILFLVAKESSY 229
PG T++ ++ IKG F P+ +L PS++A+A+LFLV+ ++ +
Sbjct: 186 SPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGH 245

Query: 230 VNGAVFNVTGG 240
+ V GG
Sbjct: 246 ITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2972DHBDHDRGNASE733e-17 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 72.8 bits (178), Expect = 3e-17
Identities = 54/255 (21%), Positives = 95/255 (37%), Gaps = 32/255 (12%)

Query: 10 VLVTGGTKGIGRATVESFVKAGAKVYGTYFWGDNLDELENHFSQYLNRPVFLQADISDEE 69
+TG +GIG A + GA + + + L+++ + AD+ D
Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSA 70

Query: 70 ITTQLIEKIAQENKKIDILILNAAFAPQFKDTYKFRGLLDSIEHNSWPLITYIDC----- 124
++ +I +E IDIL+ N A + GL+ S+ W ++
Sbjct: 71 AIDEITARIEREMGPIDILV-NVAGVLRP-------GLIHSLSDEEWEATFSVNSTGVFN 122

Query: 125 -----IKQHFGQYPGYVVAITSEGHRSCHITGYDYVAASKAVLETLTKYIG---ARENII 176
K + G +V + S + Y A+SKA TK +G A NI
Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAY-ASSKAAAVMFTKCLGLELAEYNIR 181

Query: 177 INCISPGVVDTEAFELVFGKK--AQAFIRKFDPDF--------IVSPEAVGNVSVALCSG 226
N +SPG +T+ ++ + A+ I+ F + P + + + L SG
Sbjct: 182 CNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSG 241

Query: 227 LMDAVRGQVITVDNG 241
+ + VD G
Sbjct: 242 QAGHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2982DPTHRIATOXIN300.034 Diphtheria toxin signature.
		>DPTHRIATOXIN#Diphtheria toxin signature.

Length = 567

Score = 30.5 bits (68), Expect = 0.034
Identities = 19/54 (35%), Positives = 24/54 (44%), Gaps = 9/54 (16%)

Query: 622 GIGKTETALALADSLFGGEKSLITINLSEYQEAHTVSQLKGSPPGYVGYGQGGV 675
GIG +A A AD + KS + N S Y G+ PGYV Q G+
Sbjct: 23 GIGAPPSAHAGADDVVDSSKSFVMENFSSYH---------GTKPGYVDSIQKGI 67


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2992MICOLLPTASE300.003 Microbial collagenase metalloprotease (M9) signature.
		>MICOLLPTASE#Microbial collagenase metalloprotease (M9) signature.

Length = 1104

Score = 30.1 bits (67), Expect = 0.003
Identities = 16/69 (23%), Positives = 26/69 (37%)

Query: 35 YVYSSESTYGVEPNEKEVEEIIKMKPDVIDPGETLKLAPSILSLLKKNIRKDTGWRIGGR 94
Y+S G + + VE + ++ + E + +LS K NI K G
Sbjct: 199 IQYNSNFRLGTKAQDGVVEALGRLIGNASADPEVINNCIYVLSDFKDNIDKYGSNYSKGN 258

Query: 95 YSFNSVGGG 103
FN + G
Sbjct: 259 AVFNLMKGI 267


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2999PF05272320.009 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 32.0 bits (72), Expect = 0.009
Identities = 11/18 (61%), Positives = 13/18 (72%)

Query: 408 GPNGIGKSTLLKTLLGEY 425
G GIGKSTL+ TL+G
Sbjct: 603 GTGGIGKSTLINTLVGLD 620


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3002DPTHRIATOXIN355e-04 Diphtheria toxin signature.
		>DPTHRIATOXIN#Diphtheria toxin signature.

Length = 567

Score = 35.1 bits (80), Expect = 5e-04
Identities = 40/135 (29%), Positives = 55/135 (40%), Gaps = 41/135 (30%)

Query: 3 HCNTSDLLSLEQALTK-MLSQATPLPATEVIPLSEAAGRITASAIT----------SPIA 51
H NT ++++ AL+ M++QA PL E++ + AA S I P
Sbjct: 354 HHNTEEIVAQSIALSSLMVAQAIPL-VGELVDIGFAAYNFVESIINLFQVVHNSYNRPAY 412

Query: 52 VP-----PFANSAMDGYAVRWHELSDEI--------------------PLPVAGVAFAGA 86
P PF + DGYAV W+ + D I PLP+AGV
Sbjct: 413 SPGHKTQPFLH---DGYAVSWNTVEDSIIRTGFQGESGHDIKITAENTPLPIAGVLLPTI 469

Query: 87 PFK-DVWPEKTCIRI 100
P K DV KT I +
Sbjct: 470 PGKLDVNKSKTHISV 484


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3005RTXTOXIND320.004 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 32.1 bits (73), Expect = 0.004
Identities = 30/177 (16%), Positives = 61/177 (34%), Gaps = 26/177 (14%)

Query: 114 EATSRMADIMEQINSLRNMRMRLEQDSRDTQFSLQEAQ-------HQIDIISKDLRRYKI 166
E + I EQ ++ +N + + E + + + + L +
Sbjct: 183 EVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSS 242

Query: 167 LDKKFLIAKSEL---ERQADRLIN---------WKVKSDILQK------HNSRNQKSFPS 208
L K IAK + E + +N +++S+IL +
Sbjct: 243 LLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILD 302

Query: 209 QFKNIDESIILLEKMMKMIEVGIEQLVIIAPIDGTLSVLDI-ELGQQIKSGEKISVI 264
+ + ++I LL + E + VI AP+ + L + G + + E + VI
Sbjct: 303 KLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVI 359


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3018PF05272320.007 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 32.0 bits (72), Expect = 0.007
Identities = 21/74 (28%), Positives = 29/74 (39%), Gaps = 17/74 (22%)

Query: 24 PGVKALDNVNLKVRPYSIHALMGENGAGKSTLLKCLFGIYKKDSGSIIFQGQEIEFKSSK 83
PG K D + L G G GKSTL+ L G+ F + + K
Sbjct: 591 PGCKF-DYSVV---------LEGTGGIGKSTLINTLVGLD-------FFSDTHFDIGTGK 633

Query: 84 EALEQGVSMVHQEL 97
++ EQ +V EL
Sbjct: 634 DSYEQIAGIVAYEL 647


42YpAngola_A3096YpAngola_A3106Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A3096-1203.545965U32 family peptidase
YpAngola_A3097-1234.346466hypothetical protein
YpAngola_A3098-1224.820578DNA-binding transcriptional regulator BaeR
YpAngola_A3099-1204.395143signal transduction histidine-protein kinase
YpAngola_A31000204.787054multidrug efflux system protein MdtE
YpAngola_A3101-1204.733828multidrug efflux system subunit MdtC
YpAngola_A3102-1183.882906multidrug efflux system subunit MdtB
YpAngola_A3103-1153.391116multidrug efflux system subunit MdtA
YpAngola_A31040163.110242ABC transporter ATP-binding protein
YpAngola_A31050184.032438GntR family transcriptional regulator
YpAngola_A31061173.3008664-aminobutyrate aminotransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3098HTHFIS788e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 78.3 bits (193), Expect = 8e-19
Identities = 33/150 (22%), Positives = 70/150 (46%), Gaps = 5/150 (3%)

Query: 10 QSGSVLIVEDEPKLGQLLVDYLQAAGYRTQWLTNGAEVVATVRQTPPAIILLDLMLPGSD 69
++L+ +D+ + +L L AGY + +N A + + +++ D+++P +
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 70 GITLCREIR-RFSDIPIVMVTAKTEEIDRLLGLEIGADDYICKPYSPREVVARVKTIL-- 126
L I+ D+P+++++A+ + + E GA DY+ KP+ E++ + L
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 127 --RRCSQQRHQPTDDAPLLINESRFQASYQ 154
RR S+ D PL+ + Q Y+
Sbjct: 122 PKRRPSKLEDDSQDGMPLVGRSAAMQEIYR 151


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3099BCTERIALGSPF340.001 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 34.0 bits (78), Expect = 0.001
Identities = 24/90 (26%), Positives = 38/90 (42%), Gaps = 21/90 (23%)

Query: 170 LSTLLAAAVTWVLS-------------RGMLAPVKRLVEGTHRLAA------GDFST--R 208
L+TL+AA++ + ++A V+ V H LA G F
Sbjct: 77 LATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSLADAMKCFPGSFERLYC 136

Query: 209 VAVSSRDELGHLAQDFNQLASSLEKNEQMR 238
V++ + GHL N+LA E+ +QMR
Sbjct: 137 AMVAAGETSGHLDAVLNRLADYTEQRQQMR 166


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3100TCRTETB1265e-34 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 126 bits (318), Expect = 5e-34
Identities = 97/435 (22%), Positives = 182/435 (41%), Gaps = 17/435 (3%)

Query: 20 FMQTLDTTIVNTALPSIAASLGENPLRMQSVIVSYVLTVAVMLPASGWLADRIGVKWVFF 79
F L+ ++N +LP IA + P V +++LT ++ G L+D++G+K +
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 80 SAIILFTFGSLMCAQSATLNE-LILSRVLQGVGGAMMVPVGRLTVMKIVPREQYMAAMAF 138
II+ FGS++ + LI++R +QG G A + + V + +P+E A
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143

Query: 139 VTLPGQIGPLVGPALGGFLVEFASWHWIFLINLP-VGVIGALATLLLMPNHKMSTRRFDI 197
+ +G VGPA+GG + + HW +L+ +P + +I + L+ FDI
Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHY--IHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDI 201

Query: 198 SGFIMLAIGMATLTLALDGHTGLGLSPLAIAGLILCGVIALGSYWWHALGNRFALFSLHL 257
G I++++G+ L + ++ V++ + H L
Sbjct: 202 KGIILMSVGIVFFMLF---------TTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGL 252

Query: 258 FKNKIYTLGLVGSMSARIGSGMLPFMTPIFLQIGLGFSPFHAG-LMMIPMIIGSMGMKRI 316
KN + +G++ M P ++ S G +++ P + + I
Sbjct: 253 GKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYI 312

Query: 317 IVQVVNRFGYRRVLVNATLLLAVVSLSLPLVAIMGWTLLMPVVLFFQGMLNALRFSTMNT 376
+V+R G VL L+V L+ + + +++F G L+ + ++T
Sbjct: 313 GGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTV-IST 371

Query: 377 LTLKTLPDRLASSGNSLLSMAMQLSMSIGVSTAGILLGTFAHHQVATNTPATHSAFLYS- 435
+ +L + A +G SLL+ LS G++ G LL Q S +LYS
Sbjct: 372 IVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLLDQRLLPMEVDQSTYLYSN 431

Query: 436 -YLCMAIIIALPALI 449
L + II + L+
Sbjct: 432 LLLLFSGIIVISWLV 446


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3101ACRIFLAVINRP8640.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 864 bits (2235), Expect = 0.0
Identities = 286/1035 (27%), Positives = 504/1035 (48%), Gaps = 36/1035 (3%)

Query: 6 LFIQRPVATTLLTLAITLSGIIGFSLLPVSPLPQVDYPVIMVSASMPGADPETMASSVAT 65
FI+RP+ +L + + ++G + LPV+ P + P + VSA+ PGAD +T+ +V
Sbjct: 4 FFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQ 63

Query: 66 PLERALGRIAGVNEMTSTS-SLGSTRIILQFDLNRDINGAARDVQAALNAAQSLLPSGMP 124
+E+ + I + M+STS S GS I L F D + A VQ L A LLP +
Sbjct: 64 VIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQ 123

Query: 125 SRPTYRKMNPSDAPIMIMTLTSDT--FSQGQLYDYASTKLAQKIAQTEGVSDVTVGGSSL 182
+ S + +M+ SD +Q + DY ++ + +++ GV DV + G+
Sbjct: 124 -QQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQY 182

Query: 183 PAVRVELNPSALFNQGVSLDAVRQAISAANVRRPQGSVDAAET------HWQVQANDEIK 236
A+R+ L+ L ++ V + N + G + + + A K
Sbjct: 183 -AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFK 241

Query: 237 TAEGYRPLIVHYN-NGSPVRLQDVANVIDSVQDVRNAGMSAGQPAVLLVISREPGANIIA 295
E + + + N +GS VRL+DVA V ++ G+PA L I GAN +
Sbjct: 242 NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALD 301

Query: 296 TVDRIRAELPALRASIPASIQLNIAQDRSPTIRASLDEVERSLVIAVALVILVVFIFLRS 355
T I+A+L L+ P +++ D +P ++ S+ EV ++L A+ LV LV+++FL++
Sbjct: 302 TAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQN 361

Query: 356 GRATLIPAVAVPVSLIGTFAAMYLCGFSLNNLSLMALTIATGFVVDDAIVVLENISRHL- 414
RATLIP +AVPV L+GTFA + G+S+N L++ + +A G +VDDAIVV+EN+ R +
Sbjct: 362 MRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMM 421

Query: 415 EAGVKPKVAALRGVREVGFTVLSMSISLVAVFIPLLLMAGLPGRLFREFAVTLSVAIGIS 474
E + PK A + + ++ ++ +++ L AVFIP+ G G ++R+F++T+ A+ +S
Sbjct: 422 EDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALS 481

Query: 475 LVISLTLTPMMCAWLLRSHPKGQQQRIRGFG----KVLLAIQQGYGRSLNWALSHTRWVM 530
++++L LTP +CA LL+ + GF Y S+ L T +
Sbjct: 482 VLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYL 541

Query: 531 VVLLSTIALNVWLYISIPKTFFPEQDTGRMMGFIQADQSISFQSMQQKLKDFMQIVGADP 590
++ +A V L++ +P +F PE+D G + IQ + + Q+ L +
Sbjct: 542 LIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNE 601

Query: 591 -----AVDSVTGFT-GGSRTNSGSMFISLKPLSER---QETAQQVITRLRGKLAKEPGAN 641
+V +V GF+ G N+G F+SLKP ER + +A+ VI R + +L K
Sbjct: 602 KANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGF 661

Query: 642 LFLSSVQDIRVGGRHSNAAYQFTLLADDLAALREWEPKVRAALAKL-----PQLADVNSD 696
+ ++ I G + ++ L D + + R L + L V +
Sbjct: 662 VIPFNMPAIVELGTATGFDFE---LIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPN 718

Query: 697 QQDKGAEMALTYDRETMARLGIDVSEANALLNNAFGQRQISTIYQPLNQYKVVMEVAPEY 756
+ A+ L D+E LG+ +S+ N ++ A G ++ K+ ++ ++
Sbjct: 719 GLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKF 778

Query: 757 TQDVSSLDKMFVINSNGQSIPLSYFAKWQPANAPLAVNHQGLSAASTISFNLPDGGSLSE 816
+DK++V ++NG+ +P S F + + I G S +
Sbjct: 779 RMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGD 838

Query: 817 ATAAVERAMTELGVPSTVRGAFAGTAQVFQETLKSQLWLIMAAIATVYIVLGILYESYVH 876
A A +E ++L P+ + + G + + + L+ + V++ L LYES+
Sbjct: 839 AMALMENLASKL--PAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSI 896

Query: 877 PLTILSTLPSAGVGALLALELFDAPFSLIALIGIMLLIGIVKKNAIMMVDFALDAQRNGN 936
P++++ +P VG LLA LF+ + ++G++ IG+ KNAI++V+FA D
Sbjct: 897 PVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEG 956

Query: 937 ISAREAIFQASLLRFRPIIMTTLAALFGALPLVLSSGDGAELRQPLGITIVGGLVVSQLL 996
EA A +R RPI+MT+LA + G LPL +S+G G+ + +GI ++GG+V + LL
Sbjct: 957 KGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLL 1016

Query: 997 TLYTTPVIYLYFDRL 1011
++ PV ++ R
Sbjct: 1017 AIFFVPVFFVVIRRC 1031



Score = 78.0 bits (192), Expect = 1e-16
Identities = 59/350 (16%), Positives = 130/350 (37%), Gaps = 12/350 (3%)

Query: 680 VRAALAKLPQLADVNSDQQDKGAEMALTYDRETMARLGI---DVSEANALLNNAFGQRQI 736
V+ L++L + DV M + D + + + + DV + N+ Q+
Sbjct: 162 VKDTLSRLNGVGDVQLFGAQY--AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQL 219

Query: 737 STIYQPLNQYKVVMEVAPEYTQDVSSLDKMFV-INSNGQSIPLSYFAK--WQPANAPLAV 793
Q +A ++ K+ + +NS+G + L A+ N +
Sbjct: 220 GGTPALPGQQLNASIIAQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIA 279

Query: 794 NHQGLSAASTISFNLPDGGSLSEATAAVERAMTEL--GVPSTVRGAFA-GTAQVFQETLK 850
G AA +L + A++ + EL P ++ + T Q ++
Sbjct: 280 RINGKPAAGLGIKLATGANAL-DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIH 338

Query: 851 SQLWLIMAAIATVYIVLGILYESYVHPLTILSTLPSAGVGALLALELFDAPFSLIALIGI 910
+ + AI V++V+ + ++ L +P +G L F + + + G+
Sbjct: 339 EVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGM 398

Query: 911 MLLIGIVKKNAIMMVDFALDAQRNGNISAREAIFQASLLRFRPIIMTTLAALFGALPLVL 970
+L IG++ +AI++V+ + +EA ++ ++ + +P+
Sbjct: 399 VLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAF 458

Query: 971 SSGDGAELRQPLGITIVGGLVVSQLLTLYTTPVIYLYFDRLRNRFSKQPL 1020
G + + ITIV + +S L+ L TP + + + +
Sbjct: 459 FGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENK 508


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3102ACRIFLAVINRP8720.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 872 bits (2254), Expect = 0.0
Identities = 289/1036 (27%), Positives = 501/1036 (48%), Gaps = 29/1036 (2%)

Query: 13 SRLFILRPVATTLFMIAILLAGIIGYRALPVSALPEVDYPTIQVVTLYPGASPDVVTSSI 72
+ FI RP+ + I +++AG + LPV+ P + P + V YPGA V ++
Sbjct: 2 ANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTV 61

Query: 73 TAPLERQFGQMSGLKQMASQS-SGGASVITLQFQLTLPLDVAEQEVQAAINAATNLLPSD 131
T +E+ + L M+S S S G+ ITL FQ D+A+ +VQ + AT LLP +
Sbjct: 62 TQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE 121

Query: 132 LPYPPIYNKVNPADPPILTLAVTATAIPMTQVE--DMVETRIAQKISQVTGVGLVTLSGG 189
+ I + ++ + TQ + D V + + +S++ GVG V L G
Sbjct: 122 VQQQGIS-VEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 190 QRPAVRVKLNAPAVAALGLDSETIRTAISNANVNSAKGSLDGP------TRSVTLSANDQ 243
Q A+R+ L+A + L + + N A G L G + ++ A +
Sbjct: 181 QY-AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239

Query: 244 MKSAEEYRDLII-AYQNGAPIRLQDVATIEQGAENNKLAAWANTQSAIVLNIQRQPGVNV 302
K+ EE+ + + +G+ +RL+DVA +E G EN + A N + A L I+ G N
Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299

Query: 303 IATADSIREMLPELIKSLPKSVDVKVLTDRTSTIRASVNDVQFELLLAIALVVMVIYLFL 362
+ TA +I+ L EL P+ + V D T ++ S+++V L AI LV +V+YLFL
Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359

Query: 363 RNAAATIIPSIAVPLSLVGTFAAMYFLGFSINNLTLMALTIATGFVVDDAIVVIENISRY 422
+N AT+IP+IAVP+ L+GTFA + G+SIN LT+ + +A G +VDDAIVV+EN+ R
Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419

Query: 423 I-EKGEKPLDAALKGAGEIGFTIISLTFSLIAVLIPLLFMEDIVGRLFREFAVTLAVAIL 481
+ E P +A K +I ++ + L AV IP+ F G ++R+F++T+ A+
Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479

Query: 482 ISAVVSLTLTPMMCARML---SYESLRKQNRLSRASEKFFDWVIAHYAVALKKVLNHPWL 538
+S +V+L LTP +CA +L S E + FD + HY ++ K+L
Sbjct: 480 LSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539

Query: 539 TLSVAFSTLVLTVILYLLIPKGFFPLQDNGLIQGTLEAPQSVSFSNMAERQQQVAAIILK 598
L + + V+L+L +P F P +D G+ ++ P + + QV LK
Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599

Query: 599 DPA--VESLTSFVGVDGTNATLNNGRLQINLKPLSERDDRIP---QIITRLQESVSGVPG 653
+ VES+ + G + N G ++LKP ER+ +I R + + +
Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRD 659

Query: 654 IKLYLQPVQDLTIDTQLSRTQYQFTLQ---ATSLEELSTWVPKLVNELQQK-APFQDVTS 709
++ P I + T + F L + L+ +L+ Q A V
Sbjct: 660 --GFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRP 717

Query: 710 DWQDQGLVAFVNVDRDSASRLGITMAAIDNALYNAFGQRLISTIYTQSNQYRVVLEHDVQ 769
+ + + VD++ A LG++++ I+ + A G ++ + ++ ++ D +
Sbjct: 718 NGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAK 777

Query: 770 ATPGLAAFNDIRLTGIDGKGVPLSSIATIEERFGPLSINHLNQFPSATVSFNLAQGYSLG 829
+ + + +G+ VP S+ T +G + N PS + A G S G
Sbjct: 778 FRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSG 837

Query: 830 EAVAAVTLAEKEIQLPADITTRFQGSTLAFQAALGSTLWLIIAAIVAMYIVLGVLYESFI 889
+A+A + +LPA I + G + + + L+ + V +++ L LYES+
Sbjct: 838 DAMALMENLAS--KLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWS 895

Query: 890 HPITILSTLPTAGVGALLALMLTGNELDVIAIIGIILLIGIVKKNAIMMIDFALAAERDQ 949
P++++ +P VG LLA L + DV ++G++ IG+ KNAI++++FA +
Sbjct: 896 IPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKE 955

Query: 950 GMTPYDAIYQACLLRFRPILMTTLAALFGALPLMLSTGVGAELRQPLGVCMVGGLIVSQV 1009
G +A A +R RPILMT+LA + G LPL +S G G+ + +G+ ++GG++ + +
Sbjct: 956 GKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATL 1015

Query: 1010 LTLFTTPVIYLLFDKL 1025
L +F PV +++ +
Sbjct: 1016 LAIFFVPVFFVVIRRC 1031



Score = 84.1 bits (208), Expect = 2e-18
Identities = 77/517 (14%), Positives = 190/517 (36%), Gaps = 25/517 (4%)

Query: 533 LNHPWLTLSVAFSTLVLTVILYLLIPKGFFPLQDNGLIQGTLEAPQSVSFSNMAERQQQV 592
+ P +A ++ + L +P +P + + P + + Q V
Sbjct: 6 IRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGA----DAQTVQDTV 61

Query: 593 AAIILKDPAVESLTSFVGVDGTNAT-LNNGRLQINL--KPLSERDDRIPQIITRLQESVS 649
+I +++ + ++T + G + I L + ++ D Q+ +LQ +
Sbjct: 62 TQVI-----EQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATP 116

Query: 650 GVP-GIKLYLQPVQDLTIDTQLSRTQYQFTLQATSLEELSTWVPK-LVNELQQKAPFQDV 707
+P ++ V+ + + L + T+ +++S +V + + L + DV
Sbjct: 117 LLPQEVQQQGISVEKSS-SSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDV 175

Query: 708 TSDWQDQGLVAFVNVDRDSASRLGITMAAIDNALYNAFGQRLISTIYTQSNQYRVVLEHD 767
+ + +D D ++ +T + N L Q + L
Sbjct: 176 QLFGAQYAMR--IWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNAS 233

Query: 768 VQATPGLAAFNDIR----LTGIDGKGVPLSSIATIEERFGPLSIN-HLNQFPSATVSFNL 822
+ A + DG V L +A +E ++ +N P+A + L
Sbjct: 234 IIAQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKL 293

Query: 823 AQGYSLGEAVAAV--TLAEKEIQLPADI-TTRFQGSTLAFQAALGSTLWLIIAAIVAMYI 879
A G + + A+ LAE + P + +T Q ++ + + AI+ +++
Sbjct: 294 ATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFL 353

Query: 880 VLGVLYESFIHPITILSTLPTAGVGALLALMLTGNELDVIAIIGIILLIGIVKKNAIMMI 939
V+ + ++ + +P +G L G ++ + + G++L IG++ +AI+++
Sbjct: 354 VMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVV 413

Query: 940 DFALAAERDQGMTPYDAIYQACLLRFRPILMTTLAALFGALPLMLSTGVGAELRQPLGVC 999
+ + + P +A ++ ++ + +P+ G + + +
Sbjct: 414 ENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSIT 473

Query: 1000 MVGGLIVSQVLTLFTTPVIYLLFDKLARNTRGKNRHR 1036
+V + +S ++ L TP + K +N+
Sbjct: 474 IVSAMALSVLVALILTPALCATLLKPVSAEHHENKGG 510


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3103RTXTOXIND431e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 43.3 bits (102), Expect = 1e-06
Identities = 22/115 (19%), Positives = 42/115 (36%), Gaps = 10/115 (8%)

Query: 67 VIAANTVTVTSRVDGELMALHFTEGQQVKAGDLLAEIDPRPYEVQLTQAQGQLAKDQATL 126
+ + + + + + EG+ V+ GD+L ++ A+ K Q++L
Sbjct: 91 THSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTA-------LGAEADTLKTQSSL 143

Query: 127 DNARRDLARYQKLSK---TGLISQQELDTQSSLVRQSEGSVKADQGAIDSAKLQL 178
AR + RYQ LS+ + + +L + SE V I
Sbjct: 144 LQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTW 198



Score = 42.1 bits (99), Expect = 3e-06
Identities = 23/124 (18%), Positives = 54/124 (43%), Gaps = 4/124 (3%)

Query: 108 YEVQLTQAQGQLAKDQATLDNARRDLARYQKLSKTGLISQQELDTQSSLVRQSEGSVKAD 167
E + +A +L ++ L+ ++ + + L++Q + +RQ+ ++
Sbjct: 257 QENKYVEAVNELRVYKSQLEQIESEILSAK--EEYQLVTQLFKNEILDKLRQTTDNIGLL 314

Query: 168 QGAIDSAKLQLTYSRITAPISGRV-GLKQVDVGNYITSGTATPIVVITQTHPVDVVFTLP 226
+ + + S I AP+S +V LK G +T+ T +V++ + ++V +
Sbjct: 315 TLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAE-TLMVIVPEDDTLEVTALVQ 373

Query: 227 ESDI 230
DI
Sbjct: 374 NKDI 377


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3104PF05272310.007 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 31.2 bits (70), Expect = 0.007
Identities = 8/31 (25%), Positives = 14/31 (45%)

Query: 34 LTLLGPSGSGKTTSLMMLAGFETPTQGEITL 64
+ L G G GK+T + L G + + +
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDI 629


43YpAngola_A3135YpAngola_A3149Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A3135-1153.031570phosphoribosylaminoimidazole-succinocarboxamide
YpAngola_A31361153.984374hypothetical protein
YpAngola_A31372174.153245hypothetical protein
YpAngola_A31383174.133634acetyltransferase
YpAngola_A31392171.798065hypothetical protein
YpAngola_A31401141.416379D-alanyl-D-alanine carboxypeptidase
YpAngola_A31410132.060314succinyl-diaminopimelate desuccinylase
YpAngola_A31420131.796516hypothetical protein
YpAngola_A3144112-0.535249hypothetical protein
YpAngola_A3143112-2.048045hypothetical protein
YpAngola_A3145112-2.090806hypothetical protein
YpAngola_A3146114-2.865140ABC transporter permease
YpAngola_A3147219-5.177076ABC transporter ATP-binding protein
YpAngola_A3148221-5.754367sulfatase
YpAngola_A3149020-3.076427putative sulfatase regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3138SACTRNSFRASE300.026 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 29.5 bits (66), Expect = 0.026
Identities = 10/57 (17%), Positives = 24/57 (42%), Gaps = 1/57 (1%)

Query: 468 ISRVAVTAAWRQQGIARRMIAAEQAHARQQQ-CDFLSVSFGYTAELAHFWHRCGFRL 523
I +AV +R++G+ ++ A++ C + + HF+ + F +
Sbjct: 92 IEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFII 148


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3143SYCDCHAPRONE280.016 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 27.6 bits (61), Expect = 0.016
Identities = 16/74 (21%), Positives = 29/74 (39%), Gaps = 3/74 (4%)

Query: 22 QDLLSRSPDNASLLYKIASLYDVQGLELQAVPFYRAAIEHNLVGTELQAAYLGLGSTYRT 81
L S D LY +A G A ++A + + +LGLG+ +
Sbjct: 26 AMLNEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRF---FLGLGACRQA 82

Query: 82 LGLYQAALETFDHA 95
+G Y A+ ++ +
Sbjct: 83 MGQYDLAIHSYSYG 96


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3147PF05272310.007 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 31.2 bits (70), Expect = 0.007
Identities = 14/35 (40%), Positives = 18/35 (51%)

Query: 31 VVSLLGPSGSGKTTLLRAVAGLEKPSQGHIIIGEK 65
V L G G GK+TL+ + GL+ S H IG
Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTG 632


44YpAngola_A3252YpAngola_A3272Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A3252-117-3.767840LysR family substrate-binding transcriptional
YpAngola_A3253022-6.683032hypothetical protein
YpAngola_A3255228-8.671599hypothetical protein
YpAngola_A3254333-11.867590methyl-accepting chemotaxis protein
YpAngola_A3256645-17.370470hypothetical protein
YpAngola_A3258646-17.296514hypothetical protein
YpAngola_A3259845-15.735810prepilin peptidase
YpAngola_A3261646-16.140451hypothetical protein
YpAngola_A3263542-13.371379general secretion pathway protein K
YpAngola_A3264743-13.419717general secretion pathway protein J
YpAngola_A3265643-13.481931general secretion pathway protein I
YpAngola_A3266541-12.651569general secretion pathway protein G
YpAngola_A3267441-13.037172general secretion pathway protein F
YpAngola_A3268435-10.928534general secretion pathway protein E
YpAngola_A3269330-9.837444general secretion pathway protein D
YpAngola_A3270-118-3.961528general secretion pathway protein C
YpAngola_A3272-117-3.340801putative carbonic anhydrase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3253FERRIBNDNGPP290.018 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 29.1 bits (65), Expect = 0.018
Identities = 23/104 (22%), Positives = 46/104 (44%), Gaps = 13/104 (12%)

Query: 186 GVAVSGNIHLWVADTQTPESRENWLT----TLEKIKALKPAIVVPGHFLDNAPQTLESVT 241
GVA + N LWV++ P+S + LE + +KP+ +V +P+ L +
Sbjct: 58 GVADTINYRLWVSEPPLPDSVIDVGLRTEPNLELLTEMKPSFMVWSAGYGPSPEMLARIA 117

Query: 242 FTQNYLTTLNAEIPKAKDSAELIAVMKKHYPELKDESSLELSAK 285
+ + D + +A+ +K E+ D +L+ +A+
Sbjct: 118 PGRGF---------NFSDGKQPLAMARKSLTEMADLLNLQSAAE 152


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3259PREPILNPTASE2363e-79 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 236 bits (604), Expect = 3e-79
Identities = 116/275 (42%), Positives = 152/275 (55%), Gaps = 4/275 (1%)

Query: 27 VFFVSYLIFGAMVGSFLNVLIYRLPIMLANLSSR-SESHGEEIKMRSHLRNINLFQPGSF 85
++F +F M+GSFLNV+I+RLPIML S+ NL P S
Sbjct: 14 LYFSLVFLFSLMIGSFLNVVIHRLPIMLEREWQAEYRSYFNPDDEGVDEPPYNLMVPRSC 73

Query: 86 CHHCNESIPIKYNIPILGWIFLRGASRCCNKKISTRYLFIEVLAVIQTLLVLMIFKEDLL 145
C HCN I NIP+L W++LRG R C IS RY +E+L + ++ V M
Sbjct: 74 CPHCNHPITALENIPLLSWLWLRGRCRGCQAPISARYPLVELLTALLSVAVAMTLAPGWG 133

Query: 146 ICTSLVLIWSLTALAFIDFDTYLLPDCMTIPLLWLGLLINIDTVFAPLTSAVLGAVSGYL 205
+L+L W L AL FID D LLPD +T+PLLW GLL N+ F L AV+GA++GYL
Sbjct: 134 TLAALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNLLGGFVSLGDAVIGAMAGYL 193

Query: 206 FLWLSYWLFKIVRGVDGMGYGDFKLMAALGAWFGVSAVPFLILFSSFFGLVAYAIFYFFD 265
LW YW FK++ G +GMGYGDFKL+AALGAW G A+P ++L SS G
Sbjct: 194 VLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLVGAFMGIGLILLR 253

Query: 266 KKDNGKEINYIAFGPYISLAGVLYLFLGSHVTNLF 300
K I FGPY+++AG + L G +T +
Sbjct: 254 NHHQSKP---IPFGPYLAIAGWIALLWGDSITRWY 285


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3263TYPE3IMPPROT300.012 Type III secretion system inner membrane P protein ...
		>TYPE3IMPPROT#Type III secretion system inner membrane P protein

family signature.
Length = 224

Score = 29.8 bits (67), Expect = 0.012
Identities = 14/65 (21%), Positives = 29/65 (44%), Gaps = 6/65 (9%)

Query: 4 NGIALLMVLCALFLMSTMVMASYNYWFDIYYLAKNSQQRQKEKWILLGAEEKFVSKLIKN 63
NG+ALL+ ++F+M ++ +Y Y+ D + K + + + LIK
Sbjct: 53 NGVALLL---SMFVMWPIMHDAYVYFEDEDVTFNDISSLSKH---VDEGLDGYRDYLIKY 106

Query: 64 TSEDR 68
+ +
Sbjct: 107 SDREL 111


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3266BCTERIALGSPG2072e-72 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 207 bits (529), Expect = 2e-72
Identities = 87/136 (63%), Positives = 103/136 (75%)

Query: 2 ANKKTKGFTLLEIMVVIVILGLLASLTIPSLMSNKNRADQQKAVSDISALENALDMYRLD 61
A K +GFTLLEIMVVIVI+G+LASL +P+LM NK +AD+QKAVSDI ALENALDMY+LD
Sbjct: 3 ATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLD 62

Query: 62 NGDYPTEQQGIAALVTKPNVPPLPQRYPSDGYIRRLPTDPWGNSYQMNNPGKHGQIDIFS 121
N YPT QG+ +LV P +PPL Y +GYI+RLP DPWGN Y + NPG+HG D+ S
Sbjct: 63 NHHYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDLLS 122

Query: 122 IGPDRLPETEDDIGNW 137
GPD TEDDI NW
Sbjct: 123 AGPDGEMGTEDDITNW 138


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3267BCTERIALGSPF358e-124 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 358 bits (921), Expect = e-124
Identities = 172/406 (42%), Positives = 264/406 (65%), Gaps = 7/406 (1%)

Query: 1 MAVFKYVAISRSGTKITGDIDAENIRIARYLLYKKNMHVLSI-------KKRILLFNKYV 53
MA + Y A+ G K G +A++ R AR LL ++ + LS+ +K
Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLR 60

Query: 54 VKKNSNKTDLVLITRQIATLVNASMPLDEVLDIVGKQNSKSKMIEIIQRIRVNIQEGHSF 113
K + +DL L+TRQ+ATLV ASMPL+E LD V KQ+ K + +++ +R + EGHS
Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSL 120

Query: 114 ADALSPFPAVFSPLYKTMVTAGEVSGHLGLVLVRLADHIEQTQKIQRKIIQALIYPCVLV 173
ADA+ FP F LY MV AGE SGHL VL RLAD+ EQ Q+++ +I QA+IYPCVL
Sbjct: 121 ADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLT 180

Query: 174 LISLSVIIILLTAVVPNIVEQFSFSETALPLSTKVLMILSYSIKENVIFIMAIGVSAVIF 233
+++++V+ ILL+ VVP +VEQF + ALPLST+VLM +S +++ +++ ++ +
Sbjct: 181 VVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMA 240

Query: 234 LNRLLKINKINVFFHRHYLSLPMLGNMFVRINTSRYLRTLTTLHSNGVTIVQAMSISNAV 293
+L+ K V FHR L LP++G + +NT+RY RTL+ L+++ V ++QAM IS V
Sbjct: 241 FRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDV 300

Query: 294 LTNVYIKNKLNISVKLVSEGCSLSSSLVDSGVFPPIILHMIISGERSGKLDHMLETVAGV 353
++N Y +++L+++ V EG SL +L + +FPP++ HMI SGERSG+LD MLE A
Sbjct: 301 MSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADN 360

Query: 354 QEEELMNQISIVMSLLEPTIIIVMAAFISFIILSILQPILEINSLV 399
Q+ E +Q+++ + L EP +++ MAA + FI+L+ILQPIL++N+L+
Sbjct: 361 QDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLM 406


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3269BCTERIALGSPD5430.0 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 543 bits (1401), Expect = 0.0
Identities = 310/610 (50%), Positives = 432/610 (70%), Gaps = 15/610 (2%)

Query: 3 ISGKGIKSIHGMIFLFTLIMPLDIISANFSVSFKDVDIKEFINSVSKNINKTIIIDPTVQ 62
I I+S + +F ++ + FS SFK DI+EFIN+VSKN+NKT+IIDP+V+
Sbjct: 2 IIANVIRSFSLTLLIFAALLFRPAAAEEFSASFKGTDIQEFINTVSKNLNKTVIIDPSVR 61

Query: 63 GLISIRSYENLDKDTYYQLFLNVLDVYGYAAIEMPHNVLKVISSKRAKGVVAPLPKEGVT 122
G I++RSY+ L+++ YYQ FL+VLDVYG+A I M + VLKV+ SK AK P+ +
Sbjct: 62 GTITVRSYDMLNEEQYYQFFLSVLDVYGFAVINMNNGVLKVVRSKDAKTAAVPVASDAAP 121

Query: 123 FDGDELINRVIPLRYISAKKITPLLRQLNDNTESGSIINYDPSNILLITGRAAVVNRLHS 182
GDE++ RV+PL ++A+ + PLLRQLNDN GS+++Y+PSN+LL+TGRAAV+ RL +
Sbjct: 122 GIGDEVVTRVVPLTNVAARDLAPLLRQLNDNAGVGSVVHYEPSNVLLMTGRAAVIKRLLT 181

Query: 183 IVTDLDQAGDNEIELYKLNYAIAADVVKIVNEAINPINNLKQEVSIVGKVIADERTNSIL 242
IV +D AGD + L++A AADVVK+V E + S+V V+ADERTN++L
Sbjct: 182 IVERVDNAGDRSVVTVPLSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVL 241

Query: 243 ISGDTYIRKKSILMIKKLDKRQSSDGNTKVVYMKYAQASKLLDVLNGISEGFHNEKKTKQ 302
+SG+ R++ I MIK+LD++Q++ GNTKV+Y+KYA+AS L++VL GIS +EK+ +
Sbjct: 242 VSGEPNSRQRIIAMIKQLDRQQATQGNTKVIYLKYAKASDLVEVLTGISSTMQSEKQAAK 301

Query: 303 SNQWNQRPVAIKAYDQTNALVITADPDMMLALGEVIEKLDIRRAQVLVEAIIVETQNGEG 362
+ + IKA+ QTNAL++TA PD+M L VI +LDIRR QVLVEAII E Q+ +G
Sbjct: 302 PVAALDKNIIIKAHGQTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADG 361

Query: 363 INLGVKWENKRSDDINF----IKNSDGLLNNNGWGIATTIT-----------GLTAGFYK 407
+NLG++W NK + F + S + N + T++ G+ AGFY+
Sbjct: 362 LNLGIQWANKNAGMTQFTNSGLPISTAIAGANQYNKDGTVSSSLASALSSFNGIAAGFYQ 421

Query: 408 GNWDVLLSALSTNTNNNILATPSIVTLDNMEAEFNVGQEVPVLISTQTTTTDKVYNSISR 467
GNW +LL+ALS++T N+ILATPSIVTLDNMEA FNVGQEVPVL +QTT+ D ++N++ R
Sbjct: 422 GNWAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVPVLTGSQTTSGDNIFNTVER 481

Query: 468 QSIGVMLKVKPQINKGDSVLLEIRQEVSSIADSSTVNTHNLGSVFNKRVVNNAVLVKSGE 527
+++G+ LKVKPQIN+GDSVLLEI QEVSS+AD+++ + +LG+ FN R VNNAVLV SGE
Sbjct: 482 KTVGIKLKVKPQINEGDSVLLEIEQEVSSVADAASSTSSDLGATFNTRTVNNAVLVGSGE 541

Query: 528 TVVVGGLLDKKSSTIVNKVPFLGDLPLIGWLFRQTKEKVEKSNLILFIKPTILRESDDYS 587
TVVVGGLLDK S +KVP LGD+P+IG LFR T +KV K NL+LFI+PT++R+ D+Y
Sbjct: 542 TVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNLMLFIRPTVIRDRDEYR 601

Query: 588 VVTSKEYNKY 597
+S +Y +
Sbjct: 602 QASSGQYTAF 611


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3270BCTERIALGSPC455e-09 Bacterial general secretion pathway protein C signa...
		>BCTERIALGSPC#Bacterial general secretion pathway protein C

signature.
Length = 272

Score = 44.6 bits (105), Expect = 5e-09
Identities = 19/62 (30%), Positives = 31/62 (50%)

Query: 29 IKLVGVIEHSAPSESIAILEVKGKQTTHLTRENINYEDIVIVKIFTDRVIIKRNGKYYSL 88
+ L GV+ S SIAI+ +Q + E + + IV I DRV+++ G+Y L
Sbjct: 95 LSLTGVMAGDDDSRSIAIISKDNEQFSRGVNEEVPGYNAKIVSIRPDRVVLQYQGRYEVL 154

Query: 89 II 90
+
Sbjct: 155 GL 156


45YpAngola_A3284YpAngola_A3295Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A3284415-0.649522phosphate binding protein
YpAngola_A3285415-0.519929phosphate regulon sensor protein
YpAngola_A3286514-0.947161phosphate regulon transcriptional regulatory
YpAngola_A3287614-1.184844hypothetical protein
YpAngola_A3288614-0.827924exonuclease subunit SbcD
YpAngola_A3289616-1.484703nuclease SbcCD, C subunit
YpAngola_A3290021-2.873765fructokinase
YpAngola_A3291024-4.880164recombination associated protein
YpAngola_A3292027-5.410182hypothetical protein
YpAngola_A3293028-5.753533shikimate kinase
YpAngola_A3294-124-7.207500putative methyltransferase
YpAngola_A3295-218-3.651871hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3286HTHFIS929e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 92.2 bits (229), Expect = 9e-24
Identities = 32/123 (26%), Positives = 59/123 (47%), Gaps = 2/123 (1%)

Query: 1 MMARRILVVEDEAPIREMVCFVLEQNGYQPLEAEDYDSAVARLSEPFPDLVLLDWMLPGG 60
M ILV +D+A IR ++ L + GY + + ++ DLV+ D ++P
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 SGIQFIKHMKREALTRDIPVMMLTARGEEEDRVRGLEVGADDYITKPFSPKELVARIKAV 120
+ + +K+ D+PV++++A+ ++ E GA DY+ KPF EL+ I
Sbjct: 61 NAFDLLPRIKKARP--DLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118

Query: 121 MRR 123
+
Sbjct: 119 LAE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3289RTXTOXIND422e-05 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 41.7 bits (98), Expect = 2e-05
Identities = 32/222 (14%), Positives = 74/222 (33%), Gaps = 8/222 (3%)

Query: 312 QYLAQLTPLT--QAVEQATAARQQQQLNQHEQETLIEQRIVPLDNLITQQQQTLSQLAGQ 369
L +LT L + ++ Q +L Q + L + + + Q +
Sbjct: 122 DVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSE 181

Query: 370 IQQLRAKEQQNSQQLALNEQKLLQTHQRLQQLADYANLHAHHQHWEKHLPLWHEQFRQLQ 429
+ LR Q QK + ++ A+ + A +E + +
Sbjct: 182 EEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFS 241

Query: 430 LQQQQSAQSEQQLHQQTTLLATLQQQATTLSAQEKQQQVALAEARAQASYLQQKL--LVL 487
+ A ++ + +Q + +Q +Q + + A+ + + Q +L
Sbjct: 242 SLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEIL 301

Query: 488 EQ----QQPSAQLRQQLNEFNEQRQICQQLAALSPLAQQIQA 525
++ L +L + E++Q A +S QQ++
Sbjct: 302 DKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKV 343



Score = 39.0 bits (91), Expect = 1e-04
Identities = 36/222 (16%), Positives = 72/222 (32%), Gaps = 30/222 (13%)

Query: 449 LATLQQQATTLSAQEKQQQVALAEARAQASYLQQKLLVLEQQ----QPSAQLRQQLNEFN 504
L L +A TL Q Q L + R Q +L L + +P Q +
Sbjct: 127 LTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLR 186

Query: 505 EQRQICQQLAALSPLAQQIQALYDKQQQQFTAQQQQLKQLEQQ---LTEKRQLYQQ-QKQ 560
I +Q + Q + DK++ + ++ + E + + +
Sbjct: 187 LTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHK 246

Query: 561 HLVDLEALLEREKQIVTLEAERAKLQPGDACPLCGAVEHPAITAYQAVKPSETAVRVAKL 620
+ A+LE+E + V E + ++ E+ + AK
Sbjct: 247 QAIAKHAVLEQENKYVEAVNELRVYK-------------------SQLEQIESEILSAKE 287

Query: 621 RL-QVEQLYTEGTELRTQVASMQQHQQRIEQELQDHRQQLAA 661
V QL+ E+ ++ + + EL + ++ A
Sbjct: 288 EYQLVTQLFKN--EILDKLRQTTDNIGLLTLELAKNEERQQA 327



Score = 38.3 bits (89), Expect = 2e-04
Identities = 28/206 (13%), Positives = 71/206 (34%), Gaps = 19/206 (9%)

Query: 649 EQELQDHRQQLAAYQQRWQTLAQPLSL----AFTLNEPDALALWLEQHEQQEQACQLKLV 704
+ Q Q Q R+Q L++ + L L + E+ + +
Sbjct: 136 TLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSL----- 190

Query: 705 EYERLTQQYQQAKDILTQLEQRQQEHQQQLALITERQKNAQQTYQQLQSQYQHQQEALIA 764
+ +Q+ ++ Q E + + + + R + + +S+ +L+
Sbjct: 191 ----IKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFS-SLLH 245

Query: 765 QQQVLNHTLTELSLSVPDADQQQDWLAQREEECQRWQQHQQEQQRLTIEQKTLETRIENE 824
+Q + H + E +A + + E+ + +E+ +L + E +
Sbjct: 246 KQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDK-- 303

Query: 825 RRHLQECIDQLSALSQQRQQAETLLQ 850
L++ D + L+ + + E Q
Sbjct: 304 ---LRQTTDNIGLLTLELAKNEERQQ 326



Score = 34.0 bits (78), Expect = 0.004
Identities = 26/180 (14%), Positives = 72/180 (40%), Gaps = 13/180 (7%)

Query: 835 LSALSQQRQQAETLLQQQIQQRQALFGEDIVAE-------VRQRLRLQQQQAELAQQNAE 887
+ L Q R Q + + + + ++ + +R +++Q + Q +
Sbjct: 145 QARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQ 204

Query: 888 K--ALQQAQSQLNRLSGELTGLEQQCQQYQQRATTTQAEL-QQALSTSEFADETALTAAL 944
K L + +++ + + E + + R + L +QA++ ++
Sbjct: 205 KELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEA 264

Query: 945 LSE--EERQHLQQLQQQLNERRQQAQIRLQQAR-EILDQHLQLCPQGVDKSSELTLLQQQ 1001
++E + L+Q++ ++ +++ Q+ Q + EILD+ Q + EL +++
Sbjct: 265 VNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEER 324


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3290BCTERIALGSPF280.045 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 28.3 bits (63), Expect = 0.045
Identities = 11/37 (29%), Positives = 21/37 (56%)

Query: 218 DVIAEQAMNNYERRFAKSLAHVINLFDPDVVVLGGGM 254
D + E+A +N +R F+ + + LF+P +VV +
Sbjct: 351 DSMLERAADNQDREFSSQMTLALGLFEPLLVVSMAAV 387


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3293PF05272280.015 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.5 bits (63), Expect = 0.015
Identities = 16/68 (23%), Positives = 27/68 (39%), Gaps = 12/68 (17%)

Query: 7 MVGARGAGKTTIGKALAQALGYRFVDTDL-------FMQQTSQMTVAEVVESEGWDGFRL 59
+ G G GK+T+ L F DT +Q + + E+ E FR
Sbjct: 601 LEGTGGIGKSTLINTLVGL--DFFSDTHFDIGTGKDSYEQIAGIVAYELSE---MTAFRR 655

Query: 60 RESMALQA 67
++ A++A
Sbjct: 656 ADAEAVKA 663


46YpAngola_A3319YpAngola_A3334Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A3319218-2.282771hypothetical protein
YpAngola_A3318221-2.470404class II glutamine amidotransferases
YpAngola_A3320221-2.209383phosphoheptose isomerase
YpAngola_A3322120-0.750010hypothetical protein
YpAngola_A3323219-0.824072hypothetical protein
YpAngola_A33251170.249455putative lipoprotein
YpAngola_A33260162.300763hypothetical protein
YpAngola_A33270133.170856methylthioribose kinase
YpAngola_A33280143.740981eIF-2B alpha/beta/delta family protein
YpAngola_A3329-1154.037864ARD/ARD' family dioxygenase
YpAngola_A33300154.0567952,3-diketo-5-methylthio-1-phosphopentane
YpAngola_A3331-1124.265723methylthioribulose-1-phosphate dehydratase
YpAngola_A3332-1133.760318putative aminotransferase
YpAngola_A33330133.649610hypothetical protein
YpAngola_A3334-1123.360321allantoate amidohydrolase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3322FIMREGULATRY583e-13 Escherichia coli: P pili regulatory PapB protein si...
		>FIMREGULATRY#Escherichia coli: P pili regulatory PapB protein

signature.
Length = 104

Score = 58.0 bits (140), Expect = 3e-13
Identities = 25/83 (30%), Positives = 45/83 (54%)

Query: 147 QGRISPGEVDEVQLTLLMDIAKVTKISLRAALHRHLVEGATEEWVCSVYKMNQEDFWQNM 206
+ + PG + E+ LL+ I+ + + A+ +LV G + + VC Y+MN F +
Sbjct: 20 ESVLLPGSMSEMHFFLLIGISSIHSDRVILAMKDYLVGGHSRKEVCEKYQMNNGYFSTTL 79

Query: 207 RKLHRLNERVVQLLPFYTRQTSS 229
+L RLN +L P+YT ++S+
Sbjct: 80 GRLIRLNALAARLAPYYTDESSA 102


47YpAngola_A3392YpAngola_A3403Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A3392124-3.407337maltodextrin glucosidase
YpAngola_A3393125-4.118580transposase
YpAngola_A3394131-6.669334acetyltransferase
YpAngola_A3395026-5.595105pantothenate kinase
YpAngola_A3396-124-4.492818biotin--protein ligase
YpAngola_A3397-118-3.602363UDP-N-acetylenolpyruvoylglucosamine reductase
YpAngola_A3403-216-3.154966*hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3394SACTRNSFRASE351e-04 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 34.5 bits (79), Expect = 1e-04
Identities = 28/116 (24%), Positives = 47/116 (40%), Gaps = 8/116 (6%)

Query: 66 IEREALLLWIARDEIGIIGTIQLVLCQKPNGLNRAEIQKLLVHSRSRRTGIGHKLIIAAE 125
+E E ++ E IG I++ + N A I+ + V R+ G+G L+ A
Sbjct: 60 VEEEGKAAFLYYLENNCIGRIKI----RSNWNGYALIEDIAVAKDYRKKGVGTALLHKAI 115

Query: 126 NTAVQLRRGLIYLDTQS-GSSAESFYRAQGYRYVG-EIPDYACTPNGNYHPTAIYF 179
A + + L+TQ SA FY + + Y+ P N AI++
Sbjct: 116 EWAKENHFCGLMLETQDINISACHFYAKHHFIIGAVDTMLYSNFPTAN--EIAIFW 169


48YpAngola_A3429YpAngola_A3460Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A3429221-1.633353CDP-diglyceride synthase
YpAngola_A3430326-1.673128undecaprenyl pyrophosphate synthase
YpAngola_A3431219-0.6546911-deoxy-D-xylulose 5-phosphate reductoisomerase
YpAngola_A3432222-0.148009ribosome recycling factor
YpAngola_A3433222-0.650250uridylate kinase
YpAngola_A3434220-0.666402elongation factor Ts
YpAngola_A3435013-0.57272130S ribosomal protein S2
YpAngola_A3437014-0.485721PII uridylyl-transferase
YpAngola_A3438-113-1.3328432,3,4,5-tetrahydropyridine-2,6-carboxylate
YpAngola_A3439215-0.219530hypothetical protein
YpAngola_A34403181.132934flavodoxin
YpAngola_A34412162.317502tRNA pseudouridine synthase C
YpAngola_A34421192.916021hypothetical protein
YpAngola_A34431203.527061transposase
YpAngola_A34441193.647949insertion sequence transposase
YpAngola_A34450213.551509transposase/IS protein
YpAngola_A34460243.668748purine catabolism protein PucG
YpAngola_A34470233.148518amino acid ABC transporter permease
YpAngola_A34480194.085838amino acid ABC transporter permease
YpAngola_A34490183.400474amino acid ABC transporter periplasmic amino
YpAngola_A34501163.994508RpiR family transcriptional regulator
YpAngola_A34510163.875994hypothetical protein
YpAngola_A34520173.451684hypothetical protein
YpAngola_A3453-1162.752793amidase
YpAngola_A3454-1161.179870hypothetical protein
YpAngola_A3455-2192.265647major facilitator transporter
YpAngola_A3456-3152.354102putative azaleucine resistance protein AzlC
YpAngola_A3457-3162.239164hypothetical protein
YpAngola_A3458-2162.984015transcriptional repressor MprA
YpAngola_A3460-2143.475161multidrug resistance protein A
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3433CARBMTKINASE310.004 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 30.9 bits (70), Expect = 0.004
Identities = 17/66 (25%), Positives = 24/66 (36%), Gaps = 14/66 (21%)

Query: 120 AEAI-SLLRHNRVVIFAAGTGNPFFTT-------------DSAACLRGIEIEADVVLKAT 165
AE I L+ +VI + G G P D A E+ AD+ + T
Sbjct: 176 AETIKKLVERGVIVIASGGGGVPVILEDGEIKGVEAVIDKDLAGEKLAEEVNADIFMILT 235

Query: 166 KVDGVY 171
V+G
Sbjct: 236 DVNGAA 241


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3438RTXTOXINA300.010 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 30.3 bits (68), Expect = 0.010
Identities = 9/44 (20%), Positives = 20/44 (45%), Gaps = 2/44 (4%)

Query: 206 VFIGQSTRIYDRETGE--VHYGRVPAGSVVVSGNLPSKDGSYSL 247
VF+ + G V+Y + G + + G ++ G+Y++
Sbjct: 623 VFLSAGSANIYAGKGHDVVYYDKTDTGYLTIDGTKATEAGNYTV 666


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3444HTHTETR280.047 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 28.1 bits (62), Expect = 0.047
Identities = 12/106 (11%), Positives = 31/106 (29%), Gaps = 14/106 (13%)

Query: 16 QGMSSRAIARELGISRNTVKRYLQAKSEPPKYTPRPAVASLLDEYRDYIRQRIAD----- 70
S IA+ G++R + + + KS+ + + + I + +
Sbjct: 30 SSTSLGEIAKAAGVTRGAIYWHFKDKSD--------LFSEIWELSESNIGELELEYQAKF 81

Query: 71 -AHPYKIPATVIAREIRDQGYRGGMTILRAFIRSLSVPQEQEPAVR 115
P + ++ + +L I + V+
Sbjct: 82 PGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQ 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3446PF05272300.028 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.0 bits (67), Expect = 0.028
Identities = 14/42 (33%), Positives = 22/42 (52%), Gaps = 4/42 (9%)

Query: 31 VISIIGRSGSGKSTLLRCMNGLEDYQDGSIKLGGMTVTNRDS 72
+ + G G GKSTL+ + GL+ + D +G T +DS
Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLDFFSDTHFDIG----TGKDS 635


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3455TCRTETB477e-08 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 47.2 bits (112), Expect = 7e-08
Identities = 35/163 (21%), Positives = 65/163 (39%), Gaps = 5/163 (3%)

Query: 35 LETIATNFNLSVNQAGFIVTAAQLGYAVGLMFLVPLGDMFE-RRGLIVGMTLLAAGGMLI 93
L IA +FN ++ TA L +++G L D +R L+ G+ + G +I
Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGS-VI 95

Query: 94 TAMSQNLTMMIIGTALTGLFSVVA--QLLVPLAATLAAPEKRGKVVGIIMSGLLLGILLA 151
+ + ++I A L++ + A E RGK G+I S + +G +
Sbjct: 96 GFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVG 155

Query: 152 RTVAGALATLGGWRTIYWVASALMFIMALVLWRCLPRYKQHTG 194
+ G +A W + + + I L + L + + G
Sbjct: 156 PAIGGMIAHYIHWSYLLLIPMITI-ITVPFLMKLLKKEVRIKG 197


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3458PF05272280.017 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.5 bits (63), Expect = 0.017
Identities = 23/105 (21%), Positives = 37/105 (35%), Gaps = 12/105 (11%)

Query: 12 LNSRAKRQKDFPYQEILLTRLSMHMHSKLLENRNKMLKAQGINETLFMALITLDAQESRS 71
+ + P QE+ L + L R A+G + + T
Sbjct: 745 PSPEDEEIYFRPEQELRLVETGVQGRLWALLTREGAPAAEGAAQKGYSVNTTF------- 797

Query: 72 IQPSELSAALG-----SSRTNATRIADELEKKGWIERRESHNDRR 111
+ ++L ALG SS ++ D L + GW RE+ RR
Sbjct: 798 VTIADLVQALGADPGKSSPMLEGQVRDWLNENGWEYLRETSGQRR 842


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3460RTXTOXIND681e-14 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 68.3 bits (167), Expect = 1e-14
Identities = 63/410 (15%), Positives = 118/410 (28%), Gaps = 99/410 (24%)

Query: 25 LLLTAIFIMIGVAYLIYWFLVLRHHQ---ETDNAYISGNQVQIMSQVPGSVVSVHFENTD 81
L A FIM + ++ + SG +I V + + +
Sbjct: 57 PRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGE 116

Query: 82 FVKSGDVLVTLDPTD-------AEQAFEQAK----------------------------- 105
V+ GDVL+ L + + QA+
Sbjct: 117 SVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYF 176

Query: 106 ----------------TALANSVRQTHQLIINSKQYQ-------ANIALKKTELSQAQND 142
+ Q +Q +N + + A I + ++
Sbjct: 177 QNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSR 236

Query: 143 LKRRVVLGAAAVIGREELQHARDAVEAAQASLDMAVQQYNANQALVLNTPLE-------- 194
L L I + + + A L + Q ++ +L+ E
Sbjct: 237 LDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLF 296

Query: 195 -KQPAIEQAAAKMRDAWLT---------LQRTKVVSPISGYVSRRSVQ-VGAEISSGTPL 243
+ + LT Q + + +P+S V + V G +++ L
Sbjct: 297 KNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETL 356

Query: 244 MAVVPADQ-LWIDANFKETQLANMRIGQPATI-VTDF----YGDDVVYQGKVVGLDMGTG 297
M +VP D L + A + + + +GQ A I V F YG GKV +
Sbjct: 357 MVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGY---LVGKVKNI----- 408

Query: 298 SAFSLLPAQNATGNWIKVVQRLPVRIALDEKQLKEHPLRIGLSSLVKVDT 347
+ ++ G V+ + K PL G++ ++ T
Sbjct: 409 NLDAIE--DQRLGLVFNVIISIEENCLSTG--NKNIPLSSGMAVTAEIKT 454


49YpAngola_A3502YpAngola_A3509Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A35023161.686533putative glycerol-3-phosphate acyltransferase
YpAngola_A35033161.27966750S ribosomal protein L32
YpAngola_A35042140.592749hypothetical protein
YpAngola_A35052150.440748Maf-like protein
YpAngola_A3506215-0.23905623S rRNA pseudouridylate synthase C
YpAngola_A3507116-1.477102ribonuclease E
YpAngola_A3508013-0.359602IS285 transposase
YpAngola_A3509211-0.972829major facilitator transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3507IGASERPTASE413e-05 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 41.2 bits (96), Expect = 3e-05
Identities = 43/290 (14%), Positives = 81/290 (27%), Gaps = 27/290 (9%)

Query: 504 QLHEAEMAQPLEEATIERKRPEQPALATFSLPTEVPPEEAPTVAKAKPAVATPAAVSTDV 563
L+ E+ + T++ P +P+ P +A+ A P A +T
Sbjct: 979 DLYNPEVEK--RNQTVDTTNITTPNNIQADVPSV--PSNNEEIARVDEAPVPPPAPATPS 1034

Query: 564 EQPGFFSRLFSGLKNMFGASAEAEVQPAEVVKTDTSENRRNDRRNPR--RQNNGRKERND 621
E AE Q ++ V+ + + +N ++ + N
Sbjct: 1035 ETTE--------------TVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANT 1080

Query: 622 RTPREGRDNSSRYNTNRDNT--SRDNANRDGANRDNSNRDNSGRDNVSREGREDQRRNNR 679
+T + S T T + + A + + +++Q +
Sbjct: 1081 QTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQ 1140

Query: 680 RPAQPTTTSQGQTEVVEADKAQR----EEQPQRRGDRQRRRQDEKRQAPQEIKADVAEAP 735
A+P + + E EQP + Q V E P
Sbjct: 1141 PQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVE-QPVTESTTVNTGNSVVENP 1199

Query: 736 VIEEVQPEQEERQQVMQRRQRRQLNQKVRIQSANDELNTLESPVSAPVAQ 785
Q + + + + VR N E T S + VA
Sbjct: 1200 ENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVAL 1249



Score = 37.7 bits (87), Expect = 3e-04
Identities = 45/331 (13%), Positives = 91/331 (27%), Gaps = 40/331 (12%)

Query: 671 REDQRRNNRRPAQPTTTSQGQTEVVEADKAQREEQPQRRGDR-QRRRQDEKRQAPQEIKA 729
++R TT + Q + P + + R DE P
Sbjct: 984 EVEKRNQTVDTTNITTPNNIQAD-----------VPSVPSNNEEIARVDEAPVPPPAPAT 1032

Query: 730 DVAEAPVIEEVQPEQEERQQVMQRRQRRQLNQKVRIQSANDELNTLESPVSAPVAQVVVA 789
+ E ++ + + ++ Q R + + N + + VAQ
Sbjct: 1033 PSETTETVAENSKQESKTVEKNEQDATETTAQN-REVAKEAKSNVKANTQTNEVAQS--- 1088

Query: 790 EVQEEVKLLPQITAQTDDDSANERTTNNENGMPRRSRRSPRHLRVSGQRRRRYRDERYPA 849
E + T +T E+ +++ P+ ++ + + A
Sbjct: 1089 -GSETKETQTTETKETATVEKEEK----AKVETEKTQEVPKVTSQVSPKQEQSETVQPQA 1143

Query: 850 QSAMPLAGAFASPEMASGKVWVRYPVTPVVEQVVVEQIAIEQTTTVEQTAIVEQVSVANI 909
+ A E P + EQ A E ++ VEQ
Sbjct: 1144 EPARENDPTVNIKE----------PQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGN 1193

Query: 910 VTAQLPVEQVQNTVAEQESSATPSVMTTPTVAVATAAVTLAPQHKPGGSSSSAAAVPGRA 969
+ P T +S + + ++ P + + R+
Sbjct: 1194 SVVENPENTTPATTQPTVNSESSNKPKN------RHRRSVRSV--PHNVEPATTSSNDRS 1245

Query: 970 PIVAAVPVVAETTAAETVVAKTEAAIDAVAV 1000
VA + + T A A+ +A A+ V
Sbjct: 1246 T-VALCDLTSTNTNAVLSDARAKAQFVALNV 1275


50YpAngola_A3535YpAngola_A3588Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A3535215-1.283600glycine betaine transporter membrane protein
YpAngola_A3536114-2.964504glycine betaine/L-proline transport ATP-binding
YpAngola_A3537115-3.430629ribonucleotide-diphosphate reductase subunit
YpAngola_A3538017-3.115033ribonucleotide-diphosphate reductase subunit
YpAngola_A3539225-4.277368ribonucleotide reductase stimulatory protein
YpAngola_A3540730-6.273603glutaredoxin-like protein NrdH
YpAngola_A3541833-6.964408acid shock protein
YpAngola_A3542120-3.723001hypothetical protein
YpAngola_A3543215-1.652347hypothetical protein
YpAngola_A3544215-0.831719hypothetical protein
YpAngola_A35452150.122026cold shock protein
YpAngola_A35462160.410815hypothetical protein
YpAngola_A35472160.269988nickel ABC transporter substrate-binding
YpAngola_A3548321-0.412848nickel ABC transporter permease
YpAngola_A3549119-1.258371binding-protein-dependent transport protein
YpAngola_A35502150.603591nickel ABC transporter ATP-binding protein
YpAngola_A35512150.081727nickel ABC transporter ATP-binding protein
YpAngola_A35523160.203687hypothetical protein
YpAngola_A35533150.601891hypothetical protein
YpAngola_A35551141.136558urease subunit beta
YpAngola_A3556111-0.340912urease subunit alpha
YpAngola_A3557111-2.376926urease accessory protein UreE
YpAngola_A3558012-3.947616urease accessory protein
YpAngola_A3559015-4.928253urease accessory protein
YpAngola_A3561116-5.368624urea transporter
YpAngola_A3562019-7.562472high-affinity nickel transport protein
YpAngola_A3563019-7.240988acid-resistance protein
YpAngola_A3564115-4.819354voltage-gated potassium channel
YpAngola_A3565115-4.677877camphor resistance protein CrcB
YpAngola_A3566016-3.105326crcB-like protein
YpAngola_A3567-116-2.533871hypothetical protein
YpAngola_A3568-212-0.773083PTS system N,N'-diacetylchitobiose-specific
YpAngola_A3569-113-0.523420PTS system N,N'-diacetylchitobiose-specific
YpAngola_A3571-113-1.255638DNA-binding transcriptional regulator ChbR
YpAngola_A3572-2140.608475hypothetical protein
YpAngola_A3573-2170.616566replication initiation regulator SeqA
YpAngola_A35740192.727239phosphoglucomutase
YpAngola_A35751213.009463hypothetical protein
YpAngola_A35760234.352655hypothetical protein
YpAngola_A35770224.992263hypothetical protein
YpAngola_A35780215.039372DNA-binding transcriptional activator KdpE
YpAngola_A35791225.194395sensor protein KdpD
YpAngola_A35800204.544167potassium-transporting ATPase subunit C
YpAngola_A35810184.263854potassium-transporting ATPase subunit B
YpAngola_A3582-1141.755987potassium-transporting ATPase subunit A
YpAngola_A3583-1160.246873hypothetical protein
YpAngola_A3584-1161.156506hypothetical protein
YpAngola_A3585-1161.764076hypothetical protein
YpAngola_A35860162.010307deoxyribodipyrimidine photolyase
YpAngola_A35870161.9306483',5'-cyclic-nucleotide phosphodiesterase
YpAngola_A35882172.634943putative hydrolase-oxidase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3556UREASE9770.0 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 977 bits (2528), Expect = 0.0
Identities = 328/570 (57%), Positives = 417/570 (73%), Gaps = 5/570 (0%)

Query: 3 QISRQEYAGLFGPTTGDKIRLGDTNLFIEIEKDLRGYGEESVYGGGKSLRDGMGANNNLT 62
++SR YA +FGPT GDK+RL DT LFIE+EKD +GEE +GGGK +RDGMG + +T
Sbjct: 4 RMSRAAYANMFGPTVGDKVRLADTELFIEVEKDFTTHGEEVKFGGGKVIRDGMG-QSQVT 62

Query: 63 RDNGVLDLVITNVTIVDARLGVIKADVGIRDGKIAGIGKSGNPGVMDGVTQGMVVGVSTD 122
R+ G +D VITN I+D G++KAD+G++DG+IA IGK+GNP + GVT ++VG T+
Sbjct: 63 REGGAVDTVITNALILDH-WGIVKADIGLKDGRIAAIGKAGNPDMQPGVT--IIVGPGTE 119

Query: 123 AISGEHLILTAAGIDSHIHLISPQQAYHALSNGVATFFGGGIGPTDGTNGTTVTPGPWNI 182
I+GE I+TA G+DSHIH I PQQ AL +G+ GGG GP GT TT TPGPW+I
Sbjct: 120 VIAGEGKIVTAGGMDSHIHFICPQQIEEALMSGLTCMLGGGTGPAHGTLATTCTPGPWHI 179

Query: 183 RQMLRSIEGLPVNVGILGKGNSYGRGPLLEQAIAGVVGYKVHEDWGATANALRHALRMAD 242
+M+ + + P+N+ GKGN+ G L+E + G K+HEDWG T A+ L +AD
Sbjct: 180 ARMIEAADAFPMNLAFAGKGNASLPGALVEMVLGGATSLKLHEDWGTTPAAIDCCLSVAD 239

Query: 243 EVDIQVSVHTDSLNECGYVEDTIDAFEGRTIHTFHTEGAGGGHAPDIIRVASQTNVLPSS 302
E D+QV +HTD+LNE G+VEDTI A +GRTIH +HTEGAGGGHAPDIIR+ Q NV+PSS
Sbjct: 240 EYDVQVMIHTDTLNESGFVEDTIAAIKGRTIHAYHTEGAGGGHAPDIIRICGQPNVIPSS 299

Query: 303 TNPTLPYGVNSQAELFDMIMVCHNLNPNVPADVSFAESRVRPETIAAENVLHDMGVISMF 362
TNPT PY VN+ AE DM+MVCH+L+P +P D++FAESR+R ETIAAE++LHD+G S+
Sbjct: 300 TNPTRPYTVNTLAEHLDMLMVCHHLSPTIPEDIAFAESRIRKETIAAEDILHDIGAFSII 359

Query: 363 SSDSQAMGRVGENWLRILQTADAMKAARGKLPEDAAGNDNFRVLRYVAKITINPAITQGV 422
SSDSQAMGRVGE +R QTAD MK RG+L E+ NDNFRV RY+AK TINPAI G+
Sbjct: 360 SSDSQAMGRVGEVAIRTWQTADKMKRQRGRLKEETGDNDNFRVKRYIAKYTINPAIAHGL 419

Query: 423 SHVIGSVEVGKMADLVLWDPRFFGAKPKMVIKGGMINWAAMGDPNASLPTPQPVFYRPMF 482
SH IGS+EVGK ADLVLW+P FFG KP MV+ GG I A MGDPNAS+PTPQPV YRPMF
Sbjct: 420 SHEIGSLEVGKRADLVLWNPAFFGVKPDMVLLGGTIAAAPMGDPNASIPTPQPVHYRPMF 479

Query: 483 GAMGKTLQDTCVTFVSQAALDDGVKEKAGLDRQVIAVKNCR-TISKRDLVRNDQTPNIEV 541
GA G++ ++ VTFVSQA+LD G+ + G+ ++++AV+N R I K ++ N TP+IEV
Sbjct: 480 GAYGRSRTNSSVTFVSQASLDAGLAGRLGVAKELVAVQNTRGGIGKASMIHNSLTPHIEV 539

Query: 542 DPETFAVKVDGVHATCEPIATASMNQRYFF 571
DPET+ V+ DG TCEP M QRYF
Sbjct: 540 DPETYEVRADGELLTCEPATVLPMAQRYFL 569


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3578HTHFIS749e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 74.1 bits (182), Expect = 9e-18
Identities = 27/112 (24%), Positives = 52/112 (46%), Gaps = 1/112 (0%)

Query: 1 MRIALESEGWRVFESETLQRGLIEAGTRKPDLIILDLGLPDGDGLNYIQDLRQWSA-IPI 59
+ AL G+ V + DL++ D+ +PD + + + +++ +P+
Sbjct: 19 LNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLLPRIKKARPDLPV 78

Query: 60 IVLSARNNEEDKVAALDAGADDYLSKPFGISELLARVRVALRRHSGASQESP 111
+V+SA+N + A + GA DYL KPF ++EL+ + AL +
Sbjct: 79 LVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRRPSKLE 130


51YpAngola_A3603YpAngola_A3608Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A3603-219-3.841153L-aspartate oxidase
YpAngola_A3604-122-4.711276RNA polymerase sigma factor RpoE
YpAngola_A3605-220-3.972689anti-RNA polymerase sigma factor SigE
YpAngola_A3606-119-3.551177periplasmic negative regulator of sigmaE
YpAngola_A3607-217-3.807424SoxR reducing system protein RseC
YpAngola_A3608-214-3.081247hypothetical protein
52YpAngola_A3824YpAngola_A3835Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A38241123.1767605,10-methenyltetrahydrofolate synthetase
YpAngola_A38251112.920274Z-ring-associated protein
YpAngola_A38261123.309487hypothetical protein
YpAngola_A38271133.429807proline aminopeptidase P II
YpAngola_A38281132.9578542-octaprenyl-6-methoxyphenyl hydroxylase
YpAngola_A38291131.894076hypothetical protein
YpAngola_A38301160.181922glycine cleavage system aminomethyltransferase
YpAngola_A38311150.015080glycine cleavage system protein H
YpAngola_A3832213-0.386099glycine dehydrogenase
YpAngola_A3833213-2.152860hypothetical protein
YpAngola_A3834217-1.537352hypothetical protein
YpAngola_A3835216-0.218906hemagglutination repeat-containing protein
53YpAngola_A3854YpAngola_A3870Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A38542172.568368hypothetical protein
YpAngola_A38563172.076465hypothetical protein
YpAngola_A38574171.844396RelE/ParE family plasmid stabilization system
YpAngola_A38586172.150125helix-turn-helix DNA binding domain-containing
YpAngola_A38596212.885007hypothetical protein
YpAngola_A38605171.837765D5 family nucleoside triphosphatase
YpAngola_A3861318-1.441716hypothetical protein
YpAngola_A3862319-1.129204hypothetical protein
YpAngola_A3863321-1.475141AlpA family protein
YpAngola_A3866018-1.749295hypothetical protein
YpAngola_A3867018-1.502339hypothetical protein
YpAngola_A3868-118-2.657640hypothetical protein
YpAngola_A3869119-3.602092hypothetical protein
YpAngola_A3870119-3.274727hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3862PHPHTRNFRASE230.048 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 23.2 bits (50), Expect = 0.048
Identities = 11/34 (32%), Positives = 20/34 (58%), Gaps = 3/34 (8%)

Query: 11 SHEQVVARMLKKPAV---RAEYERLERQDFAIID 41
SH +++R L+ PAV + E+++ D I+D
Sbjct: 189 SHSAIMSRSLEIPAVVGTKEVTEKIQHGDMVIVD 222


54YpAngola_A3928YpAngola_A3940Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A39281224.787789HrpO family type III secretion protein
YpAngola_A39291195.453500type III secretion system protein
YpAngola_A39301216.767851type III secretion system protein
YpAngola_A39311235.919071hypothetical protein
YpAngola_A39321235.747336type III secretion system apparatus protein
YpAngola_A39331225.587902type III secretion system ATPase
YpAngola_A39351211.384886HrpE/YscL family type III secretion apparatus
YpAngola_A3936023-0.649321hypothetical protein
YpAngola_A3937-226-2.276113YscJ/HrcJ family type III secretion apparatus
YpAngola_A3938-117-2.040580type III secretion apparatus protein, YscI/HrpB
YpAngola_A3939-118-2.362663type III secretion system protein SsaH family
YpAngola_A3940-115-3.301291AraC family transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3928TYPE3IMQPROT693e-19 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 68.6 bits (168), Expect = 3e-19
Identities = 32/79 (40%), Positives = 47/79 (59%)

Query: 10 IVHLATELLWLVLLLSLPVVVVASTVGLVISLVQALTQIQDQTLQFLIKLLAVSATLLMT 69
+V + L+LVL+LS +VA+ +GL++ L Q +TQ+Q+QTL F IKLL V L +
Sbjct: 4 LVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLFLL 63

Query: 70 YHWMGATLLNYTQQSFLQI 88
W G LL+Y +Q
Sbjct: 64 SGWYGEVLLSYGRQVIFLA 82


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3929TYPE3IMPPROT2241e-76 Type III secretion system inner membrane P protein ...
		>TYPE3IMPPROT#Type III secretion system inner membrane P protein

family signature.
Length = 224

Score = 224 bits (572), Expect = 1e-76
Identities = 86/220 (39%), Positives = 143/220 (65%), Gaps = 7/220 (3%)

Query: 4 LNSSYQLIALLFMLSVLPLLVVMGTAFLKLSVVFSLLRNALGVQQVPPNIAIYGLALVLT 63
+ + LIALL ++LP ++ GT F+K S+VF ++RNALG+QQ+P N+ + G+AL+L+
Sbjct: 1 MGNDISLIALLAFSTLLPFIIASGTCFVKFSIVFVMVRNALGLQQIPSNMTLNGVALLLS 60

Query: 64 IFIMAPVGLDVQARLQNEELSNDIGALAHQIDQNALVPYRDFLQRNTDIEQVTFFNDIVQ 123
+F+M P+ D ++E+++ + + + L YRD+L + +D E V FF +
Sbjct: 61 MFVMWPIMHDAYVYFEDEDVTFNDISSLSKHVDEGLDGYRDYLIKYSDRELVQFFENAQL 120

Query: 124 NKWPE-------RYRDSVKPDSLLILMPAFTLSQLNEAFKIGLLLFLPFVAIDLIVSNIL 176
+ R +D ++ S+ L+PA+ LS++ AFKIG L+LPFV +DL+VS++L
Sbjct: 121 KRQYGEETETVKRDKDEIEKPSIFALLPAYALSEIKSAFKIGFYLYLPFVVVDLVVSSVL 180

Query: 177 LAMGMMMVSPMTLSLPFKLLVFVLVDGWSLVLGQLVGSYL 216
LA+GMMM+SP+T+S P KL++FV +DGW+L+ L+ Y+
Sbjct: 181 LALGMMMMSPVTISTPIKLVLFVALDGWTLLSKGLILQYM 220


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3930TYPE3OMOPROT521e-09 Type III secretion system outer membrane O protein ...
		>TYPE3OMOPROT#Type III secretion system outer membrane O protein

family signature.
Length = 303

Score = 51.5 bits (123), Expect = 1e-09
Identities = 22/81 (27%), Positives = 37/81 (45%)

Query: 209 PPLAAVQLEDLPQTLVMEIGRLTLPLGEIKQLAVGQTLACQTHCYGEVNICLNGQSVGRG 268
L LP L + R + L E++ + Q L+ T+ V I NG +G G
Sbjct: 220 TAETLPGLNQLPVKLEFVLYRKNVTLAELEAMGQQQLLSLPTNAELNVEIMANGVLLGNG 279

Query: 269 SLLRCDEKLVVRIAQWGLQNG 289
L++ ++ L V I +W ++G
Sbjct: 280 ELVQMNDTLGVEIHEWLSESG 300


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3932RTXTOXIND290.005 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.0 bits (65), Expect = 0.005
Identities = 17/114 (14%), Positives = 40/114 (35%), Gaps = 7/114 (6%)

Query: 5 QQRTLQRLLALRQRQERRLRQQLGQLRREQQQLENGRRRHQQLCQQLQQLAQWCGMLTPR 64
++ + Q Q+ + L + R E+ + R++ L + + L
Sbjct: 187 LTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSL--- 243

Query: 65 EADEQKVLRQAVYQAERQAKKQLNAWVAQGRQQVSAIERQ--QARLRRNQREQE 116
+Q + + AV + E + + + + Q+ IE + A+ Q
Sbjct: 244 -LHKQAIAKHAVLEQENKY-VEAVNELRVYKSQLEQIESEILSAKEEYQLVTQL 295


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3937FLGMRINGFLIF638e-14 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 63.4 bits (154), Expect = 8e-14
Identities = 44/188 (23%), Positives = 70/188 (37%), Gaps = 7/188 (3%)

Query: 7 MLAIVLMTLSLSGCDME-LYSGLSEGEANQMLALLMLHQINAEKQIEKSGMVGLTVDKRQ 65
+ +V M L D L+S LS+ + ++A L I + V +
Sbjct: 35 VAIVVAMVLWAKTPDYRTLFSNLSDQDGGAIVAQLTQMNIPYR--FANGSGA-IEVPADK 91

Query: 66 FINAVELLRQNGFPRQRFITVDELFPANQLVTSPTQEQAKMVFLKEQQLENMLSHMDGVI 125
L Q G P+ + EL + S EQ E +L + + V
Sbjct: 92 VHELRLRLAQQGLPKGGAVGF-ELLDQEKFGISQFSEQVNYQRALEGELARTIETLGPVK 150

Query: 126 HADVTVAMPM-SVDGKNPLPHTASVFIKYSPEVNLQSYQ-SQIKGLVRDAVPGIDYAKIS 183
A V +AMP S+ + +ASV + P L Q S + LV AV G+ ++
Sbjct: 151 SARVHLAMPKPSLFVREQKSPSASVTVTLEPGRALDEGQISAVVHLVSSAVAGLPPGNVT 210

Query: 184 VVMQPANY 191
+V Q +
Sbjct: 211 LVDQSGHL 218


55YpAngola_A4017YpAngola_A4029Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A40170243.414354carbohydrate ABC transporter permease
YpAngola_A40180203.535740carbohydrate ABC transporter permease
YpAngola_A40190194.009357carbohydrate ABC transporter ATP-binding
YpAngola_A40201164.725101hypothetical protein
YpAngola_A40210175.137367hypothetical protein
YpAngola_A40220165.587004hypothetical protein
YpAngola_A40231165.095076carbon-phosphorus lyase complex accessory
YpAngola_A40241185.583096ribose 1,5-bisphosphokinase
YpAngola_A40251195.795767PhnM protein
YpAngola_A40260246.028062phosphonate ABC transporter ATP-binding protein
YpAngola_A40270225.603607phosphonate C-P lyase system protein PhnK
YpAngola_A40280225.212374PhnJ protein
YpAngola_A4029-2193.464369PhnI protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A4019PF05272371e-04 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 37.0 bits (85), Expect = 1e-04
Identities = 18/78 (23%), Positives = 25/78 (32%), Gaps = 19/78 (24%)

Query: 33 VLVGPSGCGKSTLLRMIAGLEEISGGTVGINDKDVTDVEPKMRDIAMVFQSYALYPQMTV 92
VL G G GKSTL+ + GL+ S D +D Y
Sbjct: 600 VLEGTGGIGKSTLINTLVGLDFFS-------DTHFDI--GTGKDSYEQIAGIVAY----- 645

Query: 93 RENMGFALKMAKMSKADI 110
+ +M +AD
Sbjct: 646 --ELS---EMTAFRRADA 658


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A4025UREASE300.017 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 30.1 bits (68), Expect = 0.017
Identities = 25/87 (28%), Positives = 38/87 (43%), Gaps = 18/87 (20%)

Query: 289 LASLGVLDILSSD--------------YYPASLMDAAF-RIAHDE--SNRFTLPQAVNLV 331
L +G I+SSD + A M R+ + ++ F + + +
Sbjct: 350 LHDIGAFSIISSDSQAMGRVGEVAIRTWQTADKMKRQRGRLKEETGDNDNFRVKRYIAKY 409

Query: 332 TRNPARALGLNDR-GVIAEGKRADLIL 357
T NPA A GL+ G + GKRADL+L
Sbjct: 410 TINPAIAHGLSHEIGSLEVGKRADLVL 436


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A4026PF05272280.045 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.1 bits (62), Expect = 0.045
Identities = 13/47 (27%), Positives = 17/47 (36%), Gaps = 8/47 (17%)

Query: 64 CVVLHGHSGSGKSTLLRSLYANYLPDSGHI--------WIKHQGEWI 102
VVL G G GKSTL+ +L H + + G
Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQIAGIVA 644


56YpAngola_A4091YpAngola_A4116Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A4091-1183.053011IS285 transposase
YpAngola_A4092-2173.538878IS1541 transposase
YpAngola_A4093-1173.764675phosphate transporter
YpAngola_A4094-1213.997068hypothetical protein
YpAngola_A4095-2224.479442sensory box histidine kinase/response regulator
YpAngola_A4096-2213.699735sugar transport ATP-binding protein
YpAngola_A4097-1172.691888sugar transport system permease
YpAngola_A4098-1142.463201sugar-binding transport protein
YpAngola_A4099-1132.483501hypothetical protein
YpAngola_A41010122.255850PfkB family kinase
YpAngola_A41020131.650465DNA-binding response regulator
YpAngola_A41031131.171909hypothetical protein
YpAngola_A41042150.799828LacI family sugar-binding transcriptional
YpAngola_A4105115-1.376911high-affinity gluconate transporter GntT
YpAngola_A4106120-4.124975hypothetical protein
YpAngola_A4107015-1.701886thermoresistant gluconokinase
YpAngola_A4108-117-2.697074hypothetical protein
YpAngola_A4109018-2.902855hypothetical protein
YpAngola_A4110017-2.993791hypothetical protein
YpAngola_A41114121.609815putative dITP- and XTP- hydrolase
YpAngola_A41124111.519546aspartate-semialdehyde dehydrogenase
YpAngola_A41134101.452125hypothetical protein
YpAngola_A41143101.955361hypothetical protein
YpAngola_A41153112.299110hypothetical protein
YpAngola_A41162111.981598hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A4095HTHFIS649e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 64.5 bits (157), Expect = 9e-13
Identities = 26/125 (20%), Positives = 58/125 (46%), Gaps = 17/125 (13%)

Query: 735 MADQLVLVLEDEPDVRQTLCEQLHQLGYLTLETGDSRQALALMADVPDISIVISDLMLPG 794
M +LV +D+ +R L + L + GY T ++ +A +V++D+++P
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGD-GDLVVTDVVMPD 59

Query: 795 DMTGAEVLQQARSVYPHLKLLLISGQD---------LRRSKNFMPEVELLRKPFNQQQLV 845
++L + + P L +L++S Q+ + + +++P KPF+ +L+
Sbjct: 60 -ENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLP------KPFDLTELI 112

Query: 846 QALQR 850
+ R
Sbjct: 113 GIIGR 117


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A4102HTHFIS992e-26 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 99 bits (249), Expect = 2e-26
Identities = 35/122 (28%), Positives = 61/122 (50%), Gaps = 1/122 (0%)

Query: 2 KPAILVVDDDTAICEVLRDVLNEHVFDVLLCHSGNEALQITATQPSIALILLDMMLPDIN 61
ILV DDD AI VL L+ +DV + + + A L++ D+++PD N
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDG-DLVVTDVVMPDEN 61

Query: 62 GLLVLQQVQKLRPSLPVVMLTGMGSESDMVVGLEMGADDYIAKPFNARVVVARVKAVLRR 121
+L +++K RP LPV++++ + + E GA DY+ KPF+ ++ + L
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 122 SE 123
+
Sbjct: 122 PK 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A4108adhesinb260.019 Adhesin B signature.
		>adhesinb#Adhesin B signature.

Length = 310

Score = 25.6 bits (56), Expect = 0.019
Identities = 6/27 (22%), Positives = 11/27 (40%)

Query: 20 PEEYERIVSAYAAWTRVCREYEFNDGY 46
P E + IV++ + + Y Y
Sbjct: 196 PGEKKMIVTSEGCFKYFSKAYNVPSAY 222


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A4109SHAPEPROTEIN230.047 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 23.2 bits (50), Expect = 0.047
Identities = 10/35 (28%), Positives = 18/35 (51%)

Query: 7 DVVMSDIDMPESELKKGMKGVISITTRTIKPYILI 41
D V++D + E L+ +K V S + P +L+
Sbjct: 78 DGVIADFFVTEKMLQHFIKQVHSNSFMRPSPRVLV 112


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A4116INTIMIN473e-145 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 473 bits (1217), Expect = e-145
Identities = 267/884 (30%), Positives = 405/884 (45%), Gaps = 81/884 (9%)

Query: 11 YTLGPGDSIQSIAKKYNITVDELKKLNAYRTFSKP-FASLTTGDEIEVPRKESSF----- 64
YTL G+++ ++K +I + + LN + S+ G +I +P K+ F
Sbjct: 65 YTLKTGETVADLSKSQDINLSTIWSLNKHLYSSESEMMKAEPGQQIILPLKKLPFEYSAL 124

Query: 65 ---------------------FSNNPNENNKKDVDDLLARNAMGAG-----KLLSNDNTS 98
+P+ DD A +L S
Sbjct: 125 PLLGSAPLVAAGGVAGHTNKLTKMSPDVTKSNMTDDKALNYAAQQAASLGSQLQSRSLNG 184

Query: 99 DAASNMARSAVTNEINASSQQWLNQFGTARVQLNVDSDFKLDNSALDLLVPLKDSESSLL 158
D A + A N+ ++ Q WL +GTA V L ++F D S+LD L+P DSE L
Sbjct: 185 DYAKDTALGIAGNQASSQLQAWLQHYGTAEVNLQSGNNF--DGSSLDFLLPFYDSEKMLA 242

Query: 159 FTQLGVRNKDSRNTVNIGAGIRQYQGDWMYGANTFFDNDLTGKNRRVGVGAEVATDYLKF 218
F Q+G R DSR T N+GAG R + + M G N F D D +G N R+G+G E DY K
Sbjct: 243 FGQVGARYIDSRFTANLGAGQRFFLPENMLGYNVFIDQDFSGDNTRLGIGGEYWRDYFKS 302

Query: 219 SANTYFGLTGWHQSRDFSSYDERPADGFDIRTEAYLPAYPQLGGKLMYEKYRGDEVALFG 278
S N YF ++GWH+S + YDERPA+GFDIR YLP+YP LG KLMYE+Y GD VALF
Sbjct: 303 SVNGYFRMSGWHESYNKKDYDERPANGFDIRFNGYLPSYPALGAKLMYEQYYGDNVALFN 362

Query: 279 KDDRQKDPHAVTLGVNYTPVPLVTIGAEHREGKGNNNNTSVNVQLNYRMGQPWNDQIDQS 338
D Q +P A T+GVNYTP+PLVT+G ++R G GN N+ ++Q Y+ +PW+ QI+
Sbjct: 363 SDKLQSNPGAATVGVNYTPIPLVTMGIDYRHGTGNENDLLYSMQFRYQFDKPWSQQIEPQ 422

Query: 339 AVAANRTLAGSRYDLVERNNNIVLDYKKQELIHLVLPDRISGSGGGAITLTAQVRAKYGF 398
V RTL+GSRYDLV+RNNNI+L+YKKQ+++ L +P I+G+ + V++KYG
Sbjct: 423 YVNELRTLSGSRYDLVQRNNNIILEYKKQDILSLNIPHDINGTERSTQKIQLIVKSKYGL 482

Query: 399 SRIEWDATPLENAGG---STSPLTQSSLSVTLPFYQHILRTSNTHTISAVAYDAQGNASN 455
RI WD + L + GG + + LP Y SN + ++A AYD GN+SN
Sbjct: 483 DRIVWDDSALRSQGGQIQHSGSQSAQDYQAILPAYVQ--GGSNVYKVTARAYDRNGNSSN 540

Query: 456 RAVTSIEVTRPETMV----ISHLATTIDNATANGIATNTVQATVTDGDGQPIIGQLINFA 511
+ +I V +V ++ +A A+G T ATV +
Sbjct: 541 NVLLTITVLSNGQVVDQVGVTDFTADKTSAKADGTEAITYTATVKKNGVAQANVPVSFNI 600

Query: 512 VNTQATLSTTEARTGANGTASTTLTHTVSGVSRVSVTLGSSSRSVDTTFV--ADESTAEI 569
V+ A LS A T +G A+ TL G VS + +++ V D++ A I
Sbjct: 601 VSGTAVLSANSANTNGSGKATVTLKSDKPGQVVVSAKTAEMTSALNANAVIFVDQTKASI 660

Query: 570 TAANLTVTTNDSVANGSDTNVVRAKVTDAYTNAVANQSVIFSASNGATVIDQTVITNAEG 629
T + +VANG D KV V+NQ V F+ + + + T T+ G
Sbjct: 661 T--EIKADKTTAVANGQDAITYTVKVMKG-DKPVSNQEVTFT-TTLGKLSNSTEKTDTNG 716

Query: 630 IADSTLTNTTAGVSVVTATLGGQS---QQVDTTFKPGSTAAISLVKLADRAVADGIDQNE 686
A TLT+TT G S+V+A + + + + F T +++ V +
Sbjct: 717 YAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFFTTLTIDDGNIEIVGTGVKGKLPTVW 776

Query: 687 IQ-----VVLRDGTGN----AVPNVPMSIQADNGAIVVASTPNTGVDGTIN----ATFTN 733
+Q + G G + S+ A +G + + T + + AT+T
Sbjct: 777 LQYGQVNLKASGGNGKYTWRSANPAIASVDASSGQVTLKEKGTTTISVISSDNQTATYTI 836

Query: 734 LRAGESVVS------VTSPALVGMTMTMTFSADPRTAVVSTLAAIDNNAKADG-TDTNVV 786
+V + A+ + + + A K + + +
Sbjct: 837 ATPNSLIVPNMSKRVTYNDAVNTCKNFGGKLPSSQNELENVFKAWGAANKYEYYKSSQTI 896

Query: 787 RAWVVDANGNSVPGVSVTFDAGNGAVLAQNPV----VTDRNGYA 826
+WV ++ GV+ T+D ++ QNP+ ++ N YA
Sbjct: 897 ISWVQQTAQDAKSGVASTYD-----LVKQNPLNNIKASESNAYA 935



Score = 89.7 bits (222), Expect = 1e-19
Identities = 74/340 (21%), Positives = 120/340 (35%), Gaps = 29/340 (8%)

Query: 2552 NALADGVTRNQVRAHVVDSTGNSVADMAVTFTANRGAQLSKVTVLTDNNGDAVNTLTNSL 2611
+A ADG A V + + A LS + T+ +G A TL +
Sbjct: 569 SAKADGTEAITYTATVKKNGVAQANVPVSFNIVSGTAVLSANSANTNGSGKATVTLKSDK 628

Query: 2612 VGVTVVTAKLGTAGTPLTVDTVFTAGPLATLTLVTTV--NNAFADNSATNTVQATLKDV- 2668
G VV+AK TA ++ T +T + + A + + + T+K +
Sbjct: 629 PGQVVVSAK--TAEMTSALNANAVIFVDQTKASITEIKADKTTAVANGQDAITYTVKVMK 686

Query: 2669 SGNPIVGEVVAFAASNGATITATDGGVSNANGIVLATLTNGTAGVSTVTATIE----TLT 2724
P+ + V F + G +T+ ++ NG TLT+ T G S V+A + +
Sbjct: 687 GDKPVSNQEVTFTTTLGKLSNSTE--KTDTNGYAKVTLTSTTPGKSLVSARVSDVAVDVK 744

Query: 2725 ETTDTTFIAMKNLDVTVNGTTFNGDAGFPTTGFVGATFKVNSGGDNSLYDWSSSAPALVS 2784
F + D + PT + + G N Y W S+ PA+ S
Sbjct: 745 APEVEFFTTLTIDDGNIEIVGTGVKGKLPTVWLQYGQVNLKASGGNGKYTWRSANPAIAS 804

Query: 2785 VSGD-GVVTFNAVFPTGTPTITISATPKGGGSPLSYSFRVNQWFINNNGATLNRADAITH 2843
V G VT T TIS + N + N + DA+
Sbjct: 805 VDASSGQVTLK-----EKGTTTISVISSDNQTATYTIATPNSLIVPNMSKRVTYNDAVNT 859

Query: 2844 CENVGYTMPTSTQVTNAATWMSGKRAVGNLWSEWGDFSAY 2883
C+N G +P+S + N++ WG + Y
Sbjct: 860 CKNFGGKLPSSQNE------------LENVFKAWGAANKY 887



Score = 73.2 bits (179), Expect = 1e-14
Identities = 91/426 (21%), Positives = 137/426 (32%), Gaps = 45/426 (10%)

Query: 821 DRNGYAENTLTNLAIGTTTVKATTVTDPVGQTVNTHFVAGAVDTITLTVPVNGAVANGVN 880
DRNG N+ N+ + T + V D VG T T A A+G
Sbjct: 533 DRNG---NSSNNVLLTITVLSNGQVVDQVGVT-------------DFTADKTSAKADGTE 576

Query: 881 TNSVQAVVSDSGGNPVTGATVVFSSTNATAQVTTVIGTTGADGIATATLTNTVAGTSNVV 940
+ A V +G V F+ + TA ++ T G AT TL + G V
Sbjct: 577 AITYTATVKKNGVAQA-NVPVSFNIVSGTAVLSANSANTNGSGKATVTLKSDKPGQVVVS 635

Query: 941 ATI----DTVNANIDTAFVAGAVATITLTAPV-NGAVADGADTNQVDALVEDANGNPITG 995
A +NAN FV A+IT AVA+G D V P++
Sbjct: 636 AKTAEMTSALNANA-VIFVDQTKASITEIKADKTTAVANGQDAITYTVKV-MKGDKPVSN 693

Query: 996 AAVVFSSANGATILSSTMNTGVNGVASTLLTHTVAGTSNVVATVDTVNANI---DTTFVA 1052
V F++ + +ST T NG A LT T G S V A V V ++ + F
Sbjct: 694 QEVTFTT-TLGKLSNSTEKTDTNGYAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFFT 752

Query: 1053 GAVATITLTTPVNGAVADGANSNSVQAVVSDSDGNPVTGAAVVFSSANATAQITTVIGTT 1112
V V + +Q + + G S+ A A + G
Sbjct: 753 TLTIDDGNIEIVGTGVKGKLPTVWLQYGQVNLKASGGNGKYTWRSANPAIASVDASSGQV 812

Query: 1113 GADGIATATLTNTVAGTSNVVATIDTVNANIDTAFVAGAVATITLTAPVNGAVADGADTN 1172
T T++ + TI T N + I D +T
Sbjct: 813 TLKEKGTTTISVISSDNQTATYTIATPN------------SLIVPNMSKRVTYNDAVNTC 860

Query: 1173 QVDALVQDANGNAITGAAVVFSSANGADIIAPTMNTGVNGVASTLLTHTVAGTSNVVATI 1232
+ ++ N + + +AN + + S + S V +T
Sbjct: 861 KNFGGKLPSSQNELENVFKAWGAANKYEYY-----KSSQTIISWVQQTAQDAKSGVASTY 915

Query: 1233 DTISAN 1238
D + N
Sbjct: 916 DLVKQN 921



Score = 70.9 bits (173), Expect = 7e-14
Identities = 70/374 (18%), Positives = 125/374 (33%), Gaps = 34/374 (9%)

Query: 1921 NRVQSKDTTFIADRTTATIRASDLTITRNNALADGVATNAARVIVTDANGNPVPSMFVGY 1980
N V T + + +D T + +A ADG V
Sbjct: 540 NNVLLTITVLSNGQVVDQVGVTDFTADKTSAKADGTEAITYTATVKKNGVAQANVPVSFN 599

Query: 1981 TSDNGALLTPTSGMTDSSGTFSTTFTHTTAGISKVTAAIVTMGISQTKDAVFIADRSTAH 2040
A+L+ S T+ SG + T G V+A M + +AV D++ A
Sbjct: 600 IVSGTAVLSANSANTNGSGKATVTLKSDKPGQVVVSAKTAEMTSALNANAVIFVDQTKAS 659

Query: 2041 VSELIVVKNDSLANNSDRNIVQAHIKDAHGNVVTGMNVNFSATENVTLTANTVTTNSQGY 2100
++E+ K ++AN D + V+ V F+ T L+ +T T++ GY
Sbjct: 660 ITEIKADKTTAVANGQDAITYTVKVMK-GDKPVSNQEVTFT-TTLGKLSNSTEKTDTNGY 717

Query: 2101 AENTLRHNAPVTSAVTATVATDLVGLTEDVRFVAGAGARIELFRLNDGAVADGIQTNRVE 2160
A+ TL P S V+A V+ ++ + I +E
Sbjct: 718 AKVTLTSTTPGKSLVSAR--------------VSDVAVDVKAPEVEFFTTLT-IDDGNIE 762

Query: 2161 ARVYDVSDNLVPN------SNVVFSADNGG---QLVQNDVQTDALGSAYVTVSNINTGVT 2211
V L N+ S NG + + + S VT+ G T
Sbjct: 763 IVGTGVKGKLPTVWLQYGQVNLKASGGNGKYTWRSANPAIASVDASSGQVTLK--EKGTT 820

Query: 2212 KVTVTADGVSASTTTTFIADRDTATLVTDRFLITHDNAVANGVVENRVLLHLVDANDNSV 2271
++V + S + T T+ + +V + ++ + V + + ++ N +
Sbjct: 821 TISVIS---SDNQTATYTIATPNSLIVPN---MSKRVTYNDAVNTCKNFGGKLPSSQNEL 874

Query: 2272 SGVEVNFSATNGAS 2285
V + A N
Sbjct: 875 ENVFKAWGAANKYE 888



Score = 61.6 bits (149), Expect = 4e-11
Identities = 50/212 (23%), Positives = 73/212 (34%), Gaps = 9/212 (4%)

Query: 1340 VAGAVATITLTAPVNGAVADGVNTNSVQAVVSDSDGNAVTGATVVFSSANATAQITTVIG 1399
V V TA A ADG + A V +G A V F+ + TA ++
Sbjct: 554 VVDQVGVTDFTADKTSAKADGTEAITYTATVKK-NGVAQANVPVSFNIVSGTAVLSANSA 612

Query: 1400 TTGADGIATATLTNTVAGTSNVVATI----DTVNANIDTTFVAGELENIVVSIINNNALA 1455
T G AT TL + G V A +NAN + + A+A
Sbjct: 613 NTNGSGKATVTLKSDKPGQVVVSAKTAEMTSALNANAVIFVDQTKASITEIKADKTTAVA 672

Query: 1456 NGADTNIVEAFVTDRFGNGVANQSLIFGTNGASIVGSSTVTTNLDGRVRASATHTVAGSS 1515
NG D V V+NQ + F T + +ST T+ +G + + T T G S
Sbjct: 673 NGQDAITYTVKVMKG-DKPVSNQEVTFTTTL-GKLSNSTEKTDTNGYAKVTLTSTTPGKS 730

Query: 1516 NTVIAISGAHQGYA--RVTFVADVSTAQLKLT 1545
+S V F ++ +
Sbjct: 731 LVSARVSDVAVDVKAPEVEFFTTLTIDDGNIE 762



Score = 59.7 bits (144), Expect = 2e-10
Identities = 66/358 (18%), Positives = 123/358 (34%), Gaps = 30/358 (8%)

Query: 1633 VAGKAASIEMTMTKDNAVANNIDTNEVQVLVTDVDGNAINGAVVNLTSNSGMNITPNSVT 1692
V + + T K +A A+ + V N V + ++ NS
Sbjct: 554 VVDQVGVTDFTADKTSAKADGTEAITYTATVKKNGVAQANVPVSFNIVSGTAVLSANSAN 613

Query: 1693 TGSDGTATATLTHTLAGSLPINARIDQVSKTINATF--IADASTAQI--IAGDMFIIVND 1748
T G AT TL G + ++A+ +++ +NA D + A I I D
Sbjct: 614 TNGSGKATVTLKSDKPGQVVVSAKTAEMTSALNANAVIFVDQTKASITEIKADKTTA--- 670

Query: 1749 QVANGQAVNAVQARVTDSYGNPIKDQTVEFVLSNNGTIQYELDVTSVEGGVMVTFTNTLA 1808
VANGQ +V P+ +Q V F + G + + T G VT T+T
Sbjct: 671 -VANGQDAITYTVKVMKG-DKPVSNQEVTFT-TTLGKLSNSTEKTDTNGYAKVTLTSTTP 727

Query: 1809 GITNVTATVVSSGSS-RNIDTTFIADVTTAHIAASDLMVIVDDAVADNLDKNEVHARVTD 1867
G + V+A V + + F +T IV V L + +
Sbjct: 728 GKSLVSARVSDVAVDVKAPEVEFFTTLTIDDGNIE----IVGTGVKGKLPTVWLQYGQVN 783

Query: 1868 AKGNVLSGQTVIFTSGNGAAITTVNGISDGDGLTKATLTHTLAGTSVVTARVGNRVQSKD 1927
K + +G+ ++ A + +T GT+ ++ + ++
Sbjct: 784 LKASGGNGKYTWRSANPAIASVDAS---------SGQVTLKEKGTTTISVISSD---NQT 831

Query: 1928 TTFIADRTTATIRASDLTITRNNALADGVATNAARVIVTDANGNPVPSMFVGYTSDNG 1985
T+ + I + +++ D V T ++ N + ++F + + N
Sbjct: 832 ATYTIATPNSLIVPN---MSKRVTYNDAVNTCKNFGGKLPSSQNELENVFKAWGAANK 886



Score = 59.3 bits (143), Expect = 2e-10
Identities = 81/378 (21%), Positives = 131/378 (34%), Gaps = 31/378 (8%)

Query: 675 DRAVADGIDQNEIQVVLRDGTGNAVPNVPMSIQADNG-AIVVASTPNTGVDGTINATFTN 733
A ADG + ++ G A NVP+S +G A++ A++ NT G T +
Sbjct: 568 TSAKADGTEAITYTATVKK-NGVAQANVPVSFNIVSGTAVLSANSANTNGSGKATVTLKS 626

Query: 734 LRAGESVVSVTSPALVGMTMTMTFSA----DPRTAVVSTLAAIDNNAKADGTDTNVVRAW 789
+ G+ VVS + MT + +A D A ++ + A A A+G D +
Sbjct: 627 DKPGQVVVSAKT---AEMTSALNANAVIFVDQTKASITEIKADKTTAVANGQDA-ITYTV 682

Query: 790 VVDANGNSVPGVSVTFDAGNGAVLAQNPVVTDRNGYAENTLTNLAIG--TTTVKATTVTD 847
V V VTF L+ + TD NGYA+ TLT+ G + + + V
Sbjct: 683 KVMKGDKPVSNQEVTFTT-TLGKLSNSTEKTDTNGYAKVTLTSTTPGKSLVSARVSDVAV 741

Query: 848 PVGQTVNTHFVAGAVDTITLTVPVNGAVANGVNTNSVQAVVSDSGGNPVTGATVVFSSTN 907
V F +D + + G V + T +Q + + G S+
Sbjct: 742 DVKAPEVEFFTTLTIDDGNIEIVGTG-VKGKLPTVWLQYGQVNLKASGGNGKYTWRSANP 800

Query: 908 ATAQVTTVIGTTGADGIATATLTNTVAGTSNVVATIDTVNANIDTAFVAGAVATITLTAP 967
A A V G T T++ + TI T N + I
Sbjct: 801 AIASVDASSGQVTLKEKGTTTISVISSDNQTATYTIATPN------------SLIVPNMS 848

Query: 968 VNGAVADGADTNQVDALVEDANGNPITGAAVVFSSANGATILSSTMNTGVNGVASTLLTH 1027
D +T + ++ N + + +AN S+ + S +
Sbjct: 849 KRVTYNDAVNTCKNFGGKLPSSQNELENVFKAWGAANKYEYYKSS-----QTIISWVQQT 903

Query: 1028 TVAGTSNVVATVDTVNAN 1045
S V +T D V N
Sbjct: 904 AQDAKSGVASTYDLVKQN 921



Score = 54.3 bits (130), Expect = 9e-09
Identities = 85/438 (19%), Positives = 141/438 (32%), Gaps = 75/438 (17%)

Query: 2150 VADGIQTNRVEARVYDVSDNLVPNSNVVFSADNGGQLVQNDVQTDALGSAYVTVSNINTG 2209
V G +V AR YD + N N + + + GQ+V G
Sbjct: 518 VQGGSNVYKVTARAYDRNGNSSNNVLLTITVLSNGQVVDQV------------------G 559

Query: 2210 VTKVTVTADGVSASTTTTFIADRDTATLVTDRFLITHDNAVANGVVENRVLLHLVDANDN 2269
VT T A T IT+ V N
Sbjct: 560 VTDFTADKTSAKADGTEA----------------ITYTATVKK--------------NGV 589

Query: 2270 SVSGVEVNFSATNG-ASINA-SAITDINGFAIGVLTNTLSGPSDVTVTLVTPGGTESLTV 2327
+ + V V+F+ +G A ++A SA T+ +G A L + P V V+ T T +L
Sbjct: 590 AQANVPVSFNIVSGTAVLSANSANTNGSGKATVTLKSD--KPGQVVVSAKTAEMTSALNA 647

Query: 2328 TPQFIADINTANIATGDFVIIDDGAVANSVDANEVRARVTDNQGNAIAGYSVVFSSQNGA 2387
D A+I AVAN DA +V ++ V F++ G
Sbjct: 648 NAVIFVDQTKASITE--IKADKTTAVANGQDAITYTVKVMKG-DKPVSNQEVTFTTTLGK 704

Query: 2388 TITTSGITGVDGWASAKLTHIKAGESGILARLSRPMATVHTLMPYFIADVSTATLQLFNF 2447
++ T +G+A LT G+S + AR+S V F ++
Sbjct: 705 LSNSTEKTDTNGYAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFFTTLTIDD------ 758

Query: 2448 NPIPIIADGVMQFFVLGRV-FDANQNPVGGQQVAFSATNEVTLTESNGSISTPEGSVLLS 2506
I I+ GV + + G + T +N +I++ + S
Sbjct: 759 GNIEIVGTGVKGKLPTVWLQYGQVNLKASGGNGKY------TWRSANPAIASVDASS-GQ 811

Query: 2507 VTSTQAGVHPITGTLVSNNYTDTFGAAFIANKNTAQLSTLMVVDNNALADGVTRNQVRAH 2566
VT + G I+ N A + + + + D V +
Sbjct: 812 VTLKEKGTTTISVISSDNQT-----ATYTIATPNSLI-VPNMSKRVTYNDAVNTCKNFGG 865

Query: 2567 VVDSTGNSVADMAVTFTA 2584
+ S+ N + ++ + A
Sbjct: 866 KLPSSQNELENVFKAWGA 883



Score = 50.5 bits (120), Expect = 1e-07
Identities = 93/491 (18%), Positives = 162/491 (32%), Gaps = 61/491 (12%)

Query: 1900 LTKATLTHTLAGTSVVTARVGNRVQSKDTTFIADRTTATIRASDLTITRNNALADGVATN 1959
+ + H + GT T ++ V+SK + +R+ G +
Sbjct: 453 ILSLNIPHDINGTERSTQKIQLIVKSKYGLDRIVWDDSALRS-----------QGGQIQH 501

Query: 1960 AARVIVTDANGNPVPSMFVGYTSDNGALLTPTSGMTDSSGTFSTTFTHTTAGISKVTAAI 2019
+ D ++ Y + T+ D +G S T I
Sbjct: 502 SGSQSAQDYQ-----AILPAYVQGGSNVYKVTARAYDRNGNSSNNVLLT----------I 546

Query: 2020 VTMGISQTKDAVFIADRSTAHVSELIVVKNDSLANNSDRNIVQAHIKDAHGNVVTGMNVN 2079
+ Q D V + D + K + A+ ++ A +K +G + V+
Sbjct: 547 TVLSNGQVVDQVGVTDFTAD--------KTSAKADGTEAITYTATVKK-NGVAQANVPVS 597

Query: 2080 FSATENV-TLTANTVTTNSQGYAENTLRHNAPVTSAVTATVATDLVGL-TEDVRFVAGAG 2137
F+ L+AN+ TN G A TL+ + P V+A A L V FV
Sbjct: 598 FNIVSGTAVLSANSANTNGSGKATVTLKSDKPGQVVVSAKTAEMTSALNANAVIFVDQTK 657

Query: 2138 ARI-ELFRLNDGAVADGIQTNRVEARVYDVSDNLVPNSNVVFSADNGGQLVQNDVQTDAL 2196
A I E+ AVA+G +V D V N V F+ G+L + +TD
Sbjct: 658 ASITEIKADKTTAVANGQDAITYTVKV-MKGDKPVSNQEVTFTTT-LGKLSNSTEKTDTN 715

Query: 2197 GSAYVTVSNINTGVTKVTVTADGVSASTTTTFIADRDTATLVTDRFLITHDNAVANGVVE 2256
G A VT+++ G + V+ V+ + T T+ V GV
Sbjct: 716 GYAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFFTTLTIDDGNI-----EIVGTGVKG 770

Query: 2257 NRVLLHLVDANDN-SVSGVEVNFSATNGASINASAITDINGFAIGVLTNTLSGPSDVTVT 2315
+ L N SG ++ ++ A A D + + TL T++
Sbjct: 771 KLPTVWLQYGQVNLKASGGNGKYTWR--SANPAIASVDASSGQV-----TLKEKGTTTIS 823

Query: 2316 LVTPGGTESLTVTPQFIADINTANIATGDFVIIDDGAVANSVDANEVRARVTDNQGNAIA 2375
V ++ T T I T N + + ++V+ + + N +
Sbjct: 824 -VISSDNQTATYT------IATPN-SLIVPNMSKRVTYNDAVNTCKNFGGKLPSSQNELE 875

Query: 2376 GYSVVFSSQNG 2386
+ + N
Sbjct: 876 NVFKAWGAANK 886


57YpAngola_A4128YpAngola_A4180Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A4128015-3.006549hypothetical protein
YpAngola_A4130-112-1.452060*regulatory protein UhpC
YpAngola_A4131113-1.144917sensory histidine kinase UhpB
YpAngola_A4132-113-0.756144two-component system response regulator
YpAngola_A4133012-0.300432phosphoethanolamine transferase
YpAngola_A4134013-0.818350hypothetical protein
YpAngola_A4135115-1.819321hypothetical protein
YpAngola_A4136114-2.578718amino acid permease family protein
YpAngola_A4137015-3.264170histidine ammonia-lyase
YpAngola_A4138-117-4.194156urocanate hydratase
YpAngola_A4139020-5.272388pyridoxal-phosphate dependent protein
YpAngola_A4140-218-3.797394adenine phosphoribosyltransferase
YpAngola_A4141-216-3.054711hypothetical protein
YpAngola_A4142-115-2.361170hypothetical protein
YpAngola_A4143-116-2.443635iron chelate ABC transporter periplasmic
YpAngola_A4144224-7.187224iron chelate ABC transporter permease
YpAngola_A4145326-7.762982iron chelate ABC transporter permease
YpAngola_A4147425-7.070402iron chelate ABC transporter ATP-binding
YpAngola_A4149321-5.182550helix-turn-helix DNA binding domain-containing
YpAngola_A4151322-5.154360putative transcriptional regulator
YpAngola_A4152217-3.971878hypothetical protein
YpAngola_A41531130.790842phage integrase family site specific
YpAngola_A41552140.212648*xylose operon regulatory protein
YpAngola_A4156313-0.387845sugar transport system permease
YpAngola_A4159214-1.472248hypothetical protein
YpAngola_A4161214-1.913146xylose isomerase
YpAngola_A4162114-2.500368xylulose kinase
YpAngola_A4163218-5.659447hypothetical protein
YpAngola_A4164119-4.314208pili assembly chaperone protein
YpAngola_A4167119-3.546750fimbrial protein
YpAngola_A4168020-3.089219hypothetical protein
YpAngola_A4169015-1.152742hypothetical protein
YpAngola_A41700150.023573hypothetical protein
YpAngola_A41710140.604164transposase
YpAngola_A4172-1140.414595integrase core subunit
YpAngola_A41730170.771577sugar phosphatase
YpAngola_A41741171.122151DNA gyrase subunit B
YpAngola_A41752161.474392recombination protein F
YpAngola_A41761180.071912DNA polymerase III subunit beta
YpAngola_A4177119-1.043570chromosomal replication initiation protein
YpAngola_A41782200.26152650S ribosomal protein L34
YpAngola_A41792190.128793hypothetical protein
YpAngola_A4180220-0.270720ribonuclease P protein component
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A4128PF05860594e-13 haemagglutination activity domain.
		>PF05860#haemagglutination activity domain.

Length = 117

Score = 59.0 bits (143), Expect = 4e-13
Identities = 20/128 (15%), Positives = 42/128 (32%), Gaps = 18/128 (14%)

Query: 53 TPPSTCRALTSYCIGMTETVVNIQAPDENGLSHNKYSKFDVVANGLFDVTTLNNRLAQEV 112
TP +T ++ ++ + L H+ + +F V +G
Sbjct: 4 TPDTTLPINSNITTEGNTRIIERGTQAGSNLFHS-FQEFSVPTSGTA------------- 49

Query: 113 NGNSFLQDKSATIILNEVNSSHASLLDGNLRVDGGNAHIIIANPAGINCRGCSFTNASHV 172
F + I++ V S +DG +R A++ + NP GI + +
Sbjct: 50 ---FFNNPTNIQNIISRVTGGSVSNIDGLIRA-NATANLFLINPNGIIFGQNARLDIGGS 105

Query: 173 TLTTGTPS 180
+ +
Sbjct: 106 FVGSTANR 113


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A4130TCRTETB454e-07 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 44.9 bits (106), Expect = 4e-07
Identities = 33/157 (21%), Positives = 68/157 (43%), Gaps = 7/157 (4%)

Query: 59 FNFIMPAMLTDLGLSMSDVGILGTLFYITYGCSKFVSGMISDRSNPRYFMGIGLVMTGII 118
N +P + D + + T F +T+ V G +SD+ + + G+++
Sbjct: 33 LNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFG 92

Query: 119 NILFGMSSSLLVLGALWILNAFFQGWG---WPPCSKILTSWY-SRSERGGWWAIWNTSHN 174
+++ + S +L I+ F QG G +P ++ + Y + RG + + +
Sbjct: 93 SVIGFVGHSFF---SLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVA 149

Query: 175 FGGALIPLLVGVITLHFSWRYGMIIPGIIGVVIGLLM 211
G + P + G+I + W Y ++IP I + + LM
Sbjct: 150 MGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLM 186


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A4131PF06580371e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 36.8 bits (85), Expect = 1e-04
Identities = 17/85 (20%), Positives = 33/85 (38%), Gaps = 10/85 (11%)

Query: 426 VTNAYRHGAASR-----IEINARQDNQQIYLTISDNGK-GIDLASITPGYGLRGIQSRVS 479
V N +HG A I + +DN + L + + G + + G GL+ ++ R+
Sbjct: 264 VENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGTGLQNVRERLQ 323

Query: 480 A-FGGNVSLSV---DNGTCLNVTLP 500
+G + + V +P
Sbjct: 324 MLYGTEAQIKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A4132HTHFIS613e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 60.6 bits (147), Expect = 3e-13
Identities = 34/173 (19%), Positives = 63/173 (36%), Gaps = 20/173 (11%)

Query: 4 RVVFIDDHDIVRSGFAQLLSLEEDIQVVGEFSSAKQARAGLPGLQANICICDISMPDENG 63
++ DD +R+ Q LS V S+A + ++ + D+ MPDEN
Sbjct: 5 TILVADDDAAIRTVLNQALSRA-GYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 64 LDLLKGLPS---GMGVIMLSMHDSPALVETALERGARGFLSKRCKPEDLISAVRTVGSGG 120
DLL + + V+++S ++ A E+GA +L K +LI +
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR----- 117

Query: 121 VYLMPEIAQQLARVAVDPLTRREREIAVLLAEG---MEVREIAESLGLSPKTV 170
A + L ++ L+ E+ + L + T+
Sbjct: 118 -------ALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTL 163


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A4143FERRIBNDNGPP581e-11 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 57.6 bits (139), Expect = 1e-11
Identities = 41/193 (21%), Positives = 83/193 (43%), Gaps = 7/193 (3%)

Query: 47 LNPQKVVILNPSVLDNADALHIKVAGVPQTSTHLPAFLSKYSGPE-YMNTGTLFEPDYEA 105
++P ++V L ++ AL I GV T + ++S+ P+ ++ G EP+ E
Sbjct: 33 IDPNRIVALEWLPVELLLALGIVPYGVADTINY-RLWVSEPPLPDSVIDVGLRTEPNLEL 91

Query: 106 LSQAKPDLIIAGGRAQDAYNKLSAIAPTIALDVDTQHFTQSLTQRT-EQLASIFGKEEEA 164
L++ KP ++ + L+ IAP + ++ +++ ++A + + A
Sbjct: 92 LTEMKPSFMVWSAGYGPSPEMLARIAPGRGFNFSDGKQPLAMARKSLTEMADLLNLQSAA 151

Query: 165 KTLLGNFSSQVNAIKQKSANAGS---AMVLMISGGKMSAYTPGSRFGFIFDELGFTPAAT 221
+T L + + ++K + G+ + +I M + P S F I DE G A
Sbjct: 152 ETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLVFGPNSLFQEILDEYGIPNAWQ 211

Query: 222 FAESGRHGNVVTS 234
E+ G+ S
Sbjct: 212 -GETNFWGSTAVS 223


58YpAngola_A0204YpAngola_A0226N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A0204214-1.185022flagellar hook-length control protein
YpAngola_A0205316-1.568742putative flagellar protein lafD
YpAngola_A0206316-1.559115flagellar protein FliS
YpAngola_A0207418-1.083088flagellar hook-associated protein 2
YpAngola_A02095250.551743IS285 transposase
YpAngola_A0208423-0.183545lateral flagellin
YpAngola_A0210319-1.012096lateral flagellin
YpAngola_A0211216-0.933528lateral flagellin
YpAngola_A0212015-2.541391hypothetical protein
YpAngola_A0213-115-2.770558lateral flagellin
YpAngola_A0214-118-3.393985transcriptional regulator
YpAngola_A0215121-1.374497hypothetical protein
YpAngola_A0216019-0.307165hypothetical protein
YpAngola_A0217117-0.159575flagellar hook-associated protein FlgL
YpAngola_A02182212.422026flagellar hook-associated protein FlgK
YpAngola_A02192244.046436peptidoglycan hydrolase
YpAngola_A02203213.988855flagellar basal body P-ring protein
YpAngola_A02211203.618941flagellar basal body L-ring protein
YpAngola_A02221182.894361flagellar basal body rod protein FlgG
YpAngola_A02231182.329201flagellar basal body rod protein FlgF
YpAngola_A02241181.183173flagellar hook protein FlgE
YpAngola_A02253200.853170flagellar basal body rod modification protein
YpAngola_A02260213.222786flagellar basal body rod protein FlgC
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0204FLGHOOKFLIK300.017 Flagellar hook-length control protein signature.
		>FLGHOOKFLIK#Flagellar hook-length control protein signature.

Length = 375

Score = 29.8 bits (66), Expect = 0.017
Identities = 27/114 (23%), Positives = 47/114 (41%), Gaps = 11/114 (9%)

Query: 253 QHATIRLDPPDMGKIDISIHFEGGKLQVNINANQGEVYRALQ-----------QSSAELR 301
Q A +RL P D+G++ IS+ + + Q+ + + V AL+ +S +L
Sbjct: 257 QSAELRLHPQDLGEVQISLKVDDNQAQIQMVSPHQHVRAALEAALPVLRTQLAESGIQLG 316

Query: 302 QTLIGQNSTEVNVQVSANSQQQQQQPRHSNHHGQADILAAQHFESQAEINADDG 355
Q+ I S Q ++ QQ Q+ H G+ D Q + + G
Sbjct: 317 QSNISGESFSGQQQAASQQQQSQRTANHEPLAGEDDDTLPVPVSLQGRVTGNSG 370


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0208FLAGELLIN1062e-27 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 106 bits (266), Expect = 2e-27
Identities = 66/329 (20%), Positives = 120/329 (36%), Gaps = 8/329 (2%)

Query: 5 IHTNASAKTAINSLSNAGLANAKSSQRLSTGFRINSPADNAAGLQITNRMEKFLNSAGQA 64
I+TN+ + N+L+ + + + + +RLS+G RINS D+AAG I NR + QA
Sbjct: 4 INTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQA 63

Query: 65 KQNIQESIAMLQIADGGLAESVKTLNAMKKLATQAANDTNSAADREAIQKEFSELGKELQ 124
+N + I++ Q +G L E L +++L+ QA N TNS +D ++IQ E + +E+
Sbjct: 64 SRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEID 123

Query: 125 NALNNTEYNSEKLFADGGKMRKELNFQSGTDAESSLKLDLNSVIAELTESVTKQAPKITG 184
N T++N K+ + +M Q G + ++ +DL + +
Sbjct: 124 RVSNQTQFNGVKVLSQDNQM----KIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKE 179

Query: 185 KSSSATGSLEKQAYDLDKAVTDTKSLVAGAEGVQKTLEHDFAASGNKAVAEIKIPEYKDA 244
+ S K D + +K
Sbjct: 180 ATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAAN----GQ 235

Query: 245 LGKTVPEVVIALGAVITSANSNQMKDAVAALKTTHDAAVKAEATFQAKNSTGGGVMNMQL 304
L E A+ T+ ++ +A A ++ T
Sbjct: 236 LTTDDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDG 295

Query: 305 ADKDLAMKADKKLSDVIDAYGAFRATLGA 333
K +K++ + A A + A
Sbjct: 296 NGKVSTTINGEKVTLTVADITAGAANVDA 324



Score = 44.6 bits (105), Expect = 5e-07
Identities = 46/302 (15%), Positives = 88/302 (29%), Gaps = 10/302 (3%)

Query: 64 AKQNIQESIAMLQIADGGLAESVKTLNAMKKLATQAANDTNSAADREAIQKEFSELGKEL 123
+++ S + D + K + A + D+ + +L +
Sbjct: 181 TVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDD 240

Query: 124 QNALNNTEYNSEKLFADGGKMRKELNFQSGTDAESSLKLDLNSVIAELTESVTKQAPKIT 183
+ G K + E T++ K++
Sbjct: 241 AENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVS 300

Query: 184 GKSSSATGSLEKQAYDLDKAVTDTKSLVAGAEGVQKTLEHDFAASGNKAVAEIKIPEYKD 243
+ +L A D +L + + F K+ + +
Sbjct: 301 TTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDL-E 359

Query: 244 ALGKTVPEVVIALGAVITSANSNQMKDAVAALKTTHDAAVKAEATFQAKNSTGGGVMNMQ 303
A E I + +AN+ K +A D +T +++
Sbjct: 360 ANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAA-------- 411

Query: 304 LADKDLAMKADKKLSDVIDAYGAFRATLGANQNRLQSSSNNLDNMISNTAQALGSIKDTD 363
A K + + A R++LGA QNR S+ NL N ++N A I+D D
Sbjct: 412 -AAKKSTANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDAD 470

Query: 364 FA 365
+A
Sbjct: 471 YA 472


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0210FLAGELLIN982e-24 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 98.2 bits (244), Expect = 2e-24
Identities = 67/327 (20%), Positives = 119/327 (36%), Gaps = 6/327 (1%)

Query: 5 IHTNASAKTAINSLSNAGLSNAKSSQRLSTGFRINSPADNAAGLQITNRMEKFLNSAGQA 64
I+TN+ + N+L+ + S + + +RLS+G RINS D+AAG I NR + QA
Sbjct: 4 INTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQA 63

Query: 65 KQNIQESIAMLQIADGGLAESVKTLNAMKKLATQAANDTNSAADREAIQKEFSELGKELQ 124
+N + I++ Q +G L E L +++L+ QA N TNS +D ++IQ E + +E+
Sbjct: 64 SRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEID 123

Query: 125 NALNNTEYNSEKLFADGGKMRKELNFQSGTDAESSLKLDLNSVIAELTESVTKKATPVKA 184
N T++N K+ + +M Q G + ++ +DL + + K
Sbjct: 124 RVSNQTQFNGVKVLSQDNQM----KIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKE 179

Query: 185 DVAGSTLEKEADVLDKATKAAKKAKEAAEGVQKTLETDFAVAGNKASAKITIPEYKDALG 244
G K + + K+ + L
Sbjct: 180 ATVGDL--KSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLT 237

Query: 245 KTVPEVVINSGTAITPANSTQMKDAVAALKATHDAAVKAEATFQAKNSTGGGVMNMQLAD 304
E T ++ +A A A ++ T
Sbjct: 238 TDDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNG 297

Query: 305 KDLAMKADKKLSDVIDAYGAFRATLGA 331
K +K++ + A A + A
Sbjct: 298 KVSTTINGEKVTLTVADITAGAANVDA 324



Score = 61.2 bits (148), Expect = 3e-12
Identities = 57/336 (16%), Positives = 111/336 (33%), Gaps = 10/336 (2%)

Query: 64 AKQNIQESIAMLQIADGGLAESVKTLNAMKKLATQAANDTNSAADREAIQKEFSELGKEL 123
+++ S + D + K + A + D+ + +L +
Sbjct: 181 TVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDD 240

Query: 124 QNALNNTEYNSEKLFADGGKMRKELNFQSGTDAESSLKLDLNSVIAELTESVTKKATPVK 183
+ G K + E T++ V
Sbjct: 241 AENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVS 300

Query: 184 ADVAGSTLE-KEADVLDKATKAAKKAKEAAEGVQKTLETDFAVAGNKASAKITIPEYKDA 242
+ G + AD+ A ++++ V ++ +K + +A
Sbjct: 301 TTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEA 360

Query: 243 LGKTVPEVVINSGTAITPANSTQMKDAVAALKATHDAAVKAEATFQAKNSTGGGVMNMQL 302
E I A AN+ K +A D +T +++
Sbjct: 361 NNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAA--------- 411

Query: 303 ADKDLAMKADKKLSDVIDAYGAFRATLGANQNRLQSSSNNLDNMISNTAQALGSIKDTDF 362
A K + + A R++LGA QNR S+ NL N ++N A I+D D+
Sbjct: 412 AAKKSTANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADY 471

Query: 363 ADEMKNHAQSEMLMQSSVMMLKKANAATQLISTLLQ 398
A E+ N +++++L Q+ +L +AN Q + +LL+
Sbjct: 472 ATEVSNMSKAQILQQAGTSVLAQANQVPQNVLSLLR 507


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0211FLAGELLIN1057e-27 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 105 bits (263), Expect = 7e-27
Identities = 56/204 (27%), Positives = 96/204 (47%), Gaps = 4/204 (1%)

Query: 5 IHTNGSAKTAINSLSKAGLANAKSSQRLSTGFRINSPADNAAGLQITNRMEKFLNSAGQA 64
I+TN + N+L+K+ + + + +RLS+G RINS D+AAG I NR + QA
Sbjct: 4 INTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQA 63

Query: 65 KQNIQESIAMLQIADGGLAESVKTLNTMKKLATQAANDTNSAADREAIQKEFTELGQELQ 124
+N + I++ Q +G L E L +++L+ QA N TNS +D ++IQ E + +E+
Sbjct: 64 SRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEID 123

Query: 125 NALNNTEYNAEKLFADGGKMRKELNFQSGTDANSSLKLDLNKVIEELTESVTEERKKVTG 184
N T++N K+ + +M Q G + ++ +DL K+ +
Sbjct: 124 RVSNQTQFNGVKVLSQDNQM----KIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKE 179

Query: 185 TSASATGSLEKQAFDLNEATTKAN 208
+ S K + AN
Sbjct: 180 ATVGDLKSSFKNVTGYDTYAVGAN 203



Score = 60.8 bits (147), Expect = 3e-12
Identities = 51/334 (15%), Positives = 102/334 (30%), Gaps = 7/334 (2%)

Query: 64 AKQNIQESIAMLQIADGGLAESVKTLNTMKKLATQAANDTNSAADREAIQKEFTELGQEL 123
+++ S + D + K + A + D+ + +L +
Sbjct: 181 TVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDD 240

Query: 124 QNALNNTEYNAEKLFADGGKMRKELNFQSGTDANSSLKLDLNKVIEELTESVTEERKKVT 183
+ G K + T++ + KV+
Sbjct: 241 AENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVS 300

Query: 184 GTSASATGSLEKQAFDLNEATTKANTALKEAEILQEKITTNLTKTFPASVDIPGYINAKG 243
T +L A A L+ ++ + + + N
Sbjct: 301 TTINGEKVTL-TVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTK------NESA 353

Query: 244 VPVAHEIIPSGTPINTGHIGKIQTAVAALRATHDTAAKTEDEFQAEHSTGGGVMNLLLRN 303
E + + + + A A KT + +
Sbjct: 354 KLSDLEANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAA 413

Query: 304 KDRAMEADKKLSDVIDAYGAFRATLGANQNRLQSSSNNLDNMISNTTQALGSIKDTDFAD 363
K + + A R++LGA QNR S+ NL N ++N A I+D D+A
Sbjct: 414 KKSTANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYAT 473

Query: 364 EMKNHAQSEMLMQSSVMMLKKANAATQLISTLLQ 397
E+ N +++++L Q+ +L +AN Q + +LL+
Sbjct: 474 EVSNMSKAQILQQAGTSVLAQANQVPQNVLSLLR 507


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0213FLAGELLIN1003e-25 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 100 bits (250), Expect = 3e-25
Identities = 64/328 (19%), Positives = 120/328 (36%), Gaps = 10/328 (3%)

Query: 5 IHTNASAKTAINSLSNEGLANAKSSQRLSTGFRINSPADNAAGLQITNRMEKFLNSAGQA 64
I+TN+ + N+L+ + + + +RLS+G RINS D+AAG I NR + QA
Sbjct: 4 INTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQA 63

Query: 65 KQNIQESIAMLQIADGGLAESVKTLNAMKKLATQAANDTNSAADREAIQKEFSELGKELQ 124
+N + I++ Q +G L E L +++L+ QA N TNS +D ++IQ E + +E+
Sbjct: 64 SRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEID 123

Query: 125 NALNNTEYNSEKLFADGGKMRKELNFQSGTDAESSLKLDLNSV------IAELTESVTKP 178
N T++N K+ + +M Q G + ++ +DL + + + K
Sbjct: 124 RVSNQTQFNGVKVLSQDNQM----KIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKE 179

Query: 179 GLKANSGGTAEEKELARLEGLAKDAKSTAATTKSAETTLLVDDATGKGGKGGNASIDIII 238
+ + + + + + + T K
Sbjct: 180 ATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTD 239

Query: 239 PAHKDTTGKDVAEKKIASGTAITPANITSMADAKAYWDKQEIETPKAVNEYVVKHSADSG 298
A +T K +GTA A ++ K ++
Sbjct: 240 DAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKV 299

Query: 299 VMNMQLADKDLAMKADKKLSDVIDAYGA 326
+ L + + +DA
Sbjct: 300 STTINGEKVTLTVADITAGAANVDAATL 327



Score = 63.9 bits (155), Expect = 4e-13
Identities = 56/338 (16%), Positives = 105/338 (31%), Gaps = 12/338 (3%)

Query: 64 AKQNIQESIAMLQIADGGLAESVKTLNAMKKLATQAANDTNSAADREAIQKEFSELGKEL 123
+++ S + D + K + A + D+ + +L +
Sbjct: 181 TVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDD 240

Query: 124 QNALNNTEYNSEKLFADGGKMRKELNFQSGTDAESSLKLDLNSVIAELTESVTKPGLKAN 183
+ G K + E T++ K +
Sbjct: 241 AENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVS 300

Query: 184 SGGTAEEKELARLEGLAKDAKSTAATTKSAETTLLVDDATGKGGKGGNASIDIIIPAHKD 243
+ E+ L + A A AAT +S++ +
Sbjct: 301 TTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESA------- 353

Query: 244 TTGKDVAEKKIASGTAITPANITSMADAKAYWDKQEIETPKAVNEYVVKHSADSGVMNMQ 303
A + + IT A+A +T + +
Sbjct: 354 KLSDLEANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAA 413

Query: 304 LADKDLAM-KADKKLSDVIDAYGAFRATLGANQNRLQSSSNNLDNMISNTAQALGSIKDT 362
+ D LS V R++LGA QNR S+ NL N ++N A I+D
Sbjct: 414 KKSTANPLASIDSALSKVDAV----RSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDA 469

Query: 363 DFADEMKNHAQSEMLMQSSVMMLKKANAATQLISTLLQ 400
D+A E+ N +++++L Q+ +L +AN Q + +LL+
Sbjct: 470 DYATEVSNMSKAQILQQAGTSVLAQANQVPQNVLSLLR 507


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0218FLGHOOKAP11584e-45 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 158 bits (402), Expect = 4e-45
Identities = 93/324 (28%), Positives = 157/324 (48%), Gaps = 8/324 (2%)

Query: 4 IRTAFSGMQATQAHLNATSMNIANMHTPGYSRQRAEQSAIGADGQGGVNAGNGVNVDGIR 63
I A SG+ A QA LN S NI++ + GY+RQ + + G GNGV V G++
Sbjct: 4 INNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVSGVQ 63

Query: 64 RLSQQYVVMQEWRANSQQQYYDAGEQYLNAVELMVSNESTSLATGLNNFFSSLSAATQLP 123
R ++ Q A +Q A + ++ ++ M+S ++SLAT + +FF+SL
Sbjct: 64 REYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLVSNA 123

Query: 124 DSPPMRQQIIESANAMALRFNNVNNFIVQQKKSIGQQRDITVKEINSLTRSIADYNQQIL 183
+ P RQ +I + + +F + ++ Q K + +V +IN+ + IA N QI
Sbjct: 124 EDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLNDQIS 183

Query: 184 K--NRSDGNNINDLLDKQELQIKKLSGLIETQVNQAEDGTYRISVKQGQPLVNGAVAAEL 241
+ G + N+LLD+++ + +L+ ++ +V+ + GTY I++ G LV G+ A +L
Sbjct: 184 RLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTARQL 243

Query: 242 AVDTSSVDTKITLHFSGATQGMNMSC------GGQLGGINDYELTTLKKLQDSTQEMAKT 295
A SS D T N+ G LGGI + L + +++ ++A
Sbjct: 244 AAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQLALA 303

Query: 296 VADKFNDQLGKGTDFTGAPGQDLF 319
A+ FN Q G D G G+D F
Sbjct: 304 FAEAFNTQHKAGFDANGDAGEDFF 327



Score = 61.9 bits (150), Expect = 2e-12
Identities = 47/182 (25%), Positives = 79/182 (43%), Gaps = 8/182 (4%)

Query: 275 NDYELTTLKKLQDSTQEMAKTVADKFNDQLGKGTDFTGAPG-QDLFVFNPSDPNGMLQLS 333
N +++T L +T A+ G FTG P D F P + ++ +
Sbjct: 368 NQWQVTRLA---SNTTFTVTPDANGKVAFDGLELTFTGTPAVNDSFTLKPVS-DAIVNMD 423

Query: 334 AITAEQLALAAHGK-PAG--DNSNLFELLDIRKTPVTGMKNVPLDDAATALVGYIAITSN 390
+ ++ +A + AG DN N LLD++ T +DA +LV I +
Sbjct: 424 VLITDEAKIAMASEEDAGDSDNRNGQALLDLQSNSKTVGGAKSFNDAYASLVSDIGNKTA 483

Query: 391 RNHSELENAENTLNQATRYHESFSGVNNDEEAMNLMEYQRAYQSNMKVIATGDKLFSDLL 450
+ N + Q + +S SGVN DEE NL +Q+ Y +N +V+ T + +F L+
Sbjct: 484 TLKTSSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALI 543

Query: 451 AL 452
+
Sbjct: 544 NI 545


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0219FLGFLGJ454e-09 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 45.5 bits (107), Expect = 4e-09
Identities = 19/79 (24%), Positives = 41/79 (51%), Gaps = 4/79 (5%)

Query: 18 GDLQPQDLEQAAVQFEAVFMRTLLQQMRKAAEVLAADDDPFNSKQQRMMRDFYDDKLAST 77
G+ ++ A Q E +F++ +L+ MR A D F+S+ R+ YD ++A
Sbjct: 26 GEDPAANIRPVARQVEGMFVQMMLKSMRDAL----PKDGLFSSEHTRLYTSMYDQQIAQQ 81

Query: 78 LASQRSSGIANLLIQQLGS 96
+ + + G+A ++++Q+
Sbjct: 82 MTAGKGLGLAEMMVKQMTP 100


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0220FLGPRINGFLGI330e-113 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 330 bits (848), Expect = e-113
Identities = 146/359 (40%), Positives = 210/359 (58%), Gaps = 12/359 (3%)

Query: 39 LVLPTASAQP--LGSLVDIQGVRGNQLVGYSLVVGLDGSGDK-NQVKFTGQSMANMLRQF 95
L P A A + + +Q R NQL+GY LVVGL G+GD FT QSM ML+
Sbjct: 19 LSTPPAQADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSPFTEQSMRAMLQNL 78

Query: 96 GVQLPEKMDPKVKNVAAVAISATLPPGYGRGQSIDITVSSIGDAKSLRGGTLLLTQLRGA 155
G+ KN+AAV ++A LPP G +D+TVSS+GDA SLRGG L++T L GA
Sbjct: 79 GITTQGG-QSNAKNIAAVMVTANLPPFASPGSRVDVTVSSLGDATSLRGGNLIMTSLSGA 137

Query: 156 DGEVYALAQGNVVVGGIKAEGDSGSSVTVNTPTVGRIPNGASIERQIPSDFQTNNQVVLN 215
DG++YA+AQG ++V G A+GD +++T T R+PNGA IER++PS F+ + +VL
Sbjct: 138 DGQIYAVAQGALIVNGFSAQGD-AATLTQGVTTSARVPNGAIIERELPSKFKDSVNLVLQ 196

Query: 216 LKRPSFKSANNVALALNR----AFGANTATAQSATNVMVNAPQDAGARVAFMSLLEDVQI 271
L+ P F +A VA +N +G A + + + V P+ A M+ +E++ +
Sbjct: 197 LRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKPRVA-DLTRLMAEIENLTV 255

Query: 272 NAGEQSPRVVFNARTGTVVIGEGVMVRAAAVSHGNLTVNIREQKNVSQPNPLGGGKTVTT 331
+ +VV N RTGT+VIG V + AVS+G LTV + E V QP P G+T
Sbjct: 256 ET-DTPAKVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESPQVIQPAPFSRGQTAVQ 314

Query: 332 PESDIEVTKGKNQMVMVPAGTRLRSIVNTINSLGASPDDIMAILQALYEAGALDAELVV 390
P++DI + +++ +V G LR++V +NS+G D I+AILQ + AGAL AELV+
Sbjct: 315 PQTDIMAMQEGSKVAIVE-GPDLRTLVAGLNSIGLKADGIIAILQGIKSAGALQAELVL 372


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0221FLGLRINGFLGH1538e-49 Flagellar L-ring protein signature.
		>FLGLRINGFLGH#Flagellar L-ring protein signature.

Length = 232

Score = 153 bits (389), Expect = 8e-49
Identities = 74/221 (33%), Positives = 109/221 (49%), Gaps = 13/221 (5%)

Query: 4 FLILTPMVLALCGCESPALLVQKDDAEFAPPANLIQPATVTEGGGLFQPANS-----WSL 58
+ I + +VL+L GC A A P P G +FQ A L
Sbjct: 9 YAISSLLVLSLTGCAWIPSTPLVQGATSAQPVPGPTPVA---NGSIFQSAQPINYGYQPL 65

Query: 59 LQDRRAYRIGDILTVILDESTQSSKQAKTNFGKKNDMSLGVPEVLGKKLNKFGGSI---- 114
+DRR IGD LT++L E+ +SK + N + + G V FG +
Sbjct: 66 FEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQGLFGNARADVE 125

Query: 115 -SGKRDFDGSATSAQQNMLRGSITVAVHQVLPNGVLVIRGEKWLTLNQGDEYMRVTGLVR 173
SG F+G + N G++TV V QVL NG L + GEK + +NQG E++R +G+V
Sbjct: 126 ASGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRFSGVVN 185

Query: 174 ADDVARDNSVSSQRIANARISYAGRGALSDANSAGWLTRFF 214
++ N+V S ++A+ARI Y G G +++A + GWL RFF
Sbjct: 186 PRTISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFF 226


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0222FLGHOOKAP1422e-06 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 41.9 bits (98), Expect = 2e-06
Identities = 11/42 (26%), Positives = 20/42 (47%)

Query: 213 QLEQGALEGSNVQVVEEMVDMITVQRAYEMNAKMVSAADDML 254
QL S V + EE ++ Q+ Y NA+++ A+ +
Sbjct: 498 QLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIF 539



Score = 40.7 bits (95), Expect = 3e-06
Identities = 20/78 (25%), Positives = 35/78 (44%), Gaps = 14/78 (17%)

Query: 2 NSALWVSKTGLAAQDAKMGAISNNLANVNTDGFKRDRVVFADLFYQNQRTPGAPLDQNNT 61
+S + + +GL A A + SNN+++ N G+ R + A N+T
Sbjct: 1 SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMA--------------QANST 46

Query: 62 TPSGIQFGSGVQIVGTQK 79
+G G+GV + G Q+
Sbjct: 47 LGAGGWVGNGVYVSGVQR 64


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0224FLGHOOKAP1401e-05 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 39.9 bits (93), Expect = 1e-05
Identities = 20/60 (33%), Positives = 27/60 (45%), Gaps = 5/60 (8%)

Query: 2 SFSIANTALNAHTEQLNTISNNIANSATKGFKASR----TEFSSMYAQSQ-PLGVAVSGV 56
+ A + LNA LNT SNNI++ G+ S++ A GV VSGV
Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVSGV 62



Score = 34.2 bits (78), Expect = 8e-04
Identities = 10/42 (23%), Positives = 22/42 (52%)

Query: 371 LENSNVDITAELVGLMTAQRNYQASTKIISTNDSMMNALFQV 412
S V++ E L Q+ Y A+ +++ T +++ +AL +
Sbjct: 504 QSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0226FLGHOOKAP1300.004 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 29.9 bits (67), Expect = 0.004
Identities = 6/37 (16%), Positives = 19/37 (51%)

Query: 102 VNVVSEMADMMSASRSFETNVEVLNSVKSMQQSVLKL 138
VN+ E ++ + + N +VL + ++ +++ +
Sbjct: 509 VNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545


59YpAngola_A0232YpAngola_A0249N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A02321190.361570flagellar assembly protein H
YpAngola_A02331170.066564flagellar motor switch protein G
YpAngola_A0234114-0.483964flagellar MS-ring protein
YpAngola_A0235115-1.458058flagellar hook-basal body complex protein FliE
YpAngola_A0236011-2.523512sigma-54 transcriptional regulator
YpAngola_A0238012-2.739428flagellar switch protein
YpAngola_A0239012-2.454655flagellar biosynthesis protein FliP
YpAngola_A0240014-3.959486flagellar biosynthetic protein FliQ
YpAngola_A0241014-4.371909flagellar biosynthetic protein FliR
YpAngola_A0242014-4.444608flagellar biosynthesis protein FlhB
YpAngola_A0243216-3.468786flagellar biosynthesis protein FlhA
YpAngola_A0244216-2.421037putative lipoprotein
YpAngola_A0247013-0.796414hypothetical protein
YpAngola_A0248115-0.449105hypothetical protein
YpAngola_A0249114-0.326383iron-enterobactin transporter periplasmic
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0232FLGFLIH599e-13 Flagellar assembly protein FliH signature.
		>FLGFLIH#Flagellar assembly protein FliH signature.

Length = 228

Score = 59.0 bits (142), Expect = 9e-13
Identities = 45/204 (22%), Positives = 100/204 (49%), Gaps = 11/204 (5%)

Query: 18 QFPPLRKVRQVAPSAADQTLDPAEYQKQLMAGFQEGISQGFDKGLAEGKEEGYQEGVRLG 77
+F P+ + + A+ +L+ Q Q+ A QG+ G+AEG+++G+++G + G
Sbjct: 21 EFVPIVEPEETIIEEAEPSLEQQLAQLQMQAH-----EQGYQAGIAEGRQQGHKQGYQEG 75

Query: 78 HDDGLKKGRIEGRQSELASFNDVIKPFSGYITQLHTYLETYEQRRRDELLQLVEKVTRQV 137
GL++G E +S+ A + ++ +++ T L+ + L+Q+ + RQV
Sbjct: 76 LAQGLEQGLAEA-KSQQAPIHARMQQL---VSEFQTTLDALDSVIASRLMQMALEAARQV 131

Query: 138 IRCELALQPAQLLTLVEEALAALPMVPQQLKVYLNPAEFGRINDV--APEKVQAWGLAAD 195
I + + L+ +++ L P+ + ++ ++P + R++D+ A + W L D
Sbjct: 132 IGQTPTVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRVDDMLGATLSLHGWRLRGD 191

Query: 196 PDMVGGECRIVTETTEIDVGCQHR 219
P + G C++ + ++D R
Sbjct: 192 PTLHPGGCKVSADEGDLDASVATR 215


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0233FLGMOTORFLIG1723e-53 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 172 bits (438), Expect = 3e-53
Identities = 85/334 (25%), Positives = 165/334 (49%), Gaps = 2/334 (0%)

Query: 15 KSDTKGRSRLEQASILLLSIGEEAAAMVMQQLSREEVVCVSQMMSRLHNIKLDQARQALD 74
D + ++A+ILL+SIG E ++ V + LS+EE+ ++ +++L I + L
Sbjct: 9 ILDVSALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITSELKDNVLL 68

Query: 75 DFFQDYREQSGINGASRSYLQAILNKALGNDIAKSVINGIYGDEIRHRMTRLQWVDTPQL 134
+F + Q I Y + +L K+LG A +IN + ++ D +
Sbjct: 69 EFKELMMAQEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSALQSRPFEFVRRADPANI 128

Query: 135 VALIDQEHLQLQAVFLAFLPPDVAAAVLAYLDKDRQDDILYRIAKLDDVNRDVVDEL-DR 193
+ I QEH Q A+ L++L P A+ +L+ L + Q ++ RIA +D + +VV E+
Sbjct: 129 LNFIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEVVREVERV 188

Query: 194 LIERGVAVLSEHGSKVIGIKQAANIVNRIPGNQQQ-LLDQLGERDEEVLNELKDEMYEFF 252
L ++ ++ SE + G+ I+N ++ +++ L E D E+ E+K +M+ F
Sbjct: 189 LEKKLASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEIKKKMFVFE 248

Query: 253 ILSRQSEATLQRLMDLIPMSDWAIALKGTEPALRQAIYDVLPKRQIQQLQNATQRTGAVP 312
+ + ++QR++ I + A ALK + +++ I+ + KR L+ + G
Sbjct: 249 DIVLLDDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKEDMEFLGPTR 308

Query: 313 VSRVEHIRKVIMAQVRELAEAGEIQVQLFAEQTM 346
VE ++ I++ +R+L E GEI + E+ +
Sbjct: 309 RKDVEESQQKIVSLIRKLEEQGEIVISRGGEEDV 342


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0234FLGMRINGFLIF2831e-90 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 283 bits (724), Expect = 1e-90
Identities = 154/565 (27%), Positives = 258/565 (45%), Gaps = 62/565 (10%)

Query: 12 GQLGENTKTILMSAVALLVTAAIIFSLWRSSQGYTALFGSQENIPITQVVEVLEGEAIAY 71
+L N + L+ A + V + LW + Y LF + + +V L I Y
Sbjct: 17 NRLRANPRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQDGGAIVAQLTQMNIPY 76

Query: 72 RINPDNGQVLVAENQLGKARILLAAKGITATLPIGYELMDKESMLGSSQFIQNVRYKRSL 131
R +G + V +++ + R+ LA +G+ +G+EL+D+E G SQF + V Y+R+L
Sbjct: 77 RFANGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEK-FGISQFSEQVNYQRAL 135

Query: 132 EGELAQSMMALSAVEYARVHLGMSEASSFAISNHADNSASVVLRLRYGQTLSTEQVGAIV 191
EGELA+++ L V+ ARVHL M + S F + SASV + L G+ L Q+ A+V
Sbjct: 136 EGELARTIETLGPVKSARVHLAMPKPSLF-VREQKSPSASVTVTLEPGRALDEGQISAVV 194

Query: 192 QLVAGSIPGMKPANVRVVDQHGELLSQAYQANSEGVPSVKSGTELAHYLQSTTEKNIANL 251
LV+ ++ G+ P NV +VDQ G LL+ Q+N+ G + + A+ ++S ++ I +
Sbjct: 195 HLVSSAVAGLPPGNVTLVDQSGHLLT---QSNTSGRDLNDAQLKFANDVESRIQRRIEAI 251

Query: 252 LNSVIGANNYRISVSTQLDMSRIEETAEHYGPDPRIN------DENIQQENSNDDMAMGI 305
L+ ++G N V+ QLD + E+T EHY P+ + + E G+
Sbjct: 252 LSPIVGNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNISEQVGAGYPGGV 311

Query: 306 PGSLSNQPIPQSQAGQTPAAVSRSQAQ------------------------RKYIYDRNI 341
PG+LSNQP P ++A ++ AQ Y DR I
Sbjct: 312 PGALSNQPAPPNEAPIATPPTNQQNAQNTPQTSTSTNSNSAGPRSTQRNETSNYEVDRTI 371

Query: 342 RHVRYPGYKLEKMTVAVVLN-KSLPVL--EQWTPEQQEELKRLIEDAAGIDVKRGDSLTI 398
RH + +E+++VAVV+N K+L T +Q ++++ L +A G KRGD+L +
Sbjct: 372 RHTKMNVGDIERLSVAVVVNYKTLADGKPLPLTADQMKQIEDLTREAMGFSDKRGDTLNV 431

Query: 399 NMMAFAVP-TLIDEPVMPWWQEPSTFRWAELLGIGLLSLLVLW----FGVRPLMKRYSRK 453
F+ E +P+WQ+ S G LL L+V W VRP + R +
Sbjct: 432 VNSPFSAVDNTGGE--LPFWQQQSFIDQLLAAGRWLLVLVVAWILWRKAVRPQLTRRVEE 489

Query: 454 GSENLPLAISSASADEALDHVDTGVDGAESSPRTENAFSASSLWKSDDLPEQGSGLETKI 513
+ E + V+ + E Q G E
Sbjct: 490 AK---AAQEQAQVRQETEEAVEVRLSKDEQL--------------QQRRANQRLGAEVMS 532

Query: 514 AHLQQLAQSETERTAEVIKQWINSN 538
+++++ ++ A VI+QW++++
Sbjct: 533 QRIREMSDNDPRVVALVIRQWMSND 557


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0235FLGHOOKFLIE445e-09 Flagellar hook-basal body complex protein FliE signa...
		>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE

signature.
Length = 103

Score = 44.3 bits (104), Expect = 5e-09
Identities = 23/73 (31%), Positives = 35/73 (47%), Gaps = 1/73 (1%)

Query: 53 NNLSFSQVLNGAIKSVDQLQHVASEKQTAMDMGISD-DLTGTMLASQKASVAFSAMVQVR 111
+SF+ L+ A+ + Q A + +G L M QKASV+ +QVR
Sbjct: 29 PTISFAGQLHAALDRISDTQTAARTQAEKFTLGEPGVALNDVMTDMQKASVSMQMGIQVR 88

Query: 112 NKLTSALDDVMNT 124
NKL +A +VM+
Sbjct: 89 NKLVAAYQEVMSM 101


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0236HTHFIS375e-130 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 375 bits (965), Expect = e-130
Identities = 127/345 (36%), Positives = 186/345 (53%), Gaps = 22/345 (6%)

Query: 14 HGFVANAPSSVSVFSLARRVAEFNVPVLVTGETGTGKECVAKYIHQKAMGDASPYIAVNC 73
V + + ++ + R+ + ++ +++TGE+GTGKE VA+ +H P++A+N
Sbjct: 137 MPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINM 196

Query: 74 AAIPESMLEAILFGYEKGAFTGAIASVAGKFEQANGGTLLLDEIGDMPLALQVKLLRVLQ 133
AAIP ++E+ LFG+EKGAFTGA G+FEQA GGTL LDEIGDMP+ Q +LLRVLQ
Sbjct: 197 AAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQ 256

Query: 134 EQEVERLGGHKPIPLDIRIIASTNKDLSVEIAEGRFRQDLYYRLSVVPIHILPLRERPED 193
+ E +GG PI D+RI+A+TNKDL I +G FR+DLYYRL+VVP+ + PLR+R ED
Sbjct: 257 QGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRAED 316

Query: 194 ILPLVKAFINKYQSFLNVKIDITAEAQCELYKYTWPGNVRELENVIQRGIIMSNNGVI-- 251
I LV+ F+ + + EA + + WPGNVRELEN+++R + VI
Sbjct: 317 IPDLVRHFVQQAEKEGLDVKRFDQEALELMKAHPWPGNVRELENLVRRLTALYPQDVITR 376

Query: 252 ---------ELPSLGLPMAQGISSPVGETSLPF--------STIQPPDGENNIKLRGRLA 294
E+P + A S + + S
Sbjct: 377 EIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVLAEM 436

Query: 295 QYQYIVDLLQRHQGNKSKTAAFLGITPRALRYRLANMREDGIDIE 339
+Y I+ L +GN+ K A LG+ LR + +RE G+ +
Sbjct: 437 EYPLILAALTATRGNQIKAADLLGLNRNTLRKK---IRELGVSVY 478


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0238FLGMOTORFLIN732e-19 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 72.6 bits (178), Expect = 2e-19
Identities = 35/77 (45%), Positives = 50/77 (64%)

Query: 54 RKMSLFSRIPVTLTLEVASVEIPLSELLTVNNDSVIELDKLAGEPLDIQVNGIMFGQAEV 113
+ + L IPV LT+E+ + + ELL + SV+ LD LAGEPLDI +NG + Q EV
Sbjct: 52 QDIDLIMDIPVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEV 111

Query: 114 VVINEKYGLRIININSQ 130
VV+ +KYG+RI +I +
Sbjct: 112 VVVADKYGVRITDIITP 128


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0239FLGBIOSNFLIP2191e-73 Escherichia coli: Flagellar biosynthetic protein Fl...
		>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP

signature.
Length = 245

Score = 219 bits (559), Expect = 1e-73
Identities = 111/236 (47%), Positives = 155/236 (65%), Gaps = 4/236 (1%)

Query: 19 LVGGLLYSPLLLAQEGGITLFNTVQTATGQDYNVKIEILILMTLLGLLPIMMLMMTCFTR 78
V L +PL AQ GIT + GQ +++ ++ L+ +T L +P ++LMMT FTR
Sbjct: 9 PVLLWLITPLAFAQLPGIT--SQPLPGGGQSWSLPVQTLVFITSLTFIPAILLMMTSFTR 66

Query: 79 FIIVLAILRQALGLQQSPPNKVLTGIALALTLLVMRPVWTKIHQDAVIPFQQDEITLSQA 138
IIV +LR ALG +PPN+VL G+AL LT +M PV KI+ DA PF +++I++ +A
Sbjct: 67 IIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEEKISMQEA 126

Query: 139 LGRAEAPLKNYMLAQTSTKSLDQMMAIA--QVSGEPQQQDLSVVTPAYVLSELKTAFQMG 196
L + PL+ +ML QT L +A P+ + ++ PAYV SELKTAFQ+G
Sbjct: 127 LEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSELKTAFQIG 186

Query: 197 FMIYIPFLVIDLIVASILMAMGMMMLSPLIVSLPFKLMLFVLCDGWTLMVGTLTAS 252
F I+IPFL+IDL++AS+LMA+GMMM+ P ++LPFKLMLFVL DGW L+VG+L S
Sbjct: 187 FTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLAQS 242


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0240TYPE3IMQPROT463e-10 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 45.5 bits (108), Expect = 3e-10
Identities = 25/74 (33%), Positives = 37/74 (50%)

Query: 14 GLHLVLMISIVAIVPSLLIGLLVSIFQATTQINEQTLSFLPRLVMTMLVLIFAGKWMMIK 73
L+LVL++S + + +IGLLV +FQ TQ+ EQTL F +L+ L L W
Sbjct: 11 ALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLFLLSGWYGEV 70

Query: 74 LSDFTVSIFQQAAQ 87
L + + A
Sbjct: 71 LLSYGRQVIFLALA 84


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0241TYPE3IMRPROT1053e-29 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 105 bits (263), Expect = 3e-29
Identities = 72/237 (30%), Positives = 128/237 (54%), Gaps = 3/237 (1%)

Query: 19 LPFVRILSFLHFCPVIRHKAFTRKAKIGTALLLAILITPMISQPVVSGELLSIENLLLAG 78
P +R+L+ + P++ ++ ++ K+G A+++ I P + V + S L LA
Sbjct: 18 WPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPANDV--PVFSFFALWLAV 75

Query: 79 EQILWGWLFGSMLHLVLAALEAAGQILSMNMGLGMAMMNDPTSGASTAVISQIIFTFSVL 138
+QIL G G + AA+ AG+I+ + MGL A DP S + V+++I+ ++L
Sbjct: 76 QQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHLNMPVLARIMDMLALL 135

Query: 139 IFFTLNGHLLFVTILLKSFSSWPIG-EAINDFSLRSLALSLGWILSSATLLALPTTFIML 197
+F T NGHL +++L+ +F + PIG E +N + +L + I + +LALP ++L
Sbjct: 136 LFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLIFLNGLMLALPLITLLL 195

Query: 198 IVQGSFGLLNRISPTLNLFSLGFPIGMLFGLLCLLLLAINIPDHYLHLTNEILTQFE 254
+ + GLLNR++P L++F +GFP+ + G+ + L I HL +EI
Sbjct: 196 TLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFCEHLFSEIFNLLA 252


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0242TYPE3IMSPROT298e-101 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 298 bits (764), Expect = e-101
Identities = 97/344 (28%), Positives = 173/344 (50%)

Query: 5 SGEKSEKPTAGKLSKARKKGDIPRSKDVTMAAGLVTSFILLSLFLPYYKALVSQSFVSVA 64
SGEK+E+PT K+ ARKKG + +SK+V A +V +L YY S+ + A
Sbjct: 2 SGEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIPA 61

Query: 65 QLASQLDDQGALEQFLLANLFIFAKFLATLIPIPLFSMLATLIPGGWNFTPVKLIPDLKK 124
+ + Q L F L L ++ + ++ G+ + + PD+KK
Sbjct: 62 EQSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDIKK 121

Query: 125 LSPLAGIKRIFSASNGTEVLKMLAKCSIVLYTLYLVVHSSLDDLLHLQTLPLEEAITQGF 184
++P+ G KRIFS + E LK + K ++ +++++ +L LL L T +E
Sbjct: 122 INPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPLLG 181

Query: 185 AQYHHILLYFIAIVVVFAAIDIPLSHHLFTKKMKMTKQEVKQEHKNNDGNPEIKSRVRQL 244
+++ VV + D ++ + K++KM+K E+K+E+K +G+PEIKS+ RQ
Sbjct: 182 QILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRRQF 241

Query: 245 QRQYAIGQINKTVPSADVIITNPTHFSVALKYAPEKASAPYIVAKGKDDIALYIRSIAQK 304
++ + + V + V++ NPTH ++ + Y + P + K D +R IA++
Sbjct: 242 HQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIAEE 301

Query: 305 HKIEIVEFPPLARAIYHTTKVNQQIPAQLYRAIAQVLTYVMQIK 348
+ I++ PLARA+Y V+ IPA+ A A+VL ++ +
Sbjct: 302 EGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQN 345


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0249FERRIBNDNGPP507e-09 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 49.6 bits (118), Expect = 7e-09
Identities = 21/97 (21%), Positives = 40/97 (41%), Gaps = 12/97 (12%)

Query: 129 QTEPNIKAVAKMRPDLIIISATGDDSTLELYDQLSAIAPTLVINYDDKS-----WQELTL 183
+TEPN++ + +M+P ++ SA S + L+ IAP N+ D ++
Sbjct: 84 RTEPNLELLTEMKPSFMVWSAGYGPS----PEMLARIAPGRGFNFSDGKQPLAMARKSLT 139

Query: 184 QLGQATGHEGDAEQVI---DKFARRLNEVKQKITLPP 217
++ + AE + + F R + K P
Sbjct: 140 EMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARP 176


60YpAngola_A0837YpAngola_A0844N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A0837-2110.058260peptide chain release factor 3
YpAngola_A0838-112-0.616712ribosomal-protein-alanine acetyltransferase
YpAngola_A0839-113-1.034020DNA polymerase III subunit psi
YpAngola_A0840-213-2.42884216S ribosomal RNA m2G1207 methyltransferase
YpAngola_A0842019-4.191767*hypothetical protein
YpAngola_A0843-112-1.387352diguanylate cyclase
YpAngola_A0844-1120.195110pectinesterase A
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0837TCRTETOQM2194e-66 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 219 bits (560), Expect = 4e-66
Identities = 115/462 (24%), Positives = 215/462 (46%), Gaps = 48/462 (10%)

Query: 12 KRRTFAIISHPDAGKTTITEKVLLFGHAIQTAGTVKGRGSSHHAKSDWMEMEKQRGISIT 71
K +++H DAGKTT+TE +L AI G+V ++D +E+QRGI+I
Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGT----TRTDNTLLERQRGITIQ 57

Query: 72 TSVMQFPYGGCLVNLLDTPGHEDFSEDTYRTLTAVDCCLMVIDAAKGVEDRTRKLMEVTR 131
T + F + VN++DTPGH DF + YR+L+ +D +++I A GV+ +TR L R
Sbjct: 58 TGITSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALR 117

Query: 132 LRDTPILTFMNKLDREIRDPMEVLDEVERELNIACSPITWPIGCGKSFKGVYHLHKDETY 191
P + F+NK+D+ D V +++ +L+ K +
Sbjct: 118 KMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVI------------------KQKVE 159

Query: 192 LYQSGKGHTIQEVRIVKGLNNPDLDVAVGEDLAKQFRQELELVQGASHEFDHEAFLSGDL 251
LY + E + + +DL +++ L + + F + L
Sbjct: 160 LYPNMCVTNFTESEQWDTVIEGN------DDLLEKYMSGKSLEALELEQEESIRFHNCSL 213

Query: 252 TPVFFGTALGNFGVDHMLDGLVEWAPAPMPRKTDTRVVVASEEKFTGFVFKIQANMDPKH 311
PV+ G+A N G+D++++ + + R + + G VFKI+ K
Sbjct: 214 FPVYHGSAKNNIGIDNLIEVITNKFYSSTHR---------GQSELCGKVFKIE--YSEK- 261

Query: 312 RDRVAFMRVVSGRFEKGMKLRQVRTKKDVVISDALTFMAGDRSHVEEAYAGDIIGLHNHG 371
R R+A++R+ SG +R + K+ + I++ T + G+ +++AY+G+I+ L N
Sbjct: 262 RQRLAYIRLYSGVLHLRDSVR-ISEKEKIKITEMYTSINGELCKIDKAYSGEIVILQNEF 320

Query: 372 ---TIQIGDTFTQGEDMKFTGIPNFAPELFRRIRLRDPLKQKQLLKGLVQLSEEG-AVQV 427
+GDT + + I N P L + P +++ LL L+++S+ ++
Sbjct: 321 LKLNSVLGDTKLLPQRER---IENPLPLLQTTVEPSKPQQREMLLDALLEISDSDPLLRY 377

Query: 428 FRPLSNNDLIVGAVGVLQFEVVSSRLKSEYNVEAVYESVNVS 469
+ + +++I+ +G +Q EV + L+ +Y+VE + V
Sbjct: 378 YVDSATHEIILSFLGKVQMEVTCALLQEKYHVEIEIKEPTVI 419


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0838SACTRNSFRASE475e-10 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 47.2 bits (112), Expect = 5e-10
Identities = 22/80 (27%), Positives = 33/80 (41%), Gaps = 1/80 (1%)

Query: 18 DEATLFNIAIDPQYQRQGYGRLLLEHLIEQLEARNIVTLWLEVRASNARAIALYESLGFN 77
A + +IA+ Y+++G G LL IE + + L LE + N A Y F
Sbjct: 88 GYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFI 147

Query: 78 EVSVRRNYYPS-ANGREDAI 96
+V Y + E AI
Sbjct: 148 IGAVDTMLYSNFPTANEIAI 167


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0839PF04183280.017 IucA / IucC family
		>PF04183#IucA / IucC family

Length = 580

Score = 27.9 bits (62), Expect = 0.017
Identities = 8/38 (21%), Positives = 14/38 (36%), Gaps = 2/38 (5%)

Query: 32 HLPEDTRLLIVA--QQLPEHGDPLLCDVLRSLGLTPHQ 67
L D +++A + E+ PL + GL
Sbjct: 358 WLKPDESPVLMATLMECDENNQPLAGAYIDRSGLDAET 395


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A0844ANTHRAXTOXNA310.006 Anthrax toxin LF subunit signature.
		>ANTHRAXTOXNA#Anthrax toxin LF subunit signature.

Length = 800

Score = 31.3 bits (70), Expect = 0.006
Identities = 25/105 (23%), Positives = 40/105 (38%), Gaps = 18/105 (17%)

Query: 106 WGTSGSSTVLVNAANFTAENLTIRNDFDFPANQAKAEGDPTKLKDTQAVALLLAEKSDKA 165
+ S S + VNA N I+ + N+ + E K KD+ + ++
Sbjct: 21 FAISSSQAIEVNAMNEHYTESDIKRNHKTEKNKTEKE----KFKDSINNLVKTEFTNETL 76

Query: 166 RFRQVKLEGYQDTL----------YSKTGSRSYFTDCDISGHVDF 200
K++ QD L YS+ G YFTD D+ H +
Sbjct: 77 ----DKIQQTQDLLKKIPKDVLEIYSELGGEIYFTDIDLVEHKEL 117


61YpAngola_A1265YpAngola_A1270N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A12650172.336585copper exporting ATPase
YpAngola_A12661200.913437Cu(I)-responsive transcriptional regulator
YpAngola_A12670191.582891hypothetical protein
YpAngola_A1268-1143.215619hypothetical protein
YpAngola_A1269-1134.001047putative thioredoxin
YpAngola_A1270-1134.544642short chain dehydrogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A1265IGASERPTASE367e-04 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 36.2 bits (83), Expect = 7e-04
Identities = 61/300 (20%), Positives = 99/300 (33%), Gaps = 41/300 (13%)

Query: 18 AQRVKAALESREDVHHAEVNVHYAKVTGEADTHALIETIKQTGYQATEAQTPDVELHLSG 77
A S+++ E N A ET Q A EA+ +V+ +
Sbjct: 1035 ETTETVAENSKQESKTVEKNEQDAT-----------ETTAQNREVAKEAK-SNVKANTQT 1082

Query: 78 LSCGHCTETVRKALEAVSGVISADVTLESANVYGKADIQTLIAAVEQAGYHATQQGIDSP 137
++ + +A V E KA ++T ++ +Q SP
Sbjct: 1083 NEVAQSGSETKETQTTETK-ETATVEKE-----EKAKVET--EKTQEVPKVTSQV---SP 1131

Query: 138 KTEPLTHSAQSQPESLAAAPNTVPATNVALATS-TVSDTNTVLPTNTTSTTS----TADT 192
K E S QP++ A N P N+ S T + +T P TS+ T T
Sbjct: 1132 KQE---QSETVQPQAEPAREN-DPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTEST 1187

Query: 193 ASATSTAPVINPLPVTESVAQPAA-SEGESVQLLLTGMSCASCVSKVQNALQRVDGVQVA 251
T + V NP T + QP SE + S S V+ A +
Sbjct: 1188 TVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPA--TTSSNDRS 1245

Query: 252 RVNLAERSALVTGTQNNEALIAAVKNAGYGAEIIEDEGERRERQQQ------MSQASMKR 305
V L + ++ T ++A A A + + + E + +S SM +
Sbjct: 1246 TVALCDLTSTNTNAVLSDARAKAQFVALNVGKAVSQHISQLEMNNEGQYNVWVSNTSMNK 1305


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A1268CHANLCOLICIN290.021 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 29.3 bits (65), Expect = 0.021
Identities = 33/153 (21%), Positives = 61/153 (39%), Gaps = 20/153 (13%)

Query: 130 SQRDNINSRLLHIVDEATNPWGIKITRIEIRDVRPP--TELISAMNAQMKAERTKRADIL 187
+ RD + RL IV+EA + R P TEL A NA M+AE +
Sbjct: 85 ANRDALTQRLKDIVNEA----------LRHNASRTPSATELAHANNAAMQAEDERLRLAK 134

Query: 188 EAEGVRQAAILRAEGEKQSQILKAEGERQSA-------FLQAEARERAAEAEAQATKMVS 240
E R+ A + ++++ + E ER+ A +AE + AA +E ++
Sbjct: 135 AEEKARKEAEAAEKAFQEAEQRRKEIEREKAETERQLKLAEAEEKRLAALSEEAKAVEIA 194

Query: 241 EAIAAGDIQAINYFVAQKYTDALQHIGSANNSK 273
+ + + + T + S+ +++
Sbjct: 195 QKKLSAAQSEVVKMDGEIKT-LNSRLSSSIHAR 226


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A1269PF06057290.013 Type IV secretory pathway VirJ component
		>PF06057#Type IV secretory pathway VirJ component

Length = 243

Score = 29.4 bits (66), Expect = 0.013
Identities = 15/68 (22%), Positives = 25/68 (36%), Gaps = 12/68 (17%)

Query: 20 QSMSVPV-----LFYFWSERSQHCLQLTPTLDKLAAEYAGQFILARVDCDAQPMVASQFG 74
Q PV L Y+W ++ +T + +Y +F +V ++ FG
Sbjct: 75 QQQGWPVVGWSSLKYYWKQKDPK--DVTQDTLAIIDKYQAEFGTQKV-----ILIGYSFG 127

Query: 75 LRSIPAVY 82
IP V
Sbjct: 128 AEVIPFVL 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A1270DHBDHDRGNASE813e-20 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 80.9 bits (199), Expect = 3e-20
Identities = 51/191 (26%), Positives = 85/191 (44%), Gaps = 7/191 (3%)

Query: 3 KAVLITGCSSGIGLVAAQDLKNRGYRVLAACRKPDDVAKMVQ-LGLEG-----IELDLDD 56
K ITG + GIG A+ L ++G + A P+ + K+V L E D+ D
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 57 SASVERAAAQVIELTGGRLYGLFNNGGFGLYGSLHTISRQQLEKQFSTNLFGTHQLTQLL 116
SA+++ A++ G + L N G G +H++S ++ E FS N G ++ +
Sbjct: 69 SAAIDEITARIEREMG-PIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127

Query: 117 LPAMLPHGEGRIIQTSSVMGLVSTAGRGAYAASKYALEAWSDALRMELQSSGIHVSLIEP 176
M+ G I+ S V AYA+SK A ++ L +EL I +++ P
Sbjct: 128 SKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187

Query: 177 GPISTHFTQNV 187
G T ++
Sbjct: 188 GSTETDMQWSL 198


62YpAngola_A1393YpAngola_A1399N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A13936200.331084hypothetical protein
YpAngola_A13946180.093585colicin uptake protein TolQ
YpAngola_A13956140.481720colicin uptake protein TolR
YpAngola_A13965130.256621cell envelope integrity inner membrane protein
YpAngola_A1397112-0.418440translocation protein TolB
YpAngola_A1398010-0.550968peptidoglycan-associated outer membrane
YpAngola_A1399011-0.697425tol-pal system protein YbgF
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A1393BINARYTOXINA250.013 Clostridial binary toxin A signature.
		>BINARYTOXINA#Clostridial binary toxin A signature.

Length = 454

Score = 25.0 bits (54), Expect = 0.013
Identities = 13/45 (28%), Positives = 22/45 (48%)

Query: 5 QLLLMKISLAQHFSSRPFIKGNVARMVNHATSIGIFKVDSYRPSK 49
++ + K S + S+ P G ++NH + I KVDSY+
Sbjct: 398 RINIPKDSPGAYLSAIPGYAGEYEVLLNHGSKFKINKVDSYKDGT 442


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A1396IGASERPTASE607e-12 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 60.1 bits (145), Expect = 7e-12
Identities = 29/193 (15%), Positives = 63/193 (32%), Gaps = 4/193 (2%)

Query: 64 YNRQQQQQTDAKRAEQQRQKKAEQQAEELQQKQAAEQQRLKELEKERLQAQEDAK---LA 120
YN + +++ Q E R+ E ++
Sbjct: 981 YNPEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETV 1040

Query: 121 AEEQKKQVAEQQKQIAEQQKQAAEQQKIAAAAVAKAKEEQKQAETAAAQAKAEADKIVKA 180
AE K++ +K + + A+ +++A A + K + E A + ++ + + +
Sbjct: 1041 AENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTET 1100

Query: 181 QAEAQKKAEAEAKKEAA-VAAAAKKQADADAKKAVEVAEKAAADAAEKKAAADAEKKAAA 239
+ A + E +AK E K + K+ + A+ A + K+ +
Sbjct: 1101 KETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQS 1160

Query: 240 AKKVAAAAEAKKK 252
A E K
Sbjct: 1161 QTNTTADTEQPAK 1173



Score = 52.4 bits (125), Expect = 2e-09
Identities = 22/199 (11%), Positives = 68/199 (34%), Gaps = 5/199 (2%)

Query: 67 QQQQQTDAKRAEQQRQKKAEQQAEELQQKQAAEQQRLKELEKERLQAQ-EDAKLAAEEQK 125
+Q+ ++ EQ + Q E ++ ++ + + E + ++ ++ + ++
Sbjct: 1044 SKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKET 1103

Query: 126 KQVAEQQKQIAEQQKQAAEQQKIAAAAVAKAKEEQKQAETAAAQAKAEADKIVKAQAEAQ 185
V +++K E +K + + + + + E Q + A+ I + Q++
Sbjct: 1104 ATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTN 1163

Query: 186 KKAEAEAKKE----AAVAAAAKKQADADAKKAVEVAEKAAADAAEKKAAADAEKKAAAAK 241
A+ E + + VE E + +++ K
Sbjct: 1164 TTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRH 1223

Query: 242 KVAAAAEAKKKAAAEAAAS 260
+ + + A +++
Sbjct: 1224 RRSVRSVPHNVEPATTSSN 1242



Score = 44.7 bits (105), Expect = 5e-07
Identities = 32/218 (14%), Positives = 65/218 (29%), Gaps = 10/218 (4%)

Query: 47 GEVIDAVMVDPGAVTEQYNRQQQQQTDAKRAEQQRQKKAEQQAEELQQKQAAEQQRLKEL 106
EV + A T+ Q + + ++ A + EE + + + Q
Sbjct: 1066 REVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQ----- 1120

Query: 107 EKERLQAQEDAKLAAEEQKKQVAEQQKQ--IAEQQKQAAEQQKIAAAAVAKAKEEQKQAE 164
E ++ +Q K E + AE ++ K+ Q A AKE E
Sbjct: 1121 EVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVE 1180

Query: 165 TAAAQAKAEADKIVKAQAEAQKKAEAEAKKEAAVAAAAKKQADADAKKAVEVAEKA-AAD 223
++ + E + + + ++ K + + V A
Sbjct: 1181 QPVTESTTV--NTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPAT 1238

Query: 224 AAEKKAAADAEKKAAAAKKVAAAAEAKKKAAAEAAAST 261
+ + A + A ++A+ KA A
Sbjct: 1239 TSSNDRSTVALCDLTSTNTNAVLSDARAKAQFVALNVG 1276


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A1398OMPADOMAIN1159e-34 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 115 bits (290), Expect = 9e-34
Identities = 37/119 (31%), Positives = 54/119 (45%), Gaps = 4/119 (3%)

Query: 50 EEQARLQMQELQKNNIVYFGFDKYDIGSDFAQMLDAHAAFLRSN--PSDKVVVEGHADER 107
+Q + + V F F+K + + LD + L + VVV G+ D
Sbjct: 205 APAPEVQTKHFTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRI 264

Query: 108 GTPEYNIALGERRASAVKMYLQGKGVSADQISIVSYGKEKPAVLGHDEAAFAKNRRAVL 166
G+ YN L ERRA +V YL KG+ AD+IS G+ P V G+ K R A++
Sbjct: 265 GSDAYNQGLSERRAQSVVDYLISKGIPADKISARGMGESNP-VTGNTCDN-VKQRAALI 321


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A1399SYCDCHAPRONE300.005 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 30.3 bits (68), Expect = 0.005
Identities = 21/112 (18%), Positives = 40/112 (35%), Gaps = 10/112 (8%)

Query: 152 YNVAVSLALEKKQYDQAITAFQSFVKQYPKSTYQPNANYWLGQLYYNKGKKDDAAYYYAV 211
Y++A + + +Y+ A FQ+ Y LG G+ D A + Y+
Sbjct: 40 YSLAFN-QYQSGKYEDAHKVFQALCVLDH---YDSRFFLGLGACRQAMGQYDLAIHSYSY 95

Query: 212 VVKNYPKSPKSSEAMFKVGVIMQDKGQSDKAKA---VYQQVIKQYPNTDAAK 260
K P+ F + KG+ +A++ + Q++I
Sbjct: 96 GAIMDIKEPRFP---FHAAECLLQKGELAEAESGLFLAQELIADKTEFKELS 144


63YpAngola_A1583YpAngola_A1587N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A15830172.341841arginine transporter ATP-binding subunit
YpAngola_A15840162.572889chorismate mutase
YpAngola_A15850173.298523putative lipoprotein
YpAngola_A15860163.189363NAD-dependent epimerase/dehydratase family
YpAngola_A15870162.842105NAD dependent epimerase/dehydratase family
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A1583PF05272300.015 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.7 bits (66), Expect = 0.015
Identities = 9/18 (50%), Positives = 12/18 (66%)

Query: 48 LVLLGPSGAGKSSLLRVL 65
+VL G G GKS+L+ L
Sbjct: 599 VVLEGTGGIGKSTLINTL 616


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A1585PF04183300.007 IucA / IucC family
		>PF04183#IucA / IucC family

Length = 580

Score = 29.8 bits (67), Expect = 0.007
Identities = 14/65 (21%), Positives = 23/65 (35%), Gaps = 9/65 (13%)

Query: 54 IQQIGGQQGLPDDNLSAQFRPYLSQSLYNDIQA--ARKQASNRTPAQVNKTQMISGDIFT 111
+ Q+ + D + A+ L +L D+Q AR+ S +N D
Sbjct: 78 LMQLKQVLSMSDATV-AEHMQDLYATLLGDLQLLKARRGLSASDLINLN------ADRLQ 130

Query: 112 SLREG 116
L G
Sbjct: 131 CLLSG 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A1586NUCEPIMERASE769e-18 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 76.4 bits (188), Expect = 9e-18
Identities = 71/366 (19%), Positives = 126/366 (34%), Gaps = 73/366 (19%)

Query: 1 MKVLVTGATSGLGRNAVEYLRRQEISVIA---------TGRNQAMGALLTKLGAKFIHAD 51
MK LVTGA +G + + L V+ QA LL + G +F D
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 52 LTDLVSSQAKAMLADVDTLWHCS-------SFTSPWGTEQAFALANVRATRRLGEWAAAY 104
L D + ++ S +P A+A +N+ + E
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPH----AYADSNLTGFLNILEGCRHN 116

Query: 105 GVENFIHISSPAIYFDYHHHRNIQEDFRPVRFANEFARSKAAGEEVIKLLALSNPQTH-- 162
+++ ++ SS ++Y + D + +A +K A E L+A + +
Sbjct: 117 KIQHLLYASSSSVYGL-NRKMPFSTDDSVDHPVSLYAATKKANE----LMAHTYSHLYGL 171

Query: 163 -FTILRPQGLFGPHDK--VMLPRLLHMIKHYGTLLLPRGGDALVDMTYLENAVHAM---- 215
T LR ++GP + + L + + ++ + G D TY+++ A+
Sbjct: 172 PATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQ 231

Query: 216 ---------WLATQSQKTLS---GRAYNITNQQPRPLRTIVQQLLDALDMKCRIRSVPYP 263
W S R YNI N P L +Q L DAL ++ + +P
Sbjct: 232 DVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLPLQ 291

Query: 264 MMDIMARAMEKMSNKAEKEPVLTHYAVAKLNFDLTLDTLRAEQELGYRPIISLDEGILRT 323
D VL A DT + +G+ P ++ +G+
Sbjct: 292 PGD-----------------VLETSA----------DTKALYEVIGFTPETTVKDGVKNF 324

Query: 324 ARWLKE 329
W ++
Sbjct: 325 VNWYRD 330


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A1587NUCEPIMERASE362e-04 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 36.3 bits (84), Expect = 2e-04
Identities = 13/26 (50%), Positives = 18/26 (69%)

Query: 5 RILVLGASGYIGQHLVPLLSQQGHQV 30
+ LV GA+G+IG H+ L + GHQV
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQV 27


64YpAngola_A2000YpAngola_A2015N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A20003194.557962flagellar basal body rod modification protein
YpAngola_A20011193.750276flagellar hook protein FlgE
YpAngola_A20022193.751354flagellar basal body rod protein FlgF
YpAngola_A20032183.145428flagellar basal-body rod protein FlgG
YpAngola_A20051172.861665flagellar basal body P-ring protein
YpAngola_A2006-1173.055657flagellar rod assembly protein/muramidase FlgJ
YpAngola_A2007-1173.009571flagellar hook-associated protein FlgK
YpAngola_A20090153.651040flagellar hook-associated protein 3
YpAngola_A2008-1164.227620flagellar hook-length control protein FliK
YpAngola_A2010-1174.135288flagellar biosynthesis chaperone
YpAngola_A2011-1174.110604flagellum-specific ATP synthase
YpAngola_A20120152.852669flagellar assembly protein H
YpAngola_A20130140.835683flagellar motor switch protein G
YpAngola_A2014017-0.401061flagellar MS-ring protein
YpAngola_A2015321-3.758062flagellar hook-basal body protein FliE
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2000SYCECHAPRONE290.008 Gram-negative bacterial type III secretion SycE cha...
		>SYCECHAPRONE#Gram-negative bacterial type III secretion SycE

chaperone signature.
Length = 130

Score = 28.9 bits (64), Expect = 0.008
Identities = 15/34 (44%), Positives = 21/34 (61%), Gaps = 2/34 (5%)

Query: 40 LKNQDPTNPMENNELTTQLAQINTVSGIEKLNTT 73
L N+ P N ++NN L TQL + V G E+L T+
Sbjct: 89 LWNRQPLNSLDNNSLYTQLEML--VQGAERLQTS 120


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2001FLGHOOKAP1453e-07 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 45.3 bits (107), Expect = 3e-07
Identities = 22/87 (25%), Positives = 42/87 (48%), Gaps = 8/87 (9%)

Query: 6 AVSGMNAASSNLDVIGNNIANSATSGFKAGSVSFAD----MFAGSQTGMGVKVAGITQDF 61
A+SG+NAA + L+ NNI++ +G+ + A + AG G GV V+G+ +++
Sbjct: 7 AMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVSGVQREY 66

Query: 62 NDGTATTTNRRLDLAISQNGFFRMQDS 88
+ +L A +Q+ +
Sbjct: 67 DA----FITNQLRAAQTQSSGLTARYE 89



Score = 40.7 bits (95), Expect = 9e-06
Identities = 15/49 (30%), Positives = 28/49 (57%)

Query: 380 TLTSGALESSNVDLSKELVNMIVAQRNYQSNAQTIKTQDQILQTLVSLR 428
L++ S V+L +E N+ Q+ Y +NAQ ++T + I L+++R
Sbjct: 498 QLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2003FLGHOOKAP1412e-06 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 41.5 bits (97), Expect = 2e-06
Identities = 11/41 (26%), Positives = 22/41 (53%)

Query: 192 ETSNVNVAEELVNMIQTQRAYEINSKAVSTSDQMLQKLAQL 232
S VN+ EE N+ + Q+ Y N++ + T++ + L +
Sbjct: 505 SISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2005FLGPRINGFLGI391e-138 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 391 bits (1007), Expect = e-138
Identities = 155/366 (42%), Positives = 217/366 (59%), Gaps = 9/366 (2%)

Query: 5 SLVTLLMVLLSLVWLPASAERIRDLVTVQGVRDNALIGYGLVVGLDGSGDQTMQTPFTTQ 64
+LV + LS A RI+D+ ++Q RDN LIGYGLVVGL G+GD +PFT Q
Sbjct: 10 ALVFSALPFLSTPPAQADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSPFTEQ 69

Query: 65 SLSNMLSQLGITVPPGTNMQLKNVAAVMVTAKLPAFSRAGQTIDVVVSSMGNAKSIRGGT 124
S+ ML LGIT G + KN+AAVMVTA LP F+ G +DV VSS+G+A S+RGG
Sbjct: 70 SMRAMLQNLGITTQGGQS-NAKNIAAVMVTANLPPFASPGSRVDVTVSSLGDATSLRGGN 128

Query: 125 LLMTPLKGVDNQVYALAQGNVLVGGAGAAAGGSSVQVNQLAGGRISNGATIERELPTTFG 184
L+MT L G D Q+YA+AQG ++V G A +++ R+ NGA IERELP+ F
Sbjct: 129 LIMTSLSGADGQIYAVAQGALIVNGFSAQGDAATLTQGVTTSARVPNGAIIERELPSKFK 188

Query: 185 TDGIINLQLNSEDFTLAQQVSDAINR----QRGFGSATAIDARTIQVLVPRGGSSQVRFL 240
+ LQL + DF+ A +V+D +N + G A D++ I V PR + R +
Sbjct: 189 DSVNLVLQLRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKPR-VADLTRLM 247

Query: 241 ADIQNIPINVDPGDAKVIINSRTGSVVMNRNVVLDSCAVAQGNLSVVVDKQNIVSQPDTP 300
A+I+N+ + D AKV+IN RTG++V+ +V + AV+ G L+V V + V QP P
Sbjct: 248 AEIENLTVETD-TPAKVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESPQVIQP-AP 305

Query: 301 FGGGQTVVTPNTQISVQQQGGVLQRVNASPNLNNVVRALNSLGATPIDLMSILQAMESAG 360
F GQT V P T I Q+G + V P+L +V LNS+G +++ILQ ++SAG
Sbjct: 306 FSRGQTAVQPQTDIMAMQEGSKVAIVE-GPDLRTLVAGLNSIGLKADGIIAILQGIKSAG 364

Query: 361 CLRAKL 366
L+A+L
Sbjct: 365 ALQAEL 370


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2006FLGFLGJ310e-108 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 310 bits (796), Expect = e-108
Identities = 179/316 (56%), Positives = 232/316 (73%), Gaps = 6/316 (1%)

Query: 1 MSDLLAMSGAAYDARSLEALKRDAARDPEGNLKQVAQQVEGMFVQMMLKSMRAALPQDGV 60
+SD ++ AA+DA+SL LK A DP N++ VA+QVEGMFVQMMLKSMR ALP+DG+
Sbjct: 2 ISDSKLLASAAWDAQSLNELKAKAGEDPAANIRPVARQVEGMFVQMMLKSMRDALPKDGL 61

Query: 61 MNSEQTKLYTSLYDQQIAQQMSA-KGLGLADMMVEQLS-GSTSASETAGTVPMMLDNEVL 118
+SE T+LYTS+YDQQIAQQM+A KGLGLA+MMV+Q++ E+ PM E +
Sbjct: 62 FSSEHTRLYTSMYDQQIAQQMTAGKGLGLAEMMVKQMTPEQPLPEESTPAAPMKFPLETV 121

Query: 119 QSMPAQALAQVMRRAIPTPPSSSMAAISPGNGNFVARMSIPAQIASQQSGIPHQLIMAQA 178
QAL+Q++++A+P S+ S F+A++S+PAQ+ASQQSG+PH LI+AQA
Sbjct: 122 VRYQNQALSQLVQKAVPRNYDDSLPGDSK---AFLAQLSLPAQLASQQSGVPHHLILAQA 178

Query: 179 ALESGWGQREIPTADGKSSYNVFGIKAGSSWNGPVSEITATEYEQGVAKKTKARFRVYGS 238
ALESGWGQR+I +G+ SYN+FG+KA +W GPV+EIT TEYE G AKK KA+FRVY S
Sbjct: 179 ALESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRVYSS 238

Query: 239 YVEAVSDYVKLLTQNPRYAHVAAAQSPEQGAHALQKAGYATDPQYAQKLVSVIQQMRSTG 298
Y+EA+SDYV LLT+NPRYA V A S EQGA ALQ AGYATDP YA+KL ++IQQM+S
Sbjct: 239 YLEALSDYVGLLTRNPRYAAVTTAASAEQGAQALQDAGYATDPHYARKLTNMIQQMKSIS 298

Query: 299 EQAVKAYGGSDLSQLF 314
++ K Y ++ LF
Sbjct: 299 DKVSKTY-SMNIDNLF 313


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2007FLGHOOKAP1436e-150 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 436 bits (1123), Expect = e-150
Identities = 315/552 (57%), Positives = 398/552 (72%), Gaps = 9/552 (1%)

Query: 3 NSLMNTAMSGLNAAQYALSTVSNNITNFQVAGYNRQNTVFAQNGGTITSAGFIGNGVTVT 62
+SL+N AMSGLNAAQ AL+T SNNI+++ VAGY RQ T+ AQ T+ + G++GNGV V+
Sbjct: 1 SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVS 60

Query: 63 GVNREYNAFITNQLRASQTQSSGLATYYQQISQIDNLLSNASNNLSTTMQDFFSNLQNLV 122
GV REY+AFITNQLRA+QTQSSGL Y+Q+S+IDN+LS ++++L+T MQDFF++LQ LV
Sbjct: 61 GVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLV 120

Query: 123 SNADDDAARKTVLGKAEGLVNQFQNADKYLRDMDDGVNQKITDSATQINNYAEQIAKLND 182
SNA+D AAR+ ++GK+EGLVNQF+ D+YLRD D VN I S QINNYA+QIA LND
Sbjct: 121 SNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLND 180

Query: 183 QITRLRG-SSGSEPNALLDQRDQLVTELNQIMAVTVTQQDGDAYNVSFAGGLSLVQGPNA 241
QI+RL G +G+ PN LLDQRDQLV+ELNQI+ V V+ QDG YN++ A G SLVQG A
Sbjct: 181 QISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTA 240

Query: 242 YKVEAIPSSADATRLTLGYKRGNGEATEVDESRITTGSLGGTLKFRSEALDSARNQLGQL 301
++ A+PSSAD +R T+ Y G E+ E + TGSLGG L FRS+ LD RN LGQL
Sbjct: 241 RQLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQL 300

Query: 302 ALVMADSFNTQHNAGFDINGDEGEDFFSFADPTVLKNAKNQGNASITVEYKDTSKVKASD 361
AL A++FNTQH AGFD NGD GEDFF+ P VL+N KN+G+ +I D S V A+D
Sbjct: 301 ALAFAEAFNTQHKAGFDANGDAGEDFFAIGKPAVLQNTKNKGDVAIGATVTDASAVLATD 360

Query: 362 YTVEFDGTDWQVTRLSDNTKVQTTPGVNADGDPTLEFEGVAIKIDNGTPGPQAKDKFTIK 421
Y + FD WQVTRL+ NT TP D + + F+G+ + P D FT+K
Sbjct: 361 YKISFDNNQWQVTRLASNTTFTVTP----DANGKVAFDGLELTFTG---TPAVNDSFTLK 413

Query: 422 TVSNVAANLQVAITDSSKIAAAGSADGGISDNTNAQALLDLQSKKLVEGK-TTLSGAYAG 480
VS+ N+ V ITD +KIA A D G SDN N QALLDLQS G + + AYA
Sbjct: 414 PVSDAIVNMDVLITDEAKIAMASEEDAGDSDNRNGQALLDLQSNSKTVGGAKSFNDAYAS 473

Query: 481 LVSNVGNQTATAKTNSTAQANIVTQLTTEQQSISGVNLDEEYGDLQRFQQYYLANAQVLQ 540
LVS++GN+TAT KT+S Q N+VTQL+ +QQSISGVNLDEEYG+LQRFQQYYLANAQVLQ
Sbjct: 474 LVSDIGNKTATLKTSSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQ 533

Query: 541 AASTLFNALLSI 552
A+ +F+AL++I
Sbjct: 534 TANAIFDALINI 545


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2009FLAGELLIN404e-06 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 40.4 bits (94), Expect = 4e-06
Identities = 35/137 (25%), Positives = 63/137 (45%), Gaps = 7/137 (5%)

Query: 4 STSMLYQQNMQGITNAQSLWMQTGQQLSTGKRVVNPSDDPMAASQAVMVSQAESENSQYT 63
S S+L Q N+ ++ S ++LS+G R+ + DD AA QA+ +
Sbjct: 8 SLSLLTQNNLNKSQSSLS---SAIERLSSGLRINSAKDD--AAGQAIANRFTSNIKGLTQ 62

Query: 64 LARSFARQSSSLETT--VLAQTTSTIQSIQSLVISAKNDTLSDDDRASYATQLQGLKDQL 121
+R+ S +TT L + + +Q ++ L + A N T SD D S ++Q +++
Sbjct: 63 ASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEI 122

Query: 122 LNQANTTDGNGRYIFAG 138
+N T NG + +
Sbjct: 123 DRVSNQTQFNGVKVLSQ 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2010FLGFLIJ1129e-35 Flagellar FliJ protein signature.
		>FLGFLIJ#Flagellar FliJ protein signature.

Length = 147

Score = 112 bits (281), Expect = 9e-35
Identities = 82/144 (56%), Positives = 102/144 (70%)

Query: 1 MKSQSPLVTLCDLAQKAVEQASTQLGHVRQSYQNAEQQLTMLLTYQDEYRERLNDTLCNG 60
M L TL DLA+K VE A+ LG +R+ Q AE+QL ML+ YQ+EYR LN + G
Sbjct: 1 MAEHGALATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAG 60

Query: 61 MASSSWQNYQQFIQTLEQAIDQHRKQLAQWSIKVEQAVKYWQEKQQRLNAFETLQERAET 120
+ S+ W NYQQFIQTLE+AI QHR+QL QW+ KV+ A+ W+EK+QRL A++TLQER T
Sbjct: 61 ITSNRWINYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQERQST 120

Query: 121 TQRQQENRLDQKLMDEFAQRASQR 144
ENRLDQK MDEFAQRA+ R
Sbjct: 121 AALLAENRLDQKKMDEFAQRAAMR 144


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2012FLGFLIH2215e-75 Flagellar assembly protein FliH signature.
		>FLGFLIH#Flagellar assembly protein FliH signature.

Length = 228

Score = 221 bits (563), Expect = 5e-75
Identities = 128/233 (54%), Positives = 167/233 (71%), Gaps = 7/233 (3%)

Query: 6 NALPWQPWSLKDFASQSEAPLSESMPDISLLFPNEPMEATAAVDEQQVLVNLQLEAEKQG 65
+ LPW+ W+ D A P +E +P + P E + A +Q L LQ++A +QG
Sbjct: 3 DNLPWKTWTPDDLAP----PQAEFVPIVE---PEETIIEEAEPSLEQQLAQLQMQAHEQG 55

Query: 66 RQQGFAKGLQEGLDKGYQTGLEEGHQQALADAQQQLAPMTAHWQVMVTDFQNTLDTLDSV 125
Q G A+G Q+G +GYQ GL +G +Q LA+A+ Q AP+ A Q +V++FQ TLD LDSV
Sbjct: 56 YQAGIAEGRQQGHKQGYQEGLAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALDSV 115

Query: 126 IASRLVQIALAAAKQIIGQPAICDGTALLAQIQQMIQQEPMFAGKTQLRVNPDDLAIVEQ 185
IASRL+Q+AL AA+Q+IGQ D +AL+ QIQQ++QQEP+F+GK QLRV+PDDL V+
Sbjct: 116 IASRLMQMALEAARQVIGQTPTVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRVDD 175

Query: 186 RLGSTLSLHGWRLLGDSQIHAGGCKVSAEEGDLDASLATRWHELCRLAAPGEL 238
LG+TLSLHGWRL GD +H GGCKVSA+EGDLDAS+ATRW ELCRLAAPG +
Sbjct: 176 MLGATLSLHGWRLRGDPTLHPGGCKVSADEGDLDASVATRWQELCRLAAPGVV 228


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2013FLGMOTORFLIG314e-108 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 314 bits (806), Expect = e-108
Identities = 113/327 (34%), Positives = 192/327 (58%), Gaps = 2/327 (0%)

Query: 2 SLTGTEKSAIMLMTLGEDHAAEVFKHLSSREVQQLSTTMASMRQVSHQQLVDVLAEFEDD 61
+LTG +K+AI+L+++G + +++VFK+LS E++ L+ +A + ++ + +VL EF++
Sbjct: 14 ALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITSELKDNVLLEFKEL 73

Query: 62 AEQYAALSVNASDYLRSVLIKALGEERASSLLEDILESRETTSGMETLNFMEPQMAADLI 121
+ DY R +L K+LG ++A ++ + L S + E + +P + I
Sbjct: 74 MMAQEFIQKGGIDYARELLEKSLGTQKAVDIINN-LGSALQSRPFEFVRRADPANILNFI 132

Query: 122 RDEHPQIIATILVHLKRAQAADILALFDERLRNDVMLRIATFGGVQPAALAELTEVLNNL 181
+ EHPQ IA IL +L +A+ IL+ ++ +V RIA P + E+ VL
Sbjct: 133 QQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEVVREVERVLEKK 192

Query: 182 LDGQ-NLKRSKMGGIRTAAEIINLMKTQQEETVMDAVREYDGELAQKIIDEMFLFENLVS 240
L + + GG+ EIIN+ + E+ +++++ E D ELA++I +MF+FE++V
Sbjct: 193 LASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEIKKKMFVFEDIVL 252

Query: 241 VDDRSIQRLLQEIDNESLLIALKGADQALRERFLSNMSLRAAEILRDDLATRGPVRMSLV 300
+DDRSIQR+L+EID + L ALK D ++E+ NMS RAA +L++D+ GP R V
Sbjct: 253 LDDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKEDMEFLGPTRRKDV 312

Query: 301 ENEQKSILLIVRRLAESGEIVIGGGED 327
E Q+ I+ ++R+L E GEIVI G +
Sbjct: 313 EESQQKIVSLIRKLEEQGEIVISRGGE 339


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2014FLGMRINGFLIF5770.0 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 577 bits (1488), Expect = 0.0
Identities = 354/552 (64%), Positives = 443/552 (80%), Gaps = 9/552 (1%)

Query: 19 LARLRANPKIPLLIAAAAAIAIIVALMLWAKSPDYRVLYSNLSDRDGGDIVTQLTQLNIP 78
L RLRANP+IPL++A +AA+AI+VA++LWAK+PDYR L+SNLSD+DGG IV QLTQ+NIP
Sbjct: 16 LNRLRANPRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQDGGAIVAQLTQMNIP 75

Query: 79 YRFADNGGALLIPAEKVHETRLRLAQQGLPKGGAVGFELLDQEKFGISQFSEQINYQRAL 138
YRFA+ GA+ +PA+KVHE RLRLAQQGLPKGGAVGFELLDQEKFGISQFSEQ+NYQRAL
Sbjct: 76 YRFANGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQFSEQVNYQRAL 135

Query: 139 EGELSRTIGTLGPVLNVRVHLAMPKPSLFVREQKSPTASVTLALQPGRALDDGQINAIVY 198
EGEL+RTI TLGPV + RVHLAMPKPSLFVREQKSP+ASVT+ L+PGRALD+GQI+A+V+
Sbjct: 136 EGELARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRALDEGQISAVVH 195

Query: 199 MVSSSVAGLPPGNVTVVDQTGRLLTQSDSAGRDLNASQLKFTSEVENRYQRRIENILAPM 258
+VSS+VAGLPPGNVT+VDQ+G LLTQS+++GRDLN +QLKF ++VE+R QRRIE IL+P+
Sbjct: 196 LVSSAVAGLPPGNVTLVDQSGHLLTQSNTSGRDLNDAQLKFANDVESRIQRRIEAILSPI 255

Query: 259 VGNGNVHAQVTAQVDFASREQTDEEYKPNQAANQGAVRSQQVSTSEQLGGTNVGGVPGAL 318
VGNGNVHAQVTAQ+DFA++EQT+E Y PN A++ +RS+Q++ SEQ+G GGVPGAL
Sbjct: 256 VGNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNISEQVGAGYPGGVPGAL 315

Query: 319 SNQPPVAPIAPIEIPQPAGAAANNAAPANTAATANANTTATAAKASSSNSRHDQTTNFEV 378
SNQP API P A N +T+ +N+ A +++ ++T+N+EV
Sbjct: 316 SNQPAPPNEAPIATPPTNQQNAQNTPQTSTSTNSNS--------AGPRSTQRNETSNYEV 367

Query: 379 DRTIRHTQQQAGMVQRLSVAVVVNYTSDKAGKPIALSKDQLAQVESLTREAMGFSTVRGD 438
DRTIRHT+ G ++RLSVAVVVNY + GKP+ L+ DQ+ Q+E LTREAMGFS RGD
Sbjct: 368 DRTIRHTKMNVGDIERLSVAVVVNYKTLADGKPLPLTADQMKQIEDLTREAMGFSDKRGD 427

Query: 439 TLNVVNTPFTASDDTRGSSLPFWQQQSFFDQLLNAGRYLLILLVAWILWRKLLRPMLAKK 498
TLNVVN+PF+A D+T G LPFWQQQSF DQLL AGR+LL+L+VAWILWRK +RP L ++
Sbjct: 428 TLNVVNSPFSAVDNT-GGELPFWQQQSFIDQLLAAGRWLLVLVVAWILWRKAVRPQLTRR 486

Query: 499 QVADKAAASVNNIVQTAQAAETVKQSKEELALRKKNQQRVSAEVQAQRIRELADKDPRVV 558
KAA + Q + A V+ SK+E +++ QR+ AEV +QRIRE++D DPRVV
Sbjct: 487 VEEAKAAQEQAQVRQETEEAVEVRLSKDEQLQQRRANQRLGAEVMSQRIREMSDNDPRVV 546

Query: 559 ALVIRQWMSNDQ 570
ALVIRQWMSND
Sbjct: 547 ALVIRQWMSNDH 558


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2015FLGHOOKFLIE802e-23 Flagellar hook-basal body complex protein FliE signa...
		>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE

signature.
Length = 103

Score = 80.5 bits (198), Expect = 2e-23
Identities = 59/102 (57%), Positives = 73/102 (71%)

Query: 2 SVQGIEGVLQQLQVTALQASGSAKTLPAEAGFASELKAAIGKISENQQVARTSAQNFELG 61
++QGIEGV+ QLQ TA+ A FA +L AA+ +IS+ Q ART A+ F LG
Sbjct: 2 AIQGIEGVISQLQATAMSARAQESLPQPTISFAGQLHAALDRISDTQTAARTQAEKFTLG 61

Query: 62 VPGVGLNDVMVNAQKSSVSLQLGIQVRNKLVAAYQEVMNMGV 103
PGV LNDVM + QK+SVS+Q+GIQVRNKLVAAYQEVM+M V
Sbjct: 62 EPGVALNDVMTDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV 103


65YpAngola_A2022YpAngola_A2026N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A20224170.477870flagellar capping protein
YpAngola_A20235220.514706flagellin
YpAngola_A20241171.061614transposase/IS protein
YpAngola_A20250150.275371insertion sequence transposase
YpAngola_A2026116-0.838327phase-1 flagellin
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2022ACRIFLAVINRP290.049 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 29.0 bits (65), Expect = 0.049
Identities = 20/121 (16%), Positives = 38/121 (31%), Gaps = 11/121 (9%)

Query: 32 PLTTQQTSYKSKLTAYGVLQSALAKLETASTALKKADTLNSTAVSGSNSAFSATTDSAAS 91
P + +Y A V + +E + ++ST+ S + + T S
Sbjct: 41 PAVSVSANY-PGADAQTVQDTVTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSG-- 97

Query: 92 AGTYSIEVTNLAKAQSLLSADVPSATDKLGSSDATRTITITQPGQKEPMKISLTSEQTSL 151
T+ AQ + + AT L + I++ + M S+
Sbjct: 98 --------TDPDIAQVQVQNKLQLATPLLPQEVQQQGISVEKSSSSYLMVAGFVSDNPGT 149

Query: 152 T 152
T
Sbjct: 150 T 150


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2023FLAGELLIN1426e-43 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 142 bits (359), Expect = 6e-43
Identities = 133/156 (85%), Positives = 144/156 (92%)

Query: 3 VINTNSLSLLTQNNLNKSQSSLGTAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQ 62
VINTNSLSLLTQNNLNKSQSSL +AIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQ
Sbjct: 3 VINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQ 62

Query: 63 AARNANDGISIAQTTEGSLNEINNNLQRVRELTVQAQNGSNSSSDLDSIQDEISLRLAEI 122
A+RNANDGISIAQTTEG+LNEINNNLQRVREL+VQA NG+NS SDL SIQDEI RL EI
Sbjct: 63 ASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEI 122

Query: 123 DRVSDQTQFNGKKVLAENTTMSIQVGANDGETIDVN 158
DRVS+QTQFNG KVL+++ M IQVGANDGETI ++
Sbjct: 123 DRVSNQTQFNGVKVLSQDNQMKIQVGANDGETITID 158


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2025HTHTETR280.047 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 28.1 bits (62), Expect = 0.047
Identities = 12/106 (11%), Positives = 31/106 (29%), Gaps = 14/106 (13%)

Query: 16 QGMSSRAIARELGISRNTVKRYLQAKSEPPKYTPRPAVASLLDEYRDYIRQRIAD----- 70
S IA+ G++R + + + KS+ + + + I + +
Sbjct: 30 SSTSLGEIAKAAGVTRGAIYWHFKDKSD--------LFSEIWELSESNIGELELEYQAKF 81

Query: 71 -AHPYKIPATVIAREIRDQGYRGGMTILRAFIRSLSVPQEQEPAVR 115
P + ++ + +L I + V+
Sbjct: 82 PGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQ 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2026FLAGELLIN1066e-29 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 106 bits (266), Expect = 6e-29
Identities = 85/216 (39%), Positives = 113/216 (52%), Gaps = 8/216 (3%)

Query: 9 VTIDINLQKIDSKSLGLGSYSVSGVSGALTSLTDTSVTGVTTTTALDFSDISTFAKGATV 68
V+ IN +K+ + + + + + L S + + V D + AK + +
Sbjct: 299 VSTTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDL 358

Query: 69 HGIGDVGTDGAYADGYVIRTTDGKQYKGEVDATNGKVTFADDANGDPIDDATKLEAAAQF 128
G T +G +Y + + L
Sbjct: 359 -------EANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAA 411

Query: 129 SPAGKATASPLETLDDAIKQVDGLRSSLGAVQNRFESAVTNLNNTVTNLTSARSRIEDAD 188
+ PL ++D A+ +VD +RSSLGA+QNRF+SA+TNL NTVTNL SARSRIEDAD
Sbjct: 412 AAKKSTAN-PLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDAD 470

Query: 189 YATEVSNMSRAQILQQAGTSVLSQANQVPQTVLSLL 224
YATEVSNMS+AQILQQAGTSVL+QANQVPQ VLSLL
Sbjct: 471 YATEVSNMSKAQILQQAGTSVLAQANQVPQNVLSLL 506


66YpAngola_A2537YpAngola_A2543N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A2537-3140.874608phage shock protein PspA
YpAngola_A2538-2140.963869phage shock protein operon transcriptional
YpAngola_A2539-1130.656671IS1541 transposase
YpAngola_A2540-1130.869866peptide transport periplasmic protein
YpAngola_A25410131.323834peptide transport system permease
YpAngola_A2542-1130.789984peptide transport system permease
YpAngola_A2543-113-0.002596peptide transport system ATP-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2537cloacin300.006 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 30.5 bits (68), Expect = 0.006
Identities = 33/146 (22%), Positives = 54/146 (36%), Gaps = 29/146 (19%)

Query: 56 QLLRRIDHSESQQQEWQ------------EKAELALRKDKEDLARAALIEKQ-KVMTLVE 102
Q+ +R D +QQEW E+A L + ED+AR E+Q K + +
Sbjct: 295 QVKQRQDEENRRQQEWDATHPVEAAERNYERARAELNQANEDVAR--NQERQAKAVQVYN 352

Query: 103 TLKREVATVDETLSRMKHEITELENKLTETRA--------------RQQALTLRHQAASS 148
+ K E+ ++TL+ EI + + A R Q QAA
Sbjct: 353 SRKSELDAANKTLADAIAEIKQFNRFAHDPMAGGHRMWQMAGLKAQRAQTDVNNKQAAFD 412

Query: 149 SRDVRRQLDSGKLDEAMARFEQFERR 174
+ + L AM ++ E +
Sbjct: 413 AAAKEKSDADAALSSAMESRKKKEDK 438


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2538HTHFIS346e-119 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 346 bits (890), Expect = e-119
Identities = 124/344 (36%), Positives = 176/344 (51%), Gaps = 19/344 (5%)

Query: 3 EQLDNLLGEANAFVDVLEQVSGLAKLNKPVLVIGERGTGKELIAHRLHYLSERWQGPFIS 62
+ L+G + A ++ ++ L + + +++ GE GTGKEL+A LH +R GPF++
Sbjct: 134 QDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVA 193

Query: 63 LNCAALNENLLDSELFGHEAGAFTGAQKRHLGRFERADGGTLFLDELATAPMLVQEKLLR 122
+N AA+ +L++SELFGHE GAFTGAQ R GRFE+A+GGTLFLDE+ PM Q +LLR
Sbjct: 194 INMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLR 253

Query: 123 VIEYGHLERVGGSQPLQVDVRLVCATNDNLPALAAAGKFRADLLDRLAFDVVQLPPLRER 182
V++ G VGG P++ DVR+V ATN +L G FR DL RL ++LPPLR+R
Sbjct: 254 VLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDR 313

Query: 183 QQDIMLLAEHFAILMCRELGLPLFSGFTATAKEQLLEYRWPGNVRELKNVVERSV----- 237
+DI L HF +E F A E + + WPGNVREL+N+V R
Sbjct: 314 AEDIPDLVRHFVQQAEKEGLDVK--RFDQEALELMKAHPWPGNVRELENLVRRLTALYPQ 371

Query: 238 -----------YRHSDSSLPLNNIIINPFASNQKGEIEGVDTPNEGGAVLPALPVD-LKH 285
R P+ + + +E P
Sbjct: 372 DVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYDR 431

Query: 286 WLHTSEHQMLTRALKQARFNQRKAAHLLGLTYHQLRGLLKKHTI 329
L E+ ++ AL R NQ KAA LLGL + LR +++ +
Sbjct: 432 VLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGV 475


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2542TATBPROTEIN320.002 Bacterial sec-independent translocation TatB protein...
		>TATBPROTEIN#Bacterial sec-independent translocation TatB protein

signature.
Length = 171

Score = 31.5 bits (71), Expect = 0.002
Identities = 15/46 (32%), Positives = 25/46 (54%), Gaps = 3/46 (6%)

Query: 144 LLLAIIVVAFVGPS-LEHAMFAVWLALLPRMVRTIYSAVHDELDKE 188
LL+ II + +GP L A +A R +R++ + V +EL +E
Sbjct: 10 LLVFIIGLVVLGPQRLPVA--VKTVAGWIRALRSLATTVQNELTQE 53


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2543HTHFIS310.006 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 31.3 bits (71), Expect = 0.006
Identities = 9/16 (56%), Positives = 14/16 (87%)

Query: 38 LVGESGSGKSLIAKAI 53
+ GESG+GK L+A+A+
Sbjct: 165 ITGESGTGKELVARAL 180


67YpAngola_A2610YpAngola_A2617N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A2610-215-2.415719bifunctional UDP-glucuronic acid
YpAngola_A2611-116-1.698927undecaprenyl phosphate
YpAngola_A2612217-1.550799UDP-4-amino-4-deoxy-L-arabinose--oxoglutarate
YpAngola_A2613018-2.076289IS1541 transposase
YpAngola_A2614016-1.862567vitamin B12-transporter ATPase
YpAngola_A2615-119-1.240320putative glutathione peroxidase
YpAngola_A2616-119-0.943866vtamin B12-transporter permease
YpAngola_A2617018-0.905943hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2610NUCEPIMERASE1027e-26 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 102 bits (256), Expect = 7e-26
Identities = 74/361 (20%), Positives = 138/361 (38%), Gaps = 60/361 (16%)

Query: 317 RVLILGVNGFIGNHLTERLLQDDRYEVYGLDIGSD--------AISRFLGNPAFHFVEGD 368
+ L+ G GFIG H+++RLL+ ++V G+D +D A L P F F + D
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAG-HQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 369 ISIHSEWIE--YHIKKCDVILPLVAIATPIEYT-RNPLRVFELDFEENLKIVRDCVKYN- 424
++ E + + + + + Y+ NP + + L I+ C
Sbjct: 61 LADR-EGMTDLFASGHFERVFISPHRLA-VRYSLENPHAYADSNLTGFLNILEGCRHNKI 118

Query: 425 KRIVFPSTSEVYGMCDDKEFDEDTSRLIVGPINKQRWIYSVSKQLLDRVIWAYGVKEGLK 484
+ +++ S+S VYG+ F D ++ +Y+ +K+ + + Y GL
Sbjct: 119 QHLLYASSSSVYGLNRKMPFSTD------DSVDHPVSLYAATKKANELMAHTYSHLYGLP 172

Query: 485 FTLFRPFNWMGPRLDNLDAARIGSSRAITQLILNLVEGSPIKLVDGGAQKRCFTDIHDGI 544
T R F GP D A ++A+ +EG I + + G KR FT I D
Sbjct: 173 ATGLRFFTVYGPWGRP-DMALFKFTKAM-------LEGKSIDVYNYGKMKRDFTYIDDIA 224

Query: 545 EALFRIIEN---------------RDGCCDGRIINIGNPTNEASIRELAEMLLTSFENHE 589
EA+ R+ + R+ NIGN ++ + + + L +
Sbjct: 225 EAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGN-SSPVELMDYIQALEDALGIEA 283

Query: 590 LRDHFPPFAGFKDIESSAYYGKGYQDVEYRTPSIKNARRILHWQPEIAMQQTVTETLDFF 649
++ P G DV + K ++ + PE ++ V ++++
Sbjct: 284 KKNMLPLQPG---------------DVLETSADTKALYEVIGFTPETTVKDGVKNFVNWY 328

Query: 650 L 650

Sbjct: 329 R 329


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2611PREPILNPTASE320.002 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 32.5 bits (74), Expect = 0.002
Identities = 25/90 (27%), Positives = 36/90 (40%), Gaps = 4/90 (4%)

Query: 220 LMYDLITCLTTTPLRLLSLVGSAIALLGF-TFSVLLVALRLIFGPEWAGGGVFTLFAVLF 278
L L+ L + L V A+A G+ L A +L+ G E G G F L A L
Sbjct: 166 LWGGLLFNLLGGFVSLGDAVIGAMA--GYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALG 223

Query: 279 MFIGAQFV-GMGLLGEYIGRIYNDVRARPR 307
++G Q + + LL +G R
Sbjct: 224 AWLGWQALPIVLLLSSLVGAFMGIGLILLR 253


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2614PF05272320.003 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 31.6 bits (71), Expect = 0.003
Identities = 17/66 (25%), Positives = 27/66 (40%), Gaps = 10/66 (15%)

Query: 29 LIGPNGAGKSTLLASLAGL------LPASGEIVLAGKSLQHYEGHELAR----QRAYLSQ 78
L G G GKSTL+ +L GL G + + + +EL+ +RA
Sbjct: 601 LEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQIAGIVAYELSEMTAFRRADAEA 660

Query: 79 QQSALS 84
++ S
Sbjct: 661 VKAFFS 666


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2617OUTRSURFACE300.004 Outer surface protein signature.
		>OUTRSURFACE#Outer surface protein signature.

Length = 273

Score = 29.5 bits (66), Expect = 0.004
Identities = 18/52 (34%), Positives = 27/52 (51%), Gaps = 8/52 (15%)

Query: 1 MKKYLLLFGVLSFMPLIAQSDVSLD------INMPGIN--LHLGDQDKRGYY 44
MKKYLL G++ + Q+ SLD +++PG L ++DK G Y
Sbjct: 1 MKKYLLGIGLILALIACKQNVSSLDEKNSASVDLPGEMKVLVSKEKDKDGKY 52


68YpAngola_A2691YpAngola_A2700N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A2691012-2.263154outer membrane usher protein
YpAngola_A2692-110-1.580037IS285 transposase
YpAngola_A2693-210-1.091751fimbrial usher protein
YpAngola_A2694-110-0.358908pili assembly chaperone
YpAngola_A2695-110-0.031493frimbrial protein
YpAngola_A2696-2110.015980Clp ATPase
YpAngola_A26970150.081683hypothetical protein
YpAngola_A2698014-0.957031hypothetical protein
YpAngola_A2699-113-2.355158ImpA domain-containing protein
YpAngola_A2700-113-4.304789hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2691PF005771183e-34 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 118 bits (298), Expect = 3e-34
Identities = 44/129 (34%), Positives = 72/129 (55%), Gaps = 2/129 (1%)

Query: 2 LLYQGVSRFDFSAGQL-NDSSINHNPAIVQGAYHYGLGNTYTLYGGAQVAENYRSVAIGN 60
L +G +R+ +AG+ + ++ P Q +GL +T+YGG Q+A+ YR+ G
Sbjct: 368 LQREGHTRYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGI 427

Query: 61 AFNT-PLGGVSMDITHAKSELAGDRRSSGNSYKIDYSKYVGETDTNLTLAAYRYSSGGYY 119
N LG +S+D+T A S L D + G S + Y+K + E+ TN+ L YRYS+ GY+
Sbjct: 428 GKNMGALGALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYF 487

Query: 120 SFREASLDR 128
+F + + R
Sbjct: 488 NFADTTYSR 496


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2693PF005772945e-95 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 294 bits (754), Expect = 5e-95
Identities = 106/323 (32%), Positives = 160/323 (49%), Gaps = 21/323 (6%)

Query: 4 VEFNADFIHGGG---VDVMRFMHENPVAPGVYDVTVIINGKNRGKHRIRFELSEGESTAE 60
+ FN F+ D+ RF + + PG Y V + +N + F + E
Sbjct: 47 LYFNPRFLADDPQAVADLSRFENGQELPPGTYRVDIYLNNGYMATRDVTFNTGDSEQGIV 106

Query: 61 PCFTLEQLDSIGLKIETSDTDLLVNGKAAPKDQCYNLRALIKDSHVNYNSGDLELSLTVP 120
PC T QL S+GL +T + D C L ++I D+ + G L+LT+P
Sbjct: 107 PCLTRAQLASMGL-----NTASVSGMNLLADDACVPLTSMIHDATAQLDVGQQRLNLTIP 161

Query: 121 QFNLVHHPRGYIDSSLWDAGGTVGFLDYNSNVYSIFNGRSNSDVGSDNSNSYNSNIGLSA 180
Q + + RGYI LWD G G L+YN + + +G ++ +Y + L +
Sbjct: 162 QAFMSNRARGYIPPELWDPGINAGLLNYNFSGN-----SVQNRIGGNSHYAY---LNLQS 213

Query: 181 GINLGEWRFRKRLNTTWSNSSG-----MHTQNLYGYAATDITALKSQLTIGDTNTQGSLF 235
G+N+G WR R ++++S Q++ + DI L+S+LT+GD TQG +F
Sbjct: 214 GLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRSRLTLGDGYTQGDIF 273

Query: 236 DSYALRGVLLASDTRMLPEGIRNYSPIVRGIAETNARVTVTQRGQIIYETVVTPGAFELT 295
D RG LASD MLP+ R ++P++ GIA A+VT+ Q G IY + V PG F +
Sbjct: 274 DGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIYNSTVPPGPFTIN 333

Query: 296 DIGTMSYGGDLQMTITESDGRTR 318
DI GDLQ+TI E+DG T+
Sbjct: 334 DIYAAGNSGDLQVTIKEADGSTQ 356


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2695FIMBRIALPAPE334e-04 Escherichia coli: P pili tip fibrillum papE protein...
		>FIMBRIALPAPE#Escherichia coli: P pili tip fibrillum papE protein

signature.
Length = 173

Score = 32.7 bits (74), Expect = 4e-04
Identities = 32/113 (28%), Positives = 49/113 (43%), Gaps = 18/113 (15%)

Query: 1 MRKLNLAVCAVALSVISSTSYAAAGGTVTFNGKLIADTCQVDTASENITVTLPTLSIQSL 60
M+K+ V L + + + A +TF GKLI C V +N V + IQ+L
Sbjct: 1 MKKIRGLCLPVMLGAVLMSQHVHAADNLTFKGKLIIPACTV----QNAEVNWGDIEIQNL 56

Query: 61 AVAEAQDGS--KDFEIKVLDCP-------ATLTQVGAHFNAIDSSGVNPATGN 104
Q G KDF + ++CP T+T G N+I + A+G+
Sbjct: 57 ----VQSGGNQKDFTVD-MNCPYSLGTMKVTITSNGQTGNSILVPNTSTASGD 104


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2696HTHFIS330.007 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 32.9 bits (75), Expect = 0.007
Identities = 35/164 (21%), Positives = 57/164 (34%), Gaps = 32/164 (19%)

Query: 576 DDIRAVMELPQRLEAR----------VIGQPHALMQLGENIMTARAGLSDPRKPLGVFML 625
I + P+R ++ ++G+ A+ ++ + AR +D L + M+
Sbjct: 113 GIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVL--ARLMQTD----LTL-MI 165

Query: 626 VGPSGVGKTETALAIAESMYGGEQNMITINMSEYQESHTVSSLKGSPPGYVGYGEGGVLT 685
G SG GK A A+ + + INM+ S L G E G T
Sbjct: 166 TGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGH--------EKGAFT 217

Query: 686 EAVRRKPYSV-------VLLDEIEKAHSDVHELFFQVFDKGQME 722
A R + LDEI D +V +G+
Sbjct: 218 GAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYT 261


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2700FIMBRIALPAPE310.003 Escherichia coli: P pili tip fibrillum papE protein...
		>FIMBRIALPAPE#Escherichia coli: P pili tip fibrillum papE protein

signature.
Length = 173

Score = 31.2 bits (70), Expect = 0.003
Identities = 21/83 (25%), Positives = 39/83 (46%), Gaps = 7/83 (8%)

Query: 200 PSCTFDGPQKVDFGIVTSSNL-NNGGIERDLDFNITCKTDYGHYSATAAIFTQTSSADNN 258
P+CT + V++G + NL +GG ++D ++ C G T + ++ N
Sbjct: 37 PACTVQNAE-VNWGDIEIQNLVQSGGNQKDFTVDMNCPYSLG----TMKVTITSNGQTGN 91

Query: 259 YIKVKDSQN-QEDRLLIKISDTN 280
I V ++ D LLI + ++N
Sbjct: 92 SILVPNTSTASGDGLLIYLYNSN 114


69YpAngola_A2749YpAngola_A2754N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A2749011-1.158500phosphoenolpyruvate-protein phosphotransferase
YpAngola_A2750-110-0.195800PTS system glucose-specific transporter
YpAngola_A2751013-1.221392IS1541 transposase
YpAngola_A2752015-2.556932sensor histidine kinase CpxA
YpAngola_A2753116-4.020010transcriptional regulatory protein CpxR
YpAngola_A2754014-3.869530RND efflux transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2749PHPHTRNFRASE7500.0 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 750 bits (1939), Expect = 0.0
Identities = 278/571 (48%), Positives = 392/571 (68%), Gaps = 2/571 (0%)

Query: 1 MISGILVSPGIAFGKALLLKEDEIVINRKKISADQVEQEVERFKAGRAKAAEQLEAIKTK 60
I+GI S G+A KA + E + I + I V E+E+ A K+ E+L AIK +
Sbjct: 4 KITGIAASSGVAIAKAFIHLEPNVDIEKTSI--TDVSTEIEKLTAALEKSKEELRAIKDQ 61

Query: 61 AGVSLGEEKAAIFEGHIMLLEDEELEQEIIALIKDEHASADAAAYSVIEGQAKALEELDD 120
S+G +KA IF H+++L+D EL I I++E +A+ A V + E +D+
Sbjct: 62 TEASMGADKAEIFAAHLLVLDDPELVDGIKGKIENEQMNAEYALKEVSDMFVSMFESMDN 121

Query: 121 EYLKERAADVRDIGKRLLKNILGLNIVDLSAIQDEVILVATDLTPSETAQLNLDKVLGFI 180
EY+KERAAD+RD+ KR+L +++G+ L+ I +E +++A DLTPS+TAQLN V GF
Sbjct: 122 EYMKERAADIRDVSKRVLGHLIGVETGSLATIAEETVIIAEDLTPSDTAQLNKQFVKGFA 181

Query: 181 TDIGGRTSHTSIMARSLELPAIVGTSNVTKQVKNDDYLILDAVNNKVYLNPTADVIEQLK 240
TDIGGRTSH++IM+RSLE+PA+VGT VT+++++ D +I+D + V +NPT + ++ +
Sbjct: 182 TDIGGRTSHSAIMSRSLEIPAVVGTKEVTEKIQHGDMVIVDGIEGIVIVNPTEEEVKAYE 241

Query: 241 AVKNQYITEKNELAKLKDLPAITLDGHQVEVVANIGTVRDIAGAERNGAEGVGLYRTEFL 300
+ + +K E AKL P+ T DG VE+ ANIGT +D+ G NG EG+GLYRTEFL
Sbjct: 242 EKRAAFEKQKQEWAKLVGEPSTTKDGAHVELAANIGTPKDVDGVLANGGEGIGLYRTEFL 301

Query: 301 FMDRDSLPTEEEQFQAYKAVAEAMGSQAVIVRTMDIGGDKDLPYMNLPKEENPFLGWRAI 360
+MDRD LPTEEEQF+AYK V + M + V++RT+DIGGDK+L Y+ LPKE NPFLG+RAI
Sbjct: 302 YMDRDQLPTEEEQFEAYKEVVQRMDGKPVVIRTLDIGGDKELSYLQLPKELNPFLGFRAI 361

Query: 361 RIAMDRKEILHAQLRAILRASAFGKLRIMFPMIISVEEVRELKAELELLKSQLREENKAF 420
R+ +++++I QLRA+LRAS +G L++MFPMI ++EE+R+ KA ++ K +L E
Sbjct: 362 RLCLEKQDIFRTQLRALLRASTYGNLKVMFPMIATLEELRQAKAIMQEEKDKLLSEGVDV 421

Query: 421 DETIEVGVMVETPAAAVIARHLAKEVDFFSIGTNDLTQYTLAVDRGNELISHLYNPMSPS 480
++IEVG+MVE P+ AV A AKEVDFFSIGTNDL QYT+A DR NE +S+LY P P+
Sbjct: 422 SDSIEVGIMVEIPSTAVAANLFAKEVDFFSIGTNDLIQYTMAADRMNERVSYLYQPYHPA 481

Query: 481 VLGLIKQVIDASHAEGKWTGMCGELAGDERATLLLLGMGLDEFSMSAISIPRIKKIIRNT 540
+L L+ VI A+H+EGKW GMCGE+AGDE A LLLG+GLDEFSMSA SI + +
Sbjct: 482 ILRLVDMVIKAAHSEGKWVGMCGEMAGDEVAIPLLLGLGLDEFSMSATSILPARSQLLKL 541

Query: 541 NFEDVKVLAEQALAQPTAKELMDLVTTFIEE 571
+ E++K A++AL TA+E+ LV +
Sbjct: 542 SKEELKPFAQKALMLDTAEEVEQLVKKTYLK 572


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2752PF06580385e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 38.3 bits (89), Expect = 5e-05
Identities = 41/231 (17%), Positives = 83/231 (35%), Gaps = 59/231 (25%)

Query: 239 ELRSPLARLQLAIGLAHQNPDNVDNAL----QRIEHESERLDKMIGEL-------LALSR 287
++ S QL A NP + NAL I + + +M+ L L S
Sbjct: 153 KMASMAQEAQLMALKAQINPHFMFNALNNIRALILEDPTKAREMLTSLSELMRYSLRYSN 212

Query: 288 AENHSLADD----DEYFDLQEL-------VKVVVNDARYEAQLPGVEIQLEVAAQSEYTV 336
A SLAD+ D Y L + + +N A + Q+P + +Q V
Sbjct: 213 ARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPMLVQTLV-------- 264

Query: 337 KGNAELMRRAIENIVRNALRFSASGQQVKVTLSALDKRYQIQVADQGPGVEENKLSSIFD 396
EN +++ + G ++ + + + ++V + G +N
Sbjct: 265 -----------ENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKN------- 306

Query: 397 PFVRVKSAMSGKGYGLGLAITHK-VILAHGGQVEAR-NGEQGGLVITLRVP 445
+ + G GL + + + +G + + + + +QG + + +P
Sbjct: 307 ---------TKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2753HTHFIS891e-22 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 89.1 bits (221), Expect = 1e-22
Identities = 32/122 (26%), Positives = 63/122 (51%), Gaps = 1/122 (0%)

Query: 2 KILLVDDDLELGTMLKEYLGGEGFTAKHVLTGKAGIDGALSGDYTALILDIMLPDMSGID 61
IL+ DDD + T+L + L G+ + +GD ++ D+++PD + D
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 VLRQVRK-KSRLPIIMLTAKGDNIDRVIGLEMGADDYMPKPCYPRELVARLRAVLRRFEE 120
+L +++K + LP+++++A+ + + E GA DY+PKP EL+ + L +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 121 QP 122
+P
Sbjct: 125 RP 126


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2754RTXTOXIND606e-12 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 60.2 bits (146), Expect = 6e-12
Identities = 40/259 (15%), Positives = 87/259 (33%), Gaps = 25/259 (9%)

Query: 27 FRWISPPDKPSYITAVAEIRDLEQTVLADGTIKAQKQVTVGAQVSGQIKALHVTLGQQVE 86
F+ +S + + + E Q + K+ V +I +
Sbjct: 176 FQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKS 235

Query: 87 KNQLVAEI--DDLAQQNALKDAEEALKNVQAQRAAKIA--TQKNNQLTYQRQQQILAKGV 142
+ + + ++A+ + E + + Q +++ +++ L
Sbjct: 236 RLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQL---- 291

Query: 143 GVRADFDS-IKATLEATQAEISALDAQIAQAEIAVSTAKLNLGYTKISSPIAGTVVAIPV 201
V F + I L T I L ++A+ E + I +P++ V + V
Sbjct: 292 -VTQLFKNEILDKLRQTTDNIGLLTLELAKNE-------ERQQASVIRAPVSVKVQQLKV 343

Query: 202 -EEGQTVNAVQSAPTIIKVAQLDTMTVEAQISEADVVKVKTGMPVYFTILGEPEKRF--- 257
EG V + ++ V + DT+ V A + D+ + G + P R+
Sbjct: 344 HTEGGVVTTAE--TLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYL 401

Query: 258 SATLRAIEPAPDSINDETT 276
++ I D+I D+
Sbjct: 402 VGKVKNI--NLDAIEDQRL 418



Score = 48.7 bits (116), Expect = 3e-08
Identities = 17/167 (10%), Positives = 57/167 (34%), Gaps = 17/167 (10%)

Query: 10 RLIGWVVLLLIIGGLLFFRWISPPDKPSYITAVAEIRDLEQTVLADGTIKAQKQV-TVGA 68
RL+ + ++ ++ + + + +E A+G + + +
Sbjct: 58 RLVAYFIMGFLVIAFI---L-------------SVLGQVEIVATANGKLTHSGRSKEIKP 101

Query: 69 QVSGQIKALHVTLGQQVEKNQLVAEIDDLAQQNALKDAEEALKNVQAQRAAKIATQKNNQ 128
+ +K + V G+ V K ++ ++ L + + +L + ++ ++ +
Sbjct: 102 IENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIE 161

Query: 129 LTYQRQQQILAKGVGVRADFDSIKATLEATQAEISALDAQIAQAEIA 175
L + ++ + + + + + S Q Q E+
Sbjct: 162 LNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELN 208


70YpAngola_A2794YpAngola_A2806N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A27940131.362741nitrate/nitrite response regulator protein NarP
YpAngola_A27950151.717827hypothetical protein
YpAngola_A27961151.661717aminoglycoside/multidrug efflux system
YpAngola_A27975300.881559insertion sequence transposase
YpAngola_A2798535-0.305864transposase/IS protein
YpAngola_A2803540-0.575268****elongation factor Tu
YpAngola_A2804842-0.495177preprotein translocase subunit SecE
YpAngola_A28057400.115806transcription antitermination protein NusG
YpAngola_A28067431.02445450S ribosomal protein L11
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2794HTHFIS732e-17 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 73.3 bits (180), Expect = 2e-17
Identities = 29/122 (23%), Positives = 56/122 (45%), Gaps = 5/122 (4%)

Query: 1 MTKSHTILIVDDHPLMRRGIKQLLGLDSRFDVVAEANNGSDAITEAAKFQPDVILLDLNM 60
MT + TIL+ DD +R + Q L + +DV +N + A D+++ D+ M
Sbjct: 1 MTGA-TILVADDDAAIRTVLNQALS-RAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVM 57

Query: 61 KGMSGLDTLKALRHNGSDARIIILTV-SDARSDVYAMIDAGADGYLLKDCEPEILLENIR 119
+ D L ++ D +++++ + + + A + GA YL K + L+ I
Sbjct: 58 PDENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKAS-EKGAYDYLPKPFDLTELIGIIG 116

Query: 120 QA 121
+A
Sbjct: 117 RA 118


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2796ACRIFLAVINRP13040.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 1304 bits (3377), Expect = 0.0
Identities = 672/1034 (64%), Positives = 819/1034 (79%), Gaps = 2/1034 (0%)

Query: 43 MANFFIDRPIFAWVLAIILCLTGALAISTLPVEQYPNLAPPNVRISASYPGASAQTLENT 102
MANFFI RPIFAWVLAIIL + GALAI LPV QYP +APP V +SA+YPGA AQT+++T
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 103 VTQVIEQSMTGLDNLLYMSSQSSNSGSASVTLTFQAGTNPNEAMQQVQNQLQSAIKRLPQ 162
VTQVIEQ+M G+DNL+YMSS S ++GS ++TLTFQ+GT+P+ A QVQN+LQ A LPQ
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 163 DVQQQGVSVSKSGDNTLMMVAFVSTDGSMDKQDISDYVASNLQDPLSRIEGVGSVDAFGS 222
+VQQQG+SV KS + LM+ FVS + + DISDYVASN++D LSR+ GVG V FG+
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 223 QYAMRIWLDPNKLTNYQLTTSDIVSAIQSQNTQVAVGQLGGTPAVDNQALNATINAQSQL 282
QYAMRIWLD + L Y+LT D+++ ++ QN Q+A GQLGGTPA+ Q LNA+I AQ++
Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240

Query: 283 QTPEEFREITLRVNQDGSLVTLGDVAKIELGSEKYDYLSRFNGQAASGMGIKLASGANEL 342
+ PEEF ++TLRVN DGS+V L DVA++ELG E Y+ ++R NG+ A+G+GIKLA+GAN L
Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300

Query: 343 QTDKRVKARLAELAPFFPHGLEAKIAYETTPFVQASIKDVVKTLLEAILLVFLVMYLFLQ 402
T K +KA+LAEL PFFP G++ Y+TTPFVQ SI +VVKTL EAI+LVFLVMYLFLQ
Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360

Query: 403 NFRATLIPTVAVPVVLLGTFAVLSAFGFSINTLTMFAIVLAIGLLVDDAIVVVENVERVM 462
N RATLIPT+AVPVVLLGTFA+L+AFG+SINTLTMF +VLAIGLLVDDAIVVVENVERVM
Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420

Query: 463 SEEGLDPREATRKSMGQIQGALIGIALVLSAVFIPMAFFGGTTGAIYRQFSITIVSAMVL 522
E+ L P+EAT KSM QIQGAL+GIA+VLSAVFIPMAFFGG+TGAIYRQFSITIVSAM L
Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480

Query: 523 SVLVALILTPAMCATLLKPIAPGHHHAKRGFFGWFNRMFDRNSHRYERGVARVLHHSLRY 582
SVLVALILTPA+CATLLKP++ HH K GFFGWFN FD + + Y V ++L + RY
Sbjct: 481 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540

Query: 583 MLLYLLLLGGLALLFLKLPTSFLPLEDRGVFMAQVQLPVGSTQQQTLKVVEKVENYFLTE 642
+L+Y L++ G+ +LFL+LP+SFLP ED+GVF+ +QLP G+TQ++T KV+++V +Y+L
Sbjct: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600

Query: 643 EKNNVLSVFATVGSGPGGNGQNVARLFIRLADWDQRTASTDSSFAIIERATKELSKIVEA 702
EK NV SVF G G QN F+ L W++R +S+ A+I RA EL KI +
Sbjct: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG 660

Query: 703 KVSVSSPPAISGLGGSSGFDMELQDHGGHGHDKLMVARNQLLQMASQEPA-LTRVRHNGL 761
V + PAI LG ++GFD EL D G GHD L ARNQLL MA+Q PA L VR NGL
Sbjct: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720

Query: 762 DDSPQLQIDIDQRKAQALGVSLNDINSTLKTAWGSTYVNDFVDRGRVKKVYVQSEATARM 821
+D+ Q ++++DQ KAQALGVSL+DIN T+ TA G TYVNDF+DRGRVKK+YVQ++A RM
Sbjct: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRM 780

Query: 822 LPEDVNKWYVRNKNGGMVPFSAFSTTRWEYGSPRLERYNGYSALEIVGEAASGVSTGTAM 881
LPEDV+K YVR+ NG MVPFSAF+T+ W YGSPRLERYNG ++EI GEAA G S+G AM
Sbjct: 781 LPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAM 840

Query: 882 DVMEKLVSQLPNGFGLEWTGMSYQERLSGSQAPALYAISLLVVFLCLAALYESWSIPFSV 941
+ME L S+LP G G +WTGMSYQERLSG+QAPAL AIS +VVFLCLAALYESWSIP SV
Sbjct: 841 ALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSV 900

Query: 942 MLVVPLGVIGAVAATWMRGLENDVYFQVGLLTIIGLSAKNAILIVEFANEL-NNRGKDLV 1000
MLVVPLG++G + A + +NDVYF VGLLT IGLSAKNAILIVEFA +L GK +V
Sbjct: 901 MLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVV 960

Query: 1001 EATLEASRQRLRPILMTSLAFIFGVLPMAISQGAGSGSQHAVGTGVMGGMISATVLAIFF 1060
EATL A R RLRPILMTSLAFI GVLP+AIS GAGSG+Q+AVG GVMGGM+SAT+LAIFF
Sbjct: 961 EATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFF 1020

Query: 1061 VPLFFVLVRRRFPG 1074
VP+FFV++RR F G
Sbjct: 1021 VPVFFVVIRRCFKG 1034


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2797HTHTETR280.047 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 28.1 bits (62), Expect = 0.047
Identities = 12/106 (11%), Positives = 31/106 (29%), Gaps = 14/106 (13%)

Query: 16 QGMSSRAIARELGISRNTVKRYLQAKSEPPKYTPRPAVASLLDEYRDYIRQRIAD----- 70
S IA+ G++R + + + KS+ + + + I + +
Sbjct: 30 SSTSLGEIAKAAGVTRGAIYWHFKDKSD--------LFSEIWELSESNIGELELEYQAKF 81

Query: 71 -AHPYKIPATVIAREIRDQGYRGGMTILRAFIRSLSVPQEQEPAVR 115
P + ++ + +L I + V+
Sbjct: 82 PGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQ 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2803TCRTETOQM803e-18 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 79.5 bits (196), Expect = 3e-18
Identities = 53/154 (34%), Positives = 78/154 (50%), Gaps = 13/154 (8%)

Query: 13 VNVGTIGHVDHGKTTLTAAI------TTVLAKTYGGSARAFDQIDNAPEEKARGITINTS 66
+N+G + HVD GKTTLT ++ T L G+ R DN E+ RGITI T
Sbjct: 4 INIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRT----DNTLLERQRGITIQTG 59

Query: 67 HVEYDTPARHYAHVDCPGHADYVKNMITGAAQMDGAILVVAATDGPMPQTREHILLGRQV 126
+ +D PGH D++ + + +DGAIL+++A DG QTR R++
Sbjct: 60 ITSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKM 119

Query: 127 GVPYIIVFLNKCDMVDDEELLELVEMEVRELLSQ 160
G+P I F+NK D + L V +++E LS
Sbjct: 120 GIP-TIFFINKIDQNGID--LSTVYQDIKEKLSA 150


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2804SECETRNLCASE1617e-55 Bacterial translocase SecE signature.
		>SECETRNLCASE#Bacterial translocase SecE signature.

Length = 127

Score = 161 bits (410), Expect = 7e-55
Identities = 109/127 (85%), Positives = 116/127 (91%)

Query: 1 MSANTEAPGSGRGLETAKWLIVAVLLVVAIVGNYYYREYSLPLRALAVVVIIAVAGAVAL 60
MSANTEA GSGRGLE KW++V LL+VAIVGNY YR+ LPLRALAVV++IA AG VAL
Sbjct: 1 MSANTEAQGSGRGLEAMKWVVVVALLLVAIVGNYLYRDIMLPLRALAVVILIAAAGGVAL 60

Query: 61 MTAKGKATVAFAREARTEVRKVIWPTRQETLHTTLIVAAVTAVMSLILWGLDGILVRLVS 120
+T KGKATVAFAREARTEVRKVIWPTRQETLHTTLIVAAVTAVMSLILWGLDGILVRLVS
Sbjct: 61 LTTKGKATVAFAREARTEVRKVIWPTRQETLHTTLIVAAVTAVMSLILWGLDGILVRLVS 120

Query: 121 FITGLRF 127
FITGLRF
Sbjct: 121 FITGLRF 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2806ACRIFLAVINRP260.048 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 26.3 bits (58), Expect = 0.048
Identities = 14/71 (19%), Positives = 26/71 (36%), Gaps = 2/71 (2%)

Query: 4 KVQAYVKLQVAAGMANPSPPVGPALGQQ-GVNIMEFCKAFNAKTESIEKGLPIPVVITVY 62
+V+ + N P G + G N ++ KA AK ++ P + +
Sbjct: 267 RVELGGENYNVIARINGKPAAGLGIKLATGANALDTAKAIKAKLAELQPFFPQGMKVLYP 326

Query: 63 SDRSFTFVTKT 73
D + FV +
Sbjct: 327 YDTT-PFVQLS 336


71YpAngola_A2823YpAngola_A2827N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A2823-113-0.334264major facilitator transporter
YpAngola_A2824-212-0.510666purine-binding chemotaxis protein
YpAngola_A2825-213-0.488588chemotaxis protein CheA
YpAngola_A2826-217-1.042922flagellar motor protein MotB
YpAngola_A2827-112-0.460903flagellar motor protein MotA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2823TCRTETB310.002 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 31.4 bits (71), Expect = 0.002
Identities = 29/152 (19%), Positives = 62/152 (40%), Gaps = 8/152 (5%)

Query: 18 FCIFFVYSAYCGLTYFIPF-LKDIYGLPVALIGAYGIINQYGLKMVGGPVGGFLADKVAK 76
C ++ G +P+ +KD++ L A IG+ I ++ G +GG L D+
Sbjct: 263 LCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILVDRRGP 322

Query: 77 SPTVYLKWTFLISAIAMILFIQLPHDSMNVYLGMMATLGFGAIIFSQRAI-FFAPMDEIG 135
+ + TFL + F+ ++ + ++ ++ G + F++ I
Sbjct: 323 LYVLNIGVTFLSVSFLTASFLL---ETTSWFMTIIIVFVLGGLSFTKTVISTIVSS---S 376

Query: 136 TSREHAGSAMAFGCIIGYMPSMFAYALYGSLL 167
++ AG+ M+ ++ A+ G LL
Sbjct: 377 LKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLL 408


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2825PF06580381e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 37.5 bits (87), Expect = 1e-04
Identities = 13/70 (18%), Positives = 30/70 (42%), Gaps = 10/70 (14%)

Query: 421 ELDKSLIERIIDPLT--HLVRNSLDHGIEEPATRIAAGKSPVGNLTLSAEHQGGNICIEV 478
+++ ++++ + P+ LV N + HGI + G + L G + +EV
Sbjct: 245 QINPAIMDVQVPPMLVQTLVENGIKHGIAQ--------LPQGGKILLKGTKDNGTVTLEV 296

Query: 479 IDDGAGLNRQ 488
+ G+ +
Sbjct: 297 ENTGSLALKN 306


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2826PF05272320.004 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 32.4 bits (73), Expect = 0.004
Identities = 23/88 (26%), Positives = 31/88 (35%), Gaps = 3/88 (3%)

Query: 47 LLAVSSPQELTQIAEYFRTPLKVALTSGDKSSSSTSPIPGGGDDPTQQVGEVRKQINSEE 106
L VSSP A P K ++G + + PGGGDD GE +
Sbjct: 384 LADVSSPTAAAGGAGGGEPPKKRDPSAG---AGTDPGGPGGGDDGEDPFGEWLDDEVARL 440

Query: 107 SRQEIHRLNKLREKLDQLIESDPRLKAL 134
+ L R L + + S P L
Sbjct: 441 RLRGRWLLKPRRAALIEALRSAPALAGC 468


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2827PF05844320.002 YopD protein
		>PF05844#YopD protein

Length = 295

Score = 32.3 bits (73), Expect = 0.002
Identities = 11/28 (39%), Positives = 21/28 (75%), Gaps = 2/28 (7%)

Query: 76 MDLMALLYRLLAKSRQQGMLSLERDIEN 103
++L+ +L+R+ K+R+ G+ L+RD EN
Sbjct: 74 VELLLILFRIAQKARELGV--LQRDNEN 99


72YpAngola_A2879YpAngola_A2883N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A2879-112-1.125077multidrug efflux protein
YpAngola_A2880114-2.302720multidrug efflux protein
YpAngola_A2881117-2.848539DNA-binding transcriptional repressor AcrR
YpAngola_A2882116-2.696778hypothetical protein
YpAngola_A2883015-2.614054potassium efflux protein KefA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2879ACRIFLAVINRP13420.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 1342 bits (3475), Expect = 0.0
Identities = 806/1032 (78%), Positives = 918/1032 (88%)

Query: 1 MAKFFIDRPIFAWVIAIIIMLAGALAIMKLPVAQYPTIAPPAITIAANYPGADATTVQNT 60
MA FFI RPIFAWV+AII+M+AGALAI++LPVAQYPTIAPPA++++ANYPGADA TVQ+T
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 61 VTQVIEQNMNGIDNLLYMSSSSDSSGNVQLTLTFNSGTDPDIAQVQVQNKLQLAMPLLPQ 120
VTQVIEQNMNGIDNL+YMSS+SDS+G+V +TLTF SGTDPDIAQVQVQNKLQLA PLLPQ
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 121 EVQQQGVSVEKSSSSFLMVAGFISEDGTMQQEDIADYVGSNIKDPISRTPGVGDVQLFGS 180
EVQQQG+SVEKSSSS+LMVAGF+S++ Q+DI+DYV SN+KD +SR GVGDVQLFG+
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 181 QYAMRIWMDPHKLNNYKLTPVDVINAIKIQNNQVAAGQLGGTPPVPGQELNSSIIAQTRL 240
QYAMRIW+D LN YKLTPVDVIN +K+QN+Q+AAGQLGGTP +PGQ+LN+SIIAQTR
Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240

Query: 241 TNAEEFSQILLKVNTDGSQVRLKDVAIVKLGAESYNIIARYNGKPAAGIGIKLATGANAL 300
N EEF ++ L+VN+DGS VRLKDVA V+LG E+YN+IAR NGKPAAG+GIKLATGANAL
Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300

Query: 301 NTSAAVKAELAKLQPFFPSGLTVVYPYDTTPFVKISINEVVKTLIEAIILVFLVMYLFLQ 360
+T+ A+KA+LA+LQPFFP G+ V+YPYDTTPFV++SI+EVVKTL EAI+LVFLVMYLFLQ
Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360

Query: 361 NFRATLIPTIAVPVVLLGTFAILSAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420
N RATLIPTIAVPVVLLGTFAIL+AFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM
Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420

Query: 421 QEEGLPPKEATKKSMEQIQGALVGIALVLSAVFVPMAFFGGATGAIYRQFSITIVSAMVL 480
E+ LPPKEAT+KSM QIQGALVGIA+VLSAVF+PMAFFGG+TGAIYRQFSITIVSAM L
Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480

Query: 481 SVLVALILTPALCATMLKPIKKGDHGPKTGFFGWFNNMFEKSTHHYTDSVANILRSTGRY 540
SVLVALILTPALCAT+LKP+ H K GFFGWFN F+ S +HYT+SV IL STGRY
Sbjct: 481 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540

Query: 541 LVIYLAIVIGMAVLFMRLPSSFLPEEDQGVFLTMVQLPAGATQERTQKVLNHVTDYYLDK 600
L+IY IV GM VLF+RLPSSFLPEEDQGVFLTM+QLPAGATQERTQKVL+ VTDYYL
Sbjct: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600

Query: 601 EKNVVNSVFTVNGFGFSGQGQNTGLAFVSLKNWDERKGEQNKVPAIVSRASAAFSKIKDG 660
EK V SVFTVNGF FSGQ QN G+AFVSLK W+ER G++N A++ RA KI+DG
Sbjct: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG 660

Query: 661 MVFAFNLPAIVELGTATGFDFQLIDQGNLGHQQLTDARNQLLGMAAQHPDMLVGVRPNGL 720
V FN+PAIVELGTATGFDF+LIDQ LGH LT ARNQLLGMAAQHP LV VRPNGL
Sbjct: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720

Query: 721 EDTPQFKVEVDQEKAQALGVAISDINTTLGSAMGGSYVNDFIDRGRVKKVYVQADAPFRM 780
EDT QFK+EVDQEKAQALGV++SDIN T+ +A+GG+YVNDFIDRGRVKK+YVQADA FRM
Sbjct: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRM 780

Query: 781 LPDDIDKWYVRNNMGQMVSFATFSTAKWEYGSPRLERYNGLPSMEILGQAAPGKSTGEAM 840
LP+D+DK YVR+ G+MV F+ F+T+ W YGSPRLERYNGLPSMEI G+AAPG S+G+AM
Sbjct: 781 LPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAM 840

Query: 841 DLMQELAAKLPSGVGYDWTGMSYQERLSGNQAPALYAISLIVVFLCLAALYESWSIPFSV 900
LM+ LA+KLP+G+GYDWTGMSYQERLSGNQAPAL AIS +VVFLCLAALYESWSIP SV
Sbjct: 841 ALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSV 900

Query: 901 MLVVPLGVVGALLAATLRGLENDVYFQVGLLTTIGLSAKNAILIVEFAKDLMDKEGKGLV 960
MLVVPLG+VG LLAATL +NDVYF VGLLTTIGLSAKNAILIVEFAKDLM+KEGKG+V
Sbjct: 901 MLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVV 960

Query: 961 ESTLESVRMRLRPILMTSLAFILGVMPLVISSGAGSGAQNAVGTGVMGGMITATVLAIFF 1020
E+TL +VRMRLRPILMTSLAFILGV+PL IS+GAGSGAQNAVG GVMGGM++AT+LAIFF
Sbjct: 961 EATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFF 1020

Query: 1021 VPLFFVVVRRRF 1032
VP+FFVV+RR F
Sbjct: 1021 VPVFFVVIRRCF 1032


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2880RTXTOXIND392e-05 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 39.4 bits (92), Expect = 2e-05
Identities = 22/166 (13%), Positives = 52/166 (31%), Gaps = 45/166 (27%)

Query: 89 QIDPATYQAAYDSAKGDLAKAQASAQIAHLTVNRYKPLLGTNYISKQ---EYDQALSDAQ 145
+++ +A + + + + +++ ++ + LL I+K E + +A
Sbjct: 206 ELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAV 265

Query: 146 QADATVLAAKAALES----------------------------------------ARINL 165
+ +ES
Sbjct: 266 NELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQ 325

Query: 166 AYTQVRSPISGRTGKSAV-TEGALVTSGQASAMTTVQQLDPMYVDV 210
+ +R+P+S + + V TEG +VT+ + M V + D + V
Sbjct: 326 QASVIRAPVSVKVQQLKVHTEGGVVTTAET-LMVIVPEDDTLEVTA 370


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2881HTHTETR1657e-54 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 165 bits (420), Expect = 7e-54
Identities = 135/210 (64%), Positives = 164/210 (78%)

Query: 1 MARKTKQKAEETRQQILDAAVREFSAHGVSRTSLTDIAIAAGVTRGAIYWHFKNKVDLFN 60
MARKTKQ+A+ETRQ ILD A+R FS GVS TSL +IA AAGVTRGAIYWHFK+K DLF+
Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60

Query: 61 EVWELSESKIDQLEIEYQAKYPDNPLRILRELLIYILVSTREDRRRRALMEIVFHKCEFV 120
E+WELSES I +LE+EYQAK+P +PL +LRE+LI++L ST + RRR LMEI+FHKCEFV
Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120

Query: 121 GEMTSVHDARKVLDLASYERIESVLQGCIDANQLPVNLNTHRAAIIMRAYITGLMENWLF 180
GEM V A++ L L SY+RIE L+ CI+A LP +L T RAAIIMR YI+GLMENWLF
Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLF 180

Query: 181 MPESFDIKQEAPVLIDAYLEMLGQSFSLRN 210
P+SFD+K+EA + LEM +LRN
Sbjct: 181 APQSFDLKKEARDYVAILLEMYLLCPTLRN 210


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2882ADHESNFAMILY260.034 Adhesin family signature.
		>ADHESNFAMILY#Adhesin family signature.

Length = 309

Score = 26.0 bits (57), Expect = 0.034
Identities = 9/71 (12%), Positives = 27/71 (38%)

Query: 47 IAGLNGQQPREGYNLQQMLEILTAQNVPIKLCKTCADARGIAGLTLVDGVEIGTLVELAQ 106
I +N ++ ++ ++E L VP ++ D R + ++ + I +
Sbjct: 222 IWEINTEEEGTPEQIKTLVEKLRQTKVPSLFVESSVDDRPMKTVSQDTNIPIYAQIFTDS 281

Query: 107 WTLAAEKVLTF 117
++ ++
Sbjct: 282 IAEQGKEGDSY 292


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2883GPOSANCHOR404e-05 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 40.4 bits (94), Expect = 4e-05
Identities = 28/235 (11%), Positives = 58/235 (24%), Gaps = 21/235 (8%)

Query: 35 SEVQSQLDLLSKQKILSPAEKLAQQDLTQTLE-YLDTIERTKQEANQLKQQLAQAPAKLR 93
S + +L K ++ + LE L+ + + L A L
Sbjct: 95 SNAKEKLRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALA 154

Query: 94 QATEGLE-ALKSSSADTMTKESLANYSLRQLESRLNETLDNLQSAQEDLSAYNSQLIALQ 152
LE AL+ + + +++ + + + L
Sbjct: 155 ARKADLEKALEGAMNFS-----------TADSAKIKTLEAEKAALEARQAELEKALEGAM 203

Query: 153 TQPERVQSAMYSASMRLMQIRNQLNGLTPNQESLRPTQQ--QELLAEQVMLNGQLDLERK 210
+ + + + + L E + L+ +
Sbjct: 204 NFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQA 263

Query: 211 NLEANTTLQDLLQKQRDYTTAHINQLERYVQLLQEVVSGKRLILSEKTVKEAQAQ 265
LE + +A I LE L+ K + + V A Q
Sbjct: 264 ELEK---ALEGAMNFSTADSAKIKTLEAEKAALEAE---KADLEHQSQVLNANRQ 312



Score = 32.0 bits (72), Expect = 0.016
Identities = 36/201 (17%), Positives = 72/201 (35%), Gaps = 33/201 (16%)

Query: 37 VQSQLDLLSKQKILSPAEKLAQQDLTQTLEYLDTIERTKQEANQLKQQ------------ 84
+ L ++Q L A + A T + T+E K K
Sbjct: 252 EAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANR 311

Query: 85 --LAQAPAKLRQATEGLEA-----------LKSSSADTMTKESLANYSLRQLESRLNETL 131
L + R+A + LEA ++S + + +QLE+ +
Sbjct: 312 QSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQLEAEHQKLE 371

Query: 132 DNLQSAQEDLSAYNSQLIALQTQPERVQSAMYSASMRLMQIRNQLNGLTPNQESLRPTQQ 191
+ + ++ + L A + ++V+ A+ A+ +L + L +ES + T++
Sbjct: 372 EQNKISEASRQSLRRDLDASREAKKQVEKALEEANSKLAALEKLNKEL---EESKKLTEK 428

Query: 192 QELLAEQVMLNGQLDLERKNL 212
E+ L +L+ E K L
Sbjct: 429 -----EKAELQAKLEAEAKAL 444


73YpAngola_A2959YpAngola_A2968N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A29594181.314601hypothetical protein
YpAngola_A2960118-1.056024insertion sequence transposase
YpAngola_A2961017-2.410309transposase/IS protein
YpAngola_A2963118-3.156921FAD binding domain-containing protein
YpAngola_A2964322-6.136633short chain dehydrogenase/reductase family
YpAngola_A2965221-6.514853hypothetical protein
YpAngola_A2967121-6.794745beta-ketoacyl synthase family protein
YpAngola_A2968123-7.337897short-chain dehydrogenase/reductase family
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2959ICENUCLEATIN320.008 Ice nucleation protein signature.
		>ICENUCLEATIN#Ice nucleation protein signature.

Length = 1258

Score = 32.0 bits (72), Expect = 0.008
Identities = 51/236 (21%), Positives = 88/236 (37%), Gaps = 8/236 (3%)

Query: 407 TGMSVSATGISVSTTGTSLSVTGMSTSVTGVSVGFTLIGTS--FTGVSTSFTGVGTSFTG 464
+G + I ++T G++LS T S + G T +S G ++ T S
Sbjct: 150 SGSTQPTQTIEIATYGSTLSGTHQSQLIAGYGSTETAGDSSTLIAGYGSTGTAGADSTLV 209

Query: 465 ASNSLTGVSNSMTGCSSSFTGTSNSMTGSSHSMTGMSTSITGHSMSQ-TGSSSSITGDST 523
A T + + + + T M GS + ST G S G S+ T
Sbjct: 210 AGYGSTQTAGEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGED 269

Query: 524 SFTGSSVSSTGSSVSTTGVSTSTTGSSTSTTGCSVSTTGSSTSTTGNSVSMTG----NST 579
S + ST ++ + ++ + T+ S+ ST T G + T T
Sbjct: 270 SSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQT 329

Query: 580 STTGCSISTTGSSIGTVGSSISTTGSSVSTTGSSISTTGLSVSYTGAQYSDVGVDL 635
+ G ++ S GT G S+ + +T ++ + L+ Y Q + G DL
Sbjct: 330 AQKGSDLTAGYGSTGTAGDD-SSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDL 384



Score = 30.9 bits (69), Expect = 0.022
Identities = 31/143 (21%), Positives = 63/143 (44%), Gaps = 6/143 (4%)

Query: 492 GSSHSMTGMSTSITGHSMSQTGSSSSITGDSTSFTGSSVSSTGSSVSTTGVSTSTTGSST 551
GS+ + S+ I G+ +QT +SI T+ GS+ ++ S T G +++T +
Sbjct: 629 GSTSTAGADSSLIAGYGSTQTAGYNSIL---TAGYGSTQTAQEGSDLTAGYGSTSTAGAD 685

Query: 552 STTGCSVSTTGSSTSTTGNSVSMTGNSTSTTGCSISTTGSSIGTVGSS---ISTTGSSVS 608
S+ +T ++ + + T+ G +++ S T G+ I+ GS+ +
Sbjct: 686 SSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQT 745

Query: 609 TTGSSISTTGLSVSYTGAQYSDV 631
+ S T G + T + S +
Sbjct: 746 ASYHSSLTAGYGSTQTAREQSVL 768



Score = 30.1 bits (67), Expect = 0.038
Identities = 32/143 (22%), Positives = 62/143 (43%), Gaps = 6/143 (4%)

Query: 492 GSSHSMTGMSTSITGHSMSQTGSSSSITGDSTSFTGSSVSSTGSSVSTTGVSTSTTGSST 551
GS+ + S+ I G+ +QT S T+ GS+ ++ S TG +++T +
Sbjct: 485 GSTSTAGYESSLIAGYGSTQTAGYGSTL---TAGYGSTQTAQNESDLITGYGSTSTAGAN 541

Query: 552 STTGCSVSTTGSSTSTTGNSVSMTGNSTSTTGCSISTTGSSIGTVGSS---ISTTGSSVS 608
S+ +T +++ + + T+ G ++ S GT GS I+ GS+ +
Sbjct: 542 SSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQT 601

Query: 609 TTGSSISTTGLSVSYTGAQYSDV 631
+ S T G + T + S +
Sbjct: 602 ASYHSSLTAGYGSTQTAREQSVL 624


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2960HTHTETR280.047 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 28.1 bits (62), Expect = 0.047
Identities = 12/106 (11%), Positives = 31/106 (29%), Gaps = 14/106 (13%)

Query: 16 QGMSSRAIARELGISRNTVKRYLQAKSEPPKYTPRPAVASLLDEYRDYIRQRIAD----- 70
S IA+ G++R + + + KS+ + + + I + +
Sbjct: 30 SSTSLGEIAKAAGVTRGAIYWHFKDKSD--------LFSEIWELSESNIGELELEYQAKF 81

Query: 71 -AHPYKIPATVIAREIRDQGYRGGMTILRAFIRSLSVPQEQEPAVR 115
P + ++ + +L I + V+
Sbjct: 82 PGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQ 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2964DHBDHDRGNASE1043e-29 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 104 bits (261), Expect = 3e-29
Identities = 67/252 (26%), Positives = 106/252 (42%), Gaps = 10/252 (3%)

Query: 3 KTILITGALSGIGNTATKLFSEMGYNVVFSGRRPEEGRVILDDLKRINKDVLYVNADMNS 62
K ITGA GIG + + G ++ PE+ ++ LK + AD+
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 63 ESDIKHLIEMTLERFGSLDVAVNCAGTVGETAEIQAVTQDNFHLVFNTNVLGTLLAMKYQ 122
+ I + G +D+ VN AG + I +++ + + F+ N G A +
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVL-RPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127

Query: 123 IPVMVERGKGSIINISSIAGLVGLPSTGIYVASKHAIEGLTKTAALEVATTGVRINSISP 182
M++R GSI+ + S V S Y +SK A TK LE+A +R N +SP
Sbjct: 128 SKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187

Query: 183 GPVEGKMFDRFLGHDENNKKAFIE--------MMPNKRFTTQEEVAHTIVFLAEDNVTAI 234
G E M L DEN + I+ +P K+ ++A ++FL I
Sbjct: 188 GSTETDM-QWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHI 246

Query: 235 TGQTITIDGGYT 246
T + +DGG T
Sbjct: 247 TMHNLCVDGGAT 258


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A2968DHBDHDRGNASE1226e-36 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 122 bits (307), Expect = 6e-36
Identities = 76/251 (30%), Positives = 118/251 (47%), Gaps = 16/251 (6%)

Query: 2 NLFISGGASGIGRSVVIAALSKGWNV-GFSYHNNKEGAQQLLDIAVAEFPRQLCRAYQLD 60
FI+G A GIG +V S+G ++ Y+ K A A + D
Sbjct: 10 IAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEA----FPAD 65

Query: 61 VIDSGAVEYVGDRLLVDFSNIDAVVCNAGIDLPGNLVSMTDEDWALVLNTNLTGTFYLIR 120
V DS A++ + R+ + ID +V AG+ PG + S++DE+W + N TG F R
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 121 YFLPLFLANKYGRIVTL-SSLAKDGSSGQAAYAASKAGLVGLTKTTAKEYGHFGITANVV 179
+ + G IVT+ S+ A + AAYA+SKA V TK E + I N+V
Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185

Query: 180 VPGLINTEI-----IGDD-----IKGIKNFFAQYAPVGRLGSPSEVAEAILFLVAKESSY 229
PG T++ ++ IKG F P+ +L PS++A+A+LFLV+ ++ +
Sbjct: 186 SPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGH 245

Query: 230 VNGAVFNVTGG 240
+ V GG
Sbjct: 246 ITMHNLCVDGG 256


74YpAngola_A3051YpAngola_A3058N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A3051322-1.341696transcriptional regulator HU subunit beta
YpAngola_A3052321-1.409653DNA-binding ATP-dependent protease La
YpAngola_A3053116-1.237404ATP-dependent protease ATP-binding subunit ClpX
YpAngola_A3054018-1.946878ATP-dependent Clp protease proteolytic subunit
YpAngola_A3055019-2.246385trigger factor
YpAngola_A3056-217-1.560584transcriptional regulator BolA
YpAngola_A3057-118-1.342154hypothetical protein
YpAngola_A3058019-1.425506muropeptide transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3051DNABINDINGHU1216e-40 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 121 bits (305), Expect = 6e-40
Identities = 48/88 (54%), Positives = 65/88 (73%)

Query: 2 NKSQLIDKIAAGADISKAAAGRALDAIITSVTESLKEGDDVALVGFGTFAVRERSARTGR 61
NK LI K+A +++K + A+DA+ ++V+ L +G+ V L+GFG F VRER+AR GR
Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62

Query: 62 NPQTGKEISIPAAKVPGFRAGKGLKDAV 89
NPQTG+EI I A+KVP F+AGK LKDAV
Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAV 90


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3052PF05272340.004 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 33.5 bits (76), Expect = 0.004
Identities = 15/76 (19%), Positives = 32/76 (42%), Gaps = 6/76 (7%)

Query: 296 DWMLQVPWNSHSKVKKDLVKAQEVLDTDHYGLERVKDRILEYLAVQSRVSKIKGP----- 350
DW+ W+ +++K LV D+ +++ + V+++ P
Sbjct: 537 DWVKAQQWDEVPRLEKWLVHVLGKTPDDYKPRRLRYLQLVGKYILMGHVARVMEPGCKFD 596

Query: 351 -ILCLVGPPGVGKTSL 365
+ L G G+GK++L
Sbjct: 597 YSVVLEGTGGIGKSTL 612


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3053HTHFIS290.032 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 29.4 bits (66), Expect = 0.032
Identities = 15/72 (20%), Positives = 27/72 (37%), Gaps = 13/72 (18%)

Query: 61 RSSLPTPHEIRHHLDDYVIGQEPAKKVLAVAVYNHYKRLRNGDTSNGIELGKSNILLIGP 120
P+ E ++G+ A + +Y RL D +++ G
Sbjct: 122 PKRRPSKLEDDSQDGMPLVGRSAAMQ----EIYRVLARLMQTD---------LTLMITGE 168

Query: 121 TGSGKTLLAETL 132
+G+GK L+A L
Sbjct: 169 SGTGKELVARAL 180


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3057PF06291280.012 Lambda prophage Bor protein
		>PF06291#Lambda prophage Bor protein

Length = 102

Score = 27.7 bits (61), Expect = 0.012
Identities = 19/73 (26%), Positives = 34/73 (46%), Gaps = 3/73 (4%)

Query: 2 LKKILFPLLAIFILAGCATTSNTLNVTPKVVLPTQDPTLMGVTISINGADQRRDAALAKV 61
+KK+LF ++ GCA + T+ P V P + T ++G Q++ AK+
Sbjct: 6 MKKMLFSAALAMLITGCAQQTFTVGNKPTAVTPKETITH---HFFVSGIGQKKTVDAAKI 62

Query: 62 NRDGQLVVLTPSR 74
+ VV T ++
Sbjct: 63 CGGAENVVKTETQ 75


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3058TCRTETB471e-07 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 46.8 bits (111), Expect = 1e-07
Identities = 44/199 (22%), Positives = 77/199 (38%), Gaps = 15/199 (7%)

Query: 221 RNNAWLI-LLLIVFYKMGDAFAASLSTTFLIRGVGFDAGEVGLVNKTLGLIATIIGALYG 279
R+N LI L ++ F+ + + ++S + VN L +I A+YG
Sbjct: 11 RHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYG 70

Query: 280 GLLMQRLSLFRALMIFGILQAVSNMGYWLLAITDKNIFSMGSAIFLENLCGGMGTAAFVA 339
L +L + R L+ I+ + ++ + FS+ + + G G AAF A
Sbjct: 71 KL-SDQLGIKRLLLFGIIINCFGS----VIGFVGHSFFSL---LIMARFIQGAGAAAFPA 122

Query: 340 LLM----TLCNKSFSATQFALLSALSAVGRVYVGP-IAGWFVEAHGWPLFYLFSIAAAIP 394
L+M K F L+ ++ A+G VGP I G W L + I
Sbjct: 123 LVMVVVARYIPKENRGKAFGLIGSIVAMG-EGVGPAIGGMIAHYIHWSYLLLIPMITIIT 181

Query: 395 GLLLLYVCRQTLDHTQKTD 413
L+ + ++ + D
Sbjct: 182 VPFLMKLLKKEVRIKGHFD 200


75YpAngola_A3098YpAngola_A3104N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A3098-1224.820578DNA-binding transcriptional regulator BaeR
YpAngola_A3099-1204.395143signal transduction histidine-protein kinase
YpAngola_A31000204.787054multidrug efflux system protein MdtE
YpAngola_A3101-1204.733828multidrug efflux system subunit MdtC
YpAngola_A3102-1183.882906multidrug efflux system subunit MdtB
YpAngola_A3103-1153.391116multidrug efflux system subunit MdtA
YpAngola_A31040163.110242ABC transporter ATP-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3098HTHFIS788e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 78.3 bits (193), Expect = 8e-19
Identities = 33/150 (22%), Positives = 70/150 (46%), Gaps = 5/150 (3%)

Query: 10 QSGSVLIVEDEPKLGQLLVDYLQAAGYRTQWLTNGAEVVATVRQTPPAIILLDLMLPGSD 69
++L+ +D+ + +L L AGY + +N A + + +++ D+++P +
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 70 GITLCREIR-RFSDIPIVMVTAKTEEIDRLLGLEIGADDYICKPYSPREVVARVKTIL-- 126
L I+ D+P+++++A+ + + E GA DY+ KP+ E++ + L
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 127 --RRCSQQRHQPTDDAPLLINESRFQASYQ 154
RR S+ D PL+ + Q Y+
Sbjct: 122 PKRRPSKLEDDSQDGMPLVGRSAAMQEIYR 151


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3099BCTERIALGSPF340.001 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 34.0 bits (78), Expect = 0.001
Identities = 24/90 (26%), Positives = 38/90 (42%), Gaps = 21/90 (23%)

Query: 170 LSTLLAAAVTWVLS-------------RGMLAPVKRLVEGTHRLAA------GDFST--R 208
L+TL+AA++ + ++A V+ V H LA G F
Sbjct: 77 LATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSLADAMKCFPGSFERLYC 136

Query: 209 VAVSSRDELGHLAQDFNQLASSLEKNEQMR 238
V++ + GHL N+LA E+ +QMR
Sbjct: 137 AMVAAGETSGHLDAVLNRLADYTEQRQQMR 166


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3100TCRTETB1265e-34 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 126 bits (318), Expect = 5e-34
Identities = 97/435 (22%), Positives = 182/435 (41%), Gaps = 17/435 (3%)

Query: 20 FMQTLDTTIVNTALPSIAASLGENPLRMQSVIVSYVLTVAVMLPASGWLADRIGVKWVFF 79
F L+ ++N +LP IA + P V +++LT ++ G L+D++G+K +
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 80 SAIILFTFGSLMCAQSATLNE-LILSRVLQGVGGAMMVPVGRLTVMKIVPREQYMAAMAF 138
II+ FGS++ + LI++R +QG G A + + V + +P+E A
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143

Query: 139 VTLPGQIGPLVGPALGGFLVEFASWHWIFLINLP-VGVIGALATLLLMPNHKMSTRRFDI 197
+ +G VGPA+GG + + HW +L+ +P + +I + L+ FDI
Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHY--IHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDI 201

Query: 198 SGFIMLAIGMATLTLALDGHTGLGLSPLAIAGLILCGVIALGSYWWHALGNRFALFSLHL 257
G I++++G+ L + ++ V++ + H L
Sbjct: 202 KGIILMSVGIVFFMLF---------TTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGL 252

Query: 258 FKNKIYTLGLVGSMSARIGSGMLPFMTPIFLQIGLGFSPFHAG-LMMIPMIIGSMGMKRI 316
KN + +G++ M P ++ S G +++ P + + I
Sbjct: 253 GKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYI 312

Query: 317 IVQVVNRFGYRRVLVNATLLLAVVSLSLPLVAIMGWTLLMPVVLFFQGMLNALRFSTMNT 376
+V+R G VL L+V L+ + + +++F G L+ + ++T
Sbjct: 313 GGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTV-IST 371

Query: 377 LTLKTLPDRLASSGNSLLSMAMQLSMSIGVSTAGILLGTFAHHQVATNTPATHSAFLYS- 435
+ +L + A +G SLL+ LS G++ G LL Q S +LYS
Sbjct: 372 IVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLLDQRLLPMEVDQSTYLYSN 431

Query: 436 -YLCMAIIIALPALI 449
L + II + L+
Sbjct: 432 LLLLFSGIIVISWLV 446


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3101ACRIFLAVINRP8640.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 864 bits (2235), Expect = 0.0
Identities = 286/1035 (27%), Positives = 504/1035 (48%), Gaps = 36/1035 (3%)

Query: 6 LFIQRPVATTLLTLAITLSGIIGFSLLPVSPLPQVDYPVIMVSASMPGADPETMASSVAT 65
FI+RP+ +L + + ++G + LPV+ P + P + VSA+ PGAD +T+ +V
Sbjct: 4 FFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQ 63

Query: 66 PLERALGRIAGVNEMTSTS-SLGSTRIILQFDLNRDINGAARDVQAALNAAQSLLPSGMP 124
+E+ + I + M+STS S GS I L F D + A VQ L A LLP +
Sbjct: 64 VIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQ 123

Query: 125 SRPTYRKMNPSDAPIMIMTLTSDT--FSQGQLYDYASTKLAQKIAQTEGVSDVTVGGSSL 182
+ S + +M+ SD +Q + DY ++ + +++ GV DV + G+
Sbjct: 124 -QQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQY 182

Query: 183 PAVRVELNPSALFNQGVSLDAVRQAISAANVRRPQGSVDAAET------HWQVQANDEIK 236
A+R+ L+ L ++ V + N + G + + + A K
Sbjct: 183 -AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFK 241

Query: 237 TAEGYRPLIVHYN-NGSPVRLQDVANVIDSVQDVRNAGMSAGQPAVLLVISREPGANIIA 295
E + + + N +GS VRL+DVA V ++ G+PA L I GAN +
Sbjct: 242 NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALD 301

Query: 296 TVDRIRAELPALRASIPASIQLNIAQDRSPTIRASLDEVERSLVIAVALVILVVFIFLRS 355
T I+A+L L+ P +++ D +P ++ S+ EV ++L A+ LV LV+++FL++
Sbjct: 302 TAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQN 361

Query: 356 GRATLIPAVAVPVSLIGTFAAMYLCGFSLNNLSLMALTIATGFVVDDAIVVLENISRHL- 414
RATLIP +AVPV L+GTFA + G+S+N L++ + +A G +VDDAIVV+EN+ R +
Sbjct: 362 MRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMM 421

Query: 415 EAGVKPKVAALRGVREVGFTVLSMSISLVAVFIPLLLMAGLPGRLFREFAVTLSVAIGIS 474
E + PK A + + ++ ++ +++ L AVFIP+ G G ++R+F++T+ A+ +S
Sbjct: 422 EDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALS 481

Query: 475 LVISLTLTPMMCAWLLRSHPKGQQQRIRGFG----KVLLAIQQGYGRSLNWALSHTRWVM 530
++++L LTP +CA LL+ + GF Y S+ L T +
Sbjct: 482 VLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYL 541

Query: 531 VVLLSTIALNVWLYISIPKTFFPEQDTGRMMGFIQADQSISFQSMQQKLKDFMQIVGADP 590
++ +A V L++ +P +F PE+D G + IQ + + Q+ L +
Sbjct: 542 LIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNE 601

Query: 591 -----AVDSVTGFT-GGSRTNSGSMFISLKPLSER---QETAQQVITRLRGKLAKEPGAN 641
+V +V GF+ G N+G F+SLKP ER + +A+ VI R + +L K
Sbjct: 602 KANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGF 661

Query: 642 LFLSSVQDIRVGGRHSNAAYQFTLLADDLAALREWEPKVRAALAKL-----PQLADVNSD 696
+ ++ I G + ++ L D + + R L + L V +
Sbjct: 662 VIPFNMPAIVELGTATGFDFE---LIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPN 718

Query: 697 QQDKGAEMALTYDRETMARLGIDVSEANALLNNAFGQRQISTIYQPLNQYKVVMEVAPEY 756
+ A+ L D+E LG+ +S+ N ++ A G ++ K+ ++ ++
Sbjct: 719 GLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKF 778

Query: 757 TQDVSSLDKMFVINSNGQSIPLSYFAKWQPANAPLAVNHQGLSAASTISFNLPDGGSLSE 816
+DK++V ++NG+ +P S F + + I G S +
Sbjct: 779 RMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGD 838

Query: 817 ATAAVERAMTELGVPSTVRGAFAGTAQVFQETLKSQLWLIMAAIATVYIVLGILYESYVH 876
A A +E ++L P+ + + G + + + L+ + V++ L LYES+
Sbjct: 839 AMALMENLASKL--PAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSI 896

Query: 877 PLTILSTLPSAGVGALLALELFDAPFSLIALIGIMLLIGIVKKNAIMMVDFALDAQRNGN 936
P++++ +P VG LLA LF+ + ++G++ IG+ KNAI++V+FA D
Sbjct: 897 PVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEG 956

Query: 937 ISAREAIFQASLLRFRPIIMTTLAALFGALPLVLSSGDGAELRQPLGITIVGGLVVSQLL 996
EA A +R RPI+MT+LA + G LPL +S+G G+ + +GI ++GG+V + LL
Sbjct: 957 KGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLL 1016

Query: 997 TLYTTPVIYLYFDRL 1011
++ PV ++ R
Sbjct: 1017 AIFFVPVFFVVIRRC 1031



Score = 78.0 bits (192), Expect = 1e-16
Identities = 59/350 (16%), Positives = 130/350 (37%), Gaps = 12/350 (3%)

Query: 680 VRAALAKLPQLADVNSDQQDKGAEMALTYDRETMARLGI---DVSEANALLNNAFGQRQI 736
V+ L++L + DV M + D + + + + DV + N+ Q+
Sbjct: 162 VKDTLSRLNGVGDVQLFGAQY--AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQL 219

Query: 737 STIYQPLNQYKVVMEVAPEYTQDVSSLDKMFV-INSNGQSIPLSYFAK--WQPANAPLAV 793
Q +A ++ K+ + +NS+G + L A+ N +
Sbjct: 220 GGTPALPGQQLNASIIAQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIA 279

Query: 794 NHQGLSAASTISFNLPDGGSLSEATAAVERAMTEL--GVPSTVRGAFA-GTAQVFQETLK 850
G AA +L + A++ + EL P ++ + T Q ++
Sbjct: 280 RINGKPAAGLGIKLATGANAL-DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIH 338

Query: 851 SQLWLIMAAIATVYIVLGILYESYVHPLTILSTLPSAGVGALLALELFDAPFSLIALIGI 910
+ + AI V++V+ + ++ L +P +G L F + + + G+
Sbjct: 339 EVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGM 398

Query: 911 MLLIGIVKKNAIMMVDFALDAQRNGNISAREAIFQASLLRFRPIIMTTLAALFGALPLVL 970
+L IG++ +AI++V+ + +EA ++ ++ + +P+
Sbjct: 399 VLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAF 458

Query: 971 SSGDGAELRQPLGITIVGGLVVSQLLTLYTTPVIYLYFDRLRNRFSKQPL 1020
G + + ITIV + +S L+ L TP + + + +
Sbjct: 459 FGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENK 508


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3102ACRIFLAVINRP8720.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 872 bits (2254), Expect = 0.0
Identities = 289/1036 (27%), Positives = 501/1036 (48%), Gaps = 29/1036 (2%)

Query: 13 SRLFILRPVATTLFMIAILLAGIIGYRALPVSALPEVDYPTIQVVTLYPGASPDVVTSSI 72
+ FI RP+ + I +++AG + LPV+ P + P + V YPGA V ++
Sbjct: 2 ANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTV 61

Query: 73 TAPLERQFGQMSGLKQMASQS-SGGASVITLQFQLTLPLDVAEQEVQAAINAATNLLPSD 131
T +E+ + L M+S S S G+ ITL FQ D+A+ +VQ + AT LLP +
Sbjct: 62 TQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE 121

Query: 132 LPYPPIYNKVNPADPPILTLAVTATAIPMTQVE--DMVETRIAQKISQVTGVGLVTLSGG 189
+ I + ++ + TQ + D V + + +S++ GVG V L G
Sbjct: 122 VQQQGIS-VEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 190 QRPAVRVKLNAPAVAALGLDSETIRTAISNANVNSAKGSLDGP------TRSVTLSANDQ 243
Q A+R+ L+A + L + + N A G L G + ++ A +
Sbjct: 181 QY-AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239

Query: 244 MKSAEEYRDLII-AYQNGAPIRLQDVATIEQGAENNKLAAWANTQSAIVLNIQRQPGVNV 302
K+ EE+ + + +G+ +RL+DVA +E G EN + A N + A L I+ G N
Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299

Query: 303 IATADSIREMLPELIKSLPKSVDVKVLTDRTSTIRASVNDVQFELLLAIALVVMVIYLFL 362
+ TA +I+ L EL P+ + V D T ++ S+++V L AI LV +V+YLFL
Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359

Query: 363 RNAAATIIPSIAVPLSLVGTFAAMYFLGFSINNLTLMALTIATGFVVDDAIVVIENISRY 422
+N AT+IP+IAVP+ L+GTFA + G+SIN LT+ + +A G +VDDAIVV+EN+ R
Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419

Query: 423 I-EKGEKPLDAALKGAGEIGFTIISLTFSLIAVLIPLLFMEDIVGRLFREFAVTLAVAIL 481
+ E P +A K +I ++ + L AV IP+ F G ++R+F++T+ A+
Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479

Query: 482 ISAVVSLTLTPMMCARML---SYESLRKQNRLSRASEKFFDWVIAHYAVALKKVLNHPWL 538
+S +V+L LTP +CA +L S E + FD + HY ++ K+L
Sbjct: 480 LSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539

Query: 539 TLSVAFSTLVLTVILYLLIPKGFFPLQDNGLIQGTLEAPQSVSFSNMAERQQQVAAIILK 598
L + + V+L+L +P F P +D G+ ++ P + + QV LK
Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599

Query: 599 DPA--VESLTSFVGVDGTNATLNNGRLQINLKPLSERDDRIP---QIITRLQESVSGVPG 653
+ VES+ + G + N G ++LKP ER+ +I R + + +
Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRD 659

Query: 654 IKLYLQPVQDLTIDTQLSRTQYQFTLQ---ATSLEELSTWVPKLVNELQQK-APFQDVTS 709
++ P I + T + F L + L+ +L+ Q A V
Sbjct: 660 --GFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRP 717

Query: 710 DWQDQGLVAFVNVDRDSASRLGITMAAIDNALYNAFGQRLISTIYTQSNQYRVVLEHDVQ 769
+ + + VD++ A LG++++ I+ + A G ++ + ++ ++ D +
Sbjct: 718 NGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAK 777

Query: 770 ATPGLAAFNDIRLTGIDGKGVPLSSIATIEERFGPLSINHLNQFPSATVSFNLAQGYSLG 829
+ + + +G+ VP S+ T +G + N PS + A G S G
Sbjct: 778 FRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSG 837

Query: 830 EAVAAVTLAEKEIQLPADITTRFQGSTLAFQAALGSTLWLIIAAIVAMYIVLGVLYESFI 889
+A+A + +LPA I + G + + + L+ + V +++ L LYES+
Sbjct: 838 DAMALMENLAS--KLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWS 895

Query: 890 HPITILSTLPTAGVGALLALMLTGNELDVIAIIGIILLIGIVKKNAIMMIDFALAAERDQ 949
P++++ +P VG LLA L + DV ++G++ IG+ KNAI++++FA +
Sbjct: 896 IPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKE 955

Query: 950 GMTPYDAIYQACLLRFRPILMTTLAALFGALPLMLSTGVGAELRQPLGVCMVGGLIVSQV 1009
G +A A +R RPILMT+LA + G LPL +S G G+ + +G+ ++GG++ + +
Sbjct: 956 GKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATL 1015

Query: 1010 LTLFTTPVIYLLFDKL 1025
L +F PV +++ +
Sbjct: 1016 LAIFFVPVFFVVIRRC 1031



Score = 84.1 bits (208), Expect = 2e-18
Identities = 77/517 (14%), Positives = 190/517 (36%), Gaps = 25/517 (4%)

Query: 533 LNHPWLTLSVAFSTLVLTVILYLLIPKGFFPLQDNGLIQGTLEAPQSVSFSNMAERQQQV 592
+ P +A ++ + L +P +P + + P + + Q V
Sbjct: 6 IRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGA----DAQTVQDTV 61

Query: 593 AAIILKDPAVESLTSFVGVDGTNAT-LNNGRLQINL--KPLSERDDRIPQIITRLQESVS 649
+I +++ + ++T + G + I L + ++ D Q+ +LQ +
Sbjct: 62 TQVI-----EQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATP 116

Query: 650 GVP-GIKLYLQPVQDLTIDTQLSRTQYQFTLQATSLEELSTWVPK-LVNELQQKAPFQDV 707
+P ++ V+ + + L + T+ +++S +V + + L + DV
Sbjct: 117 LLPQEVQQQGISVEKSS-SSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDV 175

Query: 708 TSDWQDQGLVAFVNVDRDSASRLGITMAAIDNALYNAFGQRLISTIYTQSNQYRVVLEHD 767
+ + +D D ++ +T + N L Q + L
Sbjct: 176 QLFGAQYAMR--IWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNAS 233

Query: 768 VQATPGLAAFNDIR----LTGIDGKGVPLSSIATIEERFGPLSIN-HLNQFPSATVSFNL 822
+ A + DG V L +A +E ++ +N P+A + L
Sbjct: 234 IIAQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKL 293

Query: 823 AQGYSLGEAVAAV--TLAEKEIQLPADI-TTRFQGSTLAFQAALGSTLWLIIAAIVAMYI 879
A G + + A+ LAE + P + +T Q ++ + + AI+ +++
Sbjct: 294 ATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFL 353

Query: 880 VLGVLYESFIHPITILSTLPTAGVGALLALMLTGNELDVIAIIGIILLIGIVKKNAIMMI 939
V+ + ++ + +P +G L G ++ + + G++L IG++ +AI+++
Sbjct: 354 VMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVV 413

Query: 940 DFALAAERDQGMTPYDAIYQACLLRFRPILMTTLAALFGALPLMLSTGVGAELRQPLGVC 999
+ + + P +A ++ ++ + +P+ G + + +
Sbjct: 414 ENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSIT 473

Query: 1000 MVGGLIVSQVLTLFTTPVIYLLFDKLARNTRGKNRHR 1036
+V + +S ++ L TP + K +N+
Sbjct: 474 IVSAMALSVLVALILTPALCATLLKPVSAEHHENKGG 510


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3103RTXTOXIND431e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 43.3 bits (102), Expect = 1e-06
Identities = 22/115 (19%), Positives = 42/115 (36%), Gaps = 10/115 (8%)

Query: 67 VIAANTVTVTSRVDGELMALHFTEGQQVKAGDLLAEIDPRPYEVQLTQAQGQLAKDQATL 126
+ + + + + + EG+ V+ GD+L ++ A+ K Q++L
Sbjct: 91 THSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTA-------LGAEADTLKTQSSL 143

Query: 127 DNARRDLARYQKLSK---TGLISQQELDTQSSLVRQSEGSVKADQGAIDSAKLQL 178
AR + RYQ LS+ + + +L + SE V I
Sbjct: 144 LQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTW 198



Score = 42.1 bits (99), Expect = 3e-06
Identities = 23/124 (18%), Positives = 54/124 (43%), Gaps = 4/124 (3%)

Query: 108 YEVQLTQAQGQLAKDQATLDNARRDLARYQKLSKTGLISQQELDTQSSLVRQSEGSVKAD 167
E + +A +L ++ L+ ++ + + L++Q + +RQ+ ++
Sbjct: 257 QENKYVEAVNELRVYKSQLEQIESEILSAK--EEYQLVTQLFKNEILDKLRQTTDNIGLL 314

Query: 168 QGAIDSAKLQLTYSRITAPISGRV-GLKQVDVGNYITSGTATPIVVITQTHPVDVVFTLP 226
+ + + S I AP+S +V LK G +T+ T +V++ + ++V +
Sbjct: 315 TLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAE-TLMVIVPEDDTLEVTALVQ 373

Query: 227 ESDI 230
DI
Sbjct: 374 NKDI 377


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3104PF05272310.007 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 31.2 bits (70), Expect = 0.007
Identities = 8/31 (25%), Positives = 14/31 (45%)

Query: 34 LTLLGPSGSGKTTSLMMLAGFETPTQGEITL 64
+ L G G GK+T + L G + + +
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDI 629


76YpAngola_A3259YpAngola_A3276N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A3259845-15.735810prepilin peptidase
YpAngola_A3261646-16.140451hypothetical protein
YpAngola_A3263542-13.371379general secretion pathway protein K
YpAngola_A3264743-13.419717general secretion pathway protein J
YpAngola_A3265643-13.481931general secretion pathway protein I
YpAngola_A3266541-12.651569general secretion pathway protein G
YpAngola_A3267441-13.037172general secretion pathway protein F
YpAngola_A3268435-10.928534general secretion pathway protein E
YpAngola_A3269330-9.837444general secretion pathway protein D
YpAngola_A3270-118-3.961528general secretion pathway protein C
YpAngola_A3272-117-3.340801putative carbonic anhydrase
YpAngola_A3273-117-2.518487hypothetical protein
YpAngola_A3275-115-1.469956hypothetical protein
YpAngola_A3276-114-0.869064hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3259PREPILNPTASE2363e-79 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 236 bits (604), Expect = 3e-79
Identities = 116/275 (42%), Positives = 152/275 (55%), Gaps = 4/275 (1%)

Query: 27 VFFVSYLIFGAMVGSFLNVLIYRLPIMLANLSSR-SESHGEEIKMRSHLRNINLFQPGSF 85
++F +F M+GSFLNV+I+RLPIML S+ NL P S
Sbjct: 14 LYFSLVFLFSLMIGSFLNVVIHRLPIMLEREWQAEYRSYFNPDDEGVDEPPYNLMVPRSC 73

Query: 86 CHHCNESIPIKYNIPILGWIFLRGASRCCNKKISTRYLFIEVLAVIQTLLVLMIFKEDLL 145
C HCN I NIP+L W++LRG R C IS RY +E+L + ++ V M
Sbjct: 74 CPHCNHPITALENIPLLSWLWLRGRCRGCQAPISARYPLVELLTALLSVAVAMTLAPGWG 133

Query: 146 ICTSLVLIWSLTALAFIDFDTYLLPDCMTIPLLWLGLLINIDTVFAPLTSAVLGAVSGYL 205
+L+L W L AL FID D LLPD +T+PLLW GLL N+ F L AV+GA++GYL
Sbjct: 134 TLAALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNLLGGFVSLGDAVIGAMAGYL 193

Query: 206 FLWLSYWLFKIVRGVDGMGYGDFKLMAALGAWFGVSAVPFLILFSSFFGLVAYAIFYFFD 265
LW YW FK++ G +GMGYGDFKL+AALGAW G A+P ++L SS G
Sbjct: 194 VLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLVGAFMGIGLILLR 253

Query: 266 KKDNGKEINYIAFGPYISLAGVLYLFLGSHVTNLF 300
K I FGPY+++AG + L G +T +
Sbjct: 254 NHHQSKP---IPFGPYLAIAGWIALLWGDSITRWY 285


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3263TYPE3IMPPROT300.012 Type III secretion system inner membrane P protein ...
		>TYPE3IMPPROT#Type III secretion system inner membrane P protein

family signature.
Length = 224

Score = 29.8 bits (67), Expect = 0.012
Identities = 14/65 (21%), Positives = 29/65 (44%), Gaps = 6/65 (9%)

Query: 4 NGIALLMVLCALFLMSTMVMASYNYWFDIYYLAKNSQQRQKEKWILLGAEEKFVSKLIKN 63
NG+ALL+ ++F+M ++ +Y Y+ D + K + + + LIK
Sbjct: 53 NGVALLL---SMFVMWPIMHDAYVYFEDEDVTFNDISSLSKH---VDEGLDGYRDYLIKY 106

Query: 64 TSEDR 68
+ +
Sbjct: 107 SDREL 111


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3266BCTERIALGSPG2072e-72 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 207 bits (529), Expect = 2e-72
Identities = 87/136 (63%), Positives = 103/136 (75%)

Query: 2 ANKKTKGFTLLEIMVVIVILGLLASLTIPSLMSNKNRADQQKAVSDISALENALDMYRLD 61
A K +GFTLLEIMVVIVI+G+LASL +P+LM NK +AD+QKAVSDI ALENALDMY+LD
Sbjct: 3 ATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLD 62

Query: 62 NGDYPTEQQGIAALVTKPNVPPLPQRYPSDGYIRRLPTDPWGNSYQMNNPGKHGQIDIFS 121
N YPT QG+ +LV P +PPL Y +GYI+RLP DPWGN Y + NPG+HG D+ S
Sbjct: 63 NHHYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDLLS 122

Query: 122 IGPDRLPETEDDIGNW 137
GPD TEDDI NW
Sbjct: 123 AGPDGEMGTEDDITNW 138


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3267BCTERIALGSPF358e-124 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 358 bits (921), Expect = e-124
Identities = 172/406 (42%), Positives = 264/406 (65%), Gaps = 7/406 (1%)

Query: 1 MAVFKYVAISRSGTKITGDIDAENIRIARYLLYKKNMHVLSI-------KKRILLFNKYV 53
MA + Y A+ G K G +A++ R AR LL ++ + LS+ +K
Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLR 60

Query: 54 VKKNSNKTDLVLITRQIATLVNASMPLDEVLDIVGKQNSKSKMIEIIQRIRVNIQEGHSF 113
K + +DL L+TRQ+ATLV ASMPL+E LD V KQ+ K + +++ +R + EGHS
Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSL 120

Query: 114 ADALSPFPAVFSPLYKTMVTAGEVSGHLGLVLVRLADHIEQTQKIQRKIIQALIYPCVLV 173
ADA+ FP F LY MV AGE SGHL VL RLAD+ EQ Q+++ +I QA+IYPCVL
Sbjct: 121 ADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLT 180

Query: 174 LISLSVIIILLTAVVPNIVEQFSFSETALPLSTKVLMILSYSIKENVIFIMAIGVSAVIF 233
+++++V+ ILL+ VVP +VEQF + ALPLST+VLM +S +++ +++ ++ +
Sbjct: 181 VVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMA 240

Query: 234 LNRLLKINKINVFFHRHYLSLPMLGNMFVRINTSRYLRTLTTLHSNGVTIVQAMSISNAV 293
+L+ K V FHR L LP++G + +NT+RY RTL+ L+++ V ++QAM IS V
Sbjct: 241 FRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDV 300

Query: 294 LTNVYIKNKLNISVKLVSEGCSLSSSLVDSGVFPPIILHMIISGERSGKLDHMLETVAGV 353
++N Y +++L+++ V EG SL +L + +FPP++ HMI SGERSG+LD MLE A
Sbjct: 301 MSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADN 360

Query: 354 QEEELMNQISIVMSLLEPTIIIVMAAFISFIILSILQPILEINSLV 399
Q+ E +Q+++ + L EP +++ MAA + FI+L+ILQPIL++N+L+
Sbjct: 361 QDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLM 406


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3269BCTERIALGSPD5430.0 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 543 bits (1401), Expect = 0.0
Identities = 310/610 (50%), Positives = 432/610 (70%), Gaps = 15/610 (2%)

Query: 3 ISGKGIKSIHGMIFLFTLIMPLDIISANFSVSFKDVDIKEFINSVSKNINKTIIIDPTVQ 62
I I+S + +F ++ + FS SFK DI+EFIN+VSKN+NKT+IIDP+V+
Sbjct: 2 IIANVIRSFSLTLLIFAALLFRPAAAEEFSASFKGTDIQEFINTVSKNLNKTVIIDPSVR 61

Query: 63 GLISIRSYENLDKDTYYQLFLNVLDVYGYAAIEMPHNVLKVISSKRAKGVVAPLPKEGVT 122
G I++RSY+ L+++ YYQ FL+VLDVYG+A I M + VLKV+ SK AK P+ +
Sbjct: 62 GTITVRSYDMLNEEQYYQFFLSVLDVYGFAVINMNNGVLKVVRSKDAKTAAVPVASDAAP 121

Query: 123 FDGDELINRVIPLRYISAKKITPLLRQLNDNTESGSIINYDPSNILLITGRAAVVNRLHS 182
GDE++ RV+PL ++A+ + PLLRQLNDN GS+++Y+PSN+LL+TGRAAV+ RL +
Sbjct: 122 GIGDEVVTRVVPLTNVAARDLAPLLRQLNDNAGVGSVVHYEPSNVLLMTGRAAVIKRLLT 181

Query: 183 IVTDLDQAGDNEIELYKLNYAIAADVVKIVNEAINPINNLKQEVSIVGKVIADERTNSIL 242
IV +D AGD + L++A AADVVK+V E + S+V V+ADERTN++L
Sbjct: 182 IVERVDNAGDRSVVTVPLSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVL 241

Query: 243 ISGDTYIRKKSILMIKKLDKRQSSDGNTKVVYMKYAQASKLLDVLNGISEGFHNEKKTKQ 302
+SG+ R++ I MIK+LD++Q++ GNTKV+Y+KYA+AS L++VL GIS +EK+ +
Sbjct: 242 VSGEPNSRQRIIAMIKQLDRQQATQGNTKVIYLKYAKASDLVEVLTGISSTMQSEKQAAK 301

Query: 303 SNQWNQRPVAIKAYDQTNALVITADPDMMLALGEVIEKLDIRRAQVLVEAIIVETQNGEG 362
+ + IKA+ QTNAL++TA PD+M L VI +LDIRR QVLVEAII E Q+ +G
Sbjct: 302 PVAALDKNIIIKAHGQTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADG 361

Query: 363 INLGVKWENKRSDDINF----IKNSDGLLNNNGWGIATTIT-----------GLTAGFYK 407
+NLG++W NK + F + S + N + T++ G+ AGFY+
Sbjct: 362 LNLGIQWANKNAGMTQFTNSGLPISTAIAGANQYNKDGTVSSSLASALSSFNGIAAGFYQ 421

Query: 408 GNWDVLLSALSTNTNNNILATPSIVTLDNMEAEFNVGQEVPVLISTQTTTTDKVYNSISR 467
GNW +LL+ALS++T N+ILATPSIVTLDNMEA FNVGQEVPVL +QTT+ D ++N++ R
Sbjct: 422 GNWAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVPVLTGSQTTSGDNIFNTVER 481

Query: 468 QSIGVMLKVKPQINKGDSVLLEIRQEVSSIADSSTVNTHNLGSVFNKRVVNNAVLVKSGE 527
+++G+ LKVKPQIN+GDSVLLEI QEVSS+AD+++ + +LG+ FN R VNNAVLV SGE
Sbjct: 482 KTVGIKLKVKPQINEGDSVLLEIEQEVSSVADAASSTSSDLGATFNTRTVNNAVLVGSGE 541

Query: 528 TVVVGGLLDKKSSTIVNKVPFLGDLPLIGWLFRQTKEKVEKSNLILFIKPTILRESDDYS 587
TVVVGGLLDK S +KVP LGD+P+IG LFR T +KV K NL+LFI+PT++R+ D+Y
Sbjct: 542 TVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNLMLFIRPTVIRDRDEYR 601

Query: 588 VVTSKEYNKY 597
+S +Y +
Sbjct: 602 QASSGQYTAF 611


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3270BCTERIALGSPC455e-09 Bacterial general secretion pathway protein C signa...
		>BCTERIALGSPC#Bacterial general secretion pathway protein C

signature.
Length = 272

Score = 44.6 bits (105), Expect = 5e-09
Identities = 19/62 (30%), Positives = 31/62 (50%)

Query: 29 IKLVGVIEHSAPSESIAILEVKGKQTTHLTRENINYEDIVIVKIFTDRVIIKRNGKYYSL 88
+ L GV+ S SIAI+ +Q + E + + IV I DRV+++ G+Y L
Sbjct: 95 LSLTGVMAGDDDSRSIAIISKDNEQFSRGVNEEVPGYNAKIVSIRPDRVVLQYQGRYEVL 154

Query: 89 II 90
+
Sbjct: 155 GL 156


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3276VACCYTOTOXIN330.005 Helicobacter pylori vacuolating cytotoxin signature.
		>VACCYTOTOXIN#Helicobacter pylori vacuolating cytotoxin signature.

Length = 1291

Score = 33.1 bits (75), Expect = 0.005
Identities = 40/172 (23%), Positives = 63/172 (36%), Gaps = 26/172 (15%)

Query: 437 LYLRNQSAATPWNFWAQTLYAHSRQSSGTYTPGYQTNGYGINVGVDRRFND--ESLFG-- 492
LY P N WA + S S G + YG + GVD N E++ G
Sbjct: 1012 LYQFAPKYEKPTNVWANAIGGTSLNSGG------NASLYGTSAGVDAYLNGEVEAIVGGF 1065

Query: 493 VSLGYQNANIN---IHSYGNEKDVDSYELMAYTGWFDDRYFFNGNVNMGYNSNSSTRNIG 549
S GY + + ++S N + Y + F +++ F+ S+ S+ N
Sbjct: 1066 GSYGYSSFSNQANSLNSGANNTNFGVYSRI-----FANQHEFDFEAQGALGSDQSSLNFK 1120

Query: 550 ENTGYQGNTKATADYNSLQMGYQVKAGMTFDL----DVVKLQPSVAYNYQWL 597
N YN L +A +D + + L+PSV +Y L
Sbjct: 1121 SALLRDLNQS----YNYLAYSAATRASYGYDFAFFRNALVLKPSVGVSYNHL 1168


77YpAngola_A3286YpAngola_A3293N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A3286514-0.947161phosphate regulon transcriptional regulatory
YpAngola_A3287614-1.184844hypothetical protein
YpAngola_A3288614-0.827924exonuclease subunit SbcD
YpAngola_A3289616-1.484703nuclease SbcCD, C subunit
YpAngola_A3290021-2.873765fructokinase
YpAngola_A3291024-4.880164recombination associated protein
YpAngola_A3292027-5.410182hypothetical protein
YpAngola_A3293028-5.753533shikimate kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3286HTHFIS929e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 92.2 bits (229), Expect = 9e-24
Identities = 32/123 (26%), Positives = 59/123 (47%), Gaps = 2/123 (1%)

Query: 1 MMARRILVVEDEAPIREMVCFVLEQNGYQPLEAEDYDSAVARLSEPFPDLVLLDWMLPGG 60
M ILV +D+A IR ++ L + GY + + ++ DLV+ D ++P
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 SGIQFIKHMKREALTRDIPVMMLTARGEEEDRVRGLEVGADDYITKPFSPKELVARIKAV 120
+ + +K+ D+PV++++A+ ++ E GA DY+ KPF EL+ I
Sbjct: 61 NAFDLLPRIKKARP--DLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118

Query: 121 MRR 123
+
Sbjct: 119 LAE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3289RTXTOXIND422e-05 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 41.7 bits (98), Expect = 2e-05
Identities = 32/222 (14%), Positives = 74/222 (33%), Gaps = 8/222 (3%)

Query: 312 QYLAQLTPLT--QAVEQATAARQQQQLNQHEQETLIEQRIVPLDNLITQQQQTLSQLAGQ 369
L +LT L + ++ Q +L Q + L + + + Q +
Sbjct: 122 DVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSE 181

Query: 370 IQQLRAKEQQNSQQLALNEQKLLQTHQRLQQLADYANLHAHHQHWEKHLPLWHEQFRQLQ 429
+ LR Q QK + ++ A+ + A +E + +
Sbjct: 182 EEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFS 241

Query: 430 LQQQQSAQSEQQLHQQTTLLATLQQQATTLSAQEKQQQVALAEARAQASYLQQKL--LVL 487
+ A ++ + +Q + +Q +Q + + A+ + + Q +L
Sbjct: 242 SLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEIL 301

Query: 488 EQ----QQPSAQLRQQLNEFNEQRQICQQLAALSPLAQQIQA 525
++ L +L + E++Q A +S QQ++
Sbjct: 302 DKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKV 343



Score = 39.0 bits (91), Expect = 1e-04
Identities = 36/222 (16%), Positives = 72/222 (32%), Gaps = 30/222 (13%)

Query: 449 LATLQQQATTLSAQEKQQQVALAEARAQASYLQQKLLVLEQQ----QPSAQLRQQLNEFN 504
L L +A TL Q Q L + R Q +L L + +P Q +
Sbjct: 127 LTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLR 186

Query: 505 EQRQICQQLAALSPLAQQIQALYDKQQQQFTAQQQQLKQLEQQ---LTEKRQLYQQ-QKQ 560
I +Q + Q + DK++ + ++ + E + + +
Sbjct: 187 LTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHK 246

Query: 561 HLVDLEALLEREKQIVTLEAERAKLQPGDACPLCGAVEHPAITAYQAVKPSETAVRVAKL 620
+ A+LE+E + V E + ++ E+ + AK
Sbjct: 247 QAIAKHAVLEQENKYVEAVNELRVYK-------------------SQLEQIESEILSAKE 287

Query: 621 RL-QVEQLYTEGTELRTQVASMQQHQQRIEQELQDHRQQLAA 661
V QL+ E+ ++ + + EL + ++ A
Sbjct: 288 EYQLVTQLFKN--EILDKLRQTTDNIGLLTLELAKNEERQQA 327



Score = 38.3 bits (89), Expect = 2e-04
Identities = 28/206 (13%), Positives = 71/206 (34%), Gaps = 19/206 (9%)

Query: 649 EQELQDHRQQLAAYQQRWQTLAQPLSL----AFTLNEPDALALWLEQHEQQEQACQLKLV 704
+ Q Q Q R+Q L++ + L L + E+ + +
Sbjct: 136 TLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSL----- 190

Query: 705 EYERLTQQYQQAKDILTQLEQRQQEHQQQLALITERQKNAQQTYQQLQSQYQHQQEALIA 764
+ +Q+ ++ Q E + + + + R + + +S+ +L+
Sbjct: 191 ----IKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFS-SLLH 245

Query: 765 QQQVLNHTLTELSLSVPDADQQQDWLAQREEECQRWQQHQQEQQRLTIEQKTLETRIENE 824
+Q + H + E +A + + E+ + +E+ +L + E +
Sbjct: 246 KQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDK-- 303

Query: 825 RRHLQECIDQLSALSQQRQQAETLLQ 850
L++ D + L+ + + E Q
Sbjct: 304 ---LRQTTDNIGLLTLELAKNEERQQ 326



Score = 34.0 bits (78), Expect = 0.004
Identities = 26/180 (14%), Positives = 72/180 (40%), Gaps = 13/180 (7%)

Query: 835 LSALSQQRQQAETLLQQQIQQRQALFGEDIVAE-------VRQRLRLQQQQAELAQQNAE 887
+ L Q R Q + + + + ++ + +R +++Q + Q +
Sbjct: 145 QARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQ 204

Query: 888 K--ALQQAQSQLNRLSGELTGLEQQCQQYQQRATTTQAEL-QQALSTSEFADETALTAAL 944
K L + +++ + + E + + R + L +QA++ ++
Sbjct: 205 KELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEA 264

Query: 945 LSE--EERQHLQQLQQQLNERRQQAQIRLQQAR-EILDQHLQLCPQGVDKSSELTLLQQQ 1001
++E + L+Q++ ++ +++ Q+ Q + EILD+ Q + EL +++
Sbjct: 265 VNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEER 324


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3290BCTERIALGSPF280.045 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 28.3 bits (63), Expect = 0.045
Identities = 11/37 (29%), Positives = 21/37 (56%)

Query: 218 DVIAEQAMNNYERRFAKSLAHVINLFDPDVVVLGGGM 254
D + E+A +N +R F+ + + LF+P +VV +
Sbjct: 351 DSMLERAADNQDREFSSQMTLALGLFEPLLVVSMAAV 387


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3293PF05272280.015 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.5 bits (63), Expect = 0.015
Identities = 16/68 (23%), Positives = 27/68 (39%), Gaps = 12/68 (17%)

Query: 7 MVGARGAGKTTIGKALAQALGYRFVDTDL-------FMQQTSQMTVAEVVESEGWDGFRL 59
+ G G GK+T+ L F DT +Q + + E+ E FR
Sbjct: 601 LEGTGGIGKSTLINTLVGL--DFFSDTHFDIGTGKDSYEQIAGIVAYELSE---MTAFRR 655

Query: 60 RESMALQA 67
++ A++A
Sbjct: 656 ADAEAVKA 663


78YpAngola_A3455YpAngola_A3465N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A3455-2192.265647major facilitator transporter
YpAngola_A3456-3152.354102putative azaleucine resistance protein AzlC
YpAngola_A3457-3162.239164hypothetical protein
YpAngola_A3458-2162.984015transcriptional repressor MprA
YpAngola_A3460-2143.475161multidrug resistance protein A
YpAngola_A3461-2122.869810multidrug resistance protein B
YpAngola_A34620132.161641putative methyltransferase
YpAngola_A34630161.480852thioredoxin 2
YpAngola_A3464-1110.672479DTW domain-containing protein
YpAngola_A3465-2110.491760putative acyl-CoA synthetase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3455TCRTETB477e-08 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 47.2 bits (112), Expect = 7e-08
Identities = 35/163 (21%), Positives = 65/163 (39%), Gaps = 5/163 (3%)

Query: 35 LETIATNFNLSVNQAGFIVTAAQLGYAVGLMFLVPLGDMFE-RRGLIVGMTLLAAGGMLI 93
L IA +FN ++ TA L +++G L D +R L+ G+ + G +I
Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGS-VI 95

Query: 94 TAMSQNLTMMIIGTALTGLFSVVA--QLLVPLAATLAAPEKRGKVVGIIMSGLLLGILLA 151
+ + ++I A L++ + A E RGK G+I S + +G +
Sbjct: 96 GFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVG 155

Query: 152 RTVAGALATLGGWRTIYWVASALMFIMALVLWRCLPRYKQHTG 194
+ G +A W + + + I L + L + + G
Sbjct: 156 PAIGGMIAHYIHWSYLLLIPMITI-ITVPFLMKLLKKEVRIKG 197


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3458PF05272280.017 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.5 bits (63), Expect = 0.017
Identities = 23/105 (21%), Positives = 37/105 (35%), Gaps = 12/105 (11%)

Query: 12 LNSRAKRQKDFPYQEILLTRLSMHMHSKLLENRNKMLKAQGINETLFMALITLDAQESRS 71
+ + P QE+ L + L R A+G + + T
Sbjct: 745 PSPEDEEIYFRPEQELRLVETGVQGRLWALLTREGAPAAEGAAQKGYSVNTTF------- 797

Query: 72 IQPSELSAALG-----SSRTNATRIADELEKKGWIERRESHNDRR 111
+ ++L ALG SS ++ D L + GW RE+ RR
Sbjct: 798 VTIADLVQALGADPGKSSPMLEGQVRDWLNENGWEYLRETSGQRR 842


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3460RTXTOXIND681e-14 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 68.3 bits (167), Expect = 1e-14
Identities = 63/410 (15%), Positives = 118/410 (28%), Gaps = 99/410 (24%)

Query: 25 LLLTAIFIMIGVAYLIYWFLVLRHHQ---ETDNAYISGNQVQIMSQVPGSVVSVHFENTD 81
L A FIM + ++ + SG +I V + + +
Sbjct: 57 PRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGE 116

Query: 82 FVKSGDVLVTLDPTD-------AEQAFEQAK----------------------------- 105
V+ GDVL+ L + + QA+
Sbjct: 117 SVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYF 176

Query: 106 ----------------TALANSVRQTHQLIINSKQYQ-------ANIALKKTELSQAQND 142
+ Q +Q +N + + A I + ++
Sbjct: 177 QNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSR 236

Query: 143 LKRRVVLGAAAVIGREELQHARDAVEAAQASLDMAVQQYNANQALVLNTPLE-------- 194
L L I + + + A L + Q ++ +L+ E
Sbjct: 237 LDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLF 296

Query: 195 -KQPAIEQAAAKMRDAWLT---------LQRTKVVSPISGYVSRRSVQ-VGAEISSGTPL 243
+ + LT Q + + +P+S V + V G +++ L
Sbjct: 297 KNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETL 356

Query: 244 MAVVPADQ-LWIDANFKETQLANMRIGQPATI-VTDF----YGDDVVYQGKVVGLDMGTG 297
M +VP D L + A + + + +GQ A I V F YG GKV +
Sbjct: 357 MVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGY---LVGKVKNI----- 408

Query: 298 SAFSLLPAQNATGNWIKVVQRLPVRIALDEKQLKEHPLRIGLSSLVKVDT 347
+ ++ G V+ + K PL G++ ++ T
Sbjct: 409 NLDAIE--DQRLGLVFNVIISIEENCLSTG--NKNIPLSSGMAVTAEIKT 454


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3461TCRTETB1401e-38 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 140 bits (355), Expect = 1e-38
Identities = 94/404 (23%), Positives = 167/404 (41%), Gaps = 17/404 (4%)

Query: 18 LSLATFMQVLDSTIANVAIPTIAGDLGSSNSQGTWVITSFGVANAISIPVTGWLAKRVGE 77
L + +F VL+ + NV++P IA D + WV T+F + +I V G L+ ++G
Sbjct: 19 LCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGI 78

Query: 78 VRLFLWSTGLFVLASWLCGMSNS-LGMLIFFRVIQGLVAGPLIPLSQSLLLNNYPPAKRS 136
RL L+ + S + + +S +LI R IQG A L ++ P R
Sbjct: 79 KRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRG 138

Query: 137 MALALWSMTIVVAPIFGPILGGYISDNYHWGWIFFINIPIGLVVVLLAGSTLKGRETKTE 196
A L + + GP +GG I+ HW + + IP+ ++ + L +E + +
Sbjct: 139 KAFGLIGSIVAMGEGVGPAIGGMIAHYIHWS--YLLLIPMITIITVPFLMKLLKKEVRIK 196

Query: 197 IRPIDTIGLVLLVVGIGALQIMLDQGKELDWFNSTEIIVLTVVAVVAITFLIVWELTDDH 256
D G++L+ VGI + ML F ++ I +V+V++ +
Sbjct: 197 -GHFDIKGIILMSVGI--VFFML--------FTTSYSISFLIVSVLSFLIFVKHIRKVTD 245

Query: 257 PVIDLSLFKSRNFTIGCLCLSLAYMLYFGAIVLLPQLLQEVYGYTATWAGLASAPVGILP 316
P +D L K+ F IG LC + + G + ++P ++++V+ + G G +
Sbjct: 246 PFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMS 305

Query: 317 VLLS-PLIGRFAHRIDMRQLVTFSFIMYAVCFYWRAYTFEPGMDFGASAWPQFFQGFAIA 375
V++ + G R ++ +V F ++ E F G +
Sbjct: 306 VIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFT 365

Query: 376 CFFMPLTTITLSGLPPERMAAASSLSNFMRTLAGSIGTSITTTL 419
++TI S L + A SL NF L+ G +I L
Sbjct: 366 K--TVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGL 407


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3465SACTRNSFRASE371e-04 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 36.8 bits (85), Expect = 1e-04
Identities = 16/54 (29%), Positives = 22/54 (40%)

Query: 812 VLVRSDLKGLGLGRALLEKMIRYARSHGLSRLTAVTMPNNRGMIGLAQKLGFTI 865
+ V D + G+G ALL K I +A+ + L T N K F I
Sbjct: 95 IAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFII 148


79YpAngola_A3652YpAngola_A3657N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A3652-19-0.535586glycerol-3-phosphate transporter ATP-binding
YpAngola_A3653010-0.884194glycerol-3-phosphate transporter membrane
YpAngola_A3654110-0.790786glycerol-3-phosphate transporter permease
YpAngola_A3655113-0.983843glycerol-3-phosphate transporter periplasmic
YpAngola_A3656319-0.969659hypothetical protein
YpAngola_A3657319-1.350354hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3652PF05272320.006 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 31.6 bits (71), Expect = 0.006
Identities = 15/56 (26%), Positives = 21/56 (37%), Gaps = 9/56 (16%)

Query: 33 IVMVGPSGCGKSTLLRMVAGLERTTTGDIYIGDQRVTDLEPKDRGIAMVFQNYVLY 88
+V+ G G GKSTL+ + GL+ + D KD V Y
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLD-------FFSDTHFDIGTGKDS--YEQIAGIVAY 645


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3655MALTOSEBP340.001 Maltose binding protein signature.
		>MALTOSEBP#Maltose binding protein signature.

Length = 396

Score = 33.9 bits (77), Expect = 0.001
Identities = 41/173 (23%), Positives = 70/173 (40%), Gaps = 13/173 (7%)

Query: 136 GRLLSQPFNSSTPVLYYNKEAFKKAGLDPEQPPKTWQELAADTAKLRAAGSSCGYASGWQ 195
G+L++ P L YNK+ L P PPKTW+E+ A +L+A G S + +
Sbjct: 127 GKLIAYPIAVEALSLIYNKD------LLP-NPPKTWEEIPALDKELKAKGKSALMFNLQE 179

Query: 196 GWIQIENFSAWHGQPIASRNNGFDGTDAVLEFNKPLQVKHIQLLSDMNKKGDFTYFGRKD 255
+ +A G N +D D + + + L D+ K
Sbjct: 180 PYFTWPLIAADGGYAFKYENGKYDIKD--VGVDNAGAKAGLTFLVDLIKNKHMNADTDYS 237

Query: 256 ESTSKFYNGDCAITTASSGSLASIRHYAKFNFGVGMMPYDADAKNAPQNAIIG 308
+ + F G+ A+T + ++I +K N+GV ++P K P +G
Sbjct: 238 IAEAAFNKGETAMTINGPWAWSNIDT-SKVNYGVTVLP---TFKGQPSKPFVG 286


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3656ECOLNEIPORIN270.031 E.coli/Neisseria porin superfamily signature.
		>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature.

Length = 331

Score = 27.5 bits (61), Expect = 0.031
Identities = 19/83 (22%), Positives = 33/83 (39%), Gaps = 10/83 (12%)

Query: 13 MKKTVIAIITMATLTSTAAYANTIEKDIRVEAEIISLMDVKRADDSNINKIKLTYDTVTN 72
MKK++IA+ A + A D+ + I + ++ R+ N + T
Sbjct: 1 MKKSLIALTLAALPVAAMA-------DVTLYGTIKAGVETSRSVAHNGAQ---AASVETG 50

Query: 73 DGTYSHSEAIKVKARKQLGDKLK 95
G I K ++ LG+ LK
Sbjct: 51 TGIVDLGSKIGFKGQEDLGNGLK 73


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3657PF00577724e-15 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 72.2 bits (177), Expect = 4e-15
Identities = 40/265 (15%), Positives = 80/265 (30%), Gaps = 27/265 (10%)

Query: 442 SLARYQSPYVS----RYAPDSGST---SGSYTRRIGPTQLSYQFNQYRNNRQHRIQSGWD 494
L R + Y+S Y S + ++ +N Q G D
Sbjct: 536 QLGRTSTLYLSGSHQTYWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQK----GRD 591

Query: 495 WQLPQFNLALSLGLQNGGQWNSHNNYGVFLNTTLSFGQSNASINTAYTQQQLNTSASYQK 554
L +++ W ++ + + + S+ S + +
Sbjct: 592 ---QMLALNVNIPF---SHWLRSDSKSQWRHASASYSMS--HDLNGRMTNLAGVYGTLLE 643

Query: 555 EFIDNYGASTLGVSGSASGKLNSVGGFAKRSGSRGDISGRVGIDNQITNGGISYNGMLAL 614
+ +Y T G ++ G G+ + + I +G +
Sbjct: 644 DNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYGNANIGYSHSDDIKQLYYGVSGGVLA 703

Query: 615 SSQGVALGRSSYSGAALLIKAPALGGTPYSFHVEDSPI--TGGGTYAIPVPRYQDRFFVR 672
+ GV LG+ + +L+KAP VE+ T YA+ +P + R
Sbjct: 704 HANGVTLGQPL-NDTVVLVKAPGAKDAK----VENQTGVRTDWRGYAV-LPYATEYRENR 757

Query: 673 THTDRSDMDMNIQLPVNIVRAHPGQ 697
D + + N+ L + P +
Sbjct: 758 VALDTNTLADNVDLDNAVANVVPTR 782


80YpAngola_A3890YpAngola_A3897N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A3890219-2.705402TonB domain-containing protein
YpAngola_A3891115-2.022042HlyD family hemolysin secretion protein
YpAngola_A3892113-1.917668ABC transporter
YpAngola_A3893114-1.927565hemophore HasA
YpAngola_A3894112-0.921912TonB-dependent heme receptor HasR
YpAngola_A3895-2112.021092hypothetical protein
YpAngola_A3896-2102.214740argininosuccinate lyase
YpAngola_A3897-281.772459acetylglutamate kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3890PF03544667e-15 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 65.8 bits (160), Expect = 7e-15
Identities = 31/195 (15%), Positives = 67/195 (34%), Gaps = 8/195 (4%)

Query: 70 ITQNIIEPAVEQRVNQPDDIVDLPTLPEQPEGQREITRKEPIKVKRPAENRATSRKPVNK 129
I+ ++ PA + + PE KE V + KP
Sbjct: 50 ISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPK-PKPKPKPKPV 108

Query: 130 ETQESDSKQSSPAAAASAMLSGTSQQVAAAVNSDSSHRQQAQVSWKSRLQGHLMGFKRYP 189
+ E + P + A + ++ ++ + S S + +YP
Sbjct: 109 KKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVASGPRALSRNQPQYP 168

Query: 190 SSARKQQQQGTAMIRFVVDKNGYVSSVQLSHSSGTSALDREALAIIKRAQPLPKPPAELL 249
+ A+ + +G ++F V +G V +VQ+ + + +RE ++R + P P
Sbjct: 169 ARAQALRIEGQVKVKFDVTPDGRVDNVQILSAKPANMFEREVKNAMRRWRYEPGKPGS-- 226

Query: 250 SQGQITLSLPVDFNL 264
+ + + F +
Sbjct: 227 -----GIVVNILFKI 236


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3891RTXTOXIND348e-118 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 348 bits (895), Expect = e-118
Identities = 91/424 (21%), Positives = 172/424 (40%), Gaps = 8/424 (1%)

Query: 25 RYLNIGGGLVVIGFIGFLLWAGLAPLDKGVAVTGLLVVAENRKVIQPLQGGRIQQLHVTE 84
R + ++ + + + L ++ G L + K I+P++ ++++ V E
Sbjct: 55 RRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKE 114

Query: 85 GDEIVSGQLLVTLDDTAIRNQRDNLQHQYLSALAQEARLTAEQNDLDVITFPQALLEH-- 142
G+ + G +L+ L Q L A ++ R +++ P+ L
Sbjct: 115 GESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEP 174

Query: 143 ATQPAVERNIILQQQLLHHRRQAHLSEIARLSTQLTRHQARLDGLQAMRSNHQRQSNLFQ 202
Q E ++ L+ + ++ + L + +A + A + ++ S + +
Sbjct: 175 YFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEK 234

Query: 203 QQLDSVQLLAKDGHIAKNKLLEMESQSTSLQARVEQSTSDIAEAHKLIDETEQHVLQRRE 262
+LD L IAK+ +LE E++ + S + + I ++ +
Sbjct: 235 SRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQ 294

Query: 263 QYQSENSEQLAKAQQNTQELVQRLNIAEYELSHTRIFAPVSGSVIALAQHTVGGVVSSGQ 322
+++E ++L + N L L E + I APVS V L HT GGVV++ +
Sbjct: 295 LFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAE 354

Query: 323 ALMEIVPSGQPLFVEAQLPVELIDKVAVGLPVDLNFSAFNQSNTPRLQGSVWRIGADRIQ 382
LM IVP L V A + + I + VG + AF + L G V I D I+
Sbjct: 355 TLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIE 414

Query: 383 PPPTSPPYYPLTVAIDL-----DPTELAIRPGMAVDVFIRTGERSLLSYLFKPFTDRLHL 437
+ + ++I+ + + GMAV I+TG RS++SYL P + +
Sbjct: 415 DQRLGLVFNVI-ISIEENCLSTGNKNIPLSSGMAVTAEIKTGMRSVISYLLSPLEESVTE 473

Query: 438 ALAE 441
+L E
Sbjct: 474 SLRE 477


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3893PF064382322e-80 Heme acquisition protein HasAp
		>PF06438#Heme acquisition protein HasAp

Length = 205

Score = 232 bits (594), Expect = 2e-80
Identities = 66/214 (30%), Positives = 104/214 (48%), Gaps = 18/214 (8%)

Query: 1 MSTTIQYNSNYADYSISSYLREWANNFGDIDQAPAETKDRGSFSG-SSTLFSGTQYAIGS 59
MS +I Y++ Y+ ++++ YL +W+ FGD++ P + D + G + F G+QYA+ S
Sbjct: 1 MSISISYSTTYSGWTVADYLADWSAYFGDVNHRPGQVVDGSNTGGFNPGPFDGSQYALKS 60

Query: 60 SHSNPEGMIAEGDLKYSFM--PQHTFHGQIDTLQFGKDLATNAGGPSAGKHLEKIDITFN 117
+ S+ IA GDL Y+ P HT G++D++ G L G S G L+ +++F+
Sbjct: 61 TASDA-AFIAGGDLHYTLFSNPSHTLWGKLDSIALGDTL--TGGASSGGYALDSQEVSFS 117

Query: 118 ELDLSGEFDSGKSMTENHQGDMHKSVRGLMKGNPDPMLEVMKAKGINVDTAFKDLSIASQ 177
L L G+ G +HK V GLM G+ + + A VD + S Q
Sbjct: 118 NLGLDSPIAQGRD------GTVHKVVYGLMSGDSSALQGQIDALLKAVDPSLSINSTFDQ 171

Query: 178 YPDSGYMSDAPM-----VDTVGVVDC-HDMLLAA 205
+G P V VGV + HD+ LAA
Sbjct: 172 LAAAGVAHATPAAAAAEVGVVGVQELPHDLALAA 205


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3897CARBMTKINASE421e-06 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 42.1 bits (99), Expect = 1e-06
Identities = 36/138 (26%), Positives = 58/138 (42%), Gaps = 18/138 (13%)

Query: 133 VQTLLAAGYMPIISSIG----ITVEGQLMNVNA----DQAATALAATLGAD-LILLSDVS 183
++ L+ G + I S G I +G++ V A D A LA + AD ++L+DV+
Sbjct: 179 IKKLVERGVIVIASGGGGVPVILEDGEIKGVEAVIDKDLAGEKLAEEVNADIFMILTDVN 238

Query: 184 GILDGKG----QRIAEMTAQKAEQLIAQGIITDG-MVVKVNAALDAARSLGRPVDIASWR 238
G G Q + E+ ++ + +G G M KV AA+ G IA
Sbjct: 239 GAALYYGTEKEQWLREVKVEELRKYYEEGHFKAGSMGPKVLAAIRFIEWGGERAIIAHL- 297

Query: 239 HSEQLPALFNGVPIGTRI 256
E+ G GT++
Sbjct: 298 --EKAVEALEG-KTGTQV 312


81YpAngola_A3912YpAngola_A3937N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A3912-1200.952203maltose/maltodextrin transporter ATP-binding
YpAngola_A3913021-0.046767hypothetical protein
YpAngola_A39140210.486182maltose ABC transporter periplasmic protein
YpAngola_A3915015-1.590071maltose transporter membrane protein
YpAngola_A3916014-0.404189maltose transporter permease
YpAngola_A3917113-0.417725phosphate-starvation-inducible protein PsiE
YpAngola_A3918011-0.392082glucose-6-phosphate isomerase
YpAngola_A3919-111-0.380384aspartate kinase III
YpAngola_A3922-110-0.645222hemagglutination activity domain-containing
YpAngola_A3923-1151.473140B12-dependent methionine synthase
YpAngola_A3924-118-0.074312IclR family transcriptional regulator
YpAngola_A3925-1180.138529bifunctional isocitrate dehydrogenase
YpAngola_A3926-1181.441208secretion system apparatus protein SsaU
YpAngola_A3927-1202.652788type III secretion apparatus protein
YpAngola_A39281224.787789HrpO family type III secretion protein
YpAngola_A39291195.453500type III secretion system protein
YpAngola_A39301216.767851type III secretion system protein
YpAngola_A39311235.919071hypothetical protein
YpAngola_A39321235.747336type III secretion system apparatus protein
YpAngola_A39331225.587902type III secretion system ATPase
YpAngola_A39351211.384886HrpE/YscL family type III secretion apparatus
YpAngola_A3936023-0.649321hypothetical protein
YpAngola_A3937-226-2.276113YscJ/HrcJ family type III secretion apparatus
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3912PF05272340.001 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 33.5 bits (76), Expect = 0.001
Identities = 13/32 (40%), Positives = 17/32 (53%)

Query: 32 VVFVGPSGCGKSTLLRMIAGLEDITSGELLIG 63
VV G G GKSTL+ + GL+ + IG
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIG 630


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3914MALTOSEBP6790.0 Maltose binding protein signature.
		>MALTOSEBP#Maltose binding protein signature.

Length = 396

Score = 679 bits (1752), Expect = 0.0
Identities = 331/394 (84%), Positives = 367/394 (93%)

Query: 10 IGKTARVLALSALTTLVLSSSAFAKIEEGKLVIWINGDKGYNGLAEVGKKFEKDTGIKVT 69
I AR+LALSALTT++ S+SA AKIEEGKLVIWINGDKGYNGLAEVGKKFEKDTGIKVT
Sbjct: 3 IKTGARILALSALTTMMFSASALAKIEEGKLVIWINGDKGYNGLAEVGKKFEKDTGIKVT 62

Query: 70 IEHPDKLEEKFPQVAATGDGPDIIFWAHDRFGGYAQSGLLAELTPSKAFQEKLFPFTWDA 129
+EHPDKLEEKFPQVAATGDGPDIIFWAHDRFGGYAQSGLLAE+TP KAFQ+KL+PFTWDA
Sbjct: 63 VEHPDKLEEKFPQVAATGDGPDIIFWAHDRFGGYAQSGLLAEITPDKAFQDKLYPFTWDA 122

Query: 130 VRFNGKLIGYPVAVEALSLIYNKDLVKEAPKTWEEIPALDKTLRANGKSAIMWNLQEPYF 189
VR+NGKLI YP+AVEALSLIYNKDL+ PKTWEEIPALDK L+A GKSA+M+NLQEPYF
Sbjct: 123 VRYNGKLIAYPIAVEALSLIYNKDLLPNPPKTWEEIPALDKELKAKGKSALMFNLQEPYF 182

Query: 190 TWPVIAADGGYAFKFENGVYDAKNVGVNNAGAQAGLQFIVDLVKNKHINADTDYSIAEAA 249
TWP+IAADGGYAFK+ENG YD K+VGV+NAGA+AGL F+VDL+KNKH+NADTDYSIAEAA
Sbjct: 183 TWPLIAADGGYAFKYENGKYDIKDVGVDNAGAKAGLTFLVDLIKNKHMNADTDYSIAEAA 242

Query: 250 FNKGETAMTINGPWAWSNIDKSKINYGVTLLPTFHGQPSKPFVGVLTAGINAASPNKELA 309
FNKGETAMTINGPWAWSNID SK+NYGVT+LPTF GQPSKPFVGVL+AGINAASPNKELA
Sbjct: 243 FNKGETAMTINGPWAWSNIDTSKVNYGVTVLPTFKGQPSKPFVGVLSAGINAASPNKELA 302

Query: 310 TEFLENYLITDQGLAEVNKDKPLGAVALKSFQEQLAKDPRIAATMDNATNGEIMPNIPQM 369
EFLENYL+TD+GL VNKDKPLGAVALKS++E+LAKDPRIAATM+NA GEIMPNIPQM
Sbjct: 303 KEFLENYLLTDEGLEAVNKDKPLGAVALKSYEEELAKDPRIAATMENAQKGEIMPNIPQM 362

Query: 370 AAFWYATRSAVLNAITGRQTVEAALNDAATRITK 403
+AFWYA R+AV+NA +GRQTV+ AL DA TRITK
Sbjct: 363 SAFWYAVRTAVINAASGRQTVDEALKDAQTRITK 396


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3918BCTERIALGSPD330.005 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 32.6 bits (74), Expect = 0.005
Identities = 15/66 (22%), Positives = 30/66 (45%), Gaps = 8/66 (12%)

Query: 69 LAKETDLAGAIKSMFSGEKINR-------TEDRAVLHIALRNRSNTPIVVDGKDVMPEVN 121
AK +DL + + S + + D+ ++ I ++N IV DVM ++
Sbjct: 276 YAKASDLVEVLTGISSTMQSEKQAAKPVAALDKNII-IKAHGQTNALIVTAAPDVMNDLE 334

Query: 122 AVLAKM 127
V+A++
Sbjct: 335 RVIAQL 340


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3919CARBMTKINASE290.032 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 29.4 bits (66), Expect = 0.032
Identities = 18/89 (20%), Positives = 29/89 (32%), Gaps = 5/89 (5%)

Query: 214 DYTAALLGEALNVSRIDIWTDVPGIYTTDPRVVPAAKRIDKIAFEEAAEMATFGAKILHP 273
D L E +N I TDV G + + ++ EE + G
Sbjct: 216 DLAGEKLAEEVNADIFMILTDVNGAALYYGT--EKEQWLREVKVEELRKYYEEGH--FKA 271

Query: 274 ATLLPAVRSDIPMFVGSSKDPAAGGTLVC 302
++ P V + I F+ + A L
Sbjct: 272 GSMGPKVLAAI-RFIEWGGERAIIAHLEK 299


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3922PF05860792e-19 haemagglutination activity domain.
		>PF05860#haemagglutination activity domain.

Length = 117

Score = 79.1 bits (195), Expect = 2e-19
Identities = 24/124 (19%), Positives = 42/124 (33%), Gaps = 21/124 (16%)

Query: 45 VSSVNGTSVINIVQPSASGLSHNQFQDFNVGEKGAVLNNATSAGNSILAGQLAANQNLNG 104
+++ T +I + S L H+ FQ+F+V G N N
Sbjct: 15 ITTEGNTRIIERGTQAGSNLFHS-FQEFSVPTSGTAFFN-------------------NP 54

Query: 105 QAASIILNEVISRNPSLLLGQQEIFGMTADYILANPNGITCNGCGFMNTNRESLVVGNPL 164
I++ V + S + G TA+ L NPNGI ++ +
Sbjct: 55 TNIQNIISRVTGGSVSNIDGLIRANA-TANLFLINPNGIIFGQNARLDIGGSFVGSTANR 113

Query: 165 IEQG 168
++
Sbjct: 114 LKFA 117


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3923BCTERIALGSPD310.041 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 30.7 bits (69), Expect = 0.041
Identities = 18/83 (21%), Positives = 33/83 (39%), Gaps = 9/83 (10%)

Query: 347 AGLEPLTIDANTLFVNVGERTN---VTGSARFKRLIKEEKYGEALDVARQQVESGAQIID 403
+P+ + + +TN VT + + E+ LD+ R QV A I +
Sbjct: 298 QAAKPVAALDKNIIIKAHGQTNALIVTAAP--DVMNDLERVIAQLDIRRPQVLVEAIIAE 355

Query: 404 INMDEGMLDAEAAMVRFLNLIAG 426
+ D L+ +++ N AG
Sbjct: 356 VQ-DADGLNLG---IQWANKNAG 374


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3926TYPE3IMSPROT347e-121 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 347 bits (892), Expect = e-121
Identities = 125/351 (35%), Positives = 199/351 (56%), Gaps = 2/351 (0%)

Query: 1 MSTEKNEKPTPKRLKEAKEKGQVVKSVEITSGVQLVALVIYFLLTGYSLVEQAKALIRSS 60
MS EK E+PTPK++++A++KGQV KS E+ S +VAL + E L+
Sbjct: 1 MSGEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIP 60

Query: 61 IIQLQQPLTLALARIGAECMTVLMHIVVVLGGALIVVTIIAGIAQVGPLLATKAVSFKGE 120
Q P + AL+ + + ++ L ++ I + + Q G L++ +A+ +
Sbjct: 61 AEQSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDIK 120

Query: 121 RINPIQNAKQLFSLRSVFELMKSLLKVGVLTLIFGYLLMQYAPSFGYLTHCGSRCALPVF 180
+INPI+ AK++FS++S+ E +KS+LKV +L+++ ++ + L CG C P+
Sbjct: 121 KINPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPLL 180

Query: 181 STLMGWLLGSLIACYLVFSLMDYAFQRYTIMKQLKMSHDEVKREYKDSNGDPHIKQKRRQ 240
++ L+ ++V S+ DYAF+ Y +K+LKMS DE+KREYK+ G P IK KRRQ
Sbjct: 181 GQILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRRQ 240

Query: 241 LQHEVQSGSFATNVRRSTAVVRNPTHFAVCLIYHPEETPLPIVIEKGHDEQAALIVSLAE 300
E+QS + NV+RS+ VV NPTH A+ ++Y ETPLP+V K D Q + +AE
Sbjct: 241 FHQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIAE 300

Query: 301 QSGIPVVENIALARALHRDVACGDTIPEQFFEPVAALLRM--ALELDYQPS 349
+ G+P+++ I LARAL+ D IP + E A +LR ++ Q S
Sbjct: 301 EEGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQNIEKQHS 351


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3927TYPE3IMRPROT1415e-43 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 141 bits (356), Expect = 5e-43
Identities = 52/230 (22%), Positives = 105/230 (45%), Gaps = 4/230 (1%)

Query: 5 LPGLTALALAMMRPYGILLILPLFTARSLGSSLLRNGLIVAIALPVTPLFLSAPIITNSS 64
L L ++R ++ P+ + RS+ + + GL + I + P + + S
Sbjct: 10 LSWLNLYFWPLLRVLALISTAPILSERSVPKRV-KLGLAMMITFAIAPSLPANDVPVFS- 67

Query: 65 PVTWIGVLCTELLIGVVMGFVAALPFWAMNMAGFLIDTLRGATMSTLFNPGMGVESSLFG 124
+ + ++LIG+ +GF F A+ AG +I G + +T +P + +
Sbjct: 68 -FFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHLNMPVLA 126

Query: 125 VLFTQILTVLFLISGGFNQVLAALYGSYDSLPIGQGIQPAADLLLFLQTEWQMMFELCLC 184
+ + +LFL G +++ L ++ +LPIG + L + +F L
Sbjct: 127 RIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSL-IFLNGLM 185

Query: 185 FALPALLVMVLADLSLGLINRSARQLNVFFLAMPIKSALALFLLLISLPY 234
ALP + +++ +L+LGL+NR A QL++F + P+ + + L+ +P
Sbjct: 186 LALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPL 235


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3928TYPE3IMQPROT693e-19 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 68.6 bits (168), Expect = 3e-19
Identities = 32/79 (40%), Positives = 47/79 (59%)

Query: 10 IVHLATELLWLVLLLSLPVVVVASTVGLVISLVQALTQIQDQTLQFLIKLLAVSATLLMT 69
+V + L+LVL+LS +VA+ +GL++ L Q +TQ+Q+QTL F IKLL V L +
Sbjct: 4 LVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLFLL 63

Query: 70 YHWMGATLLNYTQQSFLQI 88
W G LL+Y +Q
Sbjct: 64 SGWYGEVLLSYGRQVIFLA 82


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3929TYPE3IMPPROT2241e-76 Type III secretion system inner membrane P protein ...
		>TYPE3IMPPROT#Type III secretion system inner membrane P protein

family signature.
Length = 224

Score = 224 bits (572), Expect = 1e-76
Identities = 86/220 (39%), Positives = 143/220 (65%), Gaps = 7/220 (3%)

Query: 4 LNSSYQLIALLFMLSVLPLLVVMGTAFLKLSVVFSLLRNALGVQQVPPNIAIYGLALVLT 63
+ + LIALL ++LP ++ GT F+K S+VF ++RNALG+QQ+P N+ + G+AL+L+
Sbjct: 1 MGNDISLIALLAFSTLLPFIIASGTCFVKFSIVFVMVRNALGLQQIPSNMTLNGVALLLS 60

Query: 64 IFIMAPVGLDVQARLQNEELSNDIGALAHQIDQNALVPYRDFLQRNTDIEQVTFFNDIVQ 123
+F+M P+ D ++E+++ + + + L YRD+L + +D E V FF +
Sbjct: 61 MFVMWPIMHDAYVYFEDEDVTFNDISSLSKHVDEGLDGYRDYLIKYSDRELVQFFENAQL 120

Query: 124 NKWPE-------RYRDSVKPDSLLILMPAFTLSQLNEAFKIGLLLFLPFVAIDLIVSNIL 176
+ R +D ++ S+ L+PA+ LS++ AFKIG L+LPFV +DL+VS++L
Sbjct: 121 KRQYGEETETVKRDKDEIEKPSIFALLPAYALSEIKSAFKIGFYLYLPFVVVDLVVSSVL 180

Query: 177 LAMGMMMVSPMTLSLPFKLLVFVLVDGWSLVLGQLVGSYL 216
LA+GMMM+SP+T+S P KL++FV +DGW+L+ L+ Y+
Sbjct: 181 LALGMMMMSPVTISTPIKLVLFVALDGWTLLSKGLILQYM 220


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3930TYPE3OMOPROT521e-09 Type III secretion system outer membrane O protein ...
		>TYPE3OMOPROT#Type III secretion system outer membrane O protein

family signature.
Length = 303

Score = 51.5 bits (123), Expect = 1e-09
Identities = 22/81 (27%), Positives = 37/81 (45%)

Query: 209 PPLAAVQLEDLPQTLVMEIGRLTLPLGEIKQLAVGQTLACQTHCYGEVNICLNGQSVGRG 268
L LP L + R + L E++ + Q L+ T+ V I NG +G G
Sbjct: 220 TAETLPGLNQLPVKLEFVLYRKNVTLAELEAMGQQQLLSLPTNAELNVEIMANGVLLGNG 279

Query: 269 SLLRCDEKLVVRIAQWGLQNG 289
L++ ++ L V I +W ++G
Sbjct: 280 ELVQMNDTLGVEIHEWLSESG 300


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3932RTXTOXIND290.005 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.0 bits (65), Expect = 0.005
Identities = 17/114 (14%), Positives = 40/114 (35%), Gaps = 7/114 (6%)

Query: 5 QQRTLQRLLALRQRQERRLRQQLGQLRREQQQLENGRRRHQQLCQQLQQLAQWCGMLTPR 64
++ + Q Q+ + L + R E+ + R++ L + + L
Sbjct: 187 LTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSL--- 243

Query: 65 EADEQKVLRQAVYQAERQAKKQLNAWVAQGRQQVSAIERQ--QARLRRNQREQE 116
+Q + + AV + E + + + + Q+ IE + A+ Q
Sbjct: 244 -LHKQAIAKHAVLEQENKY-VEAVNELRVYKSQLEQIESEILSAKEEYQLVTQL 295


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3937FLGMRINGFLIF638e-14 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 63.4 bits (154), Expect = 8e-14
Identities = 44/188 (23%), Positives = 70/188 (37%), Gaps = 7/188 (3%)

Query: 7 MLAIVLMTLSLSGCDME-LYSGLSEGEANQMLALLMLHQINAEKQIEKSGMVGLTVDKRQ 65
+ +V M L D L+S LS+ + ++A L I + V +
Sbjct: 35 VAIVVAMVLWAKTPDYRTLFSNLSDQDGGAIVAQLTQMNIPYR--FANGSGA-IEVPADK 91

Query: 66 FINAVELLRQNGFPRQRFITVDELFPANQLVTSPTQEQAKMVFLKEQQLENMLSHMDGVI 125
L Q G P+ + EL + S EQ E +L + + V
Sbjct: 92 VHELRLRLAQQGLPKGGAVGF-ELLDQEKFGISQFSEQVNYQRALEGELARTIETLGPVK 150

Query: 126 HADVTVAMPM-SVDGKNPLPHTASVFIKYSPEVNLQSYQ-SQIKGLVRDAVPGIDYAKIS 183
A V +AMP S+ + +ASV + P L Q S + LV AV G+ ++
Sbjct: 151 SARVHLAMPKPSLFVREQKSPSASVTVTLEPGRALDEGQISAVVHLVSSAVAGLPPGNVT 210

Query: 184 VVMQPANY 191
+V Q +
Sbjct: 211 LVDQSGHL 218


82YpAngola_A3944YpAngola_A3951N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A3944-113-2.332203two-component sensor/regulator
YpAngola_A3945-114-2.947714putative DNA-binding response regulator EsrB
YpAngola_A3946-212-1.900486glutamate/aspartate:proton symporter
YpAngola_A3947-214-0.796723acetyl-CoA synthetase
YpAngola_A3948-216-0.416102hypothetical protein
YpAngola_A3949-216-0.007804acetate permease
YpAngola_A3950-1150.323470hypothetical protein
YpAngola_A39510161.523338insertion sequence transposase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3944HTHFIS793e-17 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 78.7 bits (194), Expect = 3e-17
Identities = 36/173 (20%), Positives = 64/173 (36%), Gaps = 14/173 (8%)

Query: 647 HILLVDDSETNRDITGMMLQQLGHQVTRADSGTTALAIGRQHRFDLVLMDIRMPVLDGLA 706
IL+ DD R + L + G+ V + T DLV+ D+ MP +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 707 TTARWRHDPANIDSHCMITALSANASPDEQIKTSQAGMNHYLSKPVTLGQLAEMLDLTAQ 766
R + + +SA + IK S+ G YL KP L E++ + +
Sbjct: 65 LLPRIK----KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPF---DLTELIGIIGR 117

Query: 767 FQLERGVDLSPQLSEPQPLLDL-ADSALSLKLYQSLQVLIQQAKDAIENLPVL 818
E S + Q + L SA ++Y+ ++ + +L ++
Sbjct: 118 ALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYR----VLARLMQT--DLTLM 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3945HTHFIS592e-12 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 59.1 bits (143), Expect = 2e-12
Identities = 26/127 (20%), Positives = 53/127 (41%), Gaps = 3/127 (2%)

Query: 3 TKLLIVDDHELIIHGIKNMLAAYPRYLIVGQADNGLEVYNLCRQTEPDMVILDLGLPGMD 62
+L+ DD I + L+ V N ++ + D+V+ D+ +P +
Sbjct: 4 ATILVADDDAAIRTVLNQALS--RAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 63 GLDVIIQLLRRWPALKILTLTARNEEHYASRTFNSGALGYVLKKSPQQILMAAIQTVAIG 122
D++ ++ + P L +L ++A+N A + GA Y+ K L+ I A+
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR-ALA 120

Query: 123 KRYIDPA 129
+ P+
Sbjct: 121 EPKRRPS 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3946V8PROTEASE310.008 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 31.1 bits (70), Expect = 0.008
Identities = 7/43 (16%), Positives = 18/43 (41%)

Query: 293 AYGAPKAITSFVVPTGYSFNLDGSTLYQSIAAIFIAQLYGIEL 335
+ A + + TGY + +T+++S I + ++
Sbjct: 186 SNNAETQVNQNITVTGYPGDKPVATMWESKGKITYLKGEAMQY 228


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A3951HTHTETR280.047 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 28.1 bits (62), Expect = 0.047
Identities = 12/106 (11%), Positives = 31/106 (29%), Gaps = 14/106 (13%)

Query: 16 QGMSSRAIARELGISRNTVKRYLQAKSEPPKYTPRPAVASLLDEYRDYIRQRIAD----- 70
S IA+ G++R + + + KS+ + + + I + +
Sbjct: 30 SSTSLGEIAKAAGVTRGAIYWHFKDKSD--------LFSEIWELSESNIGELELEYQAKF 81

Query: 71 -AHPYKIPATVIAREIRDQGYRGGMTILRAFIRSLSVPQEQEPAVR 115
P + ++ + +L I + V+
Sbjct: 82 PGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQ 127


83YpAngola_A4125YpAngola_A4132N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
YpAngola_A4125-116-1.504385insertion sequence transposase
YpAngola_A4126015-2.889552transposase/IS protein
YpAngola_A4128015-3.006549hypothetical protein
YpAngola_A4130-112-1.452060*regulatory protein UhpC
YpAngola_A4131113-1.144917sensory histidine kinase UhpB
YpAngola_A4132-113-0.756144two-component system response regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A4125HTHTETR280.047 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 28.1 bits (62), Expect = 0.047
Identities = 12/106 (11%), Positives = 31/106 (29%), Gaps = 14/106 (13%)

Query: 16 QGMSSRAIARELGISRNTVKRYLQAKSEPPKYTPRPAVASLLDEYRDYIRQRIAD----- 70
S IA+ G++R + + + KS+ + + + I + +
Sbjct: 30 SSTSLGEIAKAAGVTRGAIYWHFKDKSD--------LFSEIWELSESNIGELELEYQAKF 81

Query: 71 -AHPYKIPATVIAREIRDQGYRGGMTILRAFIRSLSVPQEQEPAVR 115
P + ++ + +L I + V+
Sbjct: 82 PGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQ 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A4128PF05860594e-13 haemagglutination activity domain.
		>PF05860#haemagglutination activity domain.

Length = 117

Score = 59.0 bits (143), Expect = 4e-13
Identities = 20/128 (15%), Positives = 42/128 (32%), Gaps = 18/128 (14%)

Query: 53 TPPSTCRALTSYCIGMTETVVNIQAPDENGLSHNKYSKFDVVANGLFDVTTLNNRLAQEV 112
TP +T ++ ++ + L H+ + +F V +G
Sbjct: 4 TPDTTLPINSNITTEGNTRIIERGTQAGSNLFHS-FQEFSVPTSGTA------------- 49

Query: 113 NGNSFLQDKSATIILNEVNSSHASLLDGNLRVDGGNAHIIIANPAGINCRGCSFTNASHV 172
F + I++ V S +DG +R A++ + NP GI + +
Sbjct: 50 ---FFNNPTNIQNIISRVTGGSVSNIDGLIRA-NATANLFLINPNGIIFGQNARLDIGGS 105

Query: 173 TLTTGTPS 180
+ +
Sbjct: 106 FVGSTANR 113


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A4130TCRTETB454e-07 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 44.9 bits (106), Expect = 4e-07
Identities = 33/157 (21%), Positives = 68/157 (43%), Gaps = 7/157 (4%)

Query: 59 FNFIMPAMLTDLGLSMSDVGILGTLFYITYGCSKFVSGMISDRSNPRYFMGIGLVMTGII 118
N +P + D + + T F +T+ V G +SD+ + + G+++
Sbjct: 33 LNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFG 92

Query: 119 NILFGMSSSLLVLGALWILNAFFQGWG---WPPCSKILTSWY-SRSERGGWWAIWNTSHN 174
+++ + S +L I+ F QG G +P ++ + Y + RG + + +
Sbjct: 93 SVIGFVGHSFF---SLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVA 149

Query: 175 FGGALIPLLVGVITLHFSWRYGMIIPGIIGVVIGLLM 211
G + P + G+I + W Y ++IP I + + LM
Sbjct: 150 MGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLM 186


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A4131PF06580371e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 36.8 bits (85), Expect = 1e-04
Identities = 17/85 (20%), Positives = 33/85 (38%), Gaps = 10/85 (11%)

Query: 426 VTNAYRHGAASR-----IEINARQDNQQIYLTISDNGK-GIDLASITPGYGLRGIQSRVS 479
V N +HG A I + +DN + L + + G + + G GL+ ++ R+
Sbjct: 264 VENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGTGLQNVRERLQ 323

Query: 480 A-FGGNVSLSV---DNGTCLNVTLP 500
+G + + V +P
Sbjct: 324 MLYGTEAQIKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
YpAngola_A4132HTHFIS613e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 60.6 bits (147), Expect = 3e-13
Identities = 34/173 (19%), Positives = 63/173 (36%), Gaps = 20/173 (11%)

Query: 4 RVVFIDDHDIVRSGFAQLLSLEEDIQVVGEFSSAKQARAGLPGLQANICICDISMPDENG 63
++ DD +R+ Q LS V S+A + ++ + D+ MPDEN
Sbjct: 5 TILVADDDAAIRTVLNQALSRA-GYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 64 LDLLKGLPS---GMGVIMLSMHDSPALVETALERGARGFLSKRCKPEDLISAVRTVGSGG 120
DLL + + V+++S ++ A E+GA +L K +LI +
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR----- 117

Query: 121 VYLMPEIAQQLARVAVDPLTRREREIAVLLAEG---MEVREIAESLGLSPKTV 170
A + L ++ L+ E+ + L + T+
Sbjct: 118 -------ALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTL 163



 
Contact Sachin Pundhir for Bugs/Comments.
For best view 1024 x 768 resolution & IE 6.0 or above recommended.