PredictBias

identification of genomic and pathogenicity islands in prokaryotic genome
Home | Help | Analyzed genomes
 
A) Input parameters
GenomePAK.gbThreshold dinucleotide bias2
Threshold codon bias4Threshold %GC bias3
E-value (RPSBlast)0.05Genome (non-pathogenic)
 
B) Compare a potential GI or PAI in related non-pathogenic sp. (phylogenetic tree)
Potential GI or PAI start    end  
Select Organism     
 
C) Potential GIs and PAIs in LR657304 (download)
S.NoStartEndBiasVirulenceInsertion elementsPrediction
1PAKAF_00001PAKAF_00011Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_00001013-4.339775chromosomal replication initiator protein DnaA
PAKAF_00002-110-3.442456DNA polymerase III, beta chain
PAKAF_00003-28-2.370125RecF protein
PAKAF_00004-110-1.913183DNA gyrase subunit B
PAKAF_00005-19-0.847486lysophosphatidic acid acyltransferase, LptA
PAKAF_00006-18-1.941521D-glycero-beta-D-manno-heptose 1,7- bisphosphate
PAKAF_00007-16-2.027518hypothetical protein
PAKAF_00008-19-3.016707glycyl-tRNA synthetase beta chain
PAKAF_00009014-5.098199glycyl-tRNA synthetase alpha chain
PAKAF_00010-122-5.414300DNA-3-methyladenine glycosidase I
PAKAF_00011-214-4.364936probable 2-OH-lauroyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_00001PF03544392e-05 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 39.2 bits (91), Expect = 2e-05
Identities = 21/119 (17%), Positives = 36/119 (30%), Gaps = 3/119 (2%)

Query: 85 TPRAAIVPSQTHVAPPPPVAPPPAPVQPVSAAPVVVPREELPPVTTAPSVSSDPYEPEEP 144
P+A P + V P P P P P + APVV+ + + P V +
Sbjct: 62 PPQAVQPPPEPVVEPEPEPEPIPEPPKE---APVVIEKPKPKPKPKPKPVKKVEQPKRDV 118

Query: 145 SIDPLAAAMPAGAAPAVRTERNVQVEGALKHTSYLNRTFTFENFVEGKSNQLARAAAWQ 203
A P R + K + + + + + A+A +
Sbjct: 119 KPVESRPASPFENTAPARPTSSTATAATSKPVTSVASGPRALSRNQPQYPARAQALRIE 177



Score = 34.6 bits (79), Expect = 6e-04
Identities = 17/91 (18%), Positives = 22/91 (24%), Gaps = 5/91 (5%)

Query: 82 RSRTPRAAIVPSQTHVAPPPPVAPPPAPVQPVSAAPVVVPREELPPVTTAPSVSSDPYEP 141
P A P + P + PP A P P V E P P
Sbjct: 40 VIELPAPA-QPISVTMVAPADLEPPQAVQPP----PEPVVEPEPEPEPIPEPPKEAPVVI 94

Query: 142 EEPSIDPLAAAMPAGAAPAVRTERNVQVEGA 172
E+P P P + +
Sbjct: 95 EKPKPKPKPKPKPVKKVEQPKRDVKPVESRP 125


2PAKAF_00073PAKAF_00081Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_000730113.040910TagQ1
PAKAF_00074-1113.421186TagR1
PAKAF_000750143.817835TagS1
PAKAF_000760153.693752TagT1
PAKAF_00077-1153.427725serine/threonine protein kinase PpkA
PAKAF_000780163.514301PppA
PAKAF_000790153.617050TagF1
PAKAF_000800153.549443TssM1
PAKAF_00081-1123.470860TssL1
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_00077PF03544381e-04 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 38.0 bits (88), Expect = 1e-04
Identities = 21/104 (20%), Positives = 32/104 (30%), Gaps = 4/104 (3%)

Query: 260 DRLAPSALEATQIRPLATPQGSPRASNPPPAEPAPMPPADLGGLQPVSIQLPPVTPSAGG 319
+AP+ LE Q P+ P P EP P PP + + P P
Sbjct: 53 TMVAPADLEPPQ-AVQPPPE--PVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVK 109

Query: 320 ATPPPPPPSQAA-KPPSPPPPPLPPAKPRAGGSRTPLIAAAAAA 362
P + P+ P PA+P + + +
Sbjct: 110 KVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSV 153



Score = 30.3 bits (68), Expect = 0.031
Identities = 22/111 (19%), Positives = 30/111 (27%), Gaps = 9/111 (8%)

Query: 261 RLAPSALEATQIRPLATPQGSP---------RASNPPPAEPAPMPPADLGGLQPVSIQLP 311
A + P P+ P P +P P P + + +
Sbjct: 63 PQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVE 122

Query: 312 PVTPSAGGATPPPPPPSQAAKPPSPPPPPLPPAKPRAGGSRTPLIAAAAAA 362
S T P P S A + P + PRA P A A A
Sbjct: 123 SRPASPFENTAPARPTSSTATAATSKPVTSVASGPRALSRNQPQYPARAQA 173


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_00081OMPADOMAIN741e-16 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 74.2 bits (182), Expect = 1e-16
Identities = 40/138 (28%), Positives = 60/138 (43%), Gaps = 16/138 (11%)

Query: 318 AQRVAVEDAVDRSVVTIRGDELFASASASVRDEFQPLLLRIADALRKVK---GQVLVTGH 374
A A V T++ D LF A+++ E Q L ++ L + G V+V G+
Sbjct: 201 APAPAPAPEVQTKHFTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGY 260

Query: 375 SDNRPIATLRYPSNWKLSQARAQEVADLLGATTGDAGRFTAEGRSDTEPVATNASAEGRA 434
+D + Y N LS+ RAQ V D L + A + +A G ++ PV N +
Sbjct: 261 TDRI--GSDAY--NQGLSERRAQSVVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQ 316

Query: 435 R---------NRRVEITV 443
R +RRVEI V
Sbjct: 317 RAALIDCLAPDRRVEIEV 334


3PAKAF_00103PAKAF_00115Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_001032120.189296Tsi7 immunity protein
PAKAF_001043110.003348TIGR02270 family protein
PAKAF_00105411-0.945042probable carbonic anhydrase
PAKAF_00106312-1.167356probable sulfate transporter
PAKAF_00107414-2.656354hypothetical protein
PAKAF_00108315-1.484394cytochrome c oxidase, subunit II
PAKAF_00109216-0.751279cytochrome c oxidase, subunit I
PAKAF_001103131.442770cytochrome c oxidase assembly protein
PAKAF_001114121.498901cytochrome c oxidase, subunit III
PAKAF_001126111.919158twin transmembrane helix small protein
PAKAF_001136111.951899hypothetical protein
PAKAF_001145121.018143hypothetical protein
PAKAF_001152111.039873heme A synthase
4PAKAF_00195PAKAF_00206Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_001952161.416988putative NAD(P) transhydrogenase, subunit alpha
PAKAF_001963130.793956putative NAD(P) transhydrogenase, subunit alpha
PAKAF_001971141.606174pyridine nucleotide transhydrogenase, beta
PAKAF_001980122.550732TonB2
PAKAF_001992122.841737transport protein ExbB
PAKAF_002003122.976611transport protein ExbD
PAKAF_00201293.499850DUF3079 domain-containing protein
PAKAF_002022104.324606alpha/beta hydrolase
PAKAF_002103104.705309malonate decarboxylase alpha subunit
PAKAF_002032116.223532triphosphoribosyl-dephospho-CoA synthase
PAKAF_002041125.422198malonate decarboxylase delta subunit
PAKAF_002051124.630390malonate decarboxylase beta subunit
PAKAF_002060113.595407malonate decarboxylase gamma subunit
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_00198PF03544884e-23 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 88.5 bits (219), Expect = 4e-23
Identities = 66/231 (28%), Positives = 98/231 (42%), Gaps = 14/231 (6%)

Query: 49 RETILLVLFALTLHGAVIHWLSQQRTPALPEVPPQVPPMTIEFTAPA----PPVVEPPP- 103
R L ++ +HGAV+ L + E+P P+++ APA P V+PPP
Sbjct: 12 RRFPWPTLLSVCIHGAVVAGLLYTSVHQVIELPAPAQPISVTMVAPADLEPPQAVQPPPE 71

Query: 104 ----PEPLPPVVEEPPPPVVDENAVKPPPPKPVPKPKPKPKPQPRPKPAPKAVEPAPPAP 159
PEP P + EPP P PKP PKP K + R ++ +P
Sbjct: 72 PVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVESRPASPFEN 131

Query: 160 PQPAAPPAPPAPAAAPAPLTPPSANAGYLHNPAPEYPALAMCRGWEGTVLLRVHVLASGS 219
PA P + A AA P+T ++ L P+YPA A EG V ++ V G
Sbjct: 132 TAPARPTSSTATAATSKPVTSVASGPRALSRNQPQYPARAQALRIEGQVKVKFDVTPDGR 191

Query: 220 PSEIQVQKSSGREALDQAAVKAVKRWSFVPAKRGDKAEDGWVSVPIDFKLN 270
+Q+ + ++ A++RW + P K G + V I FK+N
Sbjct: 192 VDNVQILSAKPANMFEREVKNAMRRWRYEPGKPGSG-----IVVNILFKIN 237


5PAKAF_00248PAKAF_00270Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_00248226-4.327437probable transcriptional regulator
PAKAF_00249543-9.317427UbiD family decarboxylase
PAKAF_00250669-16.534129hypothetical protein
PAKAF_00251875-18.150879DUF1275 domain-containing protein
PAKAF_00252882-20.225817hypothetical protein
PAKAF_00253889-22.251665D12 class N6 adenine-specific DNA
PAKAF_00254765-16.896713Uncharacterized protein conserved in
PAKAF_00255541-11.784153hypothetical protein
PAKAF_00256220-4.740966hypothetical protein
PAKAF_002570110.658569hypothetical protein
PAKAF_002580131.144553shufflon-specific recombinase
PAKAF_00260-1141.508998*succinate-semialdehyde dehydrogenase
PAKAF_00261092.0419874-aminobutyrate aminotransferase
PAKAF_00262182.327192response regulator
PAKAF_002632123.041863probable transcriptional regulator
PAKAF_002644132.675369carboxymuconolactone decarboxylase family
PAKAF_002656122.871642cupin domain-containing protein
PAKAF_002675122.996625antibiotic biosynthesis monooxygenase
PAKAF_002684112.916348probable transcriptional regulator
PAKAF_002694113.594937probable major facilitator superfamily (MFS)
PAKAF_002703103.413720hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_00262HTHFIS545e-10 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 54.1 bits (130), Expect = 5e-10
Identities = 26/124 (20%), Positives = 49/124 (39%), Gaps = 8/124 (6%)

Query: 1 MSAAKKAPVILIADPDPWSRDLLGQLVLGVRCDARLVLCGDGGEALAHCRRRRFALILAE 60
M+ A IL+AD D R +L Q + R + + + L++ +
Sbjct: 1 MTGAT----ILVADDDAAIRTVLNQALS--RAGYDVRITSNAATLWRWIAAGDGDLVVTD 54

Query: 61 LNLPQVDGFELLREARLRRSVAEQPFILISDRADQASVRAAVALAPTAYLVKPFQAENLM 120
+ +P + F+LL + R + P +++S + + A YL KPF L+
Sbjct: 55 VVMPDENAFDLLPRIKKAR--PDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELI 112

Query: 121 QRLR 124
+
Sbjct: 113 GIIG 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_00269TCRTETB290.037 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 29.1 bits (65), Expect = 0.037
Identities = 28/131 (21%), Positives = 47/131 (35%), Gaps = 6/131 (4%)

Query: 228 LAPYYL--EQGWSAQESGLLLGFLTAMEV-LSGLLAPALASRSRDRRPVLVGLTALMLAG 284
+ PY + S E G ++ F M V + G + L R R VL +
Sbjct: 278 MVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILVDR-RGPLYVLNIGVTFLSVS 336

Query: 285 FLGLAWAPASLPLLWALCLGLGIGGLFPMGLIVC--LDHFDAPQRAGQLAALVQGAGYLI 342
FL ++ + + + +GGL ++ + Q AG +L+ +L
Sbjct: 337 FLTASFLLETTSWFMTIIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLS 396

Query: 343 AGVSPWIAGLL 353
G I G L
Sbjct: 397 EGTGIAIVGGL 407


6PAKAF_00567PAKAF_00593Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_005672161.228856probable hydrolase
PAKAF_005681140.258996DUF805 domain-containing protein
PAKAF_005690131.354018probable transcriptional regulator
PAKAF_005700140.729082carboxymuconolactone decarboxylase family
PAKAF_005710131.507491hypothetical protein
PAKAF_00572090.684576YqaE/Pmp3 family membrane protein
PAKAF_00573190.370039hypothetical protein
PAKAF_00574213-0.791545hypothetical protein
PAKAF_00575011-0.723250SMI1/KNR4 family protein
PAKAF_00576-210-1.433511DUF4375 domain-containing protein
PAKAF_00577-29-1.656829hypothetical protein
PAKAF_00578012-2.158898NERD domain-containing protein
PAKAF_00579013-2.121668Fic family protein
PAKAF_00581012-1.256161*bifunctional diguanylate cyclase/
PAKAF_00582210-1.425324sigma factor RpoD
PAKAF_0058319-0.211904DNA primase
PAKAF_005840110.290101GatB/YqeY domain-containing protein
PAKAF_005851110.68702130S ribosomal protein S21
PAKAF_005860101.151367O-sialoglycoprotein endopeptidase
PAKAF_0058708-1.107937glycerol-3-phosphate acyltransferase PlsY
PAKAF_0058809-2.008429dihydroneopterin aldolase
PAKAF_00589011-3.4760032-amino-4-hydroxy-6-
PAKAF_00590013-3.395969tRNA nucleotidyl transferase
PAKAF_00591014-3.808338DUF4136 domain-containing protein
PAKAF_00592014-3.940449SpoVR family protein
PAKAF_00593013-3.073594YeaH/YhbH family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_00573DNABINDINGHU270.009 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 27.3 bits (61), Expect = 0.009
Identities = 17/55 (30%), Positives = 23/55 (41%), Gaps = 12/55 (21%)

Query: 41 ARALDELDDAVIERLCAA-----------SLRYRAARLGRLGESAEEVA-PARGV 83
A A+D + AV L +R RAAR GR ++ EE+ A V
Sbjct: 23 AAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGRNPQTGEEIKIKASKV 77


7PAKAF_00612PAKAF_00635Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_00612211-1.997660probable permease of ABC transporter
PAKAF_00613211-2.040155ribulose-phosphate 3-epimerase
PAKAF_00614011-1.453255probable phosphoglycolate phosphatase
PAKAF_00615012-1.765820anthranilate synthetase component I
PAKAF_00616116-0.036238transcriptional regulator PrtN
PAKAF_006170170.233570transcriptional regulator PrtR
PAKAF_006180220.947355hypothetical protein
PAKAF_006191210.858375repressor, PtrB
PAKAF_006202240.089251hypothetical protein
PAKAF_006210240.028195holin
PAKAF_00622233-3.322331hypothetical protein
PAKAF_00623228-3.411897phage baseplate assembly protein V
PAKAF_00624128-3.958791probable bacteriophage protein
PAKAF_00625128-3.928500probable bacteriophage protein
PAKAF_00626-130-3.178161probable bacteriophage protein
PAKAF_00627-128-2.866165probable bacteriophage protein
PAKAF_00628-122-1.356413probable bacteriophage protein
PAKAF_00629025-1.013713probable bacteriophage protein
PAKAF_00630125-0.245995phage tail assembly protein
PAKAF_006311260.137851phage tail length determinator protein
PAKAF_006322230.816626phage tail protein
PAKAF_006331300.451340conserved hypothetical protein
PAKAF_006341260.091136phage late control D family protein
PAKAF_006353250.025428glycoside hydrolase family 19 protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_00631PF07132320.011 Harpin protein (HrpN)
		>PF07132#Harpin protein (HrpN)

Length = 356

Score = 31.6 bits (71), Expect = 0.011
Identities = 23/57 (40%), Positives = 30/57 (52%)

Query: 621 GSLAGAALGASIGSVVPVVGTLIGGLVGGAIGAWGGSELGGRLGRSLAGDPPAASDN 677
GS+ G LG +G + +G L GGL+GG +G GS LG LG +L G A
Sbjct: 62 GSMMGGGLGGGLGGLGSSLGGLGGGLLGGGLGGGLGSSLGSGLGSALGGGLGGALGA 118


8PAKAF_00644PAKAF_00653Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_00644220-2.855002probable bacteriophage protein
PAKAF_00645223-3.404899peptidase P60
PAKAF_00646122-3.973144probable bacteriophage protein
PAKAF_00647120-3.766098probable bacteriophage protein
PAKAF_00648026-5.909797hypothetical protein
PAKAF_00649016-4.189741tail fiber protein
PAKAF_0065019-2.710573hypothetical protein
PAKAF_0065138-3.414928anthranilate synthase component II,Anthranilate
PAKAF_0065226-3.044706anthranilate
PAKAF_0065329-3.290879indole-3-glycerol-phosphate
9PAKAF_00791PAKAF_00800Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_007912100.705257NAD(P)/FAD-dependent oxidoreductase
PAKAF_007922110.498547Rrf2 family transcriptional regulator
PAKAF_007932101.591400second ferric pyoverdine receptor FpvB
PAKAF_007942112.985207probable oxidoreductase
PAKAF_007950124.400236probable acetyltransferase
PAKAF_007960115.455942probable transcriptional regulator
PAKAF_007970105.283625hypothetical protein
PAKAF_00798195.289847hypothetical protein
PAKAF_007993115.413996probable short-chain dehydrogenase
PAKAF_00800-293.704405ferric enterobactin transport protein FepG
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_00795SACTRNSFRASE415e-07 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 40.7 bits (95), Expect = 5e-07
Identities = 18/63 (28%), Positives = 27/63 (42%), Gaps = 2/63 (3%)

Query: 76 RSTWAAQDVCYLEDLYVSPDVRGQQIGKQLIEYVRRQAEERRCARLYWHTQESNHRAQRL 135
RS W +ED+ V+ D R + +G L+ A+E L TQ+ N A
Sbjct: 83 RSNW--NGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHF 140

Query: 136 YDR 138
Y +
Sbjct: 141 YAK 143


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_00799DHBDHDRGNASE1196e-35 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 119 bits (299), Expect = 6e-35
Identities = 75/258 (29%), Positives = 117/258 (45%), Gaps = 32/258 (12%)

Query: 5 RTALVTGATRGIGLALARRLAASGWSVVGI-----------------ARHASDDFPGRLL 47
+ A +TGA +GIG A+AR LA+ G + + ARHA + FP
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHA-EAFP---- 63

Query: 48 CCDLADPAQTAETLRGLLSESA-VDALVNNAGIALPQSLENLDLAALQQVFDLNVRVAVQ 106
D+ D A E + E +D LVN AG+ P + +L + F +N
Sbjct: 64 -ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFN 122

Query: 107 LAQACLPGLKRSPAGRIVNLCSRAIHGAR-ERTAYAAAKSALVGVTRTWALELAPLGITV 165
+++ + +G IV + S R AYA++K+A V T+ LELA I
Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 166 NAVAPGPIETELFRQTRPVGGEEERRILST-------IPMQRLGRPDEVAALIEFLLSEG 218
N V+PG ET++ E+ I + IP+++L +P ++A + FL+S
Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242

Query: 219 ASFVTGQVIGVDGGGSLG 236
A +T + VDGG +LG
Sbjct: 243 AGHITMHNLCVDGGATLG 260


10PAKAF_01074PAKAF_01084Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_010742130.061612HlyD family secretion protein
PAKAF_010752130.117661probable ATP-binding component of ABC
PAKAF_010761100.945521probable permease of ABC transporter
PAKAF_010770101.348275probable binding protein component of ABC
PAKAF_010782101.882840probable permease of ABC transporter
PAKAF_01079192.710957Na+/H+ antiporter NhaP
PAKAF_010811143.669093amino acid transporter
PAKAF_010820153.873921TIGR01459 family HAD-type hydrolase
PAKAF_010831143.797827protein tyrosine phosphatase TpbA
PAKAF_010840143.192359hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01074RTXTOXIND506e-09 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 49.8 bits (119), Expect = 6e-09
Identities = 24/160 (15%), Positives = 60/160 (37%), Gaps = 13/160 (8%)

Query: 82 RIAVKQAESLVASRKATL-----EMRQLNAR-RRAEMDEMVVSRESRDDAHNTAAAAMAD 135
+ AV + E+ L ++ Q+ + A+ + +V++ +++ + +
Sbjct: 251 KHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDN 310

Query: 136 YEQAKAQLDAARLNLERTRVVAQVDGYVTNLNVHR-GDYARVGEAKMAVI-DKNSYWVYG 193
+L + + + A V V L VH G E M ++ + ++ V
Sbjct: 311 IGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTA 370

Query: 194 YFEETKLPYIREGDPVDMQLMS-----GEHLKGHVESIAR 228
+ + +I G +++ + +L G V++I
Sbjct: 371 LVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINL 410



Score = 49.4 bits (118), Expect = 7e-09
Identities = 21/179 (11%), Positives = 63/179 (35%), Gaps = 16/179 (8%)

Query: 13 LLILLVAVFIGRTLW--VNYMDTPWTRDGRVRAD--VINVAADVSGIVVDVPVRDNQLVK 68
+ ++ + + + ++ T +G++ + + IV ++ V++ + V+
Sbjct: 60 VAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVR 119

Query: 69 KGDLLMQIDPDHYRIAVKQAESLVASRKATLEMRQLNARRRAEMDEMVVSRESRDDAHNT 128
KGD+L+++ + +S + + R R E++++ + +
Sbjct: 120 KGDVLLKLTALGAEADTLKTQSSLLQARLEQT-RYQILSRSIELNKLPELKLPDEPYFQN 178

Query: 129 AAAAM---------ADYEQAKAQLDAARLNLERTRVVAQVDGYVTNLNVHRGDYARVGE 178
+ + + Q LNL++ R A+ + +N +
Sbjct: 179 VSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKR--AERLTVLARINRYENLSRVEKS 235


11PAKAF_01150PAKAF_01159Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_01150117-3.452491probable methyltransferase
PAKAF_01151219-3.076364O-acetylserine synthase
PAKAF_01152114-1.887920IscR
PAKAF_01153213-1.999980L-cysteine desulfurase (pyridoxal
PAKAF_01154214-1.936261probable iron-binding protein IscU
PAKAF_01155316-1.996689probable iron-binding protein IscA
PAKAF_01156216-2.553509heat shock protein HscB
PAKAF_01157116-2.792444heat shock protein HscA
PAKAF_01158117-3.248003ferredoxin (2Fe-2S)
PAKAF_01159216-2.833962Fe-S cluster assembly protein IscX
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01150PHAGEIV300.018 Gene IV protein signature.
		>PHAGEIV#Gene IV protein signature.

Length = 426

Score = 29.5 bits (66), Expect = 0.018
Identities = 12/57 (21%), Positives = 30/57 (52%), Gaps = 10/57 (17%)

Query: 124 DAVARASGATDILDAARVVDTLEEALSGCSVVLG------TSARDRRIPW----PLL 170
D+++ ++ A+D++ R + T G +++LG +++D +P+ PL+
Sbjct: 342 DSLSSSTQASDVITNQRSIATTVNLRDGQTLLLGGLTDYKNTSQDSGVPFLSKIPLI 398


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01157SHAPEPROTEIN1071e-27 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 107 bits (269), Expect = 1e-27
Identities = 77/363 (21%), Positives = 136/363 (37%), Gaps = 56/363 (15%)

Query: 22 VGIDLGTTNSLVAAVRSGVAEPLPDAQGRLILPSAVRYHAERAEVGESARAAAAEDPFNT 81
+ IDLGT N+L+ G+ L PS V +RA +S A +
Sbjct: 13 LSIDLGTANTLIYVKGQGIV---------LNEPSVVAIRQDRAGSPKSVAAVGHD----- 58

Query: 82 VISVKRLMGRGLEDVKQLGEQLPYRFRQGESHMPFIETVQGLKSPV----EVSADILRE- 136
K+++GR + I ++ +K V V+ +L+
Sbjct: 59 ---AKQMLGRTPGN---------------------IAAIRPMKDGVIADFFVTEKMLQHF 94

Query: 137 LRQRAETTLGGELVGAVITVPAYFDDAQRQATKDAARLAGLNVLRLLNEPTAAAVAYGLD 196
++Q + ++ VP +R+A +++A+ AG + L+ EP AAA+ GL
Sbjct: 95 IKQVHSNSFMRPSPRVLVCVPVGATQVERRAIRESAQGAGAREVFLIEEPMAAAIGAGLP 154

Query: 197 KGAEGLVAIYDLGGGTFDISILRLTRGVFEVLATGGDTALGGDDFDHAIAGWVIEEAGLS 256
+ D+GGGT +++++ L V +GGD FD AI +V G
Sbjct: 155 VSEATGSMVVDIGGGTTEVAVISLNGVV-----YSSSVRIGGDRFDEAIINYVRRNYGSL 209

Query: 257 ADLDPGSQRQLLQIACAAKERLTDEASVR----VAYGDWSGELSRATLDELIEPFVARSL 312
+ ++R +I A E VR L+ + E ++ + +
Sbjct: 210 IG-EATAERIKHEIGSAYPGDEVREIEVRGRNLAEGVPRGFTLNSNEILEALQEPLTGIV 268

Query: 313 KSCRRAVRDSGVDLEEI---RSVVMVGGSTRVPRVRTAVGELFGCEPLTDIDPDQVVAIG 369
+ A+ +L R +V+ GG + + + E G + DP VA G
Sbjct: 269 SAVMVALEQCPPELASDISERGMVLTGGGALLRNLDRLLMEETGIPVVVAEDPLTCVARG 328

Query: 370 AAI 372

Sbjct: 329 GGK 331


12PAKAF_01172PAKAF_01200Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_0117229-1.522938probable oxidoreductase
PAKAF_0117309-0.696066hypothetical protein
PAKAF_01174-19-0.284969hypothetical protein
PAKAF_01175-19-0.0773762-isopropylmalate synthase
PAKAF_01176-381.167703hypothetical protein
PAKAF_01177-181.287454Putative copper transport outer membrane porin
PAKAF_01178292.052377PepSY domain-containing protein
PAKAF_011792131.749946DUF4345 domain-containing protein
PAKAF_011802112.039571M23 family metallopeptidase
PAKAF_011812131.426950DUF2946 domain-containing protein
PAKAF_011822111.157606copper chaperone PCu(A)C
PAKAF_011832111.286904putative natural product biosynthesis protein
PAKAF_011841121.418358cysteine hydrolase
PAKAF_011850102.168314probable transcriptional regulator
PAKAF_011870112.120953probable transporter
PAKAF_011881102.758822TRAP transporter small permease
PAKAF_01189192.868306TRAP transporter substrate-binding protein DctP
PAKAF_011901103.474122probable transcriptional regulator
PAKAF_011911103.099257exodeoxyribonuclease VII large subunit
PAKAF_011920102.416109probable transcriptional regulator
PAKAF_01193-2111.707083sulfite exporter TauE/SafE family protein
PAKAF_01194-1141.119506probable acetylpolyamine aminohydrolase
PAKAF_01195012-2.777127MFS transporter
PAKAF_01196112-3.577102transporter
PAKAF_01197216-4.025580probable transcriptional regulator
PAKAF_01198318-4.505887inosine-5'-monophosphate dehydrogenase
PAKAF_01199324-5.032770GMP synthase,GMP synthase
PAKAF_01200328-4.921632hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01184ISCHRISMTASE310.004 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 30.8 bits (69), Expect = 0.004
Identities = 25/124 (20%), Positives = 40/124 (32%), Gaps = 11/124 (8%)

Query: 5 QPKRALLVIDVQNEYVSGNLRIEFPSIQSSLERIGAAMDAAHAAGIPIVVVQHLA---PA 61
P RA+L+I Y + I + GIP+V P
Sbjct: 27 DPNRAVLLIHDMQNYFVDAFTAGASPVTELSANIRKLKNQCVQLGIPVVYTAQPGSQNPD 86

Query: 62 D--------SPLFARGSRQAELHEVVASRPYQHKVEKQLASSFVGTGLADWLRERDIDTL 113
D P G + ++ +A + K S+F T L + +R+ D L
Sbjct: 87 DRALLTDFWGPGLNSGPYEEKIITELAPEDDDLVLTKWRYSAFKRTNLLEMMRKEGRDQL 146

Query: 114 AVVG 117
+ G
Sbjct: 147 IITG 150


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01187RTXTOXINA330.004 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 32.6 bits (74), Expect = 0.004
Identities = 25/94 (26%), Positives = 38/94 (40%), Gaps = 12/94 (12%)

Query: 161 SSLTNTSVGELFLAGVIPGLL--LAAAFMLLNAVYAYRNGLQARHAAPAWGEILAALSGA 218
+L N G + G+L ++A+F+L N + AA A E+ + G
Sbjct: 233 PNLDNIGAG----LDTVSGILSAISASFILSN-----ADADTRTKAA-AGVELTTKVLGN 282

Query: 219 LTALIAPVIIVAGIVLGLVTPTESGALIALYVAL 252
+ I+ II GL T + LIA V L
Sbjct: 283 VGKGISQYIIAQRAAQGLSTSAAAAGLIASAVTL 316


13PAKAF_01304PAKAF_01331Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_01304271.600078ABC transporter auxiliary component
PAKAF_013053111.890877DUF4340 domain-containing protein
PAKAF_013063131.718532SufE family protein
PAKAF_013071121.270033probable pyridoxal-phosphate dependent enzyme
PAKAF_013081130.036342tetrahydrodipicolinate succinylase
PAKAF_01309290.228777LysE family translocator
PAKAF_01310-290.195047ArsC family reductase
PAKAF_01311-19-0.721704YkgJ family cysteine cluster protein
PAKAF_01312010-1.263331hypothetical protein
PAKAF_01313-110-1.660542hypothetical protein
PAKAF_01314-211-2.112134probable sodium/hydrogen antiporter
PAKAF_01315013-2.774042probable aminotransferase
PAKAF_01316013-3.754414protein-PII uridylyltransferase
PAKAF_01317215-4.025413methionine aminopeptidase
PAKAF_01319114-2.86222630S ribosomal protein S2
PAKAF_0132019-1.436937elongation factor Ts
PAKAF_01321012-0.842082uridylate kinase
PAKAF_01322-29-1.778300ribosome recycling factor
PAKAF_01323-18-1.983833undecaprenyl pyrophosphate synthetase
PAKAF_01324-19-2.080590phosphatidate cytidylyltransferase
PAKAF_0132509-2.9425011-deoxy-d-xylulose 5-phosphate reductoisomerase
PAKAF_01326111-3.678034MucP
PAKAF_01327111-3.313639outer membrane protein Opr86
PAKAF_01328313-2.389298probable outer membrane protein precursor
PAKAF_01329311-1.906070UDP-3-O-[3-hydroxylauroyl] glucosamine
PAKAF_01330110-2.146704(3R)-hydroxymyristoyl-[acyl carrier protein]
PAKAF_01331210-1.943307UDP-N-acetylglucosamine acyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01308FERRIBNDNGPP310.006 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 31.1 bits (70), Expect = 0.006
Identities = 27/106 (25%), Positives = 42/106 (39%), Gaps = 23/106 (21%)

Query: 12 VGTQNRQEAWLEVFYAL--------PRLKPSSEIVAAVAPILGY--AAGNQALTFTSQQA 61
VG R E LE+ + PS E++A +AP G+ + G Q L +
Sbjct: 81 VGL--RTEPNLELLTEMKPSFMVWSAGYGPSPEMLARIAPGRGFNFSDGKQPLAMARKSL 138

Query: 62 YQLADALKGIDAAQSALL----------SRLA-ESQKPLVATLLAE 96
++AD L AA++ L R +PL+ T L +
Sbjct: 139 TEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTTLID 184


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01316YERSSTKINASE320.014 Yersinia serine/threonine protein kinase signature.
		>YERSSTKINASE#Yersinia serine/threonine protein kinase signature.

Length = 732

Score = 32.0 bits (72), Expect = 0.014
Identities = 22/89 (24%), Positives = 43/89 (48%), Gaps = 1/89 (1%)

Query: 63 ILQQAWQRFDWGDDADIALVAVGGYGRGELHPYSDVDLLILLDSEDQESFREPIEGFLTL 122
I++ + QR D + +G R H + +++L+ L + Q E GFL
Sbjct: 538 IVEPSLQRIQKHLDQTHSFSDIGSLVRAHKHLETLLEVLVTLSQQGQPVSSETY-GFLNR 596

Query: 123 LWDIGLEVGQSVRSVQQCAEEARADLTVI 151
L + + + Q + ++QQ E A+A L+++
Sbjct: 597 LTEAKITLSQQLNTLQQQQESAKAQLSIL 625


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01321CARBMTKINASE373e-05 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 37.5 bits (87), Expect = 3e-05
Identities = 17/79 (21%), Positives = 28/79 (35%), Gaps = 15/79 (18%)

Query: 132 GEVVIFSAGTGNPFFTT-------------DSAACLRAIEIDADVVLKATKVDGVYTADP 178
G +VI S G G P D A A E++AD+ + T V+G
Sbjct: 186 GVIVIASGGGGVPVILEDGEIKGVEAVIDKDLAGEKLAEEVNADIFMILTDVNGAALY-- 243

Query: 179 FKDPNAEKFERLTYDEVLD 197
+ + + +E+
Sbjct: 244 YGTEKEQWLREVKVEELRK 262


14PAKAF_01407PAKAF_01417Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_01407-1113.3151833-hydroxyisobutyrate dehydrogenase
PAKAF_01408-1113.172141probable acetyl-coa synthetase
PAKAF_014090114.310821probable oxidoreductase
PAKAF_014100114.696411antibiotic biosynthesis monooxygenase
PAKAF_01411-1114.949886probable transcriptional regulator
PAKAF_01413-1114.123971TatD family deoxyribonuclease
PAKAF_014140104.045354fructose transport system repressor FruR
PAKAF_014150104.204365phosphotransferase system transporter enzyme I,
PAKAF_014162113.2328021-phosphofructokinase
PAKAF_014171143.123991phosphotransferase system transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01415PHPHTRNFRASE6100.0 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 610 bits (1574), Expect = 0.0
Identities = 219/565 (38%), Positives = 340/565 (60%), Gaps = 13/565 (2%)

Query: 401 ERLQAIAASPGIASGPAHVQVAQRFEFQPR-GESPAHERERLLRAKRAVDEEIVGLVERS 459
++ IAAS G+A A + + + + + E E+L A EE+ + +++
Sbjct: 3 HKITGIAASSGVAIAKAFIHLEPNVDIEKTSITDVSTEIEKLTAALEKSKEELRAIKDQT 62

Query: 460 TVKA---IREIFVTHREMLDDPELAEQVQLRL-NRGESAEAAWSRVVEDSAAQQEALHDA 515
EIF H +LDDPEL + ++ ++ N +AE A V + + E++ +
Sbjct: 63 EASMGADKAEIFAAHLLVLDDPELVDGIKGKIENEQMNAEYALKEVSDMFVSMFESMDNE 122

Query: 516 LLAERAADLRDLGRRVLARLCGVEAPREPE--QPYILVMDEVGPSDVARLDAQRVAGILT 573
+ ERAAD+RD+ +RVL L GVE + +++ +++ PSD A+L+ Q V G T
Sbjct: 123 YMKERAADIRDVSKRVLGHLIGVETGSLATIAEETVIIAEDLTPSDTAQLNKQFVKGFAT 182

Query: 574 ARGGATSHSAIIARALGIPALVGAGAAVLGLEPGTALLLDGEHGWLQVAPSTEQLQQAAA 633
GG TSHSAI++R+L IPA+VG ++ G +++DG G + V P+ E+++
Sbjct: 183 DIGGRTSHSAIMSRSLEIPAVVGTKEVTEKIQHGDMVIVDGIEGIVIVNPTEEEVKAYEE 242

Query: 634 ERDARQQRQARADAQRLEPARTRDGHAVEVCANLGDTAGAARAVELGAEGVGLLRTEFVF 693
+R A ++++ EP+ T+DG VE+ AN+G + G EG+GL RTEF++
Sbjct: 243 KRAAFEKQKQEWAKLVGEPSTTKDGAHVELAANIGTPKDVDGVLANGGEGIGLYRTEFLY 302

Query: 694 MNNARAPDLATQEAEYRRVLDALDGRPLVARTLDVGGDKPLPYWPIPHEENPYLGLRGIR 753
M+ + P Q Y+ V+ +DG+P+V RTLD+GGDK L Y +P E NP+LG R IR
Sbjct: 303 MDRDQLPTEEEQFEAYKEVVQRMDGKPVVIRTLDIGGDKELSYLQLPKELNPFLGFRAIR 362

Query: 754 LTLQRPQILETQLRALFRAAGERPLRVMFPMVGSLDEWRQARDLALRLREEI------PL 807
L L++ I TQLRAL RA+ L+VMFPM+ +L+E RQA+ + ++++
Sbjct: 363 LCLEKQDIFRTQLRALLRASTYGNLKVMFPMIATLEELRQAKAIMQEEKDKLLSEGVDVS 422

Query: 808 ADLQLGIMVEVPSAALLAPVLAREVDFFSVGTNDLTQYTLAIDRGHPSLSAQADGLHPAV 867
+++GIMVE+PS A+ A + A+EVDFFS+GTNDL QYT+A DR + +S HPA+
Sbjct: 423 DSIEVGIMVEIPSTAVAANLFAKEVDFFSIGTNDLIQYTMAADRMNERVSYLYQPYHPAI 482

Query: 868 LQLIDMTVRAAHAEGKWVGVCGELAADPLALPLLVGLGVDELSVSARSIALVKAGVRELQ 927
L+L+DM ++AAH+EGKWVG+CGE+A D +A+PLL+GLG+DE S+SA SI ++ + +L
Sbjct: 483 LRLVDMVIKAAHSEGKWVGMCGEMAGDEVAIPLLLGLGLDEFSMSATSILPARSQLLKLS 542

Query: 928 LVAARGLARKALGLASAAEVRALVE 952
+ A+KAL L +A EV LV+
Sbjct: 543 KEELKPFAQKALMLDTAEEVEQLVK 567


15PAKAF_01553PAKAF_01577Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_01553-1123.538768DUF1329 domain-containing protein
PAKAF_015540114.428671probable transcriptional regulator
PAKAF_01555-1104.096229hypothetical protein
PAKAF_01556-1114.309976leucine dehydrogenase
PAKAF_01557-2103.782597probable pyruvate dehydrogenase E1 component,
PAKAF_01558-2103.516219probable pyruvate dehydrogenase E1 component,
PAKAF_01559-1123.493922probable dihydrolipoamide acetyltransferase
PAKAF_015602142.031949Protein of unknown function (DUF2809)
PAKAF_015610123.365477hypothetical protein
PAKAF_015620111.725464conserved hypothetical protein
PAKAF_015631112.040905DUF2790 domain-containing protein
PAKAF_015640122.987825hypothetical protein
PAKAF_015651113.224190HasI
PAKAF_015662113.553857HasS
PAKAF_015672122.834845heme uptake outer membrane receptor HasR
PAKAF_015681143.497981heme acquisition protein HasAp
PAKAF_015691143.098247transport protein HasD
PAKAF_015702141.637402metalloprotease secretion protein
PAKAF_015712150.992899probable outer membrane protein precursor
PAKAF_015721150.333604phosphate-starvation-inducible protein PsiE
PAKAF_015731160.051477biotin/lipoyl-binding protein
PAKAF_015742160.149579ABC transporter permease
PAKAF_015753171.068475putative ABC-type multidrug transport system,
PAKAF_015762152.689649YceK/YidQ family lipoprotein
PAKAF_015772182.414316probable transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01556DHBDHDRGNASE280.032 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 28.5 bits (63), Expect = 0.032
Identities = 17/62 (27%), Positives = 29/62 (46%), Gaps = 1/62 (1%)

Query: 160 LGSDDLEGLRVAVQGLGH-VGYALAEQLAAVGAELLVCDLDPGRVQLAVEQLGAHPLAPE 218
+ + +EG + G +G A+A LA+ GA + D +P +++ V L A E
Sbjct: 1 MNAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAE 60

Query: 219 AL 220
A
Sbjct: 61 AF 62


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01568PF064382761e-97 Heme acquisition protein HasAp
		>PF06438#Heme acquisition protein HasAp

Length = 205

Score = 276 bits (706), Expect = 1e-97
Identities = 204/205 (99%), Positives = 205/205 (100%)

Query: 1 MSISISYSTTYSGWTVADYLADWSAYFGDVNHRPGQVVDGSNTGGFNPGPFDGSQYALKS 60
MSISISYSTTYSGWTVADYLADWSAYFGDVNHRPGQVVDGSNTGGFNPGPFDGSQYALKS
Sbjct: 1 MSISISYSTTYSGWTVADYLADWSAYFGDVNHRPGQVVDGSNTGGFNPGPFDGSQYALKS 60

Query: 61 TASDAAFIAGGDLHYTLFSNPSHTLWGKLDSIALGDTLTGGASSGGYALDSQEVSFSNLG 120
TASDAAFIAGGDLHYTLFSNPSHTLWGKLDSIALGDTLTGGASSGGYALDSQEVSFSNLG
Sbjct: 61 TASDAAFIAGGDLHYTLFSNPSHTLWGKLDSIALGDTLTGGASSGGYALDSQEVSFSNLG 120

Query: 121 LDSPIAQGRDGTVHKVVYGLMSGDSSALQGQIDALLKAVDPSLSINSTFDQLAAAGVAHA 180
LDSPIAQGRDGTVHKVVYGLMSGDSSALQGQIDALLKAVDPSLSINSTFDQLAAAGVAHA
Sbjct: 121 LDSPIAQGRDGTVHKVVYGLMSGDSSALQGQIDALLKAVDPSLSINSTFDQLAAAGVAHA 180

Query: 181 TPAAAAAEIGVVGVQELPHDLALAA 205
TPAAAAAE+GVVGVQELPHDLALAA
Sbjct: 181 TPAAAAAEVGVVGVQELPHDLALAA 205


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01570RTXTOXIND417e-145 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 417 bits (1073), Expect = e-145
Identities = 96/435 (22%), Positives = 170/435 (39%), Gaps = 8/435 (1%)

Query: 15 AALELDEK---RFSRLGWGLVLLGFVGFLLWAGLAPLDKGVGVSGTVMVAGSRKAVQHPT 71
A LEL E R RL ++ V + + L ++ +G + +G K ++
Sbjct: 44 AHLELIETPVSRRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIE 103

Query: 72 GGLVRHIRVHEGERVEAGQVLLEMDATQARAQADGLFAQYLAALASLARLSAERDEKARI 131
+V+ I V EGE V G VLL++ A A A + L A R
Sbjct: 104 NSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELN 163

Query: 132 EFPAELLALDDPRLPTLLEQQ----RQLHDSRRRALRLELDGLAETVAGSQAQLDGLQAA 187
+ P EL D+P + E++ L + + + + +A+ + A
Sbjct: 164 KLP-ELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLAR 222

Query: 188 LRSKEQQRAALEEQLRGLRQLASEGYVPRNRLLDSERLLAQVNGEIAGDLGSLGSTRRQI 247
+ E + +L L + + ++ +L+ E + E+ L +I
Sbjct: 223 INRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEI 282

Query: 248 LELRLRMAQRREKFQEEVRASLADAQVRAEELRNRLASARFDLANSEVRAPVAGLVVGQE 307
L + + F+ E+ L L LA S +RAPV+ V +
Sbjct: 283 LSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLK 342

Query: 308 VFTEGGVIAPGQQLMEILPERQPLLVDARLPVEMVDKVRVGLPVELMFSAFNQSTTPRVE 367
V TEGGV+ + LM I+PE L V A + + + + VG + AF + +
Sbjct: 343 VHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLV 402

Query: 368 GEVTLVSADRLLDERSEAPYYRVRIRVGEEGVRRLAGLEIRPGMPVEAFVRSGERSLLNY 427
G+V ++ D + D+R + + + + GM V A +++G RS+++Y
Sbjct: 403 GKVKNINLDAIEDQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGMRSVISY 462

Query: 428 LFKPLADRTHLALGE 442
L PL + +L E
Sbjct: 463 LLSPLEESVTESLRE 477


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01571RTXTOXIND320.007 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 31.7 bits (72), Expect = 0.007
Identities = 20/171 (11%), Positives = 49/171 (28%), Gaps = 11/171 (6%)

Query: 60 LPSLRYDYNKARNDSTVSQGDARVERDYRSYASTLSLEQPLFDYEAYARYRQ-GEAQAL- 117
L +L + + + S++ Q R S + P ++ E + L
Sbjct: 127 LTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLR 186

Query: 118 ---FADEQFRGRSQELA---VRLFAAYSETLFAREQVVLAEAQRRALETQLAFNQRAFEE 171
EQF + + L +E L ++ E R +++L +
Sbjct: 187 LTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHK 246

Query: 172 GEGTRTDLLE---TRARLSLTRAEEIAASDRAAAARRTLEAMLGQALEDRE 219
+ +LE + ++ + + + + +
Sbjct: 247 QAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFK 297


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01573RTXTOXIND566e-11 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 56.0 bits (135), Expect = 6e-11
Identities = 25/161 (15%), Positives = 59/161 (36%), Gaps = 17/161 (10%)

Query: 41 IVSSKAKGRVQVLHVRRGDEVKQGDLLISLDSPELEAQLDALHAARNQAQAQLDESLHGT 100
+ V+ + V+ G+ V++GD+L+ L + EA ++ QA+ + +
Sbjct: 98 EIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILS 157

Query: 101 REESIRALKASLAQAEAELRNAESDFQRNQQMVERGFLSRTQFDLSRRERDVARDRVAEA 160
R + L E +N ++++ L + QF + ++ + +
Sbjct: 158 RSIELNKLPELKLPDEPYFQNV-----SEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKK 212

Query: 161 RANLDEGLKGDREERRQALQAAVRRADAQIAELQAQIDDLQ 201
RA + A + R + ++++DD
Sbjct: 213 RAER------------LTVLARINRYENLSRVEKSRLDDFS 241



Score = 52.9 bits (127), Expect = 7e-10
Identities = 29/205 (14%), Positives = 77/205 (37%), Gaps = 24/205 (11%)

Query: 75 LEAQLDALHAARNQAQAQLDESLHGTREESIRALKASLAQAEAELRNAESDFQRNQQMVE 134
++ Q + Q + LD+ + + A + + E R +S ++
Sbjct: 191 IKEQFSTWQNQKYQKELNLDK-----KRAERLTVLARINRYENLSRVEKSRLDDFSSLLH 245

Query: 135 RGFLSRTQFDLSRRERDVARDRVAEARANLDE------GLKGDREERRQALQAAV----R 184
+ +++ + A + + ++ L++ K + + Q + + R
Sbjct: 246 KQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLR 305

Query: 185 RADAQIAELQAQI----DDLQ---VRAPVNGEVGPIPA-EQGELINAYSPLLTLVRLDDS 236
+ I L ++ + Q +RAPV+ +V + +G ++ L+ +V DD+
Sbjct: 306 QTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDT 365

Query: 237 YFV-FNLREDILAKVRKGDRIVMQV 260
V ++ + + G +++V
Sbjct: 366 LEVTALVQNKDIGFINVGQNAIIKV 390


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01575ABC2TRNSPORT280.039 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 28.4 bits (63), Expect = 0.039
Identities = 27/122 (22%), Positives = 50/122 (40%), Gaps = 1/122 (0%)

Query: 246 LGYRQSASFFMLLGIVLPFLIAVIALSEFIAELLPTEESVYLTMTFITLPLFYMAGYSWP 305
LGY Q S L ++ +A +L + L P+ + T + P+ +++G +P
Sbjct: 139 LGYTQWLSLLYALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFP 198

Query: 306 EQAMPDWVRWLADAIPSTWAIRAIAEMNQMDLPLREVSDHALVLLGMAATYALLGTLLYQ 365
+P + A +P + +I I + + P+ +V H L L T L +
Sbjct: 199 VDQLPIVFQTAARFLPLSHSIDLIRPI-MLGHPVVDVCQHVGALCIYIVIPFFLSTALLR 257

Query: 366 YR 367
R
Sbjct: 258 RR 259


16PAKAF_01604PAKAF_01616Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_01604427-4.542364phosphonate metabolism protein PhnP
PAKAF_01605526-3.792232hypothetical protein
PAKAF_01606535-6.865733hypothetical protein
PAKAF_01607429-6.418595hypothetical protein
PAKAF_01608427-5.501062probable acetyltransferase
PAKAF_01610424-5.154622*hypothetical protein
PAKAF_01611218-3.482671sugar-binding protein
PAKAF_01612-123-4.851895hypothetical protein
PAKAF_01613-111-0.948301aliphatic amidase
PAKAF_01614-190.556720probable chaperone
PAKAF_01615110-0.383008aliphatic amidase expression-regulating protein
PAKAF_016163100.477905aliphatic amidase regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01608SACTRNSFRASE565e-12 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 56.1 bits (135), Expect = 5e-12
Identities = 21/71 (29%), Positives = 33/71 (46%)

Query: 151 RSAILEDMVVDRHARGQGVGRELIGRAVERARSWGCYKLALSSHQDRETAQRFYAALGFT 210
A++ED+ V + R +GVG L+ +A+E A+ L L + +A FYA F
Sbjct: 88 GYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFI 147

Query: 211 SHGVSLALHLG 221
V L+
Sbjct: 148 IGAVDTMLYSN 158


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01614HTHFIS393e-05 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 38.7 bits (90), Expect = 3e-05
Identities = 36/177 (20%), Positives = 61/177 (34%), Gaps = 21/177 (11%)

Query: 31 LLQAHLSHRSALHSRFRFDPAAVMDCLRAEVLGQEPALQAVEDMLKVVRADIADPRRPLF 90
++ L+ S+ D M ++G+ A+Q + +L + L
Sbjct: 114 IIGRALAEPKRRPSKLEDDSQDGMP-----LVGRSAAMQEIYRVLARL----MQTDLTL- 163

Query: 91 SALFLGPTGVGKTEIVRALARALHGDAEGFCRVDMNTLSQEHYAAALTGAPPG-YVGA-K 148
+ G +G GK + RAL F ++M + ++ + L G G + GA
Sbjct: 164 --MITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQT 221

Query: 149 EGTTLLEQDKLDGSPGRPGIVLFDELEKASPEVVHALLNVLDNGLLRVASGERTYHF 205
T EQ +G G + DE+ + LL VL G G
Sbjct: 222 RSTGRFEQ--AEG-----GTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRS 271


17PAKAF_01687PAKAF_01701Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_01687427-4.287288probable HIT family protein
PAKAF_01688427-4.037732VgrG4a
PAKAF_01689737-5.579366DUF4123 domain-containing protein
PAKAF_05919638-5.982364Tli1
PAKAF_05920324-3.869548Tli1
PAKAF_01690118-2.499073Tle1
PAKAF_01691-212-0.305457hypothetical protein
PAKAF_01692-213-0.486700cysteine hydrolase family protein
PAKAF_01693-114-0.479872ankyrin repeat domain-containing protein
PAKAF_01694013-0.9888463-oxoacyl-(acyl carrier protein) synthase III
PAKAF_01695013-1.862130probable sigma-70 factor, ECF subfamily
PAKAF_01697214-2.211357hypothetical protein
PAKAF_01698213-2.099610DUF692 family protein
PAKAF_01699112-2.350249DUF2063 domain-containing protein
PAKAF_01700211-2.568331DoxX family protein
PAKAF_01701212-2.473079Pyrophosphate-specific outer membrane porin OprO
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01692ISCHRISMTASE280.013 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 28.4 bits (63), Expect = 0.013
Identities = 18/93 (19%), Positives = 35/93 (37%), Gaps = 3/93 (3%)

Query: 75 AADRVFVKHGY--LPTAELVDHLRALRAERVLVCGIQADTCVLAAGFALFDAGLQPTLIG 132
D V K Y L++ +R +++++ GI A L F ++ +G
Sbjct: 116 DDDLVLTKWRYSAFKRTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFFVG 175

Query: 133 DLVLGSSLDRSGELGVRLWKHHFGQVVSLAEVL 165
D V SL++ ++ + V +L
Sbjct: 176 DAVADFSLEKH-QMALEYAAGRCAFTVMTDSLL 207


18PAKAF_01774PAKAF_01785Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_017742101.488107probable permease of ABC transporter
PAKAF_017752121.467469potassium uptake protein TrkH
PAKAF_017762133.100864YkgJ family cysteine cluster protein
PAKAF_017773143.057135nitroreductase
PAKAF_017783152.527221hypothetical protein
PAKAF_017792160.690112probable two-component sensor
PAKAF_017801130.282899hypothetical protein
PAKAF_017810140.247557two-component response regulator CpxR
PAKAF_01782114-0.525603hypothetical protein
PAKAF_017833110.057326YciI family protein
PAKAF_01784211-0.450027putative intracellular septation protein A
PAKAF_017852130.312705PHP domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01779PF06580290.026 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 29.4 bits (66), Expect = 0.026
Identities = 16/100 (16%), Positives = 35/100 (35%), Gaps = 17/100 (17%)

Query: 341 VDNLLRNAVRFNPVGQPLEVRASSAGDYLRLSVRDHGPGIAAELQEQLGEPFFRAPNQSS 400
V+N +++ + P G + ++ + + L V + G +E
Sbjct: 264 VENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKE-------------- 309

Query: 401 PGHGLGLA-IARRAIERHGGHLRLG-NHPDGGFIATLSLP 438
G GL + R +G ++ + G A + +P
Sbjct: 310 -STGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01781HTHFIS1039e-28 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 103 bits (258), Expect = 9e-28
Identities = 42/117 (35%), Positives = 63/117 (53%)

Query: 4 LLLIDDDRELCELLGTWLVQEGFSVRASHDGAQARRALAEQTPDAVVLDVMLPDGSGLEL 63
+L+ DDD + +L L + G+ VR + + A R +A D VV DV++PD + +L
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 64 LKQLRGDHPDLPVLMLSARGEPLDRILGLELGADDYLAKPCDPRELTARLRAVLRRT 120
L +++ PDLPVL++SA+ + I E GA DYL KP D EL + L
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01782IGASERPTASE280.010 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 28.1 bits (62), Expect = 0.010
Identities = 16/76 (21%), Positives = 31/76 (40%), Gaps = 8/76 (10%)

Query: 22 EEPAPAPIPAAQPSITQATAELERRLVETERQRDELVSRMRQENRQLREQ--------LQ 73
E P P P PA T+ AE ++ +T + ++ + +NR++ ++ Q
Sbjct: 1022 EAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQ 1081

Query: 74 AAQAQRQPPLLTEEQT 89
+ + E QT
Sbjct: 1082 TNEVAQSGSETKETQT 1097


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01783adhesinmafb309e-04 Neisseria meningitidis: adhesin MafB signature.
		>adhesinmafb#Neisseria meningitidis: adhesin MafB signature.

Length = 467

Score = 30.4 bits (68), Expect = 9e-04
Identities = 13/45 (28%), Positives = 18/45 (40%)

Query: 53 AAGFTGSLIVAEFDSLAAAQSWAEADPYRAAGVYAEVVVKPFKKV 97
G GS+ E ++ A W + +P A V A V KV
Sbjct: 278 VIGGLGSVAGFEKNTREAVDRWIQENPNAAETVEAVFNVAAAAKV 322


19PAKAF_01813PAKAF_01836Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_01813212-0.046427probable short-chain dehydrogenase
PAKAF_01814210-0.452636probable hydrolase
PAKAF_01815210-1.0266113-demethylubiquinone-9 3-methyltransferase
PAKAF_0181619-0.924256TRZ/ATZ family hydrolase
PAKAF_01817111-1.0715645-methylthioribose-1-phosphate isomerase MtnA
PAKAF_01819112-1.179258DNA gyrase subunit A
PAKAF_01820113-2.4976153-phosphoserine aminotransferase
PAKAF_01821315-2.529461chorismate mutase
PAKAF_01822223-4.491212histidinol-phosphate aminotransferase
PAKAF_01823333-7.512429bifunctional prephenate
PAKAF_01824451-12.420883cytidylate kinase
PAKAF_01825462-15.18307030S ribosomal protein S1
PAKAF_01826578-16.995623integration host factor beta subunit
PAKAF_01827480-16.853306O-antigen chain length regulator,Polysaccharide
PAKAF_01828479-16.541156UDP-glucose/GDP-mannose dehydrogenase-like
PAKAF_01829478-15.331650wbpP,dTDP-glucose 4,6-dehydratase,Vi
PAKAF_01830474-13.700263colanic acid exporter,Polysaccharide
PAKAF_01831369-11.832990WbpR,sugar transferase, PEP-CTERM/EpsH1 system
PAKAF_01832355-9.701857asparagine synthase-like protein,Asparagine
PAKAF_01833249-8.056302glycosyl transferase-like protein,Spore coat
PAKAF_01834135-5.797104glycosyl transferase-like protein,Capsular
PAKAF_01835022-4.206802NAD dependent epimerase/dehydratase family-like
PAKAF_01836-116-3.464148putative group 4 glycosyl
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01813DHBDHDRGNASE914e-24 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 90.9 bits (225), Expect = 4e-24
Identities = 56/195 (28%), Positives = 91/195 (46%), Gaps = 5/195 (2%)

Query: 11 LKDRVILVTGAGRGIGAAAAKTFAAHGATVLLLGKTEEYLNEVYDAIEAAGHPQAAVIPF 70
++ ++ +TGA +GIG A A+T A+ GA + + E L +V +++A A F
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEA---F 62

Query: 71 NLETAQPHQFEELAATLENEFGRIDGLLHNASILGPRSPMQQISGENFMRVMQVNVNAMF 130
+ +E+ A +E E G ID L++ A +L P + +S E + VN +F
Sbjct: 63 PADVRDSAAIDEITARIEREMGPIDILVNVAGVLRP-GLIHSLSDEEWEATFSVNSTGVF 121

Query: 131 MLTTAMLPLMKLSSDASIIFTSSSVGRKGRAYWGAYSVSKFATEGLMQTLADELDGTSAI 190
+ ++ M SI+ S+ R AY+ SK A + L EL I
Sbjct: 122 NASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAE-YNI 180

Query: 191 RANSVNPGATRTSMR 205
R N V+PG+T T M+
Sbjct: 181 RCNIVSPGSTETDMQ 195


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01816UREASE372e-04 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 36.6 bits (85), Expect = 2e-04
Identities = 20/41 (48%), Positives = 23/41 (56%), Gaps = 3/41 (7%)

Query: 341 DAHRALRMA---TLNGARALGLERLIGSLETGKAADLVAFD 378
D R R T+N A A GL IGSLE GK ADLV ++
Sbjct: 398 DNFRVKRYIAKYTINPAIAHGLSHEIGSLEVGKRADLVLWN 438


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01826DNABINDINGHU1181e-38 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 118 bits (297), Expect = 1e-38
Identities = 35/89 (39%), Positives = 54/89 (60%), Gaps = 1/89 (1%)

Query: 2 TKSELIERIVTHQGQLSAKDVELAIKTMLEQMSQALATGDRIEIRGFGSFSLHYRAPRVG 61
K +LI + V +L+ KD A+ + +S LA G+++++ GFG+F + RA R G
Sbjct: 3 NKQDLIAK-VAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKG 61

Query: 62 RNPKTGESVRLDGKFVPHFKPGKELRDRV 90
RNP+TGE +++ VP FK GK L+D V
Sbjct: 62 RNPQTGEEIKIKASKVPAFKAGKALKDAV 90


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01829NUCEPIMERASE2572e-86 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 257 bits (659), Expect = 2e-86
Identities = 101/341 (29%), Positives = 159/341 (46%), Gaps = 30/341 (8%)

Query: 19 LITGVAGFIGSNLLETLLKLDQKVVGLDNFATGHQRNLDEVRSLVSEKQWSNFKFIQGDI 78
L+TG AGFIG ++ + LL+ +VVG+DN + +L + R + + F+F + D+
Sbjct: 4 LVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQ--PGFQFHKIDL 61

Query: 79 RNLDDCNNACA--GVDYVLHQAALGSVPRSINDPITSNATNIDGFLNMLIAARDAKVQSF 136
+ + + A + V +V S+ +P +N+ GFLN+L R K+Q
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQHL 121

Query: 137 TYAASSSTYGDHPGLP-KVEDTIGKPLSPYAVTKYVNELYADVFSRCYGFSTIGLRYFNV 195
YA+SSS YG + +P +D++ P+S YA TK NEL A +S YG GLR+F V
Sbjct: 122 LYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFFTV 181

Query: 196 FGRRQDPNGAYAAVIPKWTSSMIQGDDVYINGDGETSRDFCYIENTVQANLLAATAGLDA 255
+G P+ A K+T +M++G + + G+ RDF YI++ +A + A
Sbjct: 182 YGPWGRPDMAL----FKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIPHA 237

Query: 256 RNQ----------------VYNIAVGGRTSLNQLFFALRDGLAENGVSYHREPVYRDFRE 299
Q VYNI L AL D L G+ + +
Sbjct: 238 DTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDAL---GIEAKKN--MLPLQP 292

Query: 300 GDVRHSLADISKAAKLLGYAPKYDVSAGVALAMPWYIMFLK 340
GDV + AD +++G+ P+ V GV + WY F K
Sbjct: 293 GDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRDFYK 333


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01831PF07520290.042 Virulence protein SrfB
		>PF07520#Virulence protein SrfB

Length = 1041

Score = 28.8 bits (64), Expect = 0.042
Identities = 24/119 (20%), Positives = 46/119 (38%), Gaps = 10/119 (8%)

Query: 40 MHMSEALVSAYPPARRLDRPLWLLRELLHRLPQVIGSYGSDVVILQRELLSTIPTLEFLT 99
+ ++EA++SA A DR + ++L +P +G G + T L++L
Sbjct: 696 VPLAEAILSACEDAEEADRIDIPVADVLGLVPTPVGEEGDEEGHEDASPQVTDEILDYLE 755

Query: 100 K--------APRILDVDDAIWLHRRGIAANSIARRVDHIVCG--NQYLADYFGQFGRPT 148
K R+ D+ + A + ++V +C + D GRP+
Sbjct: 756 KPATQLGAEGWRLADMVLSASREDLDAIAREVFQKVLGNMCEVIDHLGCDVVLLTGRPS 814


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01835NUCEPIMERASE663e-14 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 66.0 bits (161), Expect = 3e-14
Identities = 67/356 (18%), Positives = 120/356 (33%), Gaps = 69/356 (19%)

Query: 5 NVLVTGATGFIGAALVNSLCSSGQ-----------YKVWAGCRRRGGAWPRGVTP----L 49
LVTGA GFIG + L +G Y V R G L
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 50 LLGELGSSVVWDAESAIDTVVHCAARVHV-MSETASDPLVEFRKANVQGT---LDLAREA 105
E + + + V R+ V S + +N+ G L+ R
Sbjct: 62 ADREGMTDLFASGH--FERVFISPHRLAVRYSLENPHAYAD---SNLTGFLNILEGCRHN 116

Query: 106 VSRGVRRFIFISSIKVNGEGTEPGRPY-TADSPPNPVDPYGVSKREAEQALLDLAEETGL 164
++ ++ SS V G + P+ T DS +PV Y +K+ E + GL
Sbjct: 117 ---KIQHLLYASSSSVYGLNRKM--PFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGL 171

Query: 165 EVVIIRPVLVYGPGVKAN--VQTMMRWLKRGVPLPL-GAIHNRRSLVSLDNLVDLIITCI 221
+R VYGP + + + + + G + + +R +D++ + II
Sbjct: 172 PATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQ 231

Query: 222 EHPA-----------------AVGQVFLVSDGEDLSTTELLRRMGRALGAPAR--LLPVP 262
+ A +V+ + + + + ++ + ALG A+ +LP+
Sbjct: 232 DVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLPLQ 291

Query: 263 ASWIGAAAKVLNRQAFARRLCGSLQVDIMKTRQVLGWTPPVGVDQALEKTARSFLD 318
V + A D +V+G+TP V ++ + D
Sbjct: 292 ------PGDV--LETSA---------DTKALYEVIGFTPETTVKDGVKNFVNWYRD 330


20PAKAF_01962PAKAF_02010Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_01962280.090003lipid kinase YegS
PAKAF_0196318-0.394078MOSC domain-containing protein
PAKAF_0196429-0.534112lysozyme inhibitor LprI family protein
PAKAF_0196528-0.811034probable soluble lytic transglycosylase
PAKAF_0196618-2.067095probable ATP-binding component of ABC
PAKAF_01967010-2.641170hypothetical protein
PAKAF_01968011-2.810194universal stress protein
PAKAF_01969010-3.033626hypothetical protein
PAKAF_01970110-3.409811hypothetical protein
PAKAF_01971211-3.605754fatty-acid oxidation complex alpha-subunit
PAKAF_01972012-3.085404fatty-acid oxidation complex beta-subunit
PAKAF_01973211-3.049840DUF1653 domain-containing protein
PAKAF_0197428-2.705267DNA topoisomerase I
PAKAF_01975415-1.289938hypothetical protein
PAKAF_01976415-0.573645hypothetical protein
PAKAF_01977314-0.049918cell division inhibitor SulA
PAKAF_0197818-0.004047repressor protein LexA
PAKAF_0197908-0.522053transcriptional regulator PsrA
PAKAF_0198008-1.033219beta-N-acetyl-D-glucosaminidase
PAKAF_01981-18-1.3817185-methylthioadenosine phosphorylase MtnP
PAKAF_0198209-1.661235hypothetical protein
PAKAF_0198308-1.759201transcription-repair coupling protein Mfd
PAKAF_01984210-2.878733probable glyceraldehyde-3-phosphate
PAKAF_01985110-2.855673aromatic amino acid transport protein AroP1
PAKAF_0198609-2.465711Na+-translocating NADH:ubiquinone oxidoreductase
PAKAF_01987010-2.256405Na+-translocating NADH:ubiquinone oxidoreductase
PAKAF_01988012-2.439701Na+-translocating NADH:ubiquinone oxidoreductase
PAKAF_01989112-3.122232Na+-translocating NADH:uniquinone oxidoreductase
PAKAF_01990111-2.381870Na+-translocating NADH:quinone oxidoreductase
PAKAF_01991210-1.888071Na+-translocating NADH:quinone oxidoreductase,
PAKAF_01992210-1.808977FAD:protein FMN transferase
PAKAF_01993111-2.566764(Na+)-NQR maturation NqrM
PAKAF_01994011-1.922035soluble pyridine nucleotide transhydrogenase
PAKAF_01995012-1.567214probable phosphodiesterase
PAKAF_01996213-2.032512phosphodiesterase
PAKAF_019972150.070845hypothetical protein
PAKAF_019983140.142456lipoprotein localisation protein, LolE
PAKAF_019993150.666923lipoprotein localisation protein, LolD
PAKAF_020003160.811691lipoprotein localisation protein, LolC
PAKAF_020013171.561409DUF2062 domain-containing protein
PAKAF_020022171.897570DNA internalization-related competence protein
PAKAF_02003213-0.246128probable tolQ-type transport protein
PAKAF_02004117-0.061614biopolymer transporter ExbD
PAKAF_020054170.435331tetraacyldisaccharide 4*-kinase
PAKAF_020064180.000138Trm112 family protein
PAKAF_02007417-0.0411563-deoxy-manno-octulosonate cytidylyltransferase
PAKAF_02008315-0.658279phosphotyrosine protein phosphatase
PAKAF_02009414-0.570175UDP-N-acetylpyruvoylglucosamine reductase
PAKAF_02010315-0.954068ribonuclease E
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01966RTXTOXIND330.003 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 33.3 bits (76), Expect = 0.003
Identities = 16/62 (25%), Positives = 26/62 (41%), Gaps = 2/62 (3%)

Query: 575 KLQRELEALPGQIDAVEAELAGVQETIAQ--QDFYLRPQDEQRETLARLDALQQELDALL 632
+ EL Q++ +E+E+ +E Q F D+ R+T + L EL
Sbjct: 263 EAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNE 322

Query: 633 ER 634
ER
Sbjct: 323 ER 324


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01968SHAPEPROTEIN270.019 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 27.4 bits (61), Expect = 0.019
Identities = 12/39 (30%), Positives = 18/39 (46%), Gaps = 2/39 (5%)

Query: 17 DPVMKRAAALATSNQARLSVVHVV-EPMAMAFGGDVPMD 54
V +RA + A V ++ EPMA A G +P+
Sbjct: 119 TQVERRAIRESAQG-AGAREVFLIEEPMAAAIGAGLPVS 156


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01979HTHTETR683e-16 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 68.1 bits (166), Expect = 3e-16
Identities = 25/93 (26%), Positives = 40/93 (43%), Gaps = 1/93 (1%)

Query: 4 SETVERILDAAEQLFAEKGFAETSLRLITSKAGVNLAAVNYHFGSKKALIQAVFSRFLGP 63
ET + ILD A +LF+++G + TSL I AGV A+ +HF K L ++
Sbjct: 10 QETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESN 69

Query: 64 FCVSLEKELDRRQAKPEAQ-HATLEDLLHLLVS 95
+ + P + L +L V+
Sbjct: 70 IGELELEYQAKFPGDPLSVLREILIHVLESTVT 102


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02005ENTSNTHTASED290.022 Enterobactin synthetase component D signature.
		>ENTSNTHTASED#Enterobactin synthetase component D signature.

Length = 234

Score = 28.8 bits (64), Expect = 0.022
Identities = 29/128 (22%), Positives = 45/128 (35%), Gaps = 22/128 (17%)

Query: 15 HPALALLRPLEALYRRVANGRRADFLSGRKPAYRAPLPVLVVGNITVGGTGKTPM----I 70
L P R R+A+ L+GR A A L + V + G + P+ +
Sbjct: 26 REHDLLWLPHHDRLRSAGRKRKAEHLAGRIAAVHA-LREVGVRTVPGMGDKRQPLWPDGL 84

Query: 71 LWMIEHCRARGLRVGVISRGYGARPPTTPWRVRAEQDAAEAGDEPLMIVRRSGVPLMIDP 130
I HC L V + + G + E+ ++ L P +ID
Sbjct: 85 FGSISHCATTALAV-ISRQRIG---------IDIEKIMSQHTATEL-------APSIIDS 127

Query: 131 DRPRALQA 138
D + LQA
Sbjct: 128 DERQILQA 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02010IGASERPTASE599e-11 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 58.9 bits (142), Expect = 9e-11
Identities = 55/316 (17%), Positives = 95/316 (30%), Gaps = 41/316 (12%)

Query: 760 RRSRGQRRRSNRRERQREVSGELEGSEATDNA-----AAPLNTVAAAAAAGIAVA--SEA 812
R G+ N +R + + +N + P N A V + A
Sbjct: 972 RNVNGRYDLYNPEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPA 1031

Query: 813 VEANVEQAPATTSEAASETTASDETDASTSEAVETQDA-----DSEANT---------GE 858
+ + A S+ S+T +E DA+ + A + A + +ANT E
Sbjct: 1032 TPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSE 1091

Query: 859 TADIEAPVTVSVVRDEAGQSTLLVAQATEEAPFASESVESREDAESAVQPATEAAEEVAA 918
T + + T E + + + T+E P + V +++ VQP E A E
Sbjct: 1092 TKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARE--- 1148

Query: 919 PVPVEAAAPSEPATTEEPTPAIAAVPANATGRALNDPREKRRLQREAERLAR--EAAAAA 976
P + ++ T A PA T + P + + E A
Sbjct: 1149 NDPTVNI---KEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPA 1205

Query: 977 EAAAQAAPAVEEVPAVASEEA----SAQEEPAA---PQAEEIAQADVPSQ-----ADEAQ 1024
P + EPA +A D+ S +A+
Sbjct: 1206 TTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTNAVLSDAR 1265

Query: 1025 EAVQAEPEASGEDATD 1040
Q G+ +
Sbjct: 1266 AKAQFVALNVGKAVSQ 1281



Score = 57.4 bits (138), Expect = 3e-10
Identities = 51/349 (14%), Positives = 104/349 (29%), Gaps = 36/349 (10%)

Query: 508 EAQPVSSTRTLVRQEAAVKTVAPQQPAPQHTEAPVEPAKPMPEPSLFQGLVKSLVGLFAG 567
+ +++ + +V + + EAPV P P PS V A
Sbjct: 993 DTTNITTPNNIQADVPSVPSNNEEIARVD--EAPVPPPAP-ATPSETTETV-------AE 1042

Query: 568 KDQPAAKPAETSKPAAERQTRQDERRNGRQQNRRRDGRDGNRRDEERKPREERAERQPRE 627
+ +K E ++ A T Q+ ++ + N + +E + +E
Sbjct: 1043 NSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKE 1102

Query: 628 ERAERPNREERSERRREERAERPAREERQPREGREERAERTPREERQPREGREGREERSE 687
+ + E + + + + P++ + E + R+ +E +S+
Sbjct: 1103 TATVEKEEKAKVETEKTQEVPKVTSQV-SPKQEQSETVQPQAEPARENDPTVNIKEPQSQ 1161

Query: 688 RRREERAERPAREERQPREGREERAERPAREERQPREDRQARDAAALEAEALPNDESLEQ 747
E+PA+E E + +E
Sbjct: 1162 TNTTADTEQPAKETS--------SNVEQPVTESTTVN---------------TGNSVVEN 1198

Query: 748 DEQDDTDGERPRRRSRGQRRRSNRRERQ-REVSGELEGSEATDNAAAPLNTVAAAAAAGI 806
E +P S + NR R R V +E + + N + + +
Sbjct: 1199 PENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTN 1258

Query: 807 AVASEAVEANVEQAPATTSEAASETTASDETDASTSEAVETQDADSEAN 855
AV S+A A + +A S+ + E + V + N
Sbjct: 1259 AVLSDAR-AKAQFVALNVGKAVSQHISQLEMNNEGQYNVWVSNTSMNKN 1306



Score = 53.5 bits (128), Expect = 4e-09
Identities = 36/187 (19%), Positives = 60/187 (32%), Gaps = 27/187 (14%)

Query: 896 VESREDAESAVQPATEAAEEVAAP-VPVEAAAPSEPATTEEPTPAIAAVPANATGRALND 954
VE R T + P VP + P PA A A N
Sbjct: 985 VEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENS 1044

Query: 955 PREKRRLQR------EAERLAREAAAAAEAAAQAAPAVEEVPAVASEEASAQ----EEPA 1004
+E + +++ E RE A A++ +A EV SE Q +E A
Sbjct: 1045 KQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETA 1104

Query: 1005 APQAEEIAQADVPSQADEAQEA--------------VQAEPEASGEDATDTEH--AKKTE 1048
+ EE A+ + + + QAEP + + + ++
Sbjct: 1105 TVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNT 1164

Query: 1049 ESETSRP 1055
++T +P
Sbjct: 1165 TADTEQP 1171



Score = 50.1 bits (119), Expect = 5e-08
Identities = 53/338 (15%), Positives = 95/338 (28%), Gaps = 62/338 (18%)

Query: 616 PREERAERQPREERAERPNREERSERRREERAERPAREERQPREGREERAERTPREERQP 675
P E+ + PN + E AR + P A TP E +
Sbjct: 983 PEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVP---PPAPATPSETTET 1039

Query: 676 REGREGREERSERRREERAERPAREERQPREGREERAERPAREERQPREDRQARDAAALE 735
+E ++ + E+ A + R+ + E ++ A + Q + A
Sbjct: 1040 VAENSKQESKTVEKNEQDATETTAQNREVAK--EAKSNVKA--------NTQTNEVAQSG 1089

Query: 736 AEALPNDESLEQDEQDDTDGERPRRRSRGQRRRSNRRERQREVSGELEGSEATDNAAAPL 795
+E E+ + ++ E+ + + + +VS + E SE A P
Sbjct: 1090 SET---KETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPA 1146

Query: 796 NTVAAAAAAGIAVASEAVEANVEQAPATTSEAASETTASDETDASTSEAVETQDADSEAN 855
+ A+ EQ TS + T + + VE + + A
Sbjct: 1147 RENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPAT 1206

Query: 856 TGETADIEAPVTVSVVRDEAGQSTLLVAQATEEAPFASESVESREDAESAVQPATEAAEE 915
T T + E+ S ++R P
Sbjct: 1207 TQPTVNSES----------------------------SNKPKNRHRRSVRSVPHNV---- 1234

Query: 916 VAAPVPVEAAAPSEPATTEEPTPAIAAVPANATGRALN 953
EPATT + A+ + T N
Sbjct: 1235 -------------EPATTSSNDRSTVAL-CDLTSTNTN 1258



Score = 43.9 bits (103), Expect = 3e-06
Identities = 47/237 (19%), Positives = 74/237 (31%), Gaps = 25/237 (10%)

Query: 427 EALKDRTAEVRARVPFQVAAFLLNEKRNAITKIELRTRARIFILPDDHLETPHFEVQRLR 486
A T E A Q + + +++A + T EV +
Sbjct: 1030 PATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSG 1089

Query: 487 DDSPELVAGQTSYEMATVEHEEAQPVSSTRTLVRQEAAVKT--VAPQQPAPQHTEAPVEP 544
++ E +T E ATVE EE V + +T QE T V+P+Q + + EP
Sbjct: 1090 SETKETQTTETK-ETATVEKEEKAKVETEKT---QEVPKVTSQVSPKQEQSETVQPQAEP 1145

Query: 545 AKPMPEPSLFQGLVKSLVGLFAGKDQPAAKPAETSKPAAERQTRQDERRNGRQQNRRRDG 604
A +P+ K+ ++T+ A Q ++ N Q
Sbjct: 1146 A-RENDPT------------VNIKE----PQSQTNTTADTEQPAKETSSNVEQPVTESTT 1188

Query: 605 RDGNRRDEERKPREERAERQP--REERAERPNREERSERRREERAERPAREERQPRE 659
+ E A QP E + +P R R PA R
Sbjct: 1189 VNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRS 1245


21PAKAF_02031PAKAF_02045Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_02031215-1.874304B12-binding domain-containing radical SAM
PAKAF_02032316-1.987258DUF4823 domain-containing protein
PAKAF_02033316-1.786453DUF1285 domain-containing protein
PAKAF_02034216-1.455092electron transfer flavoprotein-ubiquinone
PAKAF_02035215-0.421584electron transfer flavoprotein beta-subunit
PAKAF_020360131.058362electron transfer flavoprotein alpha-subunit
PAKAF_020370101.117784FabV
PAKAF_02038-252.815902probable lipase
PAKAF_02039-253.199271precorrin-3 methylase
PAKAF_02040-243.479014CobE
PAKAF_02041-253.521827CbtA family protein
PAKAF_02045-263.202779cobalamin biosynthesis protein CobW
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02035ALARACEMASE280.045 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 27.8 bits (62), Expect = 0.045
Identities = 22/85 (25%), Positives = 39/85 (45%), Gaps = 7/85 (8%)

Query: 19 VKADNSGVDLANVKM---SMNPFCEIAVEEAVRLKEKGVATEIVAVSVGPTAAQEQLRTA 75
VKA+ G + + + + F + +EEA+ L+E+G I+ + G AQ+
Sbjct: 34 VKANAYGHGIERIWSAIGATDGFALLNLEEAITLRERGWKGPIL-MLEGFFHAQD---LE 89

Query: 76 LALGADRAILVESNDELNSLAVAKL 100
+ V SN +L +L A+L
Sbjct: 90 IYDQHRLTTCVHSNWQLKALQNARL 114


22PAKAF_02075PAKAF_02099Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_020751123.854250probable transcriptional regulator
PAKAF_020761142.890364LysE family translocator
PAKAF_020770142.532122MBL fold metallo-hydrolase
PAKAF_020780143.315390probable permease of ABC transporter
PAKAF_02079-2153.196206ABC transporter substrate-binding protein
PAKAF_02080-1153.812875probable ATP-binding component of ABC
PAKAF_02081-1124.048542probable TonB-dependent receptor
PAKAF_020832105.609654UPF0059 membrane protein
PAKAF_020840105.558758cobalt-precorrin-6A reductase
PAKAF_020851105.334302cobalamin biosynthetic protein CbiD
PAKAF_02086-1104.611450precorrin-6y-dependent methyltransferase CobL
PAKAF_02088-1123.698020probable oxidoreductase
PAKAF_02089-1102.531850precorrin isomerase CobH
PAKAF_020901111.785146precorrin-2 methyltransferase CobI
PAKAF_020912112.044613precorrin-3 methylase CobJ
PAKAF_020921120.379572transporter substrate-binding domain-containing
PAKAF_020931120.497931DUF4398 domain-containing protein
PAKAF_020940121.591457probable outer membrane protein precursor
PAKAF_020952121.461070probable transcriptional regulator
PAKAF_020961101.090725SbrI
PAKAF_02097091.130127SbrR
PAKAF_02098192.374310probable sigma-70 factor, ECF subfamily
PAKAF_020992102.834679anti-sigma factor SbrR
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02075PF05272280.034 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.5 bits (63), Expect = 0.034
Identities = 6/16 (37%), Positives = 8/16 (50%)

Query: 258 FRRAYGMTPAAYRRQC 273
+R AYG + RQ
Sbjct: 672 YRGAYGRYVQDHPRQV 687


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02077PF05932270.049 Tir chaperone protein (CesT)
		>PF05932#Tir chaperone protein (CesT)

Length = 127

Score = 27.1 bits (60), Expect = 0.049
Identities = 9/53 (16%), Positives = 18/53 (33%), Gaps = 8/53 (15%)

Query: 74 ADHLSAAIFLQRELGGCLAIGARITQVQAKFSGLFNLGEAFPVDGRQFEHLFE 126
L+ A+ G L + + SGL++ ++ P + L
Sbjct: 66 QCLLAGALNPLLNAGPGLGLDEK--------SGLYHAYQSIPREKLSVPTLKR 110


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02079FERRIBNDNGPP408e-06 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 39.9 bits (93), Expect = 8e-06
Identities = 49/266 (18%), Positives = 96/266 (36%), Gaps = 43/266 (16%)

Query: 43 PSRAVSHDINLTEMMVALGLQTRMVGYTGISGW--WKNADPGLIAALKPLPELV-----A 95
P+R V+ + E+++ALG+ G + W + +P PLP+ V
Sbjct: 35 PNRIVALEWLPVELLLALGIVP--YGVADTINYRLWVS-EP-------PLPDSVIDVGLR 84

Query: 96 RYPTAETLLDVDADFFFAGWGYGMRVGGDLTPASLEPLG-VKVYELSESCAQIGEPRRAS 154
P E L ++ F GYG +P L + + + S+ + R++
Sbjct: 85 TEPNLELLTEMKPSFMVWSAGYGP------SPEMLARIAPGRGFNFSDGKQPLAMARKS- 137

Query: 155 LDELYRDLRNLGRIFDVEPRAERLVASLQARIERARAGIPANTEAPRVF--LYDSGEDRP 212
L + + +++ AE +A + I + P + L D
Sbjct: 138 -------LTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLV 190

Query: 213 FTSGRLGMPQALIEAAGGRSVTDDVAASW--TQVNWESVVA-RDPQVIVIVDYGETSAAQ 269
F + Q +++ G + W T V+ + + A +D V+ D+ +
Sbjct: 191 FGPN--SLFQEILDEYGIPNAWQGETNFWGSTAVSIDRLAAYKDVDVLCF-DHDNSKDMD 247

Query: 270 KQRFLEENPALRSLTAIRERRFIVLP 295
L P +++ +R RF +P
Sbjct: 248 A---LMATPLWQAMPFVRAGRFQRVP 270


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02094OMPADOMAIN1022e-27 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 102 bits (255), Expect = 2e-27
Identities = 44/126 (34%), Positives = 63/126 (50%), Gaps = 11/126 (8%)

Query: 155 DVLFDFNRAELKPAANRTALKLVQFL-QLNPRRV-IRIEGYTDSVGDRQANLDLSRERAQ 212
DVLF+FN+A LKP +L L L+P+ + + GYTD +G N LS RAQ
Sbjct: 220 DVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGLSERRAQ 279

Query: 213 AVADVLADLGVDPARMQVVGYGEAFPVTDNASNRGR---------AQNRRVEIVFSNDKG 263
+V D L G+ ++ G GE+ PVT N + + A +RRVEI K
Sbjct: 280 SVVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIEVKGIKD 339

Query: 264 QLSAPR 269
++ P+
Sbjct: 340 VVTQPQ 345


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02095HTHFIS501e-09 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 50.2 bits (120), Expect = 1e-09
Identities = 35/157 (22%), Positives = 61/157 (38%), Gaps = 11/157 (7%)

Query: 10 LVIADSFPVMQWALQRYLSEECGRQVLAVVGDSDSLVERLADLPPESILITELGLPGQRS 69
+++AD ++ L + LS G V + +A +++T++ +P
Sbjct: 6 ILVADDDAAIRTVLNQALSRA-GYDVRITSNAATLW-RWIAAGDG-DLVVTDVVMPD--- 59

Query: 70 RDGIHLVEWLTRHCPQMKVMVYSVFSAPLLAKAVLRSGASAYISKRSPLETLKAALECMA 129
+ L+ + + P + V+V S + + A GA Y+ K L L + A
Sbjct: 60 ENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG-RA 118

Query: 130 LGQTFLDPG-LHPQRHTGKPL---SPTEVDILRRLAR 162
L + P L G PL S +I R LAR
Sbjct: 119 LAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLAR 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02099IGASERPTASE361e-04 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 36.2 bits (83), Expect = 1e-04
Identities = 23/120 (19%), Positives = 41/120 (34%), Gaps = 8/120 (6%)

Query: 112 AAAAKRAMRAPAAPAPLSSEMSEP--PALLASYASSGEAPQLMAEAAPAAPAALADRPPA 169
+ + P + + P P+ A EAP + APA P+ +
Sbjct: 985 VEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAP--VPPPAPATPSETTETVAE 1042

Query: 170 QAAQQAK---VQAALAGDFVAQARGKAVAVKPEVLDEALGAVLALREQGKTEQAATQLAE 226
+ Q++K A + AQ R A K V +A + +T++ T +
Sbjct: 1043 NSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVA-QSGSETKETQTTETK 1101


23PAKAF_02108PAKAF_02121Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_021082111.429042expressed protein with apparent function in
PAKAF_021093130.571076putative repressor of atu genes
PAKAF_021101121.286683CPBP family intramembrane metalloprotease
PAKAF_021111121.405116DUF2897 family protein
PAKAF_02112-1101.690233probable two-component sensor
PAKAF_02113-1101.922038probable two-component response regulator
PAKAF_02114-181.831314multidrug/biocide efflux PACE transporter
PAKAF_021150102.939652probable transcriptional regulator
PAKAF_021161122.647818hypothetical protein
PAKAF_021173143.194314probable transcriptional regulator
PAKAF_021181143.233581orotidine 5'-phosphate decarboxylase
PAKAF_021192122.968818AAA family ATPase
PAKAF_021202122.760963DUF58 domain-containing protein
PAKAF_021212112.170399transglutaminase protein A, TgpA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02109HTHTETR704e-17 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 70.0 bits (171), Expect = 4e-17
Identities = 32/190 (16%), Positives = 65/190 (34%), Gaps = 8/190 (4%)

Query: 14 ESARGKLLQTAAHLFRSKGYERTTVRDLASAVGIQSGSIFHHFKSKDEILRSVMEETILY 73
+ R +L A LF +G T++ ++A A G+ G+I+ HFK K ++ + E +
Sbjct: 10 QETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESN 69

Query: 74 NTALMRAALAD-AEDLRERVLGLIRCELQSIMGGTGEAMAVLVYEWRSLSAEGQAYILGL 132
L A D + ++ L+S + + + + + A +
Sbjct: 70 IGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQA 129

Query: 133 RDIYEQMWLD----VLGEARLAGYCQG--DPFILRRFLTGALSWT-TTWFRPEGPMSLDQ 185
+ D L A + G +S W L +
Sbjct: 130 QRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDLKK 189

Query: 186 LAEEALALVI 195
A + +A+++
Sbjct: 190 EARDYVAILL 199


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02112PF06580452e-07 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 45.2 bits (107), Expect = 2e-07
Identities = 35/172 (20%), Positives = 72/172 (41%), Gaps = 24/172 (13%)

Query: 198 QIGELVSGLKDFAR--LDRAFSEEVDLND---CVRNAVLIARTAIKDKAEISSQLGELPL 252
+ E+++ L + R L + + +V L D V + + +A +D+ + +Q+ +
Sbjct: 192 KAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIM 251

Query: 253 IACAPSQINQVLL-NLLTNAAQAMERFGRILLKSWADERQVFLSVQDNGKGMPAEVLGRI 311
P + Q L+ N + + + + G+ILLK D V L V++ G
Sbjct: 252 DVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLA-------- 303

Query: 312 FDPFFTTKPVGQGTGLGLSISYKIIQQHGG---TIRVASEPGRGTRFLISLP 360
K + TG GL + +Q G I+++ + G+ ++ +P
Sbjct: 304 ------LKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVN-AMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02113HTHFIS985e-25 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 97.6 bits (243), Expect = 5e-25
Identities = 29/136 (21%), Positives = 57/136 (41%), Gaps = 2/136 (1%)

Query: 7 RILFVDDEERILRSLAMQF-RRHYEVLTESDPRRALERLKTERIQVLVSDQRMPQMSGAE 65
IL DD+ I L R Y+V S+ + ++V+D MP + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 66 LLAQARERYPETLRILLTGYSDLDAAVDALNDGGIFRYLTKPWNPQEMAFTLRQAAEIAS 125
LL + ++ P+ ++++ + A+ A + G + YL KP++ E+ + +A
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKAS-EKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 126 RQGLPAPPAATLAAPL 141
R+ + PL
Sbjct: 124 RRPSKLEDDSQDGMPL 139



Score = 54.8 bits (132), Expect = 1e-10
Identities = 27/139 (19%), Positives = 55/139 (39%), Gaps = 5/139 (3%)

Query: 142 SVLLLDDDPETLDCVGAFCHAGGHRLLRARNLAEALVWLNTEPVEVLVSDLKLAGEHTAP 201
++L+ DDD + G+ + N A W+ +++V+D+ + E+
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 202 LLKSLAQAHPRLLSLVVTPFRDTQALLELINQAQIFRYLPKPIRRGLFEKGLKAAAEQAL 261
LL + +A P L LV++ ++ + + YLPKP L +AL
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKAS-EKGAYDYLPKP----FDLTELIGIIGRAL 119

Query: 262 LWRGRSLPEVDRLAEVPRD 280
R +++ ++
Sbjct: 120 AEPKRRPSKLEDDSQDGMP 138


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02119HTHFIS320.003 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 31.7 bits (72), Expect = 0.003
Identities = 12/43 (27%), Positives = 21/43 (48%)

Query: 103 DEINRATPKSQSALLEAMEEGQVTIEGATRPLPEPFFVIATQN 145
DEI +Q+ LL +++G+ T G P+ ++A N
Sbjct: 238 DEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATN 280


24PAKAF_02172PAKAF_02237Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_02172221-3.653176conserved hypothetical protein
PAKAF_02173130-5.502723GAF domain-containing protein
PAKAF_02174136-6.842672probable glutathione S-transferase
PAKAF_02175043-8.556674hypothetical protein
PAKAF_02179048-9.168253***integrase,Putative prophage CPS-53
PAKAF_02180256-9.591765Cro/Ci family transcriptional
PAKAF_02181357-9.728305transcriptional regulator, LysR family,Hca
PAKAF_02182258-10.309083LysR family transcriptional regulator,D-malate
PAKAF_02183360-10.561699drug efflux transporter,Multidrug resistance
PAKAF_02184361-10.533881multidrug resistance protein A,Inner membrane
PAKAF_02185357-11.007954outer membrane channel lipoprotein,Multidrug
PAKAF_02186254-10.950368TetR family transcriptional regulator,HTH-type
PAKAF_02187147-9.071714TetR family transcriptional regulator
PAKAF_02188144-8.398861TetR family transcriptional regulator
PAKAF_02189141-7.610810glycoside hydrolase family
PAKAF_02190235-6.277584cytochrome o ubiquinol oxidase subunit
PAKAF_02191032-4.684390putative methyl-accepting chemotaxis
PAKAF_02192027-3.562293relaxase,Predicted HD-superfamily
PAKAF_05923124-3.849045hypothetical protein
PAKAF_02193221-3.193912integrase family protein
PAKAF_02194319-2.670962RES domain
PAKAF_02195321-1.961371TraG-like family protein,TraG-like protein,
PAKAF_02196219-0.703625hypothetical protein
PAKAF_02197218-0.684438putative integral membrane protein,integrating
PAKAF_02198217-0.507420integrating conjugative element
PAKAF_021992180.106784integrating conjugative element
PAKAF_02200218-0.152113dsba oxidoreductase,Protein-disulfide
PAKAF_02201218-0.128364type IV secretory pathway, VirB4 component,type
PAKAF_022022170.351467lipoprotein,conjugative transfer region
PAKAF_022031170.193485integrating conjugative element
PAKAF_022043191.077651integrating conjugative element
PAKAF_02205418-0.708169integrating conjugative element
PAKAF_02206215-1.403335conjugative transfer region protein,conjugative
PAKAF_02207215-1.288222integrating conjugative element membrane
PAKAF_02208216-0.383146conjugative transfer protein,integrating
PAKAF_022093160.357057putative transmembrane protein,integrative
PAKAF_022103160.412668integrating conjugative element membrane
PAKAF_022113180.796266conjugative coupling factor TraD,Type IV
PAKAF_022121232.108570integrating conjugative element
PAKAF_02213119-0.087642lytic transglycosylase, catalytic
PAKAF_02214120-0.984721integrating conjugative element
PAKAF_02215223-2.269025putative secreted protein,Uncharacterized
PAKAF_02216222-2.663353pilL lipoprotein (type IV pili),integrating
PAKAF_02217118-2.880458hypothetical protein
PAKAF_02218117-2.917968Helicase conserved C-terminal domain
PAKAF_02219322-2.815054O-methyl transferase family protein
PAKAF_02220125-2.670611O-methyl transferase family domain-containing
PAKAF_02221024-1.958649O-methyl transferase family protein,DNA
PAKAF_02222025-1.530180hypothetical protein
PAKAF_02223027-1.816829hypothetical protein
PAKAF_02224125-1.619530hypothetical protein
PAKAF_02225221-1.964518hypothetical protein
PAKAF_02226320-2.291354Protein of unknown function (DUF3275)
PAKAF_02227318-2.828679Protein of unknown function (DUF3577)
PAKAF_02229219-3.597914Protein of unknown function (DUF3085)
PAKAF_02230118-3.644750GTPase SAR1 and related small G protein
PAKAF_02232221-1.651821uridylate kinase
PAKAF_02233021-1.229347amino acid ABC transporter/signal transduction
PAKAF_02234124-1.030245hypothetical protein
PAKAF_02235223-0.957571hypothetical protein
PAKAF_02237222-0.664699hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02183TCRTETB1214e-32 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 121 bits (306), Expect = 4e-32
Identities = 77/391 (19%), Positives = 164/391 (41%), Gaps = 16/391 (4%)

Query: 26 FVVVLDTTITNVAMPTISGFLGVSTTEGTWIITAYAVAEAITVPLTGWLSRQFGQVKVFI 85
F VL+ + NV++P I+ W+ TA+ + +I + G LS Q G ++ +
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 86 VSVALFVLFSVCCGLSWSLPSLVLF-RVLQGFAGGPLIPLSATLLLSVFPEKKSNVALAL 144
+ + SV + S SL++ R +QG L ++ P++ A L
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143

Query: 145 WGMMTVVAPIIGPILGGLISDNWRWQWVFYINIGFGLLVGLGSWWVLKGRETKIQRSRLD 204
G + + +GP +GG+I+ W ++ I + + + + + ++ + D
Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPM---ITIITVPFLMKLLKKEVRIKGHFD 200

Query: 205 GVGLTLLVVFVTAFQVMLDKGRELDWFSSNVILTCAIVSAISLILFIIWELTDEQPVIDL 264
G+ L+ V + F + F+++ ++ IVS +S ++F+ P +D
Sbjct: 201 IKGIILMSVGIVFFML----------FTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDP 250

Query: 265 SVLKSRNWVVSTITLCLMYGIFFGNIVLTPLWLQQWMGYTATWAGLATAPMGILAV-VTS 323
+ K+ +++ + +++G G + + P ++ + G G ++V +
Sbjct: 251 GLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFG 310

Query: 324 PLVGRLLPKVDPRLLVTYGMGVLAASFVMRALMTSQVDFMSVAIPMFVLGAGIPACVITL 383
+ G L+ + P ++ G+ L+ SF+ + + + I +FVLG G+ +
Sbjct: 311 YIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLG-GLSFTKTVI 369

Query: 384 TSLGVSDLPPDRIAGGSGLQNFLRVLCMAIG 414
+++ S L G L NF L G
Sbjct: 370 STIVSSSLKQQEAGAGMSLLNFTSFLSEGTG 400


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02184RTXTOXIND974e-24 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 97.2 bits (242), Expect = 4e-24
Identities = 75/413 (18%), Positives = 134/413 (32%), Gaps = 74/413 (17%)

Query: 25 KKMLIAFGAALLVVLACYFIWLIF--------FAGKTVTTDNAYTAVEVAQVTPLVSGPV 76
+ ++ L FI + GK + + + P+ + V
Sbjct: 54 SRRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKE------IKPIENSIV 107

Query: 77 KEVKVVDTQAVHAGDVLVVLDDTDAKIALEQAEADLGRAR-------------------- 116
KE+ V + ++V GDVL+ L A+ + ++ L +AR
Sbjct: 108 KEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPE 167

Query: 117 ------------------RQVRQIVANDTTLAGQMDQRAAAIQSAQHE---VTRARSRYD 155
R I +T Q Q+ + + E V +RY+
Sbjct: 168 LKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYE 227

Query: 156 KAVLDEKRR----RNLVEGGAVSAQDFTDSQAELREATAALGQAEANLKAAGAASVSAKG 211
EK R +L+ A++ + + + EA L ++ L+ + +SAK
Sbjct: 228 NLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKE 287

Query: 212 SRQANEALF----LDSTVETNPIVVTAKAHADQARVNLDRTVLRAPVDGIVTQRSV-DIG 266
Q LF LD +T + + +V+RAPV V Q V G
Sbjct: 288 EYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEG 347

Query: 267 QQVQAGMRLMNIVPIQQIY-VDANFKEGQLRNVKPGQKARLTADIYGDDVEYDGRVEGFA 325
V LM IVP V A + + + GQ A + + + Y G + G
Sbjct: 348 GVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAF-PYTRY-GYLVG-- 403

Query: 326 GGSGSALSVIPAQNATGNWIKVVQRLPVRIRLDPEQLKKHPLRIGLSMEVSVD 378
+ I + +V + + I + + + M V+ +
Sbjct: 404 -----KVKNINLDAIEDQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTAE 451


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02186HTHTETR621e-13 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 61.6 bits (149), Expect = 1e-13
Identities = 33/171 (19%), Positives = 62/171 (36%), Gaps = 5/171 (2%)

Query: 14 APKEGAAVSEKARVVLAGARAVFLANGFAAATTDMIQQAAGVSKSTVYAYYPNKEALFVA 73
A K E + +L A +F G ++ + I +AAGV++ +Y ++ +K LF
Sbjct: 2 ARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61

Query: 74 VVEAECARFLKTINETQFSGKRLGETLLAMARAYLEIVLSSDAL--ALFRMVMAEAPRLP 131
+ E + + E Q + L++ R L VL S ++ +
Sbjct: 62 IWELSESNIGELELEYQ---AKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCE 118

Query: 132 ELGRRFYVAGPARINAIVAQRLEIGISNGELELGGIGLDSAASLFANMVRG 182
+G V R + + +E + D A ++RG
Sbjct: 119 FVGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRG 169


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02192PF03544290.038 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 29.2 bits (65), Expect = 0.038
Identities = 27/125 (21%), Positives = 35/125 (28%), Gaps = 8/125 (6%)

Query: 370 ERPAPFAGTVAIDAAPSD----QDADASASTPTIASKPPAENQEPPLWENMGGAAAPARP 425
E PAP AP+D Q P EPP + +P
Sbjct: 42 ELPAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKP 101

Query: 426 PAAQTSPDVVEDLLEMVGMADPSNASQDGEAASVDAPAPTSEAAMSATA----APPPPVS 481
VE V + AS A + T+ AA S + P +S
Sbjct: 102 KPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVASGPRALS 161

Query: 482 MAAPQ 486
PQ
Sbjct: 162 RNQPQ 166


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02213OMPADOMAIN320.001 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 31.8 bits (72), Expect = 0.001
Identities = 34/137 (24%), Positives = 54/137 (39%), Gaps = 24/137 (17%)

Query: 2 AAPIRRRSSRARATLGIAWLLGAGIPALAAQAQELPPPAYQIAAQQAGVPSPVLFAV--- 58
A I R +LG+++ G G A P PA ++ + + S VLF
Sbjct: 171 AHTIGTRPDNGMLSLGVSYRFGQGEAAPVVAPA--PAPAPEVQTKHFTLKSDVLFNFNKA 228

Query: 59 --------ALQESGTKLRGRLVPWPWTLNVAGQSERYATRAEACA-GLRRALARA----- 104
AL + ++L L P ++ V G ++R + A RRA +
Sbjct: 229 TLKPEGQAALDQLYSQLS-NLDPKDGSVVVLGYTDRIGSDAYNQGLSERRAQSVVDYLIS 287

Query: 105 ---PASRIDA-GLGQVN 117
PA +I A G+G+ N
Sbjct: 288 KGIPADKISARGMGESN 304


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_0222056KDTSANTIGN270.014 Rickettsia 56kDa type-specific antigen protein sign...
		>56KDTSANTIGN#Rickettsia 56kDa type-specific antigen protein

signature.
Length = 533

Score = 27.2 bits (60), Expect = 0.014
Identities = 10/29 (34%), Positives = 13/29 (44%)

Query: 52 GETDEATRQANDVAIGADDPMISHFRITP 80
GE D D G D P+ F++TP
Sbjct: 108 GEVDSKGEIKADSGGGTDAPIRKPFKLTP 136


25PAKAF_02277PAKAF_02296Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_02277-114-3.468567probable transcriptional regulator
PAKAF_02278014-3.834932acyl-CoA thioesterase
PAKAF_02279-210-3.037909VacJ family lipoprotein
PAKAF_02280030-6.544464PilZ domain-containing protein
PAKAF_02281027-6.329058probable two-component response regulator
PAKAF_02282029-6.405317STAS domain-containing protein
PAKAF_02283030-6.184845transaldolase
PAKAF_02284142-8.278133tRNA-dihydrouridine synthase A
PAKAF_02285249-9.648391pseudaminidase
PAKAF_02286030-5.370206hypothetical protein
PAKAF_02287025-3.897542hypothetical protein
PAKAF_02288-123-3.291356hypothetical protein
PAKAF_02289022-2.908734structural protein
PAKAF_02290-2110.332776hypothetical protein
PAKAF_02292-291.091323hypothetical protein
PAKAF_02293-190.754242probable chemotaxis transducer
PAKAF_022940140.854548carboxypeptidase G2 precursor
PAKAF_022950150.788658GAF domain-containing protein
PAKAF_022962160.070530helix-turn-helix transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02279VACJLIPOPROT2723e-95 VacJ lipoprotein signature.
		>VACJLIPOPROT#VacJ lipoprotein signature.

Length = 251

Score = 272 bits (698), Expect = 3e-95
Identities = 75/231 (32%), Positives = 113/231 (48%), Gaps = 15/231 (6%)

Query: 16 LACASLALAPTLSLAAS--------EEDPWESINRPIFTFN-DTLDTYALKPLAQGYQKV 66
L ++LAL TL + + DP E NR ++ FN + LD Y ++P+A ++
Sbjct: 3 LRLSALALGTTLLVGCASSGTDQQGRSDPLEGFNRTMYNFNFNVLDPYIVRPVAVAWRDY 62

Query: 67 TPNFVQDGVHNFFNNLGDVKNLANNLLQAKFHNAGVDTSRLLFNSTFGLAGLIDVATPMG 126
P ++G+ NF NL + + N LQ + V +R N+ G+ G IDVA
Sbjct: 63 VPQPARNGLSNFTGNLEEPAVMVNYFLQGDPYQGMVHFTRFFLNTILGMGGFIDVAGMAN 122

Query: 127 LQ---RNDEDFGQTLGYWGVGSGPYVMLPFLGPSTLRDAPAKIPDIYVSPYHYMDDVRAR 183
+ FG TLG++GVG GPYV LPF G TLRD + D ++ +
Sbjct: 123 PKLQRTEPHRFGSTLGHYGVGYGPYVQLPFYGSFTLRDDGGDMADALYPVLSWLTWPMSV 182

Query: 184 NVMFGINTVDTRANLLKSEKLI--SGDKYIFIRNAYLQNREFKVKDGEVED 232
+ + ++TRA LL S+ L+ S D YI +R AY Q +F GE++
Sbjct: 183 G-KWTLEGIETRAQLLDSDGLLRQSSDPYIMVREAYFQRHDFIANGGELKP 232


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02281HTHFIS1107e-29 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 110 bits (277), Expect = 7e-29
Identities = 39/128 (30%), Positives = 60/128 (46%), Gaps = 1/128 (0%)

Query: 6 ATLLIIDDDEVVRESLAAYLEDSNFKVLQALNGLQGLQIFESEQPDLVICDLRMPQIDGL 65
AT+L+ DDD +R L L + + V N + + DLV+ D+ MP +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 66 ELIRRIRQTASETPIIVLSGAGVMSDAVEALRLGAADYLIKPLEDLAVLEHSVRRALDRA 125
+L+ RI++ + P++V+S A++A GA DYL KP DL L + RAL
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPF-DLTELIGIIGRALAEP 122

Query: 126 YLRVENQR 133
R
Sbjct: 123 KRRPSKLE 130


26PAKAF_02321PAKAF_02331Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_02321020-3.080843alpha/beta hydrolase
PAKAF_02322214-4.044910hypothetical protein
PAKAF_02323214-3.304931putative lipoprotein
PAKAF_02324111-2.691876putative membrane protein
PAKAF_02325110-2.201133cupin domain-containing protein
PAKAF_02326013-2.008092hypothetical protein
PAKAF_02327012-2.254413OprQ
PAKAF_02328217-1.232180hypothetical protein
PAKAF_02329217-1.649547probable transcriptional regulator
PAKAF_02330217-1.802591PACE efflux transporter
PAKAF_02331218-1.426013hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02325MICOLLPTASE280.012 Microbial collagenase metalloprotease (M9) signature.
		>MICOLLPTASE#Microbial collagenase metalloprotease (M9) signature.

Length = 1104

Score = 27.8 bits (61), Expect = 0.012
Identities = 12/24 (50%), Positives = 15/24 (62%)

Query: 33 NLNAYASADGSKLMGTWICTPGKW 56
N YA+ADG+KL T PGK+
Sbjct: 1059 NYVDYANADGNKLSNTCKLNPGKY 1082


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02329TCRTETOQM290.028 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 29.1 bits (65), Expect = 0.028
Identities = 12/35 (34%), Positives = 16/35 (45%)

Query: 104 DSGRLFGALRTLSERYPLLDVEVLSAAQDDALALL 138
L AL +S+ PLL V SA + L+ L
Sbjct: 357 QREMLLDALLEISDSDPLLRYYVDSATHEIILSFL 391


27PAKAF_02345PAKAF_02374Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_023452110.058942probable methionine aminopeptidase
PAKAF_023460151.352906hypothetical protein
PAKAF_02347-210-1.314827DUF3509 domain-containing protein
PAKAF_02348-112-2.633235hypothetical protein
PAKAF_02349-112-2.897090hypothetical protein
PAKAF_02350014-3.504525hypothetical protein
PAKAF_02351014-3.797251probable hydrolase
PAKAF_02353012-3.343714threonyl-tRNA synthetase
PAKAF_02354216-3.559160translation initiation factor IF-3
PAKAF_02355114-2.72712050S ribosomal protein L35
PAKAF_02356018-5.29669550S ribosomal protein L20
PAKAF_02357020-6.370615phenylalanyl-tRNA synthetase, alpha-subunit
PAKAF_02358231-8.215639phenylalanyl-tRNA synthetase, beta subunit
PAKAF_02359240-10.728939integration host factor, alpha subunit
PAKAF_02360245-11.570819MerR family transcriptional regulator
PAKAF_02362349-11.985548*type I restriction-modification system subunit
PAKAF_02363354-12.491261type I restriction-modification system subunit
PAKAF_02364462-13.033751restriction modification system DNA specificity
PAKAF_02365360-12.865233putative ABC transporter,Predicted ATP-binding
PAKAF_02366567-13.542080conserved hypothetical protein
PAKAF_02367561-12.456789Restriction endonuclease
PAKAF_05924456-11.550317AlpA family transcriptional regulator
PAKAF_02368556-11.101145Reverse transcriptase (RNA-dependent DNA
PAKAF_02369151-6.521089insertion sequence IS407 OrfB
PAKAF_02370534-1.404421insertion sequence IS407 OrfB
PAKAF_023714152.143084transposase A,Transposase
PAKAF_059254162.963652IS3 family transposase
PAKAF_059263163.164415transposase
PAKAF_059263163.208909transposase
PAKAF_023733173.519087hypothetical protein
PAKAF_023742153.378292DUF4011 domain-containing protein
PAKAF_023742143.167040DUF4011 domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02359DNABINDINGHU1131e-36 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 113 bits (284), Expect = 1e-36
Identities = 34/89 (38%), Positives = 54/89 (60%)

Query: 5 TKAEIAERLYEELGLNKREAKELVELFFEEIRQALEHNEQVKLSGFGNFDLRDKRQRPGR 64
K ++ ++ E L K+++ V+ F + L E+V+L GFGNF++R++ R GR
Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62

Query: 65 NPKTGEEIPITARRVVTFRPGQKLKARVE 93
NP+TGEEI I A +V F+ G+ LK V+
Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAVK 91


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02362TONBPROTEIN310.019 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 30.7 bits (69), Expect = 0.019
Identities = 12/45 (26%), Positives = 18/45 (40%)

Query: 557 EPVMIYEPKPEEPVVPPETIEALAIGEQELPPYVPDATGGAEVFP 601
EPV+ EP+PE PP+ + + P P + P
Sbjct: 66 EPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQP 110


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02373RTXTOXIND340.001 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 34.0 bits (78), Expect = 0.001
Identities = 37/218 (16%), Positives = 69/218 (31%), Gaps = 17/218 (7%)

Query: 113 LLQALDARPDAASAALRQTLQALADGALRDDAEAL-LAQGFAALASAPVEERLSAAQHEL 171
LL+ +A + + +L R + + P E E
Sbjct: 124 LLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEE 183

Query: 172 AQRLKTDEAPITLEQWRARQQQDAPREQRLARIDRHIAELQLLQGEASAQAFLERLARAE 231
RL + EQ+ Q Q +E +D+ AE + + E L+R E
Sbjct: 184 VLRLTSLIK----EQFSTWQNQKYQKELN---LDKKRAERLTVLARINR---YENLSRVE 233

Query: 232 AEQRPERRNLLLDSLVLDLAQAAREHQQQRQRLEHLQDLASEVAALGAAEHAELLQRAAA 291
+ + +LL + +Q+ + +E + +L + L E L +
Sbjct: 234 KSRLDDFSSLLHKQAI----AKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEY 289

Query: 292 CQPDSDPQQ--LAELTERCSAILTAHLQQQAALARRQA 327
+ L +L + I L+ R+QA
Sbjct: 290 QLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQA 327



Score = 29.4 bits (66), Expect = 0.029
Identities = 20/122 (16%), Positives = 43/122 (35%), Gaps = 4/122 (3%)

Query: 12 REEAIATCERDLQRLDKALARWENQASRLAQLSDAERAAAHARRASLHALLEQERWLDVQ 71
+E + + + + R+EN + D + H + + HA+LEQE V+
Sbjct: 205 KELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKY-VE 263

Query: 72 LQVKIESEFLKRDLAEREERAIRQAAETRQQHRR---LQENASALLQALDARPDAASAAL 128
++ + + E E + ++ + Q + L + + A
Sbjct: 264 AVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEE 323

Query: 129 RQ 130
RQ
Sbjct: 324 RQ 325


28PAKAF_02477PAKAF_02482Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_02477014-3.139446tRNA methyltransferase
PAKAF_02478010-3.169758NUDIX hydrolase
PAKAF_02479010-3.865671isocitrate dehydrogenase
PAKAF_02480011-4.006583isocitrate dehydrogenase
PAKAF_02481-212-3.343776cold-shock protein CspD
PAKAF_02482-28-3.147451ClpS
29PAKAF_02518PAKAF_02537Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_02518021-3.455958response regulator GacA
PAKAF_02519023-3.494550excinuclease ABC subunit C
PAKAF_02520025-4.058610CDP-diacylglycerol--glycerol-3-phosphate
PAKAF_02522025-3.723908*probable sensor/response regulator hybrid
PAKAF_02523019-3.387462hypothetical protein
PAKAF_02525-116-2.953756*hypothetical protein
PAKAF_02526-114-2.205103NAD(P)H-dependent oxidoreductase
PAKAF_02527-115-2.486502L-Tryptophan:oxygen 2,3-oxidoreductase
PAKAF_02528-113-1.863855probable acetyltransferase
PAKAF_02529-111-2.265012probable transcriptional regulator
PAKAF_02530-113-2.342572DMT family transporter
PAKAF_02531-116-3.229383nitroreductase family protein
PAKAF_02532018-3.221106alkane-1-monooxygenase
PAKAF_02533119-3.096898probable chemotaxis transducer
PAKAF_02534118-3.145356probable two-component response regulator
PAKAF_02535223-3.291096probable two-component sensor
PAKAF_02537122-3.304906*LecA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02518HTHFIS552e-11 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 55.2 bits (133), Expect = 2e-11
Identities = 19/93 (20%), Positives = 38/93 (40%), Gaps = 1/93 (1%)

Query: 6 EGLQVVGQADCGEDCLKLARELKSDVVLMDVKMPGIGGLEATRKLLRSQPDIKVVVVTVC 65
G V + D+V+ DV MP + ++ +++PD+ V+V++
Sbjct: 26 AGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLLPRIKKARPDLPVLVMSAQ 84

Query: 66 EEDPFPTRLMQAGAAGYMTKGAGLEEMVQAIRQ 98
+ + GA Y+ K L E++ I +
Sbjct: 85 NTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02522HTHFIS665e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 65.6 bits (160), Expect = 5e-13
Identities = 30/120 (25%), Positives = 49/120 (40%), Gaps = 9/120 (7%)

Query: 863 SILLAEDHPFNRLTLTMQLESLGHRVTSTEDGEEAFERWQGEDFDVVITDGMMPRMDGYE 922
+IL+A+D R L L G+ V T + + D D+V+TD +MP + ++
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 923 LARRIRSQEALGGRRRCLVIALTASAEKDALERCLAAGMDRVLFKP----TTLDELARAL 978
L RI+ R V+ ++A + G L KP + + RAL
Sbjct: 65 LLPRIKKA-----RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02528SACTRNSFRASE362e-05 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 36.5 bits (84), Expect = 2e-05
Identities = 20/82 (24%), Positives = 35/82 (42%), Gaps = 4/82 (4%)

Query: 74 DDQVIGHCQLLFDRRNGVVRLARIALAPSARGQGLGLPMLEALLAEAFA-DADIERVELN 132
++ IG ++ NG + IA+A R +G+G +L A +A + + L
Sbjct: 73 ENNCIGRIKIR-SNWNGYALIEDIAVAKDYRKKGVGTALLH--KAIEWAKENHFCGLMLE 129

Query: 133 VYDWNAAARHLYRRAGFREEGL 154
D N +A H Y + F +
Sbjct: 130 TQDINISACHFYAKHHFIIGAV 151


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02534HTHFIS1066e-27 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 106 bits (265), Expect = 6e-27
Identities = 32/154 (20%), Positives = 69/154 (44%), Gaps = 5/154 (3%)

Query: 14 RFSVLLVDDEPLILSSLRRLLRNQPYDLLLAESGEQALQLLESRPVDLVVSDARMPNMDG 73
++L+ DD+ I + L + L YD+ + + + + + DLVV+D MP+ +
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 74 AALLAEIHRRSPETIRILLTGHADLPTIAKAINEGRIHHYLSKPWNDDELLLTLRQSLEY 133
LL I + P+ ++++ T KA +G + YL KP++ EL+ + ++L
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKG-AYDYLPKPFDLTELIGIIGRALAE 121

Query: 134 LHSERERRRLERLTQE----QNDRLQQLNATLEK 163
+ + ++ +Q++ L +
Sbjct: 122 PKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLAR 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02535PF06580355e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 34.8 bits (80), Expect = 5e-04
Identities = 29/188 (15%), Positives = 63/188 (33%), Gaps = 44/188 (23%)

Query: 288 REGIGRVRKIVQDLKNFSR-VDAEDDWQWTDLHQGIESTLNIVASE-------LKYRADV 339
E + R+++ L R + + L + + + L++ +
Sbjct: 187 LEDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQI 246

Query: 340 VREYGDLPEVKCLPSQINQVVMNLVMNAAQ-AMGPER--GRIVIRTGHTVEHAWIEVEDS 396
D+ +P + Q LV N + + G+I+++ +EVE++
Sbjct: 247 NPAIMDVQ----VPPMLVQT---LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENT 299

Query: 397 GQGISPEILPRIFDPFFTTKPVGKGTGLGLS-------LSYGIVQKHGGTIEVRSQPGVG 449
G K + TG GL + YG + I++ + G
Sbjct: 300 GSLA--------------LKNTKESTGTGLQNVRERLQMLYG--TEAQ--IKLSEKQGKV 341

Query: 450 SAFRIVLP 457
+A +++P
Sbjct: 342 NA-MVLIP 348


30PAKAF_02623PAKAF_02671Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_026232142.492562probable transcriptional regulator
PAKAF_026242131.300204probable transcriptional regulator
PAKAF_026251140.532964hypothetical protein
PAKAF_026260130.522065Pseudomonas type III repressor gene C, PtrC
PAKAF_02627281.849323hypothetical protein
PAKAF_02628182.492146TetR/AcrR family transcriptional regulator
PAKAF_026292103.063641LLM class flavin-dependent oxidoreductase
PAKAF_02630393.399934probable cytochrome c
PAKAF_026313103.201770c-type cytochrome
PAKAF_026323103.523995probable two-component sensor
PAKAF_026330102.825772probable two-component response regulator
PAKAF_02634192.732431probable thiol:disulfide interchange protein
PAKAF_02635072.716833probable thiol:disulfide interchange protein
PAKAF_026360102.808764thiol:disulfide interchange protein DsbG
PAKAF_02637-1113.121665probable cytochrome P450
PAKAF_026380133.258403hypothetical protein
PAKAF_026390133.532683probable glutathione S-transferase
PAKAF_026401123.951633probable major facilitator superfamily (MFS)
PAKAF_026411132.131947fumarylacetoacetate hydrolase family protein
PAKAF_026421111.903704gentisate 1,2-dioxygenase
PAKAF_026431111.738788probable transcriptional regulator
PAKAF_02644211-0.395068ECF sigma factor FoxI
PAKAF_02645314-0.718240Anti-sigma factor FoxR
PAKAF_02646416-2.039909Ferrioxamine receptor FoxA
PAKAF_02647420-2.097335PepSY domain-containing protein
PAKAF_02648423-2.173908hypothetical protein
PAKAF_02649423-2.617074hypothetical protein
PAKAF_02650317-0.101150Wall-associated protein,Cell wall-associated
PAKAF_026513131.230688yd repeat-containing protein,YD repeat (two
PAKAF_026532143.034935class I SAM-dependent methyltransferase
PAKAF_026532133.091246class I SAM-dependent methyltransferase
PAKAF_026541133.160294YgdI/YgdR family lipoprotein
PAKAF_026552123.356791esterase family protein
PAKAF_026550113.109966esterase family protein
PAKAF_02656-2142.548290hypothetical protein
PAKAF_02657-2142.091335probable transcriptional regulator
PAKAF_02658-2131.903364amidohydrolase
PAKAF_02659-2131.967133probable transcriptional regulator
PAKAF_02660-2131.930632glycine cleavage system protein H2
PAKAF_02661-2102.139807glycine cleavage system protein P2
PAKAF_02662-273.416636serine hydroxymethyltransferase
PAKAF_02663-173.977903L-serine dehydratase
PAKAF_02664174.852881glycine cleavage system protein T2
PAKAF_02665174.645417hypothetical protein
PAKAF_02666295.053155polysaccharide deacetylase family protein
PAKAF_026673115.274070protease modulator HflK
PAKAF_026683124.580612protease modulator HflC
PAKAF_026693103.915618hypothetical protein
PAKAF_026703133.145498hypothetical protein
PAKAF_026713133.229802probable cation-transporting P-type ATPase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02628HTHTETR514e-10 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 50.8 bits (121), Expect = 4e-10
Identities = 37/201 (18%), Positives = 66/201 (32%), Gaps = 11/201 (5%)

Query: 1 MAKRGRPCGFD-REQALRRALDVFWEAGYEGATMAALKEAMGGICAPSMYAAYGSKEALF 59
MA++ + + R+ L AL +F + G ++ + +A G+ ++Y + K LF
Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAA-GVTRGAIYWHFKDKSDLF 59

Query: 60 RSAVELYLSQECQLSKGAFA------LPTARESIAALLESAAVSYTTEGKPRGCLVDLST 113
EL S +L A L RE + +LES T E + +
Sbjct: 60 SEIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLEST---VTEERRRLLMEIIFHK 116

Query: 114 TNFSPANKGVEDYLRDHRRRAARLLRERFARGVADGDVPAGADLDALTSFYSSVLQGLSI 173
F V+ R+ + + + + +PA + GL
Sbjct: 117 CEFVGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLME 176

Query: 174 QARDGASRQQLLAIGRCAMAA 194
L R +A
Sbjct: 177 NWLFAPQSFDLKKEARDYVAI 197


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02633HTHFIS832e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 82.6 bits (204), Expect = 2e-20
Identities = 38/122 (31%), Positives = 62/122 (50%), Gaps = 1/122 (0%)

Query: 2 HVLLTEDDDLIASGIVAGLNAQGLTVDRVASAADTQALLQVARFDVLVLDLGLPDEDGLR 61
+L+ +DD I + + L+ G V ++AA + D++V D+ +PDE+
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 LLQRLRQQGVDLPVLVLTARDAVTDRVAGLQAGADDYLLKPFDLRELGARLHT-LQRRSA 120
LL R+++ DLPVLV++A++ + + GA DYL KPFDL EL + L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 121 GR 122

Sbjct: 125 RP 126


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02640TCRTETB493e-08 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 48.7 bits (116), Expect = 3e-08
Identities = 39/178 (21%), Positives = 72/178 (40%), Gaps = 3/178 (1%)

Query: 31 LCFLIVAMDGFDTAAIGFIAPALAHDWQLSPAQLSPILGAALAGLALGAFAAGPLADRFG 90
LC L + + P +A+D+ PA + + A + ++G G L+D+ G
Sbjct: 19 LCIL-SFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLG 77

Query: 91 RKSVLLLSVLFFGGWSLASAYAGS-VETLALLRFFTGLGLGGAMPNAITLTSEYCPRRHR 149
K +LL ++ S+ S L + RF G G + + + Y P+ +R
Sbjct: 78 IKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENR 137

Query: 150 ALMVTAMFCGFTLGSALGGLLAARMVPALGWESVLLLGGGLPLASLPLLWACLPESVR 207
+ +G +G + + + W S LLL + + ++P L L + VR
Sbjct: 138 GKAFGLIGSIVAMGEGVGPAIGGMIAHYIHW-SYLLLIPMITIITVPFLMKLLKKEVR 194



Score = 30.6 bits (69), Expect = 0.011
Identities = 36/196 (18%), Positives = 71/196 (36%), Gaps = 13/196 (6%)

Query: 256 AELRGGTLLLWATF--FMGLLIIYLLTNWLPTLIGGTGFSLGEAATISAMFQLGGTLGAL 313
+ LR +L+W F +L +L LP + ++ F L ++G
Sbjct: 8 SNLRHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTA 67

Query: 314 LLGSAMDRFDAHRVLSLAYVGGALFILG--IASLYHSFA---LLALCVAGVGFCISGSQV 368
+ G D+ R+L G + G I + HSF ++A + G G + V
Sbjct: 68 VYGKLSDQLGIKRLLLF---GIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALV 124

Query: 369 GANALAADFYPTRSRATGVSWALGLGRIGSIVGSLSGGALLG-LGLGFSGILALLVIPAL 427
+ A + P +R + +G VG GG + + + ++ ++ I +
Sbjct: 125 M--VVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITV 182

Query: 428 LAAVAVHRLGRRRARP 443
+ + + R
Sbjct: 183 PFLMKLLKKEVRIKGH 198


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02643MPTASEINHBTR280.030 Metalloprotease inhibitor signature.
		>MPTASEINHBTR#Metalloprotease inhibitor signature.

Length = 122

Score = 27.7 bits (61), Expect = 0.030
Identities = 13/43 (30%), Positives = 20/43 (46%)

Query: 59 RGLRPTPYGMTLFNHAQRVLTEMERARQNLEAMRSGSGSRVLL 101
PTP G+ L N +T + R ++ R+ SG+ V L
Sbjct: 76 VSWSPTPDGIWLMNAEGTGITHLNRQKEGEYTGRTPSGADVTL 118


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02646TYPE3OMGPROT340.002 Type III secretion system outer membrane G protein ...
		>TYPE3OMGPROT#Type III secretion system outer membrane G protein

family signature.
Length = 607

Score = 34.1 bits (78), Expect = 0.002
Identities = 19/81 (23%), Positives = 26/81 (32%), Gaps = 10/81 (12%)

Query: 15 LDFPRASRLSRSVRAALLSLAMAAGAAPLCASAAEAAAEQARPYAIPAGQ--LGDVLNRF 72
+ FP S R + LL L+ + A L PY A L D+L F
Sbjct: 1 MAFPLHSFFKRVLTGTLLLLSSYSWAQEL--------DWLPIPYVYVAKGESLRDLLTDF 52

Query: 73 AREAGITLSATPAQTGGYSSQ 93
T+ + S Q
Sbjct: 53 GANYDATVVVSDKINDKVSGQ 73


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02657HTHFIS331e-111 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 331 bits (851), Expect = e-111
Identities = 120/355 (33%), Positives = 179/355 (50%), Gaps = 35/355 (9%)

Query: 186 ERLAALHHDHAEGFEMLLGDSQPIRTLKTRAQRVAALDAPLLIHGETGTGKELVARGCHA 245
+R + D ++ L+G S ++ + R+ D L+I GE+GTGKELVAR H
Sbjct: 123 KRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHD 182

Query: 246 LSARHNSPFLALNCAALPENLAESELFGYAPGAFTGAQRGGKPGLLELAHQGTVFLDEIG 305
R N PF+A+N AA+P +L ESELFG+ GAFTGAQ G E A GT+FLDEIG
Sbjct: 183 YGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRST-GRFEQAEGGTLFLDEIG 241

Query: 306 EMSPYLQAKLLRFLSDGSFRRVGGDREVRVDVRILSATHRNLEKMVAEGSFREDLFYRLN 365
+M Q +LLR L G + VGG +R DVRI++AT+++L++ + +G FREDL+YRLN
Sbjct: 242 DMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLN 301

Query: 366 VLSLEVPPLRERGHDILLLARHFMQQACAQIQRPVCRLAPGTYPALLNNRWPGNVRQLQN 425
V+ L +PPLR+R DI L RHF+QQA + V R + + WPGNVR+L+N
Sbjct: 302 VVPLRLPPLRDRAEDIPDLVRHFVQQAE-KEGLDVKRFDQEALELMKAHPWPGNVRELEN 360

Query: 426 VIFRAAAICESSLVDIGDLEIAGTAVARQND----------------------------- 456
++ R A+ ++ +E + +
Sbjct: 361 LVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFG 420

Query: 457 ---GEVGSLEEAVEGFEKALLEKLYVSYPSTRQLAAR-LQTSHTAIAHRLRKYGI 507
G + + E L+ + + AA L + + ++R+ G+
Sbjct: 421 DALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGV 475


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02658UREASE340.002 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 33.9 bits (78), Expect = 0.002
Identities = 17/36 (47%), Positives = 21/36 (58%)

Query: 520 YTRNAARTIGLERRIGSLEPGKQADFIVLDRDVFEV 555
YT N A GL IGSLE GK+AD ++ + F V
Sbjct: 409 YTINPAIAHGLSHEIGSLEVGKRADLVLWNPAFFGV 444



Score = 30.9 bits (70), Expect = 0.018
Identities = 23/79 (29%), Positives = 37/79 (46%), Gaps = 10/79 (12%)

Query: 19 HALGAADLLVVNARIFTANPQQPFAEALAVEDGRILAVGDEAGLRALADGDSQVVDLG-- 76
GA D ++ NA I + + ++DGRI A+G +AG + G + +V G
Sbjct: 63 REGGAVDTVITNALIL--DHWGIVKADIGLKDGRIAAIG-KAGNPDMQPGVTIIVGPGTE 119

Query: 77 -----GKRLMPGLIDTHSH 90
GK + G +D+H H
Sbjct: 120 VIAGEGKIVTAGGMDSHIH 138


31PAKAF_02742PAKAF_02747Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_027422101.373329TssC3
PAKAF_027434112.203392TssB3
PAKAF_027444122.286797TssJ3
PAKAF_027454112.455399TssK3
PAKAF_027464111.997984TssL3
PAKAF_027472101.934628TssM3
32PAKAF_02758PAKAF_02763Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_027580133.236898probable ATP-binding component of ABC
PAKAF_027591123.369785MetQ/NlpA family ABC transporter
PAKAF_027600124.451521LLM class flavin-dependent oxidoreductase
PAKAF_027610114.541066SfnB family sulfur acquisition oxidoreductase
PAKAF_02762-2124.063228SfnB family sulfur acquisition oxidoreductase
PAKAF_02763-2133.482861NAD(P)/FAD-dependent oxidoreductase
33PAKAF_02790PAKAF_02817Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_02790381.631277cupin domain-containing protein
PAKAF_02791391.760616probable oxidoreductase
PAKAF_027925102.619361probable transcriptional regulator
PAKAF_027936102.080411serine hydrolase
PAKAF_027945111.320091probable major facilitator superfamily (MFS)
PAKAF_027954110.134434hypothetical protein
PAKAF_02796-2121.358178probable transcriptional regulator
PAKAF_02797-3150.773663hypothetical protein
PAKAF_02798-1123.102997TauD/TfdA family dioxygenase
PAKAF_02799-1123.384061ABC transporter substrate-binding protein
PAKAF_028000113.531899probable ATP-binding component of ABC
PAKAF_02801-1114.273225probable permease of ABC transporter
PAKAF_02802-1104.459947AmbA
PAKAF_02803-1104.360085AmbB
PAKAF_02804-1113.618033AmbC
PAKAF_02805-1103.084574AmbD
PAKAF_02806-1103.054593AmbE
PAKAF_02807-1110.471257hypothetical protein
PAKAF_028080100.484992chitinase
PAKAF_028090111.074069probable transcriptional regulator
PAKAF_02810092.206282probable oxidoreductase
PAKAF_028111113.305438probable ferredoxin
PAKAF_02812-2122.413106nitrate ABC transporter substrate-binding
PAKAF_028131142.826737probable permease of ABC transporter
PAKAF_028141132.185860probable ATP-binding component of ABC
PAKAF_028152132.041718HEAT repeat domain-containing protein
PAKAF_028162131.382933DUF971 domain-containing protein
PAKAF_028172121.519947probable glucose-sensitive porin
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02794TCRTETB462e-07 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 46.0 bits (109), Expect = 2e-07
Identities = 36/170 (21%), Positives = 73/170 (42%), Gaps = 7/170 (4%)

Query: 35 FVAILSETLPAGLLPQIGAGLAVSEALAGQLVSVYALGSLLAALPAASLTQGWRRRRVLL 94
F ++L+E + LP I A + + + L + L+ +R+LL
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 95 LALLIFFVCNSLTAVS-SDYRLTLLARFGSGVAAGLAWGLLAGYARRLVPPEQQGRALAV 153
++I + + V S + L ++ARF G A L+ R +P E +G+A
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAF-- 141

Query: 154 AMLGAPLALSLGVPLGTWLGGLLG--WRWAFGLLSLTALLLVGWVLRSVP 201
++G+ +++G +G +GG++ W++ LL ++ L +
Sbjct: 142 GLIGS--IVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLL 189


34PAKAF_02858PAKAF_02868Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_02858391.527161lipoamide dehydrogenase-Val
PAKAF_028592110.846385branched-chain alpha-keto acid dehydrogenase
PAKAF_028601110.9047592-oxoisovalerate dehydrogenase (beta subunit)
PAKAF_028610102.6387432-oxoisovalerate dehydrogenase (alpha subunit)
PAKAF_02862-182.330639hypothetical protein
PAKAF_02863373.077352transcriptional regulator BkdR
PAKAF_02864282.997617hypothetical protein
PAKAF_02865293.194060DNA topoisomerase IB
PAKAF_02866293.214707FAD-dependent oxidoreductase
PAKAF_02867392.106485acyltransferase family protein
PAKAF_02868282.920864PslL
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02859IGASERPTASE356e-04 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 35.0 bits (80), Expect = 6e-04
Identities = 16/70 (22%), Positives = 24/70 (34%), Gaps = 9/70 (12%)

Query: 78 EVEGAGNLAESPAAATPAAPVAATPE---------KPKEAPVAAPKAAAEAPRALRDSEA 128
EVE ++ TP A P + EAPV P A + +E
Sbjct: 984 EVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAEN 1043

Query: 129 PRQRRQPGER 138
+Q + E+
Sbjct: 1044 SKQESKTVEK 1053


35PAKAF_02881PAKAF_02905Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_02881224-2.086144short-chain
PAKAF_02882221-2.870153proteobacterial sortase system OmpA family
PAKAF_02883217-3.261306protein RomA,metal-dependent
PAKAF_02884119-3.743482Sugar diacid utilization regulator
PAKAF_02885219-4.455007phytanoyl-CoA dioxygenase,Phytanoyl-CoA
PAKAF_02886222-4.534453BNR repeat-containing protein,Ycf48-like protein
PAKAF_02887123-5.301770transporter,multidrug efflux protein,Predicted
PAKAF_02888129-6.140723putative lipoprotein,Protein of unknown function
PAKAF_02889035-6.905833hypothetical protein,Protein of unknown function
PAKAF_02890037-6.692349hypothetical protein
PAKAF_02891032-5.768763phytoene dehydrogenase,Gamma-glutamylputrescine
PAKAF_02892025-5.060710phytanoyl-CoA dioxygenase,Phytanoyl-CoA
PAKAF_02893018-3.897900TetR family transcriptional regulator,putative
PAKAF_02894015-3.135044FAD-containing monooxygenase EthA,FAD-containing
PAKAF_02895014-2.203210hydrolase,Putative aminoacrylate hydrolase
PAKAF_02896112-2.2082141,3-propanediol dehydrogenase,1,3-propanediol
PAKAF_02897113-1.909656amino acid permease,Putrescine importer
PAKAF_02898114-1.850846gamma-aminobutyraldehyde
PAKAF_02899212-1.245002diaminobutyrate--2-oxoglutarate
PAKAF_02900415-0.470482putative
PAKAF_029016130.144705hypothetical protein
PAKAF_029026140.363336acetate permease,Acetate transporter
PAKAF_029035121.327521membrane protein,Inner membrane protein
PAKAF_029043130.572642acyl-CoA synthetase,Short-chain-fatty-acid--CoA
PAKAF_02905215-0.292833dehydrogenase,Gluconate 5-dehydrogenase,short
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02881DHBDHDRGNASE972e-26 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 97.4 bits (242), Expect = 2e-26
Identities = 73/251 (29%), Positives = 114/251 (45%), Gaps = 12/251 (4%)

Query: 7 RFAVVTGASSGIGLKLTETLLGHGATVLAM---ARREGPPESLHAHSGKRLHWLAGDVTR 63
+ A +TGA+ GIG + TL GA + A+ + S + DV
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 64 ERDLDALASR-AASIGPVDYLVPNAGIAQLA--DGLDSLAFEQQWRVNGAGALNTFSVLS 120
+D + +R +GP+D LV AG+ + L +E + VN G N +S
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 121 RQ--TSKPASVVFIGTFLSQVTFPGLAAYIASKAALIAQARTLAVEWAEKGVRINLVSPG 178
+ + S+V +G+ + V +AAY +SKAA + + L +E AE +R N+VSPG
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188

Query: 179 PTATPIWASLGLSDAQAESVTRSINQRLVDGSFLS----PGEIVDVVMFLLSSKSAGLYG 234
T T + SL + AE V + + G L P +I D V+FL+S ++ +
Sbjct: 189 STETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHITM 248

Query: 235 QELIVDKGYGL 245
L VD G L
Sbjct: 249 HNLCVDGGATL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02884HTHFIS320.006 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 31.7 bits (72), Expect = 0.006
Identities = 21/113 (18%), Positives = 36/113 (31%), Gaps = 9/113 (7%)

Query: 258 RAAIGSPGTGVNGFRRSHLEALTTQRLMGRLAGAPAVATIDQVRMVSLMTQDDRAARQFV 317
R P + R +E + + A A + + + ++ R
Sbjct: 364 RLTALYPQDVI---TREIIE-NELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASF 419

Query: 318 LSTLGRLATEPSVL-----QRSLHAFLANGCNVTQTAEALGTHRNTLLRRLER 365
L VL L A A N + A+ LG +RNTL +++
Sbjct: 420 GDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRE 472


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02887ACRIFLAVINRP732e-15 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 73.3 bits (180), Expect = 2e-15
Identities = 33/209 (15%), Positives = 80/209 (38%), Gaps = 9/209 (4%)

Query: 590 TTINRVVDAAKAFRSEYPMSGISIRLASGNAGVLAAINEEVEKSETPMLLYVYAAIALLV 649
T + + +P G+ + + EV K+ L + L++
Sbjct: 301 DTAKAIKAKLAELQPFFP-QGMKVLYPYDTTPFVQLSIHEVVKT----LFEAIMLVFLVM 355

Query: 650 FVVYRDLRAVLVCCLPLTIGTFIGYWFMKELQIGLTIATLPVMVLAVGIGVDYAFYIYNR 709
++ +++RA L+ + + + + + + T+ MVLA+G+ VD A +
Sbjct: 356 YLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVEN 415

Query: 710 LQLHLAHGQTITK-AVEYALLEVGVATIFTAITLAVGVATWAF---SELKFQADMGKLLA 765
++ + + K A E ++ ++ A + A+ L+ AF S +
Sbjct: 416 VERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIV 475

Query: 766 FMFVVNMVMAMTVLPAFAVWLERAFPRKR 794
+++++A+ + PA L + +
Sbjct: 476 SAMALSVLVALILTPALCATLLKPVSAEH 504



Score = 41.0 bits (96), Expect = 2e-05
Identities = 37/216 (17%), Positives = 75/216 (34%), Gaps = 11/216 (5%)

Query: 233 IADGASAVLEFCLLALLLTAGAVYWYCHSLRFTLLALVCSLASLVWQFGSLRLLGYGLDP 292
+ V++ A++L +Y + ++R TL+ + L+ F L GY ++
Sbjct: 333 VQLSIHEVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINT 392

Query: 293 LAVLVPFLVFAIGVSHGVQQINFIVREIAIGKS----AEEAARSSFTGLLVPGTLALVTA 348
L + L + V + + + R + K A E + S G LV + L
Sbjct: 393 LTMFGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAV 452

Query: 349 LVSFVTLLLIPIPMVRELAITASLGVAYKIVTNLVMLPLMASLLRVDDKYAAAQEVSRQR 408
+ + R+ +IT +A ++ L++ P + + L K +A+ +
Sbjct: 453 FIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLL---KPVSAEHHENKG 509

Query: 409 R-SRWL-RGLARLAE--PRKAQWVLGAALAVFLAAI 440
W +LG+ L
Sbjct: 510 GFFGWFNTTFDHSVNHYTNSVGKILGSTGRYLLIYA 545



Score = 35.6 bits (82), Expect = 9e-04
Identities = 30/175 (17%), Positives = 60/175 (34%), Gaps = 18/175 (10%)

Query: 629 EVEKSETPMLLYVYAAIALLVFVV----YRDLRAVLVCCL--PLT-IGTFIGYWFMKELQ 681
E+ + A ++VF+ Y + L PL +G +
Sbjct: 863 YQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLF---N 919

Query: 682 IGLTIATLPVMVLAVGIGVDYAFYIYNRL-QLHLAHGQTITKAVEYALLEVGVATIFTAI 740
+ + ++ +G+ A I L G+ + +A A+ + T++
Sbjct: 920 QKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSL 979

Query: 741 TLAVGV-----ATWAFSELKFQADMGKLLAFMFVVNMVMAMTVLPAFAVWLERAF 790
+GV + A S Q +G + V ++A+ +P F V + R F
Sbjct: 980 AFILGVLPLAISNGAGSGA--QNAVGIGVMGGMVSATLLAIFFVPVFFVVIRRCF 1032


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02893HTHTETR505e-10 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 50.4 bits (120), Expect = 5e-10
Identities = 30/190 (15%), Positives = 68/190 (35%), Gaps = 11/190 (5%)

Query: 3 RVGAEVRRQDFIEAAVKVIAEYGVANATTRRIAAAANSPLASLHYVFHTKDELFDAVYES 62
+ A+ RQ ++ A+++ ++ GV++ + IA AA ++++ F K +LF ++E
Sbjct: 6 KQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWEL 65

Query: 63 LIDKPQQSLLHVTA--GATAADSVAEILRQLVGWFTTHPE-----LATTQFELFFWNLRN 115
+ L A + EIL ++ T F +
Sbjct: 66 SESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAV 125

Query: 116 NPAMASKIYTDSVEATKQAIEQV--AGSVLDQEALATVSRLLINLFDGLLLAWSAHGDQE 173
+ +S + +Q ++ A + + ++ GL+ W +
Sbjct: 126 VQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLF--APQ 183

Query: 174 RLNAETEAAC 183
+ + EA
Sbjct: 184 SFDLKKEARD 193


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02903RTXTOXINA260.026 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 26.5 bits (58), Expect = 0.026
Identities = 14/46 (30%), Positives = 22/46 (47%), Gaps = 1/46 (2%)

Query: 50 LIARRLAQGSNMTFGVAAGVFLFVFFCALSALYVYRANGEFDRLTQ 95
+IA+R AQG + + AAG+ A+S L +F R +
Sbjct: 291 IIAQRAAQGLSTS-AAAAGLIASAVTLAISPLSFLSIADKFKRANK 335


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02905DHBDHDRGNASE1123e-32 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 112 bits (282), Expect = 3e-32
Identities = 71/253 (28%), Positives = 123/253 (48%), Gaps = 14/253 (5%)

Query: 10 ALDGRRALVTGASSGLGRHFAMTLAAAGAEVVVTARRQAPLQALVEAIEVAGGRAQAFAL 69
++G+ A +TGA+ G+G A TLA+ GA + L+ +V +++ A+AF
Sbjct: 5 GIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPA 64

Query: 70 DV----TCREDICRVLDAAGPLDVLVNNAGVSDSQPLLACDDQSWDRVLDTNLKGAWAVA 125
DV E R+ GP+D+LVN AGV + + D+ W+ N G + +
Sbjct: 65 DVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNAS 124

Query: 126 QESARRMVAAGKGGSLINVTSILASRVAGAVGPYLAAKAGLAHLTRAMALELARHGIRVN 185
+ ++ M+ + GS++ V S A ++ Y ++KA T+ + LELA + IR N
Sbjct: 125 RSVSKYMMDR-RSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCN 183

Query: 186 ALAPGYVMTDLNEAFLASEAGDKLRSR---------IPSRRFSVPADLDGALLLLASDAG 236
++PG TD+ + A E G + + IP ++ + P+D+ A+L L S
Sbjct: 184 IVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQA 243

Query: 237 RAMSGAEIVVDGG 249
++ + VDGG
Sbjct: 244 GHITMHNLCVDGG 256


36PAKAF_02986PAKAF_03004Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_029862142.138332CinA family protein
PAKAF_029872112.524014probable metallothionein
PAKAF_029882101.920959hypothetical protein
PAKAF_029891101.999096probable ATP-dependent DNA ligase
PAKAF_029900141.866944histidine kinase
PAKAF_029910112.339412hypothetical protein
PAKAF_02993182.148202probable transporter
PAKAF_029941101.615471hypothetical protein
PAKAF_029953101.962832EAL domain-containing protein
PAKAF_02996291.669495chaperone CupA5
PAKAF_02997391.318298fimbrial subunit CupA4
PAKAF_02998271.671527usher CupA3
PAKAF_02999182.566800chaperone CupA2
PAKAF_03000082.684175fimbrial subunit CupA1
PAKAF_03001-192.715255phosphoadenosine phosphosulfate reductase
PAKAF_03002-1103.101535conserved hypothetical protein
PAKAF_03003-1103.211961probable aldehyde dehydrogenase
PAKAF_03004-2113.195837probable dehydrogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02986ACRIFLAVINRP280.024 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 27.9 bits (62), Expect = 0.024
Identities = 15/61 (24%), Positives = 24/61 (39%), Gaps = 6/61 (9%)

Query: 34 QISAELSAVAGSGKVMEGGFVVYSPAAKLALLGLDPNWLERFNLTSVEVAEAMARAALQR 93
+ LS + G G V G A + LD + L ++ LT V+V + Q
Sbjct: 161 NVKDTLSRLNGVGDVQLFG------AQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQI 214

Query: 94 S 94
+
Sbjct: 215 A 215


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02990HTHFIS358e-05 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 35.2 bits (81), Expect = 8e-05
Identities = 15/80 (18%), Positives = 32/80 (40%), Gaps = 2/80 (2%)

Query: 58 VLVLEEHADQLWRIEEFLLDRGYAVLSAASRDEALDHLASDAVIDLFLLSEQLEGPLSGS 117
+LV ++ A + + L GY V ++ +A+ DL + + +
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDG-DLVVTDVVMPD-ENAF 63

Query: 118 MLIETSLPVRPRMRVILLSD 137
L+ RP + V+++S
Sbjct: 64 DLLPRIKKARPDLPVLVMSA 83


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02998PF005777720.0 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 772 bits (1995), Expect = 0.0
Identities = 272/881 (30%), Positives = 412/881 (46%), Gaps = 50/881 (5%)

Query: 7 RRCRTGTALMAGGMALAASAFGHAQPGYEFDDRLLLGSSLGGGDLSRFNQDGRIDPGRYH 66
R + A AA A + F+ R L DLSRF + PG Y
Sbjct: 21 HRLAGFFVRLFVACAFAAQA-PLSSAELYFNPRFLADDPQAVADLSRFENGQELPPGTYR 79

Query: 67 VDVYLNERFASRSEVSFRANPASGAVEPCLDEDFLRQRLGAKPGDDPRKSGDGRHCAFLG 126
VD+YLN + + +V+F + + PCL L C L
Sbjct: 80 VDIYLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVPLT 139

Query: 127 ARLPGSRFSLDVARLRLDLSVPQALLDLKPRGYVSPEEWDAGDSMGFVNYDTNLYRSEYR 186
+ + + LDV + RL+L++PQA + + RGY+ PE WD G + G +NY+ + + R
Sbjct: 140 SMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQNR 199

Query: 187 GGESGRSDYAYVGLNSGINLGLWRLRHQSNYTYSRYNGQA--RRKWNSIRTYAQRALPAW 244
G G S YAY+ L SG+N+G WRLR + ++Y+ + + + KW I T+ +R +
Sbjct: 200 IG--GNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPL 257

Query: 245 RSELTAGESYTAGNLLGSIGYRGLSLATDDRMLPESLRRYAPQVRGTAATAARVVISQNG 304
RS LT G+ YT G++ I +RG LA+DD MLP+S R +AP + G A A+V I QNG
Sbjct: 258 RSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNG 317

Query: 305 RKIREVNVAPGPFVIDDLYDSAYAGDLDVQVFEADGSVSSFSVPFASVPESMRPGLSRYS 364
I V PGPF I+D+Y + +GDL V + EADGS F+VP++SVP R G +RYS
Sbjct: 318 YDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYS 377

Query: 365 FTLGQARQYGDGDD--LFADFTYQRGMSNALTANLGLRVADDYLA-MLGGGVLATRFGAF 421
T G+ R + F T G+ T G ++AD Y A G G GA
Sbjct: 378 ITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGAL 437

Query: 422 GLNSTYSSARVEDGARKQGWRIGLDYSRTFQPTGTTLTLAGYRYSTEGYRELGDVLGSRD 481
++ T +++ + D ++ G + Y+++ +GT + L GYRYST GY D SR
Sbjct: 438 SVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRM 497

Query: 482 ALRHGDTWD-------------SGSYKQRNQFNLLVSQALGGYGNLYLSGSSSDFYDGKS 528
+ +T D + +Y +R + L V+Q LG LYLSGS ++ +
Sbjct: 498 NGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYWGTSN 557

Query: 529 RDTQLQFGYSNTWGQLSYNLAWSRQTTTDYQEQGDQDPGVELLRRDRRSGQRNDTLTLSV 588
D Q Q G + + +++ L++S ++ R+ L L+V
Sbjct: 558 VDEQFQAGLNTAFEDINWTLSYSLT-------------------KNAWQKGRDQMLALNV 598

Query: 589 SMPLGSSSRAPTLSA-----MATRRSGDSRGG-SLQTGLNGTLGDERTWSYALSA---NR 639
++P R+ + S + S D G + G+ GTL ++ SY++
Sbjct: 599 NIPFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGG 658

Query: 640 DSEVADTTWNGTLQKQAALATVNAGYAQGDRYRQYSGGIRGALVAHRDGLTLGPSVGDTF 699
+ +T TL + N GY+ D +Q G+ G ++AH +G+TLG + DT
Sbjct: 659 GDGNSGSTGYATLNYRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTV 718

Query: 700 ALVEAKGASGAAIRGGQGARIDGNGYALAPSLSPYRYNPISLDPVGIDPDAELLETERKV 759
LV+A GA A + G R D GYA+ P + YR N ++LD + + +L V
Sbjct: 719 VLVKAPGAKDAKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANV 778

Query: 760 APYAGASVRVTFRTLTGHPLLIQARREDGSVLPLGAVVVDDGGAAIGMVGQGGQVYARAE 819
P GA VR F+ G LL+ + LP GA+V + + G+V GQVY
Sbjct: 779 VPTRGAIVRAEFKARVGIKLLMTLTH-NNKPLPFGAMVTSESSQSSGIVADNGQVYLSGM 837

Query: 820 NQRGRLLVQWGTARKERCELPYDLAGVSRDQALIRLRGTCR 860
G++ V+WG C Y L S+ Q L +L CR
Sbjct: 838 PLAGKVQVKWGEEENAHCVANYQLPPESQQQLLTQLSAECR 878


37PAKAF_03014PAKAF_03040Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_03014-1143.053632probable transcriptional regulator
PAKAF_03015-2163.375086probable major facilitator superfamily (MFS)
PAKAF_03016-1103.617924pyroglutatmate porin OpdO
PAKAF_030171122.593997LamB/YcsF family protein
PAKAF_030181142.510394allophanate hydrolase subunit 1
PAKAF_030192200.738384biotin-dependent carboxyltransferase family
PAKAF_03020228-2.044993DUF4142 domain-containing protein
PAKAF_03021131-3.437390probable decarboxylase
PAKAF_05927144-5.841928hypothetical protein
PAKAF_03022040-7.887846four-helix bundle copper-binding protein
PAKAF_03024143-8.619365probable acetyltransferase
PAKAF_03025037-6.923072probable cysteine synthase
PAKAF_03026030-3.950604probable molybdopterin biosynthesis protein
PAKAF_03027022-2.092406hypothetical protein
PAKAF_03028-117-0.635017EamA family transporter
PAKAF_030290121.223435probable transcriptional regulator
PAKAF_03030-193.923334probable short-chain dehydrogenase
PAKAF_03031-183.479138probable esterase/deacetylase
PAKAF_03032083.194330probable flavin-binding monooxygenase
PAKAF_030333114.172077probable transcriptional regulator
PAKAF_030341124.358850hypothetical protein
PAKAF_03035193.573698probable transmembrane sensor
PAKAF_03036093.233362probable sigma-70 factor, ECF subfamily
PAKAF_03037093.618625probable major facilitator superfamily (MFS)
PAKAF_03038193.276165MFS transporter
PAKAF_03039192.900999LLM class flavin-dependent oxidoreductase
PAKAF_03040193.028391TonB-dependent receptor
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03015TCRTETB393e-05 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 39.1 bits (91), Expect = 3e-05
Identities = 37/180 (20%), Positives = 82/180 (45%), Gaps = 6/180 (3%)

Query: 29 FWSCKIGYGLDGMDTQMLSFVIPTLIALWGIGTGEAGFIHTMTLLASAAGGWIAGILSDR 88
W C + + ++ +L+ +P + + +++T +L + G + G LSD+
Sbjct: 17 IWLCILSF-FSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQ 75

Query: 89 IGRVLTLQLTVLWFAFFTFLCGLAQNYEQLLV-ARTLMGFGFGGEWTAGAVLIGEVIKAR 147
+G L ++ F + + + ++ LL+ AR + G G V++ I
Sbjct: 76 LGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKE 135

Query: 148 DRGKAVGLVQSGWAIGWGLTAILYSLMFSLLPPEEAWRALFMLGLLPALFVLVVRRLVKE 207
+RGKA GL+ S A+G G+ + ++ + W L ++ ++ + V + +L+K+
Sbjct: 136 NRGKAFGLIGSIVAMGEGVGPAIGGMIAHYI----HWSYLLLIPMITIITVPFLMKLLKK 191


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03030DHBDHDRGNASE855e-22 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 85.5 bits (211), Expect = 5e-22
Identities = 57/196 (29%), Positives = 87/196 (44%), Gaps = 9/196 (4%)

Query: 1 MQRGGRQVQNILISGAASGIGAASARLFHRRGWRVGLLDIDAEALRGLAVQLPGAWHRA- 59
M G + + I+GAA GIG A AR +G + +D + E L + L A
Sbjct: 1 MNAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAE 60

Query: 60 ---VDVSEPDAVGEALAQFCAD-GRLRLLFNCAGVLRFGRFEEVALEDHARLLAINLHGV 115
DV + A+ E A+ + G + +L N AGVLR G ++ E+ ++N GV
Sbjct: 61 AFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGV 120

Query: 116 LNCCHAAFPFLRATPQAQVLNMGSASGLYGVPE--MAVYSASKFAVRGLTEALELEWRRH 173
N + ++ ++ +GS GVP MA Y++SK A T+ L LE +
Sbjct: 121 FNASRSVSKYMMDRRSGSIVTVGSNPA--GVPRTSMAAYASSKAAAVMFTKCLGLELAEY 178

Query: 174 GIRVADLMPPFVRTPM 189
IR + P T M
Sbjct: 179 NIRCNIVSPGSTETDM 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03037TCRTETA552e-10 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 54.8 bits (132), Expect = 2e-10
Identities = 84/365 (23%), Positives = 139/365 (38%), Gaps = 20/365 (5%)

Query: 23 VGTVELVVAGVLDELAASFAVSQGRAGLLMSLYALVYALLGPLLVYLSAGIERRRLLAGA 82
+G + V+ G+L +L S V G+L++LYAL+ P+L LS RR +L +
Sbjct: 21 IGLIMPVLPGLLRDLVHSNDV-TAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVS 79

Query: 83 LLAFIGANLASAAAPSFALLLASRLLVAASASVIVVVAITLAVAIVAPERRGRAIGLVFA 142
L A AP +L R++ + + V +A I + R R G + A
Sbjct: 80 LAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIA-DITDGDERARHFGFMSA 138

Query: 143 GIVASLVLGVPLGTLIGEFWGWRSLFLLLAGVALLGLPLLLRLL---------PAIPGAP 193
+V G LG L+G F + F A + L LL P A
Sbjct: 139 CFGFGMVAGPVLGGLMGGF-SPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREAL 197

Query: 194 GIAPAEQLRALARGRVPFAHLASLLQMTGQFTVYTYIVPFLVGSMALDKPTISLVLLVYG 253
+ + + ++Q+ GQ +++ F D TI + L +G
Sbjct: 198 NPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVI-FGEDRFHWDATTIGISLAAFG 256

Query: 254 GGGILG-ALLGGRAADRWPGPATFVAFLLLHALALVLLPFATGGLPLLLGAVVFWCVFNM 312
L A++ G A R + ++ +LL FAT G V+
Sbjct: 257 ILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLL--ASGG 314

Query: 313 APGPAIQKYLVELSPDTAAIQISLNTSAIQLGVALGAFIGAILVDQVAVRALPWW-GAAL 371
PA+Q L + Q+ + +A+ +L + +G +L + ++ W G A
Sbjct: 315 IGMPALQAMLSRQVDEERQGQLQGSLAALT---SLTSIVGPLLFTAIYAASITTWNGWAW 371

Query: 372 ILGAA 376
I GAA
Sbjct: 372 IAGAA 376


38PAKAF_03087PAKAF_03098Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_03087221-6.395754probable transporter (membrane subunit)
PAKAF_03088330-8.458137probable amino acid permease
PAKAF_03089329-6.188026probable glutamine synthetase
PAKAF_03090534-5.555510putative branched-chain amino acid transport
PAKAF_03091532-5.017868branched-chain amino acid ABC transporter
PAKAF_03092426-3.699512hypothetical protein
PAKAF_030931110.850453methyltransferase domain-containing protein
PAKAF_03094192.572128probable decarboxylase
PAKAF_030952132.150106class I SAM-dependent methyltransferase
PAKAF_030963120.273515siderophore-interacting protein
PAKAF_03097112-0.522699probable transcriptional regulator
PAKAF_03098214-1.181952DUF1127 domain-containing protein
39PAKAF_03145PAKAF_03161Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_031452200.938772pyrroloquinoline quinone biosynthesis protein A
PAKAF_031461171.652794NAD+ dependent aldehyde dehydrogenase ExaC
PAKAF_031472152.167690cytochrome c550
PAKAF_031481151.614277quinoprotein ethanol dehydrogenase
PAKAF_031494103.832411pentapeptide repeat-containing protein
PAKAF_03150292.702448response regulator EraR
PAKAF_031510102.345442sensor kinase, EraS
PAKAF_03152-1101.000403hypothetical protein
PAKAF_03153-182.637881response regulator ErbR
PAKAF_03154092.727369DMT family transporter
PAKAF_03155-1102.110329ErcS'
PAKAF_03156-192.053426hypothetical protein
PAKAF_031570102.271010porin
PAKAF_031582113.356342pyrroloquinoline quinone biosynthesis protein F
PAKAF_031591110.732723phosphoethanolamine--lipid A transferase
PAKAF_03160-2121.013577branched chain amino acid transporter BraZ
PAKAF_031612170.601723hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03150HTHFIS781e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 77.6 bits (191), Expect = 1e-18
Identities = 40/160 (25%), Positives = 64/160 (40%), Gaps = 9/160 (5%)

Query: 2 KILLVDDHFVVREGLAALLRGLLPDVEVSEAGDGEEALQAVQREIPSLVIVDLGLPGISG 61
IL+ DD +R L L +V + + + LV+ D+ +P +
Sbjct: 5 TILVADDDAAIRTVLNQALSRA--GYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 62 LELTRRLRQRLPQLRVLFFSLHDELALVRQALDAGARGYVTKRAAPTVLLEAIRRVLAGQ 121
+L R+++ P L VL S + +A + GA Y+ K T L+ I R L
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL--- 119

Query: 122 LYLEQPLATRLACQSWEEQGGAALRGLTRREFEIFRLLAR 161
R + + Q G L G + EI+R+LAR
Sbjct: 120 ----AEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLAR 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03153HTHFIS712e-16 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 70.6 bits (173), Expect = 2e-16
Identities = 35/173 (20%), Positives = 69/173 (39%), Gaps = 11/173 (6%)

Query: 3 KILIADDHPLFREAIHNVIADGFPGSEVMETADLDSALGLTQEHDDLDLILLDLNMPGMH 62
IL+ADD R ++ + G +V T++ + D DL++ D+ MP +
Sbjct: 5 TILVADDDAAIRTVLNQAL--SRAGYDVRITSNAATLWRWIAA-GDGDLVVTDVVMPDEN 61

Query: 63 GLNGLMNLRNEAPTIPVVIVSAEQDKQVVLQAITYGAVGFITKSSPRAQMTEAIEQILNG 122
+ L ++ P +PV+++SA+ ++A GA ++ K ++ I + L
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA- 120

Query: 123 NVYLPSDVIRTQKSSPRRSGHEEHGISPELLQALTRKQLLVLERMT---KGES 172
++ + G G S + + L+ +T GES
Sbjct: 121 ----EPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGES 169


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03155HTHFIS473e-07 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 46.7 bits (111), Expect = 3e-07
Identities = 26/99 (26%), Positives = 42/99 (42%), Gaps = 4/99 (4%)

Query: 756 GSRVWVLDNDAAICAGMRTLLEAWGCRVVTALSEEDLARQVDNYHAEADLLIVDYHLDDQ 815
G+ + V D+DAAI + L G V + L R + + DL++ D + D
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIA--AGDGDLVVTDVVMPD- 59

Query: 816 RNGVDAVAAINARRGSPLPALMITANYSNELKQQVRELG 854
N D + I R LP L+++A + + E G
Sbjct: 60 ENAFDLLPRIKKARPD-LPVLVMSAQNTFMTAIKASEKG 97


40PAKAF_03187PAKAF_03193Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_03187216-3.149547hypothetical protein
PAKAF_03188631-8.058584GNAT family N-acetyltransferase
PAKAF_03189634-9.487654hypothetical protein
PAKAF_03190526-6.311382hypothetical protein
PAKAF_03191529-6.871208catalase,Catalase
PAKAF_03192533-7.252916hypothetical protein
PAKAF_03193222-4.779511Uncharacterized protein conserved in bacteria
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03188SACTRNSFRASE427e-07 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 42.2 bits (99), Expect = 7e-07
Identities = 16/59 (27%), Positives = 27/59 (45%)

Query: 68 LARLYSIAIDPRARGIGLGQKLLEAAEQAARDNDRAYMRLEVRPDNRGAIALYERNGYR 126
A + IA+ R G+G LL A + A++N + LE + N A Y ++ +
Sbjct: 89 YALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFI 147


41PAKAF_03216PAKAF_03231Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_032161123.165363hypothetical protein
PAKAF_032170113.446405ECF sigma factor, FemI
PAKAF_032180103.381377sigma factor regulator, FemR
PAKAF_03219-1102.979558ferric-mycobactin receptor, FemA
PAKAF_03220-1113.273790PepSY domain-containing protein
PAKAF_03222-2113.871606probable major facilitator superfamily (MFS)
PAKAF_03223-2133.302404dienelactone hydrolase
PAKAF_032241163.850690HIT family protein
PAKAF_032250163.096292putative pyridoxamine 5'-phosphate
PAKAF_03226-1142.074715phenazine biosynthesis
PAKAF_032270110.044386phenazine biosynthesis protein PhzE,Anthranilate
PAKAF_03228-111-2.267759phenazine biosynthesis protein PhzD,Probable
PAKAF_03229011-2.671899phospho-2-dehydro-3-deoxyheptonate
PAKAF_03230-113-4.602239phenazine biosynthesis protein,Phenazine
PAKAF_03231013-4.052145phenazine biosynthesis protein,Phenazine
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03222TCRTETA973e-24 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 96.8 bits (241), Expect = 3e-24
Identities = 85/335 (25%), Positives = 124/335 (37%), Gaps = 37/335 (11%)

Query: 49 GAAVTVGGIAWMLAARPWGIASDRHGRRRILLGGLAGFALSYGSLCLFIVLALHWTLPTL 108
G + + + A G SDR GRR +LL LAG A+ Y +
Sbjct: 46 GILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDY----------------AI 89

Query: 109 FAFAG---IVLLRGLAGGFYAAVPACTAALVADHVEAQRRAAALAGLGAASAIGMVIGPG 165
A A ++ + + G A A A +AD + RA + A GMV GP
Sbjct: 90 MATAPFLWVLYIGRIVAGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPV 149

Query: 166 LAGLLATHGLVLPLLVTGALPLVALLALWRWLP----------REERRQPNRGAALAIGD 215
L GL+ P AL + L LP R E P A G
Sbjct: 150 LGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGM 209

Query: 216 RRLRRPLAVGFVAMFSVTVAQITVGFFALDRLRLDSADAARVAGIALTAVGVALILAQLL 275
+ +AV F+ V F DR D+ GI+L A G+ LAQ +
Sbjct: 210 TVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATT----IGISLAAFGILHSLAQAM 265

Query: 276 VRRL---DWPPPRLIRVGGLVAAIGFAAVCLADSPPLLWLAFFVAAAGMGWVFPAVSALN 332
+ R + +G + G+ + A + + + A+G G PA+ A+
Sbjct: 266 ITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASG-GIGMPALQAML 324

Query: 333 ANAVRAEEQGAAAGTLVAVHGFGLISGPLLGTLLH 367
+ V E QG G+L A+ I GPLL T ++
Sbjct: 325 SRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIY 359



Score = 36.3 bits (84), Expect = 2e-04
Identities = 36/140 (25%), Positives = 56/140 (40%), Gaps = 7/140 (5%)

Query: 251 SADAARVAGIALTAVGVA-LILAQLLVRRLDWPPPRLIRVGGLV-AAIGFAAVCLADSPP 308
S D GI L + A +L D R + + L AA+ +A + A P
Sbjct: 38 SNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATA---P 94

Query: 309 LLWLAFF--VAAAGMGWVFPAVSALNANAVRAEEQGAAAGTLVAVHGFGLISGPLLGTLL 366
LW+ + + A G A A+ +E+ G + A GFG+++GP+LG L+
Sbjct: 95 FLWVLYIGRIVAGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLM 154

Query: 367 HQLDSRAPYALVGLLLALAA 386
AP+ L L
Sbjct: 155 GGFSPHAPFFAAAALNGLNF 174


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03228ISCHRISMTASE352e-126 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 352 bits (904), Expect = e-126
Identities = 103/207 (49%), Positives = 137/207 (66%), Gaps = 2/207 (0%)

Query: 3 GIPEITAYPLPTAQQLPANLARWSLEPRRAVLLVHDMQRYFLRPLPESLRA--ELVANAA 60
IP I Y +PTA +P N W +P RAVLL+HDMQ YF+ EL AN
Sbjct: 2 AIPAIQPYQMPTASDMPQNKVSWVPDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANIR 61

Query: 61 RLRRWCVEQGVQIAYTAQPGSMTEEQRGLLKDFWGPGMRASPADREVVEELAPGPDDWLL 120
+L+ CV+ G+ + YTAQPGS + R LL DFWGPG+ + P + +++ ELAP DD +L
Sbjct: 62 KLKNQCVQLGIPVVYTAQPGSQNPDDRALLTDFWGPGLNSGPYEEKIITELAPEDDDLVL 121

Query: 121 TKWRYSAFFHSDLLQRMRAAGRDQLVLCGVYAHVGVLISTVDAYSNDIQPFLVADAIADF 180
TKWRYSAF ++LL+ MR GRDQL++ G+YAH+G L++ +A+ DI+ F V DA+ADF
Sbjct: 122 TKWRYSAFKRTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVADF 181

Query: 181 SEAHHRMALEYAASRCAMVVTTDEVLE 207
S H+MALEYAA RCA V TD +L+
Sbjct: 182 SLEKHQMALEYAAGRCAFTVMTDSLLD 208


42PAKAF_03252PAKAF_03280Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_032525132.081387transcriptional regulator,Helix-turn-helix
PAKAF_032534121.814209putative hydrolase
PAKAF_032544121.782461secretion protein,Type I secretion system
PAKAF_032554111.779192ABC transporter ATP-binding protein/permease,
PAKAF_032564121.650560putative outer membrane protein,outer membrane
PAKAF_032575141.632778glycoprotein,Cna protein B-type domain
PAKAF_032580100.725825cation transporter
PAKAF_032590121.396495hypothetical protein
PAKAF_032600131.661569LasA protease precursor
PAKAF_03261-2112.349576hypothetical protein
PAKAF_03262-191.964233probable acyl carrier protein
PAKAF_03263082.308646secretion protein XqhA
PAKAF_03264491.736005XphA
PAKAF_03265281.475099hypothetical protein
PAKAF_03266181.557906ATP-dependent DNA helicase
PAKAF_03267091.185448VRR-NUC domain-containing protein
PAKAF_03268-1111.333593probable transcriptional regulator
PAKAF_032690101.071806molybdate-binding periplasmic protein precursor
PAKAF_032700101.200527molybdenum transport protein ModB
PAKAF_03271290.318951molybdenum transport protein ModC
PAKAF_032723100.176931class I SAM-dependent methyltransferase
PAKAF_032733110.368259probable transcriptional regulator
PAKAF_03274290.427954streptomycin 3''-phosphotransferase
PAKAF_032763100.088506DUF808 domain-containing protein
PAKAF_03277390.083292probable cytochrome oxidase subunit
PAKAF_03278181.667501hypothetical protein
PAKAF_03279281.658534HPP family protein
PAKAF_03280292.031279probable transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03254RTXTOXIND2969e-99 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 296 bits (759), Expect = 9e-99
Identities = 83/416 (19%), Positives = 179/416 (43%), Gaps = 53/416 (12%)

Query: 24 PVYRPLLWTLLGCVLLFIGWAAWAQLDEVTRGDGRVVPFSRIQKIQSLEGGILDRLLVKE 83
R + + ++G +++ + Q++ V +G++ R ++I+ +E I+ ++VKE
Sbjct: 55 RRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKE 114

Query: 84 GDLVEVGQPLVRLDETRFLTNFQESANQASVLRAAIARLDAEVLGKKSIEFPPDVDPEGP 143
G+ V G L++L + ++ + R R + + P P+ P
Sbjct: 115 GESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEP 174

Query: 144 LARSERELFKSRRDKLVE-----------------------------GTQAIQRQIHLAQ 174
++ E R L++ + + +
Sbjct: 175 YFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEK 234

Query: 175 SQLDLVRPLVAKRAVSQMEALK-------LSQDIATLSGKLTELKS-------------- 213
S+LD L+ K+A+++ L+ ++ +L +++S
Sbjct: 235 SRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQ 294

Query: 214 TYFQDAYTERAQRKADLSALEPIVQQRQDQLRRTEILSPVRGRVNTVLINTRGGVIQPGE 273
+ + + Q ++ L + + +++ + + I +PV +V + ++T GGV+ E
Sbjct: 295 LFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAE 354

Query: 274 PIMEVIPVEERLLVEAKIKPRDVAFLVPGMPAKVKITAYDYTIYGDLKGTLEQISADTIE 333
+M ++P ++ L V A ++ +D+ F+ G A +K+ A+ YT YG L G ++ I+ D IE
Sbjct: 355 TLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIE 414

Query: 334 EDTPHGKESYYQVLIKTDGSQLKRGEEVLPIIPGMVAEVDILSGKRSVLNYLLRPL 389
+ + V+I + + L G + +P+ GM +I +G RSV++YLL PL
Sbjct: 415 DQRLG---LVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGMRSVISYLLSPL 467


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03256RTXTOXIND310.009 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 31.0 bits (70), Expect = 0.009
Identities = 24/162 (14%), Positives = 49/162 (30%), Gaps = 18/162 (11%)

Query: 250 DANVAEAEVREAKASLLPQLNLEASALRREIGGHPESDSVVSLRFRMDTFQGLSNFRRPT 309
A AEA+ + ++SLL R +I S+ L +
Sbjct: 128 TALGAEADTLKTQSSLL---QARLEQTRYQI-------LSRSIELNKLPELKLPDEPYFQ 177

Query: 310 AAQQRLESAKWSADAMQRD-IRRQLQNLFDNGDTLRWREQSLTQQVTESEQVGELYREQ- 367
+ S Q + Q N D R ++ ++ E + + + +
Sbjct: 178 NVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRL 237

Query: 368 ------FEVGRRDVIDLLNVQRERFEAERQLINLRIERKRIE 403
+L + + EA +L + + ++IE
Sbjct: 238 DDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIE 279


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03257INTIMIN492e-07 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 49.3 bits (117), Expect = 2e-07
Identities = 87/447 (19%), Positives = 141/447 (31%), Gaps = 59/447 (13%)

Query: 1230 TGPQAATTVDAVAPPAPVIDPSNGTTISGTAEAGAKVILTDGNGNPIGETTADGSGNWSF 1289
+G Q+A A+ P V SN ++ A D NGN S N
Sbjct: 502 SGSQSAQDYQAILPAY-VQGGSNVYKVTARAY--------DRNGN---------SSNNVL 543

Query: 1290 TPATPLANGTVVNAVA---------QDPAGNTGPQGSTTVDAVAPNTPVVNPSNGNLLNG 1340
T L+NG VV+ V A T T P + N+++G
Sbjct: 544 LTITVLSNGQVVDQVGVTDFTADKTSAKADGTEAITYTATVKKNGVAQANVPVSFNIVSG 603

Query: 1341 TAEPGSTVTLTDGNGNPIGQTTADGSGNWSFTPGSQLPNGTVVNVTASDAAGNTSLPATT 1400
TA + T+G+G S PG VV+ ++ + A
Sbjct: 604 TAVLSANSANTNGSGKATVTLK-------SDKPG-----QVVVSAKTAEMTSALNANAVI 651

Query: 1401 TVDSSLPSIPQVDPSNGSVISGTADAGNTIIITDGNGNPIG--QVTADGS-GNWSFTPGI 1457
VD + SI ++ + ++ DA + P+ +VT + G S +
Sbjct: 652 FVDQTKASITEIKADKTTAVANGQDAITYTVKVMKGDKPVSNQEVTFTTTLGKLSNSTEK 711

Query: 1458 PLPDG-TVVNVVARSPSNVDSAPAV--ITVDGVAP-----AAPVIDPSNGTEISGTAEAG 1509
+G V + + +P + V + VD AP ID N EI GT G
Sbjct: 712 TDTNGYAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFFTTLTIDDGN-IEIVGTGVKG 770

Query: 1510 ATVILTDGGGNPIGQATADGSGNWTFTPSTPLANGTVINAVAQDPAGNTSGPASVTVDAI 1569
+ G +A+ G+G +T+ + P ++ + SV
Sbjct: 771 KLPTVWLQYGQVNLKASG-GNGKYTWRSANPAIASVDASSGQVTLKEKGTTTISVISSDN 829

Query: 1570 APPAPVINPSNGVVISG------TAEAGATVILTDGNGNPIGQVTADGSGNWSFTPGTPL 1623
I N +++ +A T G + W
Sbjct: 830 QTATYTIATPNSLIVPNMSKRVTYNDAVNTCKNFGGKLPSSQNELENVFKAWGAANKYEY 889

Query: 1624 ANGS-VINALAQDAAGNNSSPTSATVD 1649
S I + Q A + S ++T D
Sbjct: 890 YKSSQTIISWVQQTAQDAKSGVASTYD 916



Score = 44.3 bits (104), Expect = 8e-06
Identities = 72/346 (20%), Positives = 116/346 (33%), Gaps = 51/346 (14%)

Query: 609 TVTLTDGNGNPIGQVTADGSGNWTFTPSTPLPNGTVVNA------TATDPSGNASSPASV 662
T D NGN S N T L NG VV+ TA S A ++
Sbjct: 528 TARAYDRNGN---------SSNNVLLTITVLSNGQVVDQVGVTDFTADKTSAKADGTEAI 578

Query: 663 TVDAVAP---ATPVVNPSNGTTLSGTAEPGATVTLTDGNGNPIGQVTADGSGNWSFTPTT 719
T A P + +SGTA A T+G+ G + T +
Sbjct: 579 TYTATVKKNGVAQANVPVSFNIVSGTAVLSANSANTNGS------------GKATVTLKS 626

Query: 720 PLPNGTVVNATATDASGNTSAGSSVTVDSVAPATPVINPSNGTTLSGTAEPGSSVTLTDG 779
P VV+A + + +A + + VD + I T ++ + +
Sbjct: 627 DKPGQVVVSAKTAEMTSALNANAVIFVDQTKASITEIKADKTTAVANGQDAITYTVKVMK 686

Query: 780 NGNPIGQVTADGSGNWSFTPSTPLADGTVVNATATDPAGNTSGQGSTTVDGVAPTTPTVN 839
P+ N T +T L + TD G ++T G + + V+
Sbjct: 687 GDKPV--------SNQEVTFTTTLGKLSNSTEK-TDTNGYAKVTLTSTTPGKSLVSARVS 737

Query: 840 LSNGSSLSGTAEPGSTVILTDGNGNPIAEVTADGSGNWTYTPSTPIANGTVVNVVAQDAA 899
+ E +T+ + DGN + G+G P+ + G VN+ A
Sbjct: 738 DVAVDVKAPEVEFFTTLTIDDGN------IEIVGTGVKGKLPTVWLQYGQ-VNLKASGGN 790

Query: 900 GN----SSPGASVTVDSQAPAAPVVNPSNGTTLSGTAEPGATVTLT 941
G S+ A +VD+ + + TT+S + T T T
Sbjct: 791 GKYTWRSANPAIASVDASS-GQVTLKEKGTTTISVISSDNQTATYT 835



Score = 42.0 bits (98), Expect = 4e-05
Identities = 69/386 (17%), Positives = 126/386 (32%), Gaps = 60/386 (15%)

Query: 407 PAGNSSTPVTAEAPDFPDAPQVNASNGSVLSGTAEAGVTIVITDGNGNPIGQ-TSADANG 465
G++ VTA A D N+SN +L+ T + +V G + TSA A+G
Sbjct: 519 QGGSNVYKVTARAYDR----NGNSSNNVLLTITVLSNGQVVDQVGVTDFTADKTSAKADG 574

Query: 466 SWSFTPGSQLPDGTVVNVVARDAAGNSSPATSITVDGVAPNAPVVEPSNGSELSGTAEPG 525
+ + T + + V A P + + +SGTA
Sbjct: 575 TEAITYTATVKKNGV--------------------------AQANVPVSFNIVSGTAVLS 608

Query: 526 SSVTLTDGNGNPIGQTTADANGNWSFTPSTPLPDGTVVNVVARDAAGNSSPPASVTVDAV 585
++ T+G+G + T + P VV+ + + A + VD
Sbjct: 609 ANSANTNGSGK------------ATVTLKSDKPGQVVVSAKTAEMTSALNANAVIFVDQT 656

Query: 586 APATPTVDPSNGTTLSGTAEPGATVTLTDGNGNPIGQVTADGSGNWTFTPSTPLPNGTVV 645
+ + T ++ + P+ TFT + + +
Sbjct: 657 KASITEIKADKTTAVANGQDAITYTVKVMKGDKPVSN------QEVTFTTTLGKLSNSTE 710

Query: 646 NATATDPSGNASSPASVTVDAVAPATPVVNPSNGTTLSGTAEPGATVTLTDGNGNPIGQV 705
TD +G A + T + + V+ + E T+T+ DGN +G
Sbjct: 711 K---TDTNGYAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFFTTLTIDDGNIEIVGTG 767

Query: 706 TADGSGNWSFTPTTPLPNGTVVNATATDASGNTSAGSSVT-VDSVAPATPVINPSNGTTL 764
PT L G VN A+ +G + S+ + SV ++ + T
Sbjct: 768 VKGK------LPTVWLQYGQ-VNLKASGGNGKYTWRSANPAIASVDASSGQVTLKEKGTT 820

Query: 765 SGTAEPGSSVTLTDGNGNPIGQVTAD 790
+ + + T T P + +
Sbjct: 821 TISVISSDNQTATYTIATPNSLIVPN 846



Score = 41.6 bits (97), Expect = 5e-05
Identities = 59/352 (16%), Positives = 104/352 (29%), Gaps = 60/352 (17%)

Query: 839 NLSNGSSLSGTAEPGSTVILTDGNGNPIAEVT---ADGSGNWTYTPSTPIANGTVVNVVA 895
N SN L+ T V+ G + A+ T ADG+ TYT
Sbjct: 537 NSSNNVLLTITVLSNGQVVDQVGVTDFTADKTSAKADGTEAITYT--------------- 581

Query: 896 QDAAGNSSPGASVTVDSQAPAAPVVNPSNGTTLSGTAEPGATVTLTDGNGNPIGQVTADG 955
A+V + A A P + +SGTA A T+G+G +
Sbjct: 582 ----------ATVKKNGVAQAN---VPVSFNIVSGTAVLSANSANTNGSGKATVTLK--- 625

Query: 956 SGNWSFTPGTPLANGTVVNATASDPTGNTSAPASTTVDSVAPAAPVVNPSNGAEISGTAE 1015
S PG VV+A ++ T +A A VD + + ++ +
Sbjct: 626 ----SDKPG-----QVVVSAKTAEMTSALNANAVIFVDQTKASITEIKADKTTAVANGQD 676

Query: 1016 PGATVTLTDGSGNPIG--QVTADGS-GNWSFTPSTPLADGTVVNATATDPAGNTGGQGST 1072
P+ +VT + G S + +G A T T G+
Sbjct: 677 AITYTVKVMKGDKPVSNQEVTFTTTLGKLSNSTEKTDTNGY---AKVT-LTSTTPGKSLV 732

Query: 1073 TV----DAIAPATPTVNLSNGSSLSGTAEPGSTVILTDG------NGNPIAEVTADGSGN 1122
+ A+ P V ++ + + + G+G
Sbjct: 733 SARVSDVAVDVKAPEVEFFTTLTIDDGNIEIVGTGVKGKLPTVWLQYGQVNLKASGGNGK 792

Query: 1123 WTYTPSTPIANGTVVNVVAQDAAGNSSPPATVTVDSSAPPAPVINPSNGVVI 1174
+T+ + P + + +V + I N +++
Sbjct: 793 YTWRSANPAIASVDASSGQVTLKEKGTTTISVISSDNQTATYTIATPNSLIV 844



Score = 37.7 bits (87), Expect = 8e-04
Identities = 72/381 (18%), Positives = 123/381 (32%), Gaps = 63/381 (16%)

Query: 250 TTAPAPATDVQV-----TPGGSSVIGKAEPGSTVGVDTDGDGQPDTTVVVGPGGSFEVPL 304
+ A D Q GGS+V + D +G+ + + + V
Sbjct: 501 HSGSQSAQDYQAILPAYVQGGSNVY----KVTARAYDRNGNSSNNVLL------TITVLS 550

Query: 305 NPPLTNGETVTVIVTDPAG---NNSTPVTVEAPDTTAPAPATDVQVAPDGSSVTGNAEPG 361
N + + VT D + + +T A +V V+ + V+G A
Sbjct: 551 NGQVVDQVGVTDFTADKTSAKADGTEAITYTATVKKNGVAQANVPVS--FNIVSGTAVLS 608

Query: 362 ATVGVDTDGDGQPDTTVVVGPGGSFEVPLNPPLTNGETVTVIVTDPAGNSSTPVTAEAPD 421
A +T+G G+ T L + + V+V+ ++ + A A
Sbjct: 609 A-NSANTNGSGKATVT----------------LKSDKPGQVVVSAKTAEMTSALNANAVI 651

Query: 422 FPDAPQVNASNGSVLSGTAEAGVTIVIT-----DGNGNPIGQTSADANGSWSFTPGSQLP 476
F D + + + TA A IT P+ + S
Sbjct: 652 FVDQTKASITEIKADKTTAVANGQDAITYTVKVMKGDKPVSNQEVTFTT--TLGKLSNST 709

Query: 477 DGTVVNVVARDAAGNSSPATSITVDGVAPNAPVVEPSNGSELSGTAEPGSSVTLTDGNGN 536
+ T D G + + T G + + V + E +++T+ DGN
Sbjct: 710 EKT-------DTNGYAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFFTTLTIDDGNIE 762

Query: 537 PIGQTTADANGNWSFTPSTPLPDGTVVNVVARDAAGN----SSPPASVTVDAVAPATPTV 592
+G P+ L G VN+ A G S+ PA +VDA + T+
Sbjct: 763 IVGTGVKGK------LPTVWLQYGQ-VNLKASGGNGKYTWRSANPAIASVDA-SSGQVTL 814

Query: 593 DPSNGTTLSGTAEPGATVTLT 613
TT+S + T T T
Sbjct: 815 KEKGTTTISVISSDNQTATYT 835



Score = 33.9 bits (77), Expect = 0.012
Identities = 40/267 (14%), Positives = 79/267 (29%), Gaps = 21/267 (7%)

Query: 1655 APVIDPSNGSVIAGTAEAGATVILTDGNGNPIGQVTADGSGNWSFTPGTPLSNGTVVNAV 1714
A P + ++++GTA A T+G+G + S PG VV+A
Sbjct: 590 AQANVPVSFNIVSGTAVLSANSANTNGSGKATVTLK-------SDKPG-----QVVVSAK 637

Query: 1715 AQDAAGNTSGPASTTVDSVAPAAPVIDPSNGSVIAGTAEAGATVILTDGGGNPIG---QA 1771
+ + A VD + I + +A +A + G P+
Sbjct: 638 TAEMTSALNANAVIFVDQTKASITEIKADKTTAVANGQDAITYTVKVMKGDKPVSNQEVT 697

Query: 1772 TADGSGNWTFTPSTPLANGTVINAVAQDPAGNTSGPASVTVDAIAPPAPVIDPSNGVVIS 1831
G + + NG + G + A V+ A+ AP ++ + I
Sbjct: 698 FTTTLGKLSNSTEKTDTNGYAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFFTTLTID 757

Query: 1832 GTAEAGATVILTDG------NGNPIGQVTADGSGNWSFTPGTPLANGSVINALAQDAAGN 1885
+ + + G+G +++ P ++
Sbjct: 758 DGNIEIVGTGVKGKLPTVWLQYGQVNLKASGGNGKYTWRSANPAIASVDASSGQVTLKEK 817

Query: 1886 TSGPASTTVDSVAPATPVLDPSNGTVI 1912
+ S AT + N ++
Sbjct: 818 GTTTISVISSDNQTATYTIATPNSLIV 844


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03263BCTERIALGSPD5920.0 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 592 bits (1528), Expect = 0.0
Identities = 197/591 (33%), Positives = 324/591 (54%), Gaps = 26/591 (4%)

Query: 44 EQWTINMKDAEIGDFIEQVSSISGQTFVVDPRVKGRVTVVSQARLSLAEVYQLFLSVLAT 103
E+++ + K +I +FI VS +T ++DP V+G +TV S L+ + YQ FLSVL
Sbjct: 28 EEFSASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITVRSYDMLNEEQYYQFFLSVLDV 87

Query: 104 HGYAVLPQGDQA-RIVPNMEARQDAAQKTVRDGPG---SLETRVVQAQQTSVAELIPMIR 159
+G+AV+ + ++V + +A+ A PG + TRVV + +L P++R
Sbjct: 88 YGFAVINMNNGVLKVVRSKDAKTAAVPVASDAAPGIGDEVVTRVVPLTNVAARDLAPLLR 147

Query: 160 PLVPAHGHLAAV--PSANALIVSDRRANIERIEAIVRSLDRAGEHDYSIYDMRHAWVAEI 217
L G + V +N L+++ R A I+R+ IV +D AG+ + A A++
Sbjct: 148 QLNDNAGVGSVVHYEPSNVLLMTGRAAVIKRLLTIVERVDNAGDRSVVTVPLSWASAADV 207

Query: 218 AEV---LDRSVTPAAGKSAATVQVLADSRSNRLVLLGPPQARARLLRLAQSLDVPSSRSA 274
++ L++ + +A + V+AD R+N +++ G P +R R++ + + LD +
Sbjct: 208 VKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEPNSRQRIIAMIKQLDRQQATQG 267

Query: 275 NSRVIRLRHGDAKTLAVTLGEIGESLHGER-GQDGRGSGKRGLLVRADESLNALVILADP 333
N++VI L++ A L L I ++ E+ + + ++++A NAL++ A P
Sbjct: 268 NTKVIYLKYAKASDLVEVLTGISSTMQSEKQAAKPVAALDKNIIIKAHGQTNALIVTAAP 327

Query: 334 EDVGLLEDIVRQLDVPRAQLLVEAAIVELSGEIGDALGVQWALRSGHVAGGAGFADSGLS 393
+ + LE ++ QLD+ R Q+LVEA I E+ G LG+QWA AG F +SGL
Sbjct: 328 DVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGIQWA---NKNAGMTQFTNSGLP 384

Query: 394 IGTLLGAL----QAGKPPAELP------DGAIVGLGSRDFGALVTALSRNSRSNLLSTPS 443
I T + + G + L +G G ++ L+TALS ++++++L+TPS
Sbjct: 385 ISTAIAGANQYNKDGTVSSSLASALSSFNGIAAGFYQGNWAMLLTALSSSTKNDILATPS 444

Query: 444 LLTLDNQKAEILVGQNVPFQTGSYTTSASGSSNPFTTVERKDIGVTLKVTPHIGEDRMLR 503
++TLDN +A VGQ VP TGS TTS N F TVERK +G+ LKV P I E +
Sbjct: 445 IVTLDNMEATFNVGQEVPVLTGSQTTS---GDNIFNTVERKTVGIKLKVKPQINEGDSVL 501

Query: 504 LEIEQEISSIAPTATLAAKAVDLVTNKRSIKSTVLADDGQVIVLGGLIQDDLQRSDSRVP 563
LEIEQE+SS+A A+ + + N R++ + VL G+ +V+GGL+ + + +VP
Sbjct: 502 LEIEQEVSSVADAASSTSSDLGATFNTRTVNNAVLVGSGETVVVGGLLDKSVSDTADKVP 561

Query: 564 LLGDIPGVGRLFRSSRETRVKRNLMVFLRPSIVRDAAGLERISHGRYRSIQ 614
LLGDIP +G LFRS+ + KRNLM+F+RP+++RD + S G+Y +
Sbjct: 562 LLGDIPVIGALFRSTSKKVSKRNLMLFIRPTVIRDRDEYRQASSGQYTAFN 612


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03268HTHTETR685e-16 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 67.7 bits (165), Expect = 5e-16
Identities = 25/152 (16%), Positives = 56/152 (36%), Gaps = 8/152 (5%)

Query: 5 RQRNLQLILDAACEVFADCGFSAARLSDVAERAGVAKANVLYYYRSKAQLYEAVLDSIVE 64
Q Q ILD A +F+ G S+ L ++A+ AGV + + ++++ K+ L+ + +
Sbjct: 9 AQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSES 68

Query: 65 PLLEASRPFAGDQP--PAEALRAYVDNKMRIGAERPHAARVFSCEIMRGAPRMPAPLLER 122
+ E + P P LR + + + + + ++++
Sbjct: 69 NIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQ 128

Query: 123 LDAQAERN-----AERIRQWIDEG-LLAPLDP 148
+ ++ I+ L A L
Sbjct: 129 AQRNLCLESYDRIEQTLKHCIEAKMLPADLMT 160


43PAKAF_03327PAKAF_03342Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_0332719-3.249302probable permease of ABC transporter
PAKAF_03328111-4.142364probable ATP-binding component of ABC
PAKAF_03329113-4.977901NADH-dependent enoyl-ACP reductase
PAKAF_03330215-5.779944peptidyl-prolyl cis-trans isomerase D
PAKAF_03332116-6.048299*DNA-binding protein HU
PAKAF_03333113-5.147926Lon protease
PAKAF_03334016-3.700661ATP-dependent Clp protease ATP-binding subunit
PAKAF_03335116-2.660519ATP-dependent Clp protease proteolytic subunit
PAKAF_03336216-2.381319trigger factor
PAKAF_03337-216-0.599764two-component response regulator, ParR
PAKAF_03338-211-0.414832two-component sensor, ParS
PAKAF_03339-29-1.008756serine hydrolase
PAKAF_03341111-2.410324hypothetical protein
PAKAF_03342210-2.066214hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03329DHBDHDRGNASE639e-14 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 62.8 bits (152), Expect = 9e-14
Identities = 61/262 (23%), Positives = 99/262 (37%), Gaps = 23/262 (8%)

Query: 4 LTGKRALIVGVASKLSIASGIAAAMHREGAELAFTYQNDKLRGRVEEFASGWGSRPELCF 63
+ GK A I G A I +A + +GA +A N + +V E F
Sbjct: 6 IEGKIAFITGAAQ--GIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAE-AF 62

Query: 64 PCDVADDSQIEAVFAALGKHWDGLDIIVHSVGF---APGDQL-DGDFTAVTTREGFRIAH 119
P DV D + I+ + A + + +DI+V+ G L D ++ A + +
Sbjct: 63 PADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGV-- 120

Query: 120 DISAYSFIALAKAGREMMKGRNGSLLTLSYLGAERTMPNYNVMGMAKASLEAGVRYLAGS 179
F A + MM R+GS++T+ A + +KA+ + L
Sbjct: 121 ------FNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLE 174

Query: 180 LGAEGTRVNAVSAGPIRTLAASGI--------KSFRKMLAANERQTPLRRNVTIEEVGNA 231
L R N VS G T + + + L + PL++ ++ +A
Sbjct: 175 LAEYNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADA 234

Query: 232 GAFLCSDLASGISGEILYVDGG 253
FL S A I+ L VDGG
Sbjct: 235 VLFLVSGQAGHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03332DNABINDINGHU1171e-38 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 117 bits (296), Expect = 1e-38
Identities = 49/88 (55%), Positives = 64/88 (72%)

Query: 2 NKSELIDAIAASADIPKAVAGRALDAVIESVTGALKAGDSVVLVGFGTFAVKERAARTGR 61
NK +LI +A + ++ K + A+DAV +V+ L G+ V L+GFG F V+ERAAR GR
Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62

Query: 62 NPQTGKPIKIAAAKIPGFKAGKALKDAV 89
NPQTG+ IKI A+K+P FKAGKALKDAV
Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAV 90


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03337HTHFIS772e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 77.2 bits (190), Expect = 2e-18
Identities = 31/132 (23%), Positives = 63/132 (47%), Gaps = 5/132 (3%)

Query: 7 SKVLLVEDDQKLARLIASFLSQHGFEVRQVHRGDAAFAAFLDFKPQVVVLDLMLPGQNGL 66
+ +L+ +DD + ++ LS+ G++VR + +VV D+++P +N
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 67 QVCREIRRV-ANLPILILTAQEDDLDHILGLESGADDYVIKPIEPPVLLARLRALM---- 121
+ I++ +LP+L+++AQ + I E GA DY+ KP + L+ + +
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 122 RRHAPLPASPES 133
RR + L +
Sbjct: 124 RRPSKLEDDSQD 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03338PF06580290.041 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 28.7 bits (64), Expect = 0.041
Identities = 20/123 (16%), Positives = 38/123 (30%), Gaps = 31/123 (25%)

Query: 315 QIRIEPRFMARAVINLL-----RNAIRHAHS------RVEIALLDQGDSCQIRVNDDGPG 363
+ +I P M V +L N I+H + ++ + + + V + G
Sbjct: 243 ENQINPAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSL 302

Query: 364 IPADARQKIFEPFSRLDDSRDRSTGGFGLGLAIVR-RVAQWHGG-YAEALETPQGGASFR 421
+ ++ G GL VR R+ +G L QG +
Sbjct: 303 ALKNTKE------------------STGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAM 344

Query: 422 LTW 424
+
Sbjct: 345 VLI 347


44PAKAF_03366PAKAF_03378Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_03366122-4.073056uroporphyrin-III C-methyltransferase
PAKAF_03367221-4.983825Major porin and structural outer membrane porin
PAKAF_03368-123-2.807925ECF sigma factor SigX
PAKAF_03369013-2.015547conserved cytoplasmic membrane protein, CmpX
PAKAF_03370212-2.768222CrfX protein
PAKAF_03371213-2.979555CmaX protein
PAKAF_03372213-3.551246probable methyltransferase
PAKAF_03373111-3.484490EstX
PAKAF_03374113-3.640580phosphoenolpyruvate synthase
PAKAF_03375010-3.966896kinase/pyrophosphorylase
PAKAF_03376010-3.493953ATP-dependent zinc protease
PAKAF_0337709-3.452270hypothetical protein
PAKAF_03378110-3.129719alpha-L-glutamate ligase-like protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03367OMPADOMAIN1631e-49 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 163 bits (415), Expect = 1e-49
Identities = 89/381 (23%), Positives = 140/381 (36%), Gaps = 79/381 (20%)

Query: 1 MKLKNTLGVVIGSLVAASAMNAFAQGQNSVEIEAFGKRYFTD------SVRNMKNADLYG 54
MK K + + + A+ A + G + D + +N G
Sbjct: 1 MK-KTAIAIAVALAGFATVAQAAPKDNTWYTGAKLGWSQYHDTGFINNNGPTHENQLGAG 59

Query: 55 GSIGYFLTDDVELALSYGEY--HDVRGTYETGNKKVHGNLTSLDAIYHFGTPGVGLRPYV 112
GY + V + Y +G+ E G K G + Y T + + +
Sbjct: 60 AFGGYQVNPYVGFEMGYDWLGRMPYKGSVENGAYKAQGVQLTAKLGYPI-TDDLDIYTRL 118

Query: 113 SAGL----AHQNITNINSDSQ-------GRQQMTMANIGAGLKYYFTENFFAKASLDGQY 161
+ N+ N D+ G + I L+Y +T N ++
Sbjct: 119 GGMVWRADTKSNVYGKNHDTGVSPVFAGGVEYAITPEIATRLEYQWTNNIGDAHTIG--- 175

Query: 162 GLEKRDNGHQGEWMAGLGVGFNFGGSKAAP----APEPVADVCSDSDNDGVCDNVDKCPD 217
+ DNG M LGV + FG +AAP AP P +V +
Sbjct: 176 --TRPDNG-----MLSLGVSYRFGQGEAAPVVAPAPAPAPEVQTKH-------------- 214

Query: 218 TPANVTVDANGCPAVAEVVRVQLDVKFDFDKSKVKENSYADIKNLADFMKQY--PSTSTT 275
++ DV F+F+K+ +K A + L + S
Sbjct: 215 ------------------FTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVV 256

Query: 276 VEGHTDSVGTDAYNQKLSERRANAVRDVLVNEYGVEGGRVNAVGYGESRPVADNATAEGR 335
V G+TD +G+DAYNQ LSERRA +V D L+++ G+ +++A G GES PV N +
Sbjct: 257 VLGYTDRIGSDAYNQGLSERRAQSVVDYLISK-GIPADKISARGMGESNPVTGNTCDNVK 315

Query: 336 ---------AINRRVEAEVEA 347
A +RRVE EV+
Sbjct: 316 QRAALIDCLAPDRRVEIEVKG 336


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03374PHPHTRNFRASE317e-101 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 317 bits (815), Expect = e-101
Identities = 113/446 (25%), Positives = 191/446 (42%), Gaps = 68/446 (15%)

Query: 332 RAIGQRI-GAGPVKVINDVSEMDKVQPGDVLVSDMTDPDWEPVMK-RASAIVTNRGGRTC 389
R + +R+ G ++ + + ++ D+T D + K T+ GGRT
Sbjct: 132 RDVSKRVLGHLIGVETGSLATIA--EETVIIAEDLTPSDTAQLNKQFVKGFATDIGGRTS 189

Query: 390 HAAIIARELGIPAVVGCGNATQILQDGQGVTVSCAEG---------DTGFIFEGELGFDV 440
H+AI++R L IPAVVG T+ +Q G V V EG + E F+
Sbjct: 190 HSAIMSRSLEIPAVVGTKEVTEKIQHGDMVIVDGIEGIVIVNPTEEEVKAYEEKRAAFEK 249

Query: 441 RKNSVDAMPDLP--------FKIMMNVGNPDRAFDFAQLPNEGVGLARLEFIINRMIGVH 492
+K + P ++ N+G P EG+GL R EF+
Sbjct: 250 QKQEWAKLVGEPSTTKDGAHVELAANIGTPKDVDGVLANGGEGIGLYRTEFLY------- 302

Query: 493 PKALLNFAGLPADIKESVEKRIAGYPDPVGFYVEKLVEGISTLAAAFWPKKVIVRLSDFK 552
++ LP + E++ Y + + K V++R D
Sbjct: 303 ----MDRDQLP-----TEEEQFEAYKE---------------VVQRMDGKPVVIRTLDIG 338

Query: 553 SNEYANLIGGKLYEPEEENPMLGFRGASRYISESFRDCFELECRALKKVRNEMGLTNVEI 612
++ + L P+E NP LGFR + + +D F + RAL + N+++
Sbjct: 339 GDKELSY----LQLPKELNPFLGFRAIRLCLEK--QDIFRTQLRALLRAS---TYGNLKV 389

Query: 613 MVPFVRTLGEASQVVELLAGNGLKRGENG------LKVIMMCELPSNALLADEFLEFFDG 666
M P + TL E Q ++ K G ++V +M E+PS A+ A+ F + D
Sbjct: 390 MFPMIATLEELRQAKAIMQEEKDKLLSEGVDVSDSIEVGIMVEIPSTAVAANLFAKEVDF 449

Query: 667 FSIGSNDLTQLTLGLDRDSGIVAHLFDERNPAVKKLLANAIAACNKAGKYIGICGQGPSD 726
FSIG+NDL Q T+ DR + V++L+ +PA+ +L+ I A + GK++G+CG+ D
Sbjct: 450 FSIGTNDLIQYTMAADRMNERVSYLYQPYHPAILRLVDMVIKAAHSEGKWVGMCGEMAGD 509

Query: 727 HPDLARWLMEQGIESVSLNPDSVLDT 752
L+ G++ S++ S+L
Sbjct: 510 -EVAIPLLLGLGLDEFSMSATSILPA 534


45PAKAF_03393PAKAF_03409Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_03393212-2.577374thioredoxin family protein
PAKAF_03394111-2.342800phospho-2-dehydro-3-deoxyheptonate aldolase
PAKAF_03395015-1.093796N-acetyltransferase
PAKAF_03396114-0.747272probable enoyl-CoA hydratase/isomerase
PAKAF_03397-113-0.572906hypothetical protein
PAKAF_03398-213-0.596962macro domain-containing protein
PAKAF_03399-1110.756017hypothetical protein
PAKAF_034000120.744715hypothetical protein
PAKAF_034013130.704430hypothetical protein
PAKAF_03402390.030104probable amidotransferase
PAKAF_03403380.288960hypothetical protein
PAKAF_03404490.222020hypothetical protein
PAKAF_0340529-0.555092probable oxidoreductase
PAKAF_0340617-0.246042probable transcriptional regulator
PAKAF_03407-19-0.259471probable 3-hydroxyacyl-CoA dehydrogenase
PAKAF_034081110.809085probable acyl-CoA thiolase
PAKAF_034092130.831314hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03398CHLAMIDIAOM6290.015 Chlamydia cysteine-rich outer membrane protein 6 si...
		>CHLAMIDIAOM6#Chlamydia cysteine-rich outer membrane protein 6

signature.
Length = 547

Score = 28.5 bits (63), Expect = 0.015
Identities = 16/40 (40%), Positives = 21/40 (52%), Gaps = 9/40 (22%)

Query: 122 PCVATGVGGLDWSEV-KP----LVVRHLGDLEIPVILYEV 156
PCV + G DWS V KP + V + GDL +L +V
Sbjct: 315 PCVQVSIAGADWSYVCKPVEYVISVSNPGDL----VLRDV 350


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03402PF05704290.012 Capsular polysaccharide synthesis protein
		>PF05704#Capsular polysaccharide synthesis protein

Length = 307

Score = 29.5 bits (66), Expect = 0.012
Identities = 14/95 (14%), Positives = 34/95 (35%), Gaps = 20/95 (21%)

Query: 10 TDILRPELIERYEGY---------GRMFQQLFAKQPIAAEFVIYNVVEGRYPADDERFDA 60
+DILR L+ +Y G ++ + F+ + ++
Sbjct: 135 SDILRLFLLCKYGGLWIDATVYMFDKVPNYIVESN----RFMFQSS---FLESETTHISN 187

Query: 61 YLVTGSKADSFGPDPWIQTLKTFLLDRYERGDKLL 95
+L+ + DP++ LK ++ ++ +K
Sbjct: 188 WLIFVKSKN----DPFLVGLKNSMVTYLKKKEKPA 218


46PAKAF_03432PAKAF_03451Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_03432224-2.739046transcriptional regulator ExsA
PAKAF_03433221-1.016333exoenzyme S synthesis protein B
PAKAF_03434218-1.497757ExsE
PAKAF_03435217-1.507800ExsC, exoenzyme S synthesis protein C precursor
PAKAF_03436215-0.672302Translocator outer membrane protein PopD
PAKAF_03437313-0.989206translocator protein PopB
PAKAF_03438314-0.536532regulatory protein PcrH
PAKAF_034394150.044286type III secretion protein PcrV
PAKAF_034405161.066539regulator in type III secretion
PAKAF_034415150.560699transcriptional regulator protein PcrR
PAKAF_034425150.768666type III secretory apparatus protein PcrD
PAKAF_034433112.437314conserved hypothetical protein in type III
PAKAF_034442112.325628YscX family type III secretion protein
PAKAF_034453152.779575type III secretion chaperone SycN
PAKAF_034462162.849249TyeA family type III secretion system gatekeeper
PAKAF_034471152.400556Type III secretion outer membrane protein PopN
PAKAF_034480152.055314ATP synthase in type III secretion system
PAKAF_034493171.532271translocation protein in type III secretion
PAKAF_034503170.472186translocation protein in type III secretion
PAKAF_03451211-1.138468translocation protein in type III secretion
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03435PF05932476e-10 Tir chaperone protein (CesT)
		>PF05932#Tir chaperone protein (CesT)

Length = 127

Score = 47.5 bits (113), Expect = 6e-10
Identities = 27/118 (22%), Positives = 49/118 (41%), Gaps = 4/118 (3%)

Query: 10 LLAEFAGRIGLPSLSLDEEGMASLLFDEQVGVTLLLLAERERLLLEADVAGIDVLGEGIF 69
LL +F+ + + L D+ G +++ D +TL RERLLL + +
Sbjct: 9 LLDDFSRSLEMQPLVFDDHGTCNMIIDNTFALTLSCDYARERLLLIGLLEP---HKDIPQ 65

Query: 70 RQLASFNRHWHRFDLH-FGFDELTGKVQLYAQILAAQLTLECFEATLANLLDHAEFWQ 126
+ L + + G DE +G Y I +L++ + +A LL+ W+
Sbjct: 66 QCLLAGALNPLLNAGPGLGLDEKSGLYHAYQSIPREKLSVPTLKREMAGLLEWMRGWR 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03436PF05844385e-137 YopD protein
		>PF05844#YopD protein

Length = 295

Score = 385 bits (989), Expect = e-137
Identities = 291/295 (98%), Positives = 293/295 (99%)

Query: 1 MIDTQYSLAATQAAIPSEPIAPGAAGRSVGTPQAAADLPQVPAARADRVELNAPRQVLDP 60
MIDTQYSLAATQAAIPSEPIAPGAAGRSVGTPQAAA+LPQVPAARADRVELNAPRQVLDP
Sbjct: 1 MIDTQYSLAATQAAIPSEPIAPGAAGRSVGTPQAAAELPQVPAARADRVELNAPRQVLDP 60

Query: 61 VRMEAAGSELDSSVELLLILFRIAQKARELGVLQRDNENQSIIHAQKAQVDEMRSGATLM 120
VRMEAAGSELDSSVELLLILFRIAQKARELGVLQRDNENQ+IIHAQKAQVDEMRSGATLM
Sbjct: 61 VRMEAAGSELDSSVELLLILFRIAQKARELGVLQRDNENQAIIHAQKAQVDEMRSGATLM 120

Query: 121 IAMAVIAGVGALASAVVGSLGALKNGKAISQEKTLQKNIDGRNELIDAKMQALGKTSDED 180
IAMAVIAGVGALASAVVGSLGALKNGKAISQEKTLQKNIDGRNELIDAKMQALGKTSDED
Sbjct: 121 IAMAVIAGVGALASAVVGSLGALKNGKAISQEKTLQKNIDGRNELIDAKMQALGKTSDED 180

Query: 181 RKIVGKVWAADQVQDSVALRAAGRAFESRNGALQVANTVIQSFVQMANASVQVRQGESQA 240
RKIVGKVWAADQ QDSVALRAAGRAFESRNGALQVANTVIQSFVQMANASVQVRQGESQA
Sbjct: 181 RKIVGKVWAADQAQDSVALRAAGRAFESRNGALQVANTVIQSFVQMANASVQVRQGESQA 240

Query: 241 SAREGEVNATIGQSQKQKVEDQMSFDAGFMKDVLQLIQQYTQSHNQAWRAAAGVV 295
SARE EVNATIGQSQKQKVEDQMSFDAGFMKDVLQLIQQYTQSHNQAWRAAAGVV
Sbjct: 241 SAREEEVNATIGQSQKQKVEDQMSFDAGFMKDVLQLIQQYTQSHNQAWRAAAGVV 295


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03438SYCDCHAPRONE2022e-69 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 202 bits (514), Expect = 2e-69
Identities = 95/166 (57%), Positives = 126/166 (75%)

Query: 3 QQATPSDTDQQQALEAFLRDGGTLAMLRGLSEDTLEQLYALGFNQYQAGKWDDAQKIFQA 62
QQ T + Q A+E+FL+ GGT+AML +S DTLEQLY+L FNQYQ+GK++DA K+FQA
Sbjct: 2 QQETTDTQEYQLAMESFLKGGGTIAMLNEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQA 61

Query: 63 LCMLDHYDARYFLGLGACRQSLGLYEQALQSYSYGALMDINEPRFPFHAAECHLQLGDLD 122
LC+LDHYD+R+FLGLGACRQ++G Y+ A+ SYSYGA+MDI EPRFPFHAAEC LQ G+L
Sbjct: 62 LCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELA 121

Query: 123 GAESGFYSARALAAAQPAHEALAARAGAMLEAVTARKDRTYESDNA 168
AESG + A+ L A + + L+ R +MLEA+ +K+ +E +
Sbjct: 122 EAESGLFLAQELIADKTEFKELSTRVSSMLEAIKLKKEMEHECVDN 167


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03439LCRVANTIGEN344e-121 Low calcium response V antigen signature.
		>LCRVANTIGEN#Low calcium response V antigen signature.

Length = 326

Score = 344 bits (884), Expect = e-121
Identities = 115/296 (38%), Positives = 171/296 (57%), Gaps = 32/296 (10%)

Query: 25 ASAEQEELLALLRSERIVLAHAGQPLSEAQVL-------------KALAWLLAANPSAPP 71
S+ EEL+ L++ + I ++ P +++V K LA+ L +
Sbjct: 28 GSSVLEELVQLVKDKNIDISIKYDPRKDSEVFANRVITDDIELLKKILAYFLPEDAILKG 87

Query: 72 GQ-------GLEVLREVLQARRQPGAQWDLREFLVSAYFSLHG-RLDEDVIGVYKDVLQT 123
G G++ ++E L++ P QW+LR F+ +FSL R+D+D++ V D +
Sbjct: 88 GHYDNQLQNGIKRVKEFLES--SPNTQWELRAFMAVMHFSLTADRIDDDILKVIVDSMNH 145

Query: 124 QDGKRKALLDELKALTAELKVYSVIQSQINAALSAKQGIRIDAGGIDLVDPTLYGYAVGD 183
R L +EL LTAELK+YSVIQ++IN LS+ I I I+L+D LYGY +
Sbjct: 146 HGDARSKLREELAELTAELKIYSVIQAEINKHLSSSGTINIHDKSINLMDKNLYGYT-DE 204

Query: 184 PRWKDSPEYALLSNLDTFSGKL--------SIKDFLSGSPKQSGELKGLSDEYPFEKDNN 235
+K S EY +L + + ++ SIKDFL K++G L L + Y + KDNN
Sbjct: 205 EIFKASAEYKILEKMPQTTIQVDGSEKKIVSIKDFLGSENKRTGALGNLKNSYSYNKDNN 264

Query: 236 PVGNFATTVSDRSRPLNDKVNEKTTLLNDTSSRYNSAVEALNRFIQKYDSVLRDIL 291
+ +FATT SD+SRPLND V++KTT L+D +SR+NSA+EALNRFIQKYDSV++ +L
Sbjct: 265 ELSHFATTCSDKSRPLNDLVSQKTTQLSDITSRFNSAIEALNRFIQKYDSVMQRLL 320


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03447PF072012844e-98 Hypersensitivity response secretion protein HrpJ
		>PF07201#Hypersensitivity response secretion protein HrpJ

Length = 293

Score = 284 bits (727), Expect = 4e-98
Identities = 134/294 (45%), Positives = 181/294 (61%), Gaps = 7/294 (2%)

Query: 1 MDILQSSSAAPLA-----PREAANAPAQQAGGSFQGERVHYVSVS-QSLADAAEELTFAF 54
M L + S P A++ Q G F+GE V VS + QS+AD AEE+TF F
Sbjct: 1 MTTLHNLSYGNTPLHNERPEIASSQIVNQTLGQFRGESVQIVSGTLQSIADMAEEVTFVF 60

Query: 55 SERAEKSLAKRRLSDAHARLSEVQAMLQEYWKRIPDLESQQKLEALIAHLGSGQLSSLAQ 114
SER E SL KR+LSD+ AR+S+V+ + +Y ++P+LE +Q + L++ L + SL+Q
Sbjct: 61 SERKELSLDKRKLSDSQARVSDVEEQVNQYLSKVPELEQKQNVSELLSLLSNSPNISLSQ 120

Query: 115 LSAYLEGFSSEISQRFLALSRARDVLAGRPEARAMLALVDQALLRMADEQGLEIELGLRI 174
L AYLEG S E S++F L RD L GRPE + LV+QAL+ MA+EQG I LG RI
Sbjct: 121 LKAYLEGKSEEPSEQFKMLCGLRDALKGRPELAHLSHLVEQALVSMAEEQGETIVLGARI 180

Query: 175 EPLAAEASAAGVGDIQALRDTYRDAVLDYRGLSAAWQDIQARFAATPLERVVAFLQKALS 234
P A S +GV +Q LRDTYRDAV+ Y+G+ A W D+Q RF ++ V+ FLQKALS
Sbjct: 181 TPEAYRESQSGVNPLQPLRDTYRDAVMGYQGIYAIWSDLQKRFPNGDIDSVILFLQKALS 240

Query: 235 ADLDSQSSRLDPVKLERVMSDMHKLRVLGGLAEQVGALWQVLVTGERGHGIRAF 288
ADL SQ S KL V+SD+ KL+ G +++QV WQ G + +G+R F
Sbjct: 241 ADLQSQQSGSGREKLGIVISDLQKLKEFGSVSDQVKGFWQFFSEG-KTNGVRPF 293


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03450IGASERPTASE432e-06 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 43.1 bits (101), Expect = 2e-06
Identities = 35/188 (18%), Positives = 58/188 (30%), Gaps = 18/188 (9%)

Query: 45 PPFDKGDETTEAEEPAATADAPTSTPLADQPAAPAADRPPTTRQAPVPVAADATPTPTPT 104
P +K ++T + + P A R +APVP A ATP+ T
Sbjct: 983 PEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIA-RV---DEAPVPPPAPATPSETTE 1038

Query: 105 PTPTPTPTPTPTVSPSGSVARQAPAVSARVAASTQAREPASVSAPPVDEPPLVPVSSHPQ 164
+ + TV + A + A + VA ++ A+ V + +
Sbjct: 1039 TVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTT 1098

Query: 165 IAGRTHERPQPGPGFPAKAAAEVAPTAQASAQASPPAPTAGGEGRGEERRQPGETDPSAL 224
T + KA E T + S +P ++ Q P A
Sbjct: 1099 ETKETATVEKEE-----KAKVETEKTQEVPKVTSQVSP---------KQEQSETVQPQAE 1144

Query: 225 PPDDQAPV 232
P + P
Sbjct: 1145 PARENDPT 1152


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03451TYPE3OMOPROT842e-20 Type III secretion system outer membrane O protein ...
		>TYPE3OMOPROT#Type III secretion system outer membrane O protein

family signature.
Length = 303

Score = 83.5 bits (206), Expect = 2e-20
Identities = 46/177 (25%), Positives = 73/177 (41%), Gaps = 14/177 (7%)

Query: 130 RLALWLDGDPATLLARLPPRPSAQRLAIPLRLSLQWPGLPLDASELRTLEPGDLLLLPAG 189
R LW + P L A RP R + + L L + GD+LL+
Sbjct: 126 RGGLWFEHLPE-LPAVGGGRPKMLRWPLRFVIGSSDTQRSL----LGRIGIGDVLLIRTS 180

Query: 190 HRPDAALLGVLEGRPWARCQLHSTQL-ELLDMH----DTPSLADGEDLHELDQLPIPVSF 244
A + + ++ + E LD+ + + E L L+QLP+ + F
Sbjct: 181 R----AEVYCYAKKLGHFNRVEGGIIVETLDIQHIEEENNTTETAETLPGLNQLPVKLEF 236

Query: 245 EVGRRTLDLHTLSTLQPGSLLDLDSALDGEVRILANQRCLGIGELVRLQDRLGVRVT 301
+ R+ + L L + LL L + + V I+AN LG GELV++ D LGV +
Sbjct: 237 VLYRKNVTLAELEAMGQQQLLSLPTNAELNVEIMANGVLLGNGELVQMNDTLGVEIH 293


47PAKAF_03560PAKAF_03570Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_03560224-3.290741succinyl-CoA synthetase alpha chain
PAKAF_03561224-3.083872succinyl-CoA synthetase beta chain
PAKAF_03563323-3.116489lipoamide dehydrogenase-glc
PAKAF_03564222-3.540186dihydrolipoamide succinyltransferase (E2
PAKAF_03565219-4.2768402-oxoglutarate dehydrogenase (E1 subunit)
PAKAF_03567217-4.550252succinate dehydrogenase (B subunit)
PAKAF_03568117-4.079617succinate dehydrogenase (A subunit)
PAKAF_03569-113-3.310071succinate dehydrogenase (D subunit)
PAKAF_03570-210-3.111943succinate dehydrogenase (C subunit)
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03563ABC2TRNSPORT300.024 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 29.5 bits (66), Expect = 0.024
Identities = 15/51 (29%), Positives = 23/51 (45%), Gaps = 2/51 (3%)

Query: 317 IGDVVRGAMLAHKASEEGVMVAERIAGHKAQMNYDLIPSVIYTHPEIAWVG 367
+GD+V G M A+ + + I A + Y S++Y P IA G
Sbjct: 110 LGDIVLGEMAW--AATKAALAGAGIGVVAAALGYTQWLSLLYALPVIALTG 158


48PAKAF_03593PAKAF_03598Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_03593415-2.756528hypothetical protein
PAKAF_03594518-3.887680Cytochrome c oxidase, cbb3-type, CcoN subunit
PAKAF_03595318-4.566329Cytochrome c oxidase, cbb3-type, CcoO subunit
PAKAF_03596319-4.039508Cytochrome c oxidase, cbb3-type, CcoQ subunit
PAKAF_03597217-3.763768Cytochrome c oxidase, cbb3-type, CcoP subunit
PAKAF_03598216-3.728082Cytochrome c oxidase, cbb3-type, CcoN subunit
49PAKAF_03619PAKAF_03628Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_03619-126-4.050535recombination protein RecR
PAKAF_03620029-5.551919YbaB/EbfC family nucleoid-associated protein
PAKAF_03621032-5.843071DNA polymerase subunits gamma and tau
PAKAF_03622250-11.289812DNA polymerase subunits gamma and tau
PAKAF_03623467-15.368681integrase family protein,Site-specific
PAKAF_03624366-15.586170hypothetical protein
PAKAF_03625-234-8.210802hypothetical protein
PAKAF_03626-228-6.527054transposon resolvase,Putative DNA-invertase from
PAKAF_03627018-3.965892hypothetical protein
PAKAF_03628015-3.175885ATPase domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03621PF03544401e-05 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 40.0 bits (93), Expect = 1e-05
Identities = 27/126 (21%), Positives = 36/126 (28%), Gaps = 5/126 (3%)

Query: 366 APRTPLKDLGISKATTDP---ANSPVAGAASPAPVAAVAPAPVVAAPVEAPAAPPAAPSA 422
AP P+ ++ A +P P P P P P APV P
Sbjct: 45 APAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPK 104

Query: 423 PPAA--VEARVAEAVVEEPAAAAEVVDLPWEEPAPSLAAEPEPEPEPEPEPEPEPLAVEA 480
P VE + E A+ + P S A +P P L+
Sbjct: 105 PKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVASGPRALSRNQ 164

Query: 481 PSVPPA 486
P P
Sbjct: 165 PQYPAR 170



Score = 34.6 bits (79), Expect = 0.001
Identities = 30/123 (24%), Positives = 40/123 (32%), Gaps = 23/123 (18%)

Query: 399 AVAPAPVVAAPVEAPAAPPAAPSAPPAAVEARVAEAVVEEPAAAAEVVDLPWEEPAPSLA 458
AV + + + P AP+ P + VA A +E P A
Sbjct: 27 AVVAGLLYTSVHQVIELP--APAQPISVT--MVAPADLEPP-------------QAVQPP 69

Query: 459 AEPEPEPEPEPEPEPEPLAVEAPSVPPAVAVEAVVETVLEALPAALPVAPDEQDEQDDEP 518
EP EPEPEPEP PEP + V ++D + E
Sbjct: 70 PEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPK------PKPKPVKKVEQPKRDVKPVES 123

Query: 519 PPA 521
PA
Sbjct: 124 RPA 126



Score = 33.8 bits (77), Expect = 0.002
Identities = 27/117 (23%), Positives = 38/117 (32%), Gaps = 5/117 (4%)

Query: 392 ASPAPVAAVAPAPVVAAP---VEAPAAPPAAPSAPPAAVEARVAEA-VVEEPAAAAEVVD 447
A P++ AP P V+ P P P P + EA VV E
Sbjct: 45 APAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPK 104

Query: 448 LPWEEPAPSLAAEPEPEPEPEPEPEPEPLAVEAPSVPPAVAVEAVVETVLEALPAAL 504
+ + +P E P E A P+ A A + T + + P AL
Sbjct: 105 PKPVKKVEQPKRDVKP-VESRPASPFENTAPARPTSSTATAATSKPVTSVASGPRAL 160



Score = 32.3 bits (73), Expect = 0.004
Identities = 23/112 (20%), Positives = 31/112 (27%), Gaps = 13/112 (11%)

Query: 391 AASPAPVAAVAPAPVVAAPVEAPA-APPAAPSAPPAAV--EARVAEAVVEEPAAAAEVVD 447
PAP P+ V PP A PP V E + E P A V+
Sbjct: 41 IELPAP-----AQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVI- 94

Query: 448 LPWEEPAPSLAAEPEPEPEPEPEPEPEPLAVEAPSVPPAVAVEAVVETVLEA 499
E + +P+ + VE+ P T A
Sbjct: 95 ----EKPKPKPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTA 142


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03623SURFACELAYER300.020 Lactobacillus surface layer protein signature.
		>SURFACELAYER#Lactobacillus surface layer protein signature.

Length = 439

Score = 30.0 bits (67), Expect = 0.020
Identities = 15/46 (32%), Positives = 19/46 (41%), Gaps = 7/46 (15%)

Query: 81 SITAASRTAPVTTAKPKGEDPTLAMLSK-------LYIEEGKRGGT 119
+ S T PVT P DP + SK Y ++ KR GT
Sbjct: 286 NKNGKSATLPVTVTVPNVADPVVPSQSKTIMHNAYFYDKDAKRVGT 331


50PAKAF_03646PAKAF_03654Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_03646114-3.354320allantoicase
PAKAF_03647024-5.670359ureidoglycolate hydrolaseYbbT
PAKAF_03648125-5.646221urate hydroxylase PuuD
PAKAF_03649123-5.235506secreted protein Hcp
PAKAF_03650025-5.143033VgrG2a
PAKAF_03651228-6.043487Tle4
PAKAF_03652016-3.941369Tli4
PAKAF_0365338-0.089256PAAR domain-containing protein
PAKAF_036542120.106547probable transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03653OMADHESIN250.048 Yersinia outer membrane adhesin signature.
		>OMADHESIN#Yersinia outer membrane adhesin signature.

Length = 455

Score = 24.9 bits (53), Expect = 0.048
Identities = 20/72 (27%), Positives = 26/72 (36%), Gaps = 9/72 (12%)

Query: 22 QTDLNGKPMAGVGHQVVCP---------LCKGTFPITEGSALLDVNGVPVALHGMKTACG 72
Q N P G+ + V P KG I G+ G VA+ A G
Sbjct: 38 QISPNADPALGLEYPVRPPVPGAGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATG 97

Query: 73 ASLIASGPLGAA 84
+ +A GPL A
Sbjct: 98 VNSVAIGPLSKA 109


51PAKAF_03682PAKAF_03688Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_036822100.559605cytochrome C-type biogenesis protein CcmF
PAKAF_036837110.502616cytochrome C-type biogenesis protein CcmE
PAKAF_036846100.849572heme exporter protein CcmD
PAKAF_036856100.637000heme exporter protein CcmC
PAKAF_036868111.259038heme exporter protein CcmB
PAKAF_036875101.828808heme exporter protein CcmA
PAKAF_03688291.700921flagellar hook-length control protein FliK
52PAKAF_03709PAKAF_03715Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_037092170.709487flagellar biosynthesis protein FlhF
PAKAF_037102170.268645flagellar biosynthesis protein FlhA
PAKAF_037113200.340276YgcG family protein
PAKAF_037124190.195674Beta-propeller domains of methanol dehydrogenase
PAKAF_03713619-0.542127flagellar biosynthetic protein FlhB
PAKAF_03714519-1.250198flagellar biosynthetic protein FliR
PAKAF_03715317-1.720443flagellar biosynthetic protein FliQ
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03711cloacin300.021 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 30.1 bits (67), Expect = 0.021
Identities = 15/48 (31%), Positives = 20/48 (41%)

Query: 398 SAGGSGGGRRRGGDYASSSGSSSSSSSSSSSDSFSGGGGSSGGGGASG 445
+ G GGG G ++S + S S G G+ GG G SG
Sbjct: 25 TGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSG 72


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03712cloacin362e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 36.2 bits (83), Expect = 2e-04
Identities = 22/55 (40%), Positives = 24/55 (43%), Gaps = 9/55 (16%)

Query: 373 GQVRLSGGGGGSSGSS--------GGGSSSSSSSSSGGFSGGGGSSG-GGGASGS 418
G L GGG S GS GGGS S G G GG +G GG SG+
Sbjct: 23 GPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGT 77



Score = 35.8 bits (82), Expect = 3e-04
Identities = 15/37 (40%), Positives = 19/37 (51%)

Query: 380 GGGGSSGSSGGGSSSSSSSSSGGFSGGGGSSGGGGAS 416
GGG SG GG S + G SGGG +GG ++
Sbjct: 47 GGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSA 83



Score = 33.5 bits (76), Expect = 0.001
Identities = 15/40 (37%), Positives = 19/40 (47%)

Query: 379 GGGGGSSGSSGGGSSSSSSSSSGGFSGGGGSSGGGGASGS 418
GGG GS GGGS + +G GG G+ G A +
Sbjct: 47 GGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 32.4 bits (73), Expect = 0.004
Identities = 16/40 (40%), Positives = 16/40 (40%)

Query: 380 GGGGSSGSSGGGSSSSSSSSSGGFSGGGGSSGGGGASGSW 419
GG G GG S S SS GGG SG GS
Sbjct: 22 GGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61



Score = 30.1 bits (67), Expect = 0.017
Identities = 12/36 (33%), Positives = 17/36 (47%)

Query: 382 GGSSGSSGGGSSSSSSSSSGGFSGGGGSSGGGGASG 417
GG G+ S+S + +GG +G G G SG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSG 38



Score = 30.1 bits (67), Expect = 0.021
Identities = 13/40 (32%), Positives = 18/40 (45%)

Query: 378 SGGGGGSSGSSGGGSSSSSSSSSGGFSGGGGSSGGGGASG 417
GG G S G SG G + S+ ++ F S+ G G
Sbjct: 65 GGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 29.3 bits (65), Expect = 0.029
Identities = 13/38 (34%), Positives = 17/38 (44%), Gaps = 3/38 (7%)

Query: 385 SGSSGGGSSSSSSSSSG---GFSGGGGSSGGGGASGSW 419
SG G G ++ + S+SG G G G GG W
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGW 39


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03713TYPE3IMSPROT336e-116 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 336 bits (864), Expect = e-116
Identities = 98/345 (28%), Positives = 183/345 (53%), Gaps = 2/345 (0%)

Query: 9 DKTEEPTEKRRREAREKGQLPRSRELNTLAILMAGAGGLLIYGADLAGALLRLMRSNFEL 68
+KTE+PT K+ R+AR+KGQ+ +S+E+ + A+++A + L+ +LM E
Sbjct: 4 EKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIPAEQ 63

Query: 69 SRETAMNTESMLQLLGASAYLAAQGLWPILLMLLVAAIVGPIALGGWLFSMDALQPKFSR 128
S ++++ ++ +P+L + + AI + G+L S +A++P +
Sbjct: 64 SYLPF--SQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDIKK 121

Query: 129 LNPLSGLKRMFSAKSLLELSKALIKFLVVLAVALLVLSADRDALLALAHQPLEQAILHSV 188
+NP+ G KR+FS KSL+E K+++K +++ + +++ + LL L +E
Sbjct: 122 INPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPLLG 181

Query: 189 RVVGWSAFWMACSLLLIAAVDVPYQIWDNRQKLLMTKQEVRDEYKDSEGKPEVKSKIRQM 248
+++ ++I+ D ++ + ++L M+K E++ EYK+ EG PE+KSK RQ
Sbjct: 182 QILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRRQF 241

Query: 249 QREMAQRRMMAAVPEADVVITNPTHFAVALKYDPAGGGAPLLLAKGNDFLALKIREVAQE 308
+E+ R M V + VV+ NPTH A+ + Y PL+ K D +R++A+E
Sbjct: 242 HQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIAEE 301

Query: 309 HKVMVMESPALARAVYYSTELDQEIPAGLYLAVAQVLAYVYQLKQ 353
V +++ LARA+Y+ +D IPA A A+VL ++ +
Sbjct: 302 EGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQNI 346


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03714TYPE3IMRPROT1357e-41 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 135 bits (341), Expect = 7e-41
Identities = 96/232 (41%), Positives = 143/232 (61%), Gaps = 2/232 (0%)

Query: 1 MLELTNAQIGGWIASFVLPLFRVAALLMTMPVIGTQLVPVRVRLYLALGVCVVLVPNLPP 60
ML++T+ Q W+ + PL RV AL+ T P++ + VP RV+L LA+ + + P+LP
Sbjct: 1 MLQVTSEQWLSWLNLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPA 60

Query: 61 MPQVDALSMKAMLLIGEQILVGALLGFSLQLLFHAFVIAGQIISMQMGLGFASMVDPANG 120
S A+ L +QIL+G LGF++Q F A AG+II +QMGL FA+ VDPA+
Sbjct: 61 NDVPV-FSFFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASH 119

Query: 121 VSVPVLGQFFTMLVTLLFLAMNGHLVVFEVIAESFVTLPVGEGLSGNHFWI-IAGKLGWV 179
+++PVL + ML LLFL NGHL + ++ ++F TLP+G ++ ++ + +
Sbjct: 120 LNMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLI 179

Query: 180 MGAALLLALPAITALLVVNLAFGAMTRAAPQLNIFSIGFPLTLVLGLVILWI 231
L+LALP IT LL +NLA G + R APQL+IF IGFPLTL +G+ ++
Sbjct: 180 FLNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAA 231


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03715TYPE3IMQPROT559e-14 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 54.8 bits (132), Expect = 9e-14
Identities = 24/75 (32%), Positives = 43/75 (57%)

Query: 7 LDLFREALWLTAMIVGVLVVPSLLVGLVVAMFQAATQINEQTLSFLPRLMVILLTLIVLG 66
+ +AL+L ++ G + + ++GL+V +FQ TQ+ EQTL F +L+ + L L +L
Sbjct: 5 VFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLFLLS 64

Query: 67 PWLLRQLMEYTQTLI 81
W L+ Y + +I
Sbjct: 65 GWYGEVLLSYGRQVI 79


53PAKAF_03754PAKAF_03784Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_037542102.161511mechanosensitive ion channel family protein
PAKAF_03755392.002986methyltransferase
PAKAF_037563101.737702nucleotidyltransferase domain- containing
PAKAF_037570112.778428probable helicase
PAKAF_037580113.166557hypothetical protein
PAKAF_037590113.215648probable transcriptional regulator
PAKAF_03760-1130.889798O-methyltransferase
PAKAF_03761018-0.605352hypothetical protein
PAKAF_03762323-2.075402probable pyruvate carboxylase
PAKAF_03763534-5.163788probable transcriptional regulator
PAKAF_03764537-6.491777hypothetical protein
PAKAF_03765436-6.133211lipase family protein,Predicted lipase,Lipase
PAKAF_03766226-4.626588lipoprotein
PAKAF_03767122-3.702700lipoprotein
PAKAF_03768019-1.856658hypothetical protein
PAKAF_03769-219-2.935009Rhs element Vgr protein,Uncharacterized protein
PAKAF_03770024-2.642581probable two-component response regulator
PAKAF_03771-229-3.031694probable two-component sensor
PAKAF_03772041-5.005691hypothetical protein
PAKAF_05930043-5.059869hypothetical protein
PAKAF_03773143-5.791949adenosine 5'-phosphosulfate (APS) kinase
PAKAF_03774043-5.784065sulfotransferase family protein
PAKAF_03775145-7.033558probable glycosyl transferase
PAKAF_03776152-9.286273probable glycosyl transferase
PAKAF_03777254-9.837785probable glycosyl transferase
PAKAF_03778251-10.350570glycosyltransferase family 2 protein
PAKAF_03779249-9.833718hypothetical protein
PAKAF_03780250-10.059715probable ATP-binding component of ABC
PAKAF_03781144-8.013351probable glycosyl transferase
PAKAF_03782033-5.786452UDP-glucose 4-epimerase
PAKAF_03783127-4.930723hypothetical protein
PAKAF_03784-224-3.177540probable type II secretion system protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03754RTXTOXIND412e-05 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 40.6 bits (95), Expect = 2e-05
Identities = 26/201 (12%), Positives = 64/201 (31%), Gaps = 13/201 (6%)

Query: 3 RASLHMLLCQFAMALGLLLSLGSEAWAARPAPQAAVDLEAPPALAEDASLDQLNAQLDLI 62
A A + + + L P ++ S +++ LI
Sbjct: 133 EADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYF-QNVSEEEVLRLTSLI 191

Query: 63 RQRVTADASDDLLAELRQSALQVQRQ-ADALLALRVADIERLDDQLKVIGPPQPDEAESL 121
+++ + + EL + +R A + +L + SL
Sbjct: 192 KEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLD--------DFSSL 243

Query: 122 AAQRQALTRQKNALLDDERQATQLGQSSRDLAAQIVNLRRSLFNSQISSRAATPFSPSFW 181
++ K+A+L+ E + + R +Q+ + + +++ + T +
Sbjct: 244 LHKQAI---AKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEI 300

Query: 182 STLIRPTDDDLRRLDKLKAEA 202
+R T D++ L A+
Sbjct: 301 LDKLRQTTDNIGLLTLELAKN 321


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03759HTHTETR581e-12 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 57.7 bits (139), Expect = 1e-12
Identities = 24/148 (16%), Positives = 53/148 (35%)

Query: 14 QPQQARSSELVASILEAAVQVLASEGAQRFTTARVAERAGVSIGSLYQYFPNKAAILFRL 73
+ + + E IL+ A+++ + +G + +A+ AGV+ G++Y +F +K+ + +
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 74 QSDEWRRTTRLLGEILEDTTRPPLERLRRLVLAFVRSECEEAAIRVALSDAAPLYRDADE 133
L E PL LR +++ + S E R+ + E
Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGE 122

Query: 134 AREVKAEGARVFQAFLREALPEVAEAER 161
V+ + +
Sbjct: 123 MAVVQQAQRNLCLESYDRIEQTLKHCIE 150


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03762RTXTOXIND368e-04 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 36.0 bits (83), Expect = 8e-04
Identities = 32/195 (16%), Positives = 58/195 (29%), Gaps = 23/195 (11%)

Query: 368 RNLLLHPAVQANRVDTRFVESHLETLLAPIPASHPRLRAECPLA---------------- 411
R L P + + F+ +HLE + P+ PRL A +
Sbjct: 26 RKQLDTPVREK--DENEFLPAHLELIETPVSR-RPRLVAYFIMGFLVIAFILSVLGQVEI 82

Query: 412 EDTAPARVEAPLGSLPLSAPSSGVLVALEVSEGERVRAGQRVAILEAMKMEFEVKAPGGG 471
TA ++ S + + ++ + V EGE VR G + L A+ E +
Sbjct: 83 VATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLK---- 138

Query: 472 IVRRLAASLGEPLEEGATLLFLEPTEDDDEQAPTEQALDLAHIRADLAEVLERQAALGDE 531
L + E +E + + + P E L +
Sbjct: 139 TQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTW 198

Query: 532 RRPQALARRRKTGQR 546
+ + +R
Sbjct: 199 QNQKYQKELNLDKKR 213


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03769ICENUCLEATIN300.040 Ice nucleation protein signature.
		>ICENUCLEATIN#Ice nucleation protein signature.

Length = 1258

Score = 30.1 bits (67), Expect = 0.040
Identities = 27/92 (29%), Positives = 41/92 (44%), Gaps = 2/92 (2%)

Query: 523 RISRDSRSLVENDRFEQVNMNSSSLIKGDELHTTQGERHTRIGGNELLSISGAGSIAVDG 582
+I+ SL+ Q+ N S LI G T G R T I G + + ++G + G
Sbjct: 1080 QIASHRSSLIAGPESTQITGNRSMLIAGKGSSQTAGYRSTLISGADSVQMAGERGKLIAG 1139

Query: 583 TWVVQ-AGSQARVTA-TNVLVDAGVNLTLKAG 612
Q AG ++++ A N + AG L AG
Sbjct: 1140 ADSTQTAGDRSKLLAGNNSYLTAGDRSKLTAG 1171


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03770HTHFIS585e-12 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 57.5 bits (139), Expect = 5e-12
Identities = 34/157 (21%), Positives = 58/157 (36%), Gaps = 5/157 (3%)

Query: 3 GRIIVADDHPLFREGMLSILQRLLPEARIEEAGDLAGVLRLAGEGEQPDSLILDLRFPGL 62
I+VADD R + L R + + A + R G D ++ D+ P
Sbjct: 4 ATILVADDDAAIRTVLNQALSRA--GYDVRITSNAATLWRWIAAG-DGDLVVTDVVMPDE 60

Query: 63 TRIEMLADLRRRFPRTTLIVVSMVDDPQLIGEVMNAGADGFLGKSIAPEELGQAILAIRA 122
++L +++ P ++V+S + + GA +L K EL I RA
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG--RA 118

Query: 123 GEVLVRYEPSGLLPLQPSPRLEGLTERQLDVLRLLAQ 159
R Q L G + ++ R+LA+
Sbjct: 119 LAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLAR 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03771HTHFIS502e-08 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 49.8 bits (119), Expect = 2e-08
Identities = 31/124 (25%), Positives = 51/124 (41%), Gaps = 9/124 (7%)

Query: 416 LTGLRVCLVEDDRNVLRATSALLERWGCTVQ-AETEADGWRTDC----DILVVDYDLGPH 470
+TG + + +DD + + L R G V+ A WR D++V D + P
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVM-PD 59

Query: 471 ASGVECIERVRRQRGEAIPALVISGH-DIERIQASVEDTDIALLSKPVRPTELRATL-RA 528
+ + + R+++ +P LV+S + E L KP TEL + RA
Sbjct: 60 ENAFDLLPRIKK-ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118

Query: 529 LRER 532
L E
Sbjct: 119 LAEP 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03782NUCEPIMERASE1811e-56 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 181 bits (460), Expect = 1e-56
Identities = 85/353 (24%), Positives = 142/353 (40%), Gaps = 51/353 (14%)

Query: 1 MRVLVTGGAGFIGSHVLVELLGQGAKVVVLDNLVNGSSESLK--RVERITGHPVGFVLGD 58
M+ LVTG AGFIG HV LL G +VV +DNL + SLK R+E + F D
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 59 VRDNLLVERLLIGEKVDAVIHLAGLKAVGESVDDPLEYYESNVQGTISLLRAMQRVGVFK 118
+ D + L + V AV S+++P Y +SN+ G +++L + +
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120

Query: 119 IVFSSSATIYQMPGTLPISESSKVGGVASPYGRTKLTAEHM------LDDLARSDARWSI 172
++++SS+++Y + +P S V S Y TK E M L L
Sbjct: 121 LLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGL-------PA 173

Query: 173 AVLRYFNPIGAHESGLIGEDPCGTPNNLLPYIAQVAVGRLSRLTVHGGDYPTI--DGTGV 230
LR+F G P G P+ +A+ + ++ + G + G
Sbjct: 174 TGLRFFTVYG----------PWGRPD--------MALFKFTKAMLEGKS-IDVYNYGKMK 214

Query: 231 RDYIHVCDLAAGHTRALEYLGQGHG---------------YHVWNLGTGTGYSVLQVIEA 275
RD+ ++ D+A R + + Y V+N+G + ++ I+A
Sbjct: 215 RDFTYIDDIAEAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQA 274

Query: 276 FERVSGRRIPFTVSGRRPGDVAECWADVSKAERELGWKAGLGLECMIADAWRW 328
E G + +PGDV E AD +G+ ++ + + W
Sbjct: 275 LEDALGIEAKKNMLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNW 327


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03784BCTERIALGSPD1981e-56 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 198 bits (506), Expect = 1e-56
Identities = 128/668 (19%), Positives = 250/668 (37%), Gaps = 109/668 (16%)

Query: 104 SLNVEDVQLAAFINEVFGNILGLPFEIESALKEKTDRVTVRLEQPQTAQMVYEVARQVLV 163
S + + + FIN V L I+ +++ +TVR + Y+ VL
Sbjct: 31 SASFKGTDIQEFINTV-SKNLNKTVIIDPSVRGT---ITVRSYDMLNEEQYYQFFLSVLD 86

Query: 164 NYGVEILHQGDIYRFQIKQVGLSPDEPPILISGEARPSVPIAYRPVFQFVALHSVDPKDV 223
YG +++ + ++ + ++ +A P I V + V L +V +D+
Sbjct: 87 VYGFAVINMNNGVLKVVRSKDAKTA--AVPVASDAAPG--IGDEVVTRVVPLTNVAARDL 142

Query: 224 IPWLN--SAYEKSGLSVMADGARSGLMLKGMSSIVNQATEAVRLLDQPFMRGRHSLRIDP 281
P L + G V + + L++ G ++++ + V +D G S+ P
Sbjct: 143 APLLRQLNDNAGVGSVVHYEPSNV-LLMTGRAAVIKRLLTIVERVDNA---GDRSVVTVP 198

Query: 282 -AFVSAADMASQLKSVIAAQGYSVGIGEAVGSIMLVPLESSNGLIVFANDGLLLDLVREW 340
++ SAAD+ + + S G V +++ E +N ++V ++
Sbjct: 199 LSWASAADVVKLVTELNKDTSKSALPGSMVANVV--ADERTNAVLVSGEPNSRQRIIAM- 255

Query: 341 AQQVDRAPMAVAAGIGEEKEGLFFYEARNTRVTELAKSLRALVSGFAGEGAYGITSGLQS 400
+Q+DR NT+V L + A + +T +
Sbjct: 256 IKQLDRQQATQG----------------NTKVIYLKYAK-------ASDLVEVLTGISST 292

Query: 401 SASKRSGGGRRAGEDGAAPAVAPLLQAAGAAALVGGDSANGLLGGLAAGISGSGTIVEDE 460
S++ A D + I
Sbjct: 293 MQSEKQAAKPVAALDK------------------------------------NIIIKAHG 316

Query: 461 NRNAILFRGAARTWQQMQGLLREMDKPARQVLIEVTVASVSLSDTQELGVEWEMLNGSFN 520
NA++ A ++ ++ ++D QVL+E +A V +D LG++W N
Sbjct: 317 QTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGIQWANKNAGMT 376

Query: 521 SATSTGSK-GSAGKGGFNYVINT--------------------AGGNTAA-IQAMADNQR 558
T++G +A G Y + GN A + A++ + +
Sbjct: 377 QFTNSGLPISTAIAGANQYNKDGTVSSSLASALSSFNGIAAGFYQGNWAMLLTALSSSTK 436

Query: 559 VRVLATPRILVKSGEQANINVGRDIPIPTAQVNDDSTTAGSTNLRNEIAYRSTGTILNVA 618
+LATP I+ +A NVG+++P+ T S T N+ N + ++ G L V
Sbjct: 437 NDILATPSIVTLDNMEATFNVGQEVPVLTG-----SQTTSGDNIFNTVERKTVGIKLKVK 491

Query: 619 PVVYSDSRVDLTVSQELSDSGGSSGGGGKASGGGISAPEISRTSLETSLTLKSGGSVLMG 678
P + V L + QE+S ++ G + ++ ++ + SG +V++G
Sbjct: 492 PQINEGDSVLLEIEQEVSSVADAASSTSSDLG-----ATFNTRTVNNAVLVGSGETVVVG 546

Query: 679 GLIRDNITDSNAGVPLLKDIPGIGFLFGRQKAVKTREEVIMLIQPYVLESDADAREVTEK 738
GL+ +++D+ VPLL DIP IG LF ++ +++ I+P V+ + R+ +
Sbjct: 547 GLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNLMLFIRPTVIRDRDEYRQASSG 606

Query: 739 LRAMLSKT 746
+
Sbjct: 607 QYTAFNDA 614


54PAKAF_03844PAKAF_03854Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_03844211-0.427285SgcJ/EcaC family oxidoreductase
PAKAF_03846210-1.008310hypothetical protein
PAKAF_03847212-1.943643DUF883 domain-containing protein
PAKAF_03848212-2.543434probable TonB-dependent receptor
PAKAF_03849312-1.910200cytochrome o ubiquinol oxidase protein CyoE
PAKAF_03850211-1.920067cytochrome o ubiquinol oxidase subunit IV
PAKAF_03851111-1.646848cytochrome o ubiquinol oxidase subunit III
PAKAF_03852210-0.623448cytochrome o ubiquinol oxidase subunit I
PAKAF_038532111.152701cytochrome o ubiquinol oxidase subunit II
PAKAF_038542122.119282probable major facilitator superfamily (MFS)
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03848ENTEROVIROMP290.034 Enterobacterial virulence outer membrane protein si...
		>ENTEROVIROMP#Enterobacterial virulence outer membrane protein

signature.
Length = 171

Score = 29.5 bits (66), Expect = 0.034
Identities = 17/42 (40%), Positives = 23/42 (54%)

Query: 466 GTSRSTPSGKPTVRADSSDGKLSTRAGLVFKPLENGRVYFSY 507
G + + PT + D+SD S AGL F P+EN + FSY
Sbjct: 109 GYGKFQTTEYPTYKHDTSDYGFSYGAGLQFNPMENVALDFSY 150


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03854TCRTETB1082e-27 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 108 bits (270), Expect = 2e-27
Identities = 82/400 (20%), Positives = 164/400 (41%), Gaps = 15/400 (3%)

Query: 17 TFIASLDISIVNLALPTLQYALDTDLAGLQWVVDAYALCLSAFMLSSGPLSDRYGRKLTW 76
+F + L+ ++N++LP + + A WV A+ L S G LSD+ G K
Sbjct: 23 SFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLL 82

Query: 77 LLGVGLFSFGSLLCALATS-LPLLLFGRAVQGIAGALLIPGALSILTQAFHDPGQRAQVI 135
L G+ + FGS++ + S LL+ R +QG GA P + ++ + R +
Sbjct: 83 LFGIIINCFGSVIGFVGHSFFSLLIMARFIQGA-GAAAFPALVMVVVARYIPKENRGKAF 141

Query: 136 GGWTSFSALSLILGPLLGGLLVEHAGWQSIFLINLPLGLLALALGLWGIEETAHPEHAAF 195
G S A+ +GP +GG++ + W + LI + + L +E H F
Sbjct: 142 GLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVRIKGH--F 199

Query: 196 DPLGQLLSVVWLGALTYALIAAGEGGWLSPTAWPALLLAGVGLLGFLFVERRTARPLLPL 255
D G +L V + + + L+++ + L F+ R+ P +
Sbjct: 200 DIKGIILMSVGIVFFMLFTTSY---------SISFLIVSVLSFLIFVKHIRKVTDPFVDP 250

Query: 256 GLFRQAGFAVCNLASFVLGFSGYASLFFLSLFFQQVQGASAQQAGF-YLAPQFLAMGALS 314
GL + F + L ++ + + + + V S + G + P +++
Sbjct: 251 GLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFG 310

Query: 315 MLFGRLQRHVPLRRLLVLGYLVIGLAMLALAACGTGTAYPWVGLLLVALGLGMGLAVPGT 374
+ G L +L +G + ++ L + T++ ++ +++V + G+
Sbjct: 311 YIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSW-FMTIIIVFVLGGLSFTKTVI 369

Query: 375 GLAVMASVARERSGMASATMNTLRQAGMAVGIALLGALLS 414
V +S+ ++ +G + +N GIA++G LLS
Sbjct: 370 STIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLS 409


55PAKAF_03871PAKAF_03917Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_03871390.028598YcgN family cysteine cluster protein
PAKAF_03872280.181810Transcriptional repressor RcnR
PAKAF_03873090.751329probable metal transporter
PAKAF_03874-2110.625556probable 2-hydroxyacid dehydrogenase
PAKAF_03875-1120.135535conserved hypothetical protein
PAKAF_03876-1100.775375ribonuclease D
PAKAF_03877112-0.283329SMP-30/gluconolactonase/LRE family protein
PAKAF_03878111-0.470388probable 3-mercaptopyruvate sulfurtransferase
PAKAF_03879290.083315alpha/beta fold hydrolase
PAKAF_038802100.214516probable transcriptional regulator
PAKAF_038811120.297895hypothetical protein
PAKAF_03882010-0.011318probable outer membrane protein precursor
PAKAF_03883-191.774878probable glutathione peroxidase
PAKAF_03884-1102.447320probable major facilitator superfamily (MFS)
PAKAF_03885-182.279450probable transcriptional regulator
PAKAF_03886-193.180696probable acyl-CoA dehydrogenase
PAKAF_038871124.590022probable transcriptional regulator
PAKAF_038882114.910441probable major facilitator superfamily (MFS)
PAKAF_038893125.392522cobalamin (5'-phosphate) synthase
PAKAF_038902115.597764alpha-ribazole phosphatase family protein
PAKAF_038913125.663496nicotinate-nucleotide--dimethylbenzimidazole
PAKAF_038921104.998807cobinamide kinase
PAKAF_03893194.533904cobyric acid synthase
PAKAF_03894-183.212572cobalamin biosynthetic protein CobC
PAKAF_03895093.424674cobalamin biosynthetic protein CobD
PAKAF_03896-182.5857005,6-dimethylbenzimidazole synthase
PAKAF_03897-272.619581cobyrinic acid a,c-diamide synthase
PAKAF_03898-282.839530cob(I)alamin adenosyltransferase
PAKAF_03899-193.872329probable tonB-dependent receptor
PAKAF_03901285.271935FUSC family protein
PAKAF_039021115.341430probable transcriptional regulator
PAKAF_03903195.5245114-hydroxyproline epimerase
PAKAF_03904295.489716hypothetical protein
PAKAF_03906275.141864probable oxidoreductase
PAKAF_03907193.705529DMT family transporter
PAKAF_03908-1113.856961probable transcriptional regulator
PAKAF_03909-1133.260829helix-turn-helix domain-containing protein
PAKAF_039100143.238737probable major facilitator superfamily (MFS)
PAKAF_039110142.390449probable transcriptional regulator
PAKAF_039121162.518101amino acid ABC transporter periplasmic binding
PAKAF_039132162.539959DUF521 domain-containing protein
PAKAF_039141142.425217probable permease of ABC transporter
PAKAF_039151102.628022amino acid ABC transporter membrane protein
PAKAF_039163102.836471amino acid ABC transporter ATP binding protein
PAKAF_03917193.072398hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03875SSPAMPROTEIN270.007 Salmonella surface presentation of antigen gene typ...
		>SSPAMPROTEIN#Salmonella surface presentation of antigen gene type

M signature.
Length = 147

Score = 27.3 bits (60), Expect = 0.007
Identities = 17/63 (26%), Positives = 33/63 (52%)

Query: 1 MKRICSVYKSPRKNEMYLYVDKREALSRVPEALLVPFGAPQHVFDLLLTPERQLAREDVA 60
++R C+V+ S ++ + Y D+ L EA++ + + D L RQL+RE++
Sbjct: 10 LQRRCTVFHSQCESILLRYQDEDRRLQVEEEAIVEQIAGLKLLLDTLRAENRQLSREEIY 69

Query: 61 KVL 63
+L
Sbjct: 70 ALL 72


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03880HTHTETR762e-19 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 76.2 bits (187), Expect = 2e-19
Identities = 37/202 (18%), Positives = 73/202 (36%), Gaps = 10/202 (4%)

Query: 5 SRQQENAEATREALLESALSAFIEHGYGGVSIDAIAREARVTKGAFYHHFGSKQELLAEC 64
+ ++ A+ TR+ +L+ AL F + G S+ IA+ A VT+GA Y HF K +L +E
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 65 YERQVRTIAEDLDRVPAQVDKWAEAAALA--EAFIDSVMARGKRQL----SLQEVITVVG 118
+E I E A+ + ++S + +R+L + V
Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGE 122

Query: 119 WE---RWKRIDSRHTLRYVGRLVDELAASGELK-DYRRETLVGQLYGFLTQAAMSLRDAR 174
+ +R + + + + + L D + G+++ + A
Sbjct: 123 MAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAP 182

Query: 175 NKRQAANEVKAIIRDFLYSLRR 196
E + + L
Sbjct: 183 QSFDLKKEARDYVAILLEMYLL 204


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03884TCRTETB444e-07 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 44.5 bits (105), Expect = 4e-07
Identities = 33/133 (24%), Positives = 62/133 (46%), Gaps = 3/133 (2%)

Query: 53 LVWGLAQPFTGALADRYGAARAVLVGGLLYALGLVLMGLSQSATGLSLSAGLLIGLGLSG 112
L + + G L+D+ G R +L G ++ G V+ + S L + A + G G +
Sbjct: 60 LTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAG-AA 118

Query: 113 TSFSVILGAVGRAVPAEQRSMAMGISSAAGSFGQFAMLPGTLGLIG-WLGWSSALLALGL 171
++++ V R +P E R A G+ + + G+ + P G+I ++ WS LL +
Sbjct: 119 AFPALVMVVVARYIPKENRGKAFGLIGSIVAMGE-GVGPAIGGMIAHYIHWSYLLLIPMI 177

Query: 172 LVALIVPLAGLMK 184
+ + L L+K
Sbjct: 178 TIITVPFLMKLLK 190


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03887HTHTETR542e-11 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 53.9 bits (129), Expect = 2e-11
Identities = 22/107 (20%), Positives = 40/107 (37%), Gaps = 1/107 (0%)

Query: 1 MGRRRTIDRDQLLDAAEAVIGREGAAGLTIDAVAKEMGITKGGVQYCFGTKDALIDAIFE 60
+ R +LD A + ++G + ++ +AK G+T+G + + F K L I+E
Sbjct: 5 TKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWE 64

Query: 61 RWGKAYDSLFEAVAGKQP-TPLTRVRAHAEATQRSDELSSSKAAALM 106
L K P PL+ +R S + +
Sbjct: 65 LSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLME 111


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03888TCRTETB1959e-59 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 195 bits (496), Expect = 9e-59
Identities = 90/398 (22%), Positives = 175/398 (43%), Gaps = 14/398 (3%)

Query: 18 FLIIIDMTVLYTALPRLTHDLGATAAEKLWIVNAYPLVVAGLLPGAGLLSDRLGHKRLFL 77
F +++ VL +LP + +D A W+ A+ L + G LSD+LG KRL L
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 78 AGLPLFGLASLCAAFAPSAAA-LIAARAGLAVGAAVMMPATLSIVRHVFQDERERALAIG 136
G+ + S+ S + LI AR GAA PA + +V + + R A G
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAA-AFPALVMVVVARYIPKENRGKAFG 142

Query: 137 IWASVASAGAALGPVVGGVLLEFFWWGSVFLINVPVVVVALLLALPAIPACGGQSRRPWD 196
+ S+ + G +GP +GG++ + W +L+ +P++ + + L + + + +D
Sbjct: 143 LIGSIVAMGEGVGPAIGGMIAHYIHWS--YLLLIPMITIITVPFLMKLLKKEVRIKGHFD 200

Query: 197 PLGSLQVMFGLVGVVYAIKELSTRAPDFGLAVLAALGGMLCLYLFVRRQRRAREPMIDFA 256
G + + G+V + L T + +++ +L +FV+ R+ +P +D
Sbjct: 201 IKGIILMSVGIVFFM-----LFTTSYSISFLIVS----VLSFLIFVKHIRKVTDPFVDPG 251

Query: 257 LFRNRRFARGVAVALVATMALVGMELVFSQHLQLVQGLTPLKAG-LFVLPIPLASLVVGP 315
L +N F GV + + G + ++ V L+ + G + + P ++ ++ G
Sbjct: 252 LGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGY 311

Query: 316 LAGWLVPRWGENRVMCASLLLGSAGLLGLALSYQAATGAQLASLVLLGVGFGGAMTAAST 375
+ G LV R G V+ + S L + + + +V + G T ST
Sbjct: 312 IGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVIST 371

Query: 376 AVMLNVDEQSSGMAAAIEDVSYELGGVIGVTLLGSLMS 413
V ++ +Q +G ++ + + L G+ ++G L+S
Sbjct: 372 IVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLS 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03899PF00577300.032 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 30.2 bits (68), Expect = 0.032
Identities = 29/154 (18%), Positives = 38/154 (24%), Gaps = 37/154 (24%)

Query: 153 RRGDGQGAKPFFSAGYGTHQ-----TLEGSAGVSG-------GAGNGWYSLGVSSFDTAG 200
R G+ Q KP F H T+ G ++ G G +LG S D
Sbjct: 384 RSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALSVDMTQ 443

Query: 201 INTKRAGT-------------------------AGYEPDRDGYRNLSGNLRGGYRFDNGL 235
N+ GY GY N + N
Sbjct: 444 ANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGYNIE 503

Query: 236 ELDGTLLRAKSHNDYDQVFGNSGFNANADGEQNL 269
DG + DY + N Q L
Sbjct: 504 TQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQL 537


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03901ACRIFLAVINRP300.047 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 29.8 bits (67), Expect = 0.047
Identities = 25/147 (17%), Positives = 46/147 (31%), Gaps = 33/147 (22%)

Query: 58 VSQPYSGMV----LAKGMFRLIGTCAGALVSIGMVALYGQASLPFLLLMALWLAFCTAGA 113
+ ++GM L+ + + +V + + ALY S+P +++ + L G
Sbjct: 854 IGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIV--GV 911

Query: 114 SLLHNHASYGFVLAGYTAAIVALPASADPATVFDQAVARCSEIGLGILCAALV-NVLLWP 172
L L + + +GL N +L
Sbjct: 912 LLAAT-------LFNQKNDVYFM-------------------VGLLTTIGLSAKNAILIV 945

Query: 173 RRLERQLANQGKAAWEAGLQAAAAELR 199
+ + +GK EA L A LR
Sbjct: 946 EFAKDLMEKEGKGVVEATLMAVRMRLR 972


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03904NUCEPIMERASE290.024 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 29.4 bits (66), Expect = 0.024
Identities = 9/30 (30%), Positives = 15/30 (50%), Gaps = 1/30 (3%)

Query: 45 VIVVG-AGIVGSACAHELARRGLDVLVLDS 73
+V G AG +G + L G V+ +D+
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAGHQVVGIDN 32


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03910TCRTETB1102e-28 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 110 bits (277), Expect = 2e-28
Identities = 80/418 (19%), Positives = 162/418 (38%), Gaps = 30/418 (7%)

Query: 15 LCILLAGQLLPMIDFSIVNVALDALAHSLGASETELELIVAVYGVAFAVCLAMGGRLGDN 74
LCIL +++ ++NV+L +A+ + + + F++ A+ G+L D
Sbjct: 19 LCIL---SFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQ 75

Query: 75 YGRRRLFDLGVALFAVASLLCGLAGS-VWLLLVARALQGVGAALIVPQILATLHVSLSGH 133
G +RL G+ + S++ + S LL++AR +QG GAA ++ + +
Sbjct: 76 LGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKE 135

Query: 134 AHSRALAAYGAIGGLAFVVGQVLGGFLVSADIGGLGWRSVFLINLPICLGILLCSRRWVP 193
+A G+I + VG +GG + + W +L+ +P+ I + +
Sbjct: 136 NRGKAFGLIGSIVAMGEGVGPAIGGMIAHY----IHWS--YLLLIPMITIITVPFLMKLL 189

Query: 194 ETRAEHAARVDAPGTLLLAALILCLLLPLALGPSLHWS-WPCALLLAAAVPLLAWLWRTE 252
+ D G +L++ I+ +L + L+
Sbjct: 190 KKEVRIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFLIF------------- 236

Query: 253 LRQERRQAWPLLPPSLLRLPSIRFGLLLAILFFACWSGFMFALALALQAGAGLSPVQAGN 312
++ R+ P + P L + G+L + F +GF+ + ++ LS + G+
Sbjct: 237 VKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGS 296

Query: 313 AFIALGA-SYFVSALLTARVAARIGPVRLLLLGCVIQMCGLLGLMLTLQRVWPQPGILNL 371
I G S + + + R GP+ +L +G L + +
Sbjct: 297 VIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFL-----LETTSWFM 351

Query: 372 APATLVIGFGQAFIVSSFFRIGLSEVPAAQAGAGSAMLATVQQASLGLGSALLGAVFA 429
+ + G +F + I S + +AGAG ++L S G G A++G + +
Sbjct: 352 TIIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLS 409


56PAKAF_03932PAKAF_03948Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_039322124.448262probable transcriptional regulator
PAKAF_039331115.538816probable enoyl-CoA hydratase/isomerase
PAKAF_039341114.680189alpha/beta fold hydrolase
PAKAF_03935194.721762hypothetical protein
PAKAF_039361114.418412hypothetical protein
PAKAF_039371114.215921probable outer membrane component of multidrug
PAKAF_039381133.238908probable multidrug resistance efflux pump
PAKAF_039391142.518785probable major facilitator superfamily (MFS)
PAKAF_039400172.542453probable transcriptional regulator
PAKAF_039410152.111252thioredoxin family protein
PAKAF_039420142.465336DUF2790 domain-containing protein
PAKAF_039431142.654013FUSC family protein
PAKAF_03944-1132.404349HlyD family secretion protein
PAKAF_039451142.306225DUF1656 domain-containing protein
PAKAF_039462122.286307probable transcriptional regulator
PAKAF_039473122.862419hypothetical protein
PAKAF_039483102.529728aminoglycoside phosphotransferase family
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03932HTHTETR453e-08 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 44.6 bits (105), Expect = 3e-08
Identities = 20/150 (13%), Positives = 45/150 (30%), Gaps = 11/150 (7%)

Query: 1 MEMLSSACGLTKASFYHHYPNKEALLRDVLEWTHQRLAETLFSIAYDPLLTPRERLEKLG 60
+ ++ A G+T+ + Y H+ +K L ++ E + + E P L ++
Sbjct: 34 LGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEYQAKFPGDPLSVLREIL 93

Query: 61 RKAARLFQDDSIGCLMGVVAVDASYGRSELMAPIRSFLDDWAQAFAQLYRPAFDEA--QA 118
+ L+ + + + + ++
Sbjct: 94 IHVLESTVTEERRRLLMEIIF-----HKCEFVGEMAVVQQAQRNLCLESYDRIEQTLKHC 148

Query: 119 LERGRQLVADFEGAILLARIYGEPGYIDGV 148
+E L AD + GYI G+
Sbjct: 149 IEAK-MLPADLMTRRAAIIMR---GYISGL 174


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03938RTXTOXIND1211e-32 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 121 bits (305), Expect = 1e-32
Identities = 61/368 (16%), Positives = 110/368 (29%), Gaps = 68/368 (18%)

Query: 66 AVSAQVSGYVAEVLVADDADVQAGDLLLRLDPRDFR-------QRLRAAEAREAAAQAAL 118
+ + V E++V + V+ GD+LL+L L A + Q
Sbjct: 98 EIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILS 157

Query: 119 EAQ-------------------------------RAKLETLDRQLLEQAQTISRARADGE 147
+ + + T Q ++ + + RA+
Sbjct: 158 RSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERL 217

Query: 148 AARAEWRRAETDWR-------RYRQLADEHATSRQRLENADAAHQRARAAARRANAEEGR 200
A R E R + L + A ++ + + + A R ++ +
Sbjct: 218 TVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQ 277

Query: 201 QRAARDVLKSR--------RREAEAALAQRQAELQEAAAARELARHALDDTEIRAPFAGR 252
+ K + E L Q + + IRAP + +
Sbjct: 278 IESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVK 337

Query: 253 VGQRKVRLR-QYVTPGLPLLAVVPLEQAYVV-ANYKETQLERIRPGQPVELEVDTFGRRW 310
V Q KV VT L+ +VP + V A + + I GQ ++V+ F
Sbjct: 338 VQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTR 397

Query: 311 RGRVDSVAPASGAVFALLPPDNATGNFTKIVQRFPVRIRLDADAAERG----RLLPGMSV 366
G + + D +V F V I ++ + G L GM+V
Sbjct: 398 YGYLV-------GKVKNINLDAIEDQRLGLV--FNVIISIEENCLSTGNKNIPLSSGMAV 448

Query: 367 IATVDTRE 374
A + T
Sbjct: 449 TAEIKTGM 456


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03939TCRTETB1096e-28 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 109 bits (274), Expect = 6e-28
Identities = 79/402 (19%), Positives = 168/402 (41%), Gaps = 17/402 (4%)

Query: 23 FMAGMNVHVTSAALPEIEGALGATFEEGSWISTAYLVAEISMIPLTAWLVEVFSLRRVML 82
F + +N V + +LP+I +W++TA+++ + L + ++R++L
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 83 LGSLVFLLSSLSCALAPN-LSTLILIRVIQGASGAVLIPLSMQLILTELPSSRVPLGMAL 141
G ++ S+ + + S LI+ R IQGA A L M ++ +P L
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143

Query: 142 FSLSNSVAQAAGPSIGGWLADAYSWRWIFLLQLLPGIALLAAVAWSIRPRDGDRERLRQA 201
++ + GP+IGG +A W ++ L+ ++ I + + + R +
Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKL---LKKEVRIK-GHF 199

Query: 202 DWLGIGAMVAGLGALQIVLEEGGRRDWFESGFIRTFAVLAVLALLLFVQRQLWGARPFIN 261
D GI M G+ + F + + +F +++VL+ L+FV+ PF++
Sbjct: 200 DIKGIILMSVGIVFFML----------FTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVD 249

Query: 262 LRLLGSYNFGVSSLAMAVFGAATFGLVFLVPNYLSQLQGFNARQIGDSLILYGLVQLLL- 320
L + F + L + G V +VP + + + +IG +I G + +++
Sbjct: 250 PGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIF 309

Query: 321 APLLPRLMRWLNPKLLVAGGFAIMALGCWMGAHLNADAGRNVIIPSIVVRGIGQPLIMVA 380
+ L+ P ++ G +++ ++ A + + IV G
Sbjct: 310 GYIGGILVDRRGPLYVLNIGVTFLSVS-FLTASFLLETTSWFMTIIIVFVLGGLSFTKTV 368

Query: 381 LSVLAVKGLDKAQAGSASALISMLRNLGGAIGTALLTQLVSL 422
+S + L + +AG+ +L++ L G A++ L+S+
Sbjct: 369 ISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSI 410


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03940HTHFIS339e-04 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 33.3 bits (76), Expect = 9e-04
Identities = 14/103 (13%), Positives = 31/103 (30%), Gaps = 6/103 (5%)

Query: 87 RHDLPRDCRVVDVPPLLRQLIVAAMRIAPDYPPGGRDERVMELILDELRVLPILALHVPQ 146
R + + R + + + ++ + D L + + +
Sbjct: 376 REIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVLAE 435

Query: 147 PVDPRLAALCRSLRAEPAADWSLGDAARRLGVSPRTLTRAFQR 189
P + L A A + AA LG++ TL + +
Sbjct: 436 MEYPLI------LAALTATRGNQIKAADLLGLNRNTLRKKIRE 472


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03944RTXTOXIND656e-14 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 65.2 bits (159), Expect = 6e-14
Identities = 43/214 (20%), Positives = 76/214 (35%), Gaps = 39/214 (18%)

Query: 79 RSYRLAVRQREAELEQARETLRQRDEQWRRRMQLAGAVSREEVANAGRALRIARARAEAA 138
R Y+ + Q E+E+ A+E + + ++ + LR
Sbjct: 269 RVYKSQLEQIESEILSAKEEYQLVTQLFKNEIL--------------DKLRQTTDNIGLL 314

Query: 139 GAALDQARLDLRRSELRSPVDGYVTQLRVQ-PGDYAAAGRTNIFIV-DRRSFWVTGYFEE 196
L + + S +R+PV V QL+V G T + IV + + VT +
Sbjct: 315 TLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQN 374

Query: 197 TKLRNVQVGAPATIKLMGFD----PLLDGHVASIGRGVADLNESRADSGLPQVSPNFSWI 252
+ + VG A IK+ F L G V +I D+ Q
Sbjct: 375 KDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNIN----------LDAIEDQRLGLV--- 421

Query: 253 RLAQRVPVRIELDRVPS---GVVLAAGMTGSVEV 283
V + IE + + + + L++GM + E+
Sbjct: 422 ---FNVIISIEENCLSTGNKNIPLSSGMAVTAEI 452



Score = 47.5 bits (113), Expect = 3e-08
Identities = 18/114 (15%), Positives = 41/114 (35%), Gaps = 3/114 (2%)

Query: 41 VSAQVIRIAPEVSGSVEAVFVADNQRVARGDPLYRIDPRSYRLAVRQREAELEQARETLR 100
S + I P + V+ + V + + V +GD L ++ + ++ L QAR +
Sbjct: 92 HSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLE-Q 150

Query: 101 QRDEQWRRRMQLAGAVSREEVANAGRALRIARARAEAAGAALDQARLDLRRSEL 154
R + R ++L E + +L + + +++
Sbjct: 151 TRYQILSRSIELNKL--PELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQK 202


57PAKAF_04079PAKAF_04091Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_04079-112-3.244304transcriptional regulator FleQ
PAKAF_04080118-4.206880flagellar assembly protein FliT
PAKAF_04081-115-1.854410flagellar protein FliS,Flagellar protein
PAKAF_04082-118-1.959464flagellar protein,Flagellar protein
PAKAF_04083-220-3.236747flagellar cap protein,Flagellar cap
PAKAF_04084-125-3.192797FlaG,flagellar protein FlaG,FlaG protein
PAKAF_04085-126-3.627525flagellin type B,A-type
PAKAF_04086-131-4.215002O-antigen biosynthesis protein,Chondroitin
PAKAF_04087142-7.909942hypothetical protein
PAKAF_04088137-7.2760703-demethylubiquinone-9
PAKAF_04089030-5.880432hypothetical protein
PAKAF_04090025-4.941410CMP-2-keto-3-deoxyoctulosonic acid
PAKAF_04091024-4.330778hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04079HTHFIS5100.0 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 510 bits (1314), Expect = 0.0
Identities = 181/489 (37%), Positives = 256/489 (52%), Gaps = 14/489 (2%)

Query: 5 TKLLLIDDNLDRSRDLAVILNFLGEDQLTCNS--EDWREVAAGLSNSREALCVLLGSVES 62
+L+ DD+ L L+ G D ++ WR +AAG + V+ V
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGD-----LVVTDVVMP 58

Query: 63 KGGAVELLKQLASWDEYLPILLI-GEPAPADWPEELRRRVLASLEMPPSYNKLLDSLHRA 121
A +LL ++ LP+L++ + + + L P +L+ + RA
Sbjct: 59 DENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118

Query: 122 QVYREMYDQARERGRSREPNLFRSLVGTSRAIQQVRQMMQQVADTDASVLILGESGTGKE 181
+ R + LVG S A+Q++ +++ ++ TD +++I GESGTGKE
Sbjct: 119 LAEP----KRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKE 174

Query: 182 VVARNLHYHSKRREGPFVPVNCGAIPAELLESELFGHEKGAFTGAITSRAGRFELANGGT 241
+VAR LH + KRR GPFV +N AIP +L+ESELFGHEKGAFTGA T GRFE A GGT
Sbjct: 175 LVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGT 234

Query: 242 LFLDEIGDMPLPMQVKLLRVLQERTFERVGSNKTQNVDVRIIAATHKNLEKMIEDGTFRE 301
LFLDEIGDMP+ Q +LLRVLQ+ + VG DVRI+AAT+K+L++ I G FRE
Sbjct: 235 LFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFRE 294

Query: 302 DLYYRLNVFPIEMAPLRERVEDIALLLNELISRMEHEKRGSIRFNSAAIMSLCRHDWPGN 361
DLYYRLNV P+ + PLR+R EDI L+ + + E E RF+ A+ + H WPGN
Sbjct: 295 DLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDVKRFDQEALELMKAHPWPGN 354

Query: 362 VRELANLVERLAIMHPYGVIGVGELPKKFR-HVDDEDEQLASSLREELEERAAINAGLPG 420
VREL NLV RL ++P VI + + R + D + A++ L A+ +
Sbjct: 355 VRELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQ 414

Query: 421 MDAPAM-LPAEGLDLKDYLANLEQGLIQQALDDAGGVVARAAERLRIRRTTLVEKMRKYG 479
A LA +E LI AL G +AA+ L + R TL +K+R+ G
Sbjct: 415 YFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELG 474

Query: 480 MSRRDDDLS 488
+S S
Sbjct: 475 VSVYRSSRS 483


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04085FLAGELLIN1595e-46 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 159 bits (402), Expect = 5e-46
Identities = 109/326 (33%), Positives = 154/326 (47%), Gaps = 2/326 (0%)

Query: 2 ALTVNTNIASLNTQRNLNNSSASLNTSLQRLSTGSRINSAKDDAAGLQIANRLTSQVNGL 61
A +NTN SL TQ NLN S +SL+++++RLS+G RINSAKDDAAG IANR TS + GL
Sbjct: 1 AQVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGL 60

Query: 62 NVATKNANDGISLAQTAEGALQQSTNILQRMRDLSLQSANGSNSDSERTALNGEVKQLQK 121
A++NANDGIS+AQT EGAL + N LQR+R+LS+Q+ NG+NSDS+ ++ E++Q +
Sbjct: 61 TQASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLE 120

Query: 122 ELDRISNTTTFGGRKLLDGSFGVASFQVGSAANEIISVGIDEMSAESLNGTYFKADGGGA 181
E+DR+SN T F G K+L + QVG+ E I++ + ++ +SL F +G
Sbjct: 121 EIDRVSNQTQFNGVKVLSQDNQM-KIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKE 179

Query: 182 VTAATASGTVDIAIGITGGSAVNVKVDMKGNETAEQAAAKIAAAVNDANVGIGAFSDGDT 241
T + G + K + N A A V D A T
Sbjct: 180 ATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAV-VTDTTAPTVPDKVYVNAANGQLTT 238

Query: 242 ISYVSKAGKDGSGAITSAVSGVVIADTGSTGVGTAAGVAPSATAFAKTNDTVAKIDISTA 301
+ D S G G T DT D +
Sbjct: 239 DDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGK 298

Query: 302 KGAQSAVLVIDEAIKQIDAQRADLGA 327
+ + I A A++ A
Sbjct: 299 VSTTINGEKVTLTVADITAGAANVDA 324



Score = 107 bits (269), Expect = 9e-28
Identities = 77/363 (21%), Positives = 132/363 (36%), Gaps = 6/363 (1%)

Query: 33 STGSRINSAKDDAAGLQIANRLTSQVNGLNVATKNANDGISLAQTAEGALQQSTNILQRM 92
+ G I + + + + + +
Sbjct: 150 NDGETITIDLQKIDVKSLGLDGFNVNGPKEATVGDLKSSFKNVTGYDTYAVGANKYRVDV 209

Query: 93 RDLSLQSANGSNSDSERTALNGEVKQLQKELDRISNTTTFGGRKLLDGSFGVASFQVGSA 152
++ + + + ++ +N QL + + A G+
Sbjct: 210 NSGAVVTDTTAPTVPDKVYVNAANGQLTTDDAENNTAVDLFKTTKSTAGTAEAKAIAGAI 269

Query: 153 ANEIISVGIDEMSAESLNGTYFKADGGGAVTAATASGTVDIAI-GITGGSAVNVKVDMKG 211
D T DG G V+ V + + IT G+A ++
Sbjct: 270 KGGKEGDTFDYKGVTFTIDTKTGNDGNGKVSTTINGEKVTLTVADITAGAANVDAATLQS 329

Query: 212 NETAEQAAAKIAAAVNDANVGIGAFSDGDTISYVSKAGKDGSGAITSAVSGVVIADTGST 271
++ + +D + G IT A+
Sbjct: 330 SKNVYTSVVNGQFTFDDKTKN----ESAKLSDLEANNAVKGESKITVN-GAEYTANAAGD 384

Query: 272 GVGTAAGVAPSATAFAKTNDTVAKIDISTAKGAQSAVLVIDEAIKQIDAQRADLGAVQNR 331
V A + + + + + K + + ID A+ ++DA R+ LGA+QNR
Sbjct: 385 KVTLAGKTMFIDKTASGVSTLINEDAAAAKKSTANPLASIDSALSKVDAVRSSLGAIQNR 444

Query: 332 FDNTINNLKNIGENVSAARGRIEDTDFAAETANLTKNQVLQQAGTAILAQANQLPQSVLS 391
FD+ I NL N N+++AR RIED D+A E +N++K Q+LQQAGT++LAQANQ+PQ+VLS
Sbjct: 445 FDSAITNLGNTVTNLNSARSRIEDADYATEVSNMSKAQILQQAGTSVLAQANQVPQNVLS 504

Query: 392 LLR 394
LLR
Sbjct: 505 LLR 507


58PAKAF_04103PAKAF_04111Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_04103314-1.683982flagellar P-ring protein precursor FlgI
PAKAF_04104314-2.497347flagellar L-ring protein precursor FlgH
PAKAF_04105314-2.990902flagellar basal-body rod protein FlgG
PAKAF_04106013-3.264497flagellar basal-body rod protein FlgF
PAKAF_04107112-3.251842flagellar hook protein FlgE
PAKAF_04108014-3.599933flagellar basal-body rod modification protein
PAKAF_04109013-3.549588flagellar basal-body rod protein FlgC
PAKAF_04110113-2.822366flagellar basal-body rod protein FlgB
PAKAF_04111213-2.530546DUF5064 family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04103FLGPRINGFLGI436e-155 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 436 bits (1122), Expect = e-155
Identities = 168/366 (45%), Positives = 224/366 (61%), Gaps = 10/366 (2%)

Query: 7 LLALAALLLAAGAAQAERLKDIASIQGVRTNQLIGYGLVVGLSGSGDQTTQTPFTLQTFN 66
AL L A R+KDIAS+Q R NQLIGYGLVVGL G+GD +PFT Q+
Sbjct: 13 FSALPFLSTPPAQADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSPFTEQSMR 72

Query: 67 NMLAQFGIKVPANVGNVQLKNVAAVSVHADLPPFAKPGQPIDVTVSSIGNAKSLRGGSLL 126
ML GI G KN+AAV V A+LPPFA PG +DVTVSS+G+A SLRGG+L+
Sbjct: 73 AMLQNLGITTQG--GQSNAKNIAAVMVTANLPPFASPGSRVDVTVSSLGDATSLRGGNLI 130

Query: 127 MTPLKGIDGQVYAVAQGNLVVGGFDAEGRDGSKITVNVPSAGRIPAGATVERAVPSGFDQ 186
MT L G DGQ+YAVAQG L+V GF A+G D + +T V ++ R+P GA +ER +PS F
Sbjct: 131 MTSLSGADGQIYAVAQGALIVNGFSAQG-DAATLTQGVTTSARVPNGAIIERELPSKFKD 189

Query: 187 GNSLTLNLNRPDFTTAKRIVDRINEL----LGPGVAHAVDGGSVRVSAPLDPNQRVDYLS 242
+L L L PDF+TA R+ D +N G +A D + V P ++
Sbjct: 190 SVNLVLQLRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKP-RVADLTRLMA 248

Query: 243 ILENLDVQPGEAVAKVIINSRTGTIVIGQNVKVSPAAVTHGSLTVSITEDPIVSQPGAFS 302
+ENL V+ + AKV+IN RTGTIVIG +V++S AV++G+LTV +TE P V QP FS
Sbjct: 249 EIENLTVET-DTPAKVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESPQVIQPAPFS 307

Query: 303 NGQTAVVPRSRVNAEEETKPMFKFGPGTTLDDIVRAVNQVGAAPSDLMAILEALKQAGAL 362
GQTAV P++ + A +E + G L +V +N +G ++AIL+ +K AGAL
Sbjct: 308 RGQTAVQPQTDIMAMQEGSKVA-IVEGPDLRTLVAGLNSIGLKADGIIAILQGIKSAGAL 366

Query: 363 QADLIV 368
QA+L++
Sbjct: 367 QAELVL 372


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04104FLGLRINGFLGH1803e-59 Flagellar L-ring protein signature.
		>FLGLRINGFLGH#Flagellar L-ring protein signature.

Length = 232

Score = 180 bits (459), Expect = 3e-59
Identities = 81/224 (36%), Positives = 112/224 (50%), Gaps = 13/224 (5%)

Query: 12 IATALGGCVNPPPKPNDPYYAPVLPRTPLPAAQNNGAIYQAGF-----EQNLYDDRKAFR 66
+ +L GC P P P P P NG+I+Q+ Q L++DR+
Sbjct: 15 LVLSLTGCAWIPSTPLVQGATSAQP-VPGPTPVANGSIFQSAQPINYGYQPLFEDRRPRN 73

Query: 67 VGDIITITLNEKTQASKKANSDIQKDSKTKMGLTSLFGSGMTTNNPIGGGDLSLSAEYGG 126
+GD +TI L E ASK ++++ +D KT G + G + E G
Sbjct: 74 IGDTLTIVLQENVSASKSSSANASRDGKTNFGFDT---VPRYLQGLFGNARADV--EASG 128

Query: 127 SRDAKGDSQAGQSNSLTGSITVTVAEVLPNGILSVRGEKWMTLNTGNELVRIAGLVRADD 186
G A SN+ +G++TVTV +VL NG L V GEK + +N G E +R +G+V
Sbjct: 129 GNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRFSGVVNPRT 188

Query: 187 IATDNTVSSTRVADARITYSGTGAFADASQPGWLDRFF--LSPL 228
I+ NTV ST+VADARI Y G G +A GWL RFF LSP+
Sbjct: 189 ISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSPM 232


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04105FLGHOOKAP1452e-07 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 44.6 bits (105), Expect = 2e-07
Identities = 13/51 (25%), Positives = 25/51 (49%)

Query: 209 NGLGTVAQNTLENSNVNVVEELVNMITTQRAYEMNSKVISTADQMLSFVTQ 259
N + ++ S VN+ EE N+ Q+ Y N++V+ TA+ + +
Sbjct: 494 NVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALIN 544



Score = 39.6 bits (92), Expect = 9e-06
Identities = 20/79 (25%), Positives = 35/79 (44%), Gaps = 14/79 (17%)

Query: 3 SALWVSKTGLSAQDMNLTTISNNLANVSTTGFKRDRAEFQDLLYQIRRQPGGQSTQDSEL 62
S + + +GL+A L T SNN+++ + G+ R + +S L
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTI--------------MAQANSTL 47

Query: 63 PSGLQLGTGVRVVGTQKIF 81
+G +G GV V G Q+ +
Sbjct: 48 GAGGWVGNGVYVSGVQREY 66


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04107FLGHOOKAP1455e-07 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 44.9 bits (106), Expect = 5e-07
Identities = 17/49 (34%), Positives = 27/49 (55%)

Query: 414 ALQSGALEASNVDISNELVNLIVHQRNYQANAKTIQTEDAVTQTIINLR 462
L + S V++ E NL Q+ Y ANA+ +QT +A+ +IN+R
Sbjct: 498 QLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546



Score = 41.1 bits (96), Expect = 8e-06
Identities = 22/69 (31%), Positives = 34/69 (49%), Gaps = 3/69 (4%)

Query: 2 SFNIGLSGIQAASSGLNVTGNNIANAGTVGFKQSRAEFADVYAASVLGSGSNPQGSGVLL 61
N +SG+ AA + LN NNI++ G+ + A A S LG+G G+GV +
Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQ--ANSTLGAGGW-VGNGVYV 59

Query: 62 SDVSQMFKQ 70
S V + +
Sbjct: 60 SGVQREYDA 68


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04109FLGHOOKAP1363e-05 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 36.1 bits (83), Expect = 3e-05
Identities = 8/38 (21%), Positives = 20/38 (52%)

Query: 107 NVNVVEEMADMISASRAFQTNAEMMNTAKQMMQKVLTL 144
VN+ EE ++ + + NA+++ TA + ++ +
Sbjct: 508 GVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545



Score = 29.5 bits (66), Expect = 0.004
Identities = 15/54 (27%), Positives = 25/54 (46%), Gaps = 2/54 (3%)

Query: 4 ASVFNIAGSGMSAQSTRLNTVASNIANAETVSSSVDKTYRARHPVFSTMFQQAQ 57
+S+ N A SG++A LNT ++NI++ + T A ST+
Sbjct: 1 SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMA--QANSTLGAGGW 52


59PAKAF_04126PAKAF_04132Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_041262100.377141PLP-dependent cysteine synthase family protein
PAKAF_04127390.392950DMT family transporter
PAKAF_041283130.105137Na+/H+ antiporter subunit G
PAKAF_041293120.417861K+/H+ antiporter subunit F
PAKAF_041302111.409027Na+/H+ antiporter subunit E
PAKAF_041311111.293830probable NADH dehydrogenase
PAKAF_041322111.520945Na+/H+ antiporter subunit C
60PAKAF_04193PAKAF_04232Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_04193022-3.7373633-oxoacyl-[acyl-carrier-protein] synthase III
PAKAF_04194229-5.352018PqsC
PAKAF_04195333-6.017611PqsB
PAKAF_04196437-7.135709probable coenzyme A ligase
PAKAF_04197543-8.833429methylated-DNA--protein-cysteine
PAKAF_04198438-8.754558usher CupC3
PAKAF_04199343-9.066552chaperone CupC2
PAKAF_04200134-6.987471fimbrial subunit CupC1
PAKAF_04201135-6.385108Hpt domain-containing protein
PAKAF_04202131-6.015677NUDIX hydrolase
PAKAF_04203035-7.247524hypothetical protein
PAKAF_04204141-7.528335acyl-CoA thioesterase
PAKAF_04205039-7.172072IS3 family transposase
PAKAF_04205038-7.239865IS3 family transposase
PAKAF_04206237-6.786870transposase A,Transposase
PAKAF_04207135-6.299189relaxase,Predicted HD-superfamily
PAKAF_04208023-4.548841PA0977 ortholog, hypothetical protein
PAKAF_04210020-4.403309*7-cyano-7-deazaguanine synthase QueC
PAKAF_04211120-4.906149probable radical activating enzyme
PAKAF_04212120-4.572427tol-pal system protein YbgF
PAKAF_04213021-5.099523Peptidoglycan associated lipoprotein OprL
PAKAF_04214119-4.841498TolB protein
PAKAF_04215123-4.990618TolA protein
PAKAF_04216124-4.387962TolR protein
PAKAF_04217122-4.051332TolQ protein
PAKAF_04218117-3.696975tol-pal system-associated acyl-CoA thioesterase
PAKAF_04219114-3.543754Holliday junction DNA helicase RuvB
PAKAF_04220114-3.331326Holliday junction DNA helicase RuvA
PAKAF_04221113-3.239201Holliday junction resolvase RuvC
PAKAF_04222013-3.283673pqsR-mediated PQS regulator, PmpR
PAKAF_04223-112-3.205742aspartyl-tRNA synthetase
PAKAF_04224-116-3.476422probable dna-binding stress protein
PAKAF_04225-213-3.236286probable cold-shock protein
PAKAF_04226-211-3.239503protein SlyX
PAKAF_04227-111-2.940249HIT domain-containing protein
PAKAF_04228-111-3.003471Basic amino acid, basic peptide and imipenem
PAKAF_04229-115-2.834326PaaI family thioesterase
PAKAF_04230-114-2.841007prolyl-tRNA synthetase
PAKAF_04231-124-3.236704hypothetical protein
PAKAF_04232129-3.351359probable acylphosphatase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04193PF04183300.016 IucA / IucC family
		>PF04183#IucA / IucC family

Length = 580

Score = 29.8 bits (67), Expect = 0.016
Identities = 13/67 (19%), Positives = 27/67 (40%), Gaps = 5/67 (7%)

Query: 214 EGGGEFLMRGRPMFEHASQTLVRIAGEMLAAHELTLD-DIDHVICHQPNLRILDAVQEQL 272
+G + P + Q + + + +A L D H + LR + + +L
Sbjct: 444 QGDMRLVKEEFPEMDSLPQEVRDVTSRL-SADYLIHDLQTGHFVTV---LRFISPLMVRL 499

Query: 273 GIPQHKF 279
G+P+ +F
Sbjct: 500 GVPERRF 506


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04198PF005777920.0 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 792 bits (2046), Expect = 0.0
Identities = 282/863 (32%), Positives = 443/863 (51%), Gaps = 47/863 (5%)

Query: 27 LSVYSRSSCLMALGLALPAVTFAVEFNAEFLNNEGGAPVELKYFENGNSVSPGTYSVDIH 86
+ R A P + + FN FL ++ A +L FENG + PGTY VDI+
Sbjct: 24 AGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPGTYRVDIY 83

Query: 87 LNQIMIRREDVVFSADPETGSVRPVVRVGLLKEIGVDIARLTRDKLIPDNLENNTPLNVA 146
LN + DV F+ + P + L +G++ A ++ L+ D+ + +
Sbjct: 84 LNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADD----ACVPLT 139

Query: 147 ELIPGASIEFDVNSLSLLVSIPQLYVQRHSRGYVDPSLWDDGVTALFSNYQANFTRNTN- 205
+I A+ + DV L ++IPQ ++ +RGY+ P LWD G+ A NY + N
Sbjct: 140 SMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQNR 199

Query: 206 FGQNSDYRYLGLRNGFNLFGWRLRNDSSLS-----GGTGMRNKFSSNRTYVERDIRALKG 260
G NS Y YL L++G N+ WRLR++++ S +G +NK+ T++ERDI L+
Sbjct: 200 IGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRS 259

Query: 261 TLSLGELYTSAQGDAFESVRMRGVQLQSDIGMLPDNEISYTPVVRGIAETNATVEVSQNG 320
L+LG+ YT GD F+ + RG QL SD MLPD++ + PV+ GIA A V + QNG
Sbjct: 260 RLTLGDGYTQ--GDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNG 317

Query: 321 FVIYSTNVPPGAFEITDIYPSGSNGDLEVKIIEADGRQRSFKQSYSYLPVMTRKGNLRYG 380
+ IY++ VPPG F I DIY +G++GDL+V I EADG + F YS +P++ R+G+ RY
Sbjct: 318 YDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYS 377

Query: 381 LAAGEYHNDG--QPSVNLLQGSAVYGLSDRVTGFGGLLAAEKYNATNLGLGFNT-PLGGF 437
+ AGEY + Q Q + ++GL T +GG A++Y A N G+G N LG
Sbjct: 378 ITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGAL 437

Query: 438 SADVTHSQSRTRRGGRNQGQSLRLLYSKTINATETSFTVVGYRYSTEGYRTLSQH----- 492
S D+T + S ++ GQS+R LY+K++N + T+ +VGYRYST GY +
Sbjct: 438 SVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRM 497

Query: 493 ----------IDDMSEESYLYGSSSSRQKSRIDLTVNQTLFRRSSLYLTAGETTYWNRPG 542
+ + + Y + + ++ ++ LTV Q L R S+LYL+ TYW
Sbjct: 498 NGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYWGTSN 557

Query: 543 SSRRVQFGFSSGIKRASYSLAVSRTQETGSFGRSDTQFTASVSIPLGG--------SARS 594
+ Q G ++ + +++L+ S T+ GR D +V+IP R
Sbjct: 558 VDEQFQAGLNTAFEDINWTLSYSLTKNAWQKGR-DQMLALNVNIPFSHWLRSDSKSQWRH 616

Query: 595 SQVYANAVSSQHGDSSLNTGISGYLDEANAFNYSAQANYSKDG----GNSGSVGLGWDTS 650
+ + +G + G+ G L E N +YS Q Y+ G G++G L +
Sbjct: 617 ASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGG 676

Query: 651 KAKLSANYSQGRDNKQINLGASGSVVVHSGGVTFGQPVGETFGLVEVPEVGGVGLDGYSS 710
+ YS D KQ+ G SG V+ H+ GVT GQP+ +T LV+ P ++ +
Sbjct: 677 YGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKVENQTG 736

Query: 711 VRTDGRGYAVLPYMQPYRYNWVNLDTNTLGSDTEISDSTQMAVPTRGAVIAKRFSAESGR 770
VRTD RGYAVLPY YR N V LDTNTL + ++ ++ VPTRGA++ F A G
Sbjct: 737 VRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKARVGI 796

Query: 771 RVQFDLSMDSGGKIPFGAQAYDKEERVVGMVDNLSRLLVFGIEDQGRLSIRWSDG---SC 827
++ L+ + +PFGA + + G+V + ++ + G+ G++ ++W + C
Sbjct: 797 KLLMTLTHN-NKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGKVQVKWGEEENAHC 855

Query: 828 SVDYQLPPRNKDLTYERVALSCR 850
+YQLPP ++ +++ CR
Sbjct: 856 VANYQLPPESQQQLLTQLSAECR 878


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04212RTXTOXIND320.002 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 32.1 bits (73), Expect = 0.002
Identities = 10/53 (18%), Positives = 19/53 (35%)

Query: 69 QLQQMQDELARLRGTLEEQQNQIQQLKQESLERYQDLDRRISGGGAPAAQNSA 121
+ + +EL + LE+ +++I K+E Q I N
Sbjct: 260 KYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIG 312


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04213OMPADOMAIN1166e-34 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 116 bits (291), Expect = 6e-34
Identities = 34/112 (30%), Positives = 53/112 (47%), Gaps = 12/112 (10%)

Query: 68 YFEYDSSDLKPEAMRALDVHA---KDLKGSGQRVVLEGHTDERGTREYNMALGERRAKAV 124
F ++ + LKPE ALD +L VV+ G+TD G+ YN L ERRA++V
Sbjct: 222 LFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGLSERRAQSV 281

Query: 125 QRYLVLQGVSPAQLELVSYGKERPVATGHDEQS---------WAQNRRVELK 167
YL+ +G+ ++ G+ PV + A +RRVE++
Sbjct: 282 VDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIE 333


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04215IGASERPTASE484e-08 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 47.8 bits (113), Expect = 4e-08
Identities = 36/204 (17%), Positives = 71/204 (34%), Gaps = 21/204 (10%)

Query: 25 QLKSKSQATTQTNQKIAGEAKKTASKQYE-----VEQLEQKKLEQQKLEQQKLEQQQVAA 79
Q + TT N + + + +++ + E +Q +
Sbjct: 990 QTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESK 1049

Query: 80 AKAAEQKKADEARKAEAQKAAEAKKADEAKKAAEAKAAEQKKQADIAKKRAEDEAKKKAA 139
++ A E + A EAK +A + ++A ++ E K+
Sbjct: 1050 TVEKNEQDATETTAQNREVAKEAKSNVKA----------NTQTNEVA--QSGSETKETQT 1097

Query: 140 EDAKKKAAEDAKKKAAEEAKKKAAAEAAKKKAAVEAAKKKAAAAAAAARKAAEDKKAQAL 199
+ K+ A + ++KA E +K E K + V ++++ A A E+ +
Sbjct: 1098 TETKETATVEKEEKAKVETEKT--QEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNI 1155

Query: 200 AELLS--DTTERQQALADEVGSEV 221
E S +TT + A E S V
Sbjct: 1156 KEPQSQTNTTADTEQPAKETSSNV 1179


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_0421760KDINNERMP290.017 60kDa inner membrane protein signature.
		>60KDINNERMP#60kDa inner membrane protein signature.

Length = 548

Score = 28.7 bits (64), Expect = 0.017
Identities = 17/72 (23%), Positives = 28/72 (38%), Gaps = 13/72 (18%)

Query: 2 WSLISNASIVVQLVMLTLVAASVTSWIMIFQRGNAMRAAKKALDAFEERFWS-----GID 56
+S+I + +V+ +M L A TS MR + + A ER +
Sbjct: 356 FSIII-ITFIVRGIMYPLTKAQYTSM-------AKMRMLQPKIQAMRERLGDDKQRISQE 407

Query: 57 LSKLYRQAGSNP 68
+ LY+ NP
Sbjct: 408 MMALYKAEKVNP 419


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04223ANTHRAXTOXNA320.009 Anthrax toxin LF subunit signature.
		>ANTHRAXTOXNA#Anthrax toxin LF subunit signature.

Length = 800

Score = 31.6 bits (71), Expect = 0.009
Identities = 31/117 (26%), Positives = 51/117 (43%), Gaps = 23/117 (19%)

Query: 212 YYQIAKCFRDEDLRADRQPEFTQIDIETSFLDESDIIGITEKMVRQLFKEVL-------D 264
YY+I K + + D+ + +++ S D+SD ++ + Q FKE L D
Sbjct: 170 YYEIGKGISLDIISKDKSLDPEFLNLIKSLSDDSD---SSDLLFSQKFKEKLELNNKSID 226

Query: 265 VEF-----DEFPHMPFEEAMRRYGSDKPDLRIPLEL-----VDVADQLKEVEFKVFS 311
+ F EF H F A Y + PD R LEL + ++L++ F+ S
Sbjct: 227 INFIKENLTEFQHA-FSLAFSYYFA--PDHRTVLELYAPDMFEYMNKLEKGGFEKIS 280


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04224HELNAPAPROT1573e-52 Helicobacter neutrophil-activating protein A family ...
		>HELNAPAPROT#Helicobacter neutrophil-activating protein A family

signature.
Length = 153

Score = 157 bits (398), Expect = 3e-52
Identities = 50/145 (34%), Positives = 72/145 (49%)

Query: 11 DRAAIAEGLSRLLADTYTLYLKTHNFHWNVTGPMFNTLHLMFEGQYTELAVAVDDIAERI 70
++ + L+ L++ + LY K H FHW V GP F TLH FE Y A VD IAER+
Sbjct: 9 NQTLVENSLNTQLSNWFLLYSKLHRFHWYVKGPHFFTLHEKFEELYDHAAETVDTIAERL 68

Query: 71 RALGFPAPGTYAAYARLSSIKEEEGVPEAEEMIRQLVQGQEAVVRTARSIFPLLDKVSDE 130
A+G T Y +SI + A EM++ LV + + ++ + L ++ D
Sbjct: 69 LAIGGQPVATVKEYTEHASITDGGNETSASEMVQALVNDYKQISSESKFVIGLAEENQDN 128

Query: 131 PTADLLTQRMQVHEKTAWMLRSLLA 155
TADL ++ EK WML S L
Sbjct: 129 ATADLFVGLIEEVEKQVWMLSSYLG 153


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04230ANTHRAXTOXNA300.031 Anthrax toxin LF subunit signature.
		>ANTHRAXTOXNA#Anthrax toxin LF subunit signature.

Length = 800

Score = 30.1 bits (67), Expect = 0.031
Identities = 10/54 (18%), Positives = 22/54 (40%)

Query: 208 HEFHVLANSGEDDIVFSDSSDYAANIEKAEAVPRESARGSATEDMRLVDTPNTK 261
V E + + DYA N E+++ V E +G + + + + + +
Sbjct: 138 ASRFVFEKKRETPKLIINIKDYAINSEQSKEVYYEIGKGISLDIISKDKSLDPE 191


61PAKAF_04242PAKAF_04250Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_04242-216-3.620673phosphoribosylaminoimidazole synthetase
PAKAF_04243-123-4.596196phosphoribosylaminoimidazole synthetase
PAKAF_04244022-5.611012DUF3108 domain-containing protein
PAKAF_04245128-5.089067probable transcriptional regulator
PAKAF_04246225-5.818742thioredoxin family protein
PAKAF_04247319-5.303042DUF2024 family protein
PAKAF_04248418-4.786873hypothetical protein
PAKAF_04249114-4.015989hypothetical protein
PAKAF_04250116-3.097987hypothetical protein
62PAKAF_04367PAKAF_04411Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_04367131-5.042173probable transcriptional regulator
PAKAF_04368134-7.312304hypothetical protein
PAKAF_04370144-9.831193hypothetical protein
PAKAF_04371137-10.165590hypothetical protein
PAKAF_04372231-9.761297von Willebrand factor type A domain-containing
PAKAF_04373226-9.221202phospholipase D/transphosphatidylase
PAKAF_04374228-9.313253SMC domain-containing protein,recombination
PAKAF_04375229-9.362044type I restriction modification enzyme
PAKAF_04376127-8.184155type I restriction modification enzyme methylase
PAKAF_04377132-8.460296RmuC domain-containing protein family,DNA
PAKAF_04378-136-8.840058helicase, type I site-specific
PAKAF_04379-151-9.443353putative two-component response
PAKAF_04380048-9.665698hypothetical protein
PAKAF_04381-146-8.827277lipopolysaccharide heptosyltransferase
PAKAF_04382142-9.009698hypothetical protein
PAKAF_04383142-8.913937hypothetical protein
PAKAF_04384144-8.979027Histidinol-phosphatase,histidinol-phosphatase,
PAKAF_04385243-8.880331hypothetical protein
PAKAF_04386243-8.534395hypothetical protein
PAKAF_04387143-8.584639helicase domain-containing protein,ATP-dependent
PAKAF_04388147-8.415088hypothetical protein
PAKAF_04389049-9.086097Uncharacterized protein conserved in
PAKAF_04390054-10.779891Uncharacterized protein conserved in
PAKAF_04391061-12.607822hypothetical protein
PAKAF_04392-160-12.626815Protein of unknown function (DUF3375)
PAKAF_04393261-13.548183Predicted transcriptional regulator
PAKAF_04394358-12.544272hypothetical protein
PAKAF_04395344-10.659151hypothetical protein
PAKAF_04396-117-0.437512hypothetical protein
PAKAF_043970130.973988resolvase,Putative transposon Tn552
PAKAF_059341152.944117hypothetical protein
PAKAF_043982153.997488hypothetical protein
PAKAF_043991143.375937probable ring-cleaving dioxygenase
PAKAF_044001143.731899probable transcriptional regulator
PAKAF_044012133.557746probable transcriptional regulator
PAKAF_044023133.828612RidA family protein
PAKAF_044031113.067196aminotransferase class V-fold PLP- dependent
PAKAF_044040112.109532hypothetical protein
PAKAF_044050122.628068probable major facilitator superfamily (MFS)
PAKAF_044060101.285693probable haloacid dehalogenase
PAKAF_04407-2110.911740probable transporter
PAKAF_04408-113-0.489505activator of HSP90 ATPase
PAKAF_04409-1110.041151AmpDh3
PAKAF_044102111.299639hypothetical protein
PAKAF_044112111.292078hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04367HTHTETR536e-11 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 53.5 bits (128), Expect = 6e-11
Identities = 28/193 (14%), Positives = 71/193 (36%), Gaps = 13/193 (6%)

Query: 39 RTNIQLAAIPVFTRKGIAETTVNDLLEAARVSRRTFYKYFAGKLEVLESIYHSAVQLLLA 98
R +I A+ +F+++G++ T++ ++ +AA V+R Y +F K ++ I+ + +
Sbjct: 13 RQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGE 72

Query: 99 RFGGLRSEAGSD-EDWLRAMVSLFFDYHL---AVGPIIRMMQEEALHAGS--PLAAHRQR 152
+++ D LR ++ + + ++ ++ + G + ++
Sbjct: 73 LELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRN 132

Query: 153 AHLKIIELWAERLG---AQGAAHDALTYRVLIWAMEAASLELLN----ASDPLELPRVKR 205
L+ + + L L R M L+ A +L + R
Sbjct: 133 LCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDLKKEAR 192

Query: 206 VLGDLLVGTLCPR 218
+L+
Sbjct: 193 DYVAILLEMYLLC 205


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04377GPOSANCHOR555e-10 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 55.5 bits (133), Expect = 5e-10
Identities = 60/321 (18%), Positives = 119/321 (37%), Gaps = 21/321 (6%)

Query: 35 RQISQLSMQLEQAELARASTEAQQQSLQIQLKAAEARSHELQIEEGKLKAQLQASADTMA 94
+ S +++ E +A+ A++ L+ L+ A S + L+A+ A A
Sbjct: 134 NFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQA 193

Query: 95 RLSAELHERKAAGSDLQVKLEEANRQHHESARRLEAVQAGAQGLQEQVTELRGRLDTSTT 154
L L + K++ + A R ++ +G T ++ T
Sbjct: 194 ELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEA 253

Query: 155 TNAALQEERDRLKDALASEETRAKVAETAEREAR---EQLSETKHKLTEHVQALDTLQKR 211
AAL+ + L+ AL + + L K L Q L+ ++
Sbjct: 254 EKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQS 313

Query: 212 -----------YQSTSNEHAELKTSLDKSEKQVAELRERLAQAQATNEQLHTDRDRLKDG 260
+ EH +L+ SE LR L ++ +QL + +L+
Sbjct: 314 LRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQLEAEHQKLE-- 371

Query: 261 LADEGKRAKALETAERDVRSQLLETKEALSQQVRSFNELQERLSTLSSEHTELKTTLQKR 320
++ K E + + +R L ++EA Q ++ E +L+ L + EL+ + +
Sbjct: 372 -----EQNKISEASRQSLRRDLDASREAKKQVEKALEEANSKLAALEKLNKELEESKKLT 426

Query: 321 EEHFQEQMAQLADTRKSLTQE 341
E+ E A+L K+L ++
Sbjct: 427 EKEKAELQAKLEAEAKALKEK 447



Score = 52.8 bits (126), Expect = 3e-09
Identities = 36/278 (12%), Positives = 93/278 (33%)

Query: 55 EAQQQSLQIQLKAAEARSHELQIEEGKLKAQLQASADTMARLSAELHERKAAGSDLQVKL 114
+ + + +L+ + E + +L+A+ + A L+ +
Sbjct: 91 TEELSNAKEKLRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEK 150

Query: 115 EEANRQHHESARRLEAVQAGAQGLQEQVTELRGRLDTSTTTNAALQEERDRLKDALASEE 174
+ + + LE + ++ L A L++ + + ++
Sbjct: 151 AALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADS 210

Query: 175 TRAKVAETAEREAREQLSETKHKLTEHVQALDTLQKRYQSTSNEHAELKTSLDKSEKQVA 234
+ K E + + ++ + L + + ++ E A L+ + EK +
Sbjct: 211 AKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALE 270

Query: 235 ELRERLAQAQATNEQLHTDRDRLKDGLADEGKRAKALETAERDVRSQLLETKEALSQQVR 294
A + L ++ L+ AD +++ L + +R L ++EA Q
Sbjct: 271 GAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEA 330

Query: 295 SFNELQERLSTLSSEHTELKTTLQKREEHFQEQMAQLA 332
+L+E+ + L+ L E ++ A+
Sbjct: 331 EHQKLEEQNKISEASRQSLRRDLDASREAKKQLEAEHQ 368



Score = 37.4 bits (86), Expect = 2e-04
Identities = 30/214 (14%), Positives = 64/214 (29%), Gaps = 16/214 (7%)

Query: 133 AGAQGLQEQVTELRGRLDTSTTTNAALQEERDRLKDALASEETRAKVAETAEREAREQLS 192
AG +V+ + R T T +QE D+ + + + + + ++
Sbjct: 31 AGLVVNTNEVSAVATRSQTDTL--EKVQERADKFEIENNTLKLKNSDLSFNNKALKDHND 88

Query: 193 ETKHKLTEHVQALDTLQKRYQSTSNEHAELKTSLDKSEKQVAELRERLAQAQATNEQLHT 252
E +L+ + L K +++ EL+ EK + A + L
Sbjct: 89 ELTEELSNAKEKLRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEA 148

Query: 253 DRDRLKDGLADEGKRAKALETAERDVRSQLLETKEALSQQVRSFNELQERLSTLSSEHTE 312
++ L AD K + +++ L+ + L + E
Sbjct: 149 EKAALAARKADLEKALEGAMNFSTADSAKI--------------KTLEAEKAALEARQAE 194

Query: 313 LKTTLQKREEHFQEQMAQLADTRKSLTQEFENLA 346
L+ L+ A++ A
Sbjct: 195 LEKALEGAMNFSTADSAKIKTLEAEKAALAARKA 228


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04379HTHFIS346e-116 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 346 bits (889), Expect = e-116
Identities = 130/344 (37%), Positives = 184/344 (53%), Gaps = 28/344 (8%)

Query: 175 FDSIVTRSERMMMLKAQAQVLAQKQVPVLIYGETGTGKELFARAIHNAGPRAGARFIPVN 234
+V RS M + L Q + ++I GE+GTGKEL ARA+H+ G R F+ +N
Sbjct: 136 GMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAIN 195

Query: 235 CGAIAPELIDSTLFGHRKGAFTGAVENRDGVFQQAHGGTLFLDEFGELKPDVQVRLLRVL 294
AI +LI+S LFGH KGAFTGA G F+QA GGTLFLDE G++ D Q RLLRVL
Sbjct: 196 MAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVL 255

Query: 295 QEGTFTPVGGTQELKVDVRLITATHRNLMQEVAHGCFREDLFYRVAVGVLHLPPLREREG 354
Q+G +T VGG ++ DVR++ AT+++L Q + G FREDL+YR+ V L LPPLR+R
Sbjct: 256 QQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRAE 315

Query: 355 DLLLLADALLEVLAKQDASLTGKKLSAEAKKLILRHPWRGNVRELQSTLLRSALWCQGSA 414
D+ L ++ K+ L K+ EA +L+ HPW GNVREL++ + R
Sbjct: 316 DIPDLVRHFVQQAEKEG--LDVKRFDQEALELMKAHPWPGNVRELENLVRRLTALYPQDV 373

Query: 415 IGAEDIEQ----ALFQLPLEQVDVMARDVS---------------------QGIDLQEIV 449
I E IE + P+E+ + +S ++
Sbjct: 374 ITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVL 433

Query: 450 AEVAGHYLHKALTLTGHNKTRAASLLGLKSQQTLSNWMDKYGIE 493
AE+ + ALT T N+ +AA LLGL ++ TL + + G+
Sbjct: 434 AEMEYPLILAALTATRGNQIKAADLLGL-NRNTLRKKIRELGVS 476


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04390GPOSANCHOR376e-04 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 36.6 bits (84), Expect = 6e-04
Identities = 40/226 (17%), Positives = 84/226 (37%), Gaps = 31/226 (13%)

Query: 596 DKQDQKRLDEDWLTGFDNRDRLAFLAEQIREVNEQLEPAKLALDAAQGDVRQLETQASLL 655
D ++ R A L + + + + + LE + + L
Sbjct: 241 STADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADL 300

Query: 656 NRVKELQFEDIDLPGAQSQLESLRTQLATLTR----LDSDLAMIKVELDAAEALQESLDQ 711
++ + +SLR L L+++ ++ + +EA ++SL +
Sbjct: 301 EHQSQV---------LNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRR 351

Query: 712 QLRQLIEQYVQLKTQFD------QAASATRKA----YNSAEKGLNDTQRELAQAHFPTLT 761
L E QL+ + + + A+R++ +++ + ++ L +A+
Sbjct: 352 DLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQVEKALEEANSKLAA 411

Query: 762 ADDLGDIVELERKHTRE--------LQGQLKALGEKLGDQKSELAK 799
+ L +E +K T + L+ + KAL EKL Q ELAK
Sbjct: 412 LEKLNKELEESKKLTEKEKAELQAKLEAEAKALKEKLAKQAEELAK 457


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04396PF04647240.029 Accessory gene regulator B
		>PF04647#Accessory gene regulator B

Length = 212

Score = 24.0 bits (52), Expect = 0.029
Identities = 7/29 (24%), Positives = 12/29 (41%)

Query: 5 ELLTYSVFGVLCALGVTAAIGYKIAAYLD 33
++ + GVL A+G+K D
Sbjct: 165 QIALAILLGVLWQTFTLTALGHKFIVGWD 193


63PAKAF_04476PAKAF_04481Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_044762121.835620hypothetical protein
PAKAF_044773111.807823NAD(P)-dependent oxidoreductase
PAKAF_044784101.205175SDS hydrolase SdsA1
PAKAF_044794110.914092probable transcriptional regulator
PAKAF_04480212-0.023602hypothetical protein
PAKAF_044813120.262921DUF1145 domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04477NUCEPIMERASE338e-04 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 32.8 bits (75), Expect = 8e-04
Identities = 29/124 (23%), Positives = 45/124 (36%), Gaps = 21/124 (16%)

Query: 1 MKIALIGATGHVGHYFLNEALQRGHAV-----------TALVRDPSKLAARDGLCVAQAD 49
MK + GA G +G + L+ GH V +L + +L A+ G + D
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 50 VSDPAQVASAVAGHE---VVISAFNGGWGSADLRARHA------AGSQAILDGVKRSGVP 100
++D + A V IS L HA G IL+G + + +
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLA-VRYSLENPHAYADSNLTGFLNILEGCRHNKIQ 119

Query: 101 RLLV 104
LL
Sbjct: 120 HLLY 123


64PAKAF_04521PAKAF_04547Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_045211113.608110probable type II secretion system protein
PAKAF_045220113.045487probable type II secretion system protein
PAKAF_04523-1122.800255probable type II secretion system protein
PAKAF_045242183.038679probable type II secretion system protein
PAKAF_045251153.385404probable type II secretion system protein
PAKAF_045261162.876951HxcX atypical pseudopilin
PAKAF_045271143.158381HxcT pseudopilin
PAKAF_04528-1142.671338HxcV putative pseudopilin
PAKAF_04529-1162.877198hypothetical protein
PAKAF_04530-1152.392923HxcU putative pseudopilin
PAKAF_045310131.673753HxcW putative pseudopilin
PAKAF_045320140.947031sigma factor regulator, VreR
PAKAF_045332141.430199ECF sigma factor, VreI
PAKAF_04534-1131.091398VreA
PAKAF_04535-1140.571395hypothetical protein
PAKAF_045363140.431982heme oxygenase
PAKAF_045372121.201977hypothetical protein
PAKAF_04538291.591022DNA polymerase Y family protein
PAKAF_04539171.671625probable DNA polymerase alpha chain
PAKAF_04542282.123302**exonuclease SbcD,Nuclease sbcCD subunit
PAKAF_04543382.290235putative exonuclease,Nuclease sbcCD subunit
PAKAF_04544092.867419exodeoxyribonuclease V subunit
PAKAF_04545072.821282exodeoxyribonuclease V subunit
PAKAF_04546282.895960exodeoxyribonuclease V subunit
PAKAF_04547282.516953putative lipoate-protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04521BCTERIALGSPF378e-131 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 378 bits (972), Expect = e-131
Identities = 187/407 (45%), Positives = 252/407 (61%), Gaps = 5/407 (1%)

Query: 1 MQTFRYEAADAQGRIETGTLEADSQRGALGQLRARGLTPLEVREQAGGGTGQGAGALFAP 60
M + Y+A DAQG+ GT EADS R A LR RGL PL V E G G+ L
Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLR 60

Query: 61 R---LSDGDLAWATRQLASLLAASLPLEAALSATLDQAERKHIAQTLSAVRSDVRGGMRL 117
R LS DLA TRQLA+L+AAS+PLE AL A Q+E+ H++Q ++AVRS V G L
Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSL 120

Query: 118 ADALAARPRDFPEIYRALVAAGEESGDLAQVMERLADYIEERNALRGKILTAFIYPAVVG 177
ADA+ P F +Y A+VAAGE SG L V+ RLADY E+R +R +I A IYP V+
Sbjct: 121 ADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLT 180

Query: 178 VVSIGIVIFLLGYVVPQVVSAFSQARQDLPALTRAMLQASDFVRAWG-WLCAGAIGGAYW 236
VV+I +V LL VVP+VV F +Q LP TR ++ SD VR +G W+ + G
Sbjct: 181 VVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFM- 239

Query: 237 GWCLYLRDPQARLGWHRRVLRLPLLGRFVLGVNTARFASTLAILGSAGVPLLRALDAARQ 296
+ + LR + R+ +HRR+L LPL+GR G+NTAR+A TL+IL ++ VPLL+A+ +
Sbjct: 240 AFRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGD 299

Query: 297 TLANDCLAQAVEEATAQVREGVSLASALRTRQVFPPILTHLIASGEKTGALPPMLDRAAQ 356
++ND + AT VREGVSL AL +FPP++ H+IASGE++G L ML+RAA
Sbjct: 300 VMSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAAD 359

Query: 357 TLSRDIERRAMGMTALLEPLMIVVMGGVVLTIVMAVLMPIIEMNQLV 403
R+ + L EPL++V M VVL IV+A+L PI+++N L+
Sbjct: 360 NQDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLM 406


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04523BCTERIALGSPD2564e-77 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 256 bits (655), Expect = 4e-77
Identities = 150/562 (26%), Positives = 256/562 (45%), Gaps = 32/562 (5%)

Query: 230 PGNNTVVVTDYAENLDRVAGIIASIDIPSASD---TDVVPIQNGIAVDIASTVSELLDSQ 286
NN V+ +++ A +AS P D T VVP+ N A D+A + +L D+
Sbjct: 94 NMNNGVLKVVRSKDAKTAAVPVASDAAPGIGDEVVTRVVPLTNVAARDLAPLLRQLNDNA 153

Query: 287 GSGGAEQGQKTVVLADPRSNSIVIRSPSPERTQLARDLIGKLDSVQSNPGNLHVVYLRNA 346
G G +VV +P SN +++ + +L ++ ++D+ ++ V L A
Sbjct: 154 GVG-------SVVHYEP-SNVLLMTGRAAVIKRL-LTIVERVDNAGDR--SVVTVPLSWA 202

Query: 347 QATRLAQALRGLITGDSG--GEGNEGDQ--QRARLSGGGMLGGGNSGTG----SQGLGTS 398
A + + + L S G+ R + + G NS + L
Sbjct: 203 SAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEPNSRQRIIAMIKQLDRQ 262

Query: 399 GNTTGSGSSGLGGSNRSGGAYGAMGSGQGGAGPGAMGEENSAFSAGGVTVQADATTNTLL 458
T G+ ++ + + A + ++A TN L+
Sbjct: 263 QATQGNTKVIYLKYAKASDLVEVLTGISSTMQSEKQAAKPVAALDKNIIIKAHGQTNALI 322

Query: 459 ISAPEPLYRNLREVIDLLDQRRAQVVIESLIVEVSEDDSSEFGIQWQAGNLGGNGVFG-G 517
++A + +L VI LD RR QV++E++I EV + D GIQW N G G
Sbjct: 323 VTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGIQWANKNAGMTQFTNSG 382

Query: 518 VNFGQSALNTAGKNTIDVLPKGLNIGLVDGTVDIPGIGKILDLKVLARALKSRGGTNVLS 577
+ + N + L L G + + +L AL S ++L+
Sbjct: 383 LPISTAIAGANQYNKDGTVSSSLASALSSFNGIAAGFYQG-NWAMLLTALSSSTKNDILA 441

Query: 578 TPNLLTLDNESASIMVGQTIPFVSGQYVTDGGGTSNNPFQTIQREDVGLKLNIRPQISEG 637
TP+++TLDN A+ VGQ +P ++G T G N F T++R+ VG+KL ++PQI+EG
Sbjct: 442 TPSIVTLDNMEATFNVGQEVPVLTGSQTTSGD----NIFNTVERKTVGIKLKVKPQINEG 497

Query: 638 GTVKLDVYQEVSSVDERASTAA---GVVTNKRAIDTSILLDDGQIMVLGGLLQDNVQDNT 694
+V L++ QEVSSV + AS+ + G N R ++ ++L+ G+ +V+GGLL +V D
Sbjct: 498 DSVLLEIEQEVSSVADAASSTSSDLGATFNTRTVNNAVLVGSGETVVVGGLLDKSVSDTA 557

Query: 695 DGVPGLSSLPGVGSLFRYQKRSRTKTNLMVFLRPYIVRDAAAGRSITLNRYDFIRRAQ-Q 753
D VP L +P +G+LFR + +K NLM+F+RP ++RD R + +Y AQ +
Sbjct: 558 DKVPLLGDIPVIGALFRSTSKKVSKRNLMLFIRPTVIRDRDEYRQASSGQYTAFNDAQSK 617

Query: 754 RVQPRHDWSVGDMQAPVLPPAQ 775
+ ++ ++ + + P Q
Sbjct: 618 QRGKENNDAMLNQDLLEIYPRQ 639



Score = 159 bits (404), Expect = 6e-43
Identities = 72/276 (26%), Positives = 127/276 (46%), Gaps = 7/276 (2%)

Query: 87 VAPVSATAAELGEQPVSLNFVDTEVEAVVRALSRATGRQFLVDPRVKGKLTLVSEGQVPA 146
A + A E +F T+++ + +S+ + ++DP V+G +T+ S +
Sbjct: 17 FAALLFRPAAAEEFSA--SFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITVRSYDMLNE 74

Query: 147 RTAYRMLTSALRMQGFSVVDVD-GVSQVVPEADAKLLGGPVYGADRPA-ANGMVTRTFRL 204
Y+ S L + GF+V++++ GV +VV DAK PV P + +VTR L
Sbjct: 75 EQYYQFFLSVLDVYGFAVINMNNGVLKVVRSKDAKTAAVPVASDAAPGIGDEVVTRVVPL 134

Query: 205 RYENAVNLIPVLRPIVAQNNPINA--YPGNNTVVVTDYAENLDRVAGIIASIDIPSASDT 262
A +L P+LR + + Y +N +++T A + R+ I+ +D
Sbjct: 135 TNVAARDLAPLLRQLNDNAGVGSVVHYEPSNVLLMTGRAAVIKRLLTIVERVDNAGDRSV 194

Query: 263 DVVPIQNGIAVDIASTVSELLDSQGSGGAEQGQKTVVLADPRSNSIVIRSPSPERTQLAR 322
VP+ A D+ V+EL V+AD R+N++++ P Q
Sbjct: 195 VTVPLSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGE-PNSRQRII 253

Query: 323 DLIGKLDSVQSNPGNLHVVYLRNAQATRLAQALRGL 358
+I +LD Q+ GN V+YL+ A+A+ L + L G+
Sbjct: 254 AMIKQLDRQQATQGNTKVIYLKYAKASDLVEVLTGI 289



Score = 50.3 bits (120), Expect = 2e-08
Identities = 24/154 (15%), Positives = 63/154 (40%), Gaps = 19/154 (12%)

Query: 194 ANGMVTRTFRLRYENAVNLIPVLRPI----------VAQNNPINAYPGNNTVVVTDYAEN 243
A T L + +A +++ ++ + + + A N V+V+ +
Sbjct: 189 AGDRSVVTVPLSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEPNS 248

Query: 244 LDRVAGIIASIDIPSAS--DTDVVPIQNGIAVDIASTVSELL-----DSQGSGGAEQGQK 296
R+ +I +D A+ +T V+ ++ A D+ ++ + + Q + K
Sbjct: 249 RQRIIAMIKQLDRQQATQGNTKVIYLKYAKASDLVEVLTGISSTMQSEKQAAKPVAALDK 308

Query: 297 TV-VLADPRSNSIVIRSPSPERTQLARDLIGKLD 329
+ + A ++N++++ + P+ +I +LD
Sbjct: 309 NIIIKAHGQTNALIVTAA-PDVMNDLERVIAQLD 341



Score = 44.1 bits (104), Expect = 2e-06
Identities = 20/84 (23%), Positives = 35/84 (41%), Gaps = 16/84 (19%)

Query: 190 DRPAANGMVTRTFRLRYENAVNLIPVLR----------------PIVAQNNPINAYPGNN 233
DR A T+ L+Y A +L+ VL + +N I A+ N
Sbjct: 260 DRQQATQGNTKVIYLKYAKASDLVEVLTGISSTMQSEKQAAKPVAALDKNIIIKAHGQTN 319

Query: 234 TVVVTDYAENLDRVAGIIASIDIP 257
++VT + ++ + +IA +DI
Sbjct: 320 ALIVTAAPDVMNDLERVIAQLDIR 343


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04527BCTERIALGSPG1671e-56 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 167 bits (425), Expect = 1e-56
Identities = 63/142 (44%), Positives = 87/142 (61%), Gaps = 6/142 (4%)

Query: 11 KGHRGQRGFTLIEIMVVVVILGILAAMVVPKVLDRPDQARATAARQDISGLMQALKLYRL 70
+ QRGFTL+EIMVV+VI+G+LA++VVP ++ ++A A DI L AL +Y+L
Sbjct: 2 RATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKL 61

Query: 71 DQGRYPSQAQGLKVLAERP-ADASASNWRS--YLERLPNDPWGKPYQYLNPGVNGEIDVF 127
D YP+ QGL+ L E P A+N+ Y++RLP DPWG Y +NPG +G D+
Sbjct: 62 DNHHYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDLL 121

Query: 128 SLGADGQPGGEGINADIGSWQL 149
S G DG+ G E DI +W L
Sbjct: 122 SAGPDGEMGTED---DITNWGL 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04528BCTERIALGSPG300.001 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 30.2 bits (68), Expect = 0.001
Identities = 19/62 (30%), Positives = 34/62 (54%), Gaps = 3/62 (4%)

Query: 8 RGFTLIEVLVALAIVAIALAAAIRAVGLMTDGNGLLRDKSLA-LLAAESRLAELRLGVGT 66
RGFTL+E++V IV I + A++ LM + + K+++ ++A E+ L +L
Sbjct: 8 RGFTLLEIMV--VIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLDNHH 65

Query: 67 AP 68
P
Sbjct: 66 YP 67


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04530BCTERIALGSPH348e-05 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 33.8 bits (77), Expect = 8e-05
Identities = 23/119 (19%), Positives = 37/119 (31%), Gaps = 7/119 (5%)

Query: 1 MVVLVIVGIATAAISLSARPDPTGLLRQDAARLARLLEIAQGEARVRGTPILWQPSAKGY 60
M++L+++G++ + L+ Q AR L Q G +
Sbjct: 12 MLILLLMGVSAGMVLLAFPASRDDSAAQTLARFEAQLRFVQQRGLQTGQFFGVSVHPDRW 71

Query: 61 RFSPQAYRGKTDAFAADTELRARDWQAAPLRVSVRPPRPVLLDAEWIGAPLRITLSDGQ 119
+F R D AD W PLR V G L + + G+
Sbjct: 72 QFLVLEARDGADPAPADDGWSGYRWL--PLR-----AGRVATSGSIAGGKLNLAFAQGE 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04531BCTERIALGSPG326e-04 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 32.2 bits (73), Expect = 6e-04
Identities = 18/60 (30%), Positives = 32/60 (53%), Gaps = 3/60 (5%)

Query: 12 RRQAGFTLIEVMVAIMLMAIV-SLMAWRGLDSIARASAHLEDSTEQGAALLRALNQLERD 70
+Q GFTL+E+MV I+++ ++ SL+ + + +A S AL AL+ + D
Sbjct: 5 DKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVS--DIVALENALDMYKLD 62


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04539DHBDHDRGNASE310.023 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 30.8 bits (69), Expect = 0.023
Identities = 30/122 (24%), Positives = 48/122 (39%), Gaps = 14/122 (11%)

Query: 520 VLALGMLSALRRSFDLIHALRGGKRLSIASIPSEDPATYEMISRADTIGVFQIESRAQMA 579
V + G+ +A R + R G +++ S P+ P T ++ + S+A
Sbjct: 115 VNSTGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRT--------SMAAYA-SSKAAAV 165

Query: 580 MLPRLRPQKFYDLVIQVAIVRPGPIQGDMVHPYLRRRNGEEPVAYPSAELEKVFERTLGV 639
M + + + I+ IV PG + DM NG E V S E K G+
Sbjct: 166 MFTKCLGLELAEYNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFK-----TGI 220

Query: 640 PL 641
PL
Sbjct: 221 PL 222


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04543RTXTOXIND452e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 44.8 bits (106), Expect = 2e-06
Identities = 39/231 (16%), Positives = 75/231 (32%), Gaps = 27/231 (11%)

Query: 628 DQEQVRAEQALERLRQTLVGLREGYSSQRERLNQSRQEQQELTGQLAALDR-QLDQWTLP 686
+ E VR L +L G + L Q+R EQ +++ +L + LP
Sbjct: 114 EGESVRKGDVLLKLTAL--GAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLP 171

Query: 687 EELRLLQPSAQLEWLAQRLDDLAGQRQQCQRDFDRLIARQRQTQQLQQELRAAETILQQR 746
+E S + L ++ Q Q Q + L +
Sbjct: 172 DEPYFQNVSEEEVLRLTSLIK------------EQFSTWQNQKYQKELNLDK----KRAE 215

Query: 747 QQALTEQRQRYEHLQQQVEEDSQQLRPLLSDEHWQRWQADPLRTFQALGESIEQRRQQQA 806
+ + + RYE+L + + LL + + L E++ + R ++
Sbjct: 216 RLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAV--LEQENKYVEAVNELRVYKS 273

Query: 807 RLQQIEQRLQELKQRCDESSWQLKQSDEQRNEARQAEERAQAELAELNGRL 857
+L+QIE + K+ + +NE + + L L
Sbjct: 274 QLEQIESEILSAKEE------YQLVTQLFKNEILDKLRQTTDNIGLLTLEL 318



Score = 37.1 bits (86), Expect = 4e-04
Identities = 24/178 (13%), Positives = 59/178 (33%), Gaps = 13/178 (7%)

Query: 878 AQAAQSAVETLQAPLDSLREEQLRLAEALEHLQQQRQRQQDEFQRLQADWQAWRERQDNL 937
Q ++E + P L +E + E + + +++F Q ++ Q L
Sbjct: 153 YQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQN-----QKYQKEL 207

Query: 938 DDSRLDALLGLSEEQATQWREQLQRLQEEITRQQTLEAER---QAQLLQHRRQRPETDRE 994
+ + A + ++ + + + +L ++ + +L+ + E E
Sbjct: 208 NLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNE 267

Query: 995 -----ALEDNLRQQRERLAASEQAYLDTYSQLQADNQRREQSQALLAELERARAEFRR 1047
+ + + + Q + D R+ L LE A+ E R+
Sbjct: 268 LRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQ 325



Score = 36.3 bits (84), Expect = 7e-04
Identities = 24/164 (14%), Positives = 63/164 (38%), Gaps = 11/164 (6%)

Query: 881 AQSAVETLQAPLDSLREEQLRLAEALEHLQQQRQRQQDEFQRLQADWQAWRERQDNLDDS 940
A++ Q+ L R EQ R R + ++ L+ + + + +
Sbjct: 132 AEADTLKTQSSLLQARLEQTRYQILS------RSIELNKLPELKLPDEPYFQNVSEEEVL 185

Query: 941 RLDALLGLSEEQATQWREQLQRLQEEITRQQTLEAERQAQLLQHRRQRPETDREALEDNL 1000
RL +L+ +EQ + W+ Q + + + +++ A++ ++ ++ L+D
Sbjct: 186 RLTSLI---KEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLS-RVEKSRLDD-F 240

Query: 1001 RQQRERLAASEQAYLDTYSQLQADNQRREQSQALLAELERARAE 1044
+ A ++ A L+ ++ ++ L ++E
Sbjct: 241 SSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILS 284



Score = 35.2 bits (81), Expect = 0.001
Identities = 19/166 (11%), Positives = 56/166 (33%), Gaps = 33/166 (19%)

Query: 656 RERLNQSRQEQQELTGQLAALDRQLDQWTLPEELRLLQPSAQLEWLAQRLDDLAGQRQQC 715
+E+ + + ++ + L + T+ + + RLDD + +
Sbjct: 192 KEQFSTWQNQKYQKELNLDKKRAERL--TVLARINRYE--NLSRVEKSRLDDFSSLLHKQ 247

Query: 716 QRDFDRLIARQRQTQQLQQELRAAETILQQRQQALTEQRQRYEHLQQQVEEDSQQLRPLL 775
++ ++ + + ELR ++ L+Q + + ++ Y+ + Q +
Sbjct: 248 AIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKN--------- 298

Query: 776 SDEHWQRWQADPLRTFQALGESIEQRRQQQARLQQIEQRLQELKQR 821
E +++ RQ + + L + ++R
Sbjct: 299 --------------------EILDKLRQTTDNIGLLTLELAKNEER 324



Score = 33.6 bits (77), Expect = 0.006
Identities = 27/214 (12%), Positives = 57/214 (26%), Gaps = 10/214 (4%)

Query: 253 QALQRLEGQQQWFTEEQRLLQSCEHAQGQLAEARQAWDALATERETLQWLERLAPVRGLI 312
L +L E L Q +L + R + + E L L+
Sbjct: 122 DVLLKLTALG---AEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQN 178

Query: 313 ERLKQLEQELRHSEQQQRQRTEQQAAGAERLQGLQARLQEARERQAQADNHLRQAQAPLR 372
+++ + ++Q Q+ L +A R N +
Sbjct: 179 VSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARI----NRYENLSRVEK 234

Query: 373 EAFQLESEARRLERTLAERQELHRQSNQRHAQQSDAARQL-DMEQQRHVAEQAQLQAALR 431
+L+ + L + + + Q N+ ++ +EQ A+ + L
Sbjct: 235 S--RLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLV 292

Query: 432 DSQALAALGDAWATHQGQLATFVQRRQRALESQA 465
+ D + + E Q
Sbjct: 293 TQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQ 326



Score = 30.6 bits (69), Expect = 0.045
Identities = 30/210 (14%), Positives = 59/210 (28%), Gaps = 14/210 (6%)

Query: 120 ADGALQKSQQSLQDLETQQMLAANKKSEFREQLEQKL-------GLNFAQFTRAVLLAQS 172
AD +S LE + ++ E + E KL ++ + R L +
Sbjct: 134 ADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKE 193

Query: 173 EFSAFLKASDNDRGALLEKLTDTGLYSQLSKAAYQRASQADEQRKQLEQ-RLEGSLPL-- 229
+FS + L +K + + + + ++
Sbjct: 194 QFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHA 253

Query: 230 ---AEQARAGLEAALESHAQARLQEQQALQRLEGQQQWFTEEQRLLQSCEHAQGQLAEAR 286
E L + Q + + + + Q T+ + + Q
Sbjct: 254 VLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTD-NIG 312

Query: 287 QAWDALATERETLQWLERLAPVRGLIERLK 316
LA E Q APV +++LK
Sbjct: 313 LLTLELAKNEERQQASVIRAPVSVKVQQLK 342


65PAKAF_04686PAKAF_04700Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_04686218-1.433234cell division protein FtsL
PAKAF_04687216-1.48363716S rRNA (cytosine(1402)-N(4))-methyltransferase
PAKAF_04688215-1.562521transcriptional regulator MraZ
PAKAF_04690114-1.387607methyltransferase
PAKAF_04691115-1.892166penicillin-binding protein activator
PAKAF_04692117-4.260818YraN family protein
PAKAF_04693113-5.106542sedoheptulose 7-phosphate isomerase GmhA
PAKAF_04694214-4.963480BON domain-containing protein
PAKAF_04695219-5.376638stringent starvation protein B
PAKAF_04696219-5.772440stringent starvation protein A
PAKAF_04697115-4.677245probable cytochrome c1 precursor
PAKAF_04698013-4.255068probable cytochrome b
PAKAF_04699-112-3.100479probable iron-sulfur protein
PAKAF_04700011-3.05714230S ribosomal protein S9
66PAKAF_04711PAKAF_04736Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_04711210-1.293738ATP sulfurylase small subunit
PAKAF_04712210-0.609684soluble and membrane-bound lytic
PAKAF_04713412-0.619317Nif3-like dinuclear metal center hexameric
PAKAF_04714412-0.888816AlgW protein
PAKAF_04715312-1.065602histidinol-phosphate aminotransferase
PAKAF_04716213-1.492378histidinol dehydrogenase
PAKAF_04717-114-2.852572ATP-phosphoribosyltransferase
PAKAF_04718-112-3.199687UDP-N-acetylglucosamine
PAKAF_04719012-3.594361BolA family transcriptional regulator
PAKAF_04721012-3.778016STAS domain-containing protein
PAKAF_04722212-3.824201phospholipid-binding protein MlaC
PAKAF_04723314-2.913502outer membrane lipid asymmetry maintenance
PAKAF_04724314-3.417871probable permease of ABC transporter
PAKAF_04725316-3.564119probable ATP-binding component of ABC
PAKAF_04726317-3.868940hypothetical protein
PAKAF_04727017-3.968025arabinose-5-phosphate isomerase KdsD
PAKAF_04728120-4.942004HAD family hydrolase
PAKAF_04729019-5.367773LPS export ABC transporter periplasmic protein
PAKAF_04730018-4.810676lipopolysaccharide transport periplasmic protein
PAKAF_04731018-4.794808probable ATP-binding component of ABC
PAKAF_04732016-3.406651RNA polymerase sigma-54 factor
PAKAF_04733-114-2.504732ribosome-associated translation inhibitor RaiA
PAKAF_04734-113-1.641579nitrogen regulatory IIA protein
PAKAF_04735112-1.471300RNase adapter RapZ
PAKAF_04736213-0.980327probable phosphoryl carrier protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04711TCRTETOQM280.046 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 28.3 bits (63), Expect = 0.046
Identities = 17/90 (18%), Positives = 33/90 (36%), Gaps = 14/90 (15%)

Query: 94 GVAQG-INPFTHGSAKHTDVMKTEGLKQALDKYGFDAAFGGARRDEEKSRAKERVYSFRD 152
+ P HGSAK G+ ++ F + + +++ F
Sbjct: 207 RFHNCSLFPVYHGSAK-----NNIGIDNLIE--VITNKFYSS---THRGQSELCGKVF-- 254

Query: 153 SKHRWDPKNQRPELWNIYNGKVKKGESIRV 182
K + K QR +Y+G + +S+R+
Sbjct: 255 -KIEYSEKRQRLAYIRLYSGVLHLRDSVRI 283


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04714V8PROTEASE612e-12 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 60.8 bits (147), Expect = 2e-12
Identities = 33/163 (20%), Positives = 52/163 (31%), Gaps = 35/163 (21%)

Query: 118 LLTNNHVTAGADQIIVALR------------DGRETIAQLVGSDPETDLAVLKIDL---- 161
LLTN HV AL+ +G T Q+ E DLA++K
Sbjct: 114 LLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGEGDLAIVKFSPNEQN 173

Query: 162 ----KNLPAMTLGRSDGIRTGDVCLAIGNPFGVGQTVTMGIISATGRNQLGLNTYEDFIQ 217
+ + T+ + + G P TM + G+ L +Q
Sbjct: 174 KHIGEVVKPATMSNNAETQVNQNITVTGYPGD-KPVATMW--ESKGK-ITYLKGE--AMQ 227

Query: 218 TDAAINPGNSGGALVDAAGNLIGINTAIFSKSGGSQGIGFAIP 260
D + GNSG + + +IGI+ G+
Sbjct: 228 YDLSTTGGNSGSPVFNEKNEVIGIHWG---------GVPNEFN 261


67PAKAF_04773PAKAF_04801Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_047732121.420218probable permease of ABC transporter
PAKAF_04774292.573299probable permease of ABC transporter
PAKAF_04775082.962780probable ATP-binding component of ABC
PAKAF_04776-193.062723probable ATP-binding component of ABC dipeptide
PAKAF_05938-1121.743928hypothetical protein
PAKAF_04779091.847366probable transcriptional regulator
PAKAF_04780010-0.043359biotin-dependent carboxyltransferase family
PAKAF_04781-110-0.6641015-oxoprolinase subunit PxpB
PAKAF_0478209-0.614769LamB/YcsF family protein
PAKAF_0478308-1.452701lipopolysaccharide biosynthetic protein LpxO1
PAKAF_0478408-1.058365probable oxidoreductase
PAKAF_0478519-3.189432probable outer membrane receptor for iron
PAKAF_04786-27-2.026106PKHD-type hydroxylase PLES_48951
PAKAF_04787-27-1.567188sel1 repeat family protein
PAKAF_04788-28-2.350818phosphoethanolamine transferase CptA
PAKAF_04789-110-1.743075VOC family protein
PAKAF_04790-18-1.523245ornithine decarboxylase
PAKAF_04791-214-1.886629probable chemotaxis transducer
PAKAF_04792-115-3.408344membrane protein
PAKAF_04793119-5.814121beta-lactamase expression regulator AmpD
PAKAF_04794018-5.556652DUF1631 domain-containing protein
PAKAF_04795022-6.650640nicotinate-nucleotide pyrophosphorylase
PAKAF_05939024-7.592283*type 4 fimbrial precursor PilA
PAKAF_04797119-6.354892type 4 fimbrial biogenesis protein PilB
PAKAF_04798220-5.548100type 4 fimbrial biogenesis protein PilC
PAKAF_04799214-1.263507type 4 prepilin peptidase PilD
PAKAF_04800017-0.936381dephosphocoenzyme A kinase
PAKAF_04801217-1.135914DNA gyrase inhibitor YacG
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04782PHPHTRNFRASE300.012 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 29.8 bits (67), Expect = 0.012
Identities = 16/64 (25%), Positives = 27/64 (42%), Gaps = 11/64 (17%)

Query: 40 CGFHAGDPLTMRRAVELAVR----HGVSIG------AHPAYPDLSGFGRRSLAC-SAEEV 88
CG AGD + + + L + SI + +L F +++L +AEEV
Sbjct: 503 CGEMAGDEVAIPLLLGLGLDEFSMSATSILPARSQLLKLSKEELKPFAQKALMLDTAEEV 562

Query: 89 HAMV 92
+V
Sbjct: 563 EQLV 566


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04788SECYTRNLCASE320.004 Preprotein translocase SecY subunit signature.
		>SECYTRNLCASE#Preprotein translocase SecY subunit signature.

Length = 437

Score = 32.4 bits (74), Expect = 0.004
Identities = 15/63 (23%), Positives = 25/63 (39%)

Query: 64 SEYFSQYFNPWMTLGLVLYSLVAILLWRRLRPVYLPRFSALPVAVLLIVATIGYPFYKQL 123
+EY S N G + L+A++ L + +LI+ +G KQ+
Sbjct: 364 AEYLSYVLNRITWPGSLYLGLIALVPTMALVGFGASQNFPFGGTSILIIVGVGLETVKQI 423

Query: 124 VSQ 126
SQ
Sbjct: 424 ESQ 426


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04795RTXTOXIND290.021 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.0 bits (65), Expect = 0.021
Identities = 23/141 (16%), Positives = 45/141 (31%), Gaps = 4/141 (2%)

Query: 75 QVEDGQRVEPNQMLFQLKGP-ARALLTGERSALNFLQLLSGTATRSQHYADLVAGTAVKL 133
V++G+ V +L +L A A +S+L +L TR Q + + +
Sbjct: 111 IVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARL---EQTRYQILSRSIELNKLPE 167

Query: 134 LDTRKTLPGLRLAQKYAVTCGGCHNHRIGLYDAFLIKENHIAACGGIDRAIAEARRIAPG 193
L ++++ + + + ++ +R AR
Sbjct: 168 LKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYE 227

Query: 194 KPVEVEVENLDELRQALEAGA 214
VE LD+ L A
Sbjct: 228 NLSRVEKSRLDDFSSLLHKQA 248


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_05939BCTERIALGSPG471e-09 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 47.2 bits (112), Expect = 1e-09
Identities = 16/54 (29%), Positives = 34/54 (62%)

Query: 1 MKAQKGFTLIELMIVVAIIGILAAIAIPQYQNYVARSEGASALASVNPLKTTVE 54
Q+GFTL+E+M+V+ IIG+LA++ +P +++ A++ + L+ ++
Sbjct: 4 TDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALD 57


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04798BCTERIALGSPF466e-167 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 466 bits (1202), Expect = e-167
Identities = 119/382 (31%), Positives = 209/382 (54%), Gaps = 14/382 (3%)

Query: 3 VKAHLRKQGINPLKVR-------KKGISLLGA--GKKVKPMDIALFTRQMATMMGAGVPL 53
+ LR++G+ PL V K G + L ++ D+AL TRQ+AT++ A +PL
Sbjct: 28 ARQLLRERGLVPLSVDENRGDQQKSGSTGLSLRRKIRLSTSDLALLTRQLATLVAASMPL 87

Query: 54 LQSFDIIGEGFDNPNMRKLVDEIKQEVSSGNSLANSLRKKPQYFDELYCNLVDAGEQSGA 113
++ D + + + P++ +L+ ++ +V G+SLA++++ P F+ LYC +V AGE SG
Sbjct: 88 EEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSLADAMKCFPGSFERLYCAMVAAGETSGH 147

Query: 114 LENLLDRVATYKEKTESLKAKIKKAMTYPIAVIIVALIVSAILLIKVVPQFQSVFEGFGA 173
L+ +L+R+A Y E+ + ++++I++AM YP + +VA+ V +ILL VVP+ F
Sbjct: 148 LDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLTVVAIAVVSILLSVVVPKVVEQFIHMKQ 207

Query: 174 ELPAFTQMIVNLSEFMQEW--WFFIILAIAIFGFAFKELHKRSQKFRDTLDRTILKLPIF 231
LP T++++ +S+ ++ + W + L F + R +K R + R +L LP+
Sbjct: 208 ALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMAF---RVMLRQEKRRVSFHRRLLHLPLI 264

Query: 232 GGIVYKSAVARYARTLSTTFAAGVPLVDALDSVSGATGNIVFKNAVSKIKQDVSTGMQLN 291
G I ARYARTLS A+ VPL+ A+ N ++ +S V G+ L+
Sbjct: 265 GRIARGLNTARYARTLSILNASAVPLLQAMRISGDVMSNDYARHRLSLATDAVREGVSLH 324

Query: 292 FSMRTTSVFPNMAIQMTAIGEESGSLDEMLSKVASYYEEEVDNAVDNLTTLMEPMIMAVL 351
++ T++FP M M A GE SG LD ML + A + E + + L EP+++ +
Sbjct: 325 KALEQTALFPPMMRHMIASGERSGELDSMLERAADNQDREFSSQMTLALGLFEPLLVVSM 384

Query: 352 GVLVGGLIVAMYLPIFQLGNVV 373
+V +++A+ PI QL ++
Sbjct: 385 AAVVLFIVLAILQPILQLNTLM 406


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04799PREPILNPTASE354e-125 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 354 bits (909), Expect = e-125
Identities = 165/283 (58%), Positives = 195/283 (68%), Gaps = 1/283 (0%)

Query: 3 LLDYLASHPLAFVLCAILLGLLVGSFLNVVVHRLPKMMERNWKAEAREALGLEPE-PKQA 61
LL+ P + L L++GSFLNVV+HRLP M+ER W+AE R + E +
Sbjct: 4 LLELAHGLPWLYFSLVFLFSLMIGSFLNVVIHRLPIMLEREWQAEYRSYFNPDDEGVDEP 63

Query: 62 TYNLVLPNSACPRCGHEIRPWENIPLVSYLALGGKCSSCKAAIGKRYPLVELATALLSGY 121
YNL++P S CP C H I ENIPL+S+L L G+C C+A I RYPLVEL TALLS
Sbjct: 64 PYNLMVPRSCCPHCNHPITALENIPLLSWLWLRGRCRGCQAPISARYPLVELLTALLSVA 123

Query: 122 VAWHFGFTWQAGAMLLLTWGLLAMSLIDADHQLLPDVLVLPLLWLGLIANHFGLFASLDD 181
VA W A LLLTW L+A++ ID D LLPD L LPLLW GL+ N G F SL D
Sbjct: 124 VAMTLAPGWGTLAALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNLLGGFVSLGD 183

Query: 182 ALFGAVFGYLSLWSVFWLFKLVTGKEGMGYGDFKLLAMLGAWGGWQILPLTILLSSLVGA 241
A+ GA+ GYL LWS++W FKL+TGKEGMGYGDFKLLA LGAW GWQ LP+ +LLSSLVGA
Sbjct: 184 AVIGAMAGYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLVGA 243

Query: 242 ILGVIMLRLRNAESGTPIPFGPYLAIAGWIALLWGDQITRTYL 284
+G+ ++ LRN PIPFGPYLAIAGWIALLWGD ITR YL
Sbjct: 244 FMGIGLILLRNHHQSKPIPFGPYLAIAGWIALLWGDSITRWYL 286


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04800DHBDHDRGNASE300.005 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 30.0 bits (67), Expect = 0.005
Identities = 23/88 (26%), Positives = 32/88 (36%), Gaps = 11/88 (12%)

Query: 5 WILGLTGGIGSGKSAAAEHFISLGVHLVDADHAARW--VVEPGRPALAKIVERFGDGILL 62
+I G GIG A A S G H+ D+ V A A+ E F
Sbjct: 12 FITGAAQGIGE---AVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAF------ 62

Query: 63 PDGQLDRAALRERIFQAPEERRWLEQLL 90
P D AA+ E + E ++ L+
Sbjct: 63 PADVRDSAAIDEITARIEREMGPIDILV 90


68PAKAF_04823PAKAF_04829Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_04823-117-3.116715probable D-amino acid oxidase
PAKAF_04824112-3.756730type 4 fimbrial biogenesis protein FimT
PAKAF_04825112-3.891266type 4 fimbrial biogenesis protein FimU
PAKAF_04826211-3.808838type 4 fimbrial biogenesis protein PilV
PAKAF_04827211-3.637635type 4 fimbrial biogenesis protein PilW
PAKAF_04828210-3.454617type 4 fimbrial biogenesis protein PilX
PAKAF_04829110-3.591139type 4 fimbrial biogenesis protein PilY1
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04824BCTERIALGSPG332e-04 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 32.9 bits (75), Expect = 2e-04
Identities = 12/47 (25%), Positives = 26/47 (55%)

Query: 4 RSQRALTLTELLFALVLLGILGSLALPGMAAWLDGNRQRSVLHELSA 50
QR TL E++ +V++G+L SL +P + + ++ + ++ A
Sbjct: 5 DKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVA 51


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04825BCTERIALGSPG415e-07 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 40.6 bits (95), Expect = 5e-07
Identities = 14/45 (31%), Positives = 30/45 (66%)

Query: 8 TGFTLIELLIIVVLLAIMASFAIPNFKQLTERNELQSAAEELNAM 52
GFTL+E+++++V++ ++AS +PN E+ + Q A ++ A+
Sbjct: 8 RGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVAL 52


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04826PilS_PF08805300.003 PilS N terminal
		>PilS_PF08805#PilS N terminal

Length = 185

Score = 30.3 bits (68), Expect = 0.003
Identities = 11/58 (18%), Positives = 24/58 (41%)

Query: 3 LKSRHRSLHQSGFSMIEVLVALLLISIGVLGMIAMQGKTIQYTADSVERNKAAMLGSN 60
L +R + G +++EVL+ + +I + + S E+N + +N
Sbjct: 16 LSARRKKEQDKGATLMEVLLVVGVIVVLAASAYKLYSMVQSNIQSSNEQNNVLTVIAN 73


69PAKAF_04923PAKAF_04930Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_0492339-1.013297hypothetical protein
PAKAF_04924411-0.881728cupin fold metalloprotein, WbuC family
PAKAF_04925413-1.140826probable purine/pyrimidine phosphoribosyl
PAKAF_049264110.053269uracil phosphoribosyltransferase
PAKAF_049274140.540227uracil permease
PAKAF_049287170.815495Pilin subunit CupE1
PAKAF_049294141.124316Pilin subunit CupE2
PAKAF_049303121.166954Pilin subunit CupE3
70PAKAF_04948PAKAF_04970Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_04948219-3.727156lipoprotein localization protein LolB
PAKAF_04949320-5.253726isopentenyl monophosphate kinase
PAKAF_04950218-5.641115ribose-phosphate pyrophosphokinase
PAKAF_04951120-5.055510*probable ribosomal protein L25
PAKAF_04952019-4.780230peptidyl-tRNA hydrolase
PAKAF_04953123-5.229138redox-regulated ATPase YchF
PAKAF_04955128-5.442629*integrase,Tyrosine recombinase
PAKAF_04957128-4.990697hypothetical protein from bacteriophage Pf1
PAKAF_04959331-4.159506probable coat protein A of bacteriophage Pf1
PAKAF_04960425-4.271850DUF2523 domain-containing protein
PAKAF_04961730-3.897624coat protein A of bacteriophage Pf1
PAKAF_04962523-3.123752coat protein B of bacteriophage Pf1
PAKAF_04963929-5.327528hypothetical protein of bacteriophage Pf1
PAKAF_04964956-15.052287hypothetical protein of bacteriophage Pf1
PAKAF_04965969-18.852234helix destabilizing protein of bacteriophage
PAKAF_04966679-20.945084hypothetical protein of bacteriophage Pf1
PAKAF_04967582-21.914811hypothetical protein
PAKAF_04968546-13.755062helix-turn-helix domain-containing protein
PAKAF_04969339-12.235854Uncharacterized conserved protein,Protein of
PAKAF_04970120-5.575719hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04961cloacin461e-07 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 46.2 bits (109), Expect = 1e-07
Identities = 22/56 (39%), Positives = 28/56 (50%)

Query: 227 GGDGGGDGNGGGNNNGGGNDGGTGNGDGSGGGDGNGAGDGSGDGDGSGTGGDGNGT 282
GG G GG ++ G + G GSG G G G G G+G G+G G G+GT
Sbjct: 22 GGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGT 77



Score = 41.2 bits (96), Expect = 6e-06
Identities = 29/65 (44%), Positives = 35/65 (53%), Gaps = 3/65 (4%)

Query: 223 PTTPGGDGGG-DGNG-GGNNNGGGNDGGTGNGDGSGGGDGNGAGDGSGDGDGSGTGGDGN 280
PT G GG DG+G NN G G+G G G G GNG G+G+ G GSGTGG+ +
Sbjct: 24 PTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNS-GGGSGTGGNLS 82

Query: 281 GTCDP 285
P
Sbjct: 83 AVAAP 87



Score = 40.5 bits (94), Expect = 9e-06
Identities = 23/66 (34%), Positives = 30/66 (45%)

Query: 234 GNGGGNNNGGGNDGGTGNGDGSGGGDGNGAGDGSGDGDGSGTGGDGNGTCDPAKENCSTG 293
G+G G+N G + G NG +G G G GA DGSG + G G+G+ G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 294 PEGPGG 299
G G
Sbjct: 64 NGGGNG 69



Score = 40.1 bits (93), Expect = 1e-05
Identities = 26/76 (34%), Positives = 32/76 (42%), Gaps = 3/76 (3%)

Query: 227 GGDGGGDGNGGGNNNGGGNDGGTGNGDGSGGGDGNGAGDGS---GDGDGSGTGGDGNGTC 283
GGDG G G + +G N G TG G G G DG+G + G G GSG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 284 DPAKENCSTGPEGPGG 299
N ++G G
Sbjct: 63 GNGGGNGNSGGGSGTG 78


71PAKAF_04986PAKAF_04994Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_04986-111-3.695002PqiB family protein
PAKAF_04988016-5.117369paraquat-inducible protein,Inner membrane
PAKAF_04991215-4.964253**protein-methionine-sulfoxide reductase
PAKAF_04992314-4.785480protein-methionine-sulfoxide reductase catalytic
PAKAF_04993214-4.437360phosphatidylserine synthase
PAKAF_04994114-3.260219ketol-acid reductoisomerase
72PAKAF_05110PAKAF_05126Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_05110-1143.277146*class I SAM-dependent methyltransferase
PAKAF_05111-2133.392185probable amino acid permease
PAKAF_05112-1143.343372probable class III aminotransferase
PAKAF_05113-1153.377484probable transcriptional regulator
PAKAF_05114-2272.032546selenocysteine-specific elongation factor
PAKAF_051150240.448830L-seryl-tRNA(ser) selenium transferase
PAKAF_051160230.482364FdhE protein
PAKAF_051170240.159165nitrate-inducible formate dehydrogenase, gamma
PAKAF_05118-1221.252537nitrate-inducible formate dehydrogenase, beta
PAKAF_05120-283.651247lipase LipC
PAKAF_05121093.8069072,4-dienoyl-CoA reductase FadH2
PAKAF_05122093.542976DMT family transporter
PAKAF_05123194.017009DUF1835 domain-containing protein
PAKAF_05124373.213211hypothetical protein
PAKAF_05125563.317657phospholipid carrier-dependent
PAKAF_05126373.335811probable glycosyl transferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_05114TCRTETOQM574e-10 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 56.8 bits (137), Expect = 4e-10
Identities = 42/143 (29%), Positives = 67/143 (46%), Gaps = 20/143 (13%)

Query: 491 VGTAGHIDHGKTSLLRAL---TGI--------EGDRRP----AERQRGITIDLGYLYADL 535
+G H+D GKT+L +L +G +G R ERQRGITI G
Sbjct: 6 IGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITSFQW 65

Query: 536 GDGSPTGFIDVPGHERFVHNMLAGASGIDCVLLVVAADDGLMPQTREHLAIVELLGIRRA 595
+ + ID PGH F+ + S +D +L+++A DG+ QTR + +GI
Sbjct: 66 -ENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIPT- 123

Query: 596 LVALTKIDR--VEPQRV-QQVRT 615
+ + KID+ ++ V Q ++
Sbjct: 124 IFFINKIDQNGIDLSTVYQDIKE 146


73PAKAF_05194PAKAF_05219Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_051942101.436765cytochrome b/b6 domain-containing protein
PAKAF_051952102.555408hypothetical protein
PAKAF_051962112.536171two-component response regulator
PAKAF_051972113.150659probable two-component sensor
PAKAF_051982112.945763probable major facilitator superfamily (MFS)
PAKAF_051990113.344609acyl-CoA delta-9-desaturase, DesB
PAKAF_052002144.873796probable oxidoreductase
PAKAF_052011124.822323DesT
PAKAF_052021124.404172urease accessory protein UreE
PAKAF_05203-192.115366urease accessory protein UreF
PAKAF_05204081.474247urease accessory protein UreG
PAKAF_05205-181.848629protein hupE
PAKAF_05206-181.755842probable transmembrane sensor
PAKAF_05207081.538102probable sigma-70 factor, ECF subfamily
PAKAF_05208091.623834TonB-dependent receptor
PAKAF_05209-1112.936150vanillate porin OpdK
PAKAF_05210-1113.424436probable aldehyde dehydrogenase
PAKAF_05211-193.557439probable major facilitator superfamily (MFS)
PAKAF_05212-193.367508benzoylformate decarboxylase
PAKAF_05213-1102.821263probable transcriptional regulator
PAKAF_052140103.374863probable major facilitator superfamily (MFS)
PAKAF_052151102.227896vanillate O-demethylase oxygenase subunit
PAKAF_052160101.515982vanillate O-demethylase oxidoreductase
PAKAF_052171120.832192probable transcriptional regulator
PAKAF_05218214-0.142610probable short-chain dehydrogenase
PAKAF_05219215-0.567113ornithine cyclodeaminase family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_05196HTHFIS764e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 76.0 bits (187), Expect = 4e-18
Identities = 42/156 (26%), Positives = 74/156 (47%), Gaps = 6/156 (3%)

Query: 2 RILVIEDDTKTGEYLKKGLGESGYAVDWSQHGADGLYLALENRYDLVVLDVMLPGLDGWQ 61
ILV +DD L + L +GY V + + A DLVV DV++P + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 IMEVLRK-KHDVPVLFLTARDQLQDRIRGLELGADDYLVKPFSFTELLLRIRTLLRRGVV 120
++ ++K + D+PVL ++A++ I+ E GA DYL KPF TEL+ I L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 121 REAEQVQLADLQLDVLR-----RKVSRQGQVIALTN 151
R ++ + + ++ +++ R + T+
Sbjct: 125 RPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTD 160


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_05198TCRTETA330.002 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 32.9 bits (75), Expect = 0.002
Identities = 70/330 (21%), Positives = 115/330 (34%), Gaps = 37/330 (11%)

Query: 44 GGLMASYYFGLVCGGKFGHKLIASFGHIRSYVACAGI--ATVTVLLHALVDQLEVWLLLR 101
G L+A Y L FG R V + A V + A L V + R
Sbjct: 46 GILLALYALMQFACAPVLGALSDRFG--RRPVLLVSLAGAAVDYAIMATAPFLWVLYIGR 103

Query: 102 F---ITGAVMMNQYMVIESWLNEQAESHQRGKVFAGYMVA-VDLGLVLGQ---GLLA-LS 153
ITGA V +++ + + +R + F G+M A G+V G GL+ S
Sbjct: 104 IVAGITGATGA----VAGAYIADITDGDERARHF-GFMSACFGFGMVAGPVLGGLMGGFS 158

Query: 154 PTLDY---KPLLLVAICFASCLIPLAMTRRVHPAKLVAAPLEVRFFWQR----VPQALGT 206
P + L + L+P + P + A F W R V +
Sbjct: 159 PHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAV 218

Query: 207 IFIAGLMVGAFYGLAPVY-ANRNGLDASQSSF-FVGMCIVAGFCAQWPLG----WLSDRL 260
FI L+ L ++ +R DA+ I+ G L +R
Sbjct: 219 FFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERR 278

Query: 261 DRSWLIRGNAVLLCLASIPMWGLVTLPYWLLLANGFVTGMLLFTLYPLAVALANDHVEQP 320
+ + L + G + P +LLA+G G+ + P A+ + V++
Sbjct: 279 ALMLGMIADGTGYILLAFATRGWMAFPIMVLLASG---GIGM----PALQAMLSRQVDEE 331

Query: 321 RRVALSAMLLTTYGVGACIGPLVAGALMRH 350
R+ L L + + +GPL+ A+
Sbjct: 332 RQGQLQGSLAALTSLTSIVGPLLFTAIYAA 361


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_05201HTHTETR646e-15 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 64.3 bits (156), Expect = 6e-15
Identities = 32/179 (17%), Positives = 57/179 (31%), Gaps = 10/179 (5%)

Query: 1 MSSPRAEQKQQTRHALMSAARHLMESGRGFGSLSLREVTRAAGIVPAGFYRHFSDMDQLG 60
M+ ++ Q+TR ++ A L +G S SL E+ +AAG+ Y HF D L
Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQ-QGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLF 59

Query: 61 LALVAEVDETFRATLR--AVRRNEFELGGLIDASVRIF-LDAVGANRSQF---LFLAREQ 114
+ + + L L + + + R +F E
Sbjct: 60 SEIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEF 119

Query: 115 YGGSLPIRQAIASLRQRITDDLAADLALLNKMPHLDGAALDVFADLVVKTVFATLPELI 173
G ++QA +L D + L D+ + + L+
Sbjct: 120 VGEMAVVQQAQRNLCLESYDRIE---QTLKHCIEAKMLPADLMTRRAAIIMRGYISGLM 175


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_05211TCRTETB461e-07 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 46.4 bits (110), Expect = 1e-07
Identities = 33/187 (17%), Positives = 75/187 (40%), Gaps = 5/187 (2%)

Query: 16 NRTHWLILGWGCFIMLFDGYDMVIYGSVVPRLMQEWQLSPVQAGTLGSCALFGMLFGGTL 75
N H IL W C + F + ++ +P + ++ P + + + G +
Sbjct: 9 NLRHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAV 68

Query: 76 LAPLADRFGRRRLV---IATTLLASLAAFLTGHARDPLELGAGRFFTGLALGALVPSAIN 132
L+D+ G +RL+ I S+ F+ GH+ L L RF G A +
Sbjct: 69 YGKLSDQLGIKRLLLFGIIINCFGSVIGFV-GHSFFSL-LIMARFIQGAGAAAFPALVMV 126

Query: 133 LISEFAPAGRRSTLVTVMSAFYSVGAVLSALLAIAMIPAWGWQSVFYVAVLPVLAVPLML 192
+++ + P R ++ + ++G + + + W + + ++ ++ VP ++
Sbjct: 127 VVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLM 186

Query: 193 RWLPESA 199
+ L +
Sbjct: 187 KLLKKEV 193



Score = 32.2 bits (73), Expect = 0.005
Identities = 28/152 (18%), Positives = 54/152 (35%), Gaps = 13/152 (8%)

Query: 258 VAFAMCMLMSYG------LNTWLPKLMAGGGYALGSSLAFLVTLNVGATLGALFGGWLAD 311
+ +C+L + LN LP + S+ + ++G G L+D
Sbjct: 15 ILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSD 74

Query: 312 RLGAGRTLVLFFAL--AAASLAALGLGPGPWLLNGLLVVA--GATTIGTLAVIHAYAAQF 367
+LG R L+ + + + +G L+ + A + V+ A++
Sbjct: 75 QLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVV---VARY 131

Query: 368 YPAWVRSTGVGWAAGVGRLGAIAGPMLGGSLL 399
P R G + +G GP +GG +
Sbjct: 132 IPKENRGKAFGLIGSIVAMGEGVGPAIGGMIA 163


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_05214TCRTETA509e-09 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 50.2 bits (120), Expect = 9e-09
Identities = 40/147 (27%), Positives = 57/147 (38%), Gaps = 8/147 (5%)

Query: 55 AEIGLLLSAGLFGMAAGSLFIAPWADRWGRRPLILACLALSGLGMLASALSQAAWQLALL 114
A G+LL+ A + + +DR+GRRP++L LA + + A + W L +
Sbjct: 43 AHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIG 102

Query: 115 R---GLTGLGIGGILASSNVIASEYASRRWRGLAVSLQSTGYALGATLGGLLAVWLIGAW 171
R G+TG A I R G + G G LGGL+ G +
Sbjct: 103 RIVAGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLM-----GGF 157

Query: 172 GWRSVFVFGAGLTLAVIPLVCLCLPES 198
+ F A L C LPES
Sbjct: 158 SPHAPFFAAAALNGLNFLTGCFLLPES 184



Score = 36.7 bits (85), Expect = 1e-04
Identities = 34/146 (23%), Positives = 59/146 (40%), Gaps = 7/146 (4%)

Query: 51 NLGGAEIGLLLSA-GLFGMAAGSLFIAPWADRWGRRPLILACLALSGLGMLASALSQAAW 109
+ IG+ L+A G+ A ++ P A R G R ++ + G G + A + W
Sbjct: 242 HWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGW 301

Query: 110 QLALLRGLTGLGIGGILASSNVIASEYASRRW---RGLAVSLQSTGYALGATLGGLLAVW 166
+ L G G+ A +++ + R +G +L S +G L +
Sbjct: 302 MAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAA 361

Query: 167 LIGAW-GWRSVFVFGAGLTLAVIPLV 191
I W GW ++ GA L L +P +
Sbjct: 362 SITTWNGW--AWIAGAALYLLCLPAL 385



Score = 33.3 bits (76), Expect = 0.002
Identities = 41/189 (21%), Positives = 66/189 (34%), Gaps = 5/189 (2%)

Query: 253 RTTLLLWALFFLVMFGFYFIMSWTPKLLVAAGLSTAQGITGGTLLSIGGI---FGAALLG 309
R +++ + L G IM P LL S G LL++ + A +LG
Sbjct: 5 RPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLG 64

Query: 310 GLAARFRLERVLALFMLLTAALLALFSLSAGLPGAALPLGLLIGLCANACVAGLYALAPS 369
L+ RF VL + + A A+ + + L L +G ++ A A A
Sbjct: 65 ALSDRFGRRPVLLVSLAGAAVDYAIMATAPFL--WVLYIGRIVAGITGATGAVAGAYIAD 122

Query: 370 LYDASVRATGVGWGIGVGRGGAILSPLVAGLLLDDGWQPLSLYGAFAAVFVVAAAVLPLL 429
+ D RA G+ G + P++ GL+ A L
Sbjct: 123 ITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLP 182

Query: 430 GARRRERSP 438
+ + ER P
Sbjct: 183 ESHKGERRP 191


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_05218DHBDHDRGNASE791e-19 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 79.3 bits (195), Expect = 1e-19
Identities = 61/244 (25%), Positives = 99/244 (40%), Gaps = 14/244 (5%)

Query: 6 FITGATSGFGEACARRFAEAGWSLVLTGRREERLQALAGELSAKTRVL-PLTLDVRDRAA 64
FITGA G GEA AR A G + E+L+ + L A+ R DVRD AA
Sbjct: 12 FITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSAA 71

Query: 65 MSAAVDNLPEEFATLRGLINNAGLALGTDPAQSCDLDDWDTMVDTNIKGLLYSTRLLLPR 124
+ + E + L+N AG+ L S ++W+ N G+ ++R +
Sbjct: 72 IDEITARIEREMGPIDILVNVAGV-LRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSKY 130

Query: 125 LIAHGAGASIVNLGSVAGKWPYPGSHVYGGTKAFVEQFSLNLRCDLQGTGVRVTNLEPGL 184
++ +G SIV +GS P Y +KA F+ L +L +R + PG
Sbjct: 131 MMDRRSG-SIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPGS 189

Query: 185 CESEFSLV----------RFGGDQARYDKTYAGAHPIQPEDIAETI-FWIMNQPAHLNIN 233
E++ G + +P DIA+ + F + Q H+ ++
Sbjct: 190 TETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHITMH 249

Query: 234 SLEI 237
+L +
Sbjct: 250 NLCV 253


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_05219SECA300.015 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 29.8 bits (67), Expect = 0.015
Identities = 38/156 (24%), Positives = 60/156 (38%), Gaps = 22/156 (14%)

Query: 160 AKAAALAARLREEGYPARAADGLRAAVEA-----ADCVSCVTTSREALVRGAWLKPGVHL 214
K+ ++ L + G + A EA A + VT + RG + G
Sbjct: 460 EKSELVSNELTKAGIKHNVLNAKFHANEAAIVAQAGYPAAVTIATNMAGRGTDIVLGGSW 519

Query: 215 DLVGAFLPSMRETDALAVARARVVVDTRAGALEEAGDLLQAIAEGAIGREA-----ISTE 269
A + ++ A + + + R A+ EAG L IG E I +
Sbjct: 520 Q---AEVAALENPTAEQIEKIKADWQVRHDAVLEAGGLH------IIGTERHESRRIDNQ 570

Query: 270 LRDLLGGAGRRGDPGEITLFKSVGYALEDLVAARRV 305
LR G +GR+GD G + S+ AL + A+ RV
Sbjct: 571 LR---GRSGRQGDAGSSRFYLSMEDALMRIFASDRV 603


74PAKAF_05288PAKAF_05312Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_05288073.093857NAD(P)H quinone oxidoreductase
PAKAF_05289072.799769Arginine:Pyruvate Transaminase, AruH
PAKAF_052900102.9341672-ketoarginine decarboxylase, AruI
PAKAF_05291-1112.435154acetate--CoA ligase family protein
PAKAF_05292-1132.181580probable acyl-CoA dehydrogenase
PAKAF_05293-2142.751527probable enoyl-CoA hydratase/isomerase
PAKAF_05294-2132.819994lysine-specific permease
PAKAF_05295-2133.182826probable two-component sensor
PAKAF_052970103.342035probable two-component response regulator
PAKAF_052980103.599503probable transcriptional regulator
PAKAF_052991103.492087ABC transporter substrate-binding protein
PAKAF_05300093.536967probable oxidoreductase
PAKAF_05301-1103.020829probable transcriptional regulator
PAKAF_05302093.4489133-deoxy-D-manno-octulosonic-acid (KDO)
PAKAF_05303-181.942997probable transcriptional regulator
PAKAF_05304-191.745611SMR multidrug efflux transporter
PAKAF_05305012-1.408433FAD-dependent oxidoreductase
PAKAF_05306-112-1.545567aldo/keto reductase
PAKAF_05307-211-2.198664hypothetical protein
PAKAF_05308-213-2.894377probable acyl-CoA dehydrogenase
PAKAF_05309-215-3.207490probable acyl-CoA dehydrogenase
PAKAF_05310-218-4.678215asparagine synthetase,Asparagine synthetase
PAKAF_05311-210-3.015642LPS biosynthesis protein RfaE
PAKAF_05312-113-3.262101transport protein MsbA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_05295HTHFIS535e-09 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 52.5 bits (126), Expect = 5e-09
Identities = 31/143 (21%), Positives = 52/143 (36%), Gaps = 8/143 (5%)

Query: 748 SALEVLLVEDVALNREVAQGLLERDGHRVMLAEDAGPALALCRQRRFDLILLDMHLPGMA 807
+ +L+ +D A R V L R G+ V + +A DL++ D+ +P
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 808 GLELCAGIRRQLDGLNRATPIFAFTASIQPDMVRRYFAAGMQGVLGKPLRMDEL----RR 863
+L I++ L P+ +A + G L KP + EL R
Sbjct: 62 AFDLLPRIKKARPDL----PVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117

Query: 864 ALGEVGTSVPALAVDAALDRQML 886
AL E L D+ ++
Sbjct: 118 ALAEPKRRPSKLEDDSQDGMPLV 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_05297HTHFIS921e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 92.2 bits (229), Expect = 1e-23
Identities = 34/136 (25%), Positives = 60/136 (44%), Gaps = 6/136 (4%)

Query: 5 PRVLVVDDDPVIRELLQAYLGEEGYDVLCAGNAEQAEACLAECAHLGQPVELVLLDIRLP 64
+LV DDD IR +L L GYDV NA +A +LV+ D+ +P
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAA-----GDGDLVVTDVVMP 58

Query: 65 GKDGLTLTRELR-VRSEVGIILITGRNDEIDRIVGLECGADDYVIKPLNPRELVSRAKNL 123
++ L ++ R ++ +++++ +N + I E GA DY+ KP + EL+
Sbjct: 59 DENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118

Query: 124 IRRVRHAQASAGPARQ 139
+ + + Q
Sbjct: 119 LAEPKRRPSKLEDDSQ 134


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_05298HTHTETR751e-18 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 74.7 bits (183), Expect = 1e-18
Identities = 34/184 (18%), Positives = 65/184 (35%), Gaps = 4/184 (2%)

Query: 6 RFSRLEPEQRKALLIEATLACLKRHGFQGASVRKICAEAGVSVGLINHHYDGKDALVAEA 65
R ++ E ++ + +++ L + G S+ +I AGV+ G I H+ K L +E
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 66 YLAVTGRVMRLLRGAIDTAPGGARPRLSAFFEASFSAELLDPQ---LLDAWLAFWGAVGS 122
+ + L PG L + + + + L++ VG
Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGE 122

Query: 123 IEAIGRVHDHSYGEYRALLVGVLRQLAEEGGW-ADFDAELAAISLSALLDGLWLESGLNP 181
+ + + + E + L+ E AD AAI + + GL P
Sbjct: 123 MAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAP 182

Query: 182 ATFT 185
+F
Sbjct: 183 QSFD 186


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_05311LPSBIOSNTHSS280.044 Lipopolysaccharide core biosynthesis protein signat...
		>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein

signature.
Length = 166

Score = 28.2 bits (63), Expect = 0.044
Identities = 14/57 (24%), Positives = 26/57 (45%), Gaps = 7/57 (12%)

Query: 346 GCFDILHAGHVTYLEQARAQGDRLIVGVNDDASVTRLKGVGRPINSVDRRMAVLAGL 402
G FD + GH+ +E+ D++ V V + + +P+ SV R+ +A
Sbjct: 7 GSFDPITFGHLDIIERGCRLFDQVYVAVLRNPN-------KQPMFSVQERLEQIAKA 56


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_05312ACRIFLAVINRP310.013 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 31.3 bits (71), Expect = 0.013
Identities = 12/50 (24%), Positives = 23/50 (46%)

Query: 144 ITFNVTMVTGAATDAIKVVIREGLTVVFLFLYLLWMNWKLTLVMLAILPV 193
++ T + + + E + +VFL +YL N + TL+ +PV
Sbjct: 325 YPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPV 374


75PAKAF_05371PAKAF_05376Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_05371312-0.980460DUF971 domain-containing protein
PAKAF_05372311-1.622580poly(3-hydroxyalkanoic acid) synthase 1
PAKAF_05373314-1.198138poly(3-hydroxyalkanoic acid) depolymerase
PAKAF_05374414-1.639624poly(3-hydroxyalkanoic acid) synthase 2
PAKAF_05375717-1.158591probable transcriptional regulator
PAKAF_05376616-0.729395polyhydroxyalkanoate synthesis protein PhaF
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_05375HTHTETR602e-13 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 60.0 bits (145), Expect = 2e-13
Identities = 30/148 (20%), Positives = 57/148 (38%), Gaps = 8/148 (5%)

Query: 1 MKTRDRILECSLLLFNEQGEPNVSTLEIANELGISPGNLYYHFHGKEPLVMALFERFQAE 60
+TR IL+ +L LF++QG + S EIA G++ G +Y+HF K L ++E ++
Sbjct: 10 QETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESN 69

Query: 61 LAPLL-----DPPEEVRLGAEDYWLFLHLIVERLAHYRFLFQDL---SNLTGRLPRLARG 112
+ L P + + + + R L + + G + + +
Sbjct: 70 IGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQA 129

Query: 113 IRTWLGALKRTLATLLARLKADRQLRSD 140
R + L + L +D
Sbjct: 130 QRNLCLESYDRIEQTLKHCIEAKMLPAD 157


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_05376IGASERPTASE477e-08 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 47.0 bits (111), Expect = 7e-08
Identities = 28/167 (16%), Positives = 50/167 (29%), Gaps = 3/167 (1%)

Query: 140 PAAKAAAKPAAKPAAKPAAKTAAAKPAAKPAAKAAAKPAAKPAAKKTAAKTAAAKPA--A 197
PA + +K +KT A + AK A A T + A
Sbjct: 1030 PATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSG 1089

Query: 198 KPAAKPTAKAAAKPATKPAA-KAAAKPAAKPAAAKPAAKPAAKPAAATAAKPAAKPAAKP 256
+ + AT KA + K ++ + K + +P A+PA +
Sbjct: 1090 SETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPAREN 1149

Query: 257 AAKKPAAKKPAAKPAAAKPAAPAASSSAPAAPAATPAASAPAANAPA 303
+ + A PA +S+ T + + N+
Sbjct: 1150 DPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVV 1196



Score = 43.1 bits (101), Expect = 1e-06
Identities = 27/175 (15%), Positives = 42/175 (24%), Gaps = 6/175 (3%)

Query: 140 PAAKAAAKPAAKPAAKPAAKTAAAKPAAKPAAKAAAK----PAAKPAAKKTAAKTAAAKP 195
P + + A P+ + A+ P PA + T
Sbjct: 983 PEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAE 1042

Query: 196 AAKPAAKPTAKAAAKPA-TKPAAKAAAKPAAKPAAAKPAAKPAAKPAAATA-AKPAAKPA 253
+K +K K T + AK A A A+ + T +
Sbjct: 1043 NSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKE 1102

Query: 254 AKPAAKKPAAKKPAAKPAAAKPAAPAASSSAPAAPAATPAASAPAANAPATPSSQ 308
K+ AK K S + P A N P +
Sbjct: 1103 TATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKE 1157


76PAKAF_05397PAKAF_05413Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_05397313-0.907758NUDIX hydrolase
PAKAF_05398624-3.882087DguC
PAKAF_05399937-6.165878DguB
PAKAF_05400730-5.125680DguA
PAKAF_05401527-5.018717DguR
PAKAF_05402527-5.325931Tli5b
PAKAF_05403319-3.181809Tli5b
PAKAF_05404011-0.686598Tli5b
PAKAF_05405-290.735265PldB
PAKAF_05406-161.320897VgrG5
PAKAF_05407-1101.591457N-formylglutamate amidohydrolase
PAKAF_054082111.212324imidazolone-5-propionate hydrolase HutI
PAKAF_054092121.085164probable histidine/phenylalanine ammonia-lyase
PAKAF_054103110.349670probable ATP-binding component of ABC
PAKAF_054112110.779111probable permease of ABC transporter
PAKAF_054120111.064559probable binding protein component of ABC
PAKAF_054132101.323369probable amino acid permease
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_05408UREASE362e-04 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 36.3 bits (84), Expect = 2e-04
Identities = 17/33 (51%), Positives = 21/33 (63%)

Query: 341 LAGVTLHAARALGLEASHGSLEVGKLADFVAWD 373
+A T++ A A GL GSLEVGK AD V W+
Sbjct: 406 IAKYTINPAIAHGLSHEIGSLEVGKRADLVLWN 438


77PAKAF_05572PAKAF_05589Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_05572316-0.566167c-type cytochrome
PAKAF_05573419-1.330452hypothetical protein
PAKAF_05574520-1.835779TerC family protein
PAKAF_05575722-1.485699mechanosensitive ion channel family protein
PAKAF_05576720-0.808404probable ATP-binding component of ABC
PAKAF_055781023-0.540453alginate regulatory protein AlgP
PAKAF_05579418-0.470743probable peptidyl-prolyl cis-trans isomerase,
PAKAF_055804140.169377Alginate regulatory protein AlgQ
PAKAF_055813140.452285disulfide bond formation protein
PAKAF_055823150.038680heme biosynthesis protein HemY
PAKAF_05583090.045042heme biosynthesis operon protein HemX
PAKAF_05584-113-2.294793uroporphyrinogen-III synthetase
PAKAF_05585019-3.312995porphobilinogen deaminase
PAKAF_05586016-2.710415alginate biosynthesis regulatory protein AlgR
PAKAF_05587015-2.767942alginate biosynthesis protein FimS
PAKAF_05588-113-3.010100argininosuccinate lyase
PAKAF_05589016-3.648053PA5264 ortholog, hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_05576GPOSANCHOR310.012 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 31.2 bits (70), Expect = 0.012
Identities = 30/109 (27%), Positives = 44/109 (40%), Gaps = 12/109 (11%)

Query: 540 RTDKRAQRQAAAALRQQLAPHKREADKLERELGGLHEKLAA-------IEARLG----DS 588
R D A R+A L + + + E L L A +EA +
Sbjct: 315 RRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQN 374

Query: 589 ALYDVSRKDELRELLSEQSSLKVREGELEERWLEALETLEALQKELEAS 637
+ + SR+ R+L + + + K E LEE + L LE L KELE S
Sbjct: 375 KISEASRQSLRRDLDASREAKKQVEKALEEANSK-LAALEKLNKELEES 422


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_05578IGASERPTASE612e-12 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 61.2 bits (148), Expect = 2e-12
Identities = 44/220 (20%), Positives = 67/220 (30%), Gaps = 7/220 (3%)

Query: 134 KAKPATKPAAKAAAKPTVKTVAAKPAAKPAAKPAAKPA-AKPAAKTAAAKPAAKPTAKPA 192
T P A P+V + + A+ P PA A P+ T +K +K
Sbjct: 993 DTTNITTPNNIQADVPSVPSNNEE-IARVDEAPVPPPAPATPSETTETVAENSKQESKTV 1051

Query: 193 AKPAAKPAAKTAAAKPAAKPAAKPVAKPAAKPAAKTAAAKPAAKPAAKPVAKPTAKPAAK 252
K TA + AK A V A + A + K K TA +
Sbjct: 1052 EKNEQDATETTAQNREVAKEAKSNV--KANTQTNEVAQSGSETKETQTTETKETATVEKE 1109

Query: 253 TAAAKPAAKPAAKPAAKPAAKPVAKSAAAKPAAKPAAKPAAKPAAKPAAKPVAAKPAATK 312
A K P P +P A+PA + K ++ T
Sbjct: 1110 EKAKVETEKTQEVPKVTSQVSP---KQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTA 1166

Query: 313 PATAPAAKPAATPSAPAAASSAASATPAAGSNGAAPTSAS 352
PA + ++ P S+ + + N T A+
Sbjct: 1167 DTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPAT 1206



Score = 44.3 bits (104), Expect = 6e-07
Identities = 40/246 (16%), Positives = 80/246 (32%), Gaps = 19/246 (7%)

Query: 45 EKQRGKAQEKLHKARTKLQDAAKAGKTKAQAK--ARETISDLEEALDTLKARQADTRTYI 102
E A+ +++T ++ A +T AQ + A+E S+++ T + Q
Sbjct: 1035 ETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQ------- 1087

Query: 103 VGLKRDVQESLKLAQGVGKVKEAAGKA-LESRKAKPATKPAAKAAAKPTVKTVAAKPAAK 161
+ +E+ E KA +E+ K + K ++ + K ++ +P A+
Sbjct: 1088 --SGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQE-QSETVQPQAE 1144

Query: 162 PAAKPA----AKPAAKPAAKTAAAKPAAKPTAKPAAKPAAKPAAKTAAAKPAAKPAAKPV 217
PA + K TA + AK T+ +P + P +
Sbjct: 1145 PARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENP--ENT 1202

Query: 218 AKPAAKPAAKTAAAKPAAKPAAKPVAKPTAKPAAKTAAAKPAAKPAAKPAAKPAAKPVAK 277
+P + ++ + V T ++ + A V
Sbjct: 1203 TPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTNAVLS 1262

Query: 278 SAAAKP 283
A AK
Sbjct: 1263 DARAKA 1268



Score = 33.9 bits (77), Expect = 0.001
Identities = 17/119 (14%), Positives = 29/119 (24%), Gaps = 1/119 (0%)

Query: 233 PAAKPAAKPVAKPTAKPAAKTAAAKPAAKPAAKPAAKPAAKPVAKSAAAKPAAKPAAKPA 292
P + + V A P+ + A+ PV A A P+ A
Sbjct: 983 PEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSET-TETVA 1041

Query: 293 AKPAAKPAAKPVAAKPAATKPATAPAAKPAATPSAPAAASSAASATPAAGSNGAAPTSA 351
+ + A A A + A + A + + T
Sbjct: 1042 ENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTET 1100


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_05579INFPOTNTIATR962e-26 Macrophage infectivity potentiator signature.
		>INFPOTNTIATR#Macrophage infectivity potentiator signature.

Length = 233

Score = 95.8 bits (238), Expect = 2e-26
Identities = 61/202 (30%), Positives = 97/202 (48%), Gaps = 16/202 (7%)

Query: 22 KDELAYAVGARLGMRLQQEMPGLELSELLLGLRQAYRGEALEIPPERIEQLLLQHE---- 77
KD+L+Y++GA LG + + + L G++ G L + E+++ +L + +
Sbjct: 31 KDKLSYSIGADLGKNFKNQGIDINPDVLAKGMQDGMSGAQLILTEEQMKDVLSKFQKDLM 90

Query: 78 -------NATTETPRTTPAEARFLANEKARFGVRELTGGVLVSELRRGQGNGIGAATQVH 130
N E + +A FL+ K++ G+ L G+ + G G G + V
Sbjct: 91 AKRSAEFNKKAEENKAK-GDA-FLSANKSKPGIVVLPSGLQYKIIDAGTGAKPGKSDTVT 148

Query: 131 VRYRGLLADGQVFDQSESA---EWFALDSVIEGWRTALRAMPVGARWRVVIPSAQAYGHE 187
V Y G L DG VFD +E A F + VI GW AL+ MP G+ W V +P+ AYG
Sbjct: 149 VEYTGTLIDGTVFDSTEKAGKPATFQVSQVIPGWTEALQLMPAGSTWEVFVPADLAYGPR 208

Query: 188 GAGDLIPPDAPLVFEIDLLGFR 209
G I P+ L+F+I L+ +
Sbjct: 209 SVGGPIGPNETLIFKIHLISVK 230


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_05586HTHFIS794e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 79.5 bits (196), Expect = 4e-19
Identities = 31/136 (22%), Positives = 57/136 (41%), Gaps = 5/136 (3%)

Query: 3 VLIVDDEPLARERLARLVGQLDGYRVLEPSASNGEEALTLIDSLKPDIVLLDIRMPGLDG 62
+L+ DD+ R L + + + GY V SN I + D+V+ D+ MP +
Sbjct: 6 ILVADDDAAIRTVLNQALSRA-GYDVR--ITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 63 LQVAARLCEREAPPAVIFCTAHDEF--ALEAFQVSAVGYLVKPVRSEDLAEALKKASRPN 120
+ R+ + V+ +A + F A++A + A YL KP +L + +A
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122

Query: 121 RVQLAALTKPPASGGS 136
+ + + L G
Sbjct: 123 KRRPSKLEDDSQDGMP 138


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_05587PF065801821e-56 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 182 bits (463), Expect = 1e-56
Identities = 76/308 (24%), Positives = 137/308 (44%), Gaps = 23/308 (7%)

Query: 64 LFVQWIVLLSAALFCRLRPLLARLPVALAGSACCLLVVALT------LGCTAVAEHYQLG 117
+F I L+ L R + R +L V + A ++L
Sbjct: 43 IFNIAISLMGLVLTHAYRSFIKRQGWLKLNMGQIILRVLPACVVIGMVWFVANTSIWRLL 102

Query: 118 GELTRAGE-------VNLYLRHALIALIMSALVLRYFYLQS-------QWRRQQQAELQA 163
+ +++ ++ + S L + + ++ QW+ A+ +A
Sbjct: 103 AFINTKPVAFTLPLALSIIFNVVVVTFMWSLLYFGWHFFKNYKQAEIDQWKMASMAQ-EA 161

Query: 164 RLESLQARIRPHFLFNSLNSIASLIELDPLKAEHAVLDLSDLFRASLAK-PGTLVSWEEE 222
+L +L+A+I PHF+FN+LN+I +LI DP KA + LS+L R SL VS +E
Sbjct: 162 QLMALKAQINPHFMFNALNNIRALILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADE 221

Query: 223 LALARRYLSIEQYRLGDRLQLDWQVHGVPANLPIPQLTLQPLLENALIYGIQPRVEGGLV 282
L + YL + + DRLQ + Q++ ++ +P + +Q L+EN + +GI +GG +
Sbjct: 222 LTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKI 281

Query: 283 QVEAVYREGVFQLCVSNPYDEALESPPSKGTRQALHNIDARLGALFGPKASLSVERRDGR 342
++ G L V N AL++ + T L N+ RL L+G +A + + + G+
Sbjct: 282 LLKGTKDNGTVTLEVENTGSLALKNTK-ESTGTGLQNVRERLQMLYGTEAQIKLSEKQGK 340

Query: 343 HYTCLRYP 350
+ P
Sbjct: 341 VNAMVLIP 348


78PAKAF_05673PAKAF_05679Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_05673213-1.025990hypothetical protein
PAKAF_05674-2121.966655probable DNA-binding protein
PAKAF_05675-2123.376427rubredoxin reductase
PAKAF_05676-2113.268695Rubredoxin 2
PAKAF_05677-2113.495725Rubredoxin 1
PAKAF_05678-2103.945856heme-binding protein
PAKAF_05679-283.310507glycolate oxidase subunit GlcF
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_05674DNABINDINGHU1114e-36 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 111 bits (280), Expect = 4e-36
Identities = 41/87 (47%), Positives = 58/87 (66%)

Query: 3 KPELAAAIAEKADLTKEQANRVLNALLDEITGALNRKDSVTLVGFGTFLQRHRGARTGKN 62
K +L A +AE +LTK+ + ++A+ ++ L + + V L+GFG F R R AR G+N
Sbjct: 4 KQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGRN 63

Query: 63 PQTGQPVKIKASNTVAFKPGKALRDAV 89
PQTG+ +KIKAS AFK GKAL+DAV
Sbjct: 64 PQTGEEIKIKASKVPAFKAGKALKDAV 90


79PAKAF_05707PAKAF_05733Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_057072122.598973L-serine dehydratase
PAKAF_057084132.754947GbdR
PAKAF_057093123.653628HTH-type transcriptional regulator CysL
PAKAF_057103113.417576probable transcriptional regulator
PAKAF_057112112.828978YeiH family protein
PAKAF_057122132.745167HocS
PAKAF_057130122.438641CdhB, Carnitine dehydrogenase-related protein B
PAKAF_057140112.969804CdhA, Carnitine dehydrogenase
PAKAF_05715-1112.143508CdhC, Carnitine dehydrogenase-related protein C
PAKAF_05716-192.284120CaiX
PAKAF_057170112.347691CdhR, transcriptional regulator
PAKAF_057182112.232926probable peptidic bond hydrolase
PAKAF_057192110.680580DUF1028 domain-containing protein
PAKAF_05720212-0.227800RidA family protein
PAKAF_05721213-0.245169conserved hypothetical protein
PAKAF_057220140.039616cardiolipin synthase
PAKAF_057231160.702455DUF1348 family protein
PAKAF_057240151.110675peptidase M19
PAKAF_05725-2131.335148hypothetical protein
PAKAF_05726-2141.386532DgcA, Dimethylglycine catabolism
PAKAF_057270141.967976DgcB, Dimethylglycine catabolism
PAKAF_057281181.449608probable electron transfer flavoprotein alpha
PAKAF_05729521-1.145497electron transfer flavoprotein subunit beta/FixA
PAKAF_05730524-2.680061hypothetical protein
PAKAF_05731525-1.106926probable transcriptional regulator
PAKAF_05732321-0.178853hypothetical protein
PAKAF_057333160.306483hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_05727TCRTETA320.008 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 32.1 bits (73), Expect = 0.008
Identities = 35/155 (22%), Positives = 52/155 (33%), Gaps = 16/155 (10%)

Query: 5 LLPVLLFAALALAVLGAAKRFLMWRRGRPAKVDWIGGL----LQMPRRYLVDLHHVVERD 60
LL L AA+ A++ A + GR + G+ + Y+ D+ ER
Sbjct: 76 LLVSLAGAAVDYAIMATAPFLWVLYIGR-----IVAGITGATGAVAGAYIADITDGDERA 130

Query: 61 RYMSRTHVATAGGFVLAALLAILVHGFGLHGRILGFALLAATALMFVGALF--VARRRLD 118
R+ G V +L L+ GF H A L + L +
Sbjct: 131 RHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGERR 190

Query: 119 PPSRLSKGP-----WMRLPKSLLAFAASFFLATLP 148
P R + P W R + A A FF+ L
Sbjct: 191 PLRREALNPLASFRWARGMTVVAALMAVFFIMQLV 225


80PAKAF_05792PAKAF_05800Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_057922122.158717DUF2388 domain-containing protein
PAKAF_057932122.424255DUF2388 domain-containing protein
PAKAF_057942101.989890DUF2388 domain-containing protein
PAKAF_057952111.247153DUF4105 domain-containing protein
PAKAF_057963101.156573GFA family protein
PAKAF_057973101.196874AEC family transporter
PAKAF_057983100.536677DUF2388 domain-containing protein
PAKAF_057993100.367632probable citrate transporter
PAKAF_05800280.597987TerC family protein
81PAKAF_00159PAKAF_00169N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_001592100.249392probable Resistance-Nodulation-Cell Division
PAKAF_0016019-0.733195probable Resistance-Nodulation-Cell Division
PAKAF_00161111-0.996736probable Resistance-Nodulation-Cell Division
PAKAF_00162-1100.911964probable transcriptional regulator
PAKAF_00163111-0.199758hypothetical protein
PAKAF_001640100.579585hypothetical protein
PAKAF_00165-1121.019713histidine porin OpdC
PAKAF_00166-1142.224539probable transcriptional regulator
PAKAF_00167-1131.992856probable gamma-glutamyltranspeptidase
PAKAF_00168-1140.975911hypothetical protein
PAKAF_059101131.857535probable transporter
PAKAF_00169-1162.077844probable transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_00159RTXTOXIND491e-08 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 49.1 bits (117), Expect = 1e-08
Identities = 23/129 (17%), Positives = 41/129 (31%), Gaps = 9/129 (6%)

Query: 75 VGGKIVERLVDVGDHVAAGQVLARLDP-------QDQRSNVENAQAAVAAQQAQSKLADL 127
+ E +V G+ V G VL +L +S++ A+ Q S+ +L
Sbjct: 103 ENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIEL 162

Query: 128 NYQRQKALLPKGYTSQSEYDQALASVRSAQSSLKAAQAQLANARDLLSYTELRASDAGVI 187
N + L + Y ++ L + Q Q L+ + RA V+
Sbjct: 163 NKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKE--LNLDKKRAERLTVL 220

Query: 188 TARQAEVGQ 196

Sbjct: 221 ARINRYENL 229



Score = 42.9 bits (101), Expect = 2e-06
Identities = 33/216 (15%), Positives = 71/216 (32%), Gaps = 26/216 (12%)

Query: 58 SITGDIQARVQADQSFRVGGKIVERLVDVGDHVAAGQVLARLDPQDQRSNVENAQAAVAA 117
++ I + + L+ A A L+ +++ N +
Sbjct: 218 TVLARINRYENLSRVEKSRLDDFSSLLHKQ----AIAKHAVLEQENKYVEAVNELRVYKS 273

Query: 118 QQAQSKLADLNYQRQKALLPKGYTSQSEYDQALASVRSAQSSLKAAQAQLANARDLLSYT 177
Q Q + L+ + + L+ + + ++ L +R ++ +LA + +
Sbjct: 274 QLEQIESEILSAKEEYQLVTQLFKNE-----ILDKLRQTTDNIGLLTLELAKNEERQQAS 328

Query: 178 ELRASDAGVITARQA-EVGQVVQATVPIFTLARDGERDAVFNVYESLFSHDVDGQRITVS 236
+RA + + + G VV + + + D V + + D+ I V
Sbjct: 329 VIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPE---DDTLEVTALVQNKDIG--FINVG 383

Query: 237 LLGKPEVTA---------SGKVREITP--TVDERSG 261
+V A GKV+ I D+R G
Sbjct: 384 QNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLG 419


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_00160RTXTOXIND401e-05 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 39.8 bits (93), Expect = 1e-05
Identities = 16/103 (15%), Positives = 32/103 (31%), Gaps = 3/103 (2%)

Query: 63 TNGRIASRLFDVGDFVGKGALLATLDPTDQQNQLRASQGDLASAEAQLIDAQANARRQEE 122
N + + G+ V KG +L L + +Q L A + Q +R E
Sbjct: 103 ENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIEL 162

Query: 123 --LFARSVTAQARLDDARTR-LKTSQASFDQAKAAVQQARDQL 162
L + + + + + + + Q + Q
Sbjct: 163 NKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQK 205



Score = 39.0 bits (91), Expect = 2e-05
Identities = 23/182 (12%), Positives = 59/182 (32%), Gaps = 31/182 (17%)

Query: 51 IQARYESVLGFRTNGRIASRLFDVGDFVGKGALLATLDPTDQQNQLRASQGDLASAEAQL 110
I ++ S L + K A+L +Q+N+ + +L ++QL
Sbjct: 223 INRYENLSRVEKSRLDDFSSLLHKQ-AIAKHAVL------EQENKYVEAVNELRVYKSQL 275

Query: 111 IDAQANARRQEELFARSVTAQARLDDARTRLKTSQASFDQAKAAVQQARDQLSYTRLVTD 170
++ +E + Q ++ +L+ + + + + ++ + +
Sbjct: 276 EQIESEILSAKE--EYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAP 333

Query: 171 FDGVITTW--HAEAGQVVSAGQAVVTLARPEVREAVFDLPTEVAESLPADARFLVSAQLD 228
+ H E G VV+ + ++ + P D V+A +
Sbjct: 334 VSVKVQQLKVHTE-GGVVTTAETLMVIV-------------------PEDDTLEVTALVQ 373

Query: 229 PQ 230
+
Sbjct: 374 NK 375


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_00161ACRIFLAVINRP490e-159 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 490 bits (1264), Expect = e-159
Identities = 240/1052 (22%), Positives = 444/1052 (42%), Gaps = 69/1052 (6%)

Query: 7 LSDWALRHQSLVWYLMAVSLVMGVFSYLNLGREEDPSFAIKTMVIQTRWPGATVDDTLEQ 66
++++ +R W L + ++ G + L L + P+ A + + +PGA +
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 67 VTDRIEKKLEELDSLDYVKSYT-RPGESTVFVYLKDTTKAGDIPDIWYQVRKKISDIQGE 125
VT IE+ + +D+L Y+ S + G T+ + + T D QV+ K+
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGT---DPDIAQVQVQNKLQLATPL 117

Query: 126 FPQGIQGPG-FNDEFGDVFGSVYAFTADGLDFRQ--LRDYVEKVRLD-IRSVKDLGKVQM 181
PQ +Q G ++ + V F +D Q + DYV D + + +G VQ+
Sbjct: 118 LPQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQL 177

Query: 182 IGAQNEV-IYLNFSTRKLAALGLDQRQVVQSLQAQNAVTPSGVVEAGPE------RISVR 234
GAQ + I+L+ L L V+ L+ QN +G + P S+
Sbjct: 178 FGAQYAMRIWLDAD--LLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASII 235

Query: 235 TSGNFRSEKDLQAVNLRVNDRFY--RLSDLASISRDFVDPPTSLFRYKGEPAIGLAVAMK 292
F++ ++ V LRVN RL D+A + + + R G+PA GL + +
Sbjct: 236 AQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARV-ELGGENYNVIARINGKPAAGLGIKLA 294

Query: 293 EGGNILEFGEALNARMQEITGELPVGVGVHQVSNQAQVVKKAVGGFTRALFEAVVIVLIV 352
G N L+ +A+ A++ E+ P G+ V + V+ ++ + LFEA+++V +V
Sbjct: 295 TGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLV 354

Query: 353 SFVSLG-LRAGLVVACSIPLVLAMVFVFMEYTDITMQRVSLGALIIALGLLVDDAMITVE 411
++ L +RA L+ ++P+VL F + ++ +++ +++A+GLLVDDA++ VE
Sbjct: 355 MYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVE 414

Query: 412 MMITRLELGDSLHDSATY-AYTSTAFPMLTGTLVTVAGFVPIGLNASSAGEYTFTLFAVI 470
+ + AT + + ++ +V A F+P+ S G I
Sbjct: 415 NVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITI 474

Query: 471 AVALLLSWIVAVLFAPVIAVHILPKTLKHKSEQKKG---RIAERFDSLLHLA-------M 520
A+ LS +VA++ P + +L E K G FD ++ +
Sbjct: 475 VSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKIL 534

Query: 521 RRRWTTIFLTALLFGVSLFLMKFVQHQFFPSSDRPELLVDLNLPQNSSIHETRAVMDR-L 579
+ + AL+ + L + F P D+ L + LP ++ T+ V+D+
Sbjct: 535 GSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVT 594

Query: 580 EATLKDDEDID-HWSAYVGEGAIRFYLPLDQQLQNNFYGQLVIVTKDLEAR---ERVAAR 635
+ LK+++ G Q QN + K E R E A
Sbjct: 595 DYYLKNEKANVESVFTVNGFS-------FSGQAQNAGMAF--VSLKPWEERNGDENSAEA 645

Query: 636 LRDRLRKDYVGI-STYVQPLEMGPPV--------GRPIQYRVSGPQIDKVREYAMGLAGV 686
+ R + + I +V P M P + +G D + + L G+
Sbjct: 646 VIHRAKMELGKIRDGFVIPFNM-PAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGM 704

Query: 687 LDGNP-NIGDIVYDWNEPGKMLKIDIAQDKARQLGLSSEDVAQIMNSVVTGSAVTQVRDD 745
+P ++ + + E K+++ Q+KA+ LG+S D+ Q +++ + G+ V D
Sbjct: 705 AAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDR 764

Query: 746 IYLVNVIGRAEDSERGSLETLESLQIVTPSGTSIPLKAFAKVSYELEQPLVWRRDRKPTI 805
+ + +A+ R E ++ L + + +G +P AF + P + R + P++
Sbjct: 765 GRVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSM 824

Query: 806 TVKASLRGEIQPTDLVAKLAPEVKRFADGLPANYRIEVGGTVEESGKAEGPIAKVVPLML 865
+ +GE P ++ A LPA + G + + +V +
Sbjct: 825 EI----QGEAAPGTSSGDAMALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISF 880

Query: 866 FLMATFLMIQLQSVQKLFLVASVAPLGLIGVVAALLPTGTPMGFVAILGILALIGIIIRN 925
++ L +S V V PLG++GV+ A ++G+L IG+ +N
Sbjct: 881 VVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKN 940

Query: 926 SVILVTQI-DAFEKDGKTPWEAVLEATHHRTRPILLTAAAASLGMIPIA------REVFW 978
++++V D EK+GK EA L A R RPIL+T+ A LG++P+A
Sbjct: 941 AILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQN 1000

Query: 979 GPMAYAMIGGIVAATLLTLIFLPALYVAWYRI 1010
+ ++GG+V+ATLL + F+P +V R
Sbjct: 1001 -AVGIGVMGGMVSATLLAIFFVPVFFVVIRRC 1031



Score = 78.7 bits (194), Expect = 8e-17
Identities = 79/517 (15%), Positives = 172/517 (33%), Gaps = 35/517 (6%)

Query: 518 LAMRRRWTTIFLTALLFGVSLFLMKFVQHQFFPSSDRPELLVDLNLPQNSSIHETRAVMD 577
+RR L +L + + +P+ P + V N P + V
Sbjct: 4 FFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQ 63

Query: 578 RLEATLKDDEDIDHWSAYVGEGAIRFYLPLDQQLQNNFYGQLVIVTKDLEARERVAARLR 637
+E + +++ + S+ + A + L Q + V V + L
Sbjct: 64 VIEQNMNGIDNLMYMSS-TSDSAGSVTITLTFQSGTDPDIAQVQVQ---NKLQLATPLLP 119

Query: 638 DRLRKDYVGISTYVQPLEMG----PPVGRPIQYRVSGPQIDKVREYAMGLAGVLDGNPNI 693
+++ + + M Q +S V++ L GV D
Sbjct: 120 QEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFG 179

Query: 694 GDIVYDWNEPGKMLKIDIAQDKARQLGLSSEDVAQIMNS----VVTGSAVTQVRDDIYLV 749
++I + D + L+ DV + + G +
Sbjct: 180 AQ---------YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQL 230

Query: 750 NVIGRAEDSERGSLETLESLQI-VTPSGTSIPLKAFAKVSYELE-QPLVWRRDRKPTITV 807
N A+ + E + + V G+ + LK A+V E ++ R + KP +
Sbjct: 231 NASIIAQT-RFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGL 289

Query: 808 KASLRGEIQPTDLVAKLAPEVKRFADGLPANYRIEVGGTVEESGKAEGPIAKVVPLML-- 865
L D + ++ P ++ + + + I +VV +
Sbjct: 290 GIKLATGANALDTAKAIKAKLAELQPFFPQGMKVLY--PYDTTPFVQLSIHEVVKTLFEA 347

Query: 866 -FLMATFLMIQLQSVQKLFLVASVAPLGLIGVVAALLPTGTPMGFVAILGILALIGIIIR 924
L+ + + LQ+++ + P+ L+G A L G + + + G++ IG+++
Sbjct: 348 IMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVD 407

Query: 925 NSVILVTQI-DAFEKDGKTPWEAVLEATHHRTRPILLTAAAASLGMIPIA-----REVFW 978
+++++V + +D P EA ++ ++ A S IP+A +
Sbjct: 408 DAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIY 467

Query: 979 GPMAYAMIGGIVAATLLTLIFLPALYVAWYRIPEPGR 1015
+ ++ + + L+ LI PAL +
Sbjct: 468 RQFSITIVSAMALSVLVALILTPALCATLLKPVSAEH 504


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_00166PHAGEIV300.009 Gene IV protein signature.
		>PHAGEIV#Gene IV protein signature.

Length = 426

Score = 30.3 bits (68), Expect = 0.009
Identities = 15/76 (19%), Positives = 29/76 (38%), Gaps = 3/76 (3%)

Query: 105 TRCRVLEVTPLARELIKSFCELPVDYPEGDSAESRLVQVLLDQLRLLPEVAFSLPMPREP 164
R ++ + +KS + D + +V D L LP+ ++ +P +
Sbjct: 138 NNVRAKDLIRVVELFVKSNTSKSSNVLSVDGSNLLVVSAPKDILDNLPQFLSTVDLPTDQ 197

Query: 165 RLLRLCQALIDEPTQS 180
L+ + LI E Q
Sbjct: 198 ILI---EGLIFEVQQG 210


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_00167PF09025300.016 YopR Core
		>PF09025#YopR Core

Length = 143

Score = 29.6 bits (66), Expect = 0.016
Identities = 26/90 (28%), Positives = 33/90 (36%), Gaps = 10/90 (11%)

Query: 134 LPFEQLL---RPAIELARDGFPVSPVIARLWQSGLDKFRAALPQRPELRAWFDEFLIDGR 190
L FEQ L PA G + RL Q + R EL+A L GR
Sbjct: 30 LAFEQALGGEPPAAGRRLAGLENGALGERLLQRFAQPLQGLEADRLELKAMLRAELPLGR 89

Query: 191 APRA------GEVFRQPGQADTLDELARSQ 214
+ G V PG + L +LAR +
Sbjct: 90 QQQTFLLQLLGAVEHAPG-GEYLAQLARRE 118


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_00168CHANNELTSX467e-08 Nucleoside-specific channel-forming protein Tsx signa...
		>CHANNELTSX#Nucleoside-specific channel-forming protein Tsx

signature.
Length = 294

Score = 45.8 bits (108), Expect = 7e-08
Identities = 38/135 (28%), Positives = 58/135 (42%), Gaps = 9/135 (6%)

Query: 14 LLAAGQAVAEDHDMTPTHETDSGPLL---WHNESLTYLYGKNFKINPPIQQTFTLEHAS- 69
LLAAG VA + P W ++S+ + + + P I+ LE+ +
Sbjct: 5 LLAAGAVVALSTTFAAGAAENDKPQYLSDWWHQSVNVVGSYHTRFGPQIRNDTYLEYEAF 64

Query: 70 -GWTWGDLFIFFDQ-INYNGKEDAS---NGKNTYYGEITPRLSFGKLTGADLSFGPVKDV 124
W D + + D + + G A N + + EI PR S KLT DLSFGP K+
Sbjct: 65 AKKDWFDFYGYIDAPVFFGGNSTAKGIWNKGSPLFMEIEPRFSIDKLTNTDLSFGPFKEW 124

Query: 125 LLAGTYEFGEGDTEA 139
A Y + G ++
Sbjct: 125 YFANNYIYDMGRNDS 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_00169HTHTETR662e-15 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 66.2 bits (161), Expect = 2e-15
Identities = 32/219 (14%), Positives = 68/219 (31%), Gaps = 30/219 (13%)

Query: 15 KPAGRIRQKNEEAILAAAEEEFARHGFKGTSMNTIAQNVGLPKANLHYYFGNKLGLYTAV 74
+ + Q+ + IL A F++ G TS+ IA+ G+ + ++++F +K L++ +
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 75 LSNILELWDSTFNTLGVD--DDPAEALARYIRAKMEFSRRYPLASRIFA----------- 121
DP L + +E + +
Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGE 122

Query: 122 MEIISGGECLTAHFNQDYRSWFRGRAAVFEAWIAAGRMDP-VDPVHLIFLLWGSTQHYAD 180
M ++ + + + + I A + + ++ G Y
Sbjct: 123 MAVVQQAQ---RNLCLESYDRI---EQTLKHCIEAKMLPADLMTRRAAIIMRG----YIS 172

Query: 181 FASQIGLVTGR-KRMSRQDFAAAADNLVRIILKGCGLTP 218
GL+ D A + V I+L+ L P
Sbjct: 173 -----GLMENWLFAPQSFDLKKEARDYVAILLEMYLLCP 206


82PAKAF_00369PAKAF_00376N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_003690121.143708probable transcriptional regulator
PAKAF_003711141.609401hydrolase
PAKAF_003721140.946179hypothetical protein
PAKAF_003731110.57751616S rRNA (guanine(966)-N(2))- methyltransferase
PAKAF_00374111-0.169161insulinase family protein
PAKAF_003751100.183384probable zinc protease
PAKAF_00376290.441084signal recognition particle receptor FtsY
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_00369HTHTETR595e-13 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 59.3 bits (143), Expect = 5e-13
Identities = 30/170 (17%), Positives = 63/170 (37%), Gaps = 11/170 (6%)

Query: 7 TRDRIAQASLELFNAQGERSVTTNHIATHLGISPGNLYYHYPNKQAIIAELFAEYESHVE 66
TR I +L LF+ QG S + IA G++ G +Y+H+ +K + +E++ ES++
Sbjct: 12 TRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIG 71

Query: 67 SFLRLPEGRGLTVDDKTF--YLEALLAAMWRYRFLHRDLEHLLESD------PELAARYR 118
+ + L +L + +E + + R
Sbjct: 72 ELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQR 131

Query: 119 AFAQRCLVNAKAIYRGFTEAGILR-MNETQLEALTLNAWI--ILTSWVRF 165
+ + EA +L T+ A+ + +I ++ +W+
Sbjct: 132 NLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFA 181


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_00371PF06057290.024 Type IV secretory pathway VirJ component
		>PF06057#Type IV secretory pathway VirJ component

Length = 243

Score = 29.0 bits (65), Expect = 0.024
Identities = 14/73 (19%), Positives = 28/73 (38%), Gaps = 15/73 (20%)

Query: 79 GLQRALLERGWASVALN-----WRGCSGEPNRLPRGYHSGVSDDLAEVVAHLRARRPQAP 133
+ L ++GW V + W+ + P+ V+ D ++ +A
Sbjct: 69 AVGGILQQQGWPVVGWSSLKYYWK------QKDPKD----VTQDTLAIIDKYQAEFGTQK 118

Query: 134 LYAVGYSLGGNVL 146
+ +GYS G V+
Sbjct: 119 VILIGYSFGAEVI 131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_00374PHPHTRNFRASE340.002 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 33.6 bits (77), Expect = 0.002
Identities = 27/109 (24%), Positives = 46/109 (42%), Gaps = 16/109 (14%)

Query: 228 PTISREQLQAFHKKAYAAGN--VVIALVGDLS--RQEAEAIAAEVSKALPQGPALAKTVQ 283
I R QL+A +A GN V+ ++ L RQ + E K L +G ++ +++
Sbjct: 368 QDIFRTQLRAL-LRASTYGNLKVMFPMIATLEELRQAKAIMQEEKDKLLSEGVDVSDSIE 426

Query: 284 P----ETPKPGLT------HIDFPSEQTH-LMLAQLGIDRQDPDYAALY 321
E P + +DF S T+ L+ + DR + + LY
Sbjct: 427 VGIMVEIPSTAVAANLFAKEVDFFSIGTNDLIQYTMAADRMNERVSYLY 475


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_00376PF03544452e-07 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 45.4 bits (107), Expect = 2e-07
Identities = 24/105 (22%), Positives = 33/105 (31%)

Query: 38 VEPVSETAAAEQRAPADDVAQSLTEQPGRQQPSAAEPAEPAPVAEAPLASDEPASAEEHS 97
V V E A Q VA + E P QP EP P E + A
Sbjct: 37 VHQVIELPAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEK 96

Query: 98 PRPEAPVAQPEPILAAEPEPEPEPEPEPEPVAPLAAAPAVSEPAT 142
P+P+ +P+ + +P APA +T
Sbjct: 97 PKPKPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSST 141



Score = 33.0 bits (75), Expect = 0.002
Identities = 22/111 (19%), Positives = 34/111 (30%), Gaps = 14/111 (12%)

Query: 27 PQAGEQPADQPVEPVSETAAAEQRAPADDVAQSLTEQPGRQQPSAAEPAEPAPVAEAPLA 86
PQA + P + VEP E +P ++ P E +P P +
Sbjct: 63 PQAVQPPPEPVVEPEPEPEP--------------IPEPPKEAPVVIEKPKPKPKPKPKPV 108

Query: 87 SDEPASAEEHSPRPEAPVAQPEPILAAEPEPEPEPEPEPEPVAPLAAAPAV 137
+ P P + E A P +PV +A+ P
Sbjct: 109 KKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVASGPRA 159


83PAKAF_00406PAKAF_00418N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_00406-212-0.001064aspartate carbamoyltransferase
PAKAF_00407014-0.700210transcriptional regulator PyrR
PAKAF_00408014-1.417720Holliday junction resolvase RuvX
PAKAF_00409012-1.810944YqgE/AlgH family protein
PAKAF_00410212-2.162958TonB3
PAKAF_00411212-2.404058glutathione synthetase
PAKAF_00412190.717746twitching motility protein PilG
PAKAF_00413191.167160twitching motility protein PilH
PAKAF_00414191.479464twitching motility protein PilI
PAKAF_00415181.788615twitching motility protein PilJ
PAKAF_00416182.516030methyltransferase PilK
PAKAF_00417172.981021component of chemotactic signal transduction
PAKAF_00418273.802520probable methylesterase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_00406TYPE3IMPPROT290.032 Type III secretion system inner membrane P protein ...
		>TYPE3IMPPROT#Type III secretion system inner membrane P protein

family signature.
Length = 224

Score = 28.6 bits (64), Expect = 0.032
Identities = 10/41 (24%), Positives = 17/41 (41%)

Query: 293 ADGAQSVILNQVTYGIAIRMAVLSMAMSGQNTQRQLEQEDA 333
A G Q + N G+A+ +++ M + E ED
Sbjct: 40 ALGLQQIPSNMTLNGVALLLSMFVMWPIMHDAYVYFEDEDV 80


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_00410PF03544601e-12 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 60.4 bits (146), Expect = 1e-12
Identities = 31/183 (16%), Positives = 58/183 (31%), Gaps = 14/183 (7%)

Query: 117 APFQDNQVKKVAPPAT--------PKQARSEEAPKAAVTTTRQRQQKAPSKTQAQKAEQV 168
AP Q V VAP P + E P+ ++ + K +
Sbjct: 45 APAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPK 104

Query: 169 AKPAPHFDSTQLSAEIASLEADLAKEQQAYAKRPRIHRLSAASTMRDKGAWYKEDWRKKI 228
KP Q ++ + P S A+ K + +
Sbjct: 105 PKPVK--KVEQPKRDVKP--VESRPASPFENTAPARPTSSTATAATSKPVTSVASGPRAL 160

Query: 229 ERIGNLNYPDEARRQKLYGSLRLLVSINRDGTIYEVQVLESSGEPILDQAAQRIVRLAAP 288
R YP A+ ++ G +++ + DG + VQ+L + + ++ + +R
Sbjct: 161 SRN-QPQYPARAQALRIEGQVKVKFDVTPDGRVDNVQILSAKPANMFEREVKNAMR-RWR 218

Query: 289 YAP 291
Y P
Sbjct: 219 YEP 221


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_00412HTHFIS733e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 72.9 bits (179), Expect = 3e-18
Identities = 32/117 (27%), Positives = 51/117 (43%), Gaps = 2/117 (1%)

Query: 6 DGLKVMVIDDSKTIRRTAETLLKKVGCDVITAIDGFDALAKIADTHPNIIFVDIMMPRLD 65
G ++V DD IR L + G DV + IA +++ D++MP +
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 66 GYQTCALIKNNSAFKSTPVIMLSSKDGLFDKAKGRIVGSDQYLTKPFSKEELLGAIK 122
+ IK A PV+++S+++ K G+ YL KPF EL+G I
Sbjct: 62 AFDLLPRIK--KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_00413HTHFIS808e-21 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 79.9 bits (197), Expect = 8e-21
Identities = 33/119 (27%), Positives = 51/119 (42%), Gaps = 2/119 (1%)

Query: 2 ARILIVDDSPTEMYKLTAMLEKHGHQVLKAENGGDGVALARQEKPDVVLMDIVMPGLNGF 61
A IL+ DD L L + G+ V N D+V+ D+VMP N F
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 QATRQLTKDAETSAIPVIIVTTKDQETDKVWGKRQGARDYLTKPVDEETLLKTINAVLA 120
++ K +PV++++ ++ + +GA DYL KP D L+ I LA
Sbjct: 64 DLLPRIKKARPD--LPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_00417HTHFIS682e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 68.3 bits (167), Expect = 2e-13
Identities = 26/113 (23%), Positives = 54/113 (47%), Gaps = 2/113 (1%)

Query: 2359 VMVVDDSVTVRKVTTRLLERNGMNVLTAKDGVDAIAQLQEHRPDILLLDIEMPRMDGFEV 2418
++V DD +R V + L R G +V + + D+++ D+ MP + F++
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 2419 ATLVRHDERLGNLPIIMITSRTGEKHRERALGIGVNQYLGKPYQETELLEAIQ 2471
++ + +LP+++++++ +A G YL KP+ TEL+ I
Sbjct: 66 LPRIK--KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_00418HTHFIS300.014 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 30.2 bits (68), Expect = 0.014
Identities = 21/82 (25%), Positives = 34/82 (41%), Gaps = 3/82 (3%)

Query: 7 PRVAVIADTSLQRHVLQQALLGHGYEVVLNADPARVDDAALECAPDLWLVDLTQQDDS-- 64
+ V D + R VL QAL GY+V + ++ A + DL + D+ D++
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 65 PLLDSLLEQD-RAPVLFGEGHA 85
LL + + PVL
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQN 85


84PAKAF_00425PAKAF_00432N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_00425013-1.989008flavin monoamine oxidase family protein
PAKAF_00426013-1.628526cytochrome b
PAKAF_00427012-1.006458PasP
PAKAF_00428010-1.113302multidrug resistance operon repressor MexR
PAKAF_0042909-0.966780Resistance-Nodulation-Cell Division (RND)
PAKAF_00430-19-0.940218Resistance-Nodulation-Cell Division (RND)
PAKAF_00431-110-0.637264Major intrinsic multiple antibiotic resistance
PAKAF_00432-110-0.858941probable ATP-dependent RNA helicase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_00425FLGFLGJ300.016 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 30.5 bits (68), Expect = 0.016
Identities = 24/128 (18%), Positives = 43/128 (33%), Gaps = 6/128 (4%)

Query: 171 NLSPTAR----LLVNQRIRSRYDEPSRLSLLYLAQQGRAYRGVDDRDLRAARLPGGSQVL 226
N+ P AR + V ++S D + L+ ++ R Y + D+ + G L
Sbjct: 32 NIRPVARQVEGMFVQMMLKSMRDALPK-DGLFSSEHTRLYTSMYDQQIAQQMTAGKGLGL 90

Query: 227 AEAFVKQIKTIKTKSKVSSIVQAKDGVAVKAGSETYKADYVVLAVPLKALGQIQMTPSLS 286
AE VKQ+ + + S+ A ++ L P S
Sbjct: 91 AEMMVKQMTPEQPLPEEST-PAAPMKFPLETVVRYQNQALSQLVQKAVPRNYDDSLPGDS 149

Query: 287 GTQMSALK 294
++ L
Sbjct: 150 KAFLAQLS 157


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_00429RTXTOXIND478e-08 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 47.1 bits (112), Expect = 8e-08
Identities = 24/93 (25%), Positives = 43/93 (46%), Gaps = 1/93 (1%)

Query: 62 RIAEVRPQVNGIILKRLFKEGSDVKAGQQLYQIDPATYEADYQSAQANLASTQEQAQRYK 121
R E++P N I+ + + KEG V+ G L ++ EAD Q++L + + RY+
Sbjct: 95 RSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQ 154

Query: 122 LLVADQAVSKQQYADANA-AYLQSKAAVEQARI 153
+L ++K Y Q+ + E R+
Sbjct: 155 ILSRSIELNKLPELKLPDEPYFQNVSEEEVLRL 187



Score = 42.9 bits (101), Expect = 1e-06
Identities = 42/268 (15%), Positives = 93/268 (34%), Gaps = 37/268 (13%)

Query: 37 EVGIVTLEAQTVTLNTELPGRTNAFRIAEVRPQVNGIILKRLFKEGSDVKAGQQLYQIDP 96
E+ + A+ +T+ + N R+ + R L + + K +
Sbjct: 206 ELNLDKKRAERLTVLARINRYENLSRVEKSRLD----DFSSLLHKQAIAKHAVLEQENKY 261

Query: 97 ATYEADYQSAQANLASTQEQAQRYK--LLVADQAVSKQ---QYADANAAYLQSKAAVEQA 151
+ + ++ L + + K + Q + + + +
Sbjct: 262 VEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKN 321

Query: 152 RINLRYTKVLSPISGRIGRSAV-TEGALVTNGQANAMATVQQLDPIYVDVTQPSTALLRL 210
+ + + +P+S ++ + V TEG +VT + M V + D + V + + +
Sbjct: 322 EERQQASVIRAPVSVKVQQLKVHTEGGVVTTAE-TLMVIVPEDDTLEVTALVQNKDIGFI 380

Query: 211 RRELASGQLERAGDNAAKVSLKLE--DGSQYP-LEGRLE--FSEVSVDEGTGSVT--IRA 263
GQ +K+E ++Y L G+++ + D+ G V I +
Sbjct: 381 N----VGQ---------NAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVIIS 427

Query: 264 V------FPNPNNELLPGMFVHAQLQEG 285
+ N N L GM V A+++ G
Sbjct: 428 IEENCLSTGNKNIPLSSGMAVTAEIKTG 455


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_00430ACRIFLAVINRP13520.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 1352 bits (3501), Expect = 0.0
Identities = 691/1034 (66%), Positives = 838/1034 (81%), Gaps = 3/1034 (0%)

Query: 1 MSKFFIDRPIFAWVIALVIMLAGGLSILSLPVNQYPAIAPPAIAVQVSYPGASAETVQDT 60
M+ FFI RPIFAWV+A+++M+AG L+IL LPV QYP IAPPA++V +YPGA A+TVQDT
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 61 VVQVIEQQMNGIDNLRYISSESNSDGSMTITVTFEQGTDPDIAQVQVQNKLQLATPLLPQ 120
V QVIEQ MNGIDNL Y+SS S+S GS+TIT+TF+ GTDPDIAQVQVQNKLQLATPLLPQ
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 121 EVQRQGIRVTKAVKNFLMVVGVVSTDGSMTKEDLSNYIVSNIQDPLSRTKGVGDFQVFGS 180
EVQ+QGI V K+ ++LMV G VS + T++D+S+Y+ SN++D LSR GVGD Q+FG+
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 181 QYSMRVWLDPAKLNSYQLTPGDVSSAIQAQNVQISSGQLGGLPAVKGQQLNATIIGKTRL 240
QY+MR+WLD LN Y+LTP DV + ++ QN QI++GQLGG PA+ GQQLNA+II +TR
Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240

Query: 241 QTAEQFENILLKVNPDGSQVRLKDVADVGLGGQDYSINAQFNGSPASGIAIKLATGANAL 300
+ E+F + L+VN DGS VRLKDVA V LGG++Y++ A+ NG PA+G+ IKLATGANAL
Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300

Query: 301 DTAKAIRQTIANLEPFMPQGMKVVYPYDTTPVVSASIHEVVKTLGEAILLVFLVMYLFLQ 360
DTAKAI+ +A L+PF PQGMKV+YPYDTTP V SIHEVVKTL EAI+LVFLVMYLFLQ
Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360

Query: 361 NFRATLIPTIAVPVVLLGTFGVLAAFGFSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420
N RATLIPTIAVPVVLLGTF +LAAFG+SINTLTMFGMVLAIGLLVDDAIVVVENVERVM
Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420

Query: 421 AEEGLSPREAARKSMGQIQGALVGIAMVLSAVFLPMAFFGGSTGVIYRQFSITIVSAMAL 480
E+ L P+EA KSM QIQGALVGIAMVLSAVF+PMAFFGGSTG IYRQFSITIVSAMAL
Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480

Query: 481 SVIVALILTPALCATMLKPIEKGDHGEHKGGFFGWFNRMFLSTTHGYERGVASILKHRAP 540
SV+VALILTPALCAT+LKP+ H E+KGGFFGWFN F + + Y V IL
Sbjct: 481 SVLVALILTPALCATLLKPVSAE-HHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539

Query: 541 YLLIYVVIVAGMIWMFTRIPTAFLPDEDQGVLFAQVQTPPGSSAERTQVVVDSMREYLLE 600
YLLIY +IVAGM+ +F R+P++FLP+EDQGV +Q P G++ ERTQ V+D + +Y L+
Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599

Query: 601 KESSSVSSVFTVTGFNFAGRGQSSGMAFIMLKPWEERPGGENSVFELAKRAQMHFFSFKD 660
E ++V SVFTV GF+F+G+ Q++GMAF+ LKPWEER G ENS + RA+M +D
Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRD 659

Query: 661 AMVFAFAPPSVLELGNATGFDLFLQDQAGVGHEVLLQARNKFLMLAAQNPA-LQRVRPNG 719
V F P+++ELG ATGFD L DQAG+GH+ L QARN+ L +AAQ+PA L VRPNG
Sbjct: 660 GFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNG 719

Query: 720 MSDEPQYKLEIDDEKASALGVSLADINSTVSIAWGSSYVNDFIDRGRVKRVYLQGRPDAR 779
+ D Q+KLE+D EKA ALGVSL+DIN T+S A G +YVNDFIDRGRVK++Y+Q R
Sbjct: 720 LEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFR 779

Query: 780 MNPDDLSKWYVRNDKGEMVPFNAFATGKWEYGSPKLERYNGVPAMEILGEPAPGLSSGDA 839
M P+D+ K YVR+ GEMVPF+AF T W YGSP+LERYNG+P+MEI GE APG SSGDA
Sbjct: 780 MLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDA 839

Query: 840 MAAVEEIVKQLPKGVGYSWTGLSYEERLSGSQAPALYALSLLVVFLCLAALYESWSIPFS 899
MA +E + +LP G+GY WTG+SY+ERLSG+QAPAL A+S +VVFLCLAALYESWSIP S
Sbjct: 840 MALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVS 899

Query: 900 VMLVVPLGVIGALLATSMRGLSNDVFFQVGLLTTIGLSAKNAILIVEFAKELHE-QGKGI 958
VMLVVPLG++G LLA ++ NDV+F VGLLTTIGLSAKNAILIVEFAK+L E +GKG+
Sbjct: 900 VMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGV 959

Query: 959 VEAAIEACRMRLRPIVMTSLAFILGVVPLAISTGAGSGSQHAIGTGVIGGMVTATVLAIF 1018
VEA + A RMRLRPI+MTSLAFILGV+PLAIS GAGSG+Q+A+G GV+GGMV+AT+LAIF
Sbjct: 960 VEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIF 1019

Query: 1019 WVPLFYVAVSTLFK 1032
+VP+F+V + FK
Sbjct: 1020 FVPVFFVVIRRCFK 1033


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_00432SECA381e-04 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 37.9 bits (88), Expect = 1e-04
Identities = 28/108 (25%), Positives = 49/108 (45%), Gaps = 7/108 (6%)

Query: 212 IEVTPPNTTVERIEQ--RVFRLPAPQKRALLAHLVTVGAWEQ-VLVFTRTKHGANRLAEY 268
V P N + R + V+ A + +A++ + A Q VLV T + + ++
Sbjct: 409 TVVVPTNRPMIRKDLPDLVYMTEAEKIQAIIEDIKERTAKGQPVLVGTISIEKSELVSNE 468

Query: 269 LTKHGLPAAAIHG-NKSQNARTKALADFKANDVRILVATDIAARGLDI 315
LTK G+ ++ + A A A + A + +AT++A RG DI
Sbjct: 469 LTKAGIKHNVLNAKFHANEAAIVAQAGYPAA---VTIATNMAGRGTDI 513


85PAKAF_00605PAKAF_00611N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_0060509-0.472595DUF3530 family protein
PAKAF_00606-28-1.078078probable two-component sensor
PAKAF_00607-210-2.041188probable two-component response regulator
PAKAF_00608-210-2.211652probable binding protein component of ABC
PAKAF_00609-211-2.089417probable ATP-binding component of ABC
PAKAF_00610-112-2.345600probable binding protein component of ABC
PAKAF_00611112-2.113112probable permease of ABC transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_00605CHANLCOLICIN290.030 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 28.9 bits (64), Expect = 0.030
Identities = 31/124 (25%), Positives = 47/124 (37%), Gaps = 17/124 (13%)

Query: 120 AGWQTLSLALPDPQSTAPVTRPAESAASASADKDA---------------SAADSASKPD 164
A W T L + A AE+ A A A++DA +A+ + S +
Sbjct: 55 AKWSTAQLKKTQAEQAARAKAAAEAQAKAKANRDALTQRLKDIVNEALRHNASRTPSATE 114

Query: 165 VKGESGNA--PAPESTAEAGSGEPAQSEDQAPPPAIDPVEQRKAHAERVMARLQASIDLA 222
+ + A E A + E A+ E +A A EQR+ ER A + + LA
Sbjct: 115 LAHANNAAMQAEDERLRLAKAEEKARKEAEAAEKAFQEAEQRRKEIEREKAETERQLKLA 174

Query: 223 LQHE 226
E
Sbjct: 175 EAEE 178


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_00607HTHFIS561e-11 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 55.6 bits (134), Expect = 1e-11
Identities = 23/85 (27%), Positives = 38/85 (44%)

Query: 11 ATNGEQLLETLRGTPCEVVLLDISMPGVNGLEAIPRIRALNEPPAILVLSMHDEAQMAAR 70
+N L + ++V+ D+ MP N + +PRI+ +LV+S + A +
Sbjct: 33 TSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIK 92

Query: 71 ALKIGAAGYATKDSDPALLLTAIRR 95
A + GA Y K D L+ I R
Sbjct: 93 ASEKGAYDYLPKPFDLTELIGIIGR 117


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_00609PF05272340.001 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 33.9 bits (77), Expect = 0.001
Identities = 15/88 (17%), Positives = 26/88 (29%), Gaps = 20/88 (22%)

Query: 40 LTLLGPSGSGKTTSLMMLAGFETPTAGEILLAGRSINNVPPHKRDIGMVFQNYALFPHMT 99
+ L G G GK+T + L G + + + +D
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDI---------GTGKDSYEQIAGIVA---YE 646

Query: 100 VAENLAFPLSVRGMSKTDVKERVKRALS 127
++E + + D E VK S
Sbjct: 647 LSE-------MTAFRRADA-EAVKAFFS 666


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_00611PF07675290.037 Cleaved Adhesin
		>PF07675#Cleaved Adhesin

Length = 1358

Score = 29.3 bits (65), Expect = 0.037
Identities = 10/34 (29%), Positives = 14/34 (41%)

Query: 132 EAPASYKDAMEQLDERWGDPAYWQVIRRNASSYT 165
APAS + + D G PA W+ I +
Sbjct: 1072 NAPASKRAEVLNEDFENGIPASWKTIDADGDGNN 1105


86PAKAF_00746PAKAF_00753N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_007460101.264348phenazine biosynthesis protein PhzD
PAKAF_00747-190.863348phenazine biosynthesis protein PhzC
PAKAF_00748-170.344214probable phenazine biosynthesis protein
PAKAF_00749-281.360203probable phenazine biosynthesis protein
PAKAF_00750-191.858945probable phenazine-specific methyltransferase
PAKAF_00751-1122.502147probable outer membrane protein precursor
PAKAF_00752-1112.342783probable Resistance-Nodulation-Cell Division
PAKAF_00753-293.388992probable Resistance-Nodulation-Cell Division
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_00746ISCHRISMTASE351e-125 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 351 bits (901), Expect = e-125
Identities = 102/207 (49%), Positives = 136/207 (65%), Gaps = 2/207 (0%)

Query: 3 GIPEITAYPLPTAQQLPANLARWSLEPRRAVLLVHDMQRYFLRPLPESLRA--GLVANAA 60
IP I Y +PTA +P N W +P RAVLL+HDMQ YF+ L AN
Sbjct: 2 AIPAIQPYQMPTASDMPQNKVSWVPDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANIR 61

Query: 61 RLRRWCVEQGVQIAYTAQPGSMTEEQRGLLKDFWGPGMRASPADREVVEELAPGPDDWLL 120
+L+ CV+ G+ + YTAQPGS + R LL DFWGPG+ + P + +++ ELAP DD +L
Sbjct: 62 KLKNQCVQLGIPVVYTAQPGSQNPDDRALLTDFWGPGLNSGPYEEKIITELAPEDDDLVL 121

Query: 121 TKWRYSAFFHSDLLQRMRAAGRDQLVLCGVYAHVGVLISTVDAYSNDIQPFLVADAIADF 180
TKWRYSAF ++LL+ MR GRDQL++ G+YAH+G L++ +A+ DI+ F V DA+ADF
Sbjct: 122 TKWRYSAFKRTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVADF 181

Query: 181 SEAHHRMALEYAASRCAMVVTTDEVLE 207
S H+MALEYAA RCA V TD +L+
Sbjct: 182 SLEKHQMALEYAAGRCAFTVMTDSLLD 208


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_00751RTXTOXIND290.033 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.4 bits (66), Expect = 0.033
Identities = 18/120 (15%), Positives = 35/120 (29%), Gaps = 4/120 (3%)

Query: 334 LGSASRAFEL--APSVSWPAF-RLGNVRARLRAVEAQ-SDAALARYQRSLLLAQEDVGNA 389
SR+ EL P + P NV + +Q + ++
Sbjct: 153 YQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKK 212

Query: 390 LNQLAEHQRRLVALFQSATHGANALEIANERYRAGAGSYLAVLENQRALYQIREELAQAE 449
+ R+ + + L+ + A + AVLE + + EL +
Sbjct: 213 RAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYK 272


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_00752ACRIFLAVINRP8040.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 804 bits (2077), Expect = 0.0
Identities = 315/1029 (30%), Positives = 529/1029 (51%), Gaps = 29/1029 (2%)

Query: 5 DLFVRRPVLALVVSTLILLLGLFSLGKLPIRQYPLLESSTITVTTEYPGASADLMQGFVT 64
+ F+RRP+ A V++ ++++ G ++ +LP+ QYP + ++V+ YPGA A +Q VT
Sbjct: 3 NFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVT 62

Query: 65 QPIAQAVSSVEGIDYLSSTSVQ-GRSVVTIRMLLNRDSTQAMTETMAKVNSVRYKLPERA 123
Q I Q ++ ++ + Y+SSTS G +T+ D A + K+ LP+
Sbjct: 63 QVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEV 122

Query: 124 YDSVIERSSGETTAVAYVGFSS--KTLPIPALTDYLSRVVEPMFSSIDGVAKVQTFGGQR 181
I ++ + GF S ++DY++ V+ S ++GV VQ FG Q
Sbjct: 123 QQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQY 182

Query: 182 LAMRLWLDADRLAGRGLTASDVAEAIRRNNYQAAPG------MVKGQYVLSNVRVNTDLT 235
AMR+WLDAD L LT DV ++ N Q A G + GQ + +++ T
Sbjct: 183 -AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFK 241

Query: 236 NVDDFREMVIRNDGNG-LVRLRDVGTVELGAAATETSALMDGDPAVHLGLFPTPTGNPLV 294
N ++F ++ +R + +G +VRL+DV VELG A ++G PA LG+ N L
Sbjct: 242 NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALD 301

Query: 295 IVDGIRKLLPEIQKTLPPDVRVDLAYETSRFIQASIDEVVRTLVEALLIVVLVIYLCLGS 354
I+ L E+Q P ++V Y+T+ F+Q SI EVV+TL EA+++V LV+YL L +
Sbjct: 302 TAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQN 361

Query: 355 LRSVLIPVATIPLSMLGAAALMLAFGFSVNLLTLLAMVLAIGLVVDDAIVVVENVHRHIE 414
+R+ LIP +P+ +LG A++ AFG+S+N LT+ MVLAIGL+VDDAIVVVENV R +
Sbjct: 362 MRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMM 421

Query: 415 EGKS-PVAAALIGAREVAGPVIAMTITLAAVYTPIGLMGGLTGALFREFALTLAGAVIVS 473
E K P A ++ G ++ + + L+AV+ P+ GG TGA++R+F++T+ A+ +S
Sbjct: 422 EDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALS 481

Query: 474 GVVALTLSPVMSSLLLQA-----HQNEGRMGRAAEWFFGGLTRRYGQVLEFSLDHRWLTG 528
+VAL L+P + + LL+ H+N+G F Y + L
Sbjct: 482 VLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYL 541

Query: 529 GLALLVCISLPLLYSMPKRELAPTEDQAAVLTAIKAPQHANLDYVELFARKLDQVYTSIP 588
+ L+ + +L+ P EDQ LT I+ P A + + ++ Y
Sbjct: 542 LIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNE 601

Query: 589 E------TVSTWIINGTDGPAASFGGINLAAWEKRERD---ASAIQSELQGKVGDVEGSS 639
+ A ++L WE+R D A A+ + ++G +
Sbjct: 602 KANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGF 661

Query: 640 IFAFQLAA--LPGSTGGLPVQMVLRSPQDYPVLYRTMEEIKQKARQSGLFVV-VDSDLDY 696
+ F + A G+ G +++ ++ + L + ++ A Q +V V +
Sbjct: 662 VIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGLE 721

Query: 697 NNPVVQVRIDRAKANSLGIRMQDIGESLAVLVGENYVNRFGMEGRSYDVIPQSLRDQRFT 756
+ ++ +D+ KA +LG+ + DI ++++ +G YVN F GR + Q+ R
Sbjct: 722 DTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRML 781

Query: 757 PQALARQFVRTQDGNLVPLSTVVRVELQVEPNKLIQFDQQNAATLQAIPAPGVSMGQAVA 816
P+ + + +VR+ +G +VP S +L +++ + +Q APG S G A+A
Sbjct: 782 PEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAMA 841

Query: 817 FLDDVARGLPAGFSHDWQSDSRQYTQEGNTLVFAFLAALVVIYLVLAAQYESLADPLIIL 876
++++A LPAG +DW S Q GN + VV++L LAA YES + P+ ++
Sbjct: 842 LMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSVM 901

Query: 877 ITVPLSICGALLPLALGYATMNIYTQIGLVTLIGLISKHGILMVEFANELQLHERLDRRA 936
+ VPL I G LL L ++Y +GL+T IGL +K+ IL+VEFA +L E
Sbjct: 902 LVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVVE 961

Query: 937 AILRAAQIRLRPVLMTTAAMVFGLVPLLFASGAGAASRFGLGVVIVSGMLVGTLFTLFVL 996
A L A ++RLRP+LMT+ A + G++PL ++GAG+ ++ +G+ ++ GM+ TL +F +
Sbjct: 962 ATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFV 1021

Query: 997 PTVYTLLAR 1005
P + ++ R
Sbjct: 1022 PVFFVVIRR 1030



Score = 94.5 bits (235), Expect = 1e-21
Identities = 70/327 (21%), Positives = 136/327 (41%), Gaps = 13/327 (3%)

Query: 701 VQVRIDRAKANSLGIRMQDIGESLAV----LVGENYVNRFGMEGRSYDVIPQSLRDQRFT 756
+++ +D N + D+ L V + + G+ + + + +
Sbjct: 184 MRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIA-QTRFKN 242

Query: 757 PQALARQFVRT-QDGNLVPLSTVVRVELQVEP-NKLIQFDQQNAATLQAIPAPGVSMGQA 814
P+ + +R DG++V L V RVEL E N + + + + AA L A G +
Sbjct: 243 PEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALDT 302

Query: 815 V----AFLDDVARGLPAGFSHDWQSDSRQYTQEG-NTLVFAFLAALVVIYLVLAAQYESL 869
A L ++ P G + D+ + Q + +V A+++++LV+ +++
Sbjct: 303 AKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNM 362

Query: 870 ADPLIILITVPLSICGALLPLALGYATMNIYTQIGLVTLIGLISKHGILMVEFANELQLH 929
LI I VP+ + G LA ++N T G+V IGL+ I++VE + +
Sbjct: 363 RATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMME 422

Query: 930 ERLDRRAAILRAAQIRLRPVLMTTAAMVFGLVPLLFASGAGAASRFGLGVVIVSGMLVGT 989
++L + A ++ ++ + +P+ F G+ A + IVS M +
Sbjct: 423 DKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSV 482

Query: 990 LFTLFVLPTV-YTLLARNHAEVDKSPR 1015
L L + P + TLL AE ++
Sbjct: 483 LVALILTPALCATLLKPVSAEHHENKG 509


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_00753RTXTOXIND461e-07 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 46.4 bits (110), Expect = 1e-07
Identities = 18/106 (16%), Positives = 43/106 (40%), Gaps = 2/106 (1%)

Query: 65 AGRQVQVAAEAAGRITRIAFESGQQVQQGQLLVQLNDAVEQAELIRLKAQLRNAEILHAR 124
+GR ++ + I + G+ V++G +L++L +A+ ++ ++ L A + +
Sbjct: 93 SGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARL--EQ 150

Query: 125 ARKLVERNVASQEQLDNAVAARDMALGAVRQTQALIDQKAIRAPFS 170
R + +L + V + + L I+ FS
Sbjct: 151 TRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFS 196



Score = 40.6 bits (95), Expect = 9e-06
Identities = 25/134 (18%), Positives = 60/134 (44%), Gaps = 6/134 (4%)

Query: 102 AVEQAELIRLKAQLRNAEILHARARKLVERNVASQ-EQLDNAVAARDMALGAVRQTQALI 160
V +++L ++++++ +A+ + +L + + + Q + + + L + Q
Sbjct: 269 RVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQAS 328

Query: 161 DQKAIRAPFSGQLGIRRVH-LGQYLGVAEPVASLV-DARTLKSNFSLDESTSPELKLGQP 218
IRAP S ++ +VH G + AE + +V + TL+ + + +GQ
Sbjct: 329 V---IRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQN 385

Query: 219 LEVLVDAYPGRSFP 232
+ V+A+P +
Sbjct: 386 AIIKVEAFPYTRYG 399


87PAKAF_00795PAKAF_00802N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_007950124.400236probable acetyltransferase
PAKAF_007960115.455942probable transcriptional regulator
PAKAF_007970105.283625hypothetical protein
PAKAF_00798195.289847hypothetical protein
PAKAF_007993115.413996probable short-chain dehydrogenase
PAKAF_00800-293.704405ferric enterobactin transport protein FepG
PAKAF_00801-2112.791679ferric enterobactin transport protein FepD
PAKAF_00802-1101.929335ferrienterobactin-binding periplasmic protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_00795SACTRNSFRASE415e-07 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 40.7 bits (95), Expect = 5e-07
Identities = 18/63 (28%), Positives = 27/63 (42%), Gaps = 2/63 (3%)

Query: 76 RSTWAAQDVCYLEDLYVSPDVRGQQIGKQLIEYVRRQAEERRCARLYWHTQESNHRAQRL 135
RS W +ED+ V+ D R + +G L+ A+E L TQ+ N A
Sbjct: 83 RSNW--NGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHF 140

Query: 136 YDR 138
Y +
Sbjct: 141 YAK 143


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_00799DHBDHDRGNASE1196e-35 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 119 bits (299), Expect = 6e-35
Identities = 75/258 (29%), Positives = 117/258 (45%), Gaps = 32/258 (12%)

Query: 5 RTALVTGATRGIGLALARRLAASGWSVVGI-----------------ARHASDDFPGRLL 47
+ A +TGA +GIG A+AR LA+ G + + ARHA + FP
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHA-EAFP---- 63

Query: 48 CCDLADPAQTAETLRGLLSESA-VDALVNNAGIALPQSLENLDLAALQQVFDLNVRVAVQ 106
D+ D A E + E +D LVN AG+ P + +L + F +N
Sbjct: 64 -ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFN 122

Query: 107 LAQACLPGLKRSPAGRIVNLCSRAIHGAR-ERTAYAAAKSALVGVTRTWALELAPLGITV 165
+++ + +G IV + S R AYA++K+A V T+ LELA I
Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 166 NAVAPGPIETELFRQTRPVGGEEERRILST-------IPMQRLGRPDEVAALIEFLLSEG 218
N V+PG ET++ E+ I + IP+++L +P ++A + FL+S
Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242

Query: 219 ASFVTGQVIGVDGGGSLG 236
A +T + VDGG +LG
Sbjct: 243 AGHITMHNLCVDGGATLG 260


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_00801PF04335300.010 VirB8 type IV secretion protein
		>PF04335#VirB8 type IV secretion protein

Length = 227

Score = 30.2 bits (68), Expect = 0.010
Identities = 13/43 (30%), Positives = 19/43 (44%), Gaps = 3/43 (6%)

Query: 7 RRRRLRAWGLLAGALLLALA---ALASLALGSRPVPLAVTLDA 46
R + AW + A LA A A+A+L P +T+D
Sbjct: 29 ERSKKLAWVVAGVAGALATAGVVAVAALTPLKTVEPYVITVDR 71


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_00802FERRIBNDNGPP376e-05 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 36.8 bits (85), Expect = 6e-05
Identities = 53/289 (18%), Positives = 96/289 (33%), Gaps = 28/289 (9%)

Query: 2 PTRRRSALPLLALALSLFA-TLAAAGEPKPARIVSTTPSVTGILLAMDAPLVASAATTPS 60
RR L +AL+ L+ A A P RIV+ +LLA+ A T
Sbjct: 6 LISRRRLLTAMALSPLLWQMNTAHAAAIDPNRIVALEWLPVELLLALGIVPYGVADTINY 65

Query: 61 RLTDAKGFFSQWAKVADQRGVEVLYRNLRFD--IEAVIAQDPDLLVASA---TGADSAAP 115
RL + S+ V+ LR + +E + P +V SA + A
Sbjct: 66 RL-----WVSEPPLPDS-----VIDVGLRTEPNLELLTEMKPSFMVWSAGYGPSPEMLAR 115

Query: 116 Y-RAELEAQGVPTLVVDYSKHSWQELATELGRHTGLERQAQAAIQRFDAYTAEVAA-AIA 173
+ ++ S E+A L + A+ + +++ + + +
Sbjct: 116 IAPGRGFNFSDGKQPLAMARKSLTEMADLLNL----QSAAETHLAQYEDFIRSMKPRFVK 171

Query: 174 PPQGPVSVVGYNIAGSYSIGRQASPQARLLEALGFQVAELPEALAGKVTRASDFQFISRE 233
P+ + + S +L+ G +P A G+ +S +
Sbjct: 172 RGARPLLLTTLIDPRHMLVFGPNSLFQEILDEYG-----IPNAWQGETNFWG-STAVSID 225

Query: 234 NLPAAIAGDSVFLLGASDDDVQAFLADPVLANLSAVREKRVYALGPSSF 282
L A D + + D+ A +A P+ + VR R + F
Sbjct: 226 RLAAYKDVDVLCFDHDNSKDMDALMATPLWQAMPFVRAGRFQRVPAVWF 274


88PAKAF_00813PAKAF_00819N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_00813092.603567probable short-chain dehydrogenase
PAKAF_008141112.331568transcriptional regulator AcoR
PAKAF_008150131.563130hypothetical protein
PAKAF_00816-1131.723264probable transcriptional regulator
PAKAF_00817-1131.437142probable outer membrane protein precursor
PAKAF_00818-1110.875055probable toxin transporter
PAKAF_00819-1100.576355probable secretion protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_00813DHBDHDRGNASE1272e-37 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 127 bits (319), Expect = 2e-37
Identities = 75/262 (28%), Positives = 126/262 (48%), Gaps = 14/262 (5%)

Query: 11 LSSRVALVTGAGRGIGRGIALALARAGADVAVADLDPQVAEETAAAIRSLGRRSLALGVD 70
+ ++A +TGA +GIG +A LA GA +A D +P+ E+ +++++ R + A D
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 71 VSDGDSVRAMVERVATEFGRLDVAVNNAGVISIRKVAELSLADWDRVMNVNARGVFLCCQ 130
V D ++ + R+ E G +D+ VN AGV+ + LS +W+ +VN+ GVF +
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 131 AELPLMQAQRWGRIVNLSSIAGKVGLPDLAHYCASKFAVIGFSNALAKEVARDGVTVNAL 190
+ M +R G IV + S V +A Y +SK A + F+ L E+A + N +
Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185

Query: 191 CPGIVGTGM----WRGEDGLSGRWRQAGESEAQSWERHQASLLPQGEAQTVEDMGQLVVY 246
PG T M W E+G + + E+ +P + D+ V++
Sbjct: 186 SPGSTETDMQWSLWADENGAEQVIKGSLETFKTG--------IPLKKLAKPSDIADAVLF 237

Query: 247 LAC--APHVTGQAIAVDGGFSL 266
L A H+T + VDGG +L
Sbjct: 238 LVSGQAGHITMHNLCVDGGATL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_00814HTHFIS339e-112 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 339 bits (871), Expect = e-112
Identities = 134/390 (34%), Positives = 192/390 (49%), Gaps = 59/390 (15%)

Query: 273 FDLDALHAAADQAPCLLRGQAGELHVRLSAPRAKARRLEREVPDDAAL---DPRIAESLR 329
FDL L +A L+ P+ + +LE + D L + E R
Sbjct: 106 FDLTELIGIIGRA--------------LAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYR 151

Query: 330 LAVRVKDRNLPVLIQGETGAGKEVFARQLHQASARRDKPFVALNCAAIPESLIESELFGY 389
+ R+ +L ++I GE+G GKE+ AR LH RR+ PFVA+N AAIP LIESELFG+
Sbjct: 152 VLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGH 211

Query: 390 VGGAFTGAAAKGMRGLLQQADGGTLFLDEIGDMPLGLQTRLLRVLAEGEVAPLGAARRQA 449
GAFTGA + G +QA+GGTLFLDEIGDMP+ QTRLLRVL +GE +G
Sbjct: 212 EKGAFTGAQTRST-GRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIR 270

Query: 450 VDIQVICATHRDLAALVAAGGFREDLYFRLGGARFELPPLRERSDRLALIRRILDEETAH 509
D++++ AT++DL + G FREDLY+RL LPPLR+R++ + + R ++
Sbjct: 271 SDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEK 330

Query: 510 CGVRI-ELGEAALECLLGYRWPGNVRQLRHVLRYACALCGGATLQLADLPAELRGEGRTP 568
G+ + + ALE + + WPGNVR+L +++R AL + + ELR E P
Sbjct: 331 EGLDVKRFDQEALELMKAHPWPGNVRELENLVRRLTALYPQDVITREIIENELRSE--IP 388

Query: 569 ASACESGGGP--------------------------------------ERDALLDALVRH 590
S E E +L AL
Sbjct: 389 DSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTAT 448

Query: 591 RWKPMAAARELGISRATLYRRVRRHGIRMP 620
R + AA LG++R TL +++R G+ +
Sbjct: 449 RGNQIKAADLLGLNRNTLRKKIRELGVSVY 478


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_00817GPOSANCHOR310.009 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 31.2 bits (70), Expect = 0.009
Identities = 41/184 (22%), Positives = 65/184 (35%), Gaps = 5/184 (2%)

Query: 144 SAALRNAQQLLLAANASQDATLQNTFALAAQAYYDALAAQRSLAASRQVAELAAQNLEAA 203
+A + AA A++ A L+ A A ++L A + E LE A
Sbjct: 209 DSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKA 268

Query: 204 DAKY---RAGAAALSDRLQAQTALSQASLAQVRDEGALSNALGVIALRMGLAPDTPLRLS 260
+A L+A+ A +A A + + + NA +LR L +
Sbjct: 269 LEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNA-NRQSLRRDLDASREAKKQ 327

Query: 261 GELEAQPDTVFVKAIDEMLAEARREHPALLAAQARLKAAAASVEESRAAGRPSLA-LSAN 319
E E Q K + RR+ A A+ +L+A +EE S L +
Sbjct: 328 LEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRD 387

Query: 320 LARS 323
L S
Sbjct: 388 LDAS 391


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_00819RTXTOXIND1563e-45 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 156 bits (397), Expect = 3e-45
Identities = 79/431 (18%), Positives = 175/431 (40%), Gaps = 55/431 (12%)

Query: 22 RPVSFTFLTLLAAAMALLVVGF--FLFGSYTKRSTVSGQLVPASGQVKVHAPQAGIVLRK 79
PVS + M LV+ F + G +T +G+L + ++ + IV
Sbjct: 51 TPVSRRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEI 110

Query: 80 FVQEGQAVRRGERLMVLSSERYGSDAGPVQAG--ISRRLEQRRDSLRDELEKLRRLQDD- 136
V+EG++VR+G+ L+ L++ +D Q+ +R + R L +E + +
Sbjct: 111 IVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKL 170

Query: 137 ------------------------------ERDSLTSKVASLQRELTTLAAQTDSQRHLL 166
++ + + E T+ A+ + +L
Sbjct: 171 PDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLS 230

Query: 167 ALASDAAARYQGLMDKGYISMDQLQQRQAELLGQRQTLQGLERERTSLRQQLTERRNELA 226
+ + L+ K I+ + +++ + + L+ + + + ++ + E
Sbjct: 231 RVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQ 290

Query: 227 GLSAR----QANQLAETRRQLSAVEQDLAESEAKRTLL-VTAPESGIATAVLAEA-GQTV 280
++ ++L +T + + +LA++E ++ + AP S + G V
Sbjct: 291 LVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVV 350

Query: 281 DSSRPLLSIVPADTPLQAELYAPSKSIGFIRPGDAVLIRYQAYPYQKFGQYHGKVQSISR 340
++ L+ IVP D L+ +K IGFI G +I+ +A+PY ++G GKV++I+
Sbjct: 351 TTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINL 410

Query: 341 ASVSYAELSSMVGGVPGLGQDGEQLYRLRVTLDDQAVTAYGQPRPLQSGMLLDADILQDT 400
++ Q ++ + +++++ ++ + PL SGM + A+I
Sbjct: 411 DAI--------------EDQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGM 456

Query: 401 RRLYEWVLEPL 411
R + ++L PL
Sbjct: 457 RSVISYLLSPL 467


89PAKAF_00879PAKAF_00885N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_008790200.077940usher CupB3
PAKAF_008800130.509862chaperone CupB4
PAKAF_008810130.062595adhesive protein CupB5
PAKAF_00882-113-0.229196fimbrial subunit CupB6
PAKAF_00883-1130.175039probable response regulator
PAKAF_00884-2121.718087probable dehydrogenase
PAKAF_00885-3101.791006probable nonribosomal peptide synthetase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_00879PF005777470.0 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 747 bits (1929), Expect = 0.0
Identities = 271/871 (31%), Positives = 415/871 (47%), Gaps = 58/871 (6%)

Query: 80 APAAAVASPAGGLDAPSRRIVFDAQMLALGPGGRSIDTSRFERGDVIEPGRYRLDLLLNS 139
A A A+ A S + F+ + LA D SRFE G + PG YR+D+ LN+
Sbjct: 31 FVACAFAAQAP---LSSAELYFNPRFLA-DDPQAVADLSRFENGQELPPGTYRVDIYLNN 86

Query: 140 RWRGVEEVELRRQPGRESAVFCYDRGLLERAGIDLEKSARGQDRSSARDPLPEGLHCDPL 199
+ +V + V C R L G++ S + L C PL
Sbjct: 87 GYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTA--------SVSGMNLLADDACVPL 138

Query: 200 ERYVPGARVKLDIAEQSVYVSVPSYYLSLDSSKTYVDPASWDSGISAALLNYNSNL-HVR 258
+ A +LD+ +Q + +++P ++S + ++ Y+ P WD GI+A LLNYN + V+
Sbjct: 139 TSMIHDATAQLDVGQQRLNLTIPQAFMS-NRARGYIPPELWDPGINAGLLNYNFSGNSVQ 197

Query: 259 ENHGRSATSGYAGMNAGFNFGRARLRHNGTATWSRRMGS-----HYQRSATYVQTDLPAW 313
G ++ Y + +G N G RLR N T +++ S +Q T+++ D+
Sbjct: 198 NRIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPL 257

Query: 314 RAQLLLGENSTSSEFFDAVSFRGVQLSSDDRMLPDSLRYYAPVVRGTASTNARVSVYQRG 373
R++L LG+ T + FD ++FRG QL+SDD MLPDS R +APV+ G A A+V++ Q G
Sbjct: 258 RSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNG 317

Query: 374 YLIYETTVAPGAFALDELQTASYGGDLEVRVTEASGEVRSFIVPFATTVQLLRPGTTRYS 433
Y IY +TV PG F ++++ A GDL+V + EA G + F VP+++ L R G TRYS
Sbjct: 318 YDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYS 377

Query: 434 LTAGRL-NDPSLERRPNMLQGVYQRGLGNDVTAYAGGAFTGSYMSGLMGAALNT-PVGGF 491
+TAG + + + +P Q GL T Y G Y + G N +G
Sbjct: 378 ITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGAL 437

Query: 492 SGDVTLARTEVPGDDRLSGSSYRLAYSKNLPNTGTNFSLLAYRYSTGGYLGLRDAAFMQD 551
S D+T A + +P D + G S R Y+K+L +GTN L+ YRYST GY D + +
Sbjct: 438 SVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRM 497

Query: 552 RVERGEPLE--------------SFSRLRNRLDANISQQLGNGGNLYLNGSSQRYWSGGG 597
E + R +L ++QQLG LYL+GS Q YW
Sbjct: 498 NGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYWGTSN 557

Query: 598 RAVNFSVGYSNQWRDVSYSISAQRLRSHYEGFSSGDKRGETSTLFSLNLSIPLGG----- 652
F G + + D+++++S ++ + + + +LN++IP
Sbjct: 558 VDEQFQAGLNTAFEDINWTLSYSLTKNAW--------QKGRDQMLALNVNIPFSHWLRSD 609

Query: 653 --AGRGSPTLSSYLTRDSNSGTQLTSGVSGMLGKRGEASYSLSASHDRDSRQTSKS---A 707
+ + S ++ D N +GV G L + SYS+ + S S A
Sbjct: 610 SKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYA 669

Query: 708 SLDYRLPQVELGSSLSQGPGYRQLSVKAAGGLVAHSGGITAAQTLGETIGLVHAPNARGA 767
+L+YR S +QL +GG++AH+ G+T Q L +T+ LV AP A+ A
Sbjct: 670 TLNYRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDA 729

Query: 768 A-AGYSGSRIDRHGYAVIPNLLPYQLNSVDLDPNGMADEIELRSSSRNVAPTAGAVVRLD 826
+G R D GYAV+P Y+ N V LD N +AD ++L ++ NV PT GA+VR +
Sbjct: 730 KVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAE 789

Query: 827 YPTRVARPLLVDSRMPSGEPLPFAAEVLDAHSGQSVGAVGQGSRLVLRVEQDRGSVRVRW 886
+ RV LL+ + +PLPF A V S QS G V ++ L G V+V+W
Sbjct: 790 FKARVGIKLLMTLT-HNNKPLPFGAMVTSE-SSQSSGIVADNGQVYLSGMPLAGKVQVKW 847

Query: 887 GNEPQQQCLVDYALGPRETTPPVLQLA--CR 915
G E C+ +Y L P + QL+ CR
Sbjct: 848 GEEENAHCVANYQLPPESQQQLLTQLSAECR 878


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_00881PF05860798e-20 haemagglutination activity domain.
		>PF05860#haemagglutination activity domain.

Length = 117

Score = 79.5 bits (196), Expect = 8e-20
Identities = 31/108 (28%), Positives = 49/108 (45%), Gaps = 9/108 (8%)

Query: 54 LPSGGTVVGGSANGEIHLSGGNSLSVNQKVDKLIANWDSFSVAAGERVIFNQPSSSSIAL 113
LP + I Q L ++ FSV FN P++ +
Sbjct: 9 LPINSNITTEGNTRII-------ERGTQAGSNLFHSFQEFSVPTSGTAFFNNPTNIQNII 61

Query: 114 NRVIGTKASDIQGRIDANG--QVFLVNPNGVLFGRGAQVNVGGLVAST 159
+RV G S+I G I AN +FL+NPNG++FG+ A++++GG +
Sbjct: 62 SRVTGGSVSNIDGLIRANATANLFLINPNGIIFGQNARLDIGGSFVGS 109


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_00883HTHFIS571e-11 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 56.8 bits (137), Expect = 1e-11
Identities = 30/126 (23%), Positives = 54/126 (42%), Gaps = 5/126 (3%)

Query: 5 RIRVMVADDHPAISLGISYELSQCGSLEMLGQVSNSTELIGRLDEGDCDVVIVDYTMPGG 64
++VADD AI ++ LS+ G + SN+ L + GD D+V+ D MP
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAG--YDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 65 KYGDGLALLSLLRRRYPHLQLVVFTMLNNPGLIRAILKQGINCILSKSDSTSHLLAAVSA 124
+ LL +++ P L ++V + N ++G L K + L+ +
Sbjct: 61 ---NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117

Query: 125 AYSRNQ 130
A + +
Sbjct: 118 ALAEPK 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_00884DHBDHDRGNASE642e-14 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 63.9 bits (155), Expect = 2e-14
Identities = 42/190 (22%), Positives = 72/190 (37%), Gaps = 9/190 (4%)

Query: 3 NVLIVGASRGIGLGLADAFLQRGAQVFAVARRPQGSPGLQALAERAGERLQAVTGDLNQR 62
I GA++GIG +A +GA + AV P+ + + + +A D+
Sbjct: 10 IAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDS 69

Query: 63 DCAERIGEALGER--RIDRLIVNAGIYGPQQQDVAEIDAEQTAQLFLTNAIAPLRLARAL 120
+ I + ID L+ AG+ P + E+ F N+ +R++
Sbjct: 70 AAIDEITARIEREMGPIDILVNVAGVLRPGLIH--SLSDEEWEATFSVNSTGVFNASRSV 127

Query: 121 SG--RVSRGGVVAFMSSQMASLALGLSATMPLYGASKAALNSLVRSWEGEFEELPFSLLL 178
S R G + + S A + +M Y +SKAA + E E +
Sbjct: 128 SKYMMDRRSGSIVTVGSNPAGVP---RTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNI 184

Query: 179 LHPGWVRTEM 188
+ PG T+M
Sbjct: 185 VSPGSTETDM 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_00885NUCEPIMERASE464e-07 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 45.9 bits (109), Expect = 4e-07
Identities = 48/201 (23%), Positives = 80/201 (39%), Gaps = 33/201 (16%)

Query: 621 ILLTGASGLMGAHLLAELLASREADLHCPVRAQNDAH--ALERLRQAARQHRIELAESDW 678
L+TGA+G +G H+ LL + ND + +L++ R LA+ +
Sbjct: 3 YLVTGAAGFIGFHVSKRLL--EAGHQVVGIDNLNDYYDVSLKQARLEL------LAQPGF 54

Query: 679 RRVRAYAADLAEPGFGLPAETYRELAGSVDQVFHSA--SAVNF-IQ-PYSYMKRDNVEGL 734
+ + AD + + G ++VF S AV + ++ P++Y N+ G
Sbjct: 55 QFHKIDLADRE-----GMTDLFAS--GHFERVFISPHRLAVRYSLENPHAYADS-NLTGF 106

Query: 735 GQVLRFCASGRCKPLMLLSSISVYSWGHLHTGKRLMREDDDIDQNLPAVVTDMGYVRSKW 794
+L C + + L+ SS SVY K DD +D P + Y +K
Sbjct: 107 LNILEGCRHNKIQHLLYASSSSVYGLNR----KMPFSTDDSVDH--PVSL----YAATKK 156

Query: 795 VMEKIADLAAE-RGLPLMTFR 814
E +A + GLP R
Sbjct: 157 ANELMAHTYSHLYGLPATGLR 177


90PAKAF_01012PAKAF_01019N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_01012-1100.890680cysteine hydrolase
PAKAF_01013-112-0.528706DUF2076 domain-containing protein
PAKAF_01014012-1.399649conserved hypothetical protein
PAKAF_01015012-1.342404probable ATP-dependent RNA helicase
PAKAF_01016213-1.529233TIGR03862 family flavoprotein
PAKAF_01017115-1.153000two-component response regulator RocA1
PAKAF_01018114-0.740187RocR
PAKAF_010191130.187081probable two-component sensor
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01012ISCHRISMTASE462e-08 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 45.8 bits (108), Expect = 2e-08
Identities = 47/198 (23%), Positives = 68/198 (34%), Gaps = 33/198 (16%)

Query: 11 SQVALLIVDLQRGMQRHDLPPRNNPGAE--ARIVELLAAWRAAGWPVVHVRHVSRQPGSP 68
++ LLI D+Q +P E A I +L G PVV+ + QPGS
Sbjct: 29 NRAVLLIHDMQNYFVDA-FTAGASPVTELSANIRKLKNQCVQLGIPVVY----TAQPGSQ 83

Query: 69 -----------FAPGQPG----VEFQPALAPRDDEAVFEKNVPDAFINSGLQRWLHVRDI 113
+ PG + LAP DD+ V K AF + L +
Sbjct: 84 NPDDRALLTDFWGPGLNSGPYEEKIITELAPEDDDLVLTKWRYSAFKRTNLLEMMRKEGR 143

Query: 114 RQVALVGVATENSVEASARSAGNLGFQTWVVADACFTFAKPDFHGTPRSADEVHAMALAN 173
Q+ + G+ +A A + + V DA F+ H MAL
Sbjct: 144 DQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVADFSLEK-----------HQMALEY 192

Query: 174 LHGEYAVVLRAAELLQRL 191
G A + LL +L
Sbjct: 193 AAGRCAFTVMTDSLLDQL 210


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01015TONBPROTEIN320.003 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 32.3 bits (73), Expect = 0.003
Identities = 24/104 (23%), Positives = 35/104 (33%), Gaps = 16/104 (15%)

Query: 352 EVELLAAIETLIGQTLQRREEPDFEPEHRVPQTA----PGGVVLKKPKKPKKPKAAESVG 407
V ++ + Q +Q EP EPE VV++KPK KPK
Sbjct: 46 SVTMVTPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKK 105

Query: 408 ---------KPGKIHLGSWFDSSAP---TVKAVRKAPGFGAGAA 439
KP + S F+++AP T A +
Sbjct: 106 VQEQPKRDVKPVESRPASPFENTAPARLTSSTATAATSKPVTSV 149


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01017HTHFIS734e-17 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 72.6 bits (178), Expect = 4e-17
Identities = 29/111 (26%), Positives = 51/111 (45%), Gaps = 1/111 (0%)

Query: 3 TVLIVDDHPVIRLAVRVLLEKHGLQVVAETDNGVDAIQLVREHEPDVVILDIGIPKLDGL 62
T+L+ DD IR + L + G V T N + + + D+V+ D+ +P +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 63 TVISRIKSLGLRSQVLVLTSQSAEAFCKRCIQVGARGFVNKEEDLNNLINA 113
++ RIK VLV+++Q+ + + GA ++ K DL LI
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGI 114


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01018HTHFIS531e-09 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 52.9 bits (127), Expect = 1e-09
Identities = 27/140 (19%), Positives = 51/140 (36%), Gaps = 9/140 (6%)

Query: 1 MNDLNVLVLEDEPFQRLVAVTALKKVVPGSILEAADGKEAVAILESCGHVDIAICDLQMS 60
M +LV +D+ R V AL + + ++ + + G D+ + D+ M
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRA-GYDVRITSNAATLWRWIAA-GDGDLVVTDVVMP 58

Query: 61 GMDGLAFLRHASLSGKVHSVILSSEVDPILRQATI-SMIECLGLNFLGDLGKPFSLERIT 119
+ L + V++ S Q T + I+ L KPF L +
Sbjct: 59 DENAFDLLPRIKKARPDLPVLVMSA------QNTFMTAIKASEKGAYDYLPKPFDLTELI 112

Query: 120 ALLTRYNARRQDLPRQIEVA 139
++ R A + P ++E
Sbjct: 113 GIIGRALAEPKRRPSKLEDD 132


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01019HTHFIS642e-12 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 64.1 bits (156), Expect = 2e-12
Identities = 31/112 (27%), Positives = 49/112 (43%), Gaps = 5/112 (4%)

Query: 957 RLQVLVVDDHAVNRQILHQQLSFLGHDVEEAENGLSALNLWHGQPFDMVITDCHMPLMSG 1016
+LV DD A R +L+Q LS G+DV N + D+V+TD MP +
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 1017 SDLARSIRQEERENGEEPVVIIGLTADAQPEEIERCIQAGMNECLIKPIGLD 1068
DL I+ + + PV+++ +A + + G + L KP L
Sbjct: 63 FDLLPRIK---KARPDLPVLVM--SAQNTFMTAIKASEKGAYDYLPKPFDLT 109


91PAKAF_01085PAKAF_01092N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_010851142.414433probable short-chain dehydrogenase
PAKAF_010860140.981231class I SAM-dependent methyltransferase
PAKAF_010870140.672305hypothetical protein
PAKAF_010880160.350038conserved hypothetical protein
PAKAF_01089016-0.020759two-component response regulator NarL
PAKAF_010900160.155200two-component sensor NarX
PAKAF_01091016-0.203039nitrite extrusion protein 1
PAKAF_010920150.228598nitrite extrusion protein 2
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01085DHBDHDRGNASE873e-22 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 87.0 bits (215), Expect = 3e-22
Identities = 54/180 (30%), Positives = 81/180 (45%), Gaps = 7/180 (3%)

Query: 5 VAFVTGCSSGIGRALADAFQRAGYRVWA----SARKEDDVRALAEAGFQAVQ--LDVNDA 58
+AF+TG + GIG A+A G + A + E V +L A DV D+
Sbjct: 10 IAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDS 69

Query: 59 AALARLAEELGVEAAGLDVLVNNAGYGAMGPLLDGGVEAMRRQFETNVFAVVGVTRALFP 118
AA+ + + E +D+LVN AG G + E F N V +R++
Sbjct: 70 AAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSK 129

Query: 119 -LLRRKSGLVVNVGSVSGVLVTPFAGAYCASKAAVHALSDALRLELAPFGVEVLEVQPGA 177
++ R+SG +V VGS + AY +SKAA + L LELA + + V PG+
Sbjct: 130 YMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPGS 189


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01089HTHFIS801e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 79.9 bits (197), Expect = 1e-19
Identities = 44/197 (22%), Positives = 76/197 (38%), Gaps = 18/197 (9%)

Query: 13 RLLLVDDHPMMRKGVAQLLELEDDLSVVGEAGSGEEALRLAAELDPDMILLDLNMKGMNG 72
+L+ DD +R + Q L V + R A D D+++ D+ M N
Sbjct: 5 TILVADDDAAIRTVLNQALSRAG-YDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 73 LDTLRALREAGVDARIVVFTVSDDKGDVVNVLRAGADGYLLKDMEPERLLEHIRQAATGQ 132
D L +++A D ++V + + + GA YL K + L+ I +A
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA-- 120

Query: 133 MTLSPQLTQILAQALRGDD---RSKSLDELTERERQILRQIAHGYSNKMIARKLDITE-G 188
+ +++ + G RS ++ E+ ++L ++ MI E G
Sbjct: 121 -EPKRRPSKLEDDSQDGMPLVGRSAAMQEI----YRVLARLMQTDLTLMI-----TGESG 170

Query: 189 TVKVHVKRVLHKLGMRS 205
T K V R LH G R
Sbjct: 171 TGKELVARALHDYGKRR 187


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01090PF06580432e-06 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 43.3 bits (102), Expect = 2e-06
Identities = 26/111 (23%), Positives = 48/111 (43%), Gaps = 12/111 (10%)

Query: 495 FGERGEVTIELDNRLQHVPLSPNEEIHVLQIVREALSNVVRHSQAQR---AWVRLSSQAD 551
F +R + +++ + V + P ++Q + E N ++H AQ + L D
Sbjct: 236 FEDRLQFENQINPAIMDVQVPP----MLVQTLVE---NGIKHGIAQLPQGGKILLKGTKD 288

Query: 552 -GQVSIAVEDDGVGFDPQQNRSGHYGLTIMQERGQTL-GSQLRFEARAPHG 600
G V++ VE+ G S GL ++ER Q L G++ + + G
Sbjct: 289 NGTVTLEVENTGSLALKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQG 339


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01091TCRTETA393e-05 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 39.0 bits (91), Expect = 3e-05
Identities = 60/350 (17%), Positives = 113/350 (32%), Gaps = 30/350 (8%)

Query: 39 ELGLSESQ---FGLMVALPILTGSLVRLPLGLITDRFGGRIVFFIHMLLVAIPIYGLAFA 95
+L S +G+++AL L LG ++DRFG R V + + A+ +A A
Sbjct: 34 DLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATA 93

Query: 96 SQYWHYLVLGLFVGLAGGSFAVGIAYTSAWFEKERQGTAMGIFGAGNAGAAITNLVAPMI 155
W + + G+ G + AV AY + + + + G A + V +
Sbjct: 94 PFLWVLYIGRIVAGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGL 153

Query: 156 VVAFGWRMVPQVYSVAMLVTAVLFWLFTWTDPAHLKGAAEASQRPTLAKQLAPLAELRVW 215
+ F P + A+ L F + + + + V
Sbjct: 154 MGGFSPHA-PFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVV 212

Query: 216 RFGLYYFFVFG--GFVALALWLPKYYIAEYGLDLKTASFITMLFTLPSGLIRA-LGGWFS 272
+ FF+ G V ALW+ + + D T F + L +A + G +
Sbjct: 213 AALMAVFFIMQLVGQVPAALWV-IFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVA 271

Query: 273 DHYGARS-VNWGVFWVCLVCLFFLSYPQTTMTIHGIQGDLSLGIGLNVWLFTFLVFVVGI 331
G R + G+ + + W+ F + V+
Sbjct: 272 ARLGERRALMLGMIADGTGYILLAFATR-------------------GWMA-FPIMVLLA 311

Query: 332 AQGFGKASVYRIIHDYYPSN-MGTVGGMVGVIGGLGGFCLPILFGYAADH 380
+ G G ++ ++ G + G + + L P+LF
Sbjct: 312 SGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAA 361


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01092TCRTETA300.019 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 30.2 bits (68), Expect = 0.019
Identities = 28/128 (21%), Positives = 47/128 (36%), Gaps = 11/128 (8%)

Query: 52 AVWMIWSTVTVRLNSAGFAFSNDQLFLLAALPSISGATLRVFYSFMVPIFGGRRWTALST 111
A+W+I+ ++ S L L S++ A + + G RR L
Sbjct: 231 ALWVIFGEDRFHWDATTIGIS---LAAFGILHSLAQA---MITGPVAARLGERRALMLGM 284

Query: 112 ASMLIPCIWLGFAVQDPSTPYWVFALIALLCGFGGGNFASSMSNISFFYPKSQQGTALGL 171
+ I L FA + W+ I +L GG + + +S + +QG G
Sbjct: 285 IADGTGYILLAFATR-----GWMAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGS 339

Query: 172 NAGLGNLG 179
A L +L
Sbjct: 340 LAALTSLT 347


92PAKAF_01251PAKAF_01260N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_01251-18-0.863460elastase LasB
PAKAF_012520130.910596probable FMN oxidoreductase
PAKAF_012531180.301839hypothetical protein
PAKAF_01254-313-0.339520NalC
PAKAF_01255-312-0.062331hypothetical protein
PAKAF_05917-211-0.402543antirepressor for MexR, ArmR
PAKAF_01256-210-0.270677probable major facilitator superfamily (MFS)
PAKAF_01257-112-0.642753probable peptidyl-prolyl cis-trans isomerase,
PAKAF_01258-112-0.225552WG repeat-containing protein
PAKAF_01259-1110.670982hypothetical protein
PAKAF_01260-280.300404probable two-component response regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01251THERMOLYSIN399e-136 Thermolysin metalloprotease (M4) family signature.
		>THERMOLYSIN#Thermolysin metalloprotease (M4) family signature.

Length = 544

Score = 399 bits (1027), Expect = e-136
Identities = 138/488 (28%), Positives = 206/488 (42%), Gaps = 59/488 (12%)

Query: 51 GAGGADELKAIRSTTLPNGKQVTRYEQFHNGVRVVGEAITEVKGPGKSVAAQRSGHFVAN 110
G + L I + G V R+EQ +G + G+ + SG + N
Sbjct: 69 GGQARERLSLIGNKLDELGHTVMRFEQAIAASLCMGAVLVAHVNDGE--LSSLSGTLIPN 126

Query: 111 IAADLPGSTTAAVSAEQVLAQAKS------LKAQGRKTENDKVELVIRLGENNIAQLVYN 164
+ T AA+S +Q AK K + E LVI E +L Y
Sbjct: 127 LDKRTL-KTEAAISIQQAEMIAKQDVADRVTKERPAAEEGKPTRLVIYPDEET-PRLAYE 184

Query: 165 VSYLIPGEGLSRPHFVIDAKTGEVLDQWEGLAHAEAGGPG---------------GNQKI 209
V+ ++IDA G+VL++W + A+ GG G+QK
Sbjct: 185 VNVRFLTPVPGNWIYMIDAADGKVLNKWNQMDEAKPGGAQPVAGTSTVGVGRGVLGDQKY 244

Query: 210 GKYTYGSDYGPLIVNDRCEMDDGNVITVDMNSSTDDSKTTPFRFACPTNTYKQVNGAYSP 269
TY S YG + D + T D + T + + Q +Y
Sbjct: 245 INTTYSSYYGYYYLQDNTR--GSGIFTYDGRNRTVLPGSLW------ADGDNQFFASYDA 296

Query: 270 -LNDAHFFGGVVFKLYRDWFG---TSPLTHKLYMKVHYGRSVENAYWDGTAMLFGDG-AT 324
DAH++ GVV+ Y++ G + VHYGR NA+W+G+ M++GDG
Sbjct: 297 AAVDAHYYAGVVYDYYKNVHGRLSYDGSNAAIRSTVHYGRGYNNAFWNGSQMVYGDGDGQ 356

Query: 325 MFYPLV-SLDVAAHEVSHGFTEQNSGLIYRGQSGGMNEAFSDMAGEAAEFYMRGKNDFLI 383
F P +DV HE++H T+ +GL+Y+ +SG +NEA SD+ G EFY D+ I
Sbjct: 357 TFLPFSGGIDVVGHELTHAVTDYTAGLVYQNESGAINEAMSDIFGTLVEFYANRNPDWEI 416

Query: 384 GYDIKK---GSGALRYMDQPSRDGRSIDNASQYYNGID----VHHSSGVYNRAFYLLANS 436
G DI ALR M P++ G D+ S+ Y G VH +SG+ N+A YLL+
Sbjct: 417 GEDIYTPGVAGDALRSMSDPAKYGDP-DHYSKRYTGTQDNGGVHTNSGIINKAAYLLSQG 475

Query: 437 --------PGWDTRKAFEVFVDANRYYWTATSNYNSGACGVIRSAQNRNYS----AADVT 484
G K ++F A YY T TSN++ +++A + S V
Sbjct: 476 GVHYGVSVTGIGRDKMGKIFYRALVYYLTPTSNFSQLRAACVQAAADLYGSTSQEVNSVK 535

Query: 485 RAFSTVGV 492
+AF+ VGV
Sbjct: 536 QAFNAVGV 543


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01254HTHTETR632e-14 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 63.1 bits (153), Expect = 2e-14
Identities = 25/96 (26%), Positives = 44/96 (45%), Gaps = 5/96 (5%)

Query: 10 ERGRQRRRAMLDAATQAFLEHGFEGTTLDMVIERAGGSRGTLYSSFGGKEGLFAAVIA-- 67
+ ++ R+ +LD A + F + G T+L + + AG +RG +Y F K LF+ +
Sbjct: 7 QEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELS 66

Query: 68 --HMIEEIFDDSADQPR-PAATLSATLEHFGRRFLT 100
++ E + A P P + L L H +T
Sbjct: 67 ESNIGELELEYQAKFPGDPLSVLREILIHVLESTVT 102


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01256TCRTETB608e-12 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 59.5 bits (144), Expect = 8e-12
Identities = 63/379 (16%), Positives = 130/379 (34%), Gaps = 55/379 (14%)

Query: 40 IALPSLQRSFGGDLAALSWIMSAFPFVGVFGGIAAGLLVRRWGDRRLLTGGLAILGGASL 99
++LP + F A+ +W+ +AF G G L + G +RLL G+ I S+
Sbjct: 35 VSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSV 94

Query: 100 LGASMQDFA-WLLATRFVEGLGFLIVVVAAPAVLHRITSETRRSVVFGLWSTFMAGGIAL 158
+G F L+ RF++G G V+ R + R FGL + +A G +
Sbjct: 95 IGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGV 154

Query: 159 SMLFGPLLADW-RADWQLSALLVLVAALLLPLSVPADDGCRAAGVRPAGLGTLLKVPAIT 217
G ++A + + L ++ + + + + + G+ +L I
Sbjct: 155 GPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDIKGI--ILMSVGIV 212

Query: 218 LLALGFTTYNLQFFALMTF----------------------------------------- 236
L T+Y++ F +
Sbjct: 213 FFMLFTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTV 272

Query: 237 -----LPVFLMQR---LGVALETAGLIGAAIVAANALGNVAAGFILSRGIRPGALLASTA 288
+ ++M+ L A + +I ++ G + + RG + T
Sbjct: 273 AGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTF 332

Query: 289 ILMGLTGAAFFHAAMPGLLAIALGFVFSAVAGMLPTTVLATAPLASPAPSLTPLAIGWVM 348
+ + A+F + I + FV ++ TV++T +S + +
Sbjct: 333 LSVSFLTASFLLETTSWFMTIIIVFVLGGLSFT--KTVISTIVSSSLKQQEAGAGMSLLN 390

Query: 349 QGNYLGQVIGPLLIGLIVS 367
++L + G ++G ++S
Sbjct: 391 FTSFLSEGTGIAIVGGLLS 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01257INFPOTNTIATR805e-22 Macrophage infectivity potentiator signature.
		>INFPOTNTIATR#Macrophage infectivity potentiator signature.

Length = 233

Score = 80.4 bits (198), Expect = 5e-22
Identities = 42/104 (40%), Positives = 59/104 (56%), Gaps = 2/104 (1%)

Query: 5 LQIEDLLLGDGKEVVKGALITTQYKGTLEDGTLFDSSYERGRPFQCVIGTGRVIKGWDQG 64
LQ + + G G + K +T +Y GTL DGT+FDS+ + G+P +VI GW +
Sbjct: 128 LQYKIIDAGTGAKPGKSDTVTVEYTGTLIDGTVFDSTEKAGKP--ATFQVSQVIPGWTEA 185

Query: 65 LMGMKVGGKRRLFVPSHLAYGERQVGAHIKPHSNLLFEIELLEV 108
L M G +FVP+ LAYG R VG I P+ L+F+I L+ V
Sbjct: 186 LQLMPAGSTWEVFVPADLAYGPRSVGGPIGPNETLIFKIHLISV 229


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01260HTHFIS613e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 61.0 bits (148), Expect = 3e-13
Identities = 28/130 (21%), Positives = 57/130 (43%), Gaps = 11/130 (8%)

Query: 2 KTRVILVDDHALTLIGMRYLLSAYD-DLRIVAQAQDADGLLAQLEAHPCDLLITDLMMPG 60
+++ DD A + LS D+RI + A + A DL++TD++MP
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATL---WRWIAAGDGDLVVTDVVMPD 59

Query: 61 SQQADGLRLVQKVRRRYPDLPIIVVTMLGNPALVSSLLKLGIHGLVSK----RGMLDDLP 116
+ L+ ++++ PDLP++V++ + G + + K ++ +
Sbjct: 60 ---ENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116

Query: 117 KAILHAGRRP 126
+A+ RRP
Sbjct: 117 RALAEPKRRP 126


93PAKAF_01265PAKAF_01275N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_01265-180.952957probable major facilitator superfamily (MFS)
PAKAF_012660101.219088probable chemotaxis transducer
PAKAF_012671121.667961purine-binding chemotaxis protein CheW
PAKAF_012680200.846682probable protein methyltransferase
PAKAF_01269015-0.147601purine-binding chemotaxis protein CheW
PAKAF_01270015-0.290765probable chemotaxis sensor/effector fusion
PAKAF_01271118-1.726461probable methylesterase
PAKAF_01272216-1.919711probable two-component response regulator
PAKAF_01273216-2.155520peptide chain release factor I
PAKAF_01274013-1.327852lysyl-tRNA synthetase
PAKAF_01275-111-0.951639probable transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01265TCRTETA356e-04 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 35.2 bits (81), Expect = 6e-04
Identities = 15/27 (55%), Positives = 17/27 (62%)

Query: 304 VAGWLSDRIGRKPVLLAGLLLATLFYF 330
V G LSDR GR+PVLL L A + Y
Sbjct: 62 VLGALSDRFGRRPVLLVSLAGAAVDYA 88



Score = 32.1 bits (73), Expect = 0.005
Identities = 24/113 (21%), Positives = 45/113 (39%), Gaps = 17/113 (15%)

Query: 63 IFALMAFAAGFLVRPFGALVFGRLGDMIGRKYTFLVTILLMGLSTFAVGLLPTYASIGVA 122
++ALM FA A V G L D GR+ LV++ + + P
Sbjct: 51 LYALMQFAC--------APVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLW----- 97

Query: 123 APIILVTLRMLQGLALGGEYGGAAIYVAEHAPANKRGSYTSWIQSTATLGLLL 175
+L R++ G+ G A Y+A+ ++R + ++ + G++
Sbjct: 98 ---VLYIGRIVAGIT-GATGAVAGAYIADITDGDERARHFGFMSACFGFGMVA 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01270HTHFIS747e-16 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 74.1 bits (182), Expect = 7e-16
Identities = 30/113 (26%), Positives = 52/113 (46%), Gaps = 2/113 (1%)

Query: 644 QRKRILVVDDSLTVRELERKLLLGRGYDVAVAVDGMDGWNALRSEHFDLLITDIDMPRMD 703
ILV DD +R + + L GYDV + + W + + DL++TD+ MP +
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 704 GIELVTLVRRDSRLQSLPVMVVSYKDREEDRRRGLDAGADYYLAKASFHDEAL 756
+L+ ++ LPV+V+S ++ + + GA YL K E +
Sbjct: 62 AFDLLPRIK--KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELI 112


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01271HTHFIS482e-08 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 48.3 bits (115), Expect = 2e-08
Identities = 29/129 (22%), Positives = 49/129 (37%), Gaps = 13/129 (10%)

Query: 6 EALRRALAFEPQHQIVWVASNGAEAVTQCAADTPDVVLMDLLMPVMDGVEATRRIMAESP 65
L +AL+ V + SN A AA D+V+ D++MP + + RI P
Sbjct: 17 TVLNQALSRAGY--DVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLLPRIKKARP 74

Query: 66 CAIVIVTVDIEQNVHRVFEAMGYGALDAVNTP----------ALGIGNPQTAAAPLLRKI 115
V+V + + +A GA D + P + P+ + L
Sbjct: 75 DLPVLV-MSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRRPSKLEDDS 133

Query: 116 QNVGWLIGQ 124
Q+ L+G+
Sbjct: 134 QDGMPLVGR 142


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01272HTHFIS681e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 67.5 bits (165), Expect = 1e-14
Identities = 34/129 (26%), Positives = 53/129 (41%), Gaps = 3/129 (2%)

Query: 21 VLLVDDQAMIGEAVRRSLASEAGIDFHFCSDPQQAVAVANQIKPTVILQDLVMPGVDGLT 80
+L+ DD A I + ++L S AG D S+ +++ D+VMP +
Sbjct: 6 ILVADDDAAIRTVLNQAL-SRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 81 LLAAYRGNPATRDIPIIVLSTKEEPTVKSAAFAAGANDYLVKLPDAIELVARIRYHSRSY 140
LL + A D+P++V+S + A GA DYL K D EL+ I
Sbjct: 65 LLPRIKK--ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122

Query: 141 IALQQRDEA 149
+ E
Sbjct: 123 KRRPSKLED 131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01275HTHTETR364e-05 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 36.1 bits (83), Expect = 4e-05
Identities = 22/119 (18%), Positives = 45/119 (37%), Gaps = 7/119 (5%)

Query: 1 MRLIVRDGVRAVRHRAVAAEAQVPLSATTYYFKDIDDLITDTFALFVERNAEALSAFWSS 60
+RL + GV + +A A V A ++FKD DL ++ + L E + +
Sbjct: 21 LRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEYQAK 80

Query: 61 VEGDLQEMAAVLADD-------PGARGSLVERIVELAVQYVQVQLTERREHLLAEQAFR 112
GD + + R L+E I ++ + ++ + L +++
Sbjct: 81 FPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRNLCLESYD 139


94PAKAF_01291PAKAF_01298N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_012911112.618532DUF72 domain-containing protein
PAKAF_01292093.185629isocitrate lyase/phosphoenolpyruvate mutase
PAKAF_01293-2121.590581extensin family protein
PAKAF_01294-1121.449064ribosomal RNA small subunit methyltransferase J
PAKAF_01295-2110.975565cytochrome P450
PAKAF_01296-112-0.135639probable transcriptional regulator
PAKAF_01297-2110.190627probable Resistance-Nodulation-Cell Division
PAKAF_01298-213-0.529897probable Resistance-Nodulation-Cell Division
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01291OUTRMMBRANEA280.028 Outer membrane protein A signature.
		>OUTRMMBRANEA#Outer membrane protein A signature.

Length = 346

Score = 28.0 bits (62), Expect = 0.028
Identities = 15/38 (39%), Positives = 18/38 (47%), Gaps = 5/38 (13%)

Query: 183 PDNH-LAAQQAQRFHALLGQRLPGLPALPEPIPAPEVE 219
PDN L+ + RF GQ P P PAPEV+
Sbjct: 178 PDNGMLSLGVSYRF----GQGEAAPVVAPAPAPAPEVQ 211


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01296HTHTETR705e-17 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 70.0 bits (171), Expect = 5e-17
Identities = 37/180 (20%), Positives = 62/180 (34%), Gaps = 9/180 (5%)

Query: 9 GPGRPKDPAKREAILEAAKRLFLCNGYDGSSMEAIASEAGVSKLTVYSHFTDKETLFSEA 68
+ + R+ IL+ A RLF G +S+ IA AGV++ +Y HF DK LFSE
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 69 VKAKCAEQLPALYFQLAE---GAPLEKVLLNIARGFHRLI---NSHEAIALTRLMAAQAG 122
+ L + G PL + + + + + G
Sbjct: 63 W-ELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVG 121

Query: 123 QNPKLSELFFEAGPKQVIDEMERLLEQARRSGKLAFP-DARHAAEHFFMLVKGCANYRLL 181
+ + + + D +E+ L+ + L R AA + G L
Sbjct: 122 EMAVVQQA-QRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLF 180


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01297RTXTOXIND561e-10 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 55.6 bits (134), Expect = 1e-10
Identities = 25/205 (12%), Positives = 63/205 (30%), Gaps = 12/205 (5%)

Query: 78 ERVKKDQPLAE--LDPQDVRLQLEAARAQVSAAEANLQTVRAEYRRYRTLLDRNLVSHSQ 135
R+ L + L+ E + ++ + +Q
Sbjct: 235 SRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQ 294

Query: 136 --FENIQNSYRAGEARLKQIRAEFNVADNQAGYAVLRSPQDGVIASRRV-EVGQVVAAGQ 192
I + R + + E + + +V+R+P + +V G VV +
Sbjct: 295 LFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAE 354

Query: 193 TVFSLAADGER-EVLIGLPEHSFERFRIGQPVSVELWSQRDRRF---AGHIRELSPAADP 248
T+ + + + EV + +GQ +++ + R+ G ++ ++ A
Sbjct: 355 TLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIE 414

Query: 249 QSRT---FAARVAFDDRATPAELGQ 270
R F ++ ++
Sbjct: 415 DQRLGLVFNVIISIEENCLSTGNKN 439



Score = 44.0 bits (104), Expect = 6e-07
Identities = 19/112 (16%), Positives = 34/112 (30%), Gaps = 21/112 (18%)

Query: 66 GGKVIRRLVEVGERVKKDQPLAELDPQDVRLQLEAARAQVSAAEANLQTVRAEYR----- 120
V +V+ GE V+K L +L ++ + A + R
Sbjct: 104 NSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELN 163

Query: 121 -----------RYRTLLDRNLVS-----HSQFENIQNSYRAGEARLKQIRAE 156
++ + + ++ QF QN E L + RAE
Sbjct: 164 KLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAE 215


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01298ACRIFLAVINRP483e-156 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 483 bits (1244), Expect = e-156
Identities = 246/1044 (23%), Positives = 449/1044 (43%), Gaps = 53/1044 (5%)

Query: 5 LSAWALQNRQIVLYLMILLGAVGALSYSKLGQSEDPPFTFKAMVVQTNWPGASAEEVARQ 64
++ + ++ L I+L GAL+ +L ++ P A+ V N+PGA A+ V
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 65 VTERIEKKLMETGDYDRIVSFSRPGVS---QVTFMAREDIHSSEIPELWYQIRKKISDIR 121
VT+ IE+ + + + S S S +TF + D +++ Q++ K+
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQV-----QVQNKLQLAT 115

Query: 122 ATLPQSIQGP-FFNDEFGTTYGNIYALTGKGFDY--AVMKDYADR-LQLQLQRIRNVGKV 177
LPQ +Q ++ ++Y + + DY ++ L R+ VG V
Sbjct: 116 PLLPQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDV 175

Query: 178 ELIGLQDEKIWIDLSNTKLATLGLPLAAVQKALEEQNAVASSGFFETASD------RVQL 231
+L G + I L L L V L+ QN ++G +
Sbjct: 176 QLFG-AQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASI 234

Query: 232 RVSGRFDSVEEIRDFPIRVGD--RTFRIGDVAEVRRGFNDPPAPRMRFMGEDAIGLAVAM 289
RF + EE +RV R+ DVA V G + R G+ A GL + +
Sbjct: 235 IAQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNV-IARINGKPAAGLGIKL 293

Query: 290 KPGGDILVLGKALETEFARLQQSLPAGLELRKVSDQPAAVRTGVGEFIRVLAEALVIVLL 349
G + L KA++ + A LQ P G+++ D V+ + E ++ L EA+++V L
Sbjct: 294 ATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFL 353

Query: 350 VSFFSLG-LRTGLVVALSIPLVLAMTFAAMHYFGIGLHKISLGALVLALGLLVDDAIIAV 408
V + L +R L+ +++P+VL TFA + FG ++ +++ +VLA+GLLVDDAI+ V
Sbjct: 354 VMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVV 413

Query: 409 EMMA-VKMEQGYDRLKAAAFAWTSTAFPMLTGTLITAAGFLPIATAQSGTGEYTRSLFQV 467
E + V ME +A + + ++ ++ +A F+P+A TG R
Sbjct: 414 ENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSIT 473

Query: 468 VTIALVVSWFAAVVFVPYLGAKLLPDLARLHAQKHGGSADGYDPYATAFYQRFRRLVEWC 527
+ A+ +S A++ P L A LL ++ H + GG ++ + V
Sbjct: 474 IVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKI 533

Query: 528 VRYRKTVIVLTLAAFVGALLLFRLVPQQFFPPSARLELLLDIKLAEGASLRSTGEEVQRL 587
+ +++ G ++LF +P F P + L I+L GA+ T + + ++
Sbjct: 534 LGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQV 593

Query: 588 EKMLQGHDGIDNYVAYVGTGSPRFYLPLDQQLPAASFAQVVVLAKDLESR---EALRKWL 644
++ + + G Q A A V + K E R E + +
Sbjct: 594 TDYYLKNEKANVESVFTVNG-----FSFSGQAQNAGMAFVSL--KPWEERNGDENSAEAV 646

Query: 645 IERMNEDFPHLRSRISRLENGPPV-------GYPVQ-FRVSGEDIPQVRELARKVADKMR 696
I R + +R N P + G+ + +G + + ++
Sbjct: 647 IHRAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAA 706

Query: 697 ENP-HVVNVHLDWEEPSKVVYLSIDQERARALGVSTASLSQFLQSALTGSHVSFFREDNE 755
++P +V+V + E + L +DQE+A+ALGVS + ++Q + +AL G++V+ F +
Sbjct: 707 QHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGR 766

Query: 756 LIEILLRGTEQERRDLSLLPSLAVPTENGRSVALSQIATLEYGFEEGIIWHRNRLPTVTV 815
+ ++ ++ + R + L V + NG V S T + + + N LP++ +
Sbjct: 767 VKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEI 826

Query: 816 RADIYDDSLPATLVAQIAPTLEPIRAELPDGYLLEVGGTVEDAAKGQSSVNAGVPLFIVV 875
+ + P T +E + ++LP G + G + A V + VV
Sbjct: 827 QGEA----APGTSSGDAMALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVV 882

Query: 876 VLSLLMVQLRSFSRMAMVFLTAPLGLIGVTLFLLLFRQPFGFVAMLGTIALAGMIMRNSV 935
V L S+S V L PLG++GV L LF Q M+G + G+ +N++
Sbjct: 883 VFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAI 942

Query: 936 ILVDQIEQ-DISHGLDRWHAIIEATVRRFRPIVLTALAAVLAMIPLSRSVFFG-----PM 989
++V+ + G A + A R RPI++T+LA +L ++PL+ S G +
Sbjct: 943 LIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAV 1002

Query: 990 AVAIMGGLIVATVLTLLFLPALYA 1013
+ +MGG++ AT+L + F+P +
Sbjct: 1003 GIGVMGGMVSATLLAIFFVPVFFV 1026



Score = 78.0 bits (192), Expect = 1e-16
Identities = 78/509 (15%), Positives = 175/509 (34%), Gaps = 47/509 (9%)

Query: 533 TVIVLTLAAFVGALLLFRLVPQQFFPPSARLELLLDIKLAEGASLRSTGEEVQR-LEKML 591
VL + + L +P +P A + + GA ++ + V + +E+ +
Sbjct: 11 FAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSAN-YPGADAQTVQDTVTQVIEQNM 69

Query: 592 QGHDGIDN---YVAYVGTGSPRFYLPLDQQLPAASFAQVVVLAKDLESREALRKWLIERM 648
G D + G+ + AQV V K L+ +
Sbjct: 70 NGIDNLMYMSSTSDSAGSVTITLTFQSGTDP---DIAQVQVQNK-LQLATP-------LL 118

Query: 649 NEDFPHLRSRISRLENGPPVGYPVQFRVSGEDIPQVRELA-RKVADKMRENPHVVNVHLD 707
++ + + + + G + + V D + V +V L
Sbjct: 119 PQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLF 178

Query: 708 WEEPSKVVYLSIDQERARALGVS-----TASLSQFLQSALTGSHVSFFREDNEL-IEILL 761
+ + + +D + ++ Q Q A + +L I+
Sbjct: 179 GAQ--YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIA 236

Query: 762 RGTEQERRDLSLLPSLAVPTENGRSVALSQIATLEYGFEEGIIWHR-NRLPTVTVRADIY 820
+ + + + +G V L +A +E G E + R N P + +
Sbjct: 237 QTRFKNPEEFGKVTLRV--NSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLA 294

Query: 821 DDSLPATLVAQIAPTLEPIRAELPDGYLLEVGGTVEDAAKGQSSVNAGV-PLFIVVVLSL 879
+ I L ++ P G + + Q S++ V LF ++L
Sbjct: 295 TGANALDTAKAIKAKLAELQPFFPQGMKVLY--PYDTTPFVQLSIHEVVKTLFEAIMLVF 352

Query: 880 LMVQ--LRSFSRMAMVFLTAPLGLIGVTLFLLLFRQPFGF---VAMLGTIALA-GMIMRN 933
L++ L++ + + P+ L+G T +L FG+ + + LA G+++ +
Sbjct: 353 LVMYLFLQNMRATLIPTIAVPVVLLG-TFAILAA---FGYSINTLTMFGMVLAIGLLVDD 408

Query: 934 SVILVDQIEQ-DISHGLDRWHAIIEATVRRFRPIVLTALAAVLAMIPL-----SRSVFFG 987
++++V+ +E+ + L A ++ + +V A+ IP+ S +
Sbjct: 409 AIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYR 468

Query: 988 PMAVAIMGGLIVATVLTLLFLPALYAAWF 1016
++ I+ + ++ ++ L+ PAL A
Sbjct: 469 QFSITIVSAMALSVLVALILTPALCATLL 497


95PAKAF_01507PAKAF_01515N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_015071111.755279putative MFS transporter,Putative
PAKAF_015080111.622459putative ATP-dependent RNA
PAKAF_015090122.120246transmembrane protein,enterobactin exporter
PAKAF_01510-1101.773524phospholipase
PAKAF_01511-1101.482719YheU family protein
PAKAF_01512-1102.015183probable sensor/response regulator hybrid
PAKAF_01513-2113.200287osmoprotectant NAGGN system M42 family
PAKAF_015140112.707385probable acetyltransferase
PAKAF_015150102.283003probable glutamine amidotransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01507TCRTETA583e-11 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 57.9 bits (140), Expect = 3e-11
Identities = 46/209 (22%), Positives = 86/209 (41%), Gaps = 4/209 (1%)

Query: 26 VIIALAFFFDSMDLAMMTFLLGSIKAEFGLDSAQA---GLLASSSFFGMVIGAALSGMLA 82
++I D++ + ++ +L + + + G+L + A + G L+
Sbjct: 8 IVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALS 67

Query: 83 DRFGRKPVFQASIVLWGLASYLCSTAGDLDSLTFYRVLLGIGMGMEFPIAQSLLSEMIPA 142
DRFGR+PV S+ + + +TA L L R++ GI G +A + ++++
Sbjct: 68 DRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGI-TGATGAVAGAYIADITDG 126

Query: 143 SRRGKYIALMDGFWPLGFVAAGCLSYFLLPLTGWRSIFLVLALPAVFVLAIRFLIPESPR 202
R ++ M + G VA L + + F AL + L FL+PES +
Sbjct: 127 DERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHK 186

Query: 203 WLEQAGRREQADRVLRDIEARVMRSLGLT 231
+ RRE + + AR M +
Sbjct: 187 GERRPLRREALNPLASFRWARGMTVVAAL 215



Score = 30.6 bits (69), Expect = 0.012
Identities = 37/167 (22%), Positives = 61/167 (36%), Gaps = 9/167 (5%)

Query: 286 LSALLQQSGFAVTQSVYYTVLISLAGIPGFLCAAWL---VESWGRKPSCVLMLLGGGAMA 342
L LL+ + + +Y +L++L + F CA L + +GR+P ++ L G
Sbjct: 28 LPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDY 87

Query: 343 YAYGQTAVFGGSLALLIGFGLAMQFFLFGMWAVLYTYTPELYPTSARATGSGFASAVGRI 402
L +L G + AV Y ++ RA GF SA
Sbjct: 88 AIMA----TAPFLWVLY-IGRIVAGITGATGAVAGAYIADITDGDERARHFGFMSACFGF 142

Query: 403 GSLLGPLVTGLVLPLTGQGGVFTLGALCFGVAALVVWAFGIETRGRT 449
G + GP++ GL+ + F A G+ L E+
Sbjct: 143 GMVAGPVLGGLMGGFSPHAP-FFAAAALNGLNFLTGCFLLPESHKGE 188


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01509TCRTETA433e-06 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 42.5 bits (100), Expect = 3e-06
Identities = 57/281 (20%), Positives = 94/281 (33%), Gaps = 13/281 (4%)

Query: 79 ALPLVLLSILSGVIADNHDRRKIMLWGLSFEMTGAMFATLLAFLGYLDPVLLIISILWIS 138
AL + + G ++D RR ++L L A + + P L ++ I I
Sbjct: 53 ALMQFACAPVLGALSDRFGRRPVLLVSL-------AGAAVDYAIMATAPFLWVLYIGRIV 105

Query: 139 LGGS-VTIPAWQAAVNEQVPARMVSDAVLLNSVNYNVARAAGPALGGLLLSAVGPAWVFL 197
G + T A + + + S + AGP LGGL+ P F
Sbjct: 106 AGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLM-GGFSPHAPFF 164

Query: 198 FNSFCY-MALIWAIWQWRRDVPKRSLPPEGILEGVTAALRFTQYSTVTRLVMMRSFAFGL 256
+ + + + P A+ R+ + TV +M F L
Sbjct: 165 AAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQL 224

Query: 257 SASAVWALLPLLAHRNPDGDAAIYGYMLGALG-LGAILGSTQVSRLRQRIGSSRLISLAG 315
AL + DA G L A G L ++ + + R+G R + L
Sbjct: 225 VGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGM 284

Query: 316 FTLALILLTLGLVDNLWVLFPVLIL--GGGCWIGALATYNS 354
+ L W+ FP+++L GG + AL S
Sbjct: 285 IADGTGYILLAFATRGWMAFPIMVLLASGGIGMPALQAMLS 325



Score = 40.6 bits (95), Expect = 1e-05
Identities = 32/189 (16%), Positives = 63/189 (33%), Gaps = 12/189 (6%)

Query: 12 PLKPEGQAAKPERTGTWAPFSIQAFRIIWICNLFANLGTWA--QSVAAAWVVTDA---HA 66
K E + + E A F + + Q AA WV+ H
Sbjct: 184 SHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHW 243

Query: 67 SPLMVA-MIQVAAALPLVLLSILSGVIADNHDRRKIMLWGLSFEMTGAMFATLLAFLGYL 125
+ + L + ++++G +A R+ ++ G+ + TG L +
Sbjct: 244 DATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTG------YILLAFA 297

Query: 126 DPVLLIISILWISLGGSVTIPAWQAAVNEQVPARMVSDAVLLNSVNYNVARAAGPALGGL 185
+ I+ + G + +PA QA ++ QV + ++ GP L
Sbjct: 298 TRGWMAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTA 357

Query: 186 LLSAVGPAW 194
+ +A W
Sbjct: 358 IYAASITTW 366



Score = 34.4 bits (79), Expect = 0.001
Identities = 32/142 (22%), Positives = 51/142 (35%), Gaps = 8/142 (5%)

Query: 277 AAIYGYMLGALGLGAILGSTQVSRLRQRIGSSR--LISLAGFTLALILLTLGLVDNLWVL 334
A YG +L L + + L R G L+SLAG + ++ LWVL
Sbjct: 42 TAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATA--PFLWVL 99

Query: 335 FPVLILGG-GCWIGALATYNSAVQILVPDWIKARALALYQTALYGGLALGSFLWGHLAET 393
+ I+ G GA+A + + + +AR G+ G L G +
Sbjct: 100 YIGRIVAGITGATGAVAG--AYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGG- 156

Query: 394 MTVHGALLAAGCLLLASVILLY 415
+ H AA L + +
Sbjct: 157 FSPHAPFFAAAALNGLNFLTGC 178


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01510PRPHPHLPASEC384e-05 Prokaryotic zinc-dependent phospholipase C signature.
		>PRPHPHLPASEC#Prokaryotic zinc-dependent phospholipase C signature.

Length = 398

Score = 38.5 bits (89), Expect = 4e-05
Identities = 13/38 (34%), Positives = 20/38 (52%)

Query: 241 QYFGLSRFAFANGHPYWGYRFLGWGMHYIQDITQPYHS 278
++ L+R+ + G+ +LG MHY DI PYH
Sbjct: 128 KFSALARYEWQRGNYKQATFYLGEAMHYFGDIDTPYHP 165


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01512HTHFIS702e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 69.9 bits (171), Expect = 2e-14
Identities = 34/132 (25%), Positives = 57/132 (43%), Gaps = 7/132 (5%)

Query: 786 LDAPCILVAEDNPVNQLVVRGFLAKRGYAVRLAGNGRLALDEYLRDPNGIQLILMDGEMP 845
+ ILVA+D+ + V+ L++ GY VR+ N L++ D MP
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGD--LVVTDVVMP 58

Query: 846 EMDGFEATRLIRREERAQGWPRVPIVALTAHILDEHRRAGIEAGMDAYLGKPVDRAELYA 905
+ + F+ I++ P +P++ ++A E G YL KP D EL
Sbjct: 59 DENAFDLLPRIKKA-----RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIG 113

Query: 906 TLERLLGQPSRQ 917
+ R L +P R+
Sbjct: 114 IIGRALAEPKRR 125


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01514SACTRNSFRASE353e-04 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 35.3 bits (81), Expect = 3e-04
Identities = 15/53 (28%), Positives = 19/53 (35%)

Query: 197 LAVDPQCSRPGVGEALVRHLVEHFMSRELAYLDLSVLHNNQQAKALYRKLGFR 249
+AV + GVG AL+ +E L L N A Y K F
Sbjct: 95 IAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFI 147


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01515ANTHRAXTOXNA330.003 Anthrax toxin LF subunit signature.
		>ANTHRAXTOXNA#Anthrax toxin LF subunit signature.

Length = 800

Score = 33.2 bits (75), Expect = 0.003
Identities = 25/68 (36%), Positives = 38/68 (55%), Gaps = 7/68 (10%)

Query: 192 APHTLLEGVKKLPPATW-MSVDLDGSCEQRTWWT---LDYG--PRPDERELTLDDWQERV 245
AP +L E K++P W V+ S E++ T + YG +PD + TL +WQ+++
Sbjct: 498 AP-SLTEIKKQIPQKEWDKVVNTPNSLEKQKGVTNLLIKYGIERKPDSTKGTLSNWQKQM 556

Query: 246 LDGLREAV 253
LD L EAV
Sbjct: 557 LDRLNEAV 564


96PAKAF_01568PAKAF_01575N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_015681143.497981heme acquisition protein HasAp
PAKAF_015691143.098247transport protein HasD
PAKAF_015702141.637402metalloprotease secretion protein
PAKAF_015712150.992899probable outer membrane protein precursor
PAKAF_015721150.333604phosphate-starvation-inducible protein PsiE
PAKAF_015731160.051477biotin/lipoyl-binding protein
PAKAF_015742160.149579ABC transporter permease
PAKAF_015753171.068475putative ABC-type multidrug transport system,
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01568PF064382761e-97 Heme acquisition protein HasAp
		>PF06438#Heme acquisition protein HasAp

Length = 205

Score = 276 bits (706), Expect = 1e-97
Identities = 204/205 (99%), Positives = 205/205 (100%)

Query: 1 MSISISYSTTYSGWTVADYLADWSAYFGDVNHRPGQVVDGSNTGGFNPGPFDGSQYALKS 60
MSISISYSTTYSGWTVADYLADWSAYFGDVNHRPGQVVDGSNTGGFNPGPFDGSQYALKS
Sbjct: 1 MSISISYSTTYSGWTVADYLADWSAYFGDVNHRPGQVVDGSNTGGFNPGPFDGSQYALKS 60

Query: 61 TASDAAFIAGGDLHYTLFSNPSHTLWGKLDSIALGDTLTGGASSGGYALDSQEVSFSNLG 120
TASDAAFIAGGDLHYTLFSNPSHTLWGKLDSIALGDTLTGGASSGGYALDSQEVSFSNLG
Sbjct: 61 TASDAAFIAGGDLHYTLFSNPSHTLWGKLDSIALGDTLTGGASSGGYALDSQEVSFSNLG 120

Query: 121 LDSPIAQGRDGTVHKVVYGLMSGDSSALQGQIDALLKAVDPSLSINSTFDQLAAAGVAHA 180
LDSPIAQGRDGTVHKVVYGLMSGDSSALQGQIDALLKAVDPSLSINSTFDQLAAAGVAHA
Sbjct: 121 LDSPIAQGRDGTVHKVVYGLMSGDSSALQGQIDALLKAVDPSLSINSTFDQLAAAGVAHA 180

Query: 181 TPAAAAAEIGVVGVQELPHDLALAA 205
TPAAAAAE+GVVGVQELPHDLALAA
Sbjct: 181 TPAAAAAEVGVVGVQELPHDLALAA 205


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01570RTXTOXIND417e-145 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 417 bits (1073), Expect = e-145
Identities = 96/435 (22%), Positives = 170/435 (39%), Gaps = 8/435 (1%)

Query: 15 AALELDEK---RFSRLGWGLVLLGFVGFLLWAGLAPLDKGVGVSGTVMVAGSRKAVQHPT 71
A LEL E R RL ++ V + + L ++ +G + +G K ++
Sbjct: 44 AHLELIETPVSRRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIE 103

Query: 72 GGLVRHIRVHEGERVEAGQVLLEMDATQARAQADGLFAQYLAALASLARLSAERDEKARI 131
+V+ I V EGE V G VLL++ A A A + L A R
Sbjct: 104 NSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELN 163

Query: 132 EFPAELLALDDPRLPTLLEQQ----RQLHDSRRRALRLELDGLAETVAGSQAQLDGLQAA 187
+ P EL D+P + E++ L + + + + +A+ + A
Sbjct: 164 KLP-ELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLAR 222

Query: 188 LRSKEQQRAALEEQLRGLRQLASEGYVPRNRLLDSERLLAQVNGEIAGDLGSLGSTRRQI 247
+ E + +L L + + ++ +L+ E + E+ L +I
Sbjct: 223 INRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEI 282

Query: 248 LELRLRMAQRREKFQEEVRASLADAQVRAEELRNRLASARFDLANSEVRAPVAGLVVGQE 307
L + + F+ E+ L L LA S +RAPV+ V +
Sbjct: 283 LSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLK 342

Query: 308 VFTEGGVIAPGQQLMEILPERQPLLVDARLPVEMVDKVRVGLPVELMFSAFNQSTTPRVE 367
V TEGGV+ + LM I+PE L V A + + + + VG + AF + +
Sbjct: 343 VHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLV 402

Query: 368 GEVTLVSADRLLDERSEAPYYRVRIRVGEEGVRRLAGLEIRPGMPVEAFVRSGERSLLNY 427
G+V ++ D + D+R + + + + GM V A +++G RS+++Y
Sbjct: 403 GKVKNINLDAIEDQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGMRSVISY 462

Query: 428 LFKPLADRTHLALGE 442
L PL + +L E
Sbjct: 463 LLSPLEESVTESLRE 477


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01571RTXTOXIND320.007 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 31.7 bits (72), Expect = 0.007
Identities = 20/171 (11%), Positives = 49/171 (28%), Gaps = 11/171 (6%)

Query: 60 LPSLRYDYNKARNDSTVSQGDARVERDYRSYASTLSLEQPLFDYEAYARYRQ-GEAQAL- 117
L +L + + + S++ Q R S + P ++ E + L
Sbjct: 127 LTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLR 186

Query: 118 ---FADEQFRGRSQELA---VRLFAAYSETLFAREQVVLAEAQRRALETQLAFNQRAFEE 171
EQF + + L +E L ++ E R +++L +
Sbjct: 187 LTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHK 246

Query: 172 GEGTRTDLLE---TRARLSLTRAEEIAASDRAAAARRTLEAMLGQALEDRE 219
+ +LE + ++ + + + + +
Sbjct: 247 QAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFK 297


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01573RTXTOXIND566e-11 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 56.0 bits (135), Expect = 6e-11
Identities = 25/161 (15%), Positives = 59/161 (36%), Gaps = 17/161 (10%)

Query: 41 IVSSKAKGRVQVLHVRRGDEVKQGDLLISLDSPELEAQLDALHAARNQAQAQLDESLHGT 100
+ V+ + V+ G+ V++GD+L+ L + EA ++ QA+ + +
Sbjct: 98 EIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILS 157

Query: 101 REESIRALKASLAQAEAELRNAESDFQRNQQMVERGFLSRTQFDLSRRERDVARDRVAEA 160
R + L E +N ++++ L + QF + ++ + +
Sbjct: 158 RSIELNKLPELKLPDEPYFQNV-----SEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKK 212

Query: 161 RANLDEGLKGDREERRQALQAAVRRADAQIAELQAQIDDLQ 201
RA + A + R + ++++DD
Sbjct: 213 RAER------------LTVLARINRYENLSRVEKSRLDDFS 241



Score = 52.9 bits (127), Expect = 7e-10
Identities = 29/205 (14%), Positives = 77/205 (37%), Gaps = 24/205 (11%)

Query: 75 LEAQLDALHAARNQAQAQLDESLHGTREESIRALKASLAQAEAELRNAESDFQRNQQMVE 134
++ Q + Q + LD+ + + A + + E R +S ++
Sbjct: 191 IKEQFSTWQNQKYQKELNLDK-----KRAERLTVLARINRYENLSRVEKSRLDDFSSLLH 245

Query: 135 RGFLSRTQFDLSRRERDVARDRVAEARANLDE------GLKGDREERRQALQAAV----R 184
+ +++ + A + + ++ L++ K + + Q + + R
Sbjct: 246 KQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLR 305

Query: 185 RADAQIAELQAQI----DDLQ---VRAPVNGEVGPIPA-EQGELINAYSPLLTLVRLDDS 236
+ I L ++ + Q +RAPV+ +V + +G ++ L+ +V DD+
Sbjct: 306 QTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDT 365

Query: 237 YFV-FNLREDILAKVRKGDRIVMQV 260
V ++ + + G +++V
Sbjct: 366 LEVTALVQNKDIGFINVGQNAIIKV 390


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01575ABC2TRNSPORT280.039 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 28.4 bits (63), Expect = 0.039
Identities = 27/122 (22%), Positives = 50/122 (40%), Gaps = 1/122 (0%)

Query: 246 LGYRQSASFFMLLGIVLPFLIAVIALSEFIAELLPTEESVYLTMTFITLPLFYMAGYSWP 305
LGY Q S L ++ +A +L + L P+ + T + P+ +++G +P
Sbjct: 139 LGYTQWLSLLYALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFP 198

Query: 306 EQAMPDWVRWLADAIPSTWAIRAIAEMNQMDLPLREVSDHALVLLGMAATYALLGTLLYQ 365
+P + A +P + +I I + + P+ +V H L L T L +
Sbjct: 199 VDQLPIVFQTAARFLPLSHSIDLIRPI-MLGHPVVDVCQHVGALCIYIVIPFFLSTALLR 257

Query: 366 YR 367
R
Sbjct: 258 RR 259


97PAKAF_01779PAKAF_01783N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_017792160.690112probable two-component sensor
PAKAF_017801130.282899hypothetical protein
PAKAF_017810140.247557two-component response regulator CpxR
PAKAF_01782114-0.525603hypothetical protein
PAKAF_017833110.057326YciI family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01779PF06580290.026 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 29.4 bits (66), Expect = 0.026
Identities = 16/100 (16%), Positives = 35/100 (35%), Gaps = 17/100 (17%)

Query: 341 VDNLLRNAVRFNPVGQPLEVRASSAGDYLRLSVRDHGPGIAAELQEQLGEPFFRAPNQSS 400
V+N +++ + P G + ++ + + L V + G +E
Sbjct: 264 VENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKE-------------- 309

Query: 401 PGHGLGLA-IARRAIERHGGHLRLG-NHPDGGFIATLSLP 438
G GL + R +G ++ + G A + +P
Sbjct: 310 -STGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01781HTHFIS1039e-28 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 103 bits (258), Expect = 9e-28
Identities = 42/117 (35%), Positives = 63/117 (53%)

Query: 4 LLLIDDDRELCELLGTWLVQEGFSVRASHDGAQARRALAEQTPDAVVLDVMLPDGSGLEL 63
+L+ DDD + +L L + G+ VR + + A R +A D VV DV++PD + +L
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 64 LKQLRGDHPDLPVLMLSARGEPLDRILGLELGADDYLAKPCDPRELTARLRAVLRRT 120
L +++ PDLPVL++SA+ + I E GA DYL KP D EL + L
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01782IGASERPTASE280.010 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 28.1 bits (62), Expect = 0.010
Identities = 16/76 (21%), Positives = 31/76 (40%), Gaps = 8/76 (10%)

Query: 22 EEPAPAPIPAAQPSITQATAELERRLVETERQRDELVSRMRQENRQLREQ--------LQ 73
E P P P PA T+ AE ++ +T + ++ + +NR++ ++ Q
Sbjct: 1022 EAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQ 1081

Query: 74 AAQAQRQPPLLTEEQT 89
+ + E QT
Sbjct: 1082 TNEVAQSGSETKETQT 1097


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01783adhesinmafb309e-04 Neisseria meningitidis: adhesin MafB signature.
		>adhesinmafb#Neisseria meningitidis: adhesin MafB signature.

Length = 467

Score = 30.4 bits (68), Expect = 9e-04
Identities = 13/45 (28%), Positives = 18/45 (40%)

Query: 53 AAGFTGSLIVAEFDSLAAAQSWAEADPYRAAGVYAEVVVKPFKKV 97
G GS+ E ++ A W + +P A V A V KV
Sbjct: 278 VIGGLGSVAGFEKNTREAVDRWIQENPNAAETVEAVFNVAAAAKV 322


98PAKAF_01835PAKAF_01843N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_01835022-4.206802NAD dependent epimerase/dehydratase family-like
PAKAF_01836-116-3.464148putative group 4 glycosyl
PAKAF_0183708-2.378537nucleotide sugar epimerase/dehydratase WbpM
PAKAF_01838-18-1.586330ComEA family DNA-binding protein
PAKAF_01840-19-1.633009*probable amino acid aminotransferase
PAKAF_01841-110-1.479354excinuclease ABC subunit B
PAKAF_0184209-0.687250probable major facilitator superfamily (MFS)
PAKAF_01843-112-0.052014probable secretion protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01835NUCEPIMERASE663e-14 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 66.0 bits (161), Expect = 3e-14
Identities = 67/356 (18%), Positives = 120/356 (33%), Gaps = 69/356 (19%)

Query: 5 NVLVTGATGFIGAALVNSLCSSGQ-----------YKVWAGCRRRGGAWPRGVTP----L 49
LVTGA GFIG + L +G Y V R G L
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 50 LLGELGSSVVWDAESAIDTVVHCAARVHV-MSETASDPLVEFRKANVQGT---LDLAREA 105
E + + + V R+ V S + +N+ G L+ R
Sbjct: 62 ADREGMTDLFASGH--FERVFISPHRLAVRYSLENPHAYAD---SNLTGFLNILEGCRHN 116

Query: 106 VSRGVRRFIFISSIKVNGEGTEPGRPY-TADSPPNPVDPYGVSKREAEQALLDLAEETGL 164
++ ++ SS V G + P+ T DS +PV Y +K+ E + GL
Sbjct: 117 ---KIQHLLYASSSSVYGLNRKM--PFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGL 171

Query: 165 EVVIIRPVLVYGPGVKAN--VQTMMRWLKRGVPLPL-GAIHNRRSLVSLDNLVDLIITCI 221
+R VYGP + + + + + G + + +R +D++ + II
Sbjct: 172 PATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQ 231

Query: 222 EHPA-----------------AVGQVFLVSDGEDLSTTELLRRMGRALGAPAR--LLPVP 262
+ A +V+ + + + + ++ + ALG A+ +LP+
Sbjct: 232 DVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLPLQ 291

Query: 263 ASWIGAAAKVLNRQAFARRLCGSLQVDIMKTRQVLGWTPPVGVDQALEKTARSFLD 318
V + A D +V+G+TP V ++ + D
Sbjct: 292 ------PGDV--LETSA---------DTKALYEVIGFTPETTVKDGVKNFVNWYRD 330


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01837NUCEPIMERASE578e-11 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 57.1 bits (138), Expect = 8e-11
Identities = 46/292 (15%), Positives = 103/292 (35%), Gaps = 56/292 (19%)

Query: 301 VMVTGAGGSIGSELCRQIMSCSPSVLILFEHSEYNLYSIHQELERRIKRESLSVNLLPIL 360
+VTGA G IG + ++++ V+ + ++Y S+ Q + +
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQP----GFQFHK 58

Query: 361 GSVRNPERLVDVMRTWKVNTVYHAAAYKHVPIVEHNIAEGVLNNVIGTLHAVQAAVQVGV 420
+ + E + D+ + V+ + V N +N+ G L+ ++ +
Sbjct: 59 IDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKI 118

Query: 421 QNFVLIST---------------DKAVRPTNVMGSTKRLAEMVLQALSNESAPVLFGDRK 465
Q+ + S+ D P ++ +TK+ E++ S
Sbjct: 119 QHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYS------------ 166

Query: 466 DVHHVNKTRFTMVRFGNVLGSSGS---VIPLFREQIKRGGPVTV-THPSITRYFMTIPEA 521
H+ T +RF V G G + F + + G + V + + R F I +
Sbjct: 167 ---HLYGLPATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDI 223

Query: 522 AQLVIQA----------GSMGQGGD--------VFVLDMGPPVKILELAEKM 555
A+ +I+ ++ G V+ + PV++++ + +
Sbjct: 224 AEAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQAL 275


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01842TCRTETB1091e-27 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 109 bits (273), Expect = 1e-27
Identities = 92/412 (22%), Positives = 169/412 (41%), Gaps = 24/412 (5%)

Query: 34 WIAVLSAMLGAFMAVLDIQITNSSLKDIQGALAATLEEGSWISTSYLVAEIIMIPMTAWL 93
W+ +LS F +VL+ + N SL DI +W++T++++ I + L
Sbjct: 18 WLCILS-----FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKL 72

Query: 94 VQLLSARRLAVMISVGFLVSSLLCSFAWNLESMIVF-RAMQGFTGGALIPLAFTLALVKL 152
L +RL + + S++ + S+++ R +QG A L + +
Sbjct: 73 SDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYI 132

Query: 153 PEHHRPKGMALFAITATFAPSIGPTLGGWLTENFGWEYIFYINVPPGLLMIAGLLYGLEK 212
P+ +R K L +GP +GG + W Y+ I + ++ + L+ L+K
Sbjct: 133 PKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMIT-IITVPFLMKLLKK 191

Query: 213 KAPHWELLKSTDYAGIVTLGIGLGCLQVFLEEGHRKDWLESQLIVSLGSVALFSLVLFVI 272
+ D GI+ + +G+ +F + S LIVS + S ++FV
Sbjct: 192 EVRIKGHF---DIKGIILMSVGIVFFMLFT-----TSYSISFLIVS-----VLSFLIFVK 238

Query: 273 LQLSRPNPLIDLGILRNRNFGLASISSIGLGMGLYGSIYVLPLYLAQIQGYNAMQIGEVI 332
+P +D G+ +N F + + + + G + ++P + + + +IG VI
Sbjct: 239 HIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVI 298

Query: 333 MWMGIPQLFLIPLVPKLMKLVSPR-LLCAAGFGLFGLASFFSGVLNPDFAGPQFNQIQLL 391
++ G + +I LV R L G+ L+ F F I ++
Sbjct: 299 IFPG--TMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIV 356

Query: 392 RALG-QPMIMVTISLIATAYLQPQDAGSASSLFNILRNLGGAIGIALLATLL 442
LG IS I ++ L+ Q+AG+ SL N L GIA++ LL
Sbjct: 357 FVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLL 408


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01843RTXTOXIND1838e-56 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 183 bits (465), Expect = 8e-56
Identities = 73/417 (17%), Positives = 144/417 (34%), Gaps = 96/417 (23%)

Query: 7 RRLTVFLVAVGLIALAFFLHWWFIGRHVESTDNAYVQGEIT------RVASQLGARVEEV 60
R + F++ +IA + VE A G++T + + V+E+
Sbjct: 58 RLVAYFIMGFLVIAFI-----LSVLGQVEIV--ATANGKLTHSGRSKEIKPIENSIVKEI 110

Query: 61 LVRDNQHVDKGQLLVRLEDADF--------------KLAVERAQA--------------- 91
+V++ + V KG +L++L +L R Q
Sbjct: 111 IVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKL 170

Query: 92 -----------------------ALATREAELAQARSKLVQQGSLIAASAADVNASQATL 128
+T + + Q L ++ + A +N +
Sbjct: 171 PDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLS 230

Query: 129 GRAQIDLNRAEALRKPGYVS-------EERVTTLTADNHVARSQL---------AKARAD 172
+ L+ +L ++ E + + V +SQL AK
Sbjct: 231 RVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQ 290

Query: 173 LEAQRVQRDTLGAEIKRLEAQIASARTELAQAEINLSRTLIHSPINGLVGQRSAR-NGQY 231
L Q + + L ++++ I ELA+ E ++I +P++ V Q G
Sbjct: 291 LVTQLFKNEILD-KLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGV 349

Query: 232 VQVGTHLLSLVPDED-IWVQANFKETQVGRMRDGQKARLTFDAFPDT---PIDGRIDSLF 287
V L+ +VP++D + V A + +G + GQ A + +AFP T + G++ ++
Sbjct: 350 VTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNIN 409

Query: 288 AASGAQFSLLPPDNATGNFTKVVQRIPVKIVFEADNPLHGRIRPGMSVEAEVELRDR 344
+ D G V+ I + + + + GM+V AE++ R
Sbjct: 410 LDA-------IEDQRLGLVFNVIISIEENCLSTGNKNI--PLSSGMAVTAEIKTGMR 457


99PAKAF_01873PAKAF_01899N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_0187319-0.268775SPOR domain-containing protein
PAKAF_0187418-0.761607colicin V production family protein
PAKAF_0187528-0.228228amidophosphoribosyltransferase
PAKAF_01876060.762395o-succinylhomoserine sulfhydrylase
PAKAF_01877070.327879probable short-chain dehydrogenase
PAKAF_01878-180.429721general secretion pathway protein D
PAKAF_018791111.434992secretion protein XcpP
PAKAF_018801141.918256general secretion pathway protein E
PAKAF_018810151.340459general secretion pathway protein F
PAKAF_018822152.041600general secretion pathway protein G
PAKAF_018834162.825119General secretion pathway outer membrane protein
PAKAF_018843122.730901general secretion pathway protein I
PAKAF_018852112.666996general secretion pathway protein J
PAKAF_01886-1102.111778general secretion pathway protein K
PAKAF_01887092.456004general secretion pathway protein L
PAKAF_01888-292.330639general secretion pathway protein M
PAKAF_01892-292.452968***probable transcriptional regulator
PAKAF_01893-191.936239carbon-nitrogen hydrolase family protein
PAKAF_01894-182.0222712,4-dienoyl-CoA reductase FadH1
PAKAF_01895072.197078chromosome partitioning protein ParA
PAKAF_01896181.739448PA3090 ortholog, hypothetical protein
PAKAF_01897080.502693DUF1853 family protein
PAKAF_01898-110-0.796871NAD(+) kinase
PAKAF_01899-111-1.075856hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01873PERTACTIN300.006 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 30.5 bits (68), Expect = 0.006
Identities = 28/85 (32%), Positives = 31/85 (36%), Gaps = 1/85 (1%)

Query: 84 AAGQPSQPIGGLPATPPATQPPAQAQAQAPAASLPPSQPQPPAAPPSPPPA-EKRLDANN 142
A P+ P P QPP Q P P Q QP A P PP E AN
Sbjct: 565 AKAPPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPPAGRELSAAANA 624

Query: 143 LPQSWSVQLASLSNRARAEELQKTL 167
+ V LAS A + L K L
Sbjct: 625 AVNTGGVGLASTLWYAESNALSKRL 649


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01877DHBDHDRGNASE902e-24 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 90.1 bits (223), Expect = 2e-24
Identities = 56/198 (28%), Positives = 81/198 (40%), Gaps = 15/198 (7%)

Query: 1 MDVAQEGQVAMSVAEVLGQFGRLDGLVCNAAIANPRNTPLEALSLGEWTRTLAVNLTGPM 60
DV + A + + G +D LV A + P + +LS EW T +VN TG
Sbjct: 64 ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRP--GLIHSLSDEEWEATFSVNSTGVF 121

Query: 61 LLAKYCTPYLRAH-NGAIVNIASTRAHQSEPDSEAYAASKGGLLALTHALAASLGPE-IR 118
++ + Y+ +G+IV + S A AYA+SK + T L L IR
Sbjct: 122 NASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIR 181

Query: 119 VNALSPG----------WIDTREAAEREAAPLTELDHDQHLVGRVGTVEDVASLVAWLLS 168
N +SPG W D A + L L ++ D+A V +L+S
Sbjct: 182 CNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPL-KKLAKPSDIADAVLFLVS 240

Query: 169 EDAGFVTGQEFLVDGGMT 186
AG +T VDGG T
Sbjct: 241 GQAGHITMHNLCVDGGAT 258


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01878BCTERIALGSPD5940.0 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 594 bits (1533), Expect = 0.0
Identities = 217/631 (34%), Positives = 345/631 (54%), Gaps = 35/631 (5%)

Query: 41 AFVPAGNQQEAHWTINLKDADIREFIDQISEITGETFVVDPRVKGQVSVVSKAQLSLSEV 100
F PA ++ ++ + K DI+EFI+ +S+ +T ++DP V+G ++V S L+ +
Sbjct: 21 LFRPAAAEE---FSASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITVRSYDMLNEEQY 77

Query: 101 YQLFLSVMSTHGFTVVAQGDQA-RIVPNAEAKTEAG--GGQSAP---DRLETRVIQVQQS 154
YQ FLSV+ +GF V+ + ++V + +AKT A +AP D + TRV+ +
Sbjct: 78 YQFFLSVLDVYGFAVINMNNGVLKVVRSKDAKTAAVPVASDAAPGIGDEVVTRVVPLTNV 137

Query: 155 PVSELIPLIRPLVPQYGHLAAV--PSANALIISDRSANIARIEDVIRQLDQKGSHDYSVI 212
+L PL+R L G + V +N L+++ R+A I R+ ++ ++D G +
Sbjct: 138 AARDLAPLLRQLNDNAGVGSVVHYEPSNVLLMTGRAAVIKRLLTIVERVDNAGDRSVVTV 197

Query: 213 NLRYGWVMDAAEV---LNNAMSRGQAKGAAGAQVIADARTNRLIILGPPQARAKLVQLAQ 269
L + D ++ LN S+ G+ A V+AD RTN +++ G P +R +++ + +
Sbjct: 198 PLSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEPNSRQRIIAMIK 257

Query: 270 SLDTPTARSANTRVIRLRHNDAKTLAETLGQISEGMKNNGGQGGEQTGGGRPSNILIRAD 329
LD A NT+VI L++ A L E L IS M++ + NI+I+A
Sbjct: 258 QLDRQQATQGNTKVIYLKYAKASDLVEVLTGISSTMQSEKQAAKPVAALDK--NIIIKAH 315

Query: 330 ESTNALVLLADPDTVNALEDIVRQLDVPRAQVLVEAAIVEISGDIQDAVGVQWAINKGGM 389
TNAL++ A PD +N LE ++ QLD+ R QVLVEA I E+ +G+QWA GM
Sbjct: 316 GQTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGIQWANKNAGM 375

Query: 390 GGTKTNFANTGLSIGTLLQSLESNKAPESIP----------DGAIVGIGSSSFGALVTAL 439
T F N+GL I T + ++ +G G ++ L+TAL
Sbjct: 376 ----TQFTNSGLPISTAIAGANQYNKDGTVSSSLASALSSFNGIAAGFYQGNWAMLLTAL 431

Query: 440 SANTKSNLLSTPSLLTLDNQKAEILVGQNVPFNTGSYTTNSEGASNPFTTVERKDIGVSL 499
S++TK+++L+TPS++TLDN +A VGQ VP TGS TT+ + N F TVERK +G+ L
Sbjct: 432 SSSTKNDILATPSIVTLDNMEATFNVGQEVPVLTGSQTTSGD---NIFNTVERKTVGIKL 488

Query: 500 KVTPHINDGAALRLEIEQEISALLPNAQQRNNT-DLITSKRSIKSTILAENGQVIVIGGL 558
KV P IN+G ++ LEIEQE+S++ A ++ + R++ + +L +G+ +V+GGL
Sbjct: 489 KVKPQINEGDSVLLEIEQEVSSVADAASSTSSDLGATFNTRTVNNAVLVGSGETVVVGGL 548

Query: 559 IQDDVSQAESKVPLLGDIPLLGRLFRSTKDTHTKRNLMVFLRPTVVRDSAGLAALSGKKY 618
+ VS KVPLLGDIP++G LFRST +KRNLM+F+RPTV+RD S +Y
Sbjct: 549 LDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNLMLFIRPTVIRDRDEYRQASSGQY 608

Query: 619 SDIR-VIDGTRGPEGRPSILPTNANQLFDGQ 648
+ RG E ++L + +++ Q
Sbjct: 609 TAFNDAQSKQRGKENNDAMLNQDLLEIYPRQ 639


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01879BCTERIALGSPC493e-09 Bacterial general secretion pathway protein C signa...
		>BCTERIALGSPC#Bacterial general secretion pathway protein C

signature.
Length = 272

Score = 49.2 bits (117), Expect = 3e-09
Identities = 37/148 (25%), Positives = 57/148 (38%), Gaps = 19/148 (12%)

Query: 32 APALLAVALIIAMSISLAWQAAG--WLRLQRSPVAVAASPVSHESIRSDPTRLAR--LFG 87
+P+++ L + + Q A W V++ ++ R P L LFG
Sbjct: 10 SPSVIRRILFYLLMLLFCQQLAMIFWRIGLPDNAPVSSVQITPAQARQQPVTLNDFTLFG 69

Query: 88 TSAQDPNAPP----------PATNLDLVLKGSFVQSDPKLSSAIIQRQGDKPHRYAVGGE 137
S + N P + L+L L G D S AII + ++ V E
Sbjct: 70 VS-PEKNKAGALDASQMSNLPPSTLNLSLTGVMAGDDDSRSIAIISKDNEQ-FSRGVNEE 127

Query: 138 ISDG--VKLHAVYRDRVELQRGGRLESL 163
+ G K+ ++ DRV LQ GR E L
Sbjct: 128 V-PGYNAKIVSIRPDRVVLQYQGRYEVL 154


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01881BCTERIALGSPF501e-180 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 501 bits (1292), Expect = e-180
Identities = 213/406 (52%), Positives = 278/406 (68%), Gaps = 2/406 (0%)

Query: 1 MAAFEYLALDPSGRQQKGVLEADSARQVRQLLRERQLAPLDVKPTRTREQSGQGGRLTFA 60
MA + Y ALD G++ +G EADSARQ RQLLRER L PL V R +Q L+
Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLR 60

Query: 61 RG--LSARDLALVTRQLATLVQAALPIEEALRAAAAQSTSQRIQSMLLAVRAKVLEGHSL 118
R LS DLAL+TRQLATLV A++P+EEAL A A QS + ++ AVR+KV+EGHSL
Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSL 120

Query: 119 AGSLREFPTAFPELYRATVAAGEHAGHLGPVLEQLADYTEQRQQSRQKIQLALLYPVILM 178
A +++ FP +F LY A VAAGE +GHL VL +LADYTEQRQQ R +IQ A++YP +L
Sbjct: 121 ADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLT 180

Query: 179 VASLAIVGFLLGYVVPDVVRVFIDSGQTLPLLTRVLIGVSDWVKAWGALAFVAAIGGVIG 238
V ++A+V LL VVP VV FI Q LPL TRVL+G+SD V+ +G +A + G +
Sbjct: 181 VVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMA 240

Query: 239 FRYALRKDAFRERWHGFLLRVPLVGRLVRSTDTARFASTLAILTRSGVPLVEALAIAAEV 298
FR LR++ R +H LL +PL+GR+ R +TAR+A TL+IL S VPL++A+ I+ +V
Sbjct: 241 FRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDV 300

Query: 299 IANRIIRNEVVKAAQKVREGASLTRSLEATGQFPPMMLHMIASGERSGELDQMLARTARN 358
++N R+ + A VREG SL ++LE T FPPMM HMIASGERSGELD ML R A N
Sbjct: 301 MSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADN 360

Query: 359 QENDLAAQIGLMVGLFEPFMLIFMGAVVLVIVLAILLPILSLNQLV 404
Q+ + ++Q+ L +GLFEP +++ M AVVL IVLAIL PIL LN L+
Sbjct: 361 QDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLM 406


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01882BCTERIALGSPG1929e-67 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 192 bits (490), Expect = 9e-67
Identities = 68/127 (53%), Positives = 86/127 (67%), Gaps = 3/127 (2%)

Query: 1 MVVVVILGILAALVVPQVMSRPDQAKVTVAKGDIKAIAAALDMYKLDNFAYPSTQQGLEA 60
MVV+VI+G+LA+LVVP +M ++A A DI A+ ALDMYKLDN YP+T QGLE+
Sbjct: 16 MVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLDNHHYPTTNQGLES 75

Query: 61 LVKKPTGNPQPKNWNKDGYLKKLPVDPWGNPYQYLAPGTKGPFDLYSLGADGKEGGSDND 120
LV+ PT P N+NK+GY+K+LP DPWGN Y + PG G +DL S G DG+ G D
Sbjct: 76 LVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDLLSAGPDGEMGTED-- 133

Query: 121 ADIGNWD 127
DI NW
Sbjct: 134 -DITNWG 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01883BCTERIALGSPH1433e-46 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 143 bits (361), Expect = 3e-46
Identities = 50/183 (27%), Positives = 85/183 (46%), Gaps = 32/183 (17%)

Query: 5 RGFTLIELMVVMVIISVLIGLAVLSTGFASTSRELDSEAERLAGL---IGVLTDEAVLDN 61
RGFTL+E+M++++++ V G+ +L+ SR+ DS A+ LA + + +
Sbjct: 4 RGFTLLEMMLILLLMGVSAGMVLLAFP---ASRD-DSAAQTLARFEAQLRFVQQRGLQTG 59

Query: 62 REYGLRLERDAYQVLRY------DEAKA-------RWLPVARDSHRLPEWAELTFELDGQ 108
+ +G+ + D +Q L D A A RWLP+ + + G
Sbjct: 60 QFFGVSVHPDRWQFLVLEARDGADPAPADDGWSGYRWLPLRAGR------VATSGSIAGG 113

Query: 109 PLVLAGSKGEKEQKKGTDQPQLLILSSGELSPFRLRLAERGPEGRALSLSSDGFRLPRVE 168
L LA ++GE D P +LI GE++PFRL L E ++ ++ G LP +
Sbjct: 114 KLNLAFAQGEAWTPG--DNPDVLIFPGGEMTPFRLTLG----EAPGIAFNARGESLPEPQ 167

Query: 169 VAR 171
A+
Sbjct: 168 EAQ 170


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01884BCTERIALGSPG352e-05 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 35.2 bits (81), Expect = 2e-05
Identities = 13/68 (19%), Positives = 30/68 (44%), Gaps = 4/68 (5%)

Query: 1 MKRARGFTLLEVLVALAIF----AMVAASVLSASARSLQNASRLEDKTLAMWIADNRLNE 56
+ RGFTLLE++V + I ++V +++ ++ + + + L + +L+
Sbjct: 4 TDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLDN 63

Query: 57 LQLEQTPP 64
T
Sbjct: 64 HHYPTTNQ 71


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01885PilS_PF08805367e-05 PilS N terminal
		>PilS_PF08805#PilS N terminal

Length = 185

Score = 35.7 bits (82), Expect = 7e-05
Identities = 11/45 (24%), Positives = 24/45 (53%)

Query: 1 MRLQRGFTLLELLIAIAIFALLALATYRMFDSVMQTDQATRVQEQ 45
+G TL+E+L+ + + +LA + Y+++ V Q++ Q
Sbjct: 22 KEQDKGATLMEVLLVVGVIVVLAASAYKLYSMVQSNIQSSNEQNN 66


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01888PYOCINKILLER280.027 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 27.8 bits (61), Expect = 0.027
Identities = 23/113 (20%), Positives = 33/113 (29%), Gaps = 5/113 (4%)

Query: 55 AERHLQSARQYFTEQRALHAYIQQQAPNVRQADAAAPQAQIDPAALQGMVTASAAQAGLS 114
A+R + + RA + Y +V A Q+ A S A A L
Sbjct: 230 AKRKAEEQARQQAAIRAANTYAMPANGSVVATAAGRGLIQVAQGAASLAQAISDAIAVLG 289

Query: 115 VERLDNEGEGAVQVALQPAPFAKLLPWLEQLNGQ-----GVQVAEAGLDRQVD 162
AV A W +Q G+ A+ GL V+
Sbjct: 290 RVLASAPSVMAVGFASLTYSSRTAEQWQDQTPDSVRYALGMDAAKLGLPPSVN 342


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01895RTXTOXIND501e-08 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 50.2 bits (120), Expect = 1e-08
Identities = 33/206 (16%), Positives = 67/206 (32%), Gaps = 18/206 (8%)

Query: 165 AAVEPQRLQMAAEEQWYAAGPAAPKAPPAEPPRKQEDEQTARLAQLVKQQRQQLAALARQ 224
A +E R Q+ + P K P + +E+ RL L+K+Q Q
Sbjct: 146 ARLEQTRYQILSRSIELNKLPEL-KLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQ 204

Query: 225 QEQRVAGLARQHEEELARREQDARGQLDILRSEVLSLQQALERQARENAELQQRLLEQGE 284
+E + + + R + +S + + A + +LEQ
Sbjct: 205 KELNLDKKRAE-RLTVLARINRYENLSRVEKSRL----DDFSSLLHKQAIAKHAVLEQ-- 257

Query: 285 QFQRNREELTRQLRFIENQGRNETDLLRSEFADELEARVAAAVAGYKEQVSIRDVELAYR 344
+ E +LR +++ + + SE + +K ++ +L
Sbjct: 258 --ENKYVEAVNELR----VYKSQLEQIESEIL-SAKEEYQLVTQLFKNEILD---KLRQT 307

Query: 345 NELDQQLEQELAELRAERDRLAAQGP 370
+ L ELA+ + + P
Sbjct: 308 TDNIGLLTLELAKNEERQQASVIRAP 333


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01897FLGFLIJ290.019 Flagellar FliJ protein signature.
		>FLGFLIJ#Flagellar FliJ protein signature.

Length = 147

Score = 28.6 bits (63), Expect = 0.019
Identities = 16/37 (43%), Positives = 23/37 (62%), Gaps = 2/37 (5%)

Query: 36 QRHPLAASRWRQEPERLAAWLREQERQPQHLAAWLAQ 72
Q+ +A + WR++ +RL AW QERQ AA LA+
Sbjct: 92 QKVDIALNSWREKKQRLQAWQTLQERQST--AALLAE 126


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01898PF06057290.028 Type IV secretory pathway VirJ component
		>PF06057#Type IV secretory pathway VirJ component

Length = 243

Score = 28.7 bits (64), Expect = 0.028
Identities = 13/58 (22%), Positives = 23/58 (39%), Gaps = 7/58 (12%)

Query: 65 LVVVVGGDGSML----GAARALARHKVPVLGINRGSLG-FLTDIRPDELEAKVGEVLD 117
LV+ + GDG L + PV+G + SL + P ++ ++D
Sbjct: 53 LVIFLSGDGGWATLDKAVGGILQQQGWPVVGWS--SLKYYWKQKDPKDVTQDTLAIID 108


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01899ANTHRAXTOXNA310.008 Anthrax toxin LF subunit signature.
		>ANTHRAXTOXNA#Anthrax toxin LF subunit signature.

Length = 800

Score = 30.9 bits (69), Expect = 0.008
Identities = 13/36 (36%), Positives = 19/36 (52%)

Query: 231 GDIVFQPDALPEAIAREPLSEEQKSSLLTYGADEPL 266
G+I F L E + LSEE+K+S+ + G P
Sbjct: 102 GEIYFTDIDLVEHKELQDLSEEEKNSMNSRGEKVPF 137


100PAKAF_01907PAKAF_01913N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_019071111.111753putative export protein
PAKAF_019082142.934152CprS
PAKAF_019092133.675552CprR
PAKAF_019113123.808606hypothetical protein
PAKAF_019122123.327396protein BatD
PAKAF_019132132.681137VWA domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01907ACRIFLAVINRP711e-14 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 71.0 bits (174), Expect = 1e-14
Identities = 36/175 (20%), Positives = 77/175 (44%), Gaps = 11/175 (6%)

Query: 613 IEAATNEVIKQSELII-LVLVYICVAAMCMITFRSFAATLCIVLPLILTSVLGNALMAAL 671
++ + +EV+K L ++LV++ + + ++ ATL + + + + A++AA
Sbjct: 333 VQLSIHEVVK--TLFEAIMLVFLVMY----LFLQNMRATLIPTIAVPVVLLGTFAILAAF 386

Query: 672 GIGVKVATLPVIALGVGIGVDYGIYIYTRLESFLRM-GLPLQEAYYETLRSTGKAVLFTG 730
G + T+ + L +G+ VD I + +E + LP +EA +++ A++
Sbjct: 387 GYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIA 446

Query: 731 LCLAIGVATWIF---SAIKFQADMGLMLTFMLLWNMFGALWLLPALARFLINPAK 782
+ L+ F S + + + ++ AL L PAL L+ P
Sbjct: 447 MVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVS 501



Score = 42.9 bits (101), Expect = 6e-06
Identities = 35/221 (15%), Positives = 80/221 (36%), Gaps = 15/221 (6%)

Query: 251 LITLVLLYWFTKCIRSTIAVLITTLVAVLWQLGLLNLVGFGLDPYSMLVPFLIFAIGISH 310
++ +++Y F + +R+T+ I V +L +L G+ ++ +M L + +
Sbjct: 349 MLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDD 408

Query: 311 GVQKINGIA-LQSSGADNALMAARLTFRQLFLPGMIAILADAVGFITLLVID--IGVI-R 366
+ + + + A + Q+ + + + FI + G I R
Sbjct: 409 AIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYR 468

Query: 367 ELAIGASIGVAVIVFTNLILLPVAISYI--GISKKAVQRSKDDAVREHPFWRLLSNFASP 424
+ +I +A+ V LIL P + + +S + + + + N +
Sbjct: 469 QFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTN 528

Query: 425 KVAPV------SIAIALLMLGGGLWYGKHLKIG---DLDQG 456
V + + I L++ G + L + DQG
Sbjct: 529 SVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQG 569



Score = 33.3 bits (76), Expect = 0.005
Identities = 23/113 (20%), Positives = 46/113 (40%), Gaps = 5/113 (4%)

Query: 626 LIILVLVYICVAAMCMITFRSFAATLCIVLPLILTSVLGNALMAALGIGVKVATLPVIAL 685
I V+V++C+AA+ + S++ + ++L + L V V + +
Sbjct: 877 AISFVVVFLCLAAL----YESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLT 932

Query: 686 GVGIGVDYGIYIYTRLESFLRM-GLPLQEAYYETLRSTGKAVLFTGLCLAIGV 737
+G+ I I + + G + EA +R + +L T L +GV
Sbjct: 933 TIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGV 985


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01908PF07675320.005 Cleaved Adhesin
		>PF07675#Cleaved Adhesin

Length = 1358

Score = 32.4 bits (73), Expect = 0.005
Identities = 23/88 (26%), Positives = 36/88 (40%), Gaps = 6/88 (6%)

Query: 64 PAPDSYYFKGSVGTAGLPPKLREMLDTPPYKSIGAMQLLGNWDDDDEEEDDDAPSDDAYV 123
PA + G G P + + K M+ G D D E +DD+P+ Y
Sbjct: 480 PASGKMWIAGDGGNQ--PARYDDFAFEAGKKYTFTMRRAGMGDGTDMEVEDDSPASYTYT 537

Query: 124 VVR--QPLADGKTLYLYDND--AAGSID 147
V R + +G T ++ D AAG+ +
Sbjct: 538 VYRDGTKIKEGLTATTFEEDGVAAGNHE 565


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01909HTHFIS845e-21 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 84.1 bits (208), Expect = 5e-21
Identities = 30/129 (23%), Positives = 59/129 (45%)

Query: 3 IHVLVVEDNFDLAGTVIDYLEAAGVVCDHARDGQAGLNLARANRYDVILLDIMLPRINGR 62
+LV +D+ + + L AG + A D+++ D+++P N
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 63 QVCRQLREAGLQTPVLMLTALDTLQDKLDGFDAGADDYLLKPFELPELLVRLQALSRRRS 122
+ ++++A PVL+++A +T + + GA DYL KPF+L EL+ +
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 123 GQAQRLQVD 131
+ +L+ D
Sbjct: 124 RRPSKLEDD 132


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_01913TYPE4SSCAGX372e-04 Type IV secretion system CagX conjugation protein si...
		>TYPE4SSCAGX#Type IV secretion system CagX conjugation protein

signature.
Length = 522

Score = 37.5 bits (86), Expect = 2e-04
Identities = 39/158 (24%), Positives = 73/158 (46%), Gaps = 13/158 (8%)

Query: 340 LMLSLPQPAMAFQFEDLWLRPDQQGQRLLQRGQADEAAKRFEDFRWKGLSLYQARDYAAA 399
L++ P P + + L +++ + Q+ Q D+ KR E+ R K + +
Sbjct: 131 LIVDAPDPK-ELEEQKKALEKEKEAKEQAQKAQKDKREKRKEE-RAKNRA-----NLENL 183

Query: 400 AQAFAQGDQADDHYNRGNALARQGELEAAVDAYEQALERQPQLVAAQRNK-ALVEELLRQ 458
A + ++ N + +Q E E +D E+ + Q Q AQ N +EEL ++
Sbjct: 184 TNAMSNPQNLSNNKNLSELIKQQRENE--LDQMERLEDMQEQ---AQANALKQIEELNKK 238

Query: 459 RQEQAAQQQAGENKEQRQEASQQSPPSGSSQRPPRDAA 496
+ E+A +Q+A + + + SQ+SP S + P D+A
Sbjct: 239 QAEEAVRQRAKDKISIKTDKSQKSPEDNSIELSPSDSA 276


101PAKAF_02074PAKAF_02079N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_020740112.731318probable short-chain dehydrogenase
PAKAF_020751123.854250probable transcriptional regulator
PAKAF_020761142.890364LysE family translocator
PAKAF_020770142.532122MBL fold metallo-hydrolase
PAKAF_020780143.315390probable permease of ABC transporter
PAKAF_02079-2153.196206ABC transporter substrate-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02074DHBDHDRGNASE694e-16 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 69.3 bits (169), Expect = 4e-16
Identities = 58/191 (30%), Positives = 92/191 (48%), Gaps = 6/191 (3%)

Query: 6 IKGKTVLITGGAKNLGGLIARDLAAHGAKAIAIHYNSAASKADADATVAALQAAGAKAVA 65
I+GK ITG A+ +G +AR LA+ GA A+ YN + V++L+A A A
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKL----EKVVSSLKAEARHAEA 61

Query: 66 LQGDLTSAAAMEKLFADAIAAVGKPDIAINTVGKVLKKPITEINETEYDEMSAVNSKSAF 125
D+ +AA++++ A +G DI +N G + I +++ E++ +VNS F
Sbjct: 62 FPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVF 121

Query: 126 FFLREAGKHVND--NGKICTLVTSLLGAYTPYYAAYAGTKAPVEHFTRAASKEFGARGIS 183
R K++ D +G I T+ ++ G AAYA +KA FT+ E I
Sbjct: 122 NASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIR 181

Query: 184 VTAVGPGPMDT 194
V PG +T
Sbjct: 182 CNIVSPGSTET 192


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02075PF05272280.034 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.5 bits (63), Expect = 0.034
Identities = 6/16 (37%), Positives = 8/16 (50%)

Query: 258 FRRAYGMTPAAYRRQC 273
+R AYG + RQ
Sbjct: 672 YRGAYGRYVQDHPRQV 687


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02077PF05932270.049 Tir chaperone protein (CesT)
		>PF05932#Tir chaperone protein (CesT)

Length = 127

Score = 27.1 bits (60), Expect = 0.049
Identities = 9/53 (16%), Positives = 18/53 (33%), Gaps = 8/53 (15%)

Query: 74 ADHLSAAIFLQRELGGCLAIGARITQVQAKFSGLFNLGEAFPVDGRQFEHLFE 126
L+ A+ G L + + SGL++ ++ P + L
Sbjct: 66 QCLLAGALNPLLNAGPGLGLDEK--------SGLYHAYQSIPREKLSVPTLKR 110


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02079FERRIBNDNGPP408e-06 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 39.9 bits (93), Expect = 8e-06
Identities = 49/266 (18%), Positives = 96/266 (36%), Gaps = 43/266 (16%)

Query: 43 PSRAVSHDINLTEMMVALGLQTRMVGYTGISGW--WKNADPGLIAALKPLPELV-----A 95
P+R V+ + E+++ALG+ G + W + +P PLP+ V
Sbjct: 35 PNRIVALEWLPVELLLALGIVP--YGVADTINYRLWVS-EP-------PLPDSVIDVGLR 84

Query: 96 RYPTAETLLDVDADFFFAGWGYGMRVGGDLTPASLEPLG-VKVYELSESCAQIGEPRRAS 154
P E L ++ F GYG +P L + + + S+ + R++
Sbjct: 85 TEPNLELLTEMKPSFMVWSAGYGP------SPEMLARIAPGRGFNFSDGKQPLAMARKS- 137

Query: 155 LDELYRDLRNLGRIFDVEPRAERLVASLQARIERARAGIPANTEAPRVF--LYDSGEDRP 212
L + + +++ AE +A + I + P + L D
Sbjct: 138 -------LTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLV 190

Query: 213 FTSGRLGMPQALIEAAGGRSVTDDVAASW--TQVNWESVVA-RDPQVIVIVDYGETSAAQ 269
F + Q +++ G + W T V+ + + A +D V+ D+ +
Sbjct: 191 FGPN--SLFQEILDEYGIPNAWQGETNFWGSTAVSIDRLAAYKDVDVLCF-DHDNSKDMD 247

Query: 270 KQRFLEENPALRSLTAIRERRFIVLP 295
L P +++ +R RF +P
Sbjct: 248 A---LMATPLWQAMPFVRAGRFQRVP 270


102PAKAF_02094PAKAF_02113N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_020940121.591457probable outer membrane protein precursor
PAKAF_020952121.461070probable transcriptional regulator
PAKAF_020961101.090725SbrI
PAKAF_02097091.130127SbrR
PAKAF_02098192.374310probable sigma-70 factor, ECF subfamily
PAKAF_020992102.834679anti-sigma factor SbrR
PAKAF_02100-1131.572918hypothetical protein
PAKAF_02101-2151.696546putative very-long chain acyl-CoA synthetase
PAKAF_02102-1162.157952putative short chain dehydrogenase involved in
PAKAF_02103-1142.138028geranyl-CoA carboxylase, alpha-subunit
PAKAF_02104-1131.404239putative isohexenylglutaconyl-CoA hydratase
PAKAF_02105-1130.767033putative citronellyl-CoA dehydrogenase involved
PAKAF_02106-1111.268114geranyl-CoA carboxylase, beta-subunit
PAKAF_021070101.214908putative dehydrogenase involved in catabolism of
PAKAF_021082111.429042expressed protein with apparent function in
PAKAF_021093130.571076putative repressor of atu genes
PAKAF_021101121.286683CPBP family intramembrane metalloprotease
PAKAF_021111121.405116DUF2897 family protein
PAKAF_02112-1101.690233probable two-component sensor
PAKAF_02113-1101.922038probable two-component response regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02094OMPADOMAIN1022e-27 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 102 bits (255), Expect = 2e-27
Identities = 44/126 (34%), Positives = 63/126 (50%), Gaps = 11/126 (8%)

Query: 155 DVLFDFNRAELKPAANRTALKLVQFL-QLNPRRV-IRIEGYTDSVGDRQANLDLSRERAQ 212
DVLF+FN+A LKP +L L L+P+ + + GYTD +G N LS RAQ
Sbjct: 220 DVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGLSERRAQ 279

Query: 213 AVADVLADLGVDPARMQVVGYGEAFPVTDNASNRGR---------AQNRRVEIVFSNDKG 263
+V D L G+ ++ G GE+ PVT N + + A +RRVEI K
Sbjct: 280 SVVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIEVKGIKD 339

Query: 264 QLSAPR 269
++ P+
Sbjct: 340 VVTQPQ 345


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02095HTHFIS501e-09 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 50.2 bits (120), Expect = 1e-09
Identities = 35/157 (22%), Positives = 61/157 (38%), Gaps = 11/157 (7%)

Query: 10 LVIADSFPVMQWALQRYLSEECGRQVLAVVGDSDSLVERLADLPPESILITELGLPGQRS 69
+++AD ++ L + LS G V + +A +++T++ +P
Sbjct: 6 ILVADDDAAIRTVLNQALSRA-GYDVRITSNAATLW-RWIAAGDG-DLVVTDVVMPD--- 59

Query: 70 RDGIHLVEWLTRHCPQMKVMVYSVFSAPLLAKAVLRSGASAYISKRSPLETLKAALECMA 129
+ L+ + + P + V+V S + + A GA Y+ K L L + A
Sbjct: 60 ENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG-RA 118

Query: 130 LGQTFLDPG-LHPQRHTGKPL---SPTEVDILRRLAR 162
L + P L G PL S +I R LAR
Sbjct: 119 LAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLAR 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02099IGASERPTASE361e-04 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 36.2 bits (83), Expect = 1e-04
Identities = 23/120 (19%), Positives = 41/120 (34%), Gaps = 8/120 (6%)

Query: 112 AAAAKRAMRAPAAPAPLSSEMSEP--PALLASYASSGEAPQLMAEAAPAAPAALADRPPA 169
+ + P + + P P+ A EAP + APA P+ +
Sbjct: 985 VEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAP--VPPPAPATPSETTETVAE 1042

Query: 170 QAAQQAK---VQAALAGDFVAQARGKAVAVKPEVLDEALGAVLALREQGKTEQAATQLAE 226
+ Q++K A + AQ R A K V +A + +T++ T +
Sbjct: 1043 NSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVA-QSGSETKETQTTETK 1101


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02100IGASERPTASE421e-06 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 41.6 bits (97), Expect = 1e-06
Identities = 13/62 (20%), Positives = 21/62 (33%)

Query: 25 KAPEKKAPEAPPQEQRAPAKPVKPARAEPAPAVPAAPKTASKKVAPAAEQVAEPKPPAKP 84
+ P+ + +P QEQ +P E P V + EQ A+
Sbjct: 1121 EVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVE 1180

Query: 85 KP 86
+P
Sbjct: 1181 QP 1182



Score = 40.0 bits (93), Expect = 4e-06
Identities = 24/135 (17%), Positives = 37/135 (27%), Gaps = 16/135 (11%)

Query: 25 KAPEKKAPEAPPQEQRAPAKPVKPARAEPAPAVPAAPKTASKKVAPAAEQVAEPKPPAKP 84
+ + E E+ AK E PK S+ V+P EQ +P A+P
Sbjct: 1094 ETQTTETKETATVEKEEKAKVETEKTQE-------VPKVTSQ-VSPKQEQSETVQPQAEP 1145

Query: 85 KPAAAPPKPASRPVAKDKPAPAKRASTARLDPEVRKPLPSAKLDLRLPK-------ELVQ 137
P P ++ V +P+ +
Sbjct: 1146 ARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPA 1205

Query: 138 KMAPPGTEETH-KPK 151
P E+ KPK
Sbjct: 1206 TTQPTVNSESSNKPK 1220



Score = 38.1 bits (88), Expect = 1e-05
Identities = 26/153 (16%), Positives = 47/153 (30%), Gaps = 16/153 (10%)

Query: 24 SKAPEKKAPEAP--PQEQRAPAKPVKPARAEPAPAVPAA--------PKTASKKVAPAAE 73
P + P P A+ + PAPA P+ K SK V +
Sbjct: 997 ITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQ 1056

Query: 74 QVAEPKPPAKPKPAAAPPK----PASRPVAKDKPAPAKRASTARLDPEVRKPLPSAKLDL 129
E + A + VA+ + +T + + AK++
Sbjct: 1057 DATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVET 1116

Query: 130 RLPKELVQKMA--PPGTEETHKPKPLLPPMFEE 160
+E+ + + P E++ +P P E
Sbjct: 1117 EKTQEVPKVTSQVSPKQEQSETVQPQAEPAREN 1149



Score = 32.3 bits (73), Expect = 0.001
Identities = 13/84 (15%), Positives = 23/84 (27%), Gaps = 4/84 (4%)

Query: 24 SKAPEKKAPEAPPQEQRAP--AKPVKPARAEPAPAVPAAPKTASKKVAPAAEQVAEP--K 79
+ E+KA + Q P V P + + P A ++
Sbjct: 1105 TVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNT 1164

Query: 80 PPAKPKPAAAPPKPASRPVAKDKP 103
+PA +PV +
Sbjct: 1165 TADTEQPAKETSSNVEQPVTESTT 1188


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02102DHBDHDRGNASE813e-20 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 81.3 bits (200), Expect = 3e-20
Identities = 54/191 (28%), Positives = 85/191 (44%), Gaps = 9/191 (4%)

Query: 3 LHGKTLFITGASRGIGREIALRAARDGANLVIAAKSAEPHPKLEGTIFSVAAEVEAAGGQ 62
+ GK FITGA++GIG +A A GA++ + E K+ ++ + A EA
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEA---- 61

Query: 63 ALPLQLDVRDEQAVAAAMARAAERFGGIDALVNNAGAIRLVGVEKLEPKRFDLMYQINTR 122
DVRD A+ AR G ID LVN AG +R + L + ++ + +N+
Sbjct: 62 ---FPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNST 118

Query: 123 AVLVCSQAALPYLRRSANGHILSLSPPINLAGRWFAQHGPYTVTKYGMSMLTLGMHEEFG 182
V S++ Y+ +G I+++ N AG Y +K M T + E
Sbjct: 119 GVFNASRSVSKYMMDRRSGSIVTVGS--NPAGVPRTSMAAYASSKAAAVMFTKCLGLELA 176

Query: 183 KYAISVNALWP 193
+Y I N + P
Sbjct: 177 EYNIRCNIVSP 187


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02103RTXTOXIND382e-04 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 37.5 bits (87), Expect = 2e-04
Identities = 25/85 (29%), Positives = 42/85 (49%), Gaps = 6/85 (7%)

Query: 579 AGASAQVGASSGTLK-APMDGAIV-EVLVGEGERVGKGQLLLVLEAMKMEHPLKAGVDGV 636
A A+ ++ S + + P++ +IV E++V EGE V KG +LL L A+ E A
Sbjct: 84 ATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAE----ADTLKT 139

Query: 637 VRRVQVGRGEQVRNRQVLVEVEADA 661
+ R EQ R + + +E +
Sbjct: 140 QSSLLQARLEQTRYQILSRSIELNK 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02104adhesinmafb280.037 Neisseria meningitidis: adhesin MafB signature.
		>adhesinmafb#Neisseria meningitidis: adhesin MafB signature.

Length = 467

Score = 28.1 bits (62), Expect = 0.037
Identities = 28/132 (21%), Positives = 46/132 (34%), Gaps = 3/132 (2%)

Query: 11 LEPIEGVLRITLNRPQSRNAMSLAMVGELRAVLAAVRDDRSVRALVLRGAGGHFCAGGDI 70
+E I GV LN S A +G++ D ++R + A G F G +
Sbjct: 225 MEFINGVAAGALNPFIS--AGEALGIGDILYGTRYAIDKAAMRNIAPLPAEGKFAVIGGL 282

Query: 71 KDMAGARAAGAEAYRTLNRAFGSLLEEAQAAPQLLVAL-VEGAVLGGGFGLACVSDVAIA 129
+AG EA + + E +A + A V G A VS
Sbjct: 283 GSVAGFEKNTREAVDRWIQENPNAAETVEAVFNVAAAAKVAKLAKAAKPGKAAVSGDFAD 342

Query: 130 AADAQFGLPETS 141
+ + L +++
Sbjct: 343 SYKKKLALSDSA 354


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02107DHBDHDRGNASE1193e-34 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 119 bits (299), Expect = 3e-34
Identities = 74/255 (29%), Positives = 121/255 (47%), Gaps = 10/255 (3%)

Query: 13 DGQTIIVTGGGSGIGRCTAHELAALGAHVVLVGRKAEKLEKTAGEIVEDGGSVSWHACDI 72
+G+ +TG GIG A LA+ GAH+ V EKLEK + + D+
Sbjct: 7 EGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADV 66

Query: 73 REEEAVKTLVANILAERGTIHHLVNNAGGQYPSPLASISQKGFETVLRTNLVGGFLVARE 132
R+ A+ + A I E G I LVN AG P + S+S + +E N G F +R
Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126

Query: 133 VFNQSMSKTGGSIVNMLADMWGGMP--GMGHSGAARSGMENFTRTAAVEWGHAGVRVNAV 190
V M + GSIV + ++ G+P M ++++ FT+ +E +R N V
Sbjct: 127 VSKYMMDRRSGSIVTVGSNP-AGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185

Query: 191 APG-------WIASSGMDTYEGAFKAVIPTLREHVPLKRIGSESEVAAAIVFLLSPGAAF 243
+PG W + + E K + T + +PLK++ S++A A++FL+S A
Sbjct: 186 SPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGH 245

Query: 244 VSGNTIRIDGAASQG 258
++ + + +DG A+ G
Sbjct: 246 ITMHNLCVDGGATLG 260


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02109HTHTETR704e-17 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 70.0 bits (171), Expect = 4e-17
Identities = 32/190 (16%), Positives = 65/190 (34%), Gaps = 8/190 (4%)

Query: 14 ESARGKLLQTAAHLFRSKGYERTTVRDLASAVGIQSGSIFHHFKSKDEILRSVMEETILY 73
+ R +L A LF +G T++ ++A A G+ G+I+ HFK K ++ + E +
Sbjct: 10 QETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESN 69

Query: 74 NTALMRAALAD-AEDLRERVLGLIRCELQSIMGGTGEAMAVLVYEWRSLSAEGQAYILGL 132
L A D + ++ L+S + + + + + A +
Sbjct: 70 IGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQA 129

Query: 133 RDIYEQMWLD----VLGEARLAGYCQG--DPFILRRFLTGALSWT-TTWFRPEGPMSLDQ 185
+ D L A + G +S W L +
Sbjct: 130 QRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDLKK 189

Query: 186 LAEEALALVI 195
A + +A+++
Sbjct: 190 EARDYVAILL 199


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02112PF06580452e-07 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 45.2 bits (107), Expect = 2e-07
Identities = 35/172 (20%), Positives = 72/172 (41%), Gaps = 24/172 (13%)

Query: 198 QIGELVSGLKDFAR--LDRAFSEEVDLND---CVRNAVLIARTAIKDKAEISSQLGELPL 252
+ E+++ L + R L + + +V L D V + + +A +D+ + +Q+ +
Sbjct: 192 KAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIM 251

Query: 253 IACAPSQINQVLL-NLLTNAAQAMERFGRILLKSWADERQVFLSVQDNGKGMPAEVLGRI 311
P + Q L+ N + + + + G+ILLK D V L V++ G
Sbjct: 252 DVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLA-------- 303

Query: 312 FDPFFTTKPVGQGTGLGLSISYKIIQQHGG---TIRVASEPGRGTRFLISLP 360
K + TG GL + +Q G I+++ + G+ ++ +P
Sbjct: 304 ------LKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVN-AMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02113HTHFIS985e-25 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 97.6 bits (243), Expect = 5e-25
Identities = 29/136 (21%), Positives = 57/136 (41%), Gaps = 2/136 (1%)

Query: 7 RILFVDDEERILRSLAMQF-RRHYEVLTESDPRRALERLKTERIQVLVSDQRMPQMSGAE 65
IL DD+ I L R Y+V S+ + ++V+D MP + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 66 LLAQARERYPETLRILLTGYSDLDAAVDALNDGGIFRYLTKPWNPQEMAFTLRQAAEIAS 125
LL + ++ P+ ++++ + A+ A + G + YL KP++ E+ + +A
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKAS-EKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 126 RQGLPAPPAATLAAPL 141
R+ + PL
Sbjct: 124 RRPSKLEDDSQDGMPL 139



Score = 54.8 bits (132), Expect = 1e-10
Identities = 27/139 (19%), Positives = 55/139 (39%), Gaps = 5/139 (3%)

Query: 142 SVLLLDDDPETLDCVGAFCHAGGHRLLRARNLAEALVWLNTEPVEVLVSDLKLAGEHTAP 201
++L+ DDD + G+ + N A W+ +++V+D+ + E+
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 202 LLKSLAQAHPRLLSLVVTPFRDTQALLELINQAQIFRYLPKPIRRGLFEKGLKAAAEQAL 261
LL + +A P L LV++ ++ + + YLPKP L +AL
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKAS-EKGAYDYLPKP----FDLTELIGIIGRAL 119

Query: 262 LWRGRSLPEVDRLAEVPRD 280
R +++ ++
Sbjct: 120 AEPKRRPSKLEDDSQDGMP 138


103PAKAF_02413PAKAF_02417N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_02413071.255442two-component sensor PfeS
PAKAF_02414061.019314two-component response regulator PfeR
PAKAF_02415071.491353VgrG1c
PAKAF_02416162.906542Tse5
PAKAF_02417-192.643635Tsi5
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02413PF06580402e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 39.8 bits (93), Expect = 2e-05
Identities = 19/134 (14%), Positives = 46/134 (34%), Gaps = 31/134 (23%)

Query: 324 ARLPCRLGVDCRVEVHLDSLAQAM----------ENLLRNAIRHSPEDGTVSLDGEREGD 373
+ RL + ++ ++ EN +++ I P+ G + L G ++
Sbjct: 234 IQFEDRLQFENQIN---PAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNG 290

Query: 374 FWHLRLQDQGPGVAEDQLERIFLPYQRLDDSAGEGFGLGLAIARRAIELQGG---RLWAS 430
L +++ G ++ E G GL R +++ G ++ S
Sbjct: 291 TVTLEVENTGSLALKNT---------------KESTGTGLQNVRERLQMLYGTEAQIKLS 335

Query: 431 NGKPGLCLHLWLPA 444
+ + + +P
Sbjct: 336 EKQGKVNAMVLIPG 349


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02414HTHFIS954e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 95.3 bits (237), Expect = 4e-24
Identities = 40/149 (26%), Positives = 66/149 (44%), Gaps = 4/149 (2%)

Query: 78 PRLLLVEDDPRLREDLDAHFRRRGFRVTVCGDGSHGLEAAGREAFDLVLLDIMLPGLDGL 137
+L+ +DD +R L+ R G+ V + + + DLV+ D+++P +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 138 ALLESLRREQA-TPVMLMSALGAEQDRISGFTRGADDYLPKPFSLAELDARTDALL--RR 194
LL +++ + PV++MSA I +GA DYLPKPF L EL L +
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 195 VRLDRLPSAQRRDTRLVFDDQA-QDVLHQ 222
R +L + LV A Q++
Sbjct: 124 RRPSKLEDDSQDGMPLVGRSAAMQEIYRV 152


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02415ICENUCLEATIN310.018 Ice nucleation protein signature.
		>ICENUCLEATIN#Ice nucleation protein signature.

Length = 1258

Score = 31.3 bits (70), Expect = 0.018
Identities = 25/120 (20%), Positives = 44/120 (36%)

Query: 550 NDESHWVGHDRTKTIDHDETVHVKHDRTETVDNNETITVHANRSKTVDRNETVRIGMNKT 609
+D S G+ T+T D ++ + T+T +T + T + ++ G T
Sbjct: 252 DDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGST 311

Query: 610 ETILMASLQNVGMGRMENVGLGYSLNVGMMMNTVVGLNQSTQVMKKKTLSVGDSYEVSVG 669
+T S Q G G + G L G G + S T + G+ ++ G
Sbjct: 312 QTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAG 371



Score = 30.1 bits (67), Expect = 0.039
Identities = 24/137 (17%), Positives = 47/137 (34%), Gaps = 2/137 (1%)

Query: 551 DESHWVGHDRTKTIDHDETVHVKHDRTETVDNNETITVHANRSKTVDRNETVRIGMNKTE 610
D S G+ T+T + ++ + T+T +T + T + ++ G T+
Sbjct: 733 DSSLIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQ 792

Query: 611 TILMASLQNVGMGRMENVGLGYSLNVGMMMNTVVGLNQSTQVMKKKTLSVGDSYEVSVGG 670
T S+ G G + L G + G + S T + G Y +
Sbjct: 793 TAGYHSILTAGYGSTQTAQERSDLTTGYGSTSTAGADSSLIAGYGSTQTAG--YNSILTA 850

Query: 671 SDDGSKITLDGQSITLG 687
++ + +T G
Sbjct: 851 GYGSTQTAQENSDLTTG 867


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02417PREPILNPTASE250.030 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 25.1 bits (55), Expect = 0.030
Identities = 18/54 (33%), Positives = 24/54 (44%), Gaps = 5/54 (9%)

Query: 5 GRRRGVPAAMIKHYLLMTLVCIPLALLYVCLEWFFGNTWVTVG--VFFGVLVVL 56
GR RG A + Y L+ L+ ALL V + W T+ + VLV L
Sbjct: 97 GRCRGCQAPISARYPLVELLT---ALLSVAVAMTLAPGWGTLAALLLTWVLVAL 147


104PAKAF_02423PAKAF_02434N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_02423-130-3.119170probable permease of ABC-2 transporter
PAKAF_02424031-2.098433probable type II secretion protein
PAKAF_02425136-1.547822probable type II secretion system protein
PAKAF_02426142-1.766994probable type II secretion system protein
PAKAF_02427243-2.192386probable type II secretion system protein
PAKAF_02428243-2.322453probable type II secretion system protein
PAKAF_02429139-3.411090probable type II secretion system protein
PAKAF_02430138-4.283237hypothetical protein
PAKAF_02431024-2.119910hypothetical protein
PAKAF_02432-114-1.466375hypothetical protein
PAKAF_02433-114-0.778661hypothetical protein
PAKAF_024341110.227403MvaU
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02423ABC2TRNSPORT343e-04 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 34.1 bits (78), Expect = 3e-04
Identities = 22/121 (18%), Positives = 42/121 (34%)

Query: 96 LTPLLAAFFNAMLGYLVLCIFLLFSGVEPGWQLVLLPLALLPFLLCVTGLAWFLAGLGVY 155
L + A A L + + G L+ + L L + L
Sbjct: 115 LGEMAWAATKAALAGAGIGVVAAALGYTQWLSLLYALPVIALTGLAFASLGMVVTALAPS 174

Query: 156 VRDIGQFVQFLLVLLLFISPVFYPLSSLPPVMQPYLYLNPLTIPVEMVRAILFDAPYPTL 215
+ ++ +LF+S +P+ LP V Q PL+ ++++R I+ P +
Sbjct: 175 YDYFIFYQTLVITPILFLSGAVFPVDQLPIVFQTAARFLPLSHSIDLIRPIMLGHPVVDV 234

Query: 216 G 216

Sbjct: 235 C 235


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02425BCTERIALGSPF1859e-57 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 185 bits (471), Expect = 9e-57
Identities = 104/404 (25%), Positives = 187/404 (46%), Gaps = 11/404 (2%)

Query: 2 NFIYQAVDRKGRRVRGELCLPTRQDALRQLQRQGLTPLSLEVKR----------RNLGSR 51
+ YQA+D +G++ RG + + A + L+ +GL PLS++ R +L +
Sbjct: 3 QYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLRRK 62

Query: 52 RRLKAEELNMAIHELATMLAAGVSMADAVEAQERGARHPKLITALQAMANGLRQGQSFPV 111
RL +L + +LAT++AA + + +A++A + + P L + A+ + + +G S
Sbjct: 63 IRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSLAD 122

Query: 112 VLESAGLDLPRYVYQLVAAGEMTGNLAGALRDCATQMEYERRTRAELQGALIYPAILVLS 171
++ R +VAAGE +G+L L A E ++ R+ +Q A+IYP +L +
Sbjct: 123 AMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLTVV 182

Query: 172 GVLAVATLFVFVVPKFANLLNET-AQLPWLAWAVLSIGVWSNESSGLLAFAVLLLAGGIA 230
+ V+ L VVPK LP ++ + + A+L
Sbjct: 183 AIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMAFR 242

Query: 231 VALRNPALRAHALDQLVRLPVVGEWLMQAEIAQWSKVLGTLLGNRVPLVEALLLSAAGVR 290
V LR R +L+ LP++G A++++ L L + VPL++A+ +S +
Sbjct: 243 VMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDVMS 302

Query: 291 IARQRRTLERVTQDVRAGIALSAALEERQAVTSIGSSLVRVGEASGQLAEMLQSLATLYG 350
R L T VR G++L ALE+ + ++ GE SG+L ML+ A
Sbjct: 303 NDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADNQD 362

Query: 351 EAGQARMKKALVLIEPLAILLIGSVFGLIITGVVLAITSANDMV 394
++M AL L EPL ++ + +V I+ ++ I N ++
Sbjct: 363 REFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLM 406


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02426BCTERIALGSPG1183e-37 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 118 bits (297), Expect = 3e-37
Identities = 47/128 (36%), Positives = 67/128 (52%), Gaps = 1/128 (0%)

Query: 6 QQGFTLLEMIVVLVIIGMLMGLVGPRLFNQADKAKAQTADTQVKMLKGALLTMRLDIGRL 65
Q+GFTLLE++VV+VIIG+L LV P L +KA Q A + + L+ AL +LD
Sbjct: 7 QRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLDNHHY 66

Query: 66 PTEEEGLALLNTPPSDERLGAFWHGPYLEGGVPLDPWNRPYLYSDRPSAEQPFTLYSQGA 125
PT +GL L P+ L A ++ +P DPW Y+ + P + L S G
Sbjct: 67 PTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVN-PGEHGAYDLLSAGP 125

Query: 126 DGQPGGKG 133
DG+ G +
Sbjct: 126 DGEMGTED 133


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02427BCTERIALGSPG391e-06 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 38.7 bits (90), Expect = 1e-06
Identities = 17/43 (39%), Positives = 31/43 (72%)

Query: 12 QAAFTLLELLVVLVIVGAIAAVALPGLVRMQETWARRTALDDL 54
Q FTLLE++VV+VI+G +A++ +P L+ +E ++ A+ D+
Sbjct: 7 QRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDI 49


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02428PilS_PF08805300.003 PilS N terminal
		>PilS_PF08805#PilS N terminal

Length = 185

Score = 29.5 bits (66), Expect = 0.003
Identities = 5/31 (16%), Positives = 16/31 (51%)

Query: 7 GFTLLEAVVALTLLAVVGGALFAWLNSAFRS 37
G TL+E ++ + ++ V+ + + + +
Sbjct: 27 GATLMEVLLVVGVIVVLAASAYKLYSMVQSN 57


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02429BCTERIALGSPH300.004 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 29.9 bits (67), Expect = 0.004
Identities = 14/48 (29%), Positives = 30/48 (62%), Gaps = 2/48 (4%)

Query: 7 KQGAFTLLEMIVVLLVVSFIGTLLMQGLSYASKANQSLHQSLGRGQVR 54
+Q FTLLEM+++LL++ +++ L++ + + S Q+L R + +
Sbjct: 2 RQRGFTLLEMMLILLLMGVSAGMVL--LAFPASRDDSAAQTLARFEAQ 47


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02434PYOCINKILLER290.005 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 28.6 bits (63), Expect = 0.005
Identities = 24/84 (28%), Positives = 33/84 (39%), Gaps = 10/84 (11%)

Query: 21 LEKLKSDSSLKQELEFKDKLQALMDKYGMTLHNIIAILDPKAPVTVSAAPQRRA------ 74
+E L + ++K E LQ M+ +I A KA +A +R+A
Sbjct: 181 MEGLTAAYNVKLFTEAISSLQIRMNTLTAAKASIEAAAANKAREQAAAEAKRKAEEQARQ 240

Query: 75 ----RALKVYKNPNNGEVVETKGG 94
RA Y P NG VV T G
Sbjct: 241 QAAIRAANTYAMPANGSVVATAAG 264


105PAKAF_02443PAKAF_02448N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_024430101.191005hypothetical protein
PAKAF_02444080.801313hypothetical protein
PAKAF_02445090.828808two-component response regulator BqsR
PAKAF_02446180.299657two-component sensor BqsS
PAKAF_02447080.470719hypothetical protein
PAKAF_02448090.456837probable chemotaxis transducer
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02443THERMOLYSIN270.010 Thermolysin metalloprotease (M4) family signature.
		>THERMOLYSIN#Thermolysin metalloprotease (M4) family signature.

Length = 544

Score = 27.3 bits (60), Expect = 0.010
Identities = 21/83 (25%), Positives = 32/83 (38%), Gaps = 6/83 (7%)

Query: 21 QARDLGPDEALKLRDAGTIKSFEELNKNAIAKHPGSSVHDTELE----EEYGRYIYQVEL 76
R L + A+ ++ A I + ++ + T L EE R Y+V +
Sbjct: 128 DKRTLKTEAAISIQQAEMIAKQDVADRVTKERPAAEEGKPTRLVIYPDEETPRLAYEVNV 187

Query: 77 R--DPQGVKWDLELDAATGAVLK 97
R P W +DAA G VL
Sbjct: 188 RFLTPVPGNWIYMIDAADGKVLN 210


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02445HTHFIS789e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 77.6 bits (191), Expect = 9e-19
Identities = 31/117 (26%), Positives = 54/117 (46%)

Query: 2 RLLLVEDHVPLADELMASLTRQGYAVDWLADGRDAAVQGASEPYDLIILDLGLPGRPGLE 61
+L+ +D + L +L+R GY V ++ A+ DL++ D+ +P +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 ILQEWRGLGLATPVLILTARGSWAERIDGLKAGADDYLTKPFHPEELALRIQALLRR 118
+L + PVL+++A+ ++ I + GA DYL KPF EL I L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02446PF06580385e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 38.3 bits (89), Expect = 5e-05
Identities = 44/263 (16%), Positives = 87/263 (33%), Gaps = 71/263 (26%)

Query: 187 RLQIAQLQQGQRSQLDNQAPEELEPLVEQIN-HLLAHTEETLKRSRNALGNLGHALKTPL 245
+ A++ Q + + + +A +L L QIN H + + + L + A +
Sbjct: 143 NYKQAEIDQWKMASMAQEA--QLMALKAQINPHFMFNALNNI--RALILEDPTKARE--- 195

Query: 246 AVLVSLAE--REEMARQPELQQVLREQLEQIQQRLGRELGKARLVGEALPGAHFDCAEEL 303
+L SL+E R + Q L ++L + L +L +
Sbjct: 196 -MLTSLSELMRYSLRYSNARQVSLADELTVVDSYL--QLASIQ----------------- 235

Query: 304 PSLCDTLRLIHGPHLQVSWSAPPGL---RLPWDREDLLEMLGNLLDNACKWA------DS 354
LQ P + ++P +L + L++N K
Sbjct: 236 ----------FEDRLQFENQINPAIMDVQVP----PML--VQTLVENGIKHGIAQLPQGG 279

Query: 355 EVRLTVAQGEGMVRLKVDDDGPGILPDQRQAVLERGTRLDEQVSGHGLGLGIARD-IAEA 413
++ L + G V L+V++ G L + ++ G GL R+ +
Sbjct: 280 KILLKGTKDNGTVTLEVENTGSLALKNTKE--------------STGTGLQNVRERLQML 325

Query: 414 CGGRLSLE-DSPLGGLRVSVELP 435
G ++ G + V +P
Sbjct: 326 YGTEAQIKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02448FLAGELLIN300.041 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 30.0 bits (67), Expect = 0.041
Identities = 17/98 (17%), Positives = 39/98 (39%), Gaps = 3/98 (3%)

Query: 438 DVKVSVRDARSTADQSAAISSQTSAGMQQQFREIDQVATASHEMTATAQDVARSAAQAAD 497
K+S +A + + I+ + + +A + + TA V+ + A
Sbjct: 352 SAKLSDLEANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAA 411

Query: 498 AARGADQATRDGLALIDRTTQSIDSLAANLTSAMGQVE 535
AA+ + LA ID +D++ ++L + + +
Sbjct: 412 AAKKSTANP---LASIDSALSKVDAVRSSLGAIQNRFD 446


106PAKAF_02583PAKAF_02592N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_025832161.395669MuxA
PAKAF_025842161.213837MuxB
PAKAF_025851131.796692MuxC
PAKAF_025861122.687945OpmB
PAKAF_025870121.869446hypothetical protein
PAKAF_025880121.794997CzcS
PAKAF_02589-1121.425250CzcR
PAKAF_025900121.428965outer membrane protein precursor CzcC
PAKAF_02591-1131.190648Resistance-Nodulation-Cell Division (RND)
PAKAF_025920130.980960Resistance-Nodulation-Cell Division (RND)
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02583RTXTOXIND448e-07 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 44.0 bits (104), Expect = 8e-07
Identities = 31/172 (18%), Positives = 65/172 (37%), Gaps = 16/172 (9%)

Query: 124 TYKAALAQAEGTLMQNQAQLKNAEIDLQRYKGLYAEDSIAKQTLDTQEAQVRQLQGTIRT 183
L + L Q ++++ +A+ + Q L+ + + ++RQ I
Sbjct: 263 EAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILD---------KLRQTTDNIGL 313

Query: 184 NQGQVDDARLNLTFTEVRAPISGR-LGLRQVDIGNLVTSGDTTPLVVITQVKPISVVFSL 242
++ + +RAP+S + L+ G +VT+ +T +V++ + + V +
Sbjct: 314 LTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETL-MVIVPEDDTLEVTALV 372

Query: 243 PQQQIGTVVEQMNGPGKLAVTALDRNQDKVLAEGTLT--TLDNQIDTTTGTV 292
+ IG + + V A + L G + LD D G V
Sbjct: 373 QNKDIGFINVGQ--NAIIKVEAFPYTRYGYL-VGKVKNINLDAIEDQRLGLV 421



Score = 41.4 bits (97), Expect = 6e-06
Identities = 26/125 (20%), Positives = 49/125 (39%), Gaps = 8/125 (6%)

Query: 80 ALGTVTAF-NTVNVKPRVNGELVKVLFQEGQEVKAGDLLAVVDPRTYKAALAQAEGTLMQ 138
A G +T + +KP N + +++ +EG+ V+ GD+L + +A + + +L+Q
Sbjct: 86 ANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQ 145

Query: 139 NQAQL--KNAEIDLQRYKGLYAEDSIAKQTLDT-QEAQVRQLQGTIR----TNQGQVDDA 191
+ + L + E +V +L I+ T Q Q
Sbjct: 146 ARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQK 205

Query: 192 RLNLT 196
LNL
Sbjct: 206 ELNLD 210


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02584ACRIFLAVINRP8400.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 840 bits (2171), Expect = 0.0
Identities = 301/1036 (29%), Positives = 514/1036 (49%), Gaps = 29/1036 (2%)

Query: 4 SRPFILRPVATTLLMVAILLSGLIAYRFLPISALPEVDYPTIQVVTLYPGASPEIMTSSI 63
+ FI RP+ +L + ++++G +A LP++ P + P + V YPGA + + ++
Sbjct: 2 ANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTV 61

Query: 64 TAPLENQLGQIPGLNEMSSSS-SGGASVITLQFSLQSNLDVAEQEVQAAINAAQSLLPND 122
T +E + I L MSS+S S G+ ITL F ++ D+A+ +VQ + A LLP +
Sbjct: 62 TQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE 121

Query: 123 LPNQPVFSKVNPADAPILTLAVMSDG--MPLPQIQDLVDTRLAQKISQISGVGLVSISGG 180
+ Q + S + + ++ +SD I D V + + +S+++GVG V + G
Sbjct: 122 VQQQGI-SVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 181 QRPAVRVRANPTALAAAGLSLEDLRSTVTSNNLNGPKGSFDGPTRAS------TLDANDQ 234
Q A+R+ + L L+ D+ + + N G G ++ A +
Sbjct: 181 QY-AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239

Query: 235 LRSADAYRDLII-AYKNGSPLRIRDVASVEDDAENVRLAAWANNLPAVVLNIQRQPGANV 293
++ + + + + +GS +R++DVA VE EN + A N PA L I+ GAN
Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299

Query: 294 IEVVDRIKALLPQLQSTLPGNLDVQVLTDRTTTIRASVKDVQFELALAVALVVMVTFLFL 353
++ IKA L +LQ P + V D T ++ S+ +V L A+ LV +V +LFL
Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359

Query: 354 RNVYATLIPSFAVPLSLIGTFGVMYLSGFSINNLTLMALTIATGFVVDDAIVMVENIARY 413
+N+ ATLIP+ AVP+ L+GTF ++ G+SIN LT+ + +A G +VDDAIV+VEN+ R
Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419

Query: 414 L-EQGDSPLEAALKGSKQIGFTIISLTFSLIAVLIPLLFMGDVAGRLFREFAITLAVAIL 472
+ E P EA K QI ++ + L AV IP+ F G G ++R+F+IT+ A+
Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479

Query: 473 ISGFVSLTLTPMLSAKLLRHIDEDQQ---GRFARAAGRVIDGLIAQYAKALRVVLRHQPL 529
+S V+L LTP L A LL+ + + G F D + Y ++ +L
Sbjct: 480 LSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539

Query: 530 TLLVAIATLALTALLYLAMPKGFFPVQDTGVIQGVAEAPQSISFQAMSERQRALAEVVLK 589
LL+ +A +L+L +P F P +D GV + + P + + + + + LK
Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599

Query: 590 DPA--VASLSSYIGVDGSNPTLNTGRLLINLKPHSERDV---TASEVIQRLQPELDHLPG 644
+ V S+ + G S N G ++LKP ER+ +A VI R + EL +
Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRD 659

Query: 645 IKLYMQPVQDLTIEDRVARTQYQFTLQD---ADPDVLAEWVPKLVARLQELP-QLADVAS 700
+ P I + T + F L D D L + +L+ + P L V
Sbjct: 660 GFVI--PFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRP 717

Query: 701 DWQDKGLQAYLNIDRDTASRLGVKLSDIDSVLYNAFGQRLISTIFTQATQYRVVLEVAPQ 760
+ + Q L +D++ A LGV LSDI+ + A G ++ + ++ ++ +
Sbjct: 718 NGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAK 777

Query: 761 FQLGPQALEQLYVPSSDGTQVRLSSLAKVEERHTLLAINHIAQFPSATLSFNLAKGYSLG 820
F++ P+ +++LYV S++G V S+ + + PS + A G S G
Sbjct: 778 FRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSG 837

Query: 821 EAVEAIRGVEASLELPLSMQGSFRGAALAFEASLSNTLLLILASVVTMYIVLGILYESFI 880
+A+ + + + +LP + + G + S + L+ S V +++ L LYES+
Sbjct: 838 DAMALMENLAS--KLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWS 895

Query: 881 HPVTILSTLPSAGVGALLALMLAGQEIGIVAIIGIILLIGIVKKNAIMMIDFALDAERNE 940
PV+++ +P VG LLA L Q+ + ++G++ IG+ KNAI++++FA D E
Sbjct: 896 IPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKE 955

Query: 941 GKPPHEAIYQACLLRFRPILMTTMAALLGALPLMLAGGAGAELRQPLGITMVGGLLLSQV 1000
GK EA A +R RPILMT++A +LG LPL ++ GAG+ + +GI ++GG++ + +
Sbjct: 956 GKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATL 1015

Query: 1001 LTLFTTPVIYLYFDRL 1016
L +F PV ++ R
Sbjct: 1016 LAIFFVPVFFVVIRRC 1031


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02585ACRIFLAVINRP8160.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 816 bits (2109), Expect = 0.0
Identities = 290/1034 (28%), Positives = 512/1034 (49%), Gaps = 31/1034 (2%)

Query: 7 FIRRPVATTLLTLALLLAGTLSFGLLPVAPLPNVDFPAIVVSASLPGASPETMASSVATP 66
FIRRP+ +L + L++AG L+ LPVA P + PA+ VSA+ PGA +T+ +V
Sbjct: 5 FIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQV 64

Query: 67 LERSLGRIAGISEMTSSS-SLGSTTVVLVFDLEKDIDGAAREVQAAINGAMSLLPSGMPN 125
+E+++ I + M+S+S S GS T+ L F D D A +VQ + A LLP +
Sbjct: 65 IEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQQ 124

Query: 126 NPSYRKANPSDMPIMVLTLTSET--QSRGEMYDLASTVLAPKLSQVQGVGQVSIGGSSLP 183
S +MV S+ ++ ++ D ++ + LS++ GVG V + G+
Sbjct: 125 -QGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQY- 182

Query: 184 AVRVDLNPDAMSQYGLSLDSVRTAIAAANSNGPKG------AVEKDDKHWQVDANDQLRK 237
A+R+ L+ D +++Y L+ V + N G A+ + + A + +
Sbjct: 183 AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKN 242

Query: 238 AREYEPLVIHYNADNGAAVRLGDVAKVSDSVEDVRNAGFSDDLPAVLLIVTRQPGANIIE 297
E+ + + N+D G+ VRL DVA+V E+ + PA L + GAN ++
Sbjct: 243 PEEFGKVTLRVNSD-GSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALD 301

Query: 298 ATDAIHAQLPVLQELLGPQVKLNVMDDRSPSIRASLEEAELTLLISVALVILVVFLFLRN 357
AI A+L LQ +K+ D +P ++ S+ E TL ++ LV LV++LFL+N
Sbjct: 302 TAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQN 361

Query: 358 GRATLIPSLAVPVSLIGTFAVMYLCDFSLNNLSLMALIIATGFVVDDAIVVVENIARRI- 416
RATLIP++AVPV L+GTFA++ +S+N L++ +++A G +VDDAIVVVEN+ R +
Sbjct: 362 MRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMM 421

Query: 417 EEGDPPIQAAITGARQVGFTVLSMTLSLVAVFIPLLLMGGLTGRLFREFAVTLSAAILVS 476
E+ PP +A Q+ ++ + + L AVFIP+ GG TG ++R+F++T+ +A+ +S
Sbjct: 422 EDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALS 481

Query: 477 LVVSLTLTPMLCARLLRPLKRPEG---ASLARRSDRFFAAFMLRYRASLGWALEHSRLMV 533
++V+L LTP LCA LL+P+ + F + Y S+G L + +
Sbjct: 482 VLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYL 541

Query: 534 VIMLACIAMNLWLFVVVPKGFLPQQDSGRLRGYAVADQSISFQSLSAKMGEYRKILSSDP 593
+I +A + LF+ +P FLP++D G + + + + +
Sbjct: 542 LIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNE 601

Query: 594 AVE-----NVVGFIGGGRWQSSNTGSFFVTLKPIGERDP----VEKVLTRLRERIAKVPG 644
V GF G Q+ N G FV+LKP ER+ E V+ R + + K+
Sbjct: 602 KANVESVFTVNGFSFSG--QAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRD 659

Query: 645 AALYLNAGQDVRLGGRDSNAQYEFTLRS-DDLTLLREWAPKVEAAMRKLP-QLVDVNSDS 702
+ + G + +E ++ L + ++ + P LV V +
Sbjct: 660 GFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNG 719

Query: 703 QDKGVQTRLVIDRDRAATLGINVEMVDAVLNDSFGQRQVSTIFNPLNQYRVVMEVDQQYQ 762
+ Q +L +D+++A LG+++ ++ ++ + G V+ + ++ ++ D +++
Sbjct: 720 LEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFR 779

Query: 763 QSPEILRQVQVIGNDGQRVPLSAFSHYEPSRAPLEVNHQGQFAATTLSFNLAPGAQIGPT 822
PE + ++ V +G+ VP SAF+ + + + APG G
Sbjct: 780 MLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDA 839

Query: 823 REAIMQALEPLHIPVDVQTSFEGNAGAVQDTQNQMPWLILLALLAVYIVLGILYESYVHP 882
+ L P + + G + + + NQ P L+ ++ + V++ L LYES+ P
Sbjct: 840 MALMENLASKL--PAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIP 897

Query: 883 LTILSTLPSAGVGALLALILCRSELSLIALIGIILLIGIVKKNAIMMIDFALEAERNHGL 942
++++ +P VG LLA L + + ++G++ IG+ KNAI++++FA + G
Sbjct: 898 VSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGK 957

Query: 943 SPREAILEACMMRFRPIMMTTLAALLGALPLIFGIGGDAALRRPLGITIVGGLIGSQLLT 1002
EA L A MR RPI+MT+LA +LG LPL G + + +GI ++GG++ + LL
Sbjct: 958 GVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLA 1017

Query: 1003 LYTTPVVYLYLDRL 1016
++ PV ++ + R
Sbjct: 1018 IFFVPVFFVVIRRC 1031



Score = 80.7 bits (199), Expect = 2e-17
Identities = 72/366 (19%), Positives = 135/366 (36%), Gaps = 15/366 (4%)

Query: 665 QYEFTLRSDDLTL--LREWAPK-VEAAMRKLPQLVDVNSDSQDKGVQTRLVIDRDRAATL 721
F + T + ++ V+ + +L + DV R+ +D D
Sbjct: 139 VAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQY--AMRIWLDADLLNKY 196

Query: 722 GINVEMVDAVL---NDSFGQRQVSTIFNPLNQYRVVMEVDQQYQQSPEILRQVQVIGN-D 777
+ V L ND Q+ Q + Q ++PE +V + N D
Sbjct: 197 KLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKNPEEFGKVTLRVNSD 256

Query: 778 GQRVPLSAFSHYEPSRAPLE--VNHQGQFAATTLSFNLAPGAQIGPTREAIMQALEPLH- 834
G V L + E G+ A L LA GA T +AI L L
Sbjct: 257 GSVVRLKDVARVELGGENYNVIARINGK-PAAGLGIKLATGANALDTAKAIKAKLAELQP 315

Query: 835 -IPVDVQ-TSFEGNAGAVQDTQNQMPWLILLALLAVYIVLGILYESYVHPLTILSTLPSA 892
P ++ VQ + +++ + A++ V++V+ + ++ L +P
Sbjct: 316 FFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPVV 375

Query: 893 GVGALLALILCRSELSLIALIGIILLIGIVKKNAIMMIDFALEAERNHGLSPREAILEAC 952
+G L ++ + + G++L IG++ +AI++++ L P+EA ++
Sbjct: 376 LLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSM 435

Query: 953 MMRFRPIMMTTLAALLGALPLIFGIGGDAALRRPLGITIVGGLIGSQLLTLYTTPVVYLY 1012
++ + +P+ F G A+ R ITIV + S L+ L TP +
Sbjct: 436 SQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCAT 495

Query: 1013 LDRLRH 1018
L +
Sbjct: 496 LLKPVS 501


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02586RTXTOXIND386e-05 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 38.3 bits (89), Expect = 6e-05
Identities = 25/216 (11%), Positives = 62/216 (28%), Gaps = 30/216 (13%)

Query: 230 RADVAQARTQLKSTQAQAIDLKYQ--RAQLEHAIAVLVGLPPAQFNLPPVASVPKLPDLP 287
+ A TQ+ + + + R Q+ L LP + P ++
Sbjct: 126 KLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVL 185

Query: 288 AVVP----------SQLLERRPDIASAERKVISANAQIGVAKAAY------FPDLTLSAA 331
+ +Q ++ ++ + ++ A+I + D +
Sbjct: 186 RLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLH 245

Query: 332 GGYRSGSLSNWISTPNRFWSIGPQFAMTLFDGGLIGSQVDQAEATYDQTVATYRQTVLDG 391
+ + N++ + + I S++ A+ Y ++ +LD
Sbjct: 246 KQA--IAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDK 303

Query: 392 FREVEDYLVQLSVLDEESGVQREALESAREALRLAE 427
R+ D + L L E + +
Sbjct: 304 LRQTTDNIGLL----------TLELAKNEERQQASV 329



Score = 31.3 bits (71), Expect = 0.009
Identities = 18/150 (12%), Positives = 43/150 (28%), Gaps = 18/150 (12%)

Query: 172 ASAADLAAVRLSQQSQLAQNYLQLRVMDEQIRLLNDTVTAYERSLKVAENK-------YR 224
+ + + Q+Q Q L L + + + YE +V +++
Sbjct: 186 RLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLH 245

Query: 225 AGIVTRADVAQARTQLKSTQAQAIDLKYQRAQLEHAIAVLVGLPPAQFNLPPVASVPKLP 284
+ + V + + + K Q Q+E I A+ V
Sbjct: 246 KQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILS------AKEEYQLVTQ----- 294

Query: 285 DLPAVVPSQLLERRPDIASAERKVISANAQ 314
+ +L + +I ++ +
Sbjct: 295 LFKNEILDKLRQTTDNIGLLTLELAKNEER 324


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02588PF06580363e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 35.6 bits (82), Expect = 3e-04
Identities = 25/105 (23%), Positives = 40/105 (38%), Gaps = 24/105 (22%)

Query: 370 LVSNAVRH----TPQGGRIDVRIGERAGHTEVRVSNDGPGIPPEYLPHLFERFYRRAGRQ 425
LV N ++H PQGG+I ++ + G + V N G
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKN---------------- 306

Query: 426 TGAQAGTGLGLAIV-QSIMAYHGGRAEAE-SVPQQKTHLRLLFPS 468
+ TG GL V + + +G A+ + S Q K + +L P
Sbjct: 307 --TKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIPG 349


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02589HTHFIS817e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 80.6 bits (199), Expect = 7e-20
Identities = 34/145 (23%), Positives = 63/145 (43%), Gaps = 8/145 (5%)

Query: 2 RILIIEDEVKTADYLHQGLTESGYIVDRANDGIDGLHMALQHPYELVILDVNLPGIDGWD 61
IL+ +D+ L+Q L+ +GY V ++ +LV+ DV +P + +D
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 LLRRLRER-SSARVMMLTGHGRLTDKVRGLDLGADDFMVKPFQFPELLARVRSLLRRHDQ 120
LL R+++ V++++ ++ + GA D++ KPF EL+ + L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE--- 121

Query: 121 APMQDVLRVADLELDASRHRAFRGR 145
R + LE D+ GR
Sbjct: 122 ----PKRRPSKLEDDSQDGMPLVGR 142


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02591RTXTOXIND462e-07 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 46.0 bits (109), Expect = 2e-07
Identities = 40/212 (18%), Positives = 82/212 (38%), Gaps = 22/212 (10%)

Query: 216 ISSPQLSDQRSEFAAAQRRLSLAQSTYKREQQLWKEGISAEQEFLLARQGLQ-EAEIALN 274
I+ + +Q +++ A L + +S + +Q+ E +SA++E+ L Q + E L
Sbjct: 249 IAKHAVLEQENKYVEAVNELRVYKS---QLEQIESEILSAKEEYQLVTQLFKNEILDKLR 305

Query: 275 NARAKIAALGG--NPSLQGGNRYELRAPFAGVLVE-KHLTQGEPVDGTANVFTLS-DLSS 330
I L + + +RAP + + + K T+G V + + + +
Sbjct: 306 QTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDT 365

Query: 331 VWATFNVPAQLLGQVRVGSKVKVLAQALDS----EVEGTVSYIG-DLLGEQTRAATARVT 385
+ T V + +G + VG + +A + G V I D + +Q V
Sbjct: 366 LEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVI 425

Query: 386 LSNPEST---------WRPGLFVSVQVAEATR 408
+S E+ G+ V+ ++ R
Sbjct: 426 ISIEENCLSTGNKNIPLSSGMAVTAEIKTGMR 457



Score = 31.3 bits (71), Expect = 0.010
Identities = 19/119 (15%), Positives = 42/119 (35%), Gaps = 13/119 (10%)

Query: 168 LAQVVSLPGEIRFNEDRTAHIVPRLPGIVDSVPANLGQAVKQGELLAVISSPQLSDQRSE 227
+ V + G++ + I P IV + G++V++G++L +++ ++
Sbjct: 80 VEIVATANGKLTHSGRSKE-IKPIENSIVKEIIVKEGESVRKGDVLLKLTALGA---EAD 135

Query: 228 FAAAQRRLSLAQSTYKREQQLWKE---------GISAEQEFLLARQGLQEAEIALNNAR 277
Q L A+ R Q L + + E F + +L +
Sbjct: 136 TLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQ 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_02592ACRIFLAVINRP8110.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 811 bits (2097), Expect = 0.0
Identities = 237/1055 (22%), Positives = 435/1055 (41%), Gaps = 56/1055 (5%)

Query: 5 IIQFAIEQRWLVLLAVLGMAGVGIGSYQKLSIDAVPDITNVQVQINTAAPGYSPLEVEQR 64
+ F I + + + + G + +L + P I V ++ PG V+
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 65 ITYPVETVMAGLPGLQETRSLS-RPGISQVTVIFEEGTDIYFARQQVNERLSTAREQLPE 123
+T +E M G+ L S S G +T+ F+ GTD A+ QV +L A LP+
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 124 DISPTLGPISTGLGEIYLWTVEAEEGATKEDGSAYTPTDLRTIQDWIIRPQLRNVKGVAE 183
++ + YL D T D+ ++ L + GV +
Sbjct: 121 EVQQQGISVEKSSS-SYLMVA-----GFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGD 174

Query: 184 INTIGGYAKQFLIAPDPKKLAAYKLTLGDLQNAVLRNNENVGAGYI------ERRGEQLL 237
+ G I D L YKLT D+ N + N+ + AG + +
Sbjct: 175 VQLFGA-QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNAS 233

Query: 238 IRAPGQVKDMDDIRGIIV-SNVDGVPIRIRDVAEVGLGKELRTGAATENGREVVLGTVFM 296
I A + K+ ++ + + N DG +R++DVA V LG E A NG+ + +
Sbjct: 234 IIAQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKL 293

Query: 297 LIGENSREVAQAVGQRLEEINRTLPKGVKAITVYDRTTLVDKAVATVKKNLVEGAALVIA 356
G N+ + A+A+ +L E+ P+G+K + YD T V ++ V K L E LV
Sbjct: 294 ATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFL 353

Query: 357 VLFLFLGNIRAALITATIIPLSMLFTFTGMVGNRVSANLMSLG--ALDFGIIVDGAVVIV 414
V++LFL N+RA LI +P+ +L TF + S N +++ L G++VD A+V+V
Sbjct: 354 VMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVV 413

Query: 415 ENAIRRLAHAQAHHGRQLTRAERFHEVFAASREARRALVFGQIIIMVVYLPIFALTGVEG 474
EN + + + + ALV +++ V++P+ G G
Sbjct: 414 EN---------VERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTG 464

Query: 475 KMFHPMAFTVVTALLGAMILSVTFVPAAIALFITGKVKEEE----------NFVMRRARL 524
++ + T+V+A+ ++++++ PA A + E N +
Sbjct: 465 AIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVN 524

Query: 525 AYEPALRWVLGHRALVVGGALGAILLTGLVASRMGSEFIPSLSEGDFAMQGLRVPGTSL- 583
Y ++ +LG + + ++ R+ S F+P +G F G +
Sbjct: 525 HYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQE 584

Query: 584 -TQSVEMQQTLERKLMGKFPEIERVFARTGTAEIASDLMPPNASDSYVMLKPQSQWPDPK 642
TQ V + Q + L + +E VF G + NA ++V LKP + +
Sbjct: 585 RTQKV-LDQVTDYYLKNEKANVESVFTVNG---FSFSGQAQNAGMAFVSLKPWEERNGDE 640

Query: 643 KSREALLEELQAAALEVP-GSVYEFSQPIQLRFNELISGVRSDVA-VKVFGDDMQVLNDT 700
S EA++ + ++ G V F+ P EL + D + G L
Sbjct: 641 NSAEAVIHRAKMELGKIRDGFVIPFNMP---AIVELGTATGFDFELIDQAGLGHDALTQA 697

Query: 701 AEKI-SKVLQGIDGASEVKVEQTTGLPVLTVDIDRDKAARFGLNVGDIQDTVATALGGRN 759
++ Q V+ +++D++KA G+++ DI T++TALGG
Sbjct: 698 RNQLLGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTY 757

Query: 760 AGTLFEGDRRFDIVIRLPETLRADLPALSNLLIPLPPNNLARIDFIPLSDVARLDLSPGP 819
+ R + ++ R + L + + +P S G
Sbjct: 758 VNDFIDRGRVKKLYVQADAKFRMLPEDVDKLYVRSANG-----EMVPFSAFTTSHWVYGS 812

Query: 820 NQISRENGKRRIVVSANVRGRDIGSFVLEAQQKLQDGVKIPAGYWTTWGGQFEQLQSAAK 879
++ R NG + + + + L K+PAG W G Q + +
Sbjct: 813 PRLERYNGLPSMEIQGEAAPGTSSGDAMALMENLAS--KLPAGIGYDWTGMSYQERLSGN 870

Query: 880 RLQVVVPVALLLVFTLLFAMFNNVKDGLLVFTGIPFALTGGVLALWLRGIPLSISAAVGF 939
+ +V ++ ++VF L A++ + + V +P + G +LA L + VG
Sbjct: 871 QAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGL 930

Query: 940 IALSGVAVLNGLVMISFIRNLL-QEGRSLDQAVWEGAITRLRPVLMTALVASLGFVPMAL 998
+ G++ N ++++ F ++L+ +EG+ + +A RLRP+LMT+L LG +P+A+
Sbjct: 931 LTTIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAI 990

Query: 999 ATGTGAEVQRPLATVVIGGILSSTMLTLLVLPVLY 1033
+ G G+ Q + V+GG++S+T+L + +PV +
Sbjct: 991 SNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFF 1025



Score = 70.6 bits (173), Expect = 2e-14
Identities = 70/527 (13%), Positives = 160/527 (30%), Gaps = 46/527 (8%)

Query: 2 FERIIQFAIEQRWLVLLAVLGMAGVGIGSYQKLSIDAVPDITNVQVQINTAAPGYSPLEV 61
+ + + LL + + + +L +P+ P + E
Sbjct: 526 YTNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQER 585

Query: 62 EQRI---------TYPVETVMA--GLPGLQETRSLSRPGISQVTV-IFEEGTDIYFARQQ 109
Q++ V + + G + G++ V++ +EE + +
Sbjct: 586 TQKVLDQVTDYYLKNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEA 645

Query: 110 VNERLSTAREQLPED-ISPTLGPISTGLGEIYLWTVEAEEGATKEDGSAYTPTDLRTIQD 168
V R ++ + + P P LG D + L ++
Sbjct: 646 VIHRAKMELGKIRDGFVIPFNMPAIVELG------TATGFDFELIDQAGLGHDALTQARN 699

Query: 169 WIIRPQLRNVKGVAEINTIGGY-AKQFLIAPDPKKLAAYKLTLGDLQNAVLRNNENVGAG 227
++ ++ + + G QF + D +K A ++L D+ +
Sbjct: 700 QLLGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVN 759

Query: 228 YIERRGEQ--LLIRAPGQ-VKDMDDIRGIIVSNVDGVPIRIRDVAEVGLGKELRTGAAT- 283
RG L ++A + +D+ + V + +G + G+
Sbjct: 760 DFIDRGRVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTS----HWVYGSPRL 815

Query: 284 --ENGREVVLGTVFMLIGENSREVAQAVGQRLEEINRTLPKGVKAITVYDRTTLVDKAVA 341
NG + G +S + + E + LP G+ + +
Sbjct: 816 ERYNGLPSMEIQGEAAPGTSSGDAMALM----ENLASKLPAGI-GYDWTGMSYQERLSGN 870

Query: 342 TVKKNLVEGAALVIAVLFLFLGNIRAALITATIIPLSMLFTFTGMVGNRVSANLMSLGAL 401
+ +V L + + ++PL ++ ++ + L
Sbjct: 871 QAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGL 930

Query: 402 --DFGIIVDGAVVIVENAIRRLAHAQAHHGRQLTRAERFHEVFAASREARRALVFGQIII 459
G+ A++IVE A G+ + A A R R ++ +
Sbjct: 931 LTTIGLSAKNAILIVEFAK----DLMEKEGKGVVEA-----TLMAVRMRLRPILMTSLAF 981

Query: 460 MVVYLPIFALTGVEGKMFHPMAFTVVTALLGAMILSVTFVPAAIALF 506
++ LP+ G + + V+ ++ A +L++ FVP +
Sbjct: 982 ILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVI 1028


107PAKAF_03108PAKAF_03111N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_031081121.226375DUF3203 family protein
PAKAF_031091130.942325probable transcriptional regulator
PAKAF_031101131.060807Resistance-Nodulation-Cell Division (RND)
PAKAF_031110130.521087Resistance-Nodulation-Cell Division (RND)
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03108PF05272280.002 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.1 bits (62), Expect = 0.002
Identities = 18/68 (26%), Positives = 26/68 (38%), Gaps = 11/68 (16%)

Query: 14 VEIEGSRHRAPVDSLRIGTDAEARLSVLYIDGKRLHISEED---------AQRLVVAGAE 64
V + G + + R AEA LY+ G+R S ED RLV G +
Sbjct: 711 VLVPGRANLVWLQKFRGQLFAEAL--HLYLAGERYFPSPEDEEIYFRPEQELRLVETGVQ 768

Query: 65 DQRRHLMA 72
+ L+
Sbjct: 769 GRLWALLT 776


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03109HTHTETR661e-15 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 65.8 bits (160), Expect = 1e-15
Identities = 38/176 (21%), Positives = 73/176 (41%), Gaps = 5/176 (2%)

Query: 1 MADLADAAGVSRGAVYGHYKNKIEVCLAMCDRAFGQI-EVPDENA--RVPALDILLRAGM 57
+ ++A AAGV+RGA+Y H+K+K ++ + + + I E+ E +LR +
Sbjct: 34 LGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEYQAKFPGDPLSVLREIL 93

Query: 58 -GFLRQCCEPGSVQRVLEILYLKCERSDENEPLLRRRELLEKQGQRFGLRQIRRAVERGE 116
L + ++EI++ KCE E + + + L + + ++ +E
Sbjct: 94 IHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKM 153

Query: 117 LPARLDVELASIYLQSLWDGICGTLAWTERLRDDPWNRAERMFRAGLDSLRSSPYL 172
LPA L A+I ++ G+ + + D A L+ P L
Sbjct: 154 LPADLMTRRAAIIMRGYISGLMENWLFAPQSFDLK-KEARDYVAILLEMYLLCPTL 208


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03110RTXTOXIND423e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 41.7 bits (98), Expect = 3e-06
Identities = 14/83 (16%), Positives = 30/83 (36%), Gaps = 3/83 (3%)

Query: 117 ASHAAAADKLKRYADLIKDRAISERE--YTEAQTDARQALAQIASAKAELEQARLRLGYA 174
+ + ++++ K+ + E RQ I EL + R +
Sbjct: 269 RVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQAS 328

Query: 175 TVTAPIDGR-ARRALVTEGALVG 196
+ AP+ + + + TEG +V
Sbjct: 329 VIRAPVSVKVQQLKVHTEGGVVT 351



Score = 41.4 bits (97), Expect = 4e-06
Identities = 24/137 (17%), Positives = 47/137 (34%), Gaps = 7/137 (5%)

Query: 67 EVRARVAGIVTRRLYEEGQDVRAGTVLFQIDPAPLKAALDISRGALARAEASHAAAADKL 126
E++ IV + +EG+ VR G VL ++ +A ++ +L +A L
Sbjct: 98 EIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLE-QTRYQIL 156

Query: 127 KRYADLIKDRAISEREYT------EAQTDARQALAQIASAKAELEQARLRLGYATVTAPI 180
R +L K + + E + +L + + + ++ + L A
Sbjct: 157 SRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAER 216

Query: 181 DGRARRALVTEGALVGE 197
R E E
Sbjct: 217 LTVLARINRYENLSRVE 233


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03111ACRIFLAVINRP10920.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 1092 bits (2826), Expect = 0.0
Identities = 508/1033 (49%), Positives = 710/1033 (68%), Gaps = 6/1033 (0%)

Query: 1 MARFFIDRPVFAWVISLLIVLAGVLAIRFLPVAQYPDIAPPVVNVSASYPGASAKVVEEA 60
MA FFI RP+FAWV+++++++AG LAI LPVAQYP IAPP V+VSA+YPGA A+ V++
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 61 VTAIIEREMNGAPGLLYTKATS-STGQASLTLTFRQGVNADLAAVEVQNRLKIVESRLPE 119
VT +IE+ MNG L+Y +TS S G ++TLTF+ G + D+A V+VQN+L++ LP+
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 120 SVRRDGIYVEKAADSIQLIVTLTSSSGRYDAMELGEIASSNVLQALRRVEGVGKVETWGA 179
V++ GI VEK++ S ++ S + ++ + +SNV L R+ GVG V+ +GA
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 180 EYAMRIWPDPAKLTSMNLSASDLVNAVRRHNARLTVGDIGNLGVPDSAPISATVKVDDTL 239
+YAMRIW D L L+ D++N ++ N ++ G +G ++A++
Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240

Query: 240 VTPEQFGEIPLRIRADGGAIRLRDVARVEFGQSEYGFVSRVNQMTATGLAVKMAPGSNAV 299
PE+FG++ LR+ +DG +RL+DVARVE G Y ++R+N A GL +K+A G+NA+
Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300

Query: 300 ATAKRIRATLDELSRYFPEGVSYNIPYDTSAFVEISIRKVVSTLLEAMLLVFAVMYLFMQ 359
TAK I+A L EL +FP+G+ PYDT+ FV++SI +VV TL EA++LVF VMYLF+Q
Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360

Query: 360 NFRATLIPTLVVPVALLGTFTVMLGLGFSINVLTMFGMVLAIGILVDDAIIVVENVERLM 419
N RATLIPT+ VPV LLGTF ++ G+SIN LTMFGMVLAIG+LVDDAI+VVENVER+M
Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420

Query: 420 AEEGLSPHDATVKAMRQISGAIVGITVVLVSVFVPMAFFSGAVGNIYRQFAVTLAVSIGF 479
E+ L P +AT K+M QI GA+VGI +VL +VF+PMAFF G+ G IYRQF++T+ ++
Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480

Query: 480 SAFLALSLTPALCATLLRPIDADHHE-KRGFFGWFNRAFLRLTGRYRNAVAGILARPIRW 538
S +AL LTPALCATLL+P+ A+HHE K GFFGWFN F Y N+V IL R+
Sbjct: 481 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540

Query: 539 MLVYALVIGVVALLFVRLPQAFLPEEDQGDFMIMVMQPEGTPMAETMANVGDVERYLAEH 598
+L+YAL++ + +LF+RLP +FLPEEDQG F+ M+ P G T + V Y ++
Sbjct: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600

Query: 599 EP--VAYAYAVGGFSLYGDGTSSAMIFATLKDWSERREASQHVGAIVERINQRFAGLPNR 656
E V + V GFS G ++ M F +LK W ER A++ R + +
Sbjct: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG 660

Query: 657 TVYAMNSPPLPDLGSTSGFDFRLQDRGGVGYEALVEARDQLLARAAEDP-RLANVMFAGQ 715
V N P + +LG+ +GFDF L D+ G+G++AL +AR+QLL AA+ P L +V G
Sbjct: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720

Query: 716 GEAPQIRLDIDRRKAETLGVSMDEINTTLAVMFGSDYIGDFMHGSQVRKVVVQADGAKRL 775
+ Q +L++D+ KA+ LGVS+ +IN T++ G Y+ DF+ +V+K+ VQAD R+
Sbjct: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRM 780

Query: 776 GIDDIGRLHVRNEQGEMVPLATFAKAAWTLGPPQLTRYNGYPSFNLEGQAAPGYSSGEAM 835
+D+ +L+VR+ GEMVP + F + W G P+L RYNG PS ++G+AAPG SSG+AM
Sbjct: 781 LPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAM 840

Query: 836 QAMEELMQGLPEGIAHEWSGQSFEERLSGAQAPALFALSVLIVFLALAALYESWSIPLAV 895
ME L LP GI ++W+G S++ERLSG QAPAL A+S ++VFL LAALYESWSIP++V
Sbjct: 841 ALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSV 900

Query: 896 ILVVPLGVLGALLGVSLRGLPNDIYFKVGLITIIGLSAKNAILIIEVAKD-HYQEGMSLL 954
+LVVPLG++G LL +L ND+YF VGL+T IGLSAKNAILI+E AKD +EG ++
Sbjct: 901 MLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVV 960

Query: 955 QATLEAARLRLRPIVMTSLAFGFGVVPLALSSGAGSGAQVAIGTGVLGGIVTATVLAVFL 1014
+ATL A R+RLRPI+MTSLAF GV+PLA+S+GAGSGAQ A+G GV+GG+V+AT+LA+F
Sbjct: 961 EATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFF 1020

Query: 1015 VPLFFLVVGRLFR 1027
VP+FF+V+ R F+
Sbjct: 1021 VPVFFVVIRRCFK 1033


108PAKAF_03417PAKAF_03424N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_03417-17-0.003887ribonuclease E inhibitor RraB
PAKAF_03418-170.011188MucR
PAKAF_03419090.051799periplasmic beta-glucosidase
PAKAF_03420418-0.170097type III export protein PscL
PAKAF_03421419-0.944546type III export protein PscK
PAKAF_03422318-1.349641type III export protein PscJ
PAKAF_034234150.367498type III export protein PscI
PAKAF_034241140.536122type III export protein PscH
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03417SECBCHAPRONE260.025 Bacterial protein-transport SecB chaperone protein ...
		>SECBCHAPRONE#Bacterial protein-transport SecB chaperone protein

signature.
Length = 170

Score = 26.4 bits (58), Expect = 0.025
Identities = 8/30 (26%), Positives = 15/30 (50%)

Query: 19 EGGFDFARIHPIDFFAIFPSEREARQAAGQ 48
G F + P++F A+F + ++ A Q
Sbjct: 131 RGTFPALNLSPVNFDALFMDYLQRQEQAEQ 160


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03420TYPE4SSCAGX300.009 Type IV secretion system CagX conjugation protein si...
		>TYPE4SSCAGX#Type IV secretion system CagX conjugation protein

signature.
Length = 522

Score = 29.8 bits (66), Expect = 0.009
Identities = 27/102 (26%), Positives = 45/102 (44%), Gaps = 8/102 (7%)

Query: 21 LRARDYQDYLSANRLVEAA--------RERAAEIEREAHEVYQEQKRLGWEAGLEEARLR 72
L RDYQ++L +L+ A +++A E E+EA E Q+ ++ E EE
Sbjct: 117 LMTRDYQEFLKTKKLIVDAPDPKELEEQKKALEKEKEAKEQAQKAQKDKREKRKEERAKN 176

Query: 73 QAGLIQETLLRCNRYYRQVDRQLGEVVLQAVRKVLRHYDAVE 114
+A L T N ++ L E++ Q L + +E
Sbjct: 177 RANLENLTNAMSNPQNLSNNKNLSELIKQQRENELDQMERLE 218


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03422FLGMRINGFLIF751e-17 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 75.0 bits (184), Expect = 1e-17
Identities = 33/165 (20%), Positives = 69/165 (41%), Gaps = 6/165 (3%)

Query: 27 LYTGISQKEGNEMLALLRSEGVSADKQADKDGTVRLLVEESDIAEAVEVLKRKGYPRENF 86
L++ +S ++G ++A L + + V + E L ++G P+
Sbjct: 53 LFSNLSDQDGGAIVAQLTQMNIPYR--FANGSGA-IEVPADKVHELRLRLAQQGLPKGG- 108

Query: 87 STLKDVFPKDGLISSPIEERARLNYAKAQEISHTLSEIDGVLVARVHVVLPEERDGLGRK 146
+ ++ ++ S E+ A E++ T+ + V ARVH+ +P + R+
Sbjct: 109 AVGFELLDQEKFGISQFSEQVNYQRALEGELARTIETLGPVKSARVHLAMP-KPSLFVRE 167

Query: 147 SSPASASVFIKHAADVQLD-AYVPQIKQLVNNGIEGLSYDRISVV 190
SASV + LD + + LV++ + GL +++V
Sbjct: 168 QKSPSASVTVTLEPGRALDEGQISAVVHLVSSAVAGLPPGNVTLV 212


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03424PF090252052e-71 YopR Core
		>PF09025#YopR Core

Length = 143

Score = 205 bits (522), Expect = 2e-71
Identities = 143/143 (100%), Positives = 143/143 (100%)

Query: 1 MSRIDTPPGFAVYPSASPKAANLPAVDQVLAFEQALGGEPPAAGRRLAGLENGALGERLL 60
MSRIDTPPGFAVYPSASPKAANLPAVDQVLAFEQALGGEPPAAGRRLAGLENGALGERLL
Sbjct: 1 MSRIDTPPGFAVYPSASPKAANLPAVDQVLAFEQALGGEPPAAGRRLAGLENGALGERLL 60

Query: 61 QRFAQPLQGLEADRLELKAMLRAELPLGRQQQTFLLQLLGAVEHAPGGEYLAQLARRELQ 120
QRFAQPLQGLEADRLELKAMLRAELPLGRQQQTFLLQLLGAVEHAPGGEYLAQLARRELQ
Sbjct: 61 QRFAQPLQGLEADRLELKAMLRAELPLGRQQQTFLLQLLGAVEHAPGGEYLAQLARRELQ 120

Query: 121 VLIPLNGMLDNLVRNSHKLDLES 143
VLIPLNGMLDNLVRNSHKLDLES
Sbjct: 121 VLIPLNGMLDNLVRNSHKLDLES 143


109PAKAF_03429PAKAF_03439N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_03429-218-0.800843Type III secretion outer membrane protein PscC
PAKAF_03430126-1.898200type III export apparatus protein
PAKAF_03431124-2.222643ExsD
PAKAF_03432224-2.739046transcriptional regulator ExsA
PAKAF_03433221-1.016333exoenzyme S synthesis protein B
PAKAF_03434218-1.497757ExsE
PAKAF_03435217-1.507800ExsC, exoenzyme S synthesis protein C precursor
PAKAF_03436215-0.672302Translocator outer membrane protein PopD
PAKAF_03437313-0.989206translocator protein PopB
PAKAF_03438314-0.536532regulatory protein PcrH
PAKAF_034394150.044286type III secretion protein PcrV
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03429TYPE3OMGPROT8160.0 Type III secretion system outer membrane G protein ...
		>TYPE3OMGPROT#Type III secretion system outer membrane G protein

family signature.
Length = 607

Score = 816 bits (2108), Expect = 0.0
Identities = 375/600 (62%), Positives = 472/600 (78%), Gaps = 7/600 (1%)

Query: 1 MRRLLIGGLLALLPGAVLRAQPLDWPSLPYDYVAQGESLRDVLANFGANYDASVIVSDKV 60
+R+L G LL L + AQ LDW +PY YVA+GESLRD+L +FGANYDA+V+VSDK+
Sbjct: 9 FKRVLTGTLLLLSSYSW--AQELDWLPIPYVYVAKGESLRDLLTDFGANYDATVVVSDKI 66

Query: 61 NDQVSGRFDLESPQAFLQLMASLYNLGWYYDGTVLYVFKTTEMQSRLVRLEQVGEAELKR 120
ND+VSG+F+ ++PQ FLQ +ASLYNL WYYDG VLY+FK +E+ SRL+RL++ AELK+
Sbjct: 67 NDKVSGQFEHDNPQDFLQHIASLYNLVWYYDGNVLYIFKNSEVASRLIRLQESEAAELKQ 126

Query: 121 ALTAAGIWEARFGWRADPSGRLVHVSGPGRYLELVEQTAQVLEQQYTLRSEKTGDLSVEI 180
AL +GIWE RFGWR D S RLV+VSGP RYLELVEQTA LEQQ +RSEKTG L++EI
Sbjct: 127 ALQRSGIWEPRFGWRPDASNRLVYVSGPPRYLELVEQTAAALEQQTQIRSEKTGALAIEI 186

Query: 181 FPLRYAVAEDRKIEYRDDEIEAPGIASILSRVLSDANVVAVGDEPGKLRPGP--QSSHAV 238
FPL+YA A DR I YRDDE+ APG+A+IL RVLSDA + V + ++ S+ A
Sbjct: 187 FPLKYASASDRTIHYRDDEVAAPGVATILQRVLSDATIQQVTVDNQRIPQAATRASAQAR 246

Query: 239 VQAEPSLNAVVVRDHKDRLPMYRRLIEALDRPSARIEVGLSIIDINAENLAQLGVDWSAG 298
V+A+PSLNA++VRD +R+PMY+RLI ALD+PSARIEV LSI+DINA+ L +LGVDW G
Sbjct: 247 VEADPSLNAIIVRDSPERMPMYQRLIHALDKPSARIEVALSIVDINADQLTELGVDWRVG 306

Query: 299 IRLGNNKSIQIRTTGQDSEEGGGAGNGAVGSLVDSRGLDFLLAKVTLLQSQGQAQIGSRP 358
IR GNN + I+TTG S A NGA+GSLVD+RGLD+LLA+V LL+++G AQ+ SRP
Sbjct: 307 IRTGNNHQVVIKTTGDQS---NIASNGALGSLVDARGLDYLLARVNLLENEGSAQVVSRP 363

Query: 359 TLLTQENTQAVLDQSETYYVRVTGERVAELKAITYGTMLKMTPRVVTLGDTPEISLSLHI 418
TLLTQEN QAV+D SETYYV+VTG+ VAELK ITYGTML+MTPRV+T GD EISL+LHI
Sbjct: 364 TLLTQENAQAVIDHSETYYVKVTGKEVAELKGITYGTMLRMTPRVLTQGDKSEISLNLHI 423

Query: 419 EDGSQKPNSAGLDKIPTINRTVIDTIARVGHGQSLLIGGIYRDELSQSQRKVPWLGDIPY 478
EDG+QKPNS+G++ IPTI+RTV+DT+ARVGHGQSL+IGGIYRDELS + KVP LGDIPY
Sbjct: 424 EDGNQKPNSSGIEGIPTISRTVVDTVARVGHGQSLIIGGIYRDELSVALSKVPLLGDIPY 483

Query: 479 LGALFRTTADTVRRSVRLFLIEPRLIDDGVGHYLALNNRRDLRGGLLEIDELSNQSLSLR 538
+GALFR ++ RR+VRLF+IEPR+ID+G+ H+LAL N +DLR G+L +DE+SNQS +L
Sbjct: 484 IGALFRRKSELTRRTVRLFIIEPRIIDEGIAHHLALGNGQDLRTGILTVDEISNQSTTLN 543

Query: 539 KLLGSARCQALAPARAEQERLRQAGQGSFLTPCRMGAQEGWRVTDGACPKDGAWCVGAER 598
KLLG ++CQ L A+ Q+ L Q + S+LT C+M GWRV +GAC +WCV A +
Sbjct: 544 KLLGGSQCQPLNKAQEVQKWLSQNNKSSYLTQCKMDKSLGWRVVEGACTPAQSWCVSAPK 603


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03430PF05932932e-27 Tir chaperone protein (CesT)
		>PF05932#Tir chaperone protein (CesT)

Length = 127

Score = 92.6 bits (230), Expect = 2e-27
Identities = 25/120 (20%), Positives = 41/120 (34%), Gaps = 5/120 (4%)

Query: 2 DHLLSGLATRLGQGPFVADRTGSYHLRIDGQSVLLLRQGDDLLLESPLEHAPLDPQRDQQ 61
LL + L P V D G+ ++ ID L L D E L L+P +
Sbjct: 7 KTLLDDFSRSLEMQPLVFDDHGTCNMIIDNTFALTLS--CDYARERLLLIGLLEP--HKD 62

Query: 62 GLLRALLSRVASWSRRYPQAIVLDADGRLLLQA-RLGLDGLDPERLERALAAQVGLLEAL 120
+ LL+ + + LD L + + L L+R +A + +
Sbjct: 63 IPQQCLLAGALNPLLNAGPGLGLDEKSGLYHAYQSIPREKLSVPTLKREMAGLLEWMRGW 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03435PF05932476e-10 Tir chaperone protein (CesT)
		>PF05932#Tir chaperone protein (CesT)

Length = 127

Score = 47.5 bits (113), Expect = 6e-10
Identities = 27/118 (22%), Positives = 49/118 (41%), Gaps = 4/118 (3%)

Query: 10 LLAEFAGRIGLPSLSLDEEGMASLLFDEQVGVTLLLLAERERLLLEADVAGIDVLGEGIF 69
LL +F+ + + L D+ G +++ D +TL RERLLL + +
Sbjct: 9 LLDDFSRSLEMQPLVFDDHGTCNMIIDNTFALTLSCDYARERLLLIGLLEP---HKDIPQ 65

Query: 70 RQLASFNRHWHRFDLH-FGFDELTGKVQLYAQILAAQLTLECFEATLANLLDHAEFWQ 126
+ L + + G DE +G Y I +L++ + +A LL+ W+
Sbjct: 66 QCLLAGALNPLLNAGPGLGLDEKSGLYHAYQSIPREKLSVPTLKREMAGLLEWMRGWR 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03436PF05844385e-137 YopD protein
		>PF05844#YopD protein

Length = 295

Score = 385 bits (989), Expect = e-137
Identities = 291/295 (98%), Positives = 293/295 (99%)

Query: 1 MIDTQYSLAATQAAIPSEPIAPGAAGRSVGTPQAAADLPQVPAARADRVELNAPRQVLDP 60
MIDTQYSLAATQAAIPSEPIAPGAAGRSVGTPQAAA+LPQVPAARADRVELNAPRQVLDP
Sbjct: 1 MIDTQYSLAATQAAIPSEPIAPGAAGRSVGTPQAAAELPQVPAARADRVELNAPRQVLDP 60

Query: 61 VRMEAAGSELDSSVELLLILFRIAQKARELGVLQRDNENQSIIHAQKAQVDEMRSGATLM 120
VRMEAAGSELDSSVELLLILFRIAQKARELGVLQRDNENQ+IIHAQKAQVDEMRSGATLM
Sbjct: 61 VRMEAAGSELDSSVELLLILFRIAQKARELGVLQRDNENQAIIHAQKAQVDEMRSGATLM 120

Query: 121 IAMAVIAGVGALASAVVGSLGALKNGKAISQEKTLQKNIDGRNELIDAKMQALGKTSDED 180
IAMAVIAGVGALASAVVGSLGALKNGKAISQEKTLQKNIDGRNELIDAKMQALGKTSDED
Sbjct: 121 IAMAVIAGVGALASAVVGSLGALKNGKAISQEKTLQKNIDGRNELIDAKMQALGKTSDED 180

Query: 181 RKIVGKVWAADQVQDSVALRAAGRAFESRNGALQVANTVIQSFVQMANASVQVRQGESQA 240
RKIVGKVWAADQ QDSVALRAAGRAFESRNGALQVANTVIQSFVQMANASVQVRQGESQA
Sbjct: 181 RKIVGKVWAADQAQDSVALRAAGRAFESRNGALQVANTVIQSFVQMANASVQVRQGESQA 240

Query: 241 SAREGEVNATIGQSQKQKVEDQMSFDAGFMKDVLQLIQQYTQSHNQAWRAAAGVV 295
SARE EVNATIGQSQKQKVEDQMSFDAGFMKDVLQLIQQYTQSHNQAWRAAAGVV
Sbjct: 241 SAREEEVNATIGQSQKQKVEDQMSFDAGFMKDVLQLIQQYTQSHNQAWRAAAGVV 295


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03438SYCDCHAPRONE2022e-69 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 202 bits (514), Expect = 2e-69
Identities = 95/166 (57%), Positives = 126/166 (75%)

Query: 3 QQATPSDTDQQQALEAFLRDGGTLAMLRGLSEDTLEQLYALGFNQYQAGKWDDAQKIFQA 62
QQ T + Q A+E+FL+ GGT+AML +S DTLEQLY+L FNQYQ+GK++DA K+FQA
Sbjct: 2 QQETTDTQEYQLAMESFLKGGGTIAMLNEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQA 61

Query: 63 LCMLDHYDARYFLGLGACRQSLGLYEQALQSYSYGALMDINEPRFPFHAAECHLQLGDLD 122
LC+LDHYD+R+FLGLGACRQ++G Y+ A+ SYSYGA+MDI EPRFPFHAAEC LQ G+L
Sbjct: 62 LCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELA 121

Query: 123 GAESGFYSARALAAAQPAHEALAARAGAMLEAVTARKDRTYESDNA 168
AESG + A+ L A + + L+ R +MLEA+ +K+ +E +
Sbjct: 122 EAESGLFLAQELIADKTEFKELSTRVSSMLEAIKLKKEMEHECVDN 167


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03439LCRVANTIGEN344e-121 Low calcium response V antigen signature.
		>LCRVANTIGEN#Low calcium response V antigen signature.

Length = 326

Score = 344 bits (884), Expect = e-121
Identities = 115/296 (38%), Positives = 171/296 (57%), Gaps = 32/296 (10%)

Query: 25 ASAEQEELLALLRSERIVLAHAGQPLSEAQVL-------------KALAWLLAANPSAPP 71
S+ EEL+ L++ + I ++ P +++V K LA+ L +
Sbjct: 28 GSSVLEELVQLVKDKNIDISIKYDPRKDSEVFANRVITDDIELLKKILAYFLPEDAILKG 87

Query: 72 GQ-------GLEVLREVLQARRQPGAQWDLREFLVSAYFSLHG-RLDEDVIGVYKDVLQT 123
G G++ ++E L++ P QW+LR F+ +FSL R+D+D++ V D +
Sbjct: 88 GHYDNQLQNGIKRVKEFLES--SPNTQWELRAFMAVMHFSLTADRIDDDILKVIVDSMNH 145

Query: 124 QDGKRKALLDELKALTAELKVYSVIQSQINAALSAKQGIRIDAGGIDLVDPTLYGYAVGD 183
R L +EL LTAELK+YSVIQ++IN LS+ I I I+L+D LYGY +
Sbjct: 146 HGDARSKLREELAELTAELKIYSVIQAEINKHLSSSGTINIHDKSINLMDKNLYGYT-DE 204

Query: 184 PRWKDSPEYALLSNLDTFSGKL--------SIKDFLSGSPKQSGELKGLSDEYPFEKDNN 235
+K S EY +L + + ++ SIKDFL K++G L L + Y + KDNN
Sbjct: 205 EIFKASAEYKILEKMPQTTIQVDGSEKKIVSIKDFLGSENKRTGALGNLKNSYSYNKDNN 264

Query: 236 PVGNFATTVSDRSRPLNDKVNEKTTLLNDTSSRYNSAVEALNRFIQKYDSVLRDIL 291
+ +FATT SD+SRPLND V++KTT L+D +SR+NSA+EALNRFIQKYDSV++ +L
Sbjct: 265 ELSHFATTCSDKSRPLNDLVSQKTTQLSDITSRFNSAIEALNRFIQKYDSVMQRLL 320


110PAKAF_03447PAKAF_03455N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_034471152.400556Type III secretion outer membrane protein PopN
PAKAF_034480152.055314ATP synthase in type III secretion system
PAKAF_034493171.532271translocation protein in type III secretion
PAKAF_034503170.472186translocation protein in type III secretion
PAKAF_03451211-1.138468translocation protein in type III secretion
PAKAF_03452110-1.652441translocation protein in type III secretion
PAKAF_03453111-1.573790probable translocation protein in type III
PAKAF_0345409-0.727132translocation protein in type III secretion
PAKAF_03455-19-0.864181translocation protein in type III secretion
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03447PF072012844e-98 Hypersensitivity response secretion protein HrpJ
		>PF07201#Hypersensitivity response secretion protein HrpJ

Length = 293

Score = 284 bits (727), Expect = 4e-98
Identities = 134/294 (45%), Positives = 181/294 (61%), Gaps = 7/294 (2%)

Query: 1 MDILQSSSAAPLA-----PREAANAPAQQAGGSFQGERVHYVSVS-QSLADAAEELTFAF 54
M L + S P A++ Q G F+GE V VS + QS+AD AEE+TF F
Sbjct: 1 MTTLHNLSYGNTPLHNERPEIASSQIVNQTLGQFRGESVQIVSGTLQSIADMAEEVTFVF 60

Query: 55 SERAEKSLAKRRLSDAHARLSEVQAMLQEYWKRIPDLESQQKLEALIAHLGSGQLSSLAQ 114
SER E SL KR+LSD+ AR+S+V+ + +Y ++P+LE +Q + L++ L + SL+Q
Sbjct: 61 SERKELSLDKRKLSDSQARVSDVEEQVNQYLSKVPELEQKQNVSELLSLLSNSPNISLSQ 120

Query: 115 LSAYLEGFSSEISQRFLALSRARDVLAGRPEARAMLALVDQALLRMADEQGLEIELGLRI 174
L AYLEG S E S++F L RD L GRPE + LV+QAL+ MA+EQG I LG RI
Sbjct: 121 LKAYLEGKSEEPSEQFKMLCGLRDALKGRPELAHLSHLVEQALVSMAEEQGETIVLGARI 180

Query: 175 EPLAAEASAAGVGDIQALRDTYRDAVLDYRGLSAAWQDIQARFAATPLERVVAFLQKALS 234
P A S +GV +Q LRDTYRDAV+ Y+G+ A W D+Q RF ++ V+ FLQKALS
Sbjct: 181 TPEAYRESQSGVNPLQPLRDTYRDAVMGYQGIYAIWSDLQKRFPNGDIDSVILFLQKALS 240

Query: 235 ADLDSQSSRLDPVKLERVMSDMHKLRVLGGLAEQVGALWQVLVTGERGHGIRAF 288
ADL SQ S KL V+SD+ KL+ G +++QV WQ G + +G+R F
Sbjct: 241 ADLQSQQSGSGREKLGIVISDLQKLKEFGSVSDQVKGFWQFFSEG-KTNGVRPF 293


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03450IGASERPTASE432e-06 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 43.1 bits (101), Expect = 2e-06
Identities = 35/188 (18%), Positives = 58/188 (30%), Gaps = 18/188 (9%)

Query: 45 PPFDKGDETTEAEEPAATADAPTSTPLADQPAAPAADRPPTTRQAPVPVAADATPTPTPT 104
P +K ++T + + P A R +APVP A ATP+ T
Sbjct: 983 PEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIA-RV---DEAPVPPPAPATPSETTE 1038

Query: 105 PTPTPTPTPTPTVSPSGSVARQAPAVSARVAASTQAREPASVSAPPVDEPPLVPVSSHPQ 164
+ + TV + A + A + VA ++ A+ V + +
Sbjct: 1039 TVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTT 1098

Query: 165 IAGRTHERPQPGPGFPAKAAAEVAPTAQASAQASPPAPTAGGEGRGEERRQPGETDPSAL 224
T + KA E T + S +P ++ Q P A
Sbjct: 1099 ETKETATVEKEE-----KAKVETEKTQEVPKVTSQVSP---------KQEQSETVQPQAE 1144

Query: 225 PPDDQAPV 232
P + P
Sbjct: 1145 PARENDPT 1152


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03451TYPE3OMOPROT842e-20 Type III secretion system outer membrane O protein ...
		>TYPE3OMOPROT#Type III secretion system outer membrane O protein

family signature.
Length = 303

Score = 83.5 bits (206), Expect = 2e-20
Identities = 46/177 (25%), Positives = 73/177 (41%), Gaps = 14/177 (7%)

Query: 130 RLALWLDGDPATLLARLPPRPSAQRLAIPLRLSLQWPGLPLDASELRTLEPGDLLLLPAG 189
R LW + P L A RP R + + L L + GD+LL+
Sbjct: 126 RGGLWFEHLPE-LPAVGGGRPKMLRWPLRFVIGSSDTQRSL----LGRIGIGDVLLIRTS 180

Query: 190 HRPDAALLGVLEGRPWARCQLHSTQL-ELLDMH----DTPSLADGEDLHELDQLPIPVSF 244
A + + ++ + E LD+ + + E L L+QLP+ + F
Sbjct: 181 R----AEVYCYAKKLGHFNRVEGGIIVETLDIQHIEEENNTTETAETLPGLNQLPVKLEF 236

Query: 245 EVGRRTLDLHTLSTLQPGSLLDLDSALDGEVRILANQRCLGIGELVRLQDRLGVRVT 301
+ R+ + L L + LL L + + V I+AN LG GELV++ D LGV +
Sbjct: 237 VLYRKNVTLAELEAMGQQQLLSLPTNAELNVEIMANGVLLGNGELVQMNDTLGVEIH 293


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03452TYPE3IMPPROT2463e-85 Type III secretion system inner membrane P protein ...
		>TYPE3IMPPROT#Type III secretion system inner membrane P protein

family signature.
Length = 224

Score = 246 bits (629), Expect = 3e-85
Identities = 92/217 (42%), Positives = 142/217 (65%), Gaps = 7/217 (3%)

Query: 6 DELGLILGLALLALVPFIAVMATSFIKMTVVFSLLRNALGVQQIPPNMAMYGLAIILSLY 65
+++ LI LA L+PFI T F+K ++VF ++RNALG+QQIP NM + G+A++LS++
Sbjct: 3 NDISLIALLAFSTLLPFIIASGTCFVKFSIVFVMVRNALGLQQIPSNMTLNGVALLLSMF 62

Query: 66 VMAPVGFATRDYLRNHDVSLSDSASVERFLDEGMAPYRNFLKRQIQEREHTFFMESTRQV 125
VM P+ Y + DV+ +D +S+ + +DEG+ YR++L + FF + +
Sbjct: 63 VMWPIMHDAYVYFEDEDVTFNDISSLSKHVDEGLDGYRDYLIKYSDRELVQFFENAQLKR 122

Query: 126 WPSEYAERLDPD-------SLLILLPAFTVSELTRAFEIGFLIYLPFIAIDLIISNILLA 178
E E + D S+ LLPA+ +SE+ AF+IGF +YLPF+ +DL++S++LLA
Sbjct: 123 QYGEETETVKRDKDEIEKPSIFALLPAYALSEIKSAFKIGFYLYLPFVVVDLVVSSVLLA 182

Query: 179 MGMMMVSPMTISLPFKLLLFVLLDGWARLTHGLVISY 215
+GMMM+SP+TIS P KL+LFV LDGW L+ GL++ Y
Sbjct: 183 LGMMMMSPVTISTPIKLVLFVALDGWTLLSKGLILQY 219


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03453TYPE3IMQPROT684e-19 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 68.3 bits (167), Expect = 4e-19
Identities = 35/78 (44%), Positives = 48/78 (61%)

Query: 5 DILHFTNQTLWLVLVLSLPPVLVAALIGTLVSLVQALTQIQEQTLGFVAKLVAVVVVLFA 64
D++ N+ L+LVL+LS P +VA +IG LV L Q +TQ+QEQTL F KL+ V + LF
Sbjct: 3 DLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLFL 62

Query: 65 TSGWLGGELYRFAEMTLL 82
SGW G L + +
Sbjct: 63 LSGWYGEVLLSYGRQVIF 80


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03454TYPE3IMRPROT1421e-43 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 142 bits (360), Expect = 1e-43
Identities = 47/245 (19%), Positives = 100/245 (40%), Gaps = 4/245 (1%)

Query: 9 LLLTYSLLLPRIISCFVVLPVLAKQTLGGGLVRNGVACSLALFAYPIVAGSLPPALDALD 68
L Y L R+++ P+L+++++ V+ G+A + P + + P
Sbjct: 12 WLNLYFWPLLRVLALISTAPILSERSVPK-RVKLGLAMMITFAIAPSLPANDVPVFSFFA 70

Query: 69 IALLIGKEVLLGLLIGFVATIPFWAMEATGFIIDNQRGAALASTFNPSLGSQTSPTGLLL 128
+ L + +++L+G+ +GF F A+ G II Q G + A+ +P+ ++
Sbjct: 71 LWLAV-QQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHLNMPVLARIM 129

Query: 129 TQTLITLFFSGGAFLALVGSLFRSYASWPVSSFFPQLGSQWVAFFYAQFSQMLMLCALFA 188
+ LF + L L+ L ++ + P+ S S + + + A
Sbjct: 130 DMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPL--NSNAFLALTKAGSLIFLNGLMLA 187

Query: 189 APLLIAMFLAEFGLALVSRFAPSLNVFILAMPIKSLVASLLLVLYLGILMEHAYDALLLA 248
PL+ + L L++R AP L++F++ P+ V L+ + ++
Sbjct: 188 LPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFCEHLFSEI 247

Query: 249 VDPLR 253
+ L
Sbjct: 248 FNLLA 252


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03455TYPE3IMSPROT422e-150 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 422 bits (1086), Expect = e-150
Identities = 231/349 (66%), Positives = 294/349 (84%)

Query: 1 MSAEKTEQPTAKKLRDARRQGQVVKSKEIVSSALILSLVALLMGFSDYYLEHLGKLLLLP 60
MS EKTEQPT KK+RDAR++GQV KSKE+VS+ALI++L A+LMG SDYY EH KL+L+P
Sbjct: 1 MSGEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIP 60

Query: 61 AEYIDLPFRQALETILENLLQELLYLLAPVLLVAALVVVLSHVGQYGFLLSLDSVKPDLK 120
AE LPF QAL +++N+L E YL P+L VAAL+ + SHV QYGFL+S +++KPD+K
Sbjct: 61 AEQSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDIK 120

Query: 121 KINPVEGAKKIFSIRSLVEFLKSTLKVALLSLLVWLTLQGNLASLLRIPACGLDCVAPVS 180
KINP+EGAK+IFSI+SLVEFLKS LKV LLS+L+W+ ++GNL +LL++P CG++C+ P+
Sbjct: 121 KINPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPLL 180

Query: 181 GLMLRQLMLVCAVGFLAIAVADYAFERHQHYKQLRMSKDEVKREYKEMEGSPEIKSKRRQ 240
G +LRQLM++C VGF+ I++ADYAFE +Q+ K+L+MSKDE+KREYKEMEGSPEIKSKRRQ
Sbjct: 181 GQILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRRQ 240

Query: 241 FHQELQSSNLRADVRRSSVIVANPTHVAIGIRYRRGETPLPLVTLKHTDALALRVRRIAE 300
FHQE+QS N+R +V+RSSV+VANPTH+AIGI Y+RGETPLPLVT K+TDA VR+IAE
Sbjct: 241 FHQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIAE 300

Query: 301 EEGIPVLQRVPLARALLRDGNVDQYIPADLIQATAEVLRWLESQQTDTP 349
EEG+P+LQR+PLARAL D VD YIPA+ I+ATAEVLRWLE Q +
Sbjct: 301 EEGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQNIEKQ 349


111PAKAF_03701PAKAF_03730N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_03701112-0.945610MotD
PAKAF_03702112-1.092808MotC
PAKAF_03703111-1.325721probable methyltransferase
PAKAF_03704-110-1.028018probable two-component sensor
PAKAF_0370509-1.563314chemotaxis protein CheZ
PAKAF_03706013-0.391834two-component response regulator CheY
PAKAF_037070150.290361sigma factor FliA
PAKAF_037080150.536157flagellar synthesis regulator FleN
PAKAF_037092170.709487flagellar biosynthesis protein FlhF
PAKAF_037102170.268645flagellar biosynthesis protein FlhA
PAKAF_037113200.340276YgcG family protein
PAKAF_037124190.195674Beta-propeller domains of methanol dehydrogenase
PAKAF_03713619-0.542127flagellar biosynthetic protein FlhB
PAKAF_03714519-1.250198flagellar biosynthetic protein FliR
PAKAF_03715317-1.720443flagellar biosynthetic protein FliQ
PAKAF_03716012-0.451094flagellar biosynthetic protein FliP
PAKAF_0371719-0.128731flagellar protein FliO
PAKAF_0371808-0.801229flagellar motor switch protein FliN
PAKAF_037190110.199043flagellar motor switch protein FliM
PAKAF_037200110.599477flagellar basal body-associated protein FliL
PAKAF_037210120.927239flagellar hook-length control protein FliK
PAKAF_037221141.019663YecA family protein
PAKAF_037231141.338592conserved hypothetical protein
PAKAF_037240121.484622probable two-component sensor
PAKAF_03725-1100.446187probable two-component response regulator
PAKAF_03726-290.125238probable Resistance-Nodulation-Cell Division
PAKAF_03727-113-1.195929probable Resistance-Nodulation-Cell Division
PAKAF_03728-29-0.520056hypothetical protein
PAKAF_03729-210-0.766891EAL domain-containing protein
PAKAF_03730-216-1.726856autoinducer synthesis protein LasI
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03701OMPADOMAIN691e-15 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 69.2 bits (169), Expect = 1e-15
Identities = 34/125 (27%), Positives = 52/125 (41%), Gaps = 16/125 (12%)

Query: 128 EITLNSSLLFPSGDALPNDAAFDIVEKVAKILAPYKNP---IHVEGFTDDVPIHSPRYPT 184
TL S +LF A ++++ L+ + V G+TD I S Y
Sbjct: 214 HFTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDR--IGSDAY-- 269

Query: 185 NWELSAARAASIVRLLGNDGVEPSRMAAVGYGEFQPVADNASAEGR---------AKNRR 235
N LS RA S+V L + G+ +++A G GE PV N + A +RR
Sbjct: 270 NQGLSERRAQSVVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRR 329

Query: 236 VVLVI 240
V + +
Sbjct: 330 VEIEV 334


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03703HTHFIS598e-12 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 59.5 bits (144), Expect = 8e-12
Identities = 35/142 (24%), Positives = 55/142 (38%), Gaps = 6/142 (4%)

Query: 2 AVKVLVVDDSGFFRRRVSEILSADGQIQVVGTATNGREAIEQALALRPDVITMDYEMPLM 61
+LV DD R +++ LS G +N A D++ D MP
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVR--ITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 62 DGITAVRNIMQRCP-TPVLMFSSLTHEGARVTLDALDAGAVDYLPKNFEDISRNPDKVRQ 120
+ + I + P PVL+ S+ + A + GA DYLPK F D++ + +
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAI--KASEKGAYDYLPKPF-DLTELIGIIGR 117

Query: 121 LLCEKVLTIARSNRRSISLPPL 142
L E ++ S PL
Sbjct: 118 ALAEPKRRPSKLEDDSQDGMPL 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03704PF06580427e-06 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 41.8 bits (98), Expect = 7e-06
Identities = 13/69 (18%), Positives = 30/69 (43%), Gaps = 10/69 (14%)

Query: 462 ETDLDKNLVEALADPLV--HLVRNAVDHGIESPEEREAAGKPRVGQVVLSAEQEGDHILL 519
E ++ +++ P++ LV N + HGI P+ G+++L ++ + L
Sbjct: 243 ENQINPAIMDVQVPPMLVQTLVENGIKHGIA--------QLPQGGKILLKGTKDNGTVTL 294

Query: 520 MITDDGKGM 528
+ + G
Sbjct: 295 EVENTGSLA 303


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03706HTHFIS902e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 89.9 bits (223), Expect = 2e-24
Identities = 32/120 (26%), Positives = 55/120 (45%), Gaps = 3/120 (2%)

Query: 2 KILIVDDFSTMRRIIKNLLRDLGFTNTAEADDGTTALPMLHSGNFDFLVTDWNMPGMTGI 61
IL+ DD + +R ++ L G+ + T + +G+ D +VTD MP
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 DLLRAVRADERLKHLPVLMVTAEAKRDQIIEAAQAGVNGYVVKPFTAQVLKEKIEKIFER 121
DLL ++ + LPVL+++A+ I+A++ G Y+ KPF L I +
Sbjct: 64 DLLPRIK--KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03711cloacin300.021 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 30.1 bits (67), Expect = 0.021
Identities = 15/48 (31%), Positives = 20/48 (41%)

Query: 398 SAGGSGGGRRRGGDYASSSGSSSSSSSSSSSDSFSGGGGSSGGGGASG 445
+ G GGG G ++S + S S G G+ GG G SG
Sbjct: 25 TGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSG 72


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03712cloacin362e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 36.2 bits (83), Expect = 2e-04
Identities = 22/55 (40%), Positives = 24/55 (43%), Gaps = 9/55 (16%)

Query: 373 GQVRLSGGGGGSSGSS--------GGGSSSSSSSSSGGFSGGGGSSG-GGGASGS 418
G L GGG S GS GGGS S G G GG +G GG SG+
Sbjct: 23 GPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGT 77



Score = 35.8 bits (82), Expect = 3e-04
Identities = 15/37 (40%), Positives = 19/37 (51%)

Query: 380 GGGGSSGSSGGGSSSSSSSSSGGFSGGGGSSGGGGAS 416
GGG SG GG S + G SGGG +GG ++
Sbjct: 47 GGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSA 83



Score = 33.5 bits (76), Expect = 0.001
Identities = 15/40 (37%), Positives = 19/40 (47%)

Query: 379 GGGGGSSGSSGGGSSSSSSSSSGGFSGGGGSSGGGGASGS 418
GGG GS GGGS + +G GG G+ G A +
Sbjct: 47 GGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 32.4 bits (73), Expect = 0.004
Identities = 16/40 (40%), Positives = 16/40 (40%)

Query: 380 GGGGSSGSSGGGSSSSSSSSSGGFSGGGGSSGGGGASGSW 419
GG G GG S S SS GGG SG GS
Sbjct: 22 GGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61



Score = 30.1 bits (67), Expect = 0.017
Identities = 12/36 (33%), Positives = 17/36 (47%)

Query: 382 GGSSGSSGGGSSSSSSSSSGGFSGGGGSSGGGGASG 417
GG G+ S+S + +GG +G G G SG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSG 38



Score = 30.1 bits (67), Expect = 0.021
Identities = 13/40 (32%), Positives = 18/40 (45%)

Query: 378 SGGGGGSSGSSGGGSSSSSSSSSGGFSGGGGSSGGGGASG 417
GG G S G SG G + S+ ++ F S+ G G
Sbjct: 65 GGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 29.3 bits (65), Expect = 0.029
Identities = 13/38 (34%), Positives = 17/38 (44%), Gaps = 3/38 (7%)

Query: 385 SGSSGGGSSSSSSSSSG---GFSGGGGSSGGGGASGSW 419
SG G G ++ + S+SG G G G GG W
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGW 39


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03713TYPE3IMSPROT336e-116 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 336 bits (864), Expect = e-116
Identities = 98/345 (28%), Positives = 183/345 (53%), Gaps = 2/345 (0%)

Query: 9 DKTEEPTEKRRREAREKGQLPRSRELNTLAILMAGAGGLLIYGADLAGALLRLMRSNFEL 68
+KTE+PT K+ R+AR+KGQ+ +S+E+ + A+++A + L+ +LM E
Sbjct: 4 EKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIPAEQ 63

Query: 69 SRETAMNTESMLQLLGASAYLAAQGLWPILLMLLVAAIVGPIALGGWLFSMDALQPKFSR 128
S ++++ ++ +P+L + + AI + G+L S +A++P +
Sbjct: 64 SYLPF--SQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDIKK 121

Query: 129 LNPLSGLKRMFSAKSLLELSKALIKFLVVLAVALLVLSADRDALLALAHQPLEQAILHSV 188
+NP+ G KR+FS KSL+E K+++K +++ + +++ + LL L +E
Sbjct: 122 INPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPLLG 181

Query: 189 RVVGWSAFWMACSLLLIAAVDVPYQIWDNRQKLLMTKQEVRDEYKDSEGKPEVKSKIRQM 248
+++ ++I+ D ++ + ++L M+K E++ EYK+ EG PE+KSK RQ
Sbjct: 182 QILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRRQF 241

Query: 249 QREMAQRRMMAAVPEADVVITNPTHFAVALKYDPAGGGAPLLLAKGNDFLALKIREVAQE 308
+E+ R M V + VV+ NPTH A+ + Y PL+ K D +R++A+E
Sbjct: 242 HQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIAEE 301

Query: 309 HKVMVMESPALARAVYYSTELDQEIPAGLYLAVAQVLAYVYQLKQ 353
V +++ LARA+Y+ +D IPA A A+VL ++ +
Sbjct: 302 EGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQNI 346


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03714TYPE3IMRPROT1357e-41 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 135 bits (341), Expect = 7e-41
Identities = 96/232 (41%), Positives = 143/232 (61%), Gaps = 2/232 (0%)

Query: 1 MLELTNAQIGGWIASFVLPLFRVAALLMTMPVIGTQLVPVRVRLYLALGVCVVLVPNLPP 60
ML++T+ Q W+ + PL RV AL+ T P++ + VP RV+L LA+ + + P+LP
Sbjct: 1 MLQVTSEQWLSWLNLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPA 60

Query: 61 MPQVDALSMKAMLLIGEQILVGALLGFSLQLLFHAFVIAGQIISMQMGLGFASMVDPANG 120
S A+ L +QIL+G LGF++Q F A AG+II +QMGL FA+ VDPA+
Sbjct: 61 NDVPV-FSFFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASH 119

Query: 121 VSVPVLGQFFTMLVTLLFLAMNGHLVVFEVIAESFVTLPVGEGLSGNHFWI-IAGKLGWV 179
+++PVL + ML LLFL NGHL + ++ ++F TLP+G ++ ++ + +
Sbjct: 120 LNMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLI 179

Query: 180 MGAALLLALPAITALLVVNLAFGAMTRAAPQLNIFSIGFPLTLVLGLVILWI 231
L+LALP IT LL +NLA G + R APQL+IF IGFPLTL +G+ ++
Sbjct: 180 FLNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAA 231


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03715TYPE3IMQPROT559e-14 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 54.8 bits (132), Expect = 9e-14
Identities = 24/75 (32%), Positives = 43/75 (57%)

Query: 7 LDLFREALWLTAMIVGVLVVPSLLVGLVVAMFQAATQINEQTLSFLPRLMVILLTLIVLG 66
+ +AL+L ++ G + + ++GL+V +FQ TQ+ EQTL F +L+ + L L +L
Sbjct: 5 VFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLFLLS 64

Query: 67 PWLLRQLMEYTQTLI 81
W L+ Y + +I
Sbjct: 65 GWYGEVLLSYGRQVI 79


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03716FLGBIOSNFLIP2642e-91 Escherichia coli: Flagellar biosynthetic protein Fl...
		>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP

signature.
Length = 245

Score = 264 bits (676), Expect = 2e-91
Identities = 140/242 (57%), Positives = 176/242 (72%), Gaps = 3/242 (1%)

Query: 11 LAALCLLLLAPWPALAADPTSISAITVTTNGQGQQEYSVSLQILLIMTALSFIPAFVMLM 70
L+ +LL P A + IT G Q +S+ +Q L+ +T+L+FIPA +++M
Sbjct: 5 LSVAPVLLWLITPLAFA---QLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILLMM 61

Query: 71 TSFTRIIIVFSILRQALGLQSTPSNQVLVGLALFLTMFVMAPVFDKINSQALQPYLNEQI 130
TSFTRIIIVF +LR ALG S P NQVL+GLALFLT F+M+PV DKI A QP+ E+I
Sbjct: 62 TSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEEKI 121

Query: 131 PAQEALQKAEVPLKAFMLAQTRTSDLELFVRLSKRTDIGSPEATPLTILVPAFVTSELKT 190
QEAL+K PL+ FML QTR +DL LF RL+ + PEA P+ IL+PA+VTSELKT
Sbjct: 122 SMQEALEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSELKT 181

Query: 191 AFQIGFMIFIPFLIIDLVVSSVLMAMGMMMLSPLIISLPFKIMLFVLVDGWALIIGTLAG 250
AFQIGF IFIPFLIIDLV++SVLMA+GMMM+ P I+LPFK+MLFVLVDGW L++G+LA
Sbjct: 182 AFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLAQ 241

Query: 251 SF 252
SF
Sbjct: 242 SF 243


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03718FLGMOTORFLIN1191e-37 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 119 bits (300), Expect = 1e-37
Identities = 49/83 (59%), Positives = 73/83 (87%)

Query: 74 NLDVILDIPVTISMEVGHTDISIRNLLQLNQGSVIELDRLAGEPLDVLVNGTLIAHGEVV 133
++D+I+DIPV +++E+G T ++I+ LL+L QGSV+ LD LAGEPLD+L+NG LIA GEVV
Sbjct: 53 DIDLIMDIPVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVV 112

Query: 134 VVNEKFGIRLTDVISPSERIKKL 156
VV +K+G+R+TD+I+PSER+++L
Sbjct: 113 VVADKYGVRITDIITPSERMRRL 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03719FLGMOTORFLIM2592e-87 Flagellar motor switch protein FliM signature.
		>FLGMOTORFLIM#Flagellar motor switch protein FliM signature.

Length = 344

Score = 259 bits (664), Expect = 2e-87
Identities = 98/326 (30%), Positives = 167/326 (51%), Gaps = 13/326 (3%)

Query: 5 DLLSQDEIDALLHGVDDGLVETEVEATPG-----SVKSYDLTSQDRIVRGRMPTLEMINE 59
++LSQDEID LL + G + +E + YD D+ + +M TL +++E
Sbjct: 3 EVLSQDEIDQLLTAISSG--DASIEDARPISDTRKITLYDFRRPDKFSKEQMRTLSLMHE 60

Query: 60 RFARYTRISMFNLLRRSADVAVGGVQVMKFGEYVHSLYVPTSLNLVKMKPLRGTALFILD 119
FAR T S+ LR V V V + + E++ S+ P++L ++ M PL+G A+ +D
Sbjct: 61 TFARLTTTSLSAQLRSMVHVHVASVDQLTYEEFIRSIPTPSTLAVITMDPLKGNAVLEVD 120

Query: 120 AKLVFKLVDNFFGGDGRHAKIEGREFTPTELRVVRMVLEQAFVDLKEAWQAVLEMNFEYV 179
+ F ++D FGG G+ AK++ R+ T E V+ V+ + +++E+W V+++
Sbjct: 121 PSITFSIIDRLFGGTGQAAKVQ-RDLTDIENSVMEGVIVRILANVRESWTQVIDLRPRLG 179

Query: 180 NSEVNPAMANIVSPSEVVVVSTFHIELDGGGGDLHITMPYSMIEPIREMLDAGF--QSDH 237
E NP A IV PSE+VV+ T ++ G ++ +PY IEPI L + F S
Sbjct: 180 QIETNPQFAQIVPPSEMVVLVTLETKVGEEEGMMNFCIPYITIEPIISKLSSQFWFSSVR 239

Query: 238 DDQDERWIKALREDVLDVQVPLGATVVRRQLKLRDILHMQPGDVIPVE---MPEHMVMRA 294
+++ LR+ + V + + A V +L +RDIL ++ GD+I + + + V+
Sbjct: 240 RSSTTQYMGVLRDKLSTVDMDVVAEVGSLRLSVRDILGLRVGDIIRLHDTHVGDPFVLSI 299

Query: 295 NGVPAFKVKLGAHKGNLALQILEAVE 320
F + G +A QILE +E
Sbjct: 300 GNRKKFLCQPGVVGKKIAAQILERIE 325


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03721FLGHOOKFLIK522e-09 Flagellar hook-length control protein signature.
		>FLGHOOKFLIK#Flagellar hook-length control protein signature.

Length = 375

Score = 52.1 bits (124), Expect = 2e-09
Identities = 73/300 (24%), Positives = 114/300 (38%), Gaps = 14/300 (4%)

Query: 128 DENTQATLLPPAVPTASSAPASLTEASSDPTLVKLNGVPAVNMALEQGAQDAAQTAKGGP 187
DE + +T L A A +A A + V A AL T K
Sbjct: 90 DEQSTSTPLTTAQTMALAAVADKNTTKDEKADDLNEDVTASLSALFAMLPGFDNTPKVTD 149

Query: 188 AKSADPRQANLGDTLAGLTSDSLTKAVDGKALEAQLQQTAEPAVASAASESLLESKAEPR 247
A S LTS+ LT A A Q P VA A S++ + S P
Sbjct: 150 APST-VLPTEKPTLFTKLTSEQLTTAQPDDAPGTPAQPLT-PLVAEAQSKAEVISTPSPV 207

Query: 248 GEPFAAKLNGLTQAMAQQALTNRPVNGTVPGQPVAMQQNGWSEAVVDRVMWMSSQNLKSA 307
AA +T Q T P + + W +++ + + Q +SA
Sbjct: 208 T---AAASPLITPHQTQPLPT-----VAAPVLSAPLGSHEWQQSLSQHISLFTRQGQQSA 259

Query: 308 EIQLDPAELGRLDVRIHMTADQTQVTFASPNAGVRDALESQMHRLRDMFSQQGMNQLDVN 367
E++L P +LG + + + + +Q Q+ SP+ VR ALE+ + LR ++ G+ N
Sbjct: 260 ELRLHPQDLGEVQISLKVDDNQAQIQMVSPHQHVRAALEAALPVLRTQLAESGIQLGQSN 319

Query: 368 VSDQSLARGWQGQQQGEGGSARGRGLAGEASGDEETLAGVSEIRSRPGASAARGLVDYYA 427
+S +S + G Q + S R A D++TL ++ G VD +A
Sbjct: 320 ISGESFS-GQQQAASQQQQSQRTANHEPLAGEDDDTLPVPVSLQ---GRVTGNSGVDIFA 375


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03724PF06580290.031 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 29.4 bits (66), Expect = 0.031
Identities = 17/93 (18%), Positives = 30/93 (32%), Gaps = 5/93 (5%)

Query: 351 EETSLAGEIATTVDFLEVI----FDEAGVGIEVRGEAR-ALVERALFQRAVTNLLYNAAQ 405
+ SLA E+ +L++ D ++ V L Q V N + +
Sbjct: 214 RQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPMLVQTLVENGIKHGIA 273

Query: 406 HTAAGGTLRVGVERRGDEVRVAVSNPGVPIADE 438
GG + + + V + V N G
Sbjct: 274 QLPQGGKILLKGTKDNGTVTLEVENTGSLALKN 306


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03725HTHFIS861e-21 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 86.0 bits (213), Expect = 1e-21
Identities = 33/129 (25%), Positives = 62/129 (48%), Gaps = 2/129 (1%)

Query: 2 RVLIVEDEAKTADYLNRGLSEQGFTVDLADNGIDGRHLALHGEYDVIVLDVMLPGVDGYG 61
+L+ +D+A LN+ LS G+ V + N G+ D++V DV++P + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 VLRALRERR-QTPVIMLTARERVEDRVRGLREGADDYLIKPFSFLELVARL-QALTRRGG 119
+L +++ R PV++++A+ ++ +GA DYL KPF EL+ + +AL
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 120 NHESHSQMR 128

Sbjct: 125 RPSKLEDDS 133


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03726ACRIFLAVINRP7080.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 708 bits (1829), Expect = 0.0
Identities = 272/984 (27%), Positives = 460/984 (46%), Gaps = 36/984 (3%)

Query: 1 MASSVATPLEVQFSAIPGITEMTSSSA-LGTTTLTLQFSLDKSIDVAAQEVQAAINAAAG 59
+ +V +E + I + M+S+S G+ T+TL F D+A +VQ + A
Sbjct: 57 VQDTVTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATP 116

Query: 60 RLPVDMPNLPTWRKVNPADSPIMILRVNSE--MMPLIELSDYAETILARQLSQVNGVGQI 117
LP ++ + S +M+ S+ ++SDY + + LS++NGVG +
Sbjct: 117 LLPQEVQQ-QGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDV 175

Query: 118 FVVGQQRPAIRIQAQPEKLAAYQLTLADLRQSLQSASVNLAKGALYGEGRVS------TL 171
+ G Q A+RI + L Y+LT D+ L+ + +A G L G + ++
Sbjct: 176 QLFGAQY-AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASI 234

Query: 172 AANDQLFNASDYDDLVV-AYRQGAPVFLKDVARIVSAPEDDYVQAWPNGVPGVALVILRQ 230
A + N ++ + + G+ V LKDVAR+ E+ V A NG P L I
Sbjct: 235 IAQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLA 294

Query: 231 PGANIVDTADAIQAALPRLREMLPATIEVDVLNDRTRTIRSSLHEVELTLLLTIGLVVLV 290
GAN +DTA AI+A L L+ P ++V D T ++ S+HEV TL I LV LV
Sbjct: 295 TGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLV 354

Query: 291 MGLFLRQLSATLIVATVLAVSLSASFAAMYVLGFTLNNLTLVALIIAVGFIVDDAIVVVE 350
M LFL+ + ATLI + V L +FA + G+++N LT+ +++A+G +VDDAIVVVE
Sbjct: 355 MYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVE 414

Query: 351 NIHRHL-EAGASKVEAALKGAAEIGFTVISISFSLIAAFIPLLFMGGIVGRLFREFAVSV 409
N+ R + E EA K ++I ++ I+ L A FIP+ F GG G ++R+F++++
Sbjct: 415 NVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITI 474

Query: 410 TVAILISVVASLTLAPMLASRFM-PALRHAEAPRKGFAEW-------LTGGYERGLRWAL 461
A+ +SV+ +L L P L + + P + GF W Y + L
Sbjct: 475 VSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKIL 534

Query: 462 GHQRLMLVGFAFTVLVAVAGYVGIPKGFFPLQDTAFVFGTSQAAEDISYDDMVAKHRQLA 521
G L+ +A V V ++ +P F P +D Q + + Q+
Sbjct: 535 GSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVT 594

Query: 522 EIIASDPA--VQSYNHAVGVTGGSQSLANGRFWIVLKDRGERDV---SVGEFIDRLRPQL 576
+ + V+S G + Q+ G ++ LK ER+ S I R + +L
Sbjct: 595 DYYLKNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMEL 654

Query: 577 AKVPGIMLYLRAAQDINLSSGPSRTQYQYAL---RSSDSTQLALWAQRLTERLKQVPG-L 632
K+ + I + T + + L L +L Q P L
Sbjct: 655 GKIRDGFVIPFNMPAI--VELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASL 712

Query: 633 MDVSNDLQVGASVTALDIDRVAAARFGLSAEDVSQTLYDAFGQRQVGEYQTEVNQYKVVL 692
+ V + + L++D+ A G+S D++QT+ A G V ++ K+ +
Sbjct: 713 VSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYV 772

Query: 693 ELDARQRGRAESLDWFYLRSPLSGEMVPLSAIAKVAAPRSGPLQINHNGMFPAVNLSFNL 752
+ DA+ R E +D Y+RS +GEMVP SA G ++ P++ +
Sbjct: 773 QADAKFRMLPEDVDKLYVRSA-NGEMVPFSAFTTSH-WVYGSPRLERYNGLPSMEIQGEA 830

Query: 753 AAGVSLGEAVQAVQRAQEEIGMPSTIIGVFQGAAQAFQSSLASQPLLILAALIAVYIILG 812
A G S G+A+ ++ + +P+ I + G + + S P L+ + + V++ L
Sbjct: 831 APGTSSGDAMALMENLASK--LPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLA 888

Query: 813 VLYESFVHPLTILSTLPSAGIGAVFLLWAWGQDFSIMALIGIVLLIGIVKKNGILMVDFA 872
LYES+ P++++ +P +G + + Q + ++G++ IG+ KN IL+V+FA
Sbjct: 889 ALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFA 948

Query: 873 IVAQREQGMSAEQAIYQACLTRFRPIMMTTLAALLGAIPLMIGFGTGSELRQPLGIAVVG 932
++G +A A R RPI+MT+LA +LG +PL I G GS + +GI V+G
Sbjct: 949 KDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMG 1008

Query: 933 GLLVSQVLTLFSTPVVYLALERLF 956
G++ + +L +F PV ++ + R F
Sbjct: 1009 GMVSATLLAIFFVPVFFVVIRRCF 1032


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03727RTXTOXIND516e-09 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 50.6 bits (121), Expect = 6e-09
Identities = 34/205 (16%), Positives = 66/205 (32%), Gaps = 60/205 (29%)

Query: 10 RVLVGVLAAGLVAFGGWAWLGGDAGAKAAPAPARVPVNVARVERRDVEQQVSGIGTVTSL 69
R++ + LV + LG VE + G +T
Sbjct: 58 RLVAYFIMGFLVIAFILSVLG------------------------QVEIVATANGKLTHS 93

Query: 70 HNV-VIRTQIDGQLTRLLVSEGQMVEAGELLATIDD-------RAVVAALEQAQASRASN 121
I+ + + ++V EG+ V G++L + ++L QA+ +
Sbjct: 94 GRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRY 153

Query: 122 QAQLKS--------------------AEQDLQRYRSLYAER--------AVSRQLLDQQQ 153
Q +S +E+++ R SL E+ LD+++
Sbjct: 154 QILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKR 213

Query: 154 ATVDQLLATLKANDATINAERVRLS 178
A +LA + + E+ RL
Sbjct: 214 AERLTVLARINRYENLSRVEKSRLD 238



Score = 39.4 bits (92), Expect = 2e-05
Identities = 40/207 (19%), Positives = 75/207 (36%), Gaps = 16/207 (7%)

Query: 110 ALEQAQASRASNQAQLKSAEQDLQRYRSLYAERAVSRQLLDQQ-QATVDQLLATLKANDA 168
A+ + + +L+ + L++ S QL+ Q + + L N
Sbjct: 253 AVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIG 312

Query: 169 TINAERV----RLSYTRITSPVSGKVGIRNV-DVGNLVRVGDSLGLFSVTQIAPISVVFS 223
+ E R + I +PVS KV V G +V ++L + V + + V
Sbjct: 313 LLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETL-MVIVPEDDTLEVTAL 371

Query: 224 LQQEQLPQLQALLGGEAAVRAY-SRDGGSALGEGRLLTIDNQIDSSTGTI-RVRASFD-- 279
+Q + + + V A+ G +G+ + + +D D G + V S +
Sbjct: 372 VQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVIISIEEN 431

Query: 280 -----NRQARLWPGQFVAVSLHTGVRR 301
N+ L G V + TG+R
Sbjct: 432 CLSTGNKNIPLSSGMAVTAEIKTGMRS 458


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03730AUTOINDCRSYN1535e-49 Autoinducer synthesis protein signature.
		>AUTOINDCRSYN#Autoinducer synthesis protein signature.

Length = 216

Score = 153 bits (387), Expect = 5e-49
Identities = 41/177 (23%), Positives = 74/177 (41%), Gaps = 6/177 (3%)

Query: 14 KLLGEMHKLRAQVFKERKGWDVSVIDEMEIDGYDALSPYYMLIQEDTPEAQVFGCWRILD 73
GE+ LR + FK+R W V D ME D YD + Y+ +D V R ++
Sbjct: 15 TKSGELFTLRKETFKDRLNWAVQCTDGMEFDQYDNNNTTYLFGIKDN---TVICSLRFIE 71

Query: 74 TTGPYMLKNTFPELLHGKEAPCSPHIWELSRFAINSGQKGSLGFSDCTLEAMRALARYSL 133
T P M+ TF P + E SRF ++ + + ++ + +M L+ +
Sbjct: 72 TKYPNMITGTFFPYFKEINIPEGNY-LESSRFFVDKSRAKDILGNEYPISSMLFLSMINY 130

Query: 134 QND--IQTLVTVTTVGVEKMMIRAGLDVSRFGPHLKIGIERAVALRIELNAKTQIAL 188
D + T+ + + ++ R+G + L ER + + ++ + Q AL
Sbjct: 131 SKDKGYDGIYTIVSHPMLTILKRSGWGIRVVEQGLSEKEERVYLVFLPVDDENQEAL 187


112PAKAF_03782PAKAF_03789N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_03782033-5.786452UDP-glucose 4-epimerase
PAKAF_03783127-4.930723hypothetical protein
PAKAF_03784-224-3.177540probable type II secretion system protein
PAKAF_03785-313-0.788237hypothetical protein
PAKAF_03786-1131.127388probable transcriptional regulator
PAKAF_03787-2130.870003probable short-chain dehydrogenase
PAKAF_03788-2141.620314hypothetical protein
PAKAF_03789-2101.098735acetyltransferase, GNAT family
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03782NUCEPIMERASE1811e-56 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 181 bits (460), Expect = 1e-56
Identities = 85/353 (24%), Positives = 142/353 (40%), Gaps = 51/353 (14%)

Query: 1 MRVLVTGGAGFIGSHVLVELLGQGAKVVVLDNLVNGSSESLK--RVERITGHPVGFVLGD 58
M+ LVTG AGFIG HV LL G +VV +DNL + SLK R+E + F D
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 59 VRDNLLVERLLIGEKVDAVIHLAGLKAVGESVDDPLEYYESNVQGTISLLRAMQRVGVFK 118
+ D + L + V AV S+++P Y +SN+ G +++L + +
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120

Query: 119 IVFSSSATIYQMPGTLPISESSKVGGVASPYGRTKLTAEHM------LDDLARSDARWSI 172
++++SS+++Y + +P S V S Y TK E M L L
Sbjct: 121 LLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGL-------PA 173

Query: 173 AVLRYFNPIGAHESGLIGEDPCGTPNNLLPYIAQVAVGRLSRLTVHGGDYPTI--DGTGV 230
LR+F G P G P+ +A+ + ++ + G + G
Sbjct: 174 TGLRFFTVYG----------PWGRPD--------MALFKFTKAMLEGKS-IDVYNYGKMK 214

Query: 231 RDYIHVCDLAAGHTRALEYLGQGHG---------------YHVWNLGTGTGYSVLQVIEA 275
RD+ ++ D+A R + + Y V+N+G + ++ I+A
Sbjct: 215 RDFTYIDDIAEAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQA 274

Query: 276 FERVSGRRIPFTVSGRRPGDVAECWADVSKAERELGWKAGLGLECMIADAWRW 328
E G + +PGDV E AD +G+ ++ + + W
Sbjct: 275 LEDALGIEAKKNMLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNW 327


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03784BCTERIALGSPD1981e-56 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 198 bits (506), Expect = 1e-56
Identities = 128/668 (19%), Positives = 250/668 (37%), Gaps = 109/668 (16%)

Query: 104 SLNVEDVQLAAFINEVFGNILGLPFEIESALKEKTDRVTVRLEQPQTAQMVYEVARQVLV 163
S + + + FIN V L I+ +++ +TVR + Y+ VL
Sbjct: 31 SASFKGTDIQEFINTV-SKNLNKTVIIDPSVRGT---ITVRSYDMLNEEQYYQFFLSVLD 86

Query: 164 NYGVEILHQGDIYRFQIKQVGLSPDEPPILISGEARPSVPIAYRPVFQFVALHSVDPKDV 223
YG +++ + ++ + ++ +A P I V + V L +V +D+
Sbjct: 87 VYGFAVINMNNGVLKVVRSKDAKTA--AVPVASDAAPG--IGDEVVTRVVPLTNVAARDL 142

Query: 224 IPWLN--SAYEKSGLSVMADGARSGLMLKGMSSIVNQATEAVRLLDQPFMRGRHSLRIDP 281
P L + G V + + L++ G ++++ + V +D G S+ P
Sbjct: 143 APLLRQLNDNAGVGSVVHYEPSNV-LLMTGRAAVIKRLLTIVERVDNA---GDRSVVTVP 198

Query: 282 -AFVSAADMASQLKSVIAAQGYSVGIGEAVGSIMLVPLESSNGLIVFANDGLLLDLVREW 340
++ SAAD+ + + S G V +++ E +N ++V ++
Sbjct: 199 LSWASAADVVKLVTELNKDTSKSALPGSMVANVV--ADERTNAVLVSGEPNSRQRIIAM- 255

Query: 341 AQQVDRAPMAVAAGIGEEKEGLFFYEARNTRVTELAKSLRALVSGFAGEGAYGITSGLQS 400
+Q+DR NT+V L + A + +T +
Sbjct: 256 IKQLDRQQATQG----------------NTKVIYLKYAK-------ASDLVEVLTGISST 292

Query: 401 SASKRSGGGRRAGEDGAAPAVAPLLQAAGAAALVGGDSANGLLGGLAAGISGSGTIVEDE 460
S++ A D + I
Sbjct: 293 MQSEKQAAKPVAALDK------------------------------------NIIIKAHG 316

Query: 461 NRNAILFRGAARTWQQMQGLLREMDKPARQVLIEVTVASVSLSDTQELGVEWEMLNGSFN 520
NA++ A ++ ++ ++D QVL+E +A V +D LG++W N
Sbjct: 317 QTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGIQWANKNAGMT 376

Query: 521 SATSTGSK-GSAGKGGFNYVINT--------------------AGGNTAA-IQAMADNQR 558
T++G +A G Y + GN A + A++ + +
Sbjct: 377 QFTNSGLPISTAIAGANQYNKDGTVSSSLASALSSFNGIAAGFYQGNWAMLLTALSSSTK 436

Query: 559 VRVLATPRILVKSGEQANINVGRDIPIPTAQVNDDSTTAGSTNLRNEIAYRSTGTILNVA 618
+LATP I+ +A NVG+++P+ T S T N+ N + ++ G L V
Sbjct: 437 NDILATPSIVTLDNMEATFNVGQEVPVLTG-----SQTTSGDNIFNTVERKTVGIKLKVK 491

Query: 619 PVVYSDSRVDLTVSQELSDSGGSSGGGGKASGGGISAPEISRTSLETSLTLKSGGSVLMG 678
P + V L + QE+S ++ G + ++ ++ + SG +V++G
Sbjct: 492 PQINEGDSVLLEIEQEVSSVADAASSTSSDLG-----ATFNTRTVNNAVLVGSGETVVVG 546

Query: 679 GLIRDNITDSNAGVPLLKDIPGIGFLFGRQKAVKTREEVIMLIQPYVLESDADAREVTEK 738
GL+ +++D+ VPLL DIP IG LF ++ +++ I+P V+ + R+ +
Sbjct: 547 GLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNLMLFIRPTVIRDRDEYRQASSG 606

Query: 739 LRAMLSKT 746
+
Sbjct: 607 QYTAFNDA 614


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03787DHBDHDRGNASE1103e-31 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 110 bits (277), Expect = 3e-31
Identities = 61/185 (32%), Positives = 86/185 (46%), Gaps = 3/185 (1%)

Query: 5 KTLLITGASSGFGQALAREALDAGHRVVGTVRSEEARSALEAVAPGQAFGR---LLDVTD 61
K ITGA+ G G+A+AR G + + E + + +A DV D
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 62 LAAIEPTVAAIERDIGPLDVLVNSAGYGHEGILEESPLAEMRRQFEVNLFGAVAMIQAVL 121
AAI+ A IER++GP+D+LVN AG G++ E F VN G ++V
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 122 PYMRRRRRGHILNITSMGGYITMPGIAYYCGSKFALEGLSEALGKEVASLGIAVTAVAPG 181
YM RR G I+ + S + +A Y SK A ++ LG E+A I V+PG
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188

Query: 182 SFRTD 186
S TD
Sbjct: 189 STETD 193


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03789SACTRNSFRASE379e-06 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 37.2 bits (86), Expect = 9e-06
Identities = 15/51 (29%), Positives = 23/51 (45%), Gaps = 1/51 (1%)

Query: 93 VAVAWQGKGVGSRLLGELLDIADNWMNLRRVELTVYTDNAPALALYRKFGF 143
VA ++ KGVG+ LL + ++ A + + L N A Y K F
Sbjct: 97 VAKDYRKKGVGTALLHKAIEWAKE-NHFCGLMLETQDINISACHFYAKHHF 146


113PAKAF_03795PAKAF_03799N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_03795020-3.346327transcriptional regulator,Probable acrEF/envCD
PAKAF_03796021-3.065910putative multidrug efflux gene product,Multidrug
PAKAF_03797020-3.370413putative AcrB/AcrD/AcrF family protein,Swarming
PAKAF_03798-117-2.557173short chain
PAKAF_03799-115-1.781582transcriptional regulator, TetR family,HTH-type
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03795HTHTETR602e-13 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 60.0 bits (145), Expect = 2e-13
Identities = 34/196 (17%), Positives = 63/196 (32%), Gaps = 14/196 (7%)

Query: 20 RDQIVAAATEHFRLYGYEKTTVSDLAKAIGFSKAYIYKFFESKQAIGEMICSSCLQQIQT 79
R I+ A F G T++ ++AKA G ++ IY F+ K + I I
Sbjct: 13 RQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGE 72

Query: 80 EVNAAIHEV-DSPPEKLRRMLKVLMEACL-----RLFFQDRKLYEIAASAASGRWPATLL 133
+ P LR +L ++E+ + RL + + A
Sbjct: 73 LELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRN 132

Query: 134 YEGFIEQTLREILQQGRQSGDFERKTPLDETTRAIHLIMRPYFNPLLLQYGLETTD---- 189
+ + L+ ++ L AI IMR Y + L+ +
Sbjct: 133 LCLESYDRIEQTLKHCIEAKML--PADLMTRRAAI--IMRGYISGLMENWLFAPQSFDLK 188

Query: 190 EAPALLSSLVLRSLSP 205
+ +++L
Sbjct: 189 KEARDYVAILLEMYLL 204


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03796RTXTOXIND432e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 42.5 bits (100), Expect = 2e-06
Identities = 17/112 (15%), Positives = 41/112 (36%), Gaps = 9/112 (8%)

Query: 68 VSGKVLQRLVDTGQTVKRGQVLLRLDPIDLQLAAHAQREAVTAARARAQQASDDEARYRA 127
+ V + +V G++V++G VLL+L + ++ QA ++ RY+
Sbjct: 103 ENSIVKEIIVKEGESVRKGDVLLKLTALGA-------EADTLKTQSSLLQARLEQTRYQI 155

Query: 128 LRGTGAVSASAYDQYKAAADAARAQLSAAEAQAKVAGNATRYAELLADADGI 179
L ++ + + K + +S E + +++
Sbjct: 156 LS--RSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQK 205



Score = 28.6 bits (64), Expect = 0.047
Identities = 27/245 (11%), Positives = 62/245 (25%), Gaps = 33/245 (13%)

Query: 74 QRLVDTGQTVKRGQV-LLRLDPIDLQLAAHAQREAVTAARARAQQASDDEARYRALRGTG 132
RL D + + + + + + V ++ ++ A+ T
Sbjct: 235 SRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQ 294

Query: 133 AVSASAYDQYKAAADAARAQLSAAEAQAKVAGNATRYAELLADADGIVME-TLAEPGQVV 191
D+ + + + + + + A V + + G VV
Sbjct: 295 LFKNEILDKLRQT----TDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVV 350

Query: 192 SAGQAVVRLAHAG-PREALVQLPETLRPAVGSSAEAKLFGRQDVSVATRLRQLSDVADRQ 250
+ + ++ + E + + A + V A
Sbjct: 351 TTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAII----KVE-----------AFPY 395

Query: 251 TRTFEARYVLEGELSDAPLGTTITV-QISDPQTASPSTVQVPIAALYDAGNGPGIWMIRG 309
TR L G++ I + I D + V + I + I + G
Sbjct: 396 TRY----GYLVGKV------KNINLDAIEDQRLGLVFNVIISIEENCLSTGNKNIPLSSG 445

Query: 310 EPAEV 314

Sbjct: 446 MAVTA 450


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03797ACRIFLAVINRP429e-136 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 429 bits (1105), Expect = e-136
Identities = 226/1046 (21%), Positives = 426/1046 (40%), Gaps = 61/1046 (5%)

Query: 8 LSALAVRERSVTLFLVCLISLAGLVAFFKLGRAEDPAFTIKVMTIVTAWPGATAQEMQDQ 67
++ +R L ++ +AG +A +L A+ P +++ +PGA AQ +QD
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 68 VAEKIEKRLQELR--WYDRSETYTRPGLAFTTLSLLDTTPPSEVLEEFYQARKKISDEAK 125
V + IE+ + + Y S + G TL+ T P Q + K+
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTS-DSAGSVTITLTFQSGTDPDIAQV---QVQNKLQLATP 116

Query: 126 TLPTGVIGPLVNDEYSDVTFALFAL--KARGEPQRHLVRD--AETLRQRLLHVPGVKKVN 181
LP V ++ E S ++ + A + + D A ++ L + GV V
Sbjct: 117 LLPQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQ 176

Query: 182 IIGEQ-AERIFVEFSHERLATLGIGPQDVFAALNGQNALTPAGSVETRGP------DVFL 234
+ G Q A RI+++ + L + P DV L QN AG + + +
Sbjct: 177 LFGAQYAMRIWLDA--DLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASI 234

Query: 235 RLDGAFDELEKIRDTPIIAQ--GRTLKLSDVATVKRGYEDPATFLIRNGGEPALLLGIIM 292
F E+ + G ++L DVA V+ G E+ NG +PA LGI +
Sbjct: 235 IAQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARING-KPAAGLGIKL 293

Query: 293 REGWNGLDLGKALNDEVQQINAELPLGMSLSKVTDQAVNISASVDEFMIKFFVALLVVML 352
G N LD KA+ ++ ++ P GM + D + S+ E + F A+++V L
Sbjct: 294 ATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFL 353

Query: 353 VCFVSMG-WRVGVVVAVAVPLTLAMVFVVMAATGKNFDRITLGSLILALGLLVDDAIIAI 411
V ++ + R ++ +AVP+ L F ++AA G + + +T+ ++LA+GLLVDDAI+ +
Sbjct: 354 VMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVV 413

Query: 412 EMMV-VKMEEGYDRVRASAYAWSHTAAPMLSGTLVTAVGFMPNGFARSTAGEYTSNMFWI 470
E + V ME+ A+ + S ++ +V + F+P F + G
Sbjct: 414 ENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSIT 473

Query: 471 VGIALIASWLVAVIFTPYFGVKLLPEIKKVE--------GGHDAIYDTPRYNHFRRILGK 522
+ A+ S LVA+I TP LL + G + +D NH+ +GK
Sbjct: 474 IVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFD-HSVNHYTNSVGK 532

Query: 523 VIAHKWLVAGSVIGLFVTAVLGMALVKKQFFPISDRPEVMVEVQMPYGTSILQTSAAAEK 582
++ + V+ + F P D+ + +Q+P G + +T ++
Sbjct: 533 ILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQ 592

Query: 583 VEAWLAQQAEAKIVTAYIGQGAPRFFMAMSPELPDPSFAKIVV-----RTDNPDEREALK 637
V + + E V + + S + + A + + R + + EA+
Sbjct: 593 VTDYY-LKNEKANVESVFTVNG----FSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVI 647

Query: 638 HRLRKAIAE-----GLASEAQVRVSQLVFGPYSPYPVAYRISGPDPQRLREIASEVRQVM 692
HR + + + + V + + +G L + +++ +
Sbjct: 648 HRAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELID--QAGLGHDALTQARNQLLGMA 705

Query: 693 DASPL-MRTVNTDWGTRAPTLHFNLQQDRLQAVGLTSSAVAQQLQFLLSGVPVTAVREDI 751
P + +V + + Q++ QA+G++ S + Q + L G V +
Sbjct: 706 AQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRG 765

Query: 752 RTVQVMARAAGDIRLDPARVMDFTLAGTNGQRIPLSQIGEVEVRMEEPIMRWRDRVPTIT 811
R ++ +A R+ P V + NG+ +P S P + + +P++
Sbjct: 766 RVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSME 825

Query: 812 VRGDIAEGLQPPDVSTAISQQLQPIIDRLPSGYRIEQAGSIEESGKASKAMLPLFPIMLA 871
++G+ A G D + + +LP+G + G + + L I
Sbjct: 826 IQGEAAPGTSSGDAMALMEN----LASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFV 881

Query: 872 VTLIIIILQVRSIAAMIMVFLTSPLGLIGVVPTLILFQQPFGINALVGLIALSGILMRNT 931
V + + S + + V L PLG++GV+ LF Q + +VGL+ G+ +N
Sbjct: 882 VVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNA 941

Query: 932 LILIGQIRQ-NEAAGLDPFRAVVEATVQRARPVILTALAAILAFIPLTHSVFWGT----- 985
++++ + E G A + A R RP+++T+LA IL +PL S G+
Sbjct: 942 ILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNA 1001

Query: 986 LAYTLIGGTFAGTVLTLVFLPAMYAI 1011
+ ++GG + T+L + F+P + +
Sbjct: 1002 VGIGVMGGMVSATLLAIFFVPVFFVV 1027



Score = 74.5 bits (183), Expect = 1e-15
Identities = 57/329 (17%), Positives = 121/329 (36%), Gaps = 24/329 (7%)

Query: 712 LHFNLQQDRLQAVGLT----SSAVAQQLQFLLSGVPVTAVREDIRTVQVMARAAGDIRLD 767
+ L D L LT + + Q + +G + + A + +
Sbjct: 184 MRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFK-N 242

Query: 768 PARVMDFTL-AGTNGQRIPLSQIGEVEVRMEE-PIMRWRDRVPTITVRGDIAEGLQPPDV 825
P TL ++G + L + VE+ E ++ + P + +A G D
Sbjct: 243 PEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALDT 302

Query: 826 STAISQQLQPIIDRLPSGYRIE----QAGSIEESGKASKAMLPLFPIMLAVTLIIIILQV 881
+ AI +L + P G ++ ++ S L IML ++ + L
Sbjct: 303 AKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTL-FEAIMLVFLVMYLFL-- 359

Query: 882 RSIAAMIMVFLTSPLGLIGVVPTLILFQQPFGINA--LVGLIALSGILMRNTLILIGQI- 938
+++ A ++ + P+ L+G IL + IN + G++ G+L+ + ++++ +
Sbjct: 360 QNMRATLIPTIAVPVVLLGTF--AILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVE 417

Query: 939 RQNEAAGLDPFRAVVEATVQRARPVILTALAAILAFIPL-----THSVFWGTLAYTLIGG 993
R L P A ++ Q ++ A+ FIP+ + + + T++
Sbjct: 418 RVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSA 477

Query: 994 TFAGTVLTLVFLPAMYAIWFGIRPIPHEP 1022
++ L+ PA+ A H
Sbjct: 478 MALSVLVALILTPALCATLLKPVSAEHHE 506


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03798DHBDHDRGNASE1011e-27 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 101 bits (252), Expect = 1e-27
Identities = 54/185 (29%), Positives = 83/185 (44%), Gaps = 8/185 (4%)

Query: 6 VVVITGVSSGIGRTAAEQFAGRGCRVFGSVRNPATAQAIPGV--------ELIHLDIRDE 57
+ ITG + GIG A A +G + NP + + E D+RD
Sbjct: 10 IAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDS 69

Query: 58 ASIRQSIEHVIAEAGRIDVLVNNAGTTLLGATEETAIDEAQALFDTNVFGVLRVTQAVLP 117
A+I + + E G ID+LVN AG G + +E +A F N GV +++V
Sbjct: 70 AAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSK 129

Query: 118 QMRKQSAGRIVNVSSVLGFLPAPYMGLYSASKHAVEGLSETLDHEVRRFGIRVALVEPSF 177
M + +G IV V S +P M Y++SK A ++ L E+ + IR +V P
Sbjct: 130 YMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPGS 189

Query: 178 TKTSL 182
T+T +
Sbjct: 190 TETDM 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03799HTHTETR603e-13 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 59.6 bits (144), Expect = 3e-13
Identities = 31/174 (17%), Positives = 60/174 (34%), Gaps = 9/174 (5%)

Query: 7 ARRGRPTNEALAQTILDAASELFVELGFQATTLDKVAQRAKISKLSIYRRFENKEALFSA 66
AR+ + + Q ILD A LF + G +T+L ++A+ A +++ +IY F++K LFS
Sbjct: 2 ARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61

Query: 67 AIAAGCQQ-SFAPQALLEGVEGSVEDQLMAVGTSLLRTLLRPDVSSIEAMVMADKTSQNA 125
G L + +L + + + ++
Sbjct: 62 IWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERR--RLLMEIIFHKCEF 119

Query: 126 LSKCHYEAGAA-----HVIADIDALLRQLHAKALLNVP-DPLQAARLFAALFKG 173
+ + A I+ L+ +L +AA + G
Sbjct: 120 VGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISG 173


114PAKAF_03922PAKAF_03932N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_03922-1111.018458alkaline proteinase inhibitor AprI
PAKAF_03923-2100.684438alkaline metalloproteinase precursor
PAKAF_03924-391.777971Alkaline protease secretion outer membrane
PAKAF_03925-292.114708alkaline protease secretion protein AprE
PAKAF_03926-182.059322alkaline protease secretion protein AprD
PAKAF_03927091.840737hypothetical protein
PAKAF_03929-192.180584hypothetical protein
PAKAF_03930-1102.623630probable sensor/response regulator hybrid
PAKAF_03931-1102.859520S8/S53 family peptidase
PAKAF_039322124.448262probable transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03922MPTASEINHBTR1295e-42 Metalloprotease inhibitor signature.
		>MPTASEINHBTR#Metalloprotease inhibitor signature.

Length = 122

Score = 129 bits (325), Expect = 5e-42
Identities = 40/118 (33%), Positives = 58/118 (49%), Gaps = 9/118 (7%)

Query: 12 CLLCGFFSTGI-SMASSLILLSASDLAGQWTLQQDEAPAICHLELRDSEVAEASGYDLGG 70
F S G +MASS ++ S + +AGQ ++ +C +E A A L G
Sbjct: 11 VWQVLFVSAGAQAMASSFVVPSTAQMAGQLGIEATG-SGVC---AGPAEQANA----LAG 62

Query: 71 DTACLTRWLPSEPRAWRPTPAGIALLERGGLTLMLLGRQGEGDYRVQKGDGGQLVLRR 128
D AC +WL +P +W PTP GI L+ G + L RQ EG+Y + G + L+R
Sbjct: 63 DVACAEQWLGDKPVSWSPTPDGIWLMNAEGTGITHLNRQKEGEYTGRTPSGADVTLQR 120


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03923CABNDNGRPT418e-145 NodO calcium binding signature.
		>CABNDNGRPT#NodO calcium binding signature.

Length = 479

Score = 418 bits (1077), Expect = e-145
Identities = 254/480 (52%), Positives = 319/480 (66%), Gaps = 29/480 (6%)

Query: 10 GRSDAYTQVDNFLHAYARGGDELVNGHPSYTVDQAAEQILREQASWQKAPGDSVLTLSYS 69
S AY V +FL + RG VNG SY++DQAA QI RE SW G +V S
Sbjct: 19 NTSSAYNSVYDFLRYHDRGDGLTVNGKTSYSIDQAAAQITRENVSWN---GTNVFGKSA- 74

Query: 70 FLTKPNDFFNTPWKYVSDIYSLGK----FSAFSAQQQAQAKLSLQSWADVTNIHFVDAGQ 125
N +K++ + S+ F F+A+Q QAKLSLQSW+DV N+ F +
Sbjct: 75 ---------NLTFKFLQSVSSIPSGDTGFVKFNAEQIEQAKLSLQSWSDVANLTFTEVTG 125

Query: 126 GDQGDLTFGNFSSSVGG------AAFAFLPDVPDALKGQSWYLINSSYSANVNPANGNYG 179
++TFGN++ G A+A+ P G SWY N S NP + YG
Sbjct: 126 NKSANITFGNYTRDASGNLDYGTQAYAYYPGNYQG-AGSSWYNYNQSN--IRNPGSEEYG 182

Query: 180 RQTLTHEIGHTLGLSHPGDYNAGEGDPTYADATYAEDTRAYSVMSYWEEQNTGQDFKGAY 239
RQT THEIGH LGL+HPG+YNAGEGDP+Y DA YAED+ +S+MSYW E TG D+ G Y
Sbjct: 183 RQTFTHEIGHALGLAHPGEYNAGEGDPSYNDAVYAEDSYQFSIMSYWGENETGADYNGHY 242

Query: 240 SSAPLLDDIAAIQKLYGANLTTRTGDTVYGFNSNTERDFYSATSSSSKLVFSVWDAGGND 299
AP++DDIAAIQ+LYGAN+TTRTGD+VYGFNSNT+RDFY+AT SS L+FSVWDAGG D
Sbjct: 243 GGAPMIDDIAAIQRLYGANMTTRTGDSVYGFNSNTDRDFYTATDSSKALIFSVWDAGGTD 302

Query: 300 TLDFSGFSQNQKINLNEKALSDVGGLKGNVSIAAGVTVENAIGGSGSDLLIGNDVANVLK 359
T DFSG+S NQ+INLNE + SDVGGLKGNVSIA GVT+ENAIGGSG+D+L+GN N+L+
Sbjct: 303 TFDFSGYSNNQRINLNEGSFSDVGGLKGNVSIAHGVTIENAIGGSGNDILVGNSADNILQ 362

Query: 360 GGAGNDILYGGLGADQLWGGAGADTFVYGDIAESSAAAPDTLRDFVSGQDKIDLSGLDAF 419
GGAGND+LYGG GAD L+GGAG DTFVYG +S+ AA D + DF G DKIDLS AF
Sbjct: 363 GGAGNDVLYGGAGADTLYGGAGRDTFVYGSGQDSTVAAYDWIADFQKGIDKIDLS---AF 419

Query: 420 VNGGLVLQYVDAFAGKAGQAILSYDAASKAGSLAIDFSGDAHADFAINLIGQATQADIVV 479
N G + D F GK + +L +DAA+ +L + +G + DF + ++GQA Q+DI+V
Sbjct: 420 RNEGQLSFVQDQFTGKGQEVMLQWDAANSITNLWLHEAGHSSVDFLVRIVGQAAQSDIIV 479


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03925RTXTOXIND438e-154 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 438 bits (1128), Expect = e-154
Identities = 99/423 (23%), Positives = 181/423 (42%), Gaps = 2/423 (0%)

Query: 11 AYARLGWLLVLFGFGGALLWAAFAPLDQGVAVPATVIISGQRKSVQHPLGGVVKHILVRD 70
RL ++ A + + ++ + SG+ K ++ +VK I+V++
Sbjct: 55 RRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKE 114

Query: 71 GQHVEAGEPLIRMEPTQARANVDSLLNRYANARLNQARLQAEYDGRRTLEMPA-GLAEQA 129
G+ V G+ L+++ A A+ + ARL Q R Q ++P L ++
Sbjct: 115 GESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEP 174

Query: 130 PLPTLGERLEL-QRQLLHSRQTALANELSALRANIEGLRAQLEGLRQTEGNQRLQQRLLN 188
+ E L L+ + + N+ N++ RA+ + R+
Sbjct: 175 YFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEK 234

Query: 189 SQLSGARDLAEEGYMPRNQLLEQERQLAEVNARLSESSGRFGQIRQSIAEAQMRIAQREE 248
S+L L + + ++ +LEQE + E L + QI I A+ +
Sbjct: 235 SRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQ 294

Query: 249 EYRKEVNGQLAETQVNARTLWEELSSARYELRHAEIRAPVSGYVAGLKVFTDGGVIGPGE 308
++ E+ +L +T N L EL+ + + IRAPVS V LKV T+GGV+ E
Sbjct: 295 LFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAE 354

Query: 309 LLMYIVPNSDSLEVEGQLAVNLVDRIHSGLPVEMLFTAFNQSKTPRVTGEVTMVSADRLL 368
LM IVP D+LEV + + I+ G + AF ++ + G+V ++ D +
Sbjct: 355 TLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIE 414

Query: 369 DEQNKQPYYALRAQVDAAAMGKLKGLQIRPGMAVQVFVRTGERSLLNYLFKPLFDRAHVA 428
D++ + + + + K + + GMAV ++TG RS+++YL PL + +
Sbjct: 415 DQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGMRSVISYLLSPLEESVTES 474

Query: 429 LAE 431
L E
Sbjct: 475 LRE 477


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03930HTHFIS823e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 81.8 bits (202), Expect = 3e-18
Identities = 33/119 (27%), Positives = 52/119 (43%), Gaps = 5/119 (4%)

Query: 742 THVLLVDDDRMVRYTTALLLGDLGYQVSEAASAEEALGEVERGLAPDLLVTDHLMADKTG 801
+L+ DDD +R L GY V ++A + G DL+VTD +M D+
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAG-DGDLVVTDVVMPDENA 62

Query: 802 VQLAEELRQRFPQLPVLVITGYANL----RPEQLNGFEVLTKPFRHNELAERLARLLEA 856
L +++ P LPVLV++ + + ++ L KPF EL + R L
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03931SUBTILISIN883e-21 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 88.4 bits (219), Expect = 3e-21
Identities = 60/293 (20%), Positives = 104/293 (35%), Gaps = 51/293 (17%)

Query: 256 VRIGVIERDVDFDAPDFADYLGPCKAPAPRTCLYARDAERPDNHGSTVAGILAARWDQGG 315
V++ V++ D D PD + + + + HG+ VAG +AA
Sbjct: 43 VKVAVLDTGCDADHPDLKARIIGGRNFTDDDEGDPEIFKDYNGHGTHVAGTIAA----TE 98

Query: 316 NSGFLRGLDRASQGFEVIVERNSDAGITANVAASVN-LVEDGVRVLNWSWGIHRVGARDV 374
N + G+ + + V +G + + +E V +++ S G
Sbjct: 99 NENGVVGVAPEADLLIIKVLNKQGSGQYDWIIQGIYYAIEQKVDIISMSLG------GPE 152

Query: 375 DGDEVDSLVRSGIAMSGYEELLEEFFLWLRKEHPDVLVVNSAGN-GSSYSGTDEYRLPSS 433
D E+ V+ +A +LV+ +AGN G TDE P
Sbjct: 153 DVPELHEAVKKAVA-------------------SQILVMCAAGNEGDGDDRTDELGYPGC 193

Query: 434 FVTEQLLVVGGHQRSERQGLAVDDPAYAVKRSTSNVDMRVDVTAAACTHASTLERDARGE 493
+ +++ VG A++ +A SN + VD+ A ST+ +
Sbjct: 194 Y--NEVISVG----------AINFDRHAS--EFSNSNNEVDLVAPGEDILSTV-PGGKYA 238

Query: 494 VHCGTSYATPMVAGTVAAMLSLNPRLR-----PEEIRMLLRRSAMTIGGDYDF 541
GTS ATP VAG +A + L E+ L + + +G
Sbjct: 239 TFSGTSMATPHVAGALALIKQLANASFERDLTEPELYAQLIKRTIPLGNSPKM 291


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03932HTHTETR453e-08 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 44.6 bits (105), Expect = 3e-08
Identities = 20/150 (13%), Positives = 45/150 (30%), Gaps = 11/150 (7%)

Query: 1 MEMLSSACGLTKASFYHHYPNKEALLRDVLEWTHQRLAETLFSIAYDPLLTPRERLEKLG 60
+ ++ A G+T+ + Y H+ +K L ++ E + + E P L ++
Sbjct: 34 LGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEYQAKFPGDPLSVLREIL 93

Query: 61 RKAARLFQDDSIGCLMGVVAVDASYGRSELMAPIRSFLDDWAQAFAQLYRPAFDEA--QA 118
+ L+ + + + + ++
Sbjct: 94 IHVLESTVTEERRRLLMEIIF-----HKCEFVGEMAVVQQAQRNLCLESYDRIEQTLKHC 148

Query: 119 LERGRQLVADFEGAILLARIYGEPGYIDGV 148
+E L AD + GYI G+
Sbjct: 149 IEAK-MLPADLMTRRAAIIMR---GYISGL 174


115PAKAF_03938PAKAF_03944N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_039381133.238908probable multidrug resistance efflux pump
PAKAF_039391142.518785probable major facilitator superfamily (MFS)
PAKAF_039400172.542453probable transcriptional regulator
PAKAF_039410152.111252thioredoxin family protein
PAKAF_039420142.465336DUF2790 domain-containing protein
PAKAF_039431142.654013FUSC family protein
PAKAF_03944-1132.404349HlyD family secretion protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03938RTXTOXIND1211e-32 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 121 bits (305), Expect = 1e-32
Identities = 61/368 (16%), Positives = 110/368 (29%), Gaps = 68/368 (18%)

Query: 66 AVSAQVSGYVAEVLVADDADVQAGDLLLRLDPRDFR-------QRLRAAEAREAAAQAAL 118
+ + V E++V + V+ GD+LL+L L A + Q
Sbjct: 98 EIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILS 157

Query: 119 EAQ-------------------------------RAKLETLDRQLLEQAQTISRARADGE 147
+ + + T Q ++ + + RA+
Sbjct: 158 RSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERL 217

Query: 148 AARAEWRRAETDWR-------RYRQLADEHATSRQRLENADAAHQRARAAARRANAEEGR 200
A R E R + L + A ++ + + + A R ++ +
Sbjct: 218 TVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQ 277

Query: 201 QRAARDVLKSR--------RREAEAALAQRQAELQEAAAARELARHALDDTEIRAPFAGR 252
+ K + E L Q + + IRAP + +
Sbjct: 278 IESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVK 337

Query: 253 VGQRKVRLR-QYVTPGLPLLAVVPLEQAYVV-ANYKETQLERIRPGQPVELEVDTFGRRW 310
V Q KV VT L+ +VP + V A + + I GQ ++V+ F
Sbjct: 338 VQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTR 397

Query: 311 RGRVDSVAPASGAVFALLPPDNATGNFTKIVQRFPVRIRLDADAAERG----RLLPGMSV 366
G + + D +V F V I ++ + G L GM+V
Sbjct: 398 YGYLV-------GKVKNINLDAIEDQRLGLV--FNVIISIEENCLSTGNKNIPLSSGMAV 448

Query: 367 IATVDTRE 374
A + T
Sbjct: 449 TAEIKTGM 456


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03939TCRTETB1096e-28 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 109 bits (274), Expect = 6e-28
Identities = 79/402 (19%), Positives = 168/402 (41%), Gaps = 17/402 (4%)

Query: 23 FMAGMNVHVTSAALPEIEGALGATFEEGSWISTAYLVAEISMIPLTAWLVEVFSLRRVML 82
F + +N V + +LP+I +W++TA+++ + L + ++R++L
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 83 LGSLVFLLSSLSCALAPN-LSTLILIRVIQGASGAVLIPLSMQLILTELPSSRVPLGMAL 141
G ++ S+ + + S LI+ R IQGA A L M ++ +P L
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143

Query: 142 FSLSNSVAQAAGPSIGGWLADAYSWRWIFLLQLLPGIALLAAVAWSIRPRDGDRERLRQA 201
++ + GP+IGG +A W ++ L+ ++ I + + + R +
Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKL---LKKEVRIK-GHF 199

Query: 202 DWLGIGAMVAGLGALQIVLEEGGRRDWFESGFIRTFAVLAVLALLLFVQRQLWGARPFIN 261
D GI M G+ + F + + +F +++VL+ L+FV+ PF++
Sbjct: 200 DIKGIILMSVGIVFFML----------FTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVD 249

Query: 262 LRLLGSYNFGVSSLAMAVFGAATFGLVFLVPNYLSQLQGFNARQIGDSLILYGLVQLLL- 320
L + F + L + G V +VP + + + +IG +I G + +++
Sbjct: 250 PGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIF 309

Query: 321 APLLPRLMRWLNPKLLVAGGFAIMALGCWMGAHLNADAGRNVIIPSIVVRGIGQPLIMVA 380
+ L+ P ++ G +++ ++ A + + IV G
Sbjct: 310 GYIGGILVDRRGPLYVLNIGVTFLSVS-FLTASFLLETTSWFMTIIIVFVLGGLSFTKTV 368

Query: 381 LSVLAVKGLDKAQAGSASALISMLRNLGGAIGTALLTQLVSL 422
+S + L + +AG+ +L++ L G A++ L+S+
Sbjct: 369 ISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSI 410


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03940HTHFIS339e-04 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 33.3 bits (76), Expect = 9e-04
Identities = 14/103 (13%), Positives = 31/103 (30%), Gaps = 6/103 (5%)

Query: 87 RHDLPRDCRVVDVPPLLRQLIVAAMRIAPDYPPGGRDERVMELILDELRVLPILALHVPQ 146
R + + R + + + ++ + D L + + +
Sbjct: 376 REIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVLAE 435

Query: 147 PVDPRLAALCRSLRAEPAADWSLGDAARRLGVSPRTLTRAFQR 189
P + L A A + AA LG++ TL + +
Sbjct: 436 MEYPLI------LAALTATRGNQIKAADLLGLNRNTLRKKIRE 472


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_03944RTXTOXIND656e-14 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 65.2 bits (159), Expect = 6e-14
Identities = 43/214 (20%), Positives = 76/214 (35%), Gaps = 39/214 (18%)

Query: 79 RSYRLAVRQREAELEQARETLRQRDEQWRRRMQLAGAVSREEVANAGRALRIARARAEAA 138
R Y+ + Q E+E+ A+E + + ++ + LR
Sbjct: 269 RVYKSQLEQIESEILSAKEEYQLVTQLFKNEIL--------------DKLRQTTDNIGLL 314

Query: 139 GAALDQARLDLRRSELRSPVDGYVTQLRVQ-PGDYAAAGRTNIFIV-DRRSFWVTGYFEE 196
L + + S +R+PV V QL+V G T + IV + + VT +
Sbjct: 315 TLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQN 374

Query: 197 TKLRNVQVGAPATIKLMGFD----PLLDGHVASIGRGVADLNESRADSGLPQVSPNFSWI 252
+ + VG A IK+ F L G V +I D+ Q
Sbjct: 375 KDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNIN----------LDAIEDQRLGLV--- 421

Query: 253 RLAQRVPVRIELDRVPS---GVVLAAGMTGSVEV 283
V + IE + + + + L++GM + E+
Sbjct: 422 ---FNVIISIEENCLSTGNKNIPLSSGMAVTAEI 452



Score = 47.5 bits (113), Expect = 3e-08
Identities = 18/114 (15%), Positives = 41/114 (35%), Gaps = 3/114 (2%)

Query: 41 VSAQVIRIAPEVSGSVEAVFVADNQRVARGDPLYRIDPRSYRLAVRQREAELEQARETLR 100
S + I P + V+ + V + + V +GD L ++ + ++ L QAR +
Sbjct: 92 HSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLE-Q 150

Query: 101 QRDEQWRRRMQLAGAVSREEVANAGRALRIARARAEAAGAALDQARLDLRRSEL 154
R + R ++L E + +L + + +++
Sbjct: 151 TRYQILSRSIELNKL--PELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQK 202


116PAKAF_04010PAKAF_04018N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_04010-1152.857679PcpS
PAKAF_040110152.132891tRNA cyclic N6-threonylcarbamoyladenosine(37)
PAKAF_04012-1122.123375NdvB
PAKAF_04013-2102.192849succinyl-diaminopimelate desuccinylase
PAKAF_04014-1140.340086rRNA methyltransferase
PAKAF_04015015-1.018905hypothetical protein
PAKAF_04016013-1.480706probable cold-shock protein
PAKAF_04017012-0.935461probable two-component sensor
PAKAF_04018014-4.706819probable two-component response regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04010ENTSNTHTASED892e-23 Enterobactin synthetase component D signature.
		>ENTSNTHTASED#Enterobactin synthetase component D signature.

Length = 234

Score = 88.5 bits (219), Expect = 2e-23
Identities = 62/200 (31%), Positives = 93/200 (46%), Gaps = 12/200 (6%)

Query: 15 LDDRWPLPVALPGVQLRSTRFDPALLQPGDFALAGIQPPANILRAVAKRQAEFLAGRLCA 74
L +PLP A G +L FD + + D L + + A KR+AE LAGR+ A
Sbjct: 2 LTSHFPLPFA--GHRLHIVDFDASSFREHD--LLWLPHHDRLRSAGRKRKAEHLAGRIAA 57

Query: 75 RAALFALDGRAQTPAVGEDRAPVWPAAISGSITHGDRWAAALVAARGDWRGLGLDVETLL 134
AL + G P +G+ R P+WP + GSI+H A A+++ + +G+D+E ++
Sbjct: 58 VHALREV-GVRTVPGMGDKRQPLWPDGLFGSISHCATTALAVISR----QRIGIDIEKIM 112

Query: 135 EAERARYLHGEILTEGERLRFADDLERRTGLLVTLAFSLKESLFKALYPLVGKRFYFEHA 194
A L I+ ER L L +TLAFS KES++KA + F A
Sbjct: 113 SQHTATELAPSIIDSDERQILQASL-LPFPLALTLAFSAKESVYKA-FSDRVTLPGFNSA 170

Query: 195 ELLEWRADGQARLRLLTDLS 214
++ A L LL +
Sbjct: 171 KVTSLTA-THISLHLLPAFA 189


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04011ISCHRISMTASE300.009 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 30.0 bits (67), Expect = 0.009
Identities = 14/51 (27%), Positives = 18/51 (35%), Gaps = 3/51 (5%)

Query: 109 MAEYIVDF--DYLIDCIDSVAAKAALIAWCKRRKIPVITTGGAGGQVDPTQ 157
M Y VD + A L C + IPV+ T G Q +P
Sbjct: 38 MQNYFVDAFTAGASPVTELSANIRKLKNQCVQLGIPVVYTAQPGSQ-NPDD 87


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04012PF05704310.024 Capsular polysaccharide synthesis protein
		>PF05704#Capsular polysaccharide synthesis protein

Length = 307

Score = 30.6 bits (69), Expect = 0.024
Identities = 7/32 (21%), Positives = 20/32 (62%), Gaps = 1/32 (3%)

Query: 430 YNEPPELLKQTLDALARLDYPDYEVLVIDNNT 461
+ P +++Q + ++ + + D++V++ID N
Sbjct: 79 IEKAPYIVQQCVASV-KKNSGDFKVIIIDGNN 109


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04017PF06580423e-06 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 41.8 bits (98), Expect = 3e-06
Identities = 21/107 (19%), Positives = 35/107 (32%), Gaps = 25/107 (23%)

Query: 345 LQNLLTNALRHA------DRRVRISYRVSLERCRVDVEDDGPGVPEAQWERLFTPFLRLD 398
+Q L+ N ++H ++ + ++VE+ G + E
Sbjct: 260 VQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKE---------- 309

Query: 399 DSRTRASGGHGLGLSIVR-RIVYWHGGRASIGRSETLGGACFTLAWP 444
G GL VR R+ +G A I SE G + P
Sbjct: 310 --------STGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04018HTHFIS813e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 80.6 bits (199), Expect = 3e-19
Identities = 37/148 (25%), Positives = 66/148 (44%), Gaps = 3/148 (2%)

Query: 72 RILIVEDDRRLAELTREYLEGNGLKVDIEANGALAAARILAERPDLVVLDLMLPGEDGLS 131
IL+ +DD + + + L G V I +N A I A DLVV D+++P E+
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 132 ICRQVR-PQFDGPILMLTARTDDMDEVLGLEMGADDYVCKPVRPRVLLARIRALLRRSEA 190
+ +++ + D P+L+++A+ M + E GA DY+ KP L+ I L +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 191 PEAGAPAADSKRLAFGRLVIDNAMREAW 218
+ + + AM+E +
Sbjct: 125 RPSKLEDD--SQDGMPLVGRSAAMQEIY 150


117PAKAF_04068PAKAF_04079N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_04068181.689914probable major facilitator superfamily (MFS)
PAKAF_040691100.100457diguanylate cyclase
PAKAF_040700110.105379hypothetical protein
PAKAF_0407109-0.139323flagellar protein FliJ
PAKAF_04072080.391273flagellum-specific ATP synthase FliI
PAKAF_04073-1100.312705probable flagellar assembly protein
PAKAF_04074-190.121024flagellar motor switch protein FliG
PAKAF_04075-290.205121Flagella M-ring outer membrane protein
PAKAF_04076010-0.107215flagellar hook-basal body complex protein FliE
PAKAF_040770100.294956two-component response regulator
PAKAF_04078-111-2.056257two-component sensor
PAKAF_04079-112-3.244304transcriptional regulator FleQ
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04068TCRTETA574e-11 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 57.1 bits (138), Expect = 4e-11
Identities = 82/337 (24%), Positives = 125/337 (37%), Gaps = 34/337 (10%)

Query: 4 RPRPPLLLVLALLALPQVAETILSPALPALASHWRLDDATSQWT------MALFFVGFAP 57
+P PL+++L+ +AL V ++ P LP L + + AL AP
Sbjct: 2 KPNRPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAP 61

Query: 58 GIWLWGWLADRLGRRPALLGGLGLAALATFGAWASTDYSYLLACRLVQGLGLATCSVTVQ 117
+ G L+DR GRRP LL L AA+ + L R+V G+ AT +V
Sbjct: 62 ---VLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAV-AG 117

Query: 118 ASLRDVLQGPALMSYFVTLGAVLAWSPAVGPLGGQWLADLGGH-PAVFATLAVLLASLAA 176
A + D+ G +F + A + GP+ G + H P A L L
Sbjct: 118 AYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTG 177

Query: 177 LVV---PAWPETRPLLAGTPEPATLAIFRRVLADRPLQTRALLVAVLNVLVFSFYAAGPF 233
+ E RPL P FR + + V + LV AA
Sbjct: 178 CFLLPESHKGERRPLRREALNPLAS--FRWARGMTVVAA-LMAVFFIMQLVGQVPAALWV 234

Query: 234 MVGDLPGLGFGW----IGLAIAIAGSLGAL----LNRRLPRTWNSARRVRLGLALAAAGA 285
+ G+ F W IG+++A G L +L + + R + LG+ A
Sbjct: 235 IFGEDR---FHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGM---IADG 288

Query: 286 TAQTLLAAVGYAEGLYWALPALPIFIGFGVAIPNLLG 322
T LLA + A P + + G+ +P L
Sbjct: 289 TGYILLAFATRG---WMAFPIMVLLASGGIGMPALQA 322


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04071FLGFLIJ542e-12 Flagellar FliJ protein signature.
		>FLGFLIJ#Flagellar FliJ protein signature.

Length = 147

Score = 54.4 bits (130), Expect = 2e-12
Identities = 46/134 (34%), Positives = 74/134 (55%)

Query: 8 LAPVVDMASKAERDAATQLGRCQQQLLAAQQKLAELERYRNDYQQQWISQGQKGVSGQWL 67
LA + D+A K DAA LG ++ A+++L L Y+N+Y+ S G++
Sbjct: 7 LATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAGITSNRW 66

Query: 68 MNYQRFLSQLETAVAQQANSVTWHREAVDKARLNWQERYARLEGLRKLVERYLEEARQAE 127
+NYQ+F+ LE A+ Q + + VD A +W+E+ RL+ + L ER A AE
Sbjct: 67 INYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQERQSTAALLAE 126

Query: 128 DKREQKQLDELAQR 141
++ +QK++DE AQR
Sbjct: 127 NRLDQKKMDEFAQR 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04073FLGFLIH473e-09 Flagellar assembly protein FliH signature.
		>FLGFLIH#Flagellar assembly protein FliH signature.

Length = 228

Score = 47.1 bits (111), Expect = 3e-09
Identities = 28/103 (27%), Positives = 57/103 (55%), Gaps = 2/103 (1%)

Query: 12 DALIEQGMVNLVNHVARQVIQRELHMDSSHVRQVLREALKLLPMGAANIRIHVNPQDFER 71
D++I ++ + ARQVI + +D+S + + +++ L+ P+ + ++ V+P D +R
Sbjct: 113 DSVIASRLMQMALEAARQVIGQTPTVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQR 172

Query: 72 VKAL--RERHEESWRILEDDSLLPGGCRIETEHSRIDATIETR 112
V + WR+ D +L PGGC++ + +DA++ TR
Sbjct: 173 VDDMLGATLSLHGWRLRGDPTLHPGGCKVSADEGDLDASVATR 215


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04074FLGMOTORFLIG305e-105 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 305 bits (784), Expect = e-105
Identities = 109/330 (33%), Positives = 204/330 (61%)

Query: 9 KLTKVDKAAILLLSLGETDAAQVLRHMGPKEVQRVGVAMASMRNVHREQVEQVMGEFVEV 68
LT KAAILL+S+G +++V +++ +E++ + +A + + E + V+ EF E+
Sbjct: 14 ALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITSELKDNVLLEFKEL 73

Query: 69 VGDQTSLGVGADGYIRKMLTQALGEDKANNLIDRILLGGSTSGLDSLKWMEPRAVADVIR 128
+ Q + G Y R++L ++LG KA ++I+ + + + ++ +P + + I+
Sbjct: 74 MMAQEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSALQSRPFEFVRRADPANILNFIQ 133

Query: 129 YEHPQIQAIVVAYLDPDQAAEVLSHFDHKVRLDIVLRVSSLNTVQPSALKELNLILEKQF 188
EHPQ A++++YLDP +A+ +LS +V+ ++ R++ ++ P ++E+ +LEK+
Sbjct: 134 QEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEVVREVERVLEKKL 193

Query: 189 AGNSNATRTTMGGVKRAADIMNYLDSSIEGQLMDSIREVDEDLSGQIEDLMFVFDNLADV 248
A S+ T+ GGV +I+N D E +++S+ E D +L+ +I+ MFVF+++ +
Sbjct: 194 ASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEIKKKMFVFEDIVLL 253

Query: 249 DDRGIQALLREVSSDVLVLALKGSDEAIREKVFKNMSKRAAELLRDDLEAKGPVRVSEVE 308
DDR IQ +LRE+ L ALK D ++EK+FKNMSKRAA +L++D+E GP R +VE
Sbjct: 254 DDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKEDMEFLGPTRRKDVE 313

Query: 309 GAQKEILTIARRMAESGDIVLGGKGGEEMI 338
+Q++I+++ R++ E G+IV+ G E+++
Sbjct: 314 ESQQKIVSLIRKLEEQGEIVISRGGEEDVL 343


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04075FLGMRINGFLIF6030.0 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 603 bits (1557), Expect = 0.0
Identities = 206/577 (35%), Positives = 309/577 (53%), Gaps = 40/577 (6%)

Query: 30 LDNLSEMTMLRQIGLLVGLAASVAIGFAVVLWSQQPDYKPLYGSLNGVDANRVVEALTAA 89
L+ L+ + +I L+V +A+VAI A+VLW++ PDY+ L+ +L+ D +V LT
Sbjct: 13 LEWLNRLRANPRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQDGGAIVAQLTQM 72

Query: 90 DIPYKVEPNSGALLVKADDLGRARMKVASAGVAPTDNNVGFEILDKEQALGTSQFMEATN 149
+IPY+ SGA+ V AD + R+++A G+ P VGFE+LD+E G SQF E N
Sbjct: 73 NIPYRFANGSGAIEVPADKVHELRLRLAQQGL-PKGGAVGFELLDQE-KFGISQFSEQVN 130

Query: 150 YRRGLEGELARTVSSLNNVKAARVHLAIPKSSVFVRDDRKPSASVLVELYPGRSLEPSQV 209
Y+R LEGELART+ +L VK+ARVHLA+PK S+FVR+ + PSASV V L PGR+L+ Q+
Sbjct: 131 YQRALEGELARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRALDEGQI 190

Query: 210 MAIVNLVATSVPELDKSQVTVVDQKGNLLSDQQELSELTMAGKQFDFTRRMEGLLTQRVH 269
A+V+LV+++V L VT+VDQ G+LL+ Q S + Q F +E + +R+
Sbjct: 191 SAVVHLVSSAVAGLPPGNVTLVDQSGHLLT-QSNTSGRDLNDAQLKFANDVESRIQRRIE 249

Query: 270 NILQPVLGNGRYKAEVSADVDFSAVESTSEMYNPDQPA----LRSEQRNNEERQNSSGPQ 325
IL P++GNG A+V+A +DF+ E T E Y+P+ A LRS Q N E+ + P
Sbjct: 250 AILSPIVGNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNISEQVGAGYPG 309

Query: 326 GVPGALSNQPPGPASAPQQATASAPADYVAPGQPLKDANGQTIIDPKTGKPELAPYPTDK 385
GVPGALSNQP P AP + P P N Q T + P
Sbjct: 310 GVPGALSNQPAPPNEAP----IATP--------PTNQQNAQNTPQTSTSTNSNSAGPRST 357

Query: 386 RDQTTRNYELDRSISYTKQQQGRLRRLSVAVVLDDQMKVDAKTGEVSHQPWSADELARFT 445
+ T NYE+DR+I +TK G + RLSVAVV++ + D K P +AD++ +
Sbjct: 358 QRNETSNYEVDRTIRHTKMNVGDIERLSVAVVVNYKTLADGKP-----LPLTADQMKQIE 412

Query: 446 RLVQDSVGYDASRGDSVSVINAPFAPAQAEEIDSIPFYSQPWFWDIVKQVLGVLFILVLM 505
L ++++G+ RGD+++V+N+PF A +PF+ Q F D + L +LV+
Sbjct: 413 DLTREAMGFSDKRGDTLNVVNSPF-SAVDNTGGELPFWQQQSFIDQLLAAGRWLLVLVVA 471

Query: 506 F----GVLRPVLSNITGGGKGKSLAGGGGGRDGDLALGESGLEGSLADDRVSIGGPSSIL 561
+ +RP L+ K E +E L+ D ++
Sbjct: 472 WILWRKAVRPQLTRRVEEAKAAQEQAQVRQE------TEEAVEVRLSKDEQLQQRRANQR 525

Query: 562 LPSPTEGYDAQLNAIKNLVAQDPGRVAQVVKEWINAD 598
L G + I+ + DP VA V+++W++ D
Sbjct: 526 L-----GAEVMSQRIREMSDNDPRVVALVIRQWMSND 557


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04076FLGHOOKFLIE911e-27 Flagellar hook-basal body complex protein FliE signa...
		>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE

signature.
Length = 103

Score = 91.3 bits (226), Expect = 1e-27
Identities = 42/92 (45%), Positives = 56/92 (60%)

Query: 18 QMEAMAKAKPAQAPAEAGAPSFSEMLSQAVDKVNETQQASTAMANAFEVGQSGVDLTDVM 77
Q++A A + AQ SF+ L A+D++++TQ A+ A F +G+ GV L DVM
Sbjct: 12 QLQATAMSARAQESLPQPTISFAGQLHAALDRISDTQTAARTQAEKFTLGEPGVALNDVM 71

Query: 78 IASQKASVSFQAMTQVRNKLVQAYQDIMQMPV 109
QKASVS Q QVRNKLV AYQ++M M V
Sbjct: 72 TDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV 103


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04077HTHFIS503e-179 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 503 bits (1298), Expect = e-179
Identities = 172/482 (35%), Positives = 255/482 (52%), Gaps = 18/482 (3%)

Query: 2 AAKVLLVEDDRALREALSDTLLLGGHEFVAVDSAEAALPVLAREAFSLVISDVNMPGMDG 61
A +L+ +DD A+R L+ L G++ +A +A LV++DV MP +
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 62 HQLLGLIRTRYPHLPVLLMTAYGAVDRAVEAMRQGAADYLVKPF--------EARALLDL 113
LL I+ P LPVL+M+A A++A +GA DYL KPF RAL +
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122

Query: 114 VARHALGQLPGCEEDGPVALEPASRQLLELAARVARSDSTVLISGESGTGKEVLANYIHQ 173
R + + + V A +++ + AR+ ++D T++I+GESGTGKE++A +H
Sbjct: 123 KRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHD 182

Query: 174 QSPRAGKPFIAINCAAIPDNMLEATLFGHEKGSFTGAIAAQPGKFELADGGTILLDEISE 233
R PF+AIN AAIP +++E+ LFGHEKG+FTGA G+FE A+GGT+ LDEI +
Sbjct: 183 YGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGD 242

Query: 234 MPLGLQAKLLRVLQEREVERVGARKPINLDIRVLATTNRDLAAEVAAGRFREDLYYRLSV 293
MP+ Q +LLRVLQ+ E VG R PI D+R++A TN+DL + G FREDLYYRL+V
Sbjct: 243 MPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNV 302

Query: 294 FPLAWRPLRERPADILPLAERLLRKHSRKMNLGAVALGPEAAQCLVRHAWPGNVRELDNA 353
PL PLR+R DI L +++ + K L EA + + H WPGNVREL+N
Sbjct: 303 VPLRLPPLRDRAEDIPDLVRHFVQQ-AEKEGLDVKRFDQEALELMKAHPWPGNVRELENL 361

Query: 354 IQRALILQQGGLIQPADLCLTAPIGMPLAAPVPVPMPAMPPATPPSVE------IPSPAA 407
++R L +I + +P + + + +VE S
Sbjct: 362 VRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGD 421

Query: 408 GQDASGALGDDLRRREFQVIIDTLRTERGRRKEAAERLGISPRTLRYKLAQMRDAGMDVE 467
SG L E+ +I+ L RG + +AA+ LG++ TLR K+ ++ G+ V
Sbjct: 422 ALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIREL---GVSVY 478

Query: 468 AY 469

Sbjct: 479 RS 480


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04078PF06580371e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 36.8 bits (85), Expect = 1e-04
Identities = 20/97 (20%), Positives = 32/97 (32%), Gaps = 19/97 (19%)

Query: 299 LVENA----IQACGPELRLKVHLYARADSLRLSVSDNGPGMDPATLARLGEPFFTTKTTG 354
LVEN I ++ + ++ L V + G T
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNT------------KES 310

Query: 355 TGLGLAVVKAVARAHQG---QLQLRSRPGRGTCATLI 388
TG GL V+ + G Q++L + G+ LI
Sbjct: 311 TGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLI 347


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04079HTHFIS5100.0 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 510 bits (1314), Expect = 0.0
Identities = 181/489 (37%), Positives = 256/489 (52%), Gaps = 14/489 (2%)

Query: 5 TKLLLIDDNLDRSRDLAVILNFLGEDQLTCNS--EDWREVAAGLSNSREALCVLLGSVES 62
+L+ DD+ L L+ G D ++ WR +AAG + V+ V
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGD-----LVVTDVVMP 58

Query: 63 KGGAVELLKQLASWDEYLPILLI-GEPAPADWPEELRRRVLASLEMPPSYNKLLDSLHRA 121
A +LL ++ LP+L++ + + + L P +L+ + RA
Sbjct: 59 DENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118

Query: 122 QVYREMYDQARERGRSREPNLFRSLVGTSRAIQQVRQMMQQVADTDASVLILGESGTGKE 181
+ R + LVG S A+Q++ +++ ++ TD +++I GESGTGKE
Sbjct: 119 LAEP----KRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKE 174

Query: 182 VVARNLHYHSKRREGPFVPVNCGAIPAELLESELFGHEKGAFTGAITSRAGRFELANGGT 241
+VAR LH + KRR GPFV +N AIP +L+ESELFGHEKGAFTGA T GRFE A GGT
Sbjct: 175 LVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGT 234

Query: 242 LFLDEIGDMPLPMQVKLLRVLQERTFERVGSNKTQNVDVRIIAATHKNLEKMIEDGTFRE 301
LFLDEIGDMP+ Q +LLRVLQ+ + VG DVRI+AAT+K+L++ I G FRE
Sbjct: 235 LFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFRE 294

Query: 302 DLYYRLNVFPIEMAPLRERVEDIALLLNELISRMEHEKRGSIRFNSAAIMSLCRHDWPGN 361
DLYYRLNV P+ + PLR+R EDI L+ + + E E RF+ A+ + H WPGN
Sbjct: 295 DLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDVKRFDQEALELMKAHPWPGN 354

Query: 362 VRELANLVERLAIMHPYGVIGVGELPKKFR-HVDDEDEQLASSLREELEERAAINAGLPG 420
VREL NLV RL ++P VI + + R + D + A++ L A+ +
Sbjct: 355 VRELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQ 414

Query: 421 MDAPAM-LPAEGLDLKDYLANLEQGLIQQALDDAGGVVARAAERLRIRRTTLVEKMRKYG 479
A LA +E LI AL G +AA+ L + R TL +K+R+ G
Sbjct: 415 YFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELG 474

Query: 480 MSRRDDDLS 488
+S S
Sbjct: 475 VSVYRSSRS 483


118PAKAF_04096PAKAF_04109N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_04096-18-2.630911short chain dehydrogenase/reductase family
PAKAF_04097-18-2.1296723-oxoacyl-ACP
PAKAF_04098-110-1.813419acyl carrier protein
PAKAF_04099-111-1.455439nucleotide sugar
PAKAF_04100112-1.369225flagellar hook-associated protein
PAKAF_04101112-0.983446flagellar hook-associated protein 1 FlgK
PAKAF_04102114-0.985194flagellar protein FlgJ
PAKAF_04103314-1.683982flagellar P-ring protein precursor FlgI
PAKAF_04104314-2.497347flagellar L-ring protein precursor FlgH
PAKAF_04105314-2.990902flagellar basal-body rod protein FlgG
PAKAF_04106013-3.264497flagellar basal-body rod protein FlgF
PAKAF_04107112-3.251842flagellar hook protein FlgE
PAKAF_04108014-3.599933flagellar basal-body rod modification protein
PAKAF_04109013-3.549588flagellar basal-body rod protein FlgC
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04096DHBDHDRGNASE1082e-30 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 108 bits (270), Expect = 2e-30
Identities = 69/260 (26%), Positives = 130/260 (50%), Gaps = 13/260 (5%)

Query: 7 FNPFSLSGRRILVTGASSGLGLAIAQSCARMGAELIVSGRDPQRLGASLEALQAISDLSH 66
N + G+ +TGA+ G+G A+A++ A GA + +P++L + +L+A +
Sbjct: 1 MNAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEA-RHA 59

Query: 67 QAIQVDLTVAEQRAALVAALDGEIHGV---VHSAGISRLCPVRMMSEAHLREVQSINVDS 123
+A D+ + + A ++ E+ + V+ AG+ R + +S+ S+N
Sbjct: 60 EAFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTG 119

Query: 124 PMLLTQALLKRNLIAAGGSILFIASIAAHIGVAGVGAYSGTKAALIAMSRCLAMEVVKRR 183
++++ K + GSI+ + S A + + AY+ +KAA + ++CL +E+ +
Sbjct: 120 VFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYN 179

Query: 184 IRVNCLSPALVETPLLE-------ATAQVV-GSMDTERNNYPLG-FGKPEDIANAAIFML 234
IR N +SP ET + QV+ GS++T + PL KP DIA+A +F++
Sbjct: 180 IRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLV 239

Query: 235 SDASRWVTGTTLVMDGGLTI 254
S + +T L +DGG T+
Sbjct: 240 SGQAGHITMHNLCVDGGATL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04100FLAGELLIN553e-10 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 55.1 bits (132), Expect = 3e-10
Identities = 62/369 (16%), Positives = 122/369 (33%), Gaps = 14/369 (3%)

Query: 1 MRISTIQAFNNSVNGISRNYADLNRTFEQISTGKRILTPADDPVGSVRLLRLD-QEQGLN 59
I+T + N ++++ + L+ E++S+G RI + DD G R +GL
Sbjct: 2 QVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLT 61

Query: 60 EQYKTGMTEAKNSLSQEETILRSVGNVLQRIREIAGQAGDGALDSNDKKSLASELRQRED 119
Q + + E L + N LQR+RE++ QA +G +D KS+ E++QR +
Sbjct: 62 -QASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLE 120

Query: 120 ELLNLLNSRDASGKYLFSGSQGSVQPFVRNEDGTYSYMGDESQREVQIASSTRIPVSDSG 179
E+ + N +G + S N+ T + + V+ V+
Sbjct: 121 EIDRVSNQTQFNGVKVLSQDNQMKIQVGANDGETITIDLQKID--VKSLGLDGFNVNGPK 178

Query: 180 KVLFEDIVNAARLDTKAAAGNTGDGRISVGLVEDELAFDSQFPASNPPAATDGFNIHFVS 239
+ D+ ++ + T G + V + + D+ P + N +
Sbjct: 179 EATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTT 238

Query: 240 DKEYVVYDPKSLPPGYDWTTYDPNSPPAWQLSKGAIDDDPKTIDKVLYAGVSVTIDGTPK 299
D T + A + K D Y GV+ TID
Sbjct: 239 DDAENNTAVDLF------KTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTG 292

Query: 300 AGDEFNVNYKPGSEKRSLLNVVSDLRKALESSTDNQAGNDAIRDATAVALTNLSAVAAAV 359
V+ + ++ ++ + A + ++ +
Sbjct: 293 NDGNGKVS----TTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKT 348

Query: 360 DGGQGKIGA 368
K+
Sbjct: 349 KNESAKLSD 357



Score = 37.3 bits (86), Expect = 1e-04
Identities = 24/94 (25%), Positives = 40/94 (42%)

Query: 326 KALESSTDNQAGNDAIRDATAVALTNLSAVAAAVDGGQGKIGARLNTVESTETFIDDVKL 385
A ST A + +TA L ++ + + VD + +GA N +S T + +
Sbjct: 398 TASGVSTLINEDAAAAKKSTANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVT 457

Query: 386 VNASVMSQIQDLDYAEALSRLSLQSTIMDAAQQS 419
S S+I+D DYA +S +S + A
Sbjct: 458 NLNSARSRIEDADYATEVSNMSKAQILQQAGTSV 491


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04101FLGHOOKAP12441e-74 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 244 bits (624), Expect = 1e-74
Identities = 142/469 (30%), Positives = 236/469 (50%), Gaps = 23/469 (4%)

Query: 2 SDLLSIGLSGLGTSQTWLTITGHNITNVKTPGYSRQDAIQQTQVPQFSGAGYMGSGSQIV 61
S L++ +SGL +Q L +NI++ GY+RQ I G++G+G +
Sbjct: 1 SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVS 60

Query: 62 DVRRLASDFLTGQLRNATSQNSELSAFRSQIEQLDGLLSNTTTGVSPAMQRFFAALQAAA 121
V+R F+T QLR A +Q+S L+A Q+ ++D +LS +T+ ++ MQ FF +LQ
Sbjct: 61 GVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLV 120

Query: 122 NNPSSTEAREAVLAQAEGLGKTFNTLYDQLDKQNSLINQQLGALASQVNHLSQSVASYND 181
+N AR+A++ ++EGL F T L Q+ +N +GA Q+N+ ++ +AS ND
Sbjct: 121 SNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLND 180

Query: 182 AIAK--AKSAGAVPNDLMDARDEAVRKLSEMIGVTAVTQDDNSVSLFIGSGQPLVVGNTV 239
I++ AGA PN+L+D RD+ V +L++++GV QD + ++ + +G LV G+T
Sbjct: 181 QISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTA 240

Query: 240 STLSVVPGLDDPTRYQVQLSNG--NSIQNVTGLVSGGEMGGLLAYRNSALDSSYNKLGQL 297
L+ VP DP+R V +G +I+ L++ G +GG+L +R+ LD + N LGQL
Sbjct: 241 RQLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQL 300

Query: 298 AITLADTINKQLGQGLDLAGKAGANLFGDINDPDITALRVLAKNGNTGNVHANLNITDTS 357
A+ A+ N Q G D G AG + F I VL N G+V +TD S
Sbjct: 301 ALAFAEAFNTQHKAGFDANGDAGEDFF------AIGKPAVLQNTKNKGDVAIGATVTDAS 354

Query: 358 KLNSSDFRLDFDGTNFTARRLGDDASMQVTVSGTGPYTLSFKDANGVDQGFNLTLDQLPA 417
+ ++D+++ FD + RL + + VT + G LT PA
Sbjct: 355 AVLATDYKISFDNNQWQVTRLASNTTFTVT---------PDANGKVAFDGLELTFTGTPA 405

Query: 418 AGDRFTLQPTRRGAADIEATLKNASQLAFAGTARTESTTENRGTGKIGA 466
D FTL+P +++ + + +++A +E + A
Sbjct: 406 VNDSFTLKPVSDAIVNMDVLITDEAKIA----MASEEDAGDSDNRNGQA 450



Score = 83.5 bits (206), Expect = 6e-19
Identities = 49/111 (44%), Positives = 65/111 (58%), Gaps = 3/111 (2%)

Query: 569 FNDKGISDNRNALNLLALQTKPTVGGTDNTGSTYNEAYGGLVERVGTLTAQVRASSEASA 628
D G SDNRN LL LQ+ G ++N+AY LV +G TA ++ SS
Sbjct: 437 EEDAGDSDNRNGQALLDLQSNSKTVGGA---KSFNDAYASLVSDIGNKTATLKTSSATQG 493

Query: 629 TVLKQAQDSRDSLSGVSLDEEAANLIQFQQYYGASAQVIQVARTLFDTLIG 679
V+ Q + + S+SGV+LDEE NL +FQQYY A+AQV+Q A +FD LI
Sbjct: 494 NVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALIN 544


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04102FLGFLGJ1481e-43 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 148 bits (374), Expect = 1e-43
Identities = 79/195 (40%), Positives = 114/195 (58%), Gaps = 7/195 (3%)

Query: 198 LPAQSYPAASRRGFSTDGVDSQGSRRIAQP-----PLARGKSMFASADEFIATMLPMAQK 252
LP +S PAA F + V ++ ++Q P S+ + F+A + AQ
Sbjct: 104 LPEESTPAA-PMKFPLETVVRYQNQALSQLVQKAVPRNYDDSLPGDSKAFLAQLSLPAQL 162

Query: 253 AAERIGVDARYLVAQAALETGWGKSIIRQQDGGSSHNLFGIKTGSRWDGASARALTTEYE 312
A+++ GV ++AQAALE+GWG+ IR+++G S+NLFG+K W G TTEYE
Sbjct: 163 ASQQSGVPHHLILAQAALESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYE 222

Query: 313 GGKAVKEVAAFRSYSSFEQSFHDYVSFLQGNDRYQNALDSAANPERFMQELQRAGYATDP 372
G+A K A FR YSS+ ++ DYV L N RY A+ +AA+ E+ Q LQ AGYATDP
Sbjct: 223 NGEAKKVKAKFRVYSSYLEALSDYVGLLTRNPRYA-AVTTAASAEQGAQALQDAGYATDP 281

Query: 373 QYARKVAQIARQMQT 387
YARK+ + +QM++
Sbjct: 282 HYARKLTNMIQQMKS 296



Score = 67.8 bits (165), Expect = 1e-14
Identities = 36/90 (40%), Positives = 56/90 (62%), Gaps = 4/90 (4%)

Query: 20 DLNRLNQLKVGKDRDGEANIRKVAQEFESLFLNEMLKSMRSANEALGDGNFMNSQTTKQY 79
D LN+LK D ANIR VA++ E +F+ MLKSMR +AL +S+ T+ Y
Sbjct: 14 DAQSLNELKAKAGEDPAANIRPVARQVEGMFVQMMLKSMR---DALPKDGLFSSEHTRLY 70

Query: 80 QDMYDQQLSVSLSKNAGGIGLADVLVRQLS 109
MYDQQ++ ++ G+GLA+++V+Q++
Sbjct: 71 TSMYDQQIAQQMT-AGKGLGLAEMMVKQMT 99


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04103FLGPRINGFLGI436e-155 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 436 bits (1122), Expect = e-155
Identities = 168/366 (45%), Positives = 224/366 (61%), Gaps = 10/366 (2%)

Query: 7 LLALAALLLAAGAAQAERLKDIASIQGVRTNQLIGYGLVVGLSGSGDQTTQTPFTLQTFN 66
AL L A R+KDIAS+Q R NQLIGYGLVVGL G+GD +PFT Q+
Sbjct: 13 FSALPFLSTPPAQADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSPFTEQSMR 72

Query: 67 NMLAQFGIKVPANVGNVQLKNVAAVSVHADLPPFAKPGQPIDVTVSSIGNAKSLRGGSLL 126
ML GI G KN+AAV V A+LPPFA PG +DVTVSS+G+A SLRGG+L+
Sbjct: 73 AMLQNLGITTQG--GQSNAKNIAAVMVTANLPPFASPGSRVDVTVSSLGDATSLRGGNLI 130

Query: 127 MTPLKGIDGQVYAVAQGNLVVGGFDAEGRDGSKITVNVPSAGRIPAGATVERAVPSGFDQ 186
MT L G DGQ+YAVAQG L+V GF A+G D + +T V ++ R+P GA +ER +PS F
Sbjct: 131 MTSLSGADGQIYAVAQGALIVNGFSAQG-DAATLTQGVTTSARVPNGAIIERELPSKFKD 189

Query: 187 GNSLTLNLNRPDFTTAKRIVDRINEL----LGPGVAHAVDGGSVRVSAPLDPNQRVDYLS 242
+L L L PDF+TA R+ D +N G +A D + V P ++
Sbjct: 190 SVNLVLQLRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKP-RVADLTRLMA 248

Query: 243 ILENLDVQPGEAVAKVIINSRTGTIVIGQNVKVSPAAVTHGSLTVSITEDPIVSQPGAFS 302
+ENL V+ + AKV+IN RTGTIVIG +V++S AV++G+LTV +TE P V QP FS
Sbjct: 249 EIENLTVET-DTPAKVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESPQVIQPAPFS 307

Query: 303 NGQTAVVPRSRVNAEEETKPMFKFGPGTTLDDIVRAVNQVGAAPSDLMAILEALKQAGAL 362
GQTAV P++ + A +E + G L +V +N +G ++AIL+ +K AGAL
Sbjct: 308 RGQTAVQPQTDIMAMQEGSKVA-IVEGPDLRTLVAGLNSIGLKADGIIAILQGIKSAGAL 366

Query: 363 QADLIV 368
QA+L++
Sbjct: 367 QAELVL 372


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04104FLGLRINGFLGH1803e-59 Flagellar L-ring protein signature.
		>FLGLRINGFLGH#Flagellar L-ring protein signature.

Length = 232

Score = 180 bits (459), Expect = 3e-59
Identities = 81/224 (36%), Positives = 112/224 (50%), Gaps = 13/224 (5%)

Query: 12 IATALGGCVNPPPKPNDPYYAPVLPRTPLPAAQNNGAIYQAGF-----EQNLYDDRKAFR 66
+ +L GC P P P P P NG+I+Q+ Q L++DR+
Sbjct: 15 LVLSLTGCAWIPSTPLVQGATSAQP-VPGPTPVANGSIFQSAQPINYGYQPLFEDRRPRN 73

Query: 67 VGDIITITLNEKTQASKKANSDIQKDSKTKMGLTSLFGSGMTTNNPIGGGDLSLSAEYGG 126
+GD +TI L E ASK ++++ +D KT G + G + E G
Sbjct: 74 IGDTLTIVLQENVSASKSSSANASRDGKTNFGFDT---VPRYLQGLFGNARADV--EASG 128

Query: 127 SRDAKGDSQAGQSNSLTGSITVTVAEVLPNGILSVRGEKWMTLNTGNELVRIAGLVRADD 186
G A SN+ +G++TVTV +VL NG L V GEK + +N G E +R +G+V
Sbjct: 129 GNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRFSGVVNPRT 188

Query: 187 IATDNTVSSTRVADARITYSGTGAFADASQPGWLDRFF--LSPL 228
I+ NTV ST+VADARI Y G G +A GWL RFF LSP+
Sbjct: 189 ISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSPM 232


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04105FLGHOOKAP1452e-07 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 44.6 bits (105), Expect = 2e-07
Identities = 13/51 (25%), Positives = 25/51 (49%)

Query: 209 NGLGTVAQNTLENSNVNVVEELVNMITTQRAYEMNSKVISTADQMLSFVTQ 259
N + ++ S VN+ EE N+ Q+ Y N++V+ TA+ + +
Sbjct: 494 NVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALIN 544



Score = 39.6 bits (92), Expect = 9e-06
Identities = 20/79 (25%), Positives = 35/79 (44%), Gaps = 14/79 (17%)

Query: 3 SALWVSKTGLSAQDMNLTTISNNLANVSTTGFKRDRAEFQDLLYQIRRQPGGQSTQDSEL 62
S + + +GL+A L T SNN+++ + G+ R + +S L
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTI--------------MAQANSTL 47

Query: 63 PSGLQLGTGVRVVGTQKIF 81
+G +G GV V G Q+ +
Sbjct: 48 GAGGWVGNGVYVSGVQREY 66


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04107FLGHOOKAP1455e-07 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 44.9 bits (106), Expect = 5e-07
Identities = 17/49 (34%), Positives = 27/49 (55%)

Query: 414 ALQSGALEASNVDISNELVNLIVHQRNYQANAKTIQTEDAVTQTIINLR 462
L + S V++ E NL Q+ Y ANA+ +QT +A+ +IN+R
Sbjct: 498 QLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546



Score = 41.1 bits (96), Expect = 8e-06
Identities = 22/69 (31%), Positives = 34/69 (49%), Gaps = 3/69 (4%)

Query: 2 SFNIGLSGIQAASSGLNVTGNNIANAGTVGFKQSRAEFADVYAASVLGSGSNPQGSGVLL 61
N +SG+ AA + LN NNI++ G+ + A A S LG+G G+GV +
Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQ--ANSTLGAGGW-VGNGVYV 59

Query: 62 SDVSQMFKQ 70
S V + +
Sbjct: 60 SGVQREYDA 68


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04109FLGHOOKAP1363e-05 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 36.1 bits (83), Expect = 3e-05
Identities = 8/38 (21%), Positives = 20/38 (52%)

Query: 107 NVNVVEEMADMISASRAFQTNAEMMNTAKQMMQKVLTL 144
VN+ EE ++ + + NA+++ TA + ++ +
Sbjct: 508 GVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545



Score = 29.5 bits (66), Expect = 0.004
Identities = 15/54 (27%), Positives = 25/54 (46%), Gaps = 2/54 (3%)

Query: 4 ASVFNIAGSGMSAQSTRLNTVASNIANAETVSSSVDKTYRARHPVFSTMFQQAQ 57
+S+ N A SG++A LNT ++NI++ + T A ST+
Sbjct: 1 SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMA--QANSTLGAGGW 52


119PAKAF_04212PAKAF_04217N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_04212120-4.572427tol-pal system protein YbgF
PAKAF_04213021-5.099523Peptidoglycan associated lipoprotein OprL
PAKAF_04214119-4.841498TolB protein
PAKAF_04215123-4.990618TolA protein
PAKAF_04216124-4.387962TolR protein
PAKAF_04217122-4.051332TolQ protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04212RTXTOXIND320.002 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 32.1 bits (73), Expect = 0.002
Identities = 10/53 (18%), Positives = 19/53 (35%)

Query: 69 QLQQMQDELARLRGTLEEQQNQIQQLKQESLERYQDLDRRISGGGAPAAQNSA 121
+ + +EL + LE+ +++I K+E Q I N
Sbjct: 260 KYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIG 312


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04213OMPADOMAIN1166e-34 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 116 bits (291), Expect = 6e-34
Identities = 34/112 (30%), Positives = 53/112 (47%), Gaps = 12/112 (10%)

Query: 68 YFEYDSSDLKPEAMRALDVHA---KDLKGSGQRVVLEGHTDERGTREYNMALGERRAKAV 124
F ++ + LKPE ALD +L VV+ G+TD G+ YN L ERRA++V
Sbjct: 222 LFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGLSERRAQSV 281

Query: 125 QRYLVLQGVSPAQLELVSYGKERPVATGHDEQS---------WAQNRRVELK 167
YL+ +G+ ++ G+ PV + A +RRVE++
Sbjct: 282 VDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIE 333


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04215IGASERPTASE484e-08 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 47.8 bits (113), Expect = 4e-08
Identities = 36/204 (17%), Positives = 71/204 (34%), Gaps = 21/204 (10%)

Query: 25 QLKSKSQATTQTNQKIAGEAKKTASKQYE-----VEQLEQKKLEQQKLEQQKLEQQQVAA 79
Q + TT N + + + +++ + E +Q +
Sbjct: 990 QTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESK 1049

Query: 80 AKAAEQKKADEARKAEAQKAAEAKKADEAKKAAEAKAAEQKKQADIAKKRAEDEAKKKAA 139
++ A E + A EAK +A + ++A ++ E K+
Sbjct: 1050 TVEKNEQDATETTAQNREVAKEAKSNVKA----------NTQTNEVA--QSGSETKETQT 1097

Query: 140 EDAKKKAAEDAKKKAAEEAKKKAAAEAAKKKAAVEAAKKKAAAAAAAARKAAEDKKAQAL 199
+ K+ A + ++KA E +K E K + V ++++ A A E+ +
Sbjct: 1098 TETKETATVEKEEKAKVETEKT--QEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNI 1155

Query: 200 AELLS--DTTERQQALADEVGSEV 221
E S +TT + A E S V
Sbjct: 1156 KEPQSQTNTTADTEQPAKETSSNV 1179


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_0421760KDINNERMP290.017 60kDa inner membrane protein signature.
		>60KDINNERMP#60kDa inner membrane protein signature.

Length = 548

Score = 28.7 bits (64), Expect = 0.017
Identities = 17/72 (23%), Positives = 28/72 (38%), Gaps = 13/72 (18%)

Query: 2 WSLISNASIVVQLVMLTLVAASVTSWIMIFQRGNAMRAAKKALDAFEERFWS-----GID 56
+S+I + +V+ +M L A TS MR + + A ER +
Sbjct: 356 FSIII-ITFIVRGIMYPLTKAQYTSM-------AKMRMLQPKIQAMRERLGDDKQRISQE 407

Query: 57 LSKLYRQAGSNP 68
+ LY+ NP
Sbjct: 408 MMALYKAEKVNP 419


120PAKAF_04461PAKAF_04468N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_04461-2120.677380probable two-component response regulator
PAKAF_04462-211-0.044526cis-aconitate porin OpdH
PAKAF_04463-190.361790tripartite tricarboxylate transporter
PAKAF_04464180.725488tripartite tricarboxylate transporter TctB
PAKAF_04465080.566406tripartite tricarboxylate transporter permease
PAKAF_04466-190.963810AbrB family transcriptional regulator
PAKAF_04467-2120.860328uracil-DNA glycosylase
PAKAF_04468-2101.447855hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04461HTHFIS838e-21 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 83.3 bits (206), Expect = 8e-21
Identities = 35/127 (27%), Positives = 62/127 (48%)

Query: 2 RILLVEDHPQLAESVVQALKGAGWTVDLLQDGVAADLALASEEYALAILDVGLPRMDGFE 61
IL+ +D + + QAL AG+ V + + +A+ + L + DV +P + F+
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 VLARLRGRGKTLPVLMLTARGEVKDRVHGLNLGADDYLAKPFELSELEARVKALLRRSVL 121
+L R++ LPVL+++A+ + GA DYL KPF+L+EL + L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 122 GGEQLQR 128
+L+
Sbjct: 125 RPSKLED 131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04463AEROLYSIN290.028 Aerolysin signature.
		>AEROLYSIN#Aerolysin signature.

Length = 493

Score = 28.8 bits (64), Expect = 0.028
Identities = 16/40 (40%), Positives = 23/40 (57%)

Query: 2 MMKLSFRPLALVAAGLLLAGAAVAEPKRPECIAPASPGGG 41
M K+ L+L+ +GLL+A A AEP P+ + S G G
Sbjct: 1 MQKIKLTGLSLIISGLLMAQAQAAEPVYPDQLRLFSLGQG 40


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04464ACRIFLAVINRP270.044 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 26.7 bits (59), Expect = 0.044
Identities = 15/58 (25%), Positives = 25/58 (43%), Gaps = 5/58 (8%)

Query: 99 LGFILSAALVGSCMAILYGARPIPAVVTASLL-----GIGLYWLFDRALDVPLPLGVL 151
+S +V C+A LY + IP V + + LF++ DV +G+L
Sbjct: 874 ALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLL 931


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04468PF05043300.010 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 29.9 bits (67), Expect = 0.010
Identities = 7/27 (25%), Positives = 16/27 (59%)

Query: 47 YRYIKHTSKLIRRLGDSDLALQRNKVV 73
YR I +K+I+R +++L +++
Sbjct: 118 YRIISQINKVIKRQFQFEVSLTPVQII 144


121PAKAF_04521PAKAF_04531N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_045211113.608110probable type II secretion system protein
PAKAF_045220113.045487probable type II secretion system protein
PAKAF_04523-1122.800255probable type II secretion system protein
PAKAF_045242183.038679probable type II secretion system protein
PAKAF_045251153.385404probable type II secretion system protein
PAKAF_045261162.876951HxcX atypical pseudopilin
PAKAF_045271143.158381HxcT pseudopilin
PAKAF_04528-1142.671338HxcV putative pseudopilin
PAKAF_04529-1162.877198hypothetical protein
PAKAF_04530-1152.392923HxcU putative pseudopilin
PAKAF_045310131.673753HxcW putative pseudopilin
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04521BCTERIALGSPF378e-131 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 378 bits (972), Expect = e-131
Identities = 187/407 (45%), Positives = 252/407 (61%), Gaps = 5/407 (1%)

Query: 1 MQTFRYEAADAQGRIETGTLEADSQRGALGQLRARGLTPLEVREQAGGGTGQGAGALFAP 60
M + Y+A DAQG+ GT EADS R A LR RGL PL V E G G+ L
Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLR 60

Query: 61 R---LSDGDLAWATRQLASLLAASLPLEAALSATLDQAERKHIAQTLSAVRSDVRGGMRL 117
R LS DLA TRQLA+L+AAS+PLE AL A Q+E+ H++Q ++AVRS V G L
Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSL 120

Query: 118 ADALAARPRDFPEIYRALVAAGEESGDLAQVMERLADYIEERNALRGKILTAFIYPAVVG 177
ADA+ P F +Y A+VAAGE SG L V+ RLADY E+R +R +I A IYP V+
Sbjct: 121 ADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLT 180

Query: 178 VVSIGIVIFLLGYVVPQVVSAFSQARQDLPALTRAMLQASDFVRAWG-WLCAGAIGGAYW 236
VV+I +V LL VVP+VV F +Q LP TR ++ SD VR +G W+ + G
Sbjct: 181 VVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFM- 239

Query: 237 GWCLYLRDPQARLGWHRRVLRLPLLGRFVLGVNTARFASTLAILGSAGVPLLRALDAARQ 296
+ + LR + R+ +HRR+L LPL+GR G+NTAR+A TL+IL ++ VPLL+A+ +
Sbjct: 240 AFRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGD 299

Query: 297 TLANDCLAQAVEEATAQVREGVSLASALRTRQVFPPILTHLIASGEKTGALPPMLDRAAQ 356
++ND + AT VREGVSL AL +FPP++ H+IASGE++G L ML+RAA
Sbjct: 300 VMSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAAD 359

Query: 357 TLSRDIERRAMGMTALLEPLMIVVMGGVVLTIVMAVLMPIIEMNQLV 403
R+ + L EPL++V M VVL IV+A+L PI+++N L+
Sbjct: 360 NQDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLM 406


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04523BCTERIALGSPD2564e-77 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 256 bits (655), Expect = 4e-77
Identities = 150/562 (26%), Positives = 256/562 (45%), Gaps = 32/562 (5%)

Query: 230 PGNNTVVVTDYAENLDRVAGIIASIDIPSASD---TDVVPIQNGIAVDIASTVSELLDSQ 286
NN V+ +++ A +AS P D T VVP+ N A D+A + +L D+
Sbjct: 94 NMNNGVLKVVRSKDAKTAAVPVASDAAPGIGDEVVTRVVPLTNVAARDLAPLLRQLNDNA 153

Query: 287 GSGGAEQGQKTVVLADPRSNSIVIRSPSPERTQLARDLIGKLDSVQSNPGNLHVVYLRNA 346
G G +VV +P SN +++ + +L ++ ++D+ ++ V L A
Sbjct: 154 GVG-------SVVHYEP-SNVLLMTGRAAVIKRL-LTIVERVDNAGDR--SVVTVPLSWA 202

Query: 347 QATRLAQALRGLITGDSG--GEGNEGDQ--QRARLSGGGMLGGGNSGTG----SQGLGTS 398
A + + + L S G+ R + + G NS + L
Sbjct: 203 SAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEPNSRQRIIAMIKQLDRQ 262

Query: 399 GNTTGSGSSGLGGSNRSGGAYGAMGSGQGGAGPGAMGEENSAFSAGGVTVQADATTNTLL 458
T G+ ++ + + A + ++A TN L+
Sbjct: 263 QATQGNTKVIYLKYAKASDLVEVLTGISSTMQSEKQAAKPVAALDKNIIIKAHGQTNALI 322

Query: 459 ISAPEPLYRNLREVIDLLDQRRAQVVIESLIVEVSEDDSSEFGIQWQAGNLGGNGVFG-G 517
++A + +L VI LD RR QV++E++I EV + D GIQW N G G
Sbjct: 323 VTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGIQWANKNAGMTQFTNSG 382

Query: 518 VNFGQSALNTAGKNTIDVLPKGLNIGLVDGTVDIPGIGKILDLKVLARALKSRGGTNVLS 577
+ + N + L L G + + +L AL S ++L+
Sbjct: 383 LPISTAIAGANQYNKDGTVSSSLASALSSFNGIAAGFYQG-NWAMLLTALSSSTKNDILA 441

Query: 578 TPNLLTLDNESASIMVGQTIPFVSGQYVTDGGGTSNNPFQTIQREDVGLKLNIRPQISEG 637
TP+++TLDN A+ VGQ +P ++G T G N F T++R+ VG+KL ++PQI+EG
Sbjct: 442 TPSIVTLDNMEATFNVGQEVPVLTGSQTTSGD----NIFNTVERKTVGIKLKVKPQINEG 497

Query: 638 GTVKLDVYQEVSSVDERASTAA---GVVTNKRAIDTSILLDDGQIMVLGGLLQDNVQDNT 694
+V L++ QEVSSV + AS+ + G N R ++ ++L+ G+ +V+GGLL +V D
Sbjct: 498 DSVLLEIEQEVSSVADAASSTSSDLGATFNTRTVNNAVLVGSGETVVVGGLLDKSVSDTA 557

Query: 695 DGVPGLSSLPGVGSLFRYQKRSRTKTNLMVFLRPYIVRDAAAGRSITLNRYDFIRRAQ-Q 753
D VP L +P +G+LFR + +K NLM+F+RP ++RD R + +Y AQ +
Sbjct: 558 DKVPLLGDIPVIGALFRSTSKKVSKRNLMLFIRPTVIRDRDEYRQASSGQYTAFNDAQSK 617

Query: 754 RVQPRHDWSVGDMQAPVLPPAQ 775
+ ++ ++ + + P Q
Sbjct: 618 QRGKENNDAMLNQDLLEIYPRQ 639



Score = 159 bits (404), Expect = 6e-43
Identities = 72/276 (26%), Positives = 127/276 (46%), Gaps = 7/276 (2%)

Query: 87 VAPVSATAAELGEQPVSLNFVDTEVEAVVRALSRATGRQFLVDPRVKGKLTLVSEGQVPA 146
A + A E +F T+++ + +S+ + ++DP V+G +T+ S +
Sbjct: 17 FAALLFRPAAAEEFSA--SFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITVRSYDMLNE 74

Query: 147 RTAYRMLTSALRMQGFSVVDVD-GVSQVVPEADAKLLGGPVYGADRPA-ANGMVTRTFRL 204
Y+ S L + GF+V++++ GV +VV DAK PV P + +VTR L
Sbjct: 75 EQYYQFFLSVLDVYGFAVINMNNGVLKVVRSKDAKTAAVPVASDAAPGIGDEVVTRVVPL 134

Query: 205 RYENAVNLIPVLRPIVAQNNPINA--YPGNNTVVVTDYAENLDRVAGIIASIDIPSASDT 262
A +L P+LR + + Y +N +++T A + R+ I+ +D
Sbjct: 135 TNVAARDLAPLLRQLNDNAGVGSVVHYEPSNVLLMTGRAAVIKRLLTIVERVDNAGDRSV 194

Query: 263 DVVPIQNGIAVDIASTVSELLDSQGSGGAEQGQKTVVLADPRSNSIVIRSPSPERTQLAR 322
VP+ A D+ V+EL V+AD R+N++++ P Q
Sbjct: 195 VTVPLSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGE-PNSRQRII 253

Query: 323 DLIGKLDSVQSNPGNLHVVYLRNAQATRLAQALRGL 358
+I +LD Q+ GN V+YL+ A+A+ L + L G+
Sbjct: 254 AMIKQLDRQQATQGNTKVIYLKYAKASDLVEVLTGI 289



Score = 50.3 bits (120), Expect = 2e-08
Identities = 24/154 (15%), Positives = 63/154 (40%), Gaps = 19/154 (12%)

Query: 194 ANGMVTRTFRLRYENAVNLIPVLRPI----------VAQNNPINAYPGNNTVVVTDYAEN 243
A T L + +A +++ ++ + + + A N V+V+ +
Sbjct: 189 AGDRSVVTVPLSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEPNS 248

Query: 244 LDRVAGIIASIDIPSAS--DTDVVPIQNGIAVDIASTVSELL-----DSQGSGGAEQGQK 296
R+ +I +D A+ +T V+ ++ A D+ ++ + + Q + K
Sbjct: 249 RQRIIAMIKQLDRQQATQGNTKVIYLKYAKASDLVEVLTGISSTMQSEKQAAKPVAALDK 308

Query: 297 TV-VLADPRSNSIVIRSPSPERTQLARDLIGKLD 329
+ + A ++N++++ + P+ +I +LD
Sbjct: 309 NIIIKAHGQTNALIVTAA-PDVMNDLERVIAQLD 341



Score = 44.1 bits (104), Expect = 2e-06
Identities = 20/84 (23%), Positives = 35/84 (41%), Gaps = 16/84 (19%)

Query: 190 DRPAANGMVTRTFRLRYENAVNLIPVLR----------------PIVAQNNPINAYPGNN 233
DR A T+ L+Y A +L+ VL + +N I A+ N
Sbjct: 260 DRQQATQGNTKVIYLKYAKASDLVEVLTGISSTMQSEKQAAKPVAALDKNIIIKAHGQTN 319

Query: 234 TVVVTDYAENLDRVAGIIASIDIP 257
++VT + ++ + +IA +DI
Sbjct: 320 ALIVTAAPDVMNDLERVIAQLDIR 343


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04527BCTERIALGSPG1671e-56 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 167 bits (425), Expect = 1e-56
Identities = 63/142 (44%), Positives = 87/142 (61%), Gaps = 6/142 (4%)

Query: 11 KGHRGQRGFTLIEIMVVVVILGILAAMVVPKVLDRPDQARATAARQDISGLMQALKLYRL 70
+ QRGFTL+EIMVV+VI+G+LA++VVP ++ ++A A DI L AL +Y+L
Sbjct: 2 RATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKL 61

Query: 71 DQGRYPSQAQGLKVLAERP-ADASASNWRS--YLERLPNDPWGKPYQYLNPGVNGEIDVF 127
D YP+ QGL+ L E P A+N+ Y++RLP DPWG Y +NPG +G D+
Sbjct: 62 DNHHYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDLL 121

Query: 128 SLGADGQPGGEGINADIGSWQL 149
S G DG+ G E DI +W L
Sbjct: 122 SAGPDGEMGTED---DITNWGL 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04528BCTERIALGSPG300.001 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 30.2 bits (68), Expect = 0.001
Identities = 19/62 (30%), Positives = 34/62 (54%), Gaps = 3/62 (4%)

Query: 8 RGFTLIEVLVALAIVAIALAAAIRAVGLMTDGNGLLRDKSLA-LLAAESRLAELRLGVGT 66
RGFTL+E++V IV I + A++ LM + + K+++ ++A E+ L +L
Sbjct: 8 RGFTLLEIMV--VIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLDNHH 65

Query: 67 AP 68
P
Sbjct: 66 YP 67


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04530BCTERIALGSPH348e-05 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 33.8 bits (77), Expect = 8e-05
Identities = 23/119 (19%), Positives = 37/119 (31%), Gaps = 7/119 (5%)

Query: 1 MVVLVIVGIATAAISLSARPDPTGLLRQDAARLARLLEIAQGEARVRGTPILWQPSAKGY 60
M++L+++G++ + L+ Q AR L Q G +
Sbjct: 12 MLILLLMGVSAGMVLLAFPASRDDSAAQTLARFEAQLRFVQQRGLQTGQFFGVSVHPDRW 71

Query: 61 RFSPQAYRGKTDAFAADTELRARDWQAAPLRVSVRPPRPVLLDAEWIGAPLRITLSDGQ 119
+F R D AD W PLR V G L + + G+
Sbjct: 72 QFLVLEARDGADPAPADDGWSGYRWL--PLR-----AGRVATSGSIAGGKLNLAFAQGE 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04531BCTERIALGSPG326e-04 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 32.2 bits (73), Expect = 6e-04
Identities = 18/60 (30%), Positives = 32/60 (53%), Gaps = 3/60 (5%)

Query: 12 RRQAGFTLIEVMVAIMLMAIV-SLMAWRGLDSIARASAHLEDSTEQGAALLRALNQLERD 70
+Q GFTL+E+MV I+++ ++ SL+ + + +A S AL AL+ + D
Sbjct: 5 DKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVS--DIVALENALDMYKLD 62


122PAKAF_04557PAKAF_04571N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_04557115-0.455671two-component response regulator, PprB,Alginate
PAKAF_045581140.194177protein TadG,Predicted membrane protein
PAKAF_045593121.820081Protein of unknown function (DUF3613)
PAKAF_045602141.546300pilus assembly protein,Flp pilus assembly
PAKAF_045612131.400927protein TadC,Flp pilus assembly protein
PAKAF_045621121.159843protein TadB,Flp pilus assembly protein
PAKAF_04563070.173091putative type II secretion system
PAKAF_04564-170.313910protein TadZ,Flp pilus assembly protein, ATPase
PAKAF_04565-18-0.570491type II secretion system protein,Pullulanase
PAKAF_04566-110-0.976052RcpC,Flp pilus assembly protein CpaB,Flp pilus
PAKAF_04567-111-1.125925type IVb pilin, Flp,Flp pilus assembly protein,
PAKAF_04568-110-0.693233chemotactic transducer PctC,H3,methyl-accepting
PAKAF_04569-19-0.324863ATPase,Ornithine/acetylornithine
PAKAF_04570-111-0.565364chemotactic transducer,H3,methyl-accepting
PAKAF_0457109-0.892945chemotactic transducer PctB,H3,methyl-accepting
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04557HTHFIS745e-17 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 73.7 bits (181), Expect = 5e-17
Identities = 31/156 (19%), Positives = 65/156 (41%), Gaps = 4/156 (2%)

Query: 10 SVLIIDDEPQVTSELRELLENSGYRCVTSTHRESAIASFQADPNIGLVICDLYLGQDNGI 69
++L+ DD+ + + L + L +GY +++ + A LV+ D+ + +N
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDG-DLVVTDVVMPDENAF 63

Query: 70 RLIESLKEVAGNGRFFESIILTGHDGRQEVIEAMRVGAADYYQKPVAPQELLHGLERLEN 129
L+ +K+ + ++++ + I+A GA DY KP EL+ + R
Sbjct: 64 DLLPRIKKARPDLPV---LVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 130 RLHERVRSQLSLSHVNQRLEYLAESLNSIYRDIHKI 165
R S L + ++ IYR + ++
Sbjct: 121 EPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARL 156


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04558BCTERIALGSPC300.031 Bacterial general secretion pathway protein C signa...
		>BCTERIALGSPC#Bacterial general secretion pathway protein C

signature.
Length = 272

Score = 29.5 bits (66), Expect = 0.031
Identities = 17/100 (17%), Positives = 34/100 (34%), Gaps = 1/100 (1%)

Query: 20 LLLALICLLLVVDTGRLYLEQRNLQRVADVAALESASQGALCGDQSSAQATSFAKASAML 79
LL+ L C L + R+ L + ++ Q D + + + L
Sbjct: 21 LLMLLFCQQLAMIFWRIGLPDNAPVSSVQITPAQARQQPVTLNDFTLFGVSPEKNKAGAL 80

Query: 80 N-GFDADAAGSSLSAEVGGVLSAGGLRSFIASASNAAVAN 118
+ ++ S+L+ + GV++ IA S
Sbjct: 81 DASQMSNLPPSTLNLSLTGVMAGDDDSRSIAIISKDNEQF 120


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04559TYPE3OMGPROT270.013 Type III secretion system outer membrane G protein ...
		>TYPE3OMGPROT#Type III secretion system outer membrane G protein

family signature.
Length = 607

Score = 26.8 bits (59), Expect = 0.013
Identities = 10/20 (50%), Positives = 13/20 (65%)

Query: 4 RILFGVLLLLSGTAWAADTP 23
R+L G LLLLS +WA +
Sbjct: 11 RVLTGTLLLLSSYSWAQELD 30


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04564HTHFIS330.002 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 32.9 bits (75), Expect = 0.002
Identities = 22/105 (20%), Positives = 40/105 (38%), Gaps = 10/105 (9%)

Query: 18 LQNSLASAG-QVVPAGSASLEELLALLDVTAAGVLFISL---GKSNLVSQGALVEGLVSA 73
L +L+ AG V +A L + ++ + ++ L+ + A
Sbjct: 19 LNQALSRAGYDVRITSNA--ATLWRWIAAGDGDLVVTDVVMPDENAF----DLLPRIKKA 72

Query: 74 RPMLSVVAIGDGLDNQLVLAAMRAGARDFITYGARASELTGLIRR 118
RP L V+ + + A GA D++ +EL G+I R
Sbjct: 73 RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04565BCTERIALGSPD1462e-40 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 146 bits (369), Expect = 2e-40
Identities = 67/253 (26%), Positives = 109/253 (43%), Gaps = 15/253 (5%)

Query: 131 PNQVQTDIRFVEVSRSKLKQASTSFVRRGGNLWVLG------APGSLGDIKVNADGSGLG 184
QV + EV + + + + + G + N DG+ +
Sbjct: 344 RPQVLVEAIIAEVQDADGLNLGIQWANKNAGMTQFTNSGLPISTAIAGANQYNKDGT-VS 402

Query: 185 GTFGTGSSGFNLIFGG---GKWLSFMNALEGSGFAYTLARPSLVAMSGQSASFLAGGEFP 241
+ + S FN I G G W + AL S LA PS+V + A+F G E P
Sbjct: 403 SSLASALSSFNGIAAGFYQGNWAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVP 462

Query: 242 IPVP--NGTNDNV--TIEYKEFGIRLTLTPTVMNNRRIALKVAPEVSELDYSAGIQSGGV 297
+ + DN+ T+E K GI+L + P + + L++ EVS + +A S +
Sbjct: 463 VLTGSQTTSGDNIFNTVERKTVGIKLKVKPQINEGDSVLLEIEQEVSSVADAASSTSSDL 522

Query: 298 AVPALRVRRTDTSVMLADGESFVISGLTSSNSVSNVDKFPWLGDIPILGAFFRSTKLDKD 357
R + +V++ GE+ V+ GL + DK P LGDIP++GA FRST
Sbjct: 523 GA-TFNTRTVNNAVLVGSGETVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVS 581

Query: 358 DRELLMIVTPHLV 370
R L++ + P ++
Sbjct: 582 KRNLMLFIRPTVI 594


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04568RTXTOXINA310.019 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 30.7 bits (69), Expect = 0.019
Identities = 29/179 (16%), Positives = 65/179 (36%), Gaps = 8/179 (4%)

Query: 299 LIRVLMQPLTDMGRAMQDIAQGEGDLTKRLKVTSNDEFGTLANAFNRFVERIHESIREVA 358
LI ++ + G ++ D+ + +L ++ + F + I + R V
Sbjct: 49 LILLIPKDYKGQGSSLNDLVRTADELGIEVQYDEKNGTAITKQVFGTAEKLIGLTERGVT 108

Query: 359 GTARQLHDVAQLVVNASN---SSMANSDEQSNRTNSVAAAI-NELGAAAQEIARNAADAS 414
A QL + Q A N N + + + + N LG A + +
Sbjct: 109 IFAPQLDKLLQKYQKAGNILGGGAENIGDNLGKAGGILSTFQNFLGTALSSMKIDELIKK 168

Query: 415 HHASDANHQAEDGKQVVEQTIRAMNELSEKISASCANIEALNSRTVNIGQILEVIKGIS 473
+ +E K +E N+L + +++ N+ + + + +G +L K ++
Sbjct: 169 QKSGGNVSSSELAKASIELI----NQLVDTVASLNNNVNSFSQQLNTLGSVLSNTKHLN 223


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04570RTXTOXINA310.018 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 31.1 bits (70), Expect = 0.018
Identities = 24/167 (14%), Positives = 58/167 (34%), Gaps = 8/167 (4%)

Query: 308 GRAMQDIAQGEGDLTKRLAVTSRDEFGVLGDAFN---QFVERIHRSIREVAGTAHKLHDV 364
G ++ D+ + +L + ++ + F + + R + A KL
Sbjct: 61 GSSLNDLVRTADELGIEVQYDEKNGTAITKQVFGTAEKLIGLTERGVTIFAPQLDKLLQK 120

Query: 365 SQLVVNASNSSMANSDEQSNRTNSVAAAI-NELGAAAQEIARNAADASHHASDANHQAED 423
Q N N + + + + N LG A + + + +E
Sbjct: 121 YQKAGNILGGGAENIGDNLGKAGGILSTFQNFLGTALSSMKIDELIKKQKSGGNVSSSEL 180

Query: 424 GKQVVEQTIRAMNELSEKISASCANIEALNSRTVNIGQILEVIKGIS 470
K +E N+L + +++ N+ + + + +G +L K ++
Sbjct: 181 AKASIELI----NQLVDTVASLNNNVNSFSQQLNTLGSVLSNTKHLN 223


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04571RTXTOXINA300.026 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 30.3 bits (68), Expect = 0.026
Identities = 29/179 (16%), Positives = 65/179 (36%), Gaps = 8/179 (4%)

Query: 296 LIRVLMQPLTDMGRAMQDIAQGEGDLTKRLKVTSNDEFGTLANAFNRFVERIHESIREVA 355
LI ++ + G ++ D+ + +L ++ + F + I + R V
Sbjct: 49 LILLIPKDYKGQGSSLNDLVRTADELGIEVQYDEKNGTAITKQVFGTAEKLIGLTERGVT 108

Query: 356 GTARQLHDVAQLVVNASN---SSMANSDEQSNRTNSVAAAI-NELGAAAQEIARNAADAS 411
A QL + Q A N N + + + + N LG A + +
Sbjct: 109 IFAPQLDKLLQKYQKAGNILGGGAENIGDNLGKAGGILSTFQNFLGTALSSMKIDELIKK 168

Query: 412 HHASDANHQAEDGKQVVEQTIRAMNELSEKISASCANIEALNSRTVNIGQILEVIKGIS 470
+ +E K +E N+L + +++ N+ + + + +G +L K ++
Sbjct: 169 QKSGGNVSSSELAKASIELI----NQLVDTVASLNNNVNSFSQQLNTLGSVLSNTKHLN 223


123PAKAF_04617PAKAF_04624N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_04617-1101.130869probable major facilitator superfamily (MFS)
PAKAF_046180100.592453xenobiotic reductase
PAKAF_04619-2110.337489ferrous iron-sensisng transcriptional regulator
PAKAF_04620-190.853334probable ferrous iron transport protein
PAKAF_04621-1121.670295ferrous iron transporter A
PAKAF_04622-1121.856162ATPase
PAKAF_04623-1132.786480hypothetical protein
PAKAF_04624091.729750probable oxidoreductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04617TCRTETB508e-09 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 49.9 bits (119), Expect = 8e-09
Identities = 32/155 (20%), Positives = 66/155 (42%), Gaps = 2/155 (1%)

Query: 26 LPQVAGDLRVSIPSAGWLISGYAFAVAFGAPLMAMATARLERKKALLALMGIFIVGNLLC 85
LP +A D S W+ + + + G + + +L K+ LL + I G+++
Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIG 96

Query: 86 AVAANY-GLLMLARIVTALCHGAFFGIGSVVAASLVAPNRRASAVALMFTGLTLANVLGV 144
V ++ LL++AR + AF + VV A + R A L+ + + + +G
Sbjct: 97 FVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGP 156

Query: 145 PLGTALGQEAGWRATFWVVTLIGVVAFVGLARVLP 179
+G + W ++ +I ++ L ++L
Sbjct: 157 AIGGMIAHYIHWSYLL-LIPMITIITVPFLMKLLK 190


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04620TCRTETOQM350.001 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 35.2 bits (81), Expect = 0.001
Identities = 40/179 (22%), Positives = 69/179 (38%), Gaps = 55/179 (30%)

Query: 1 MTALTLGLIGNPNSGKTTLFNQL---TGSRQRVGNW-AGVTV------ERKEG------- 43
M + +G++ + ++GKTTL L +G+ +G+ G T ER+ G
Sbjct: 1 MKIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGI 60

Query: 44 -AFHTARHAVRLVDLPGTYSLTSVSAQASLDEQIACRYIASGEVDVLVNVVDAANL---- 98
+F V ++D PG + EV ++V+D A L
Sbjct: 61 TSFQWENTKVNIIDTPGHMDFLA-------------------EVYRSLSVLDGAILLISA 101

Query: 99 -----ERNLYLTVQLREMGIPCIVALNMLDIARSQRIRIDIDGLAR----RLGCPVVPL 148
+ L LR+MGIP I +N +D + ID+ + + +L +V
Sbjct: 102 KDGVQAQTRILFHALRKMGIPTIFFINKID-----QNGIDLSTVYQDIKEKLSAEIVIK 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04622GPOSANCHOR300.009 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 30.4 bits (68), Expect = 0.009
Identities = 22/111 (19%), Positives = 40/111 (36%)

Query: 95 EEAAGRLDDIRGKVVASESSVTSEREALRLQVKQLQEKLGSQERQQADVSNQFGGQGKRL 154
E+A + A ++ +E+ AL + +L++ L S +
Sbjct: 231 EKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEK 290

Query: 155 DQLASDLKAQQESAAQLVAQLDGKLQTLAAEQEKLKALQVELGKTNEQLKA 205
L ++ + + L A + L A +E K L+ E K EQ K
Sbjct: 291 AALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKI 341


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04624NUCEPIMERASE1091e-29 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 109 bits (274), Expect = 1e-29
Identities = 81/362 (22%), Positives = 130/362 (35%), Gaps = 68/362 (18%)

Query: 1 MRILVTGATGFIGGRFARFALEQGLSVRV---------SGRRADAVEHLVARGAEFVPGD 51
M+ LVTGA GFIG ++ LE G V + +E L G +F D
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 52 LADPALVLRLCED--VEAVVHCAGAVGV---WGPRERFLAANVGLAESVVEACMRQKVRR 106
LAD + L E V + V + +N+ +++E C K++
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120

Query: 107 LVHLSSPSIYFDGRDHLDLNEEYVPRRFSDHYGATKYQAEQLVLSARDL-GLEVLALRPR 165
L++ SS S+Y R + + + Y ATK E + + L GL LR
Sbjct: 121 LLYASSSSVYGLNR-KMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLR-F 178

Query: 166 FVV----GAGDTSIFPRMIQAHRKGR-LRILGNGLNRVDFTSVHNLNDALFSCL------ 214
F V G D ++F + +A +G+ + + G + DFT + ++ +A+
Sbjct: 179 FTVYGPWGRPDMALF-KFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIPHA 237

Query: 215 ------LAGEPALG----KVYNISNGQPVPFWDAVNYVMRQLDLPPVGGHLPYAVGYGLA 264
G PA +VYNI N PV D + + L + LP
Sbjct: 238 DTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLPL------- 290

Query: 265 ALNEGVCRILPGRPEPVLFRLGMAVMAKNFTLDINRAREYLDYDPRVSLWTALDEFCAWW 324
+P VL D E + + P ++ + F W+
Sbjct: 291 ------------QPGDVLETSA----------DTKALYEVIGFTPETTVKDGVKNFVNWY 328

Query: 325 RA 326
R
Sbjct: 329 RD 330


124PAKAF_04708PAKAF_04714N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_04708-110-2.176221alpha/beta hydrolase
PAKAF_04709010-1.866523DUF1043 family protein
PAKAF_0471019-2.011690ATP sulfurylase GTP-binding subunit/APS kinase
PAKAF_04711210-1.293738ATP sulfurylase small subunit
PAKAF_04712210-0.609684soluble and membrane-bound lytic
PAKAF_04713412-0.619317Nif3-like dinuclear metal center hexameric
PAKAF_04714412-0.888816AlgW protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04708FLAGELLIN290.008 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 29.2 bits (65), Expect = 0.008
Identities = 13/42 (30%), Positives = 20/42 (47%), Gaps = 5/42 (11%)

Query: 12 SARD--AGLA---TLRFNFRGVGQSAGSYGEGIGEIDDAEAA 48
SA+D AG A N +G+ Q++ + +GI E A
Sbjct: 39 SAKDDAAGQAIANRFTSNIKGLTQASRNANDGISIAQTTEGA 80


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04710TCRTETOQM685e-14 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 68.0 bits (166), Expect = 5e-14
Identities = 53/150 (35%), Positives = 67/150 (44%), Gaps = 17/150 (11%)

Query: 33 VDDGKSTLIGRLLHDSKMIYEDHLEAITRDSKKVGTTGDDVDLALLVDGLQAEREQGITI 92
VD GK+TL LL++S I E K T D+ L ER++GITI
Sbjct: 12 VDAGKTTLTESLLYNSGAITE------LGSVDKGTTRTDNTLL---------ERQRGITI 56

Query: 93 DVAYRYFSTAKRKFIIADTPGHEQYTRNMATGASTCDLAIILIDARYGVQTQTRRHSFIA 152
F K I DTPGH + + S D AI+LI A+ GVQ QTR
Sbjct: 57 QTGITSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHAL 116

Query: 153 SLLGIRHIVVAINKMDLKDFD-QGVFEQIK 181
+GI I INK+D D V++ IK
Sbjct: 117 RKMGIPTIFF-INKIDQNGIDLSTVYQDIK 145


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04711TCRTETOQM280.046 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 28.3 bits (63), Expect = 0.046
Identities = 17/90 (18%), Positives = 33/90 (36%), Gaps = 14/90 (15%)

Query: 94 GVAQG-INPFTHGSAKHTDVMKTEGLKQALDKYGFDAAFGGARRDEEKSRAKERVYSFRD 152
+ P HGSAK G+ ++ F + + +++ F
Sbjct: 207 RFHNCSLFPVYHGSAK-----NNIGIDNLIE--VITNKFYSS---THRGQSELCGKVF-- 254

Query: 153 SKHRWDPKNQRPELWNIYNGKVKKGESIRV 182
K + K QR +Y+G + +S+R+
Sbjct: 255 -KIEYSEKRQRLAYIRLYSGVLHLRDSVRI 283


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04714V8PROTEASE612e-12 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 60.8 bits (147), Expect = 2e-12
Identities = 33/163 (20%), Positives = 52/163 (31%), Gaps = 35/163 (21%)

Query: 118 LLTNNHVTAGADQIIVALR------------DGRETIAQLVGSDPETDLAVLKIDL---- 161
LLTN HV AL+ +G T Q+ E DLA++K
Sbjct: 114 LLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGEGDLAIVKFSPNEQN 173

Query: 162 ----KNLPAMTLGRSDGIRTGDVCLAIGNPFGVGQTVTMGIISATGRNQLGLNTYEDFIQ 217
+ + T+ + + G P TM + G+ L +Q
Sbjct: 174 KHIGEVVKPATMSNNAETQVNQNITVTGYPGD-KPVATMW--ESKGK-ITYLKGE--AMQ 227

Query: 218 TDAAINPGNSGGALVDAAGNLIGINTAIFSKSGGSQGIGFAIP 260
D + GNSG + + +IGI+ G+
Sbjct: 228 YDLSTTGGNSGSPVFNEKNEVIGIHWG---------GVPNEFN 261


125PAKAF_04795PAKAF_04805N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_04795022-6.650640nicotinate-nucleotide pyrophosphorylase
PAKAF_05939024-7.592283*type 4 fimbrial precursor PilA
PAKAF_04797119-6.354892type 4 fimbrial biogenesis protein PilB
PAKAF_04798220-5.548100type 4 fimbrial biogenesis protein PilC
PAKAF_04799214-1.263507type 4 prepilin peptidase PilD
PAKAF_04800017-0.936381dephosphocoenzyme A kinase
PAKAF_04801217-1.135914DNA gyrase inhibitor YacG
PAKAF_04802017-1.161029hypothetical protein
PAKAF_04803015-0.848879membrane protein
PAKAF_04804-413-0.330661hypothetical protein
PAKAF_04805-214-0.934138GNAT family N-acetyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04795RTXTOXIND290.021 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.0 bits (65), Expect = 0.021
Identities = 23/141 (16%), Positives = 45/141 (31%), Gaps = 4/141 (2%)

Query: 75 QVEDGQRVEPNQMLFQLKGP-ARALLTGERSALNFLQLLSGTATRSQHYADLVAGTAVKL 133
V++G+ V +L +L A A +S+L +L TR Q + + +
Sbjct: 111 IVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARL---EQTRYQILSRSIELNKLPE 167

Query: 134 LDTRKTLPGLRLAQKYAVTCGGCHNHRIGLYDAFLIKENHIAACGGIDRAIAEARRIAPG 193
L ++++ + + + ++ +R AR
Sbjct: 168 LKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYE 227

Query: 194 KPVEVEVENLDELRQALEAGA 214
VE LD+ L A
Sbjct: 228 NLSRVEKSRLDDFSSLLHKQA 248


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_05939BCTERIALGSPG471e-09 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 47.2 bits (112), Expect = 1e-09
Identities = 16/54 (29%), Positives = 34/54 (62%)

Query: 1 MKAQKGFTLIELMIVVAIIGILAAIAIPQYQNYVARSEGASALASVNPLKTTVE 54
Q+GFTL+E+M+V+ IIG+LA++ +P +++ A++ + L+ ++
Sbjct: 4 TDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALD 57


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04798BCTERIALGSPF466e-167 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 466 bits (1202), Expect = e-167
Identities = 119/382 (31%), Positives = 209/382 (54%), Gaps = 14/382 (3%)

Query: 3 VKAHLRKQGINPLKVR-------KKGISLLGA--GKKVKPMDIALFTRQMATMMGAGVPL 53
+ LR++G+ PL V K G + L ++ D+AL TRQ+AT++ A +PL
Sbjct: 28 ARQLLRERGLVPLSVDENRGDQQKSGSTGLSLRRKIRLSTSDLALLTRQLATLVAASMPL 87

Query: 54 LQSFDIIGEGFDNPNMRKLVDEIKQEVSSGNSLANSLRKKPQYFDELYCNLVDAGEQSGA 113
++ D + + + P++ +L+ ++ +V G+SLA++++ P F+ LYC +V AGE SG
Sbjct: 88 EEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSLADAMKCFPGSFERLYCAMVAAGETSGH 147

Query: 114 LENLLDRVATYKEKTESLKAKIKKAMTYPIAVIIVALIVSAILLIKVVPQFQSVFEGFGA 173
L+ +L+R+A Y E+ + ++++I++AM YP + +VA+ V +ILL VVP+ F
Sbjct: 148 LDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLTVVAIAVVSILLSVVVPKVVEQFIHMKQ 207

Query: 174 ELPAFTQMIVNLSEFMQEW--WFFIILAIAIFGFAFKELHKRSQKFRDTLDRTILKLPIF 231
LP T++++ +S+ ++ + W + L F + R +K R + R +L LP+
Sbjct: 208 ALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMAF---RVMLRQEKRRVSFHRRLLHLPLI 264

Query: 232 GGIVYKSAVARYARTLSTTFAAGVPLVDALDSVSGATGNIVFKNAVSKIKQDVSTGMQLN 291
G I ARYARTLS A+ VPL+ A+ N ++ +S V G+ L+
Sbjct: 265 GRIARGLNTARYARTLSILNASAVPLLQAMRISGDVMSNDYARHRLSLATDAVREGVSLH 324

Query: 292 FSMRTTSVFPNMAIQMTAIGEESGSLDEMLSKVASYYEEEVDNAVDNLTTLMEPMIMAVL 351
++ T++FP M M A GE SG LD ML + A + E + + L EP+++ +
Sbjct: 325 KALEQTALFPPMMRHMIASGERSGELDSMLERAADNQDREFSSQMTLALGLFEPLLVVSM 384

Query: 352 GVLVGGLIVAMYLPIFQLGNVV 373
+V +++A+ PI QL ++
Sbjct: 385 AAVVLFIVLAILQPILQLNTLM 406


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04799PREPILNPTASE354e-125 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 354 bits (909), Expect = e-125
Identities = 165/283 (58%), Positives = 195/283 (68%), Gaps = 1/283 (0%)

Query: 3 LLDYLASHPLAFVLCAILLGLLVGSFLNVVVHRLPKMMERNWKAEAREALGLEPE-PKQA 61
LL+ P + L L++GSFLNVV+HRLP M+ER W+AE R + E +
Sbjct: 4 LLELAHGLPWLYFSLVFLFSLMIGSFLNVVIHRLPIMLEREWQAEYRSYFNPDDEGVDEP 63

Query: 62 TYNLVLPNSACPRCGHEIRPWENIPLVSYLALGGKCSSCKAAIGKRYPLVELATALLSGY 121
YNL++P S CP C H I ENIPL+S+L L G+C C+A I RYPLVEL TALLS
Sbjct: 64 PYNLMVPRSCCPHCNHPITALENIPLLSWLWLRGRCRGCQAPISARYPLVELLTALLSVA 123

Query: 122 VAWHFGFTWQAGAMLLLTWGLLAMSLIDADHQLLPDVLVLPLLWLGLIANHFGLFASLDD 181
VA W A LLLTW L+A++ ID D LLPD L LPLLW GL+ N G F SL D
Sbjct: 124 VAMTLAPGWGTLAALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNLLGGFVSLGD 183

Query: 182 ALFGAVFGYLSLWSVFWLFKLVTGKEGMGYGDFKLLAMLGAWGGWQILPLTILLSSLVGA 241
A+ GA+ GYL LWS++W FKL+TGKEGMGYGDFKLLA LGAW GWQ LP+ +LLSSLVGA
Sbjct: 184 AVIGAMAGYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLVGA 243

Query: 242 ILGVIMLRLRNAESGTPIPFGPYLAIAGWIALLWGDQITRTYL 284
+G+ ++ LRN PIPFGPYLAIAGWIALLWGD ITR YL
Sbjct: 244 FMGIGLILLRNHHQSKPIPFGPYLAIAGWIALLWGDSITRWYL 286


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04800DHBDHDRGNASE300.005 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 30.0 bits (67), Expect = 0.005
Identities = 23/88 (26%), Positives = 32/88 (36%), Gaps = 11/88 (12%)

Query: 5 WILGLTGGIGSGKSAAAEHFISLGVHLVDADHAARW--VVEPGRPALAKIVERFGDGILL 62
+I G GIG A A S G H+ D+ V A A+ E F
Sbjct: 12 FITGAAQGIGE---AVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAF------ 62

Query: 63 PDGQLDRAALRERIFQAPEERRWLEQLL 90
P D AA+ E + E ++ L+
Sbjct: 63 PADVRDSAAIDEITARIEREMGPIDILV 90


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04805SACTRNSFRASE280.013 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 27.6 bits (61), Expect = 0.013
Identities = 5/25 (20%), Positives = 10/25 (40%)

Query: 66 RRGYLQHLVVDPGYRGLGLARRMLD 90
++ + V YR G+ +L
Sbjct: 88 GYALIEDIAVAKDYRKKGVGTALLH 112


126PAKAF_04811PAKAF_04833N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_04811-311-0.844890ShlB/FhaC/HecB family hemolysin
PAKAF_04812-211-0.811428filamentous hemagglutinin N-terminal
PAKAF_0481609-0.551228***ClpB protein
PAKAF_04817-2130.171097peptidoglycan editing factor PgeF
PAKAF_04818-212-0.505038pseudouridine synthase
PAKAF_04819013-0.771265competence protein ComL
PAKAF_04820014-0.991009MYND finger
PAKAF_04821112-1.337278two-component sensor PilS
PAKAF_04822012-2.173263two-component response regulator PilR
PAKAF_04823-117-3.116715probable D-amino acid oxidase
PAKAF_04824112-3.756730type 4 fimbrial biogenesis protein FimT
PAKAF_04825112-3.891266type 4 fimbrial biogenesis protein FimU
PAKAF_04826211-3.808838type 4 fimbrial biogenesis protein PilV
PAKAF_04827211-3.637635type 4 fimbrial biogenesis protein PilW
PAKAF_04828210-3.454617type 4 fimbrial biogenesis protein PilX
PAKAF_04829110-3.591139type 4 fimbrial biogenesis protein PilY1
PAKAF_04830111-2.815322type 4 fimbrial biogenesis protein PilY2
PAKAF_04831010-2.232668type 4 fimbrial biogenesis protein PilE
PAKAF_04832-18-2.179598LytB protein
PAKAF_04833-19-2.327917probable peptidyl-prolyl cis-trans isomerase,
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04811PF00577372e-04 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 36.8 bits (85), Expect = 2e-04
Identities = 32/226 (14%), Positives = 69/226 (30%), Gaps = 19/226 (8%)

Query: 256 QRYYRAAYQLPLGSRGTRIGLAHAETTYRLVRDFSRLDAHGRAITDSLFVSQPLLRSRSL 315
Y+ A G I + A+ + L V+Q L R+ +L
Sbjct: 484 SGYFNFADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTL 543

Query: 316 SLS-TQLQYENKRLRDDQERTG-RHSRKEIRLWTASISGNAQDRLFGGGQS-----GFSL 368
LS + Y D+Q + G + ++I ++S + + G+ ++
Sbjct: 544 YLSGSHQTYWGTSNVDEQFQAGLNTAFEDINW---TLSYSLTKNAWQKGRDQMLALNVNI 600

Query: 369 AYAHGQLAIDSGEERLLDRYTIGTAGSFDKIMLNAVRLQHLGDRLQLFAQLNAQWSGGNL 428
++H + + R + ++ A L + L + ++GG
Sbjct: 601 PFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGD 660

Query: 429 DSAEQFDMG-----GPYGVRAFPLGSYKGYGDEGWQASAELRYSLA 469
++ G YG + D+ Q + +
Sbjct: 661 GNSGSTGYATLNYRGGYGN----ANIGYSHSDDIKQLYYGVSGGVL 702


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04812PF05860651e-14 haemagglutination activity domain.
		>PF05860#haemagglutination activity domain.

Length = 117

Score = 65.2 bits (159), Expect = 1e-14
Identities = 31/93 (33%), Positives = 52/93 (55%), Gaps = 5/93 (5%)

Query: 71 TDGRHMVID---QQSHKLITNWNEFSVRADERVSFHQPGQDAVALNRVIGRNGSDIQGRI 127
T+G +I+ Q L ++ EFSV F+ P ++RV G + S+I G I
Sbjct: 17 TEGNTRIIERGTQAGSNLFHSFQEFSVPTSGTAFFNNPTNIQNIISRVTGGSVSNIDGLI 76

Query: 128 DANGK--VFLVNPNGVVFGKSAQVNVGGLVAST 158
AN +FL+NPNG++FG++A++++GG +
Sbjct: 77 RANATANLFLINPNGIIFGQNARLDIGGSFVGS 109


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04816HTHFIS434e-06 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 43.3 bits (102), Expect = 4e-06
Identities = 50/266 (18%), Positives = 94/266 (35%), Gaps = 45/266 (16%)

Query: 551 MLEGEREKLLRMEQELHRRVIGQDEAVVAVSNAVRRSRAGLADPNRPSGSFLFLGPTGVG 610
+ KL Q+ ++G+ A+ + + R L + + G +G G
Sbjct: 121 EPKRRPSKLEDDSQDGMP-LVGRSAAMQEIYRVLAR----LMQTDLT---LMITGESGTG 172

Query: 611 KTELCKALAEFLFDTEEALVRIDMSEFMEKHSVARLIGAPPGYVGFEEGGYLTEAIRRKP 670
K + +AL ++ V I+M+ + L G E G T A R
Sbjct: 173 KELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGH--------EKGAFTGAQTRST 224

Query: 671 YSV-------VLLDEVEKAHPDVFNILLQVLEDG---RLTDSHGRTVDFRNTVVVMTSNL 720
+ LDE+ D LL+VL+ G + D R +V +N
Sbjct: 225 GRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVR---IVAATN- 280

Query: 721 GSAQIQELAGDREAQRAAVMDAVNAHFRPEFINRIDEVVVFEPLAREQIAGIAEIQLGRL 780
++L + ++ FR + R++ V + P R++ I ++ +
Sbjct: 281 -----KDL-------KQSINQ---GLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFV 325

Query: 781 RKRLAERELSLELSQEALDKLIAVGF 806
++ E QEAL+ + A +
Sbjct: 326 QQAEKEGLDVKRFDQEALELMKAHPW 351



Score = 34.4 bits (79), Expect = 0.002
Identities = 25/177 (14%), Positives = 59/177 (33%), Gaps = 32/177 (18%)

Query: 49 LLMQVGFDIAALRSGLNKELDALPKIQSPTGDVNLSQDLARLLNQADRLAQQKGDQFISS 108
L + G+D+ + I + GD+ ++ D+ + + +
Sbjct: 22 ALSRAGYDVRITSNAA----TLWRWIAAGDGDLVVT-DV--------VMPDENAFDLLPR 68

Query: 109 ELVLLAAMDENTRLGKLLLGQGVSRKALENAVANLRGGEA-------VNDPNVEESRQAL 161
+ + + ++ Q A+ G + +AL
Sbjct: 69 ----IKKARPDLPV-LVMSAQN----TFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119

Query: 162 DKYTVDMTKRAEEG-KLDPVIGRDDEIRRTIQVLQRRTKNN-PVLI-GEPGVGKTAI 215
+ +K ++ P++GR ++ +VL R + + ++I GE G GK +
Sbjct: 120 AEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELV 176


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_0482060KDINNERMP250.028 60kDa inner membrane protein signature.
		>60KDINNERMP#60kDa inner membrane protein signature.

Length = 548

Score = 25.3 bits (55), Expect = 0.028
Identities = 14/43 (32%), Positives = 23/43 (53%), Gaps = 3/43 (6%)

Query: 1 MGLFRLLFWIALIAIAFWLWRRFTR---PTPRQQQRPQDEPSA 40
M R L IAL+ ++F +W+ + + P P+ QQ Q +A
Sbjct: 1 MDSQRNLLVIALLFVSFMIWQAWEQDKNPQPQAQQTTQTTTTA 43


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04822HTHFIS5250.0 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 525 bits (1355), Expect = 0.0
Identities = 176/477 (36%), Positives = 261/477 (54%), Gaps = 33/477 (6%)

Query: 1 MSRQKALIVDDEPDIRELLEITLGRMKLDTRSARNVKEARELLAREPFDLCLTDMRLPDG 60
M+ L+ DD+ IR +L L R D R N +A DL +TD+ +PD
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 SGLDLVQYIQQRHPQTPVAMITAYGSLDTAIQALKAGAFDFLTKPVDLGRLRELVATALR 120
+ DL+ I++ P PV +++A + TAI+A + GA+D+L KP DL L ++ AL
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 121 LRNPEAEEAPVDNR----LLGESPPMRALRNQIGKLARSQAPVYISGESGSGKELVARLI 176
+ D++ L+G S M+ + + +L ++ + I+GESG+GKELVAR +
Sbjct: 121 EPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARAL 180

Query: 177 HEQGPRIERPFVPVNCGAIPSELMESEFFGHKKGSFTGAIEDKQGLFQAASGGTLFLDEV 236
H+ G R PFV +N AIP +L+ESE FGH+KG+FTGA G F+ A GGTLFLDE+
Sbjct: 181 HDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEI 240

Query: 237 ADLPMAMQVKLLRAIQEKAVRAVGGQQEVAVDVRILCATHKDLAAEVGAGRFRQDLYYRL 296
D+PM Q +LLR +Q+ VGG+ + DVRI+ AT+KDL + G FR+DLYYRL
Sbjct: 241 GDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRL 300

Query: 297 NVIELRVPPLRERREDIPLLAERILKRLAGDTGLPAARLTGDAQEKLKNYRFPGNVRELE 356
NV+ LR+PPLR+R EDIP L +++ + GL R +A E +K + +PGNVRELE
Sbjct: 301 NVVPLRLPPLRDRAEDIPDLVRHFVQQAEKE-GLDVKRFDQEALELMKAHPWPGNVRELE 359

Query: 357 NMLERAYTLCEDDQIQPHDLRL---------ADAPGASQEGAASLSEI------------ 395
N++ R L D I + A++ G+ S+S+
Sbjct: 360 NLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASF 419

Query: 396 -------DNLEDYLEDIERKLIMQALEETRWNRTAAAQRLGLTFRSMRYRLKKLGID 445
+ L ++E LI+ AL TR N+ AA LGL ++R ++++LG+
Sbjct: 420 GDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVS 476


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04824BCTERIALGSPG332e-04 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 32.9 bits (75), Expect = 2e-04
Identities = 12/47 (25%), Positives = 26/47 (55%)

Query: 4 RSQRALTLTELLFALVLLGILGSLALPGMAAWLDGNRQRSVLHELSA 50
QR TL E++ +V++G+L SL +P + + ++ + ++ A
Sbjct: 5 DKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVA 51


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04825BCTERIALGSPG415e-07 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 40.6 bits (95), Expect = 5e-07
Identities = 14/45 (31%), Positives = 30/45 (66%)

Query: 8 TGFTLIELLIIVVLLAIMASFAIPNFKQLTERNELQSAAEELNAM 52
GFTL+E+++++V++ ++AS +PN E+ + Q A ++ A+
Sbjct: 8 RGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVAL 52


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04826PilS_PF08805300.003 PilS N terminal
		>PilS_PF08805#PilS N terminal

Length = 185

Score = 30.3 bits (68), Expect = 0.003
Identities = 11/58 (18%), Positives = 24/58 (41%)

Query: 3 LKSRHRSLHQSGFSMIEVLVALLLISIGVLGMIAMQGKTIQYTADSVERNKAAMLGSN 60
L +R + G +++EVL+ + +I + + S E+N + +N
Sbjct: 16 LSARRKKEQDKGATLMEVLLVVGVIVVLAASAYKLYSMVQSNIQSSNEQNNVLTVIAN 73


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04831BCTERIALGSPG421e-07 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 41.8 bits (98), Expect = 1e-07
Identities = 18/49 (36%), Positives = 31/49 (63%), Gaps = 3/49 (6%)

Query: 4 RQKGFTLLEMVVVVAVIGILLGIAIPSYQNYVIRSNRTEGQALLSDAAA 52
+Q+GFTLLE++VV+ +IG+L + +P N + + + Q +SD A
Sbjct: 6 KQRGFTLLEIMVVIVIIGVLASLVVP---NLMGNKEKADKQKAVSDIVA 51


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04832PF06704280.033 DspF/AvrF protein
		>PF06704#DspF/AvrF protein

Length = 129

Score = 27.5 bits (61), Expect = 0.033
Identities = 10/28 (35%), Positives = 18/28 (64%), Gaps = 2/28 (7%)

Query: 194 KNDIC--YATQNRQDAVKELADQCDMVL 219
+N +C Y +Q+ + AV E+ D +MV+
Sbjct: 26 QNGVCALYDSQDNEAAVIEMPDHSEMVI 53


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04833INFPOTNTIATR341e-04 Macrophage infectivity potentiator signature.
		>INFPOTNTIATR#Macrophage infectivity potentiator signature.

Length = 233

Score = 33.8 bits (77), Expect = 1e-04
Identities = 21/54 (38%), Positives = 33/54 (61%), Gaps = 5/54 (9%)

Query: 8 GQESRVTLHFALKLEDGNVVDSTFDK--QPASFKVGDGNLLPGFEQALFGLKAG 59
G+ VT+ + L DG V DST +K +PA+F+V ++PG+ +AL + AG
Sbjct: 142 GKSDTVTVEYTGTLIDGTVFDST-EKAGKPATFQV--SQVIPGWTEALQLMPAG 192


127PAKAF_04869PAKAF_04877N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_04869-1111.057154probable permease of ABC transporter
PAKAF_04870-28-0.256474probable ATP-binding component of ABC
PAKAF_04871-2100.246649probable ATP-binding component of ABC
PAKAF_04872-2130.297830hypothetical protein
PAKAF_04873-170.632676probable transcriptional regulator
PAKAF_04874-2100.456328Multidrug efflux outer membrane protein OprJ
PAKAF_04875-2100.020423Resistance-Nodulation-Cell Division (RND)
PAKAF_04876-1120.374813Resistance-Nodulation-Cell Division (RND)
PAKAF_04877-113-0.182937transcriptional regulator NfxB
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04869PF05844290.033 YopD protein
		>PF05844#YopD protein

Length = 295

Score = 28.8 bits (64), Expect = 0.033
Identities = 34/123 (27%), Positives = 51/123 (41%), Gaps = 21/123 (17%)

Query: 193 VSPGADVYSVGAALGAALTARLPGHEAQV---QVSQQVLDGLKRQTRTFTYLLAGLGIIS 249
++PGA SVG AA ++P A +QVLD R + L + + ++
Sbjct: 20 IAPGAAGRSVGTPQAAAELPQVPAARADRVELNAPRQVLDP-VRMEAAGSELDSSVELLL 78

Query: 250 LLGGGVGVMNVMLMSVAERRREIGVRMALGARQRDIRNLFLIEAVTLTAAGALSGAVLGV 309
+L +A++ RE+GV QRD N +I A SGA L +
Sbjct: 79 IL-----------FRIAQKARELGVL------QRDNENQAIIHAQKAQVDEMRSGATLMI 121

Query: 310 AAA 312
A A
Sbjct: 122 AMA 124


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04873HTHTETR358e-05 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 34.6 bits (79), Expect = 8e-05
Identities = 21/141 (14%), Positives = 50/141 (35%), Gaps = 8/141 (5%)

Query: 7 ATMGELAELAGVSRATLNRHCGTREGL-KRRLESHARSTLERLTHSAALQRLEPREALRE 65
++GE+A+ AGV+R + H + L E + E A +P LRE
Sbjct: 32 TSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEYQAKFPGDPLSVLRE 91

Query: 66 LIREHL-------AQRDLLALLMFEQNPGRQAGHGDASWQSYVEALDAFFLRGQQKRVFR 118
++ L +R L+ ++ + + + ++ + + +
Sbjct: 92 ILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRNLCLESYDRIEQTLKHCIEA 151

Query: 119 IDISAATFSELFIVLIYGMVD 139
+ A + +++ G +
Sbjct: 152 KMLPADLMTRRAAIIMRGYIS 172


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04875ACRIFLAVINRP11690.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 1169 bits (3025), Expect = 0.0
Identities = 517/1028 (50%), Positives = 715/1028 (69%), Gaps = 8/1028 (0%)

Query: 1 MSEFFIKRPNFAWVVALFISLAGLLVISKLPVAQYPNVAPPQITITATYPGASAKVLVDS 60
M+ FFI+RP FAWV+A+ + +AG L I +LPVAQYP +APP ++++A YPGA A+ + D+
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 61 VTSVLEESLNGAKGLLYFESTNNSNGTAEIVVTFEPGTDPDLAQVDVQNRLKKAEARMPQ 120
VT V+E+++NG L+Y ST++S G+ I +TF+ GTDPD+AQV VQN+L+ A +PQ
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 121 AVLTQGLQVEQTSAGFLLIYALSYKEGAQRSDTTALGDYAARNINNELRRLPGVGKLQFF 180
V QG+ VE++S+ +L++ D + DY A N+ + L RL GVG +Q F
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDD--ISDYVASNVKDTLSRLNGVGDVQLF 178

Query: 181 SSEAAMRVWIDPQKLVGFGLSIDDVSNAIRGQNVQVPAGAFGSAPGSSAQELTATLAVKG 240
++ AMR+W+D L + L+ DV N ++ QN Q+ AG G P Q+L A++ +
Sbjct: 179 GAQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQT 238

Query: 241 TLDDPQEFGQVVLRANEDGSLVRLADVARLELGKESYNISSRLNGTPTVGGAIQLSPGAN 300
+P+EFG+V LR N DGS+VRL DVAR+ELG E+YN+ +R+NG P G I+L+ GAN
Sbjct: 239 RFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGAN 298

Query: 301 AIQTATLVKQRLAELSAFFPEDMQYSVPYDTSRFVDVAIEKVIHTLIEAMVLVFLVMFLF 360
A+ TA +K +LAEL FFP+ M+ PYDT+ FV ++I +V+ TL EA++LVFLVM+LF
Sbjct: 299 ALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLF 358

Query: 361 LQNVRYTLIPSIVVPVCLLGTLMVMYLLGFSVNMMTMFGMVLAIGILVDDAIVVVENVER 420
LQN+R TLIP+I VPV LLGT ++ G+S+N +TMFGMVLAIG+LVDDAIVVVENVER
Sbjct: 359 LQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVER 418

Query: 421 IMAEEGISPAEATVKAMKQVSGAIVGITLVLSAVFLPLAFMAGSVGVIYQQFSVSLAVSI 480
+M E+ + P EAT K+M Q+ GA+VGI +VLSAVF+P+AF GS G IY+QFS+++ ++
Sbjct: 419 VMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAM 478

Query: 481 LFSGFLALTFTPALCATLLKPIPEGHHE-KRGFFGAFNRGFARVTERYSLLNSKLVARAG 539
S +AL TPALCATLLKP+ HHE K GFFG FN F Y+ K++ G
Sbjct: 479 ALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTG 538

Query: 540 RFMLVYAGLVAMLGYFYLRLPEAFVPAEDLGYMVVDVQLPPGASRVRTDATGEE-LERFL 598
R++L+YA +VA + +LRLP +F+P ED G + +QLP GA++ RT ++ + +L
Sbjct: 539 RYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYL 598

Query: 599 KSREA-VASVFLISGFSFSGQGDNAALAFPTFKDWSER-GAEQSAAAEIAALNEHFALPD 656
K+ +A V SVF ++GFSFSGQ NA +AF + K W ER G E SA A I
Sbjct: 599 KNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIR 658

Query: 657 DGTVMAVSPPPINGLGNSGGFALRLMDRSGVGREALLQARDTLLGEIQTNPKFLYAMM-E 715
DG V+ + P I LG + GF L+D++G+G +AL QAR+ LLG +P L ++
Sbjct: 659 DGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPN 718

Query: 716 GLAEAPQLRLLIDREKARALGVSFETISGTLSAAFGSEVINDFTNAGRQQRVVIQAEQGN 775
GL + Q +L +D+EKA+ALGVS I+ T+S A G +NDF + GR +++ +QA+
Sbjct: 719 GLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKF 778

Query: 776 RMTPESVLELYVPNAAGNLVPLSAFVSVKWEEGPVQLVRYNGYPSIRIVGDAAPGFSTGE 835
RM PE V +LYV +A G +VP SAF + W G +L RYNG PS+ I G+AAPG S+G+
Sbjct: 779 RMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGD 838

Query: 836 AMAEMERLAAQLPAGIGYEWTGLSYQEKVSAGQATSLFALAILVVFLLLVALYESWSIPL 895
AMA ME LA++LPAGIGY+WTG+SYQE++S QA +L A++ +VVFL L ALYESWSIP+
Sbjct: 839 AMALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPV 898

Query: 896 SVMLIVPIGAIGAVLAVMVSGMSNDVYFKVGLITIIGLSAKNAILIVEFAKELWE-QGHS 954
SVML+VP+G +G +LA + NDVYF VGL+T IGLSAKNAILIVEFAK+L E +G
Sbjct: 899 SVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKG 958

Query: 955 LRDAAIEAARLRFRPIIMTSMAFILGVIPLALASGAGAASQRAIGTGVIGGMLSATFLGV 1014
+ +A + A R+R RPI+MTS+AFILGV+PLA+++GAG+ +Q A+G GV+GGM+SAT L +
Sbjct: 959 VVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAI 1018

Query: 1015 LFVPICFV 1022
FVP+ FV
Sbjct: 1019 FFVPVFFV 1026



Score = 96.1 bits (239), Expect = 4e-22
Identities = 92/506 (18%), Positives = 179/506 (35%), Gaps = 40/506 (7%)

Query: 541 FMLVYAGLVAMLGYF-YLRLPEAFVPAEDLGYMVVDVQLP-PGAS-RVRTDATGEELERF 597
F V A ++ M G L+LP A P + V V PGA + D + +E+
Sbjct: 11 FAWVLAIILMMAGALAILQLPVAQYP--TIAPPAVSVSANYPGADAQTVQDTVTQVIEQN 68

Query: 598 LKSREAVASVFLISGFSFSGQGDNAALAFPTFKDWSERGAEQSAAAEIAALNEHFALPDD 657
+ + ++ +S S S L F + D A+ ++ LP +
Sbjct: 69 MNG---IDNLMYMSSTSDSAGSVTITLTFQSGTD--PDIAQVQVQNKLQLATP--LLPQE 121

Query: 658 GTVMAVSPPPINGLGNSGGFALRLMDRSGVGREALLQARDTLLGEIQTNPKFLYAMMEGL 717
V I+ +S + + S D + +N K + + G+
Sbjct: 122 -----VQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDY----VASNVKDTLSRLNGV 172

Query: 718 AEAP------QLRLLIDREKARALGVSFETISGTLSAA---FGSEVINDFTNAGRQQRVV 768
+ +R+ +D + ++ + L + + QQ
Sbjct: 173 GDVQLFGAQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNA 232

Query: 769 IQAEQGNRMTPESVLELYVP-NAAGNLVPLSAFVSVKW-EEGPVQLVRYNGYPSIRIVGD 826
Q PE ++ + N+ G++V L V+ E + R NG P+ +
Sbjct: 233 SIIAQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIK 292

Query: 827 AAPGFSTGEA----MAEMERLAAQLPAGIGYEWTGLSYQEKVSAGQATSLFAL--AILVV 880
A G + + A++ L P G+ + V + L AI++V
Sbjct: 293 LATGANALDTAKAIKAKLAELQPFFPQGMKVLYP-YDTTPFVQLSIHEVVKTLFEAIMLV 351

Query: 881 FLLLVALYESWSIPLSVMLIVPIGAIGAVLAVMVSGMSNDVYFKVGLITIIGLSAKNAIL 940
FL++ ++ L + VP+ +G + G S + G++ IGL +AI+
Sbjct: 352 FLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIV 411

Query: 941 IVE-FAKELWEQGHSLRDAAIEAARLRFRPIIMTSMAFILGVIPLALASGAGAASQRAIG 999
+VE + + E ++A ++ ++ +M IP+A G+ A R
Sbjct: 412 VVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFS 471

Query: 1000 TGVIGGMLSATFLGVLFVPICFVWLL 1025
++ M + + ++ P LL
Sbjct: 472 ITIVSAMALSVLVALILTPALCATLL 497


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04876RTXTOXIND431e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 42.9 bits (101), Expect = 1e-06
Identities = 22/104 (21%), Positives = 41/104 (39%), Gaps = 4/104 (3%)

Query: 99 LKAAVSRAEGELARNRAVLFEAQARVRRYEPLVKIQAVSQQDFDTATADLRSAEAATRSA 158
+ A EL ++ L + ++ + + + Q V+Q + LR
Sbjct: 257 QENKYVEAVNELRVYKSQLEQIESEILSAK--EEYQLVTQLFKNEILDKLRQTTDNIGLL 314

Query: 159 QADLETARLNLGYASVTAPISGRIGRALV-TEGALVGQGEATLM 201
+L + + AP+S ++ + V TEG +V E TLM
Sbjct: 315 TLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAE-TLM 357



Score = 37.1 bits (86), Expect = 1e-04
Identities = 22/199 (11%), Positives = 67/199 (33%), Gaps = 28/199 (14%)

Query: 55 PGRIEPV-RVAEVRARVAGIVVRKRFEEGADVKAGDLLFQIDP-------APLKAAVSRA 106
G++ R E++ IV +EG V+ GD+L ++ ++++ +A
Sbjct: 87 NGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQA 146

Query: 107 EGELARNRAVLFEAQARVRRY--------------EPLVKIQAVSQQDFDTATADLRSAE 152
E R + + + E ++++ ++ ++ F T E
Sbjct: 147 RLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKE 206

Query: 153 AATRSAQADLETARLNLGYASVTAPISGR---IGRALVTEGALVGQGEATLMARIQQLDP 209
+A+ T + + + +L+ + A+ + ++ + +
Sbjct: 207 LNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAI---AKHAVLEQENKYVE 263

Query: 210 IYADFTQTAAEALRLRDAL 228
+ ++ ++ +
Sbjct: 264 AVNELRVYKSQLEQIESEI 282


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_04877HTHTETR388e-06 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 38.1 bits (88), Expect = 8e-06
Identities = 19/141 (13%), Positives = 52/141 (36%), Gaps = 7/141 (4%)

Query: 24 ATLKELAEAAGVSKATLHRFCGTRDNLVQMLEDHGETVLNQIIQACDLEHAEPLEALQRL 83
+L E+A+AAGV++ ++ + +L + + E+ + ++ + ++ R
Sbjct: 32 TSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEYQAKFPGDPLSVLRE 91

Query: 84 IKEHL-------THRELLVFLVFQYRPDFLDPHGEGARWQSYLEALDAFFLRGQQKGVFR 136
I H+ R LL+ ++F + ++ + + +
Sbjct: 92 ILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRNLCLESYDRIEQTLKHCIEA 151

Query: 137 IDITAAVFTELFITLVYGMVD 157
+ A + T ++ G +
Sbjct: 152 KMLPADLMTRRAAIIMRGYIS 172


128PAKAF_05082PAKAF_05088N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_050820101.722808PmrA: two-component regulator system response
PAKAF_050830101.500086PmrB: two-component regulator system signal
PAKAF_05084-1101.791017CueR
PAKAF_050860111.911556EamA family transporter
PAKAF_05087-2101.735529serine protein kinase RIO
PAKAF_05088-2122.497583cyclic di-GMP phosphodiesterase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_05082HTHFIS802e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 79.9 bits (197), Expect = 2e-19
Identities = 34/123 (27%), Positives = 58/123 (47%), Gaps = 1/123 (0%)

Query: 2 RILLAEDDLLLGDGIRAGLRLEGDTVEWVTDGVAAENALVTDEFDLLVLDIGLPRRSGLD 61
IL+A+DD + + L G V ++ + + DL+V D+ +P + D
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 ILRNLRHQGRLTPVLLLTARDKVADRVAGLDSGADDYLTKPFDLDELQARV-RALTRRTT 120
+L ++ PVL+++A++ + + GA DYL KPFDL EL + RAL
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 121 GRA 123
+
Sbjct: 125 RPS 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_05083PF06580340.002 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 33.7 bits (77), Expect = 0.002
Identities = 15/81 (18%), Positives = 31/81 (38%), Gaps = 20/81 (24%)

Query: 360 LVGNALRY----TPAGGQVEIRVENRAQHAVLRVRDNGPGVALEEQQAIFTRFYRSPATS 415
LV N +++ P GG++ ++ L V + G +
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLA----------------LKN 306

Query: 416 SGEGSGLGLPIVKRIVELHFG 436
+ E +G GL V+ +++ +G
Sbjct: 307 TKESTGTGLQNVRERLQMLYG 327


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_05084PF07675300.002 Cleaved Adhesin
		>PF07675#Cleaved Adhesin

Length = 1358

Score = 30.4 bits (68), Expect = 0.002
Identities = 17/82 (20%), Positives = 35/82 (42%), Gaps = 2/82 (2%)

Query: 3 IGEAAKKSGLTPKMIRYYESIELLRPAGRSASGYRHYNENDLHTLAFIRRSRDLGFSLDE 62
G + + +G P+ + +++L PAG +RHYN +DL+ + +G S
Sbjct: 933 FGLSTEANGAKPQSVWIERTVDL--PAGTKYVAFRHYNCSDLNYILLDDIQFTMGGSPTP 990

Query: 63 VGKLLTLWQDRQRASADVKALA 84
T+++D + +
Sbjct: 991 TDYTYTVYRDGTKIKEGLTETT 1012


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_05087IGASERPTASE310.008 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 30.8 bits (69), Expect = 0.008
Identities = 15/75 (20%), Positives = 28/75 (37%), Gaps = 4/75 (5%)

Query: 218 PELRQTRYAKEMWALYEAGELTAETPLSGTFVEAEEAADVRAVLREIEAAQREEARRQAL 277
+ AKE + +A T E SG+ E +E +E ++EE +
Sbjct: 1061 TTAQNREVAKEAKSNVKANTQTNEVAQSGS--ETKETQ--TTETKETATVEKEEKAKVET 1116

Query: 278 RQADDAPRGEREEPP 292
+ + P+ + P
Sbjct: 1117 EKTQEVPKVTSQVSP 1131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_05088HTHFIS742e-16 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 74.1 bits (182), Expect = 2e-16
Identities = 29/120 (24%), Positives = 52/120 (43%), Gaps = 6/120 (5%)

Query: 13 VLVVDDTPDNLLLMRELLE-EQYRVRTAGSGPAGLRAAVEEPRPDLILLDVNMPGMDGYE 71
+LV DD ++ + L Y VR + R + DL++ DV MP + ++
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRW-IAAGDGDLVVTDVVMPDENAFD 64

Query: 72 VCRRLKA-DPLTRDIPLMFLTARADRDDEQQGLALGAVDYLGKPVSPPIVLARVRTHLQL 130
+ R+K P D+P++ ++A+ + GA DYL KP ++ + L
Sbjct: 65 LLPRIKKARP---DLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121


129PAKAF_05173PAKAF_05179N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_05173-1183.224627probable ATP-binding component of ABC
PAKAF_051742202.668635GNAT family N-acetyltransferase
PAKAF_051751161.785571urease accessory protein
PAKAF_051762140.635086urease gamma subunit
PAKAF_051771131.410921L-methionine sulfoximine N-acetyltransferase
PAKAF_051782101.066616urease beta subunit
PAKAF_051790110.987177urease alpha subunit
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_05173PF05272280.045 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 27.7 bits (61), Expect = 0.045
Identities = 13/37 (35%), Positives = 19/37 (51%)

Query: 14 SHILRGLSFEAKVGEVTCLLGRNGVGKTTLLRCLMGL 50
H+ R + K L G G+GK+TL+ L+GL
Sbjct: 583 GHVARVMEPGCKFDYSVVLEGTGGIGKSTLINTLVGL 619


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_05174SACTRNSFRASE378e-06 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 36.8 bits (85), Expect = 8e-06
Identities = 16/74 (21%), Positives = 34/74 (45%), Gaps = 1/74 (1%)

Query: 57 DGQPVGLLVTRETADGFL-VDNLAVLPECKGQGIGRQLLERAERDATSLGYRSLYLYTNE 115
+ +G + R +G+ ++++AV + + +G+G LL +A A + L L T +
Sbjct: 73 ENNCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQD 132

Query: 116 RMTENIELYARVGY 129
YA+ +
Sbjct: 133 INISACHFYAKHHF 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_05177SACTRNSFRASE408e-07 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 40.3 bits (94), Expect = 8e-07
Identities = 15/63 (23%), Positives = 26/63 (41%), Gaps = 1/63 (1%)

Query: 81 RGTVEHSVYVRDDQRGKGLGVQLLLALIERARAQGLHVMVAAIESGNAASIGLHRRLGFE 140
+E + V D R KG+G LL IE A+ ++ + N ++ + + F
Sbjct: 89 YALIED-IAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFI 147

Query: 141 ISG 143
I
Sbjct: 148 IGA 150


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_05179UREASE10960.0 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 1096 bits (2837), Expect = 0.0
Identities = 423/567 (74%), Positives = 479/567 (84%), Gaps = 2/567 (0%)

Query: 2 KISRQAYADMFGPTVGDRVRLADTDLWIEVERDFTVYGEEVKFGGGKVIRDGMGQSQL-G 60
++SR AYA+MFGPTVGD+VRLADT+L+IEVE+DFT +GEEVKFGGGKVIRDGMGQSQ+
Sbjct: 4 RMSRAAYANMFGPTVGDKVRLADTELFIEVEKDFTTHGEEVKFGGGKVIRDGMGQSQVTR 63

Query: 61 AAQVVDTVITNALILDHWGVVKADVGLKDGRIQAIGKAGNPDIQPGVNIAIGAGTEVIAG 120
VDTVITNALILDHWG+VKAD+GLKDGRI AIGKAGNPD+QPGV I +G GTEVIAG
Sbjct: 64 EGGAVDTVITNALILDHWGIVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAG 123

Query: 121 EGMILTAGGIDTHIHFICPQQIEEALMSGVTTMIGGGTGPAAGTNATTCTSGPWHMARML 180
EG I+TAGG+D+HIHFICPQQIEEALMSG+T M+GGGTGPA GT ATTCT GPWH+ARM+
Sbjct: 124 EGKIVTAGGMDSHIHFICPQQIEEALMSGLTCMLGGGTGPAHGTLATTCTPGPWHIARMI 183

Query: 181 QAADAFPMNIGFTGKGNASLPLPLEEQVLAGAIGLKLHEDWGSTPAAIDNCLEVAERHDI 240
+AADAFPMN+ F GKGNASLP L E VL GA LKLHEDWG+TPAAID CL VA+ +D+
Sbjct: 184 EAADAFPMNLAFAGKGNASLPGALVEMVLGGATSLKLHEDWGTTPAAIDCCLSVADEYDV 243

Query: 241 QVAIHTDTLNESGFVETTLGAFKGRTIHTYHTEGAGGGHAPDIIKACGFANVLPSSTNPT 300
QV IHTDTLNESGFVE T+ A KGRTIH YHTEGAGGGHAPDII+ CG NV+PSSTNPT
Sbjct: 244 QVMIHTDTLNESGFVEDTIAAIKGRTIHAYHTEGAGGGHAPDIIRICGQPNVIPSSTNPT 303

Query: 301 RPFTRNTIDEHLDMLMVCHHLDPAIAEDVAFAESRIRRETIAAEDILHDLGAFSMISSDS 360
RP+T NT+ EHLDMLMVCHHL P I ED+AFAESRIR+ETIAAEDILHD+GAFS+ISSDS
Sbjct: 304 RPYTVNTLAEHLDMLMVCHHLSPTIPEDIAFAESRIRKETIAAEDILHDIGAFSIISSDS 363

Query: 361 QAMGRVGEVITRTWQTADKMKRQRGRLDGDGARNDNFRARRYIAKYTINPAITHGISHEV 420
QAMGRVGEV RTWQTADKMKRQRGRL + NDNFR +RYIAKYTINPAI HG+SHE+
Sbjct: 364 QAMGRVGEVAIRTWQTADKMKRQRGRLKEETGDNDNFRVKRYIAKYTINPAIAHGLSHEI 423

Query: 421 GSVEAGKWADLVLWRPAFFGVKPSLILKGGAIAASLMGDINGSIPTPQPVHYRPMFASYA 480
GS+E GK ADLVLW PAFFGVKP ++L GG IAA+ MGD N SIPTPQPVHYRPMF +Y
Sbjct: 424 GSLEVGKRADLVLWNPAFFGVKPDMVLLGGTIAAAPMGDPNASIPTPQPVHYRPMFGAYG 483

Query: 481 GSRHATSLTFVSQAAFAAGVPQQLGLRKAIGVVSGCR-GVQKTDLIHNGYLPTIEVDAQN 539
SR +S+TFVSQA+ AG+ +LG+ K + V R G+ K +IHN P IEVD +
Sbjct: 484 RSRTNSSVTFVSQASLDAGLAGRLGVAKELVAVQNTRGGIGKASMIHNSLTPHIEVDPET 543

Query: 540 YQVRADGQLLWCEPADVLPMAQRYFLF 566
Y+VRADG+LL CEPA VLPMAQRYFLF
Sbjct: 544 YEVRADGELLTCEPATVLPMAQRYFLF 570


130PAKAF_05477PAKAF_05486N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_05477-2140.660478probable outer membrane protein precursor
PAKAF_05478-214-0.345304multidrug resistance protein
PAKAF_05479-2120.007881drug efflux transporter
PAKAF_05481-2120.378066*dTDP-D-glucose 4,6-dehydratase
PAKAF_05482-111-0.056837dTDP-4-dehydrorhamnose reductase
PAKAF_05483-114-0.584421glucose-1-phosphate thymidylyltransferase
PAKAF_05484012-0.765437dTDP-4-dehydrorhamnose 3,5-epimerase
PAKAF_05485012-1.086306DctB
PAKAF_05486114-1.760070DctC
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_05477RTXTOXIND340.001 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 34.0 bits (78), Expect = 0.001
Identities = 27/213 (12%), Positives = 50/213 (23%), Gaps = 26/213 (12%)

Query: 79 EALQGTPDLQIAEARARQAAATAQAQDAARQPTLDAKASYSGIRAPTSVAPAPLGGRYSA 138
AL D ++ QA + K +
Sbjct: 128 TALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRL 187

Query: 139 IKYLSLGFNYDFDLWGGERAAWEAALGQANAARIDSQAARIGLSASIARAYSDLAHAFTV 198
F W ++ E L + A R+ A S L ++
Sbjct: 188 TSL----IKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSL 243

Query: 199 RD--------LAEEELKRSQRMTELSQKR------MSAGLDSKVQLQQ--------TQTQ 236
+ E+E K + + EL + S L +K + Q +
Sbjct: 244 LHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDK 303

Query: 237 LATARQQLSAAEQDIASARIALAVLLGKGPDRG 269
L + ++A + + P
Sbjct: 304 LRQTTDNIGLLTLELAKNEERQQASVIRAPVSV 336


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_05478RTXTOXIND772e-17 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 76.8 bits (189), Expect = 2e-17
Identities = 47/368 (12%), Positives = 105/368 (28%), Gaps = 90/368 (24%)

Query: 54 GNVVQITPQIVGTVVSIGADDGDLVRKGQELVRFDPSDADIALQRAEANLA--------- 104
G +I P V I +G+ VRKG L++ A+ + +++L
Sbjct: 94 GRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRY 153

Query: 105 -----------------------------HTVRQVRGLFSNVDGYRAEVATRKVALAKAE 135
+R + ++ + +++ L K
Sbjct: 154 QILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKR 213

Query: 136 ADYK----RRKNLADDGAISQEELAH----------ARDALDSAKASLTSSEQQLNTNRA 181
A+ R + + + L A+ A+ + + +L ++
Sbjct: 214 AERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKS 273

Query: 182 LVDDTQ---------------------ITSHPDVKAAAAQLRQ----AYLDDARSTIVAP 216
++ + + L S I AP
Sbjct: 274 QLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAP 333

Query: 217 VTGYVAKRSVQ-VGQRVQPGNALMAVVPLDQ-IWIDANFKETQLKHMRIGQPVEIRSDLY 274
V+ V + V G V LM +VP D + + A + + + +GQ I+ + +
Sbjct: 334 VSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAF 393

Query: 275 GSDV--RYSGTVDSLGVGTGSAFSLLPAQNATGNWIKIVQRVPVRIHIDPQELQKHPLRI 332
G V ++ + G ++ + + + PL
Sbjct: 394 PYTRYGYLVGKVKNINLDA-------IEDQRLGLVFNVIISIEE--NCLSTGNKNIPLSS 444

Query: 333 GLSMDVKV 340
G+++ ++
Sbjct: 445 GMAVTAEI 452


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_05479TCRTETB1095e-28 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 109 bits (275), Expect = 5e-28
Identities = 74/397 (18%), Positives = 155/397 (39%), Gaps = 16/397 (4%)

Query: 17 IGLSLATFMQVLDTTIANVALPTISGNLGVSSEQGTWVITSFAVSNAIALPLTGWLARRV 76
I L + +F VL+ + NV+LP I+ + WV T+F ++ +I + G L+ ++
Sbjct: 17 IWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQL 76

Query: 77 GEVRLFIAAALLFVLASFLCGIAQSMPSLVGFRALQGFVAGPLYPITQTLLISIY-PPAK 135
G RL + ++ S + + S SL+ +P ++++ Y P
Sbjct: 77 GIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKEN 136

Query: 136 RGMALALLAMVTVVAPIAGPILGGWITDDYSWPWIFFINVPVGLFAAFVVYQQLKARPVV 195
RG A L+ + + GP +GG I W ++ I + + F++ + V
Sbjct: 137 RGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLL---KKEV 193

Query: 196 IKKAPMDYVGLIALVIGVGALQIVLDKGNDLDWFESNFIVGGALIAAIALAFFIIWEFTD 255
K D G+I + +G+ + F +++ + +++ ++ F+
Sbjct: 194 RIKGHFDIKGIILMSVGIVFFML----------FTTSYSISFLIVSVLSFLIFVKHIRKV 243

Query: 256 RHPIVNLRLFAHRNFAAGTLALVLGYAAFFGINLLLPQWLQTQMGYTATWAGLAAAPIGI 315
P V+ L + F G L + + G ++P ++ + G G
Sbjct: 244 TDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGT 303

Query: 316 LPV-FLSPLVGRYANHFDLRMLAGLSFLAMAITCFMRANFTTEVDYQHIAIVQLIMGLGV 374
+ V + G + + + ++++ F+ A+F E + I+ + + G+
Sbjct: 304 MSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVS-FLTASFLLETTSWFMTIIIVFVLGGL 362

Query: 375 AFFFMPILSILLSDLPPDQIADGSGLATFLRTLGGSF 411
+F I +I+ S L + G L F L
Sbjct: 363 SFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGT 399


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_05481NUCEPIMERASE1769e-55 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 176 bits (449), Expect = 9e-55
Identities = 86/352 (24%), Positives = 136/352 (38%), Gaps = 42/352 (11%)

Query: 1 MTILVTGSAGFIGANFVLDWLALHDEPVVSLDKLT--YAGNRQNL-ASLDGDARHTFVAG 57
M LVTG+AGFIG + L + VV +D L Y + + L F
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQ-VVGIDNLNDYYDVSLKQARLELLAQPGFQFHKI 59

Query: 58 DIGDSQLVARLLAEHQPRAILNFAAESHVDRSIHGPEDFIQTNIVGTFRLLEEVRAYWGA 117
D+ D + + L A + V S+ P + +N+ G +LE R
Sbjct: 60 DLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHN--- 116

Query: 118 LEPEAKAAFRFLHVSTDEVYGSLAPSDPAFTENNRYEPNSPYSASKAASDHLVRAYHHTY 177
+ + L+ S+ VYG L P T+++ P S Y+A+K A++ + Y H Y
Sbjct: 117 -KIQ-----HLLYASSSSVYG-LNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLY 169

Query: 178 GLPVLTTNCSNNYGPYHFPEKLIPLVIHNALAGKPLPIYGDGQQIRDWLYVKDHCSAIRR 237
GLP YGP+ P+ + L GK + +Y G+ RD+ Y+ D AI R
Sbjct: 170 GLPATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIR 229

Query: 238 VLEAGQL------------------GETYNVGGWNEKANLDVVETLCAILDQEQPRADGR 279
+ + YN+G + +D ++ L L E A
Sbjct: 230 LQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIE---AK-- 284

Query: 280 SYREQITFVKDRPGHDRRYAIDATRLERELGWKPAETFETGIRKTVRWYLDN 331
+ +PG + D L +G+ P T + G++ V WY D
Sbjct: 285 -----KNMLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRDF 331


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_05482NUCEPIMERASE467e-08 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 46.3 bits (110), Expect = 7e-08
Identities = 32/158 (20%), Positives = 54/158 (34%), Gaps = 27/158 (17%)

Query: 3 RILLLGANGQVGWELQRALAPLGE--------------LLVCDRR-----------RADL 37
+ L+ GA G +G+ + + L G L R + DL
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 38 ADPEGLARLIRAERPQFIVNAGAYTAVDKAESDADNARLINARAVAVLAEEAAACG-AWL 96
AD EG+ L + + + + AV + + N + E L
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQHL 121

Query: 97 VHYSTDYVFDGAGSAPFAEDAPTG-PLSVYGQTKLEGE 133
++ S+ V+ PF+ D P+S+Y TK E
Sbjct: 122 LYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANE 159


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_05486HTHFIS445e-156 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 445 bits (1147), Expect = e-156
Identities = 176/480 (36%), Positives = 240/480 (50%), Gaps = 48/480 (10%)

Query: 11 TQVLLIDDDPHLRQALRQTLDLAGLKVATLDDARQLDTAQCKDWPGVVVSDIRMPGIDGM 70
+L+ DDD +R L Q L AG V +A L +VV+D+ MP +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 71 ELLRQLHEQDADLPVILITGHGDVPLAVQAMRGGAYDFLEKPFPSDALLDSVRRALEVRR 130
+LL ++ + DLPV++++ A++A GAYD+L KPF L+ + RAL +
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 131 LVLENRTLRLALAERHELHGRLIGRSAGMQRLREQVGSLAAIQADVLVLGETGAGKEVVA 190
R + + L+GRSA MQ + + L +++ GE+G GKE+VA
Sbjct: 124 -----RRPSKLEDDSQDGMP-LVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVA 177

Query: 191 RALHDLSSRRDGPFVAINAGALAESVVESELFGHEAGAFTGAQKRRIGKFEYANGGTLFL 250
RALHD RR+GPFVAIN A+ ++ESELFGHE GAFTGAQ R G+FE A GGTLFL
Sbjct: 178 RALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFL 237

Query: 251 DEIESMSLDVQVKLLRLLQERVVERLGSNQLIPLDIRIIAATKEDLRQAADQGRFRADLY 310
DEI M +D Q +LLR+LQ+ +G I D+RI+AAT +DL+Q+ +QG FR DLY
Sbjct: 238 DEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLY 297

Query: 311 YRLNVASLRIPPLRERGEDIPLLFRHFAEAGAMRYGLTPRELDAGQSARLLAYDWPGNVR 370
YRLNV LR+PPLR+R EDIP L RHF + A + GL + D + A+ WPGNVR
Sbjct: 298 YRLNVVPLRLPPLRDRAEDIPDLVRHFVQQ-AEKEGLDVKRFDQEALELMKAHPWPGNVR 356

Query: 371 ELQNAAERFAL-----------------------------------------GLGLSLDD 389
EL+N R +
Sbjct: 357 ELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYF 416

Query: 390 GVLPDAADEPHNLSAKVEAFERSLIAAELERPHNSLRSVAEALGIPRKTLHDKLRKHGLP 449
DA + E LI A L + A+ LG+ R TL K+R+ G+
Sbjct: 417 ASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVS 476


131PAKAF_05547PAKAF_05555N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_05547215-1.647624TIGR02449 family protein
PAKAF_0554818-0.231895cell division protein ZapA
PAKAF_05550-18-0.0528765-formyltetrahydrofolate cyclo-ligase
PAKAF_0555109-0.445085hypothetical protein
PAKAF_05552-17-0.054651EVE domain-containing protein
PAKAF_05553-18-0.348728probable permease of ABC transporter
PAKAF_05554-380.140181probable ATP-binding/permease fusion ABC
PAKAF_05555-27-0.313958HlyD family efflux transporter periplasmic
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_05547ALARACEMASE260.048 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 25.5 bits (56), Expect = 0.048
Identities = 10/34 (29%), Positives = 15/34 (44%), Gaps = 2/34 (5%)

Query: 8 PARPGHWPRPGTILYSCPITPIPKREPMEDADLQ 41
P W RPG ILY +P + + + L+
Sbjct: 199 PEAHFDWVRPGIILYG--ASPSGQWRDIANTGLR 230


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_05553ABC2TRNSPORT444e-07 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 43.8 bits (103), Expect = 4e-07
Identities = 35/149 (23%), Positives = 60/149 (40%), Gaps = 4/149 (2%)

Query: 196 AALIREREHGTVEHLLVMPLSAFEIMMAKV-WSMGLVVLVAAGLSLQWVVRGWLDVPISG 254
AA R T E +L L +I++ ++ W+ L AG+ + G+
Sbjct: 89 AAFGRMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGAGIGVVAAALGYTQWL--- 145

Query: 255 SVGLFLLGAGLHLFATTSMGIFLGTVARSMPQLGLLTILVLLPLNILSGGTTPRESMPEL 314
S+ L L A S+G+ + +A S LV+ P+ LSG P + +P +
Sbjct: 146 SLLYALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPIV 205

Query: 315 VQNIMLAAPTTHFVSLAQAILFRGAGFDI 343
Q P +H + L + I+ D+
Sbjct: 206 FQTAARFLPLSHSIDLIRPIMLGHPVVDV 234


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_05554PF05272320.013 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 32.0 bits (72), Expect = 0.013
Identities = 13/39 (33%), Positives = 17/39 (43%), Gaps = 2/39 (5%)

Query: 36 MVGLIGPDGVGKSSLLALLAGARKMQDGEIRVLDGDMRD 74
V L G G+GKS+L+ L G D G +D
Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLDFFSDTHF--DIGTGKD 634


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_05555RTXTOXIND816e-19 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 81.0 bits (200), Expect = 6e-19
Identities = 59/392 (15%), Positives = 126/392 (32%), Gaps = 85/392 (21%)

Query: 1 MKQESKRWLSRALIVAALLGVGVLVWQVSRPTGLGEGFASGNGRI--EATEVDVAAKLPG 58
++ R V + V E A+ NG++ ++
Sbjct: 49 IETPVSRRPRLVAYFIMGFLVIAFILSVLGQV---EIVATANGKLTHSGRSKEIKPIENS 105

Query: 59 RVAEIKVDEGDFVKAGEIVARMDTQVLEAQLAQAQAQVRQAENAKLTATSLVAQRESEKS 118
V EI V EG+ V+ G+++ ++ EA + Q+ + QA + L E K
Sbjct: 106 IVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKL 165

Query: 119 TAQAVVAQRQAELTAAQKRFTRTEALVKRNALPQQQLDDDRATLQSAQAALSAARSQV-- 176
+ + + + ++ T + ++ + Q Q L +A +++
Sbjct: 166 PELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINR 225

Query: 177 ------------------------------------ISAQAAIEAGRSQVIEAQSAIEAA 200
+ A + +SQ+ + +S I +A
Sbjct: 226 YENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSA 285

Query: 201 KASVARLQADIDD-----------------------------SLLKAPRNGRV-QYRVAQ 230
K + + S+++AP + +V Q +V
Sbjct: 286 KEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHT 345

Query: 231 PGEVLPAGGKLLNMVDLADVY-MTFFLPSMQAGRVGLGQEVRLVIDAVPDY---VIPAKV 286
G V+ L+ +V D +T + + G + +GQ + ++A P + KV
Sbjct: 346 EGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKV 405

Query: 287 SYVASVAQFTPKTVETANEREKLMFRVKARLD 318
+ +E ++R L+F V ++
Sbjct: 406 KNIN------LDAIE--DQRLGLVFNVIISIE 429


132PAKAF_05649PAKAF_05659N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_0564909-0.891040acetylglutamate kinase
PAKAF_05650-19-0.767947Sphingosine-responsive Regulator, SphR
PAKAF_05651-110-0.663966Sphingosine-responsive Regulator, SphA
PAKAF_05652-311-0.146753Sphingosine-responsive Regulator, SphD
PAKAF_05653-312-1.332312Sphingosine-responsive Regulator, SphC
PAKAF_05654013-1.892124Sphingosine-responsive Regulator, SphB
PAKAF_05655-212-1.696114acyl-CoA thioesterase
PAKAF_05656011-0.787094hypothetical protein
PAKAF_05657013-1.163673orotate phosphoribosyltransferase
PAKAF_05658115-1.357519catabolite repression control protein
PAKAF_05659-116-0.970017DUF4870 domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_05649CARBMTKINASE558e-11 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 54.8 bits (132), Expect = 8e-11
Identities = 66/301 (21%), Positives = 116/301 (38%), Gaps = 61/301 (20%)

Query: 26 VGKTLVIKYGGNAMESEELKAGF----------ARDVVLMKAVGINPVVVHGGGPQIGDL 75
+GK +VI GGNA++ K + AR + + A G V+ HG GPQ+G L
Sbjct: 1 MGKRVVIALGGNALQQRGQKGSYEEMMDNVRKTARQIAEIIARGYEVVITHGNGPQVGSL 60

Query: 76 LKRLSIESHFIDGMRVTDAATMDVV-----------------EMVLGGQVNKDIVNLINR 118
L L +++ A MDV + + K +V +I +
Sbjct: 61 L--LHMDAG--QATYGIPAQPMDVAGAMSQGWIGYMIQQALKNELRKRGMEKKVVTIITQ 116

Query: 119 -----------HGGSAIG--LTGKDAELIRAKKLTVTRQ---------TPEMTKPEIIDI 156
+ +G + A+ + +K + ++ P ++
Sbjct: 117 TIVDKNDPAFQNPTKPVGPFYDEETAKRLAREKGWIVKEDSGRGWRRVVPSPDPKGHVEA 176

Query: 157 GHVGEVTGVNVGLLNMLVKGDFIPVIAPIGVGSNGESYNINADLVAGKVAEALKAEKLML 216
+ V G++ + G +PVI G E+ I+ DL K+AE + A+ M+
Sbjct: 177 ETIK--KLVERGVIVIASGGGGVPVILEDGEIKGVEAV-IDKDLAGEKLAEEVNADIFMI 233

Query: 217 LTNIAGLMDKQG----QVLTGLSTEQVNELIADGT-IYGGMLPKIRCALEAVQGGVTSAH 271
LT++ G G Q L + E++ + +G G M PK+ A+ ++ G A
Sbjct: 234 LTDVNGAALYYGTEKEQWLREVKVEELRKYYEEGHFKAGSMGPKVLAAIRFIEWGGERAI 293

Query: 272 I 272
I
Sbjct: 294 I 294


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_05650HTHFIS290.029 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 29.0 bits (65), Expect = 0.029
Identities = 19/136 (13%), Positives = 41/136 (30%), Gaps = 18/136 (13%)

Query: 161 RALVSPAFEPLGIELIHAAPPYAGEYLRLLGPQVRFGCLHNRMAIASHWLDMRLPNHNLP 220
R + + E+I + R G L A+ + R +
Sbjct: 364 RLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENM---RQYFASFG 420

Query: 221 ALRQALALLEQESTQVHRKLDLVQAVERAIARDLSLGSQIERISAELNMSSRTLRRRLAE 280
L ++ ++ L ++ A+ + + L ++ TLR+++ E
Sbjct: 421 DALPPSGLYDRVLAEMEYPL-ILAALTAT-------RGNQIKAADLLGLNRNTLRKKIRE 472

Query: 281 HGLTFEALLEQVRRGR 296
G+ V R
Sbjct: 473 LGV-------SVYRSS 481


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_05652ALARACEMASE290.041 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 28.6 bits (64), Expect = 0.041
Identities = 26/147 (17%), Positives = 49/147 (33%), Gaps = 21/147 (14%)

Query: 33 IDLDRLDHNIDVVMRSVRRGGKHLRL--VEKSLPSPGLLAYIARRAGTRRLMSFHQPFLN 90
+DL L N+ VR+ H R+ V K+ + I G
Sbjct: 9 LDLQALKQNL----SIVRQAATHARVWSVVKANAYGHGIERIWSAIGATDGF-------- 56

Query: 91 HDAVAFADADILL---GKPLPVRSAELFYREHKGAFDPARQLQWLIDTPQRLRQYLALAQ 147
A+ + I L G P+ E F+ +L + + +L+
Sbjct: 57 --ALLNLEEAITLRERGWKGPILMLEGFFHAQDLEIYDQHRLTTCVHSNWQLKALQNAR- 113

Query: 148 GLGTRMRVNIELDVGLHRGGVADQAAL 174
L + + ++++ G++R G L
Sbjct: 114 -LKAPLDIYLKVNSGMNRLGFQPDRVL 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_05656RTXTOXIND300.008 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.8 bits (67), Expect = 0.008
Identities = 17/109 (15%), Positives = 36/109 (33%), Gaps = 1/109 (0%)

Query: 82 LRQRKAAQAQASSDAQLLRLYSSLEDVDRARERRLAELDGLSSVARGNLQSLKLQQANLQ 141
L + + + + L +SSL + + E + A L+ K Q ++
Sbjct: 220 LARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIE 279

Query: 142 GQAAN-QERAGRPVAQALVDQLDDLKQEEKRLQGEIGRFQKAREDAERT 189
+ + +E + LD L+Q + K E + +
Sbjct: 280 SEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQAS 328


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_05657PF00577280.041 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 27.9 bits (62), Expect = 0.041
Identities = 12/46 (26%), Positives = 23/46 (50%)

Query: 105 HGEGGTLVGAPLSGRVLIIDDVITAGTAIREVMQIIDAQGARAAGV 150
H + + +SG VL + +T G + + + ++ A GA+ A V
Sbjct: 686 HSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKV 731


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_05659ACRIFLAVINRP280.009 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 28.3 bits (63), Expect = 0.009
Identities = 7/41 (17%), Positives = 18/41 (43%)

Query: 65 FQITVALAMFVSFLLMLVVIGFFLLGLVCLAALVLTIIAGI 105
VA++ V FL + + + + + + + L I+ +
Sbjct: 872 APALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVL 912


133PAKAF_05686PAKAF_05691N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_05686012-0.548267two-component response regulator PhoB
PAKAF_05687-112-0.874998two-component sensor PhoR
PAKAF_05688-211-1.247184HlyC/CorC family transporter
PAKAF_05689-210-1.173981DUF4124 domain-containing protein
PAKAF_05690-111-1.927475probable two-component response regulator
PAKAF_05691-112-2.027669phosphate uptake regulatory protein PhoU
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_05686HTHFIS1002e-26 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 99.5 bits (248), Expect = 2e-26
Identities = 39/124 (31%), Positives = 63/124 (50%), Gaps = 2/124 (1%)

Query: 1 MVGKTILIVDDEAPIREMIAVALEMAGYECLEAENTQQAHAVIVDRKPDLILLDWMLPGT 60
M G TIL+ DD+A IR ++ AL AGY+ N I DL++ D ++P
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 SGIELARRLKRDELTVDIPIIMLTAKGEEDNKIQGLEVGADDYITKPFSPRELVARLKAV 120
+ +L R+K+ D+P+++++A+ I+ E GA DY+ KPF EL+ +
Sbjct: 61 NAFDLLPRIKKARP--DLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118

Query: 121 LRRT 124
L
Sbjct: 119 LAEP 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_05687PF06580386e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 37.9 bits (88), Expect = 6e-05
Identities = 22/99 (22%), Positives = 36/99 (36%), Gaps = 25/99 (25%)

Query: 333 LVFNAVKY----TPDEGEIRIRWWADEQGAHLSVQDTGIGVDPKHLPRLTERFYRVDSSR 388
LV N +K+ P G+I ++ D L V++TG
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALK----------------- 305

Query: 389 ASNTGGTGLGLAIVKHVLIR---HRARLEISSVPGKGST 424
+ TG GL V+ L A++++S GK +
Sbjct: 306 -NTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNA 343


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_05690HTHFIS918e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 91.4 bits (227), Expect = 8e-23
Identities = 29/139 (20%), Positives = 63/139 (45%), Gaps = 4/139 (2%)

Query: 1 MSKVSALVVDDAPFIRDLMKKGLRDNFPGLHIEEAVNGRKAQQLLSRQNVDLILCDWEMP 60
M+ + LV DD IR ++ + L + G + N + ++ + DL++ D MP
Sbjct: 1 MTGATILVADDDAAIRTVLNQAL--SRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMP 58

Query: 61 EMSGLELLTWCRAQENLKTTPFIMVTSRGDKENVVQAIQAGVSDYIGKPFSNDQLVAKIK 120
+ + +LL + P ++++++ ++A + G DY+ KPF +L+ I
Sbjct: 59 DENAFDLLPRIKKAR--PDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116

Query: 121 KALSRSGKLEALAAHAPRR 139
+AL+ + + +
Sbjct: 117 RALAEPKRRPSKLEDDSQD 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_05691FLGHOOKAP1280.033 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 28.4 bits (63), Expect = 0.033
Identities = 22/88 (25%), Positives = 41/88 (46%), Gaps = 9/88 (10%)

Query: 11 ISQQFNAELEDVRSHLLAMGGLVEKQVNDAVNALIDADSGLAQQVREIDDQINQMERNID 70
+ QF + +R +KQVN A+ A +D + A+Q+ ++DQI+++
Sbjct: 139 LVNQFKTTDQYLR--------DQDKQVNIAIGASVDQINNYAKQIASLNDQISRLTGVGA 190

Query: 71 EECVR-ILARRQPAASDLRLIISISKSV 97
+L +R S+L I+ + SV
Sbjct: 191 GASPNNLLDQRDQLVSELNQIVGVEVSV 218


134PAKAF_05856PAKAF_05863N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PAKAF_05856010-1.470687probable short-chain dehydrogenase
PAKAF_0585709-2.756088probable transcriptional regulator
PAKAF_0585809-2.448317YgdI/YgdR family lipoprotein
PAKAF_0585909-1.266363hypothetical protein
PAKAF_0586009-1.124663Tim44 domain-containing protein
PAKAF_05861-111-1.266571probable sodium/proton antiporter
PAKAF_05862-116-0.779205probable MFS dicarboxylate transporter
PAKAF_058631190.257527TonB1
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_05856DHBDHDRGNASE1045e-29 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 104 bits (260), Expect = 5e-29
Identities = 78/261 (29%), Positives = 119/261 (45%), Gaps = 24/261 (9%)

Query: 9 GQVALISGAGSELGIGFAIARRLAREGVRLL-ITASSERIRQRAEELSACGHDVRAASAD 67
G++A I+GA GIG A+AR LA +G + + + E++ + L A A AD
Sbjct: 8 GKIAFITGAAQ--GIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 68 LTDEAQVQGLLDWAEAQWGRVDILVNNAGMAQLDSAEPFSAVEATSLRDWQLSLSRNLTS 127
+ D A + + E + G +DILVN AG+ + + + S +W+ + S N T
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRP------GLIHSLSDEEWEATFSVNSTG 119

Query: 128 AFLLTRGLLPGMRERGYGRIVNVASTTGTRGSNPGEAAYSAAKAGLVGWSMGLALEVAKS 187
F +R + M +R G IV V S AAY+++KA V ++ L LE+A+
Sbjct: 120 VFNASRSVSKYMMDRRSGSIVTVGSNPAGV-PRTSMAAYASSKAAAVMFTKCLGLELAEY 178

Query: 188 GITVNSVAPG-------WIATASSTAEER-------QAALASPSGRAGRPEEVAAAVAFL 233
I N V+PG W A E+ P + +P ++A AV FL
Sbjct: 179 NIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFL 238

Query: 234 ASPEASFVNGELLVVDGGNCL 254
S +A + L VDGG L
Sbjct: 239 VSGQAGHITMHNLCVDGGATL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_05861RTXTOXINA320.008 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 31.9 bits (72), Expect = 0.008
Identities = 11/44 (25%), Positives = 23/44 (52%)

Query: 360 PVAVAVSAITTLLTPYLIRAADPLSQHLANAMPQRMARIFGHYG 403
PV+ V A+T +++ L + + +H+A+ M +A +G
Sbjct: 394 PVSALVGAVTGIISGILEASKQAMFEHVASKMADVIAEWEKKHG 437


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_05862TCRTETA356e-04 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 34.8 bits (80), Expect = 6e-04
Identities = 16/48 (33%), Positives = 26/48 (54%), Gaps = 1/48 (2%)

Query: 286 AATLFLFMLLQPIVGALSDKIGRRPILIAFGVLGTVFTYPILSTLHSV 333
A + P++GALSD+ GRRP+L+ + G Y I++T +
Sbjct: 50 ALYALMQFACAPVLGALSDRFGRRPVLLV-SLAGAAVDYAIMATAPFL 96



Score = 34.4 bits (79), Expect = 9e-04
Identities = 37/192 (19%), Positives = 74/192 (38%), Gaps = 33/192 (17%)

Query: 49 KAFFPQGDMTAQLLNTAAIFAVGFLMRPIGGWLMGIYADRKGRKAALLASVLLMCFGSLI 108
+ D+TA A++A LM+ ++G +DR GR+ LL S+ I
Sbjct: 33 RDLVHSNDVTAHYGILLALYA---LMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAI 89

Query: 109 IALTPSYETIGVAAPILLVVARLLQGLSVGGEYGTSATYLSEMANKEQR----GFFSSFQ 164
+A P +L + R++ G++ G + Y++++ + ++R GF S+
Sbjct: 90 MATAPFLW--------VLYIGRIVAGIT-GATGAVAGAYIADITDGDERARHFGFMSACF 140

Query: 165 YVTLISGQLIALAVLIVLQQTLTVEQLESWGWRVPFFIGA----LCAVVAMFLRRGMEET 220
+++G ++ + + PFF A L + FL +
Sbjct: 141 GFGMVAGPVLG-------------GLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKG 187

Query: 221 ESFSKKKEEPKE 232
E ++E
Sbjct: 188 ERRPLRREALNP 199


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PAKAF_05863TONBPROTEIN1152e-32 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 115 bits (288), Expect = 2e-32
Identities = 66/193 (34%), Positives = 93/193 (48%), Gaps = 17/193 (8%)

Query: 137 AEPTPQPPAAAPEPTPPKIEEPKPEPPKPKPVEKPKPKPKPKPKPVENAIPKAKPKPEPK 196
P A P P P EP+PEP P E P KPKPKP KPKP+P
Sbjct: 52 PADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKP--------KPKPKPV 103

Query: 197 PKPEPEPSTEASSQPSPSSAAPPPPAPTVGQSTPGAQTAPSGSQGPAGLPSGSLNDSDIK 256
K + +P + P +P + ++ + + + S + S +
Sbjct: 104 KKVQEQP------KRDVKPVESRPASPFENTAPARLTSSTATAATSKPVTSVA---SGPR 154

Query: 257 PLRMDPPVYPRMAQARGIEGRVKVLFTITSDGRIDDIQVLESVPSRMFDREVRQAMAKWR 316
L + P YP AQA IEG+VKV F +T DGR+D++Q+L + P+ MF+REV+ AM +WR
Sbjct: 155 ALSRNQPQYPARAQALRIEGQVKVKFDVTPDGRVDNVQILSAKPANMFEREVKNAMRRWR 214

Query: 317 FEPRVSGGKIVAR 329
+EP G IV
Sbjct: 215 YEPGKPGSGIVVN 227



 
Contact Sachin Pundhir for Bugs/Comments.
For best view 1024 x 768 resolution & IE 6.0 or above recommended.