PredictBias

identification of genomic and pathogenicity islands in prokaryotic genome
Home | Help | Analyzed genomes
 
A) Input parameters
Genomesequence.gbThreshold dinucleotide bias2
Threshold codon bias4Threshold %GC bias3
E-value (RPSBlast)0.05Genome (non-pathogenic)
 
B) Compare a potential GI or PAI in related non-pathogenic sp. (phylogenetic tree)
Potential GI or PAI start    end  
Select Organism     
 
C) Potential GIs and PAIs in NZ_CP015194 (download)
S.NoStartEndBiasVirulenceInsertion elementsPrediction
1AYJ58_RS00155AYJ58_RS00185N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AYJ58_RS00155-1539737486009671545.670593two-component system response regulator
AYJ58_RS00160-1539737486009671544.874899HU family DNA-binding protein
AYJ58_RS00165-1539737486009671644.909001hypothetical protein
AYJ58_RS00170-1539737486009671844.55958530S ribosomal protein S6--L-glutamate ligase
AYJ58_RS00175-1539737486009671344.933275GGDEF domain-containing response regulator
AYJ58_RS00180-1539737486009671345.883627response regulator
AYJ58_RS00185-1539737486009671246.599732PAS domain S-box protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS00155HTHFIS618e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 61.0 bits (148), Expect = 8e-13
Identities = 25/107 (23%), Positives = 44/107 (41%), Gaps = 3/107 (2%)

Query: 146 RVLVVDDSRMARNVIKRTIGNLGMKLITEAEDGAQAIALMKNNMFDLVITDYNMPSVDGL 205
+LV DD R V+ + + G + + A + DLV+TD MP +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 206 ALTQYIRSESQQSHIPILMVSSEANDTHLSNVSQAGVNALCDKPFEP 252
L I+ + +P+L++S++ S+ G KPF+
Sbjct: 64 DLLPRIKKA--RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDL 108



Score = 46.4 bits (110), Expect = 6e-08
Identities = 34/155 (21%), Positives = 60/155 (38%), Gaps = 6/155 (3%)

Query: 10 SILLVEPSDIQRRIIIQRLQQEGILSIQTAENIEAAKDIIARHKPDLIASAMHFDDGTAI 69
+IL+ + R ++ Q L + G ++ N IA DL+ + + D A
Sbjct: 5 TILVADDDAAIRTVLNQALSRAG-YDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 70 DLLGYLRASADCKDIQFMLVSSECRREQLEIFRQSGVVAILPKPFSADHLATALNATIDL 129
DLL ++ + D+ +++S++ + G LPKPF L + +
Sbjct: 64 DLLPRIKKAR--PDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 130 LSHDELDLSNFDVQDVRVLVVDDSRM--ARNVIKR 162
L + D QD LV + M V+ R
Sbjct: 122 PKRRPSKLED-DSQDGMPLVGRSAAMQEIYRVLAR 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS00160DNABINDINGHU1092e-35 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 109 bits (275), Expect = 2e-35
Identities = 45/88 (51%), Positives = 66/88 (75%)

Query: 2 NKTELIAKIAENADLTKVEAARALKSFEAAITESMKNGDKISIVGFGSFETATRAARTGR 61
NK +LIAK+AE +LTK ++A A+ + +A++ + G+K+ ++GFG+FE RAAR GR
Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62

Query: 62 NPQTGKEIQIAEATVPKFKAGKTLRDSV 89
NPQTG+EI+I + VP FKAGK L+D+V
Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAV 90


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS00175HTHFIS616e-12 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 61.0 bits (148), Expect = 6e-12
Identities = 29/102 (28%), Positives = 48/102 (47%), Gaps = 3/102 (2%)

Query: 3 LLLIDDDEVDRTAIIRALRQSKLTFNVIEANCAFDGLNLALERHFDGILLDYLLPDANGL 62
+L+ DDD RT + +AL S+ ++V + A D ++ D ++PD N
Sbjct: 6 ILVADDDAAIRTVLNQAL--SRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 63 EVLIKLNAMTQDQTVVVMLSRYEDEKLAQRCIELGAQDFLLK 104
++L ++ D V+VM S A + E GA D+L K
Sbjct: 64 DLLPRIKKARPDLPVLVM-SAQNTFMTAIKASEKGAYDYLPK 104


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS00180HTHFIS482e-09 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 48.3 bits (115), Expect = 2e-09
Identities = 23/112 (20%), Positives = 44/112 (39%), Gaps = 10/112 (8%)

Query: 8 QQVTILLVDDDDVDYMAVQRAMRQLRLLNPLVRARDGIEALAILTSLDTIKGPYLILLDL 67
TIL+ DDD + +A+ + + + + + L++ D+
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGY--DVRITSNAATLWRWIAA----GDGDLVVTDV 55

Query: 68 NMPRMNGFEFLEQIRS-DPSLSSSVVFMLTTSSTDEDRMKAYSHHVAGYMVK 118
MP N F+ L +I+ P L V +++ +T +KA Y+ K
Sbjct: 56 VMPDENAFDLLPRIKKARPDL---PVLVMSAQNTFMTAIKASEKGAYDYLPK 104


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS00185PF06580300.039 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 29.8 bits (67), Expect = 0.039
Identities = 20/107 (18%), Positives = 37/107 (34%), Gaps = 22/107 (20%)

Query: 608 LVIRNLISNAIKH---HDLGTGVITVLCESTSKHYLFSVLDDGPGISSAYQNKVFEMFQT 664
++++ L+ N IKH G I + + V + G
Sbjct: 258 MLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGS---------------- 301

Query: 665 LKPRDEVEGSGLGLSLVKKTVESLGGN---IQLKSQGRGCCFYFTWP 708
L ++ E +G GL V++ ++ L G I+L + P
Sbjct: 302 LALKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIP 348


2AYJ58_RS00270AYJ58_RS00310N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AYJ58_RS00270-1539737486009671749.367351GNAT family N-acetyltransferase
AYJ58_RS00275-1539737486009672051.254622MFS transporter
AYJ58_RS00280-1539737486009672051.671522HlyD family secretion protein
AYJ58_RS00285-1539737486009672151.961170LysR family transcriptional regulator
AYJ58_RS00290-1539737486009672152.263202large-conductance mechanosensitive channel
AYJ58_RS00295-1539737486009672152.422021antibiotic biosynthesis monooxygenase
AYJ58_RS00300-1539737486009672252.421775AcrB/AcrD/AcrF family protein
AYJ58_RS00305-1539737486009672050.923295RND transporter MFP subunit
AYJ58_RS00310-1539737486009672049.985290TolC family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS00270SACTRNSFRASE386e-06 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 37.6 bits (87), Expect = 6e-06
Identities = 21/72 (29%), Positives = 31/72 (43%), Gaps = 5/72 (6%)

Query: 75 ASIGRVVVSPAGRGKGLAMPLMQHAIESALTTWPDAGIQIGAQDY-LKA--FYQKLGFVA 131
A I + V+ R KG+ L+ AIE A G+ + QD + A FY K F+
Sbjct: 90 ALIEDIAVAKDYRKKGVGTALLHKAIEWAKEN-HFCGLMLETQDINISACHFYAKHHFII 148

Query: 132 CS-EMYLEDGIP 142
+ + L P
Sbjct: 149 GAVDTMLYSNFP 160


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS00275TCRTETB1282e-34 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 128 bits (324), Expect = 2e-34
Identities = 86/401 (21%), Positives = 169/401 (42%), Gaps = 19/401 (4%)

Query: 45 AFMAILDIQITNASMKEIQGSLGATLEEGSWISTAYLVAEMIAIPLSGWLSTGLSVRRYL 104
+F ++L+ + N S+ +I +W++TA+++ I + G LS L ++R L
Sbjct: 23 SFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLL 82

Query: 105 LWTTAAFIFASVLCSMAWN-LEAMIAFRALQGFFGGALIPLAFRLILEFLPDNKRAVGMA 163
L+ F SV+ + + +I R +QG A L ++ ++P R
Sbjct: 83 LFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFG 142

Query: 164 LFGVTATFAPSIGPTLGGWLTEQFSWHYLFYINVPPGLLVMAMLAYGLEKQSVVWDKLKN 223
L G +GP +GG + W YL +P ++ L K+ V +
Sbjct: 143 LIGSIVAMGEGVGPAIGGMIAHYIHWSYLL--LIPMITIITVPFLMKLLKKEVRIKG--H 198

Query: 224 VDLAGIVTMALGMGCLEVVLEEGNRKDWFGSELIRNLAIIAVVNLVLFVWIQLRRKEPLV 283
D+ GI+ M++G+ + F + + I++V++ ++FV + +P V
Sbjct: 199 FDIKGIILMSVGIVFFML----------FTTSYSISFLIVSVLSFLIFVKHIRKVTDPFV 248

Query: 284 NLRLLGKRDFVLSTVAYFLLGMALFGAIYLIPLYLSQVHDYTPLEIGGVIMWMGFPQLLV 343
+ L F++ + ++ + G + ++P + VH + EIG VI++ G +++
Sbjct: 249 DPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVII 308

Query: 344 L-PLVPKLMERFDSRYLAAFGFLMFAISYYMNSQMTADYAGPQMIASQVVRALG-QPFIL 401
+ L++R Y+ G ++S+ S + + +V LG F
Sbjct: 309 FGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFL--LETTSWFMTIIIVFVLGGLSFTK 366

Query: 402 VPIGMLATMHLKPHENASASTVLNVMRNLGGAFGIALVATL 442
I + + LK E + ++LN L GIA+V L
Sbjct: 367 TVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGL 407


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS00280RTXTOXIND951e-23 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 94.9 bits (236), Expect = 1e-23
Identities = 40/289 (13%), Positives = 94/289 (32%), Gaps = 32/289 (11%)

Query: 71 LAQLEDNQFSAKVSQAEASLASSKADLQTLAAKVELQRALITQASAGVVAAQADNTRAQQ 130
+ + + S + ++ + ++ +RA A + + + +
Sbjct: 176 FQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKS 235

Query: 131 QLSRSKKLKVSNYSSQDDVDQLQAGFDSAAARLDEAKA--------VLVAKQRELAVFN- 181
+L L ++ V + + + A L K+ +L AK+ V
Sbjct: 236 RLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQL 295

Query: 182 ------AQLDQAGSVVDQANATLELAKIQLNDTRVTAPFSGVIGKRGAM-VGQYVQPGQA 234
+L Q + L + + + + AP S + + G V +
Sbjct: 296 FKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAET 355

Query: 235 LYSLVPDGAV-WITANFKETQIQHMQPGQSVQVSLDAFPDKTFIGVIDSLSPASGAKFSL 293
L +VP+ +TA + I + GQ+ + ++AFP + G + K
Sbjct: 356 LMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRY-GYLV-------GKVKN 407

Query: 294 LPAENATGNFTKIVQRIPVRIRLDLSEEEAH-----MLPGLSAVVKVDT 337
+ + +V V I ++ + + G++ ++ T
Sbjct: 408 INLDAIEDQRLGLVFN--VIISIEENCLSTGNKNIPLSSGMAVTAEIKT 454



Score = 52.9 bits (127), Expect = 8e-10
Identities = 21/128 (16%), Positives = 47/128 (36%), Gaps = 2/128 (1%)

Query: 59 VTDNQHVQKGELLAQLEDNQFSAKVSQAEASLASSKADLQTLAAKVELQRALITQASAGV 118
V + + V+KG++L +L A + ++SL ++ + L +
Sbjct: 112 VKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSR-SIELNKLPELKL 170

Query: 119 VAAQADNTRAQQQLSRSKKLKVSNYSS-QDDVDQLQAGFDSAAARLDEAKAVLVAKQREL 177
+++++ R L +S+ Q+ Q + D A A + +
Sbjct: 171 PDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLS 230

Query: 178 AVFNAQLD 185
V ++LD
Sbjct: 231 RVEKSRLD 238


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS00290MECHCHANNEL1759e-60 Bacterial mechano-sensitive ion channel signature.
		>MECHCHANNEL#Bacterial mechano-sensitive ion channel signature.

Length = 136

Score = 175 bits (444), Expect = 9e-60
Identities = 87/136 (63%), Positives = 111/136 (81%), Gaps = 1/136 (0%)

Query: 1 MSLIKEFKAFASRGNVIDMAVGIIIGAAFGKIVSSFVADVIMPPIGIILGGVNFSDLSIV 60
MS+IKEF+ FA RGNV+D+AVG+IIGAAFGKIVSS VAD+IMPP+G+++GG++F ++
Sbjct: 1 MSIIKEFREFAMRGNVVDLAVGVIIGAAFGKIVSSLVADIIMPPLGLLIGGIDFKQFAVT 60

Query: 61 LQAAQGDAPSVVIAYGKFIQTIIDFTIIAFAIFMGLKAINTLKRKEEDAPKAPPAPTKEE 120
L+ AQGD P+VV+ YG FIQ + DF I+AFAIFM +K IN L RK+E+ P A PAPTKEE
Sbjct: 61 LRDAQGDIPAVVMHYGVFIQNVFDFLIVAFAIFMAIKLINKLNRKKEE-PAAAPAPTKEE 119

Query: 121 ELLSEIRDLLKAQQEK 136
LL+EIRDLLK Q +
Sbjct: 120 VLLTEIRDLLKEQNNR 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS00300ACRIFLAVINRP6610.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 661 bits (1708), Expect = 0.0
Identities = 221/1078 (20%), Positives = 433/1078 (40%), Gaps = 73/1078 (6%)

Query: 9 AIKNRLLVVLALLAVIVGCVAMLPKLNLDAFPDVTNVQVTINTAAEGLAAEEVEKLISYP 68
I+ + + + +++ + +L + +P + V+++ G A+ V+ ++
Sbjct: 5 FIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQV 64

Query: 69 VESAMYALPAVTEVRSLS-RTGLSIVTVVFAEGTDIYFARQQVFEQLQAAREMIPSGVGV 127
+E M + + + S S G +T+ F GTD A+ QV +LQ A ++P V
Sbjct: 65 IEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQQ 124

Query: 128 PEIGPNTSGLGQIYQYILRAEPSSGIDAAELRSLNDYLVKLIMMPVGGVTEVLSFGGDVR 187
I S + ++ + G ++ VK + + GV +V FG
Sbjct: 125 QGISVEKSSSSYLMVAGFVSD-NPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQ-Y 182

Query: 188 QYQVQVDPNKLRAYGLSMAQVTEALESNNRNAGGWFMDQGQE------QLVVRGYGMLPA 241
++ +D + L Y L+ V L+ N + +
Sbjct: 183 AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF-- 240

Query: 242 GVQGLAAIAQIPLTEDK-GTPVRVGDIAQVDFGSEIRVGAVTMTRRDESGKVQNLGEVVA 300
+ ++ L + G+ VR+ D+A+V+ G E + N
Sbjct: 241 --KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARI----------NGKPAAG 288

Query: 301 GVVLKRMGANTKATIDDIGARVSLIEQALPDGVSFEVFYDQAELVDKAVTTVRDALLMAF 360
+ GAN T I A+++ ++ P G+ YD V ++ V L A
Sbjct: 289 LGIKLATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAI 348

Query: 361 VFIVVILALFLVNIRATLLVLLSIPVSIGLALMVMSYYGLSANLMSLGGLAVAIGMLVDG 420
+ + +++ LFL N+RATL+ +++PV + +++ +G S N +++ G+ +AIG+LVD
Sbjct: 349 MLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDD 408

Query: 421 SVVMVENIFKHLTQPDRRHLAQARSRADGEIDPYHSDEDGGLQASMAVRIMLAAKEVCSP 480
++V+VEN+ + + + L A + ++
Sbjct: 409 AIVVVENVERVM-------------------------MEDKLPPKEATE--KSMSQIQGA 441

Query: 481 IFFATAIIIVVFAPLFALEGVEGKLFQPMAVSIILAMISALLVALIAVPALAVYLFK--- 537
+ ++ VF P+ G G +++ +++I+ AM ++LVALI PAL L K
Sbjct: 442 LVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVS 501

Query: 538 -------RGVVLKQSVVLAPLDAAYRKLLAATLARPKVVMLSALLMFALSLLLLPRLGTE 590
G + Y + L +L L+ A ++L RL +
Sbjct: 502 AEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSS 561

Query: 591 FVPELEEGTINLRVTLAPTASLGTSLAVAPKLEAILLEFPEVEYALSRIGAPELGGDPEP 650
F+PE ++G + L A+ + V ++ L+ + S +
Sbjct: 562 FLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNEKANVE-SVFTVNGFSFSGQA 620

Query: 651 VSNIEVYIGLKPISEWQSASSRLE--LQRLMEEKLSVFPGLLLTFSQPIATRVDELLSGV 708
+ ++ LKP E + E + R E + G ++ F+ P + EL +
Sbjct: 621 QNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGFVIPFNMP---AIVELGTAT 677

Query: 709 KAQLA-IKIFGPDLAVLSEKGQALSDLVAKIPGAV-DVSLEQVSGEAQLVVRPKRELLAR 766
I G L++ L + A+ P ++ V + AQ + +E
Sbjct: 678 GFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQA 737

Query: 767 YGISVDQVMSLVSQSIGGASAGQVIDGNARYDINVRLAAEFRQSPDAIKDLLLSGTNGAT 826
G+S+ + +S ++GG ID + V+ A+FR P+ + L + NG
Sbjct: 738 LGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRMLPEDVDKLYVRSANGEM 797

Query: 827 VRLGEVASVEVEMAPPNIRRDDVQRRVVVQANVA-GRDMGSVVKDIYALVPQADLPAGYT 885
V + P + R + + +Q A G G + + L + LPAG
Sbjct: 798 VPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAMALMENLASK--LPAGIG 855

Query: 886 VIIGGQYENQQRAQQKLMLVVPISIALIALLLYFSFGSFKQVLLIMANVPLALIGGIVAL 945
G ++ + + +V IS ++ L L + S+ + +M VPL ++G ++A
Sbjct: 856 YDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAA 915

Query: 946 YVSGTYLSVPSSIGFITLFGVAVLNGVVLVDSINQ-RRQSGEALYDCVYEGTVGRLRPVL 1004
+ V +G +T G++ N +++V+ + G+ + + RLRP+L
Sbjct: 916 TLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPIL 975

Query: 1005 MTALTSALGLIPILLSSGVGSEIQKPLAVVIIGGLFSSTALTLLVLPTLYRWLYRRDK 1062
MT+L LG++P+ +S+G GS Q + + ++GG+ S+T L + +P + + R K
Sbjct: 976 MTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIRRCFK 1033



Score = 101 bits (253), Expect = 8e-24
Identities = 91/515 (17%), Positives = 193/515 (37%), Gaps = 36/515 (6%)

Query: 565 RPKVVMLSALLMFALSLLLLPRLGTEFVPELEEGTINLRVTLAPTASLGT-SLAVAPKLE 623
RP + A+++ L + +L P + +++ P A T V +E
Sbjct: 8 RPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSAN-YPGADAQTVQDTVTQVIE 66

Query: 624 AILLEFPEVEYALSRIGAPELGGDPEPVSNIEVYIGLKPISEWQSASSRLELQRLMEEKL 683
+ + Y S + ++ + + + ++ A Q ++ KL
Sbjct: 67 QNMNGIDNLMYMSST---------SDSAGSVTITLTFQSGTDPDIA------QVQVQNKL 111

Query: 684 SVFPGLLLTFSQPIATRVDELLSGVKAQLAIKIFGPDLAVLSEKGQA---LSDLVAKIPG 740
+ LL Q V++ S P + D ++++ G
Sbjct: 112 QLATPLLPQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNG 171

Query: 741 AVDVSLEQVSGEAQLVVRPKRELLARYGISVDQVMSLVSQSIGGASAGQVIDGNARYD-- 798
DV L + + + +LL +Y ++ V++ + +AGQ+ A
Sbjct: 172 VGDVQL--FGAQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQ 229

Query: 799 INVRLAAEFR-QSPDAIKDLLL-SGTNGATVRLGEVASVEVEMAPPNIR-RDDVQRRVVV 855
+N + A+ R ++P+ + L ++G+ VRL +VA VE+ N+ R + + +
Sbjct: 230 LNASIIAQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGL 289

Query: 856 QANVA-GRDMGSVVKDIYALVP--QADLPAGYTVIIGGQYENQQRAQQKLMLVVP---IS 909
+A G + K I A + Q P G V+ Y+ Q + VV +
Sbjct: 290 GIKLATGANALDTAKAIKAKLAELQPFFPQGMKVLY--PYDTTPFVQLSIHEVVKTLFEA 347

Query: 910 IALIALLLYFSFGSFKQVLLIMANVPLALIGGIVALYVSGTYLSVPSSIGFITLFGVAVL 969
I L+ L++Y + + L+ VP+ L+G L G ++ + G + G+ V
Sbjct: 348 IMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVD 407

Query: 970 NGVVLVDSINQRRQS-GEALYDCVYEGTVGRLRPVLMTALTSALGLIPILLSSGVGSEIQ 1028
+ +V+V+++ + + + ++ A+ + IP+ G I
Sbjct: 408 DAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIY 467

Query: 1029 KPLAVVIIGGLFSSTALTLLVLPTLYRWLYRRDKR 1063
+ ++ I+ + S + L++ P L L +
Sbjct: 468 RQFSITIVSAMALSVLVALILTPALCATLLKPVSA 502


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS00305RTXTOXIND544e-10 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 54.1 bits (130), Expect = 4e-10
Identities = 36/182 (19%), Positives = 64/182 (35%), Gaps = 22/182 (12%)

Query: 109 RATATLVVDRDRTATLAPQLDVRVLARHVVPGQEVKKGEPLLTLGGSAVAQAQADYINAA 168
R V++ R + L + +A+H V QE + +N
Sbjct: 225 RYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQE----------------NKYVEAVNEL 268

Query: 169 AEWSRVKRMSEGAVSVSRRMQAQVDAELKRAILEAIKMTPTQIRALE----STPEAIGSY 224
+ E + ++ V K IL+ ++ T I L E +
Sbjct: 269 RVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQAS 328

Query: 225 QLLAPIDGRVQQ-DIAMLGQVFSAGTPLMQLT-DESYLWVEAQLTPTQTAHITVGGPALV 282
+ AP+ +VQQ + G V + LM + ++ L V A + I VG A++
Sbjct: 329 VIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAII 388

Query: 283 QV 284
+V
Sbjct: 389 KV 390



Score = 38.3 bits (89), Expect = 5e-05
Identities = 28/145 (19%), Positives = 56/145 (38%), Gaps = 7/145 (4%)

Query: 106 LDIRATA--TLVVDRDRTATLAPQLDVRVLARHVVPGQEVKKGEPLLTLGG----SAVAQ 159
++I ATA L R+ + P + V V G+ V+KG+ LL L + +
Sbjct: 80 VEIVATANGKLTHS-GRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLK 138

Query: 160 AQADYINAAAEWSRVKRMSEGAVSVSRRMQAQVDAELKRAILEAIKMTPTQIRALESTPE 219
Q+ + A E +R + +S D + + E + T + + +
Sbjct: 139 TQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTW 198

Query: 220 AIGSYQLLAPIDGRVQQDIAMLGQV 244
YQ +D + + + +L ++
Sbjct: 199 QNQKYQKELNLDKKRAERLTVLARI 223


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS00310RTXTOXIND300.023 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.8 bits (67), Expect = 0.023
Identities = 17/156 (10%), Positives = 48/156 (30%), Gaps = 13/156 (8%)

Query: 97 QLPEVQAQLTRQQQAELAIQAADRAVYNPELGLNYQNADTDSYSLGLSQTLDWADKRGVA 156
+ Q+ L + + + Q R++ L + Y +S+ +
Sbjct: 135 DTLKTQSSLLQARLEQTRYQILSRSI--ELNKLPELKLPDEPYFQNVSEEEVLRLTSLIK 192

Query: 157 TKRAELEAQILLADIGLERNQMLAERLLALAEQAQSRKALTFAEQQLRFTKAQLSIAKQR 216
+ + + Q ++ L++ + AE+ + E R K++L
Sbjct: 193 EQFSTWQNQKYQKELNLDKKR---------AERLTVLARINRYENLSRVEKSRLDDFSSL 243

Query: 217 LAAGDLSNVELQLMQLELATNTADYALAEQAALVAD 252
L ++ + + + + L + +
Sbjct: 244 LHKQAIAKHAVLEQENKYVEAVNE--LRVYKSQLEQ 277


3AYJ58_RS00945AYJ58_RS00980N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AYJ58_RS00945-1539737486009671949.295570guanosine-3',5'-bis(diphosphate)
AYJ58_RS00950-1539737486009672149.274554RidA family protein
AYJ58_RS00955-1539737486009672050.367855AMP-dependent synthetase
AYJ58_RS00960-1539737486009671949.793651sodium:calcium antiporter
AYJ58_RS00965-1539737486009671850.361248hypothetical protein
AYJ58_RS00970-1539737486009671750.595813sensor histidine kinase
AYJ58_RS00975-1539737486009671851.253482DNA-binding response regulator
AYJ58_RS00980-1539737486009671750.585666ATP-dependent DNA helicase RecG
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS00945PF07328320.003 T-DNA border endonuclease VirD1
		>PF07328#T-DNA border endonuclease VirD1

Length = 144

Score = 31.9 bits (72), Expect = 0.003
Identities = 17/69 (24%), Positives = 32/69 (46%), Gaps = 4/69 (5%)

Query: 505 APEQIEKVI----RDTKHTTLDSLLADIGLGNAMSIVIAQRLIGDNLENQESRDGHMMPI 560
P +++KVI + + D+ +A++GL ++ IA R IG +EN + +
Sbjct: 15 GPARVDKVISVKMTEAELAEFDAQIAELGLNRNRALRIAARRIGGFVENDAKTVELLRDM 74

Query: 561 RGAEGMLVT 569
A + T
Sbjct: 75 SRAIAGVAT 83


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS00970PF06580320.004 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 31.8 bits (72), Expect = 0.004
Identities = 20/71 (28%), Positives = 36/71 (50%), Gaps = 11/71 (15%)

Query: 276 LIVTELVNNILRHSGASQC------IIDFIQQPDRLILEVKDNGT----SKPIAEGNGLT 325
++V LV N ++H G +Q ++ + + LEV++ G+ + + G GL
Sbjct: 258 MLVQTLVENGIKH-GIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGTGLQ 316

Query: 326 GIRERLDILGG 336
+RERL +L G
Sbjct: 317 NVRERLQMLYG 327


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS00975HTHFIS785e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 77.6 bits (191), Expect = 5e-19
Identities = 29/117 (24%), Positives = 52/117 (44%), Gaps = 1/117 (0%)

Query: 2 KILLAEDQAMVRGALAALLTLAGGFNITQASDGDEALNLLKQQSFDLLLTDIEMPGRTGL 61
IL+A+D A +R L L+ AG +++ S+ + DL++TD+ MP
Sbjct: 5 TILVADDDAAIRTVLNQALSRAG-YDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 ELAAWVKEQHSQTKVVVITTFGRAGYIKRAIEAGVGGFLLKDAPSETLVNAIQQVMA 118
+L +K+ V+V++ +A E G +L K L+ I + +A
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS00980SECA411e-05 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 41.4 bits (97), Expect = 1e-05
Identities = 30/84 (35%), Positives = 39/84 (46%), Gaps = 8/84 (9%)

Query: 289 MRLVQGDV-----GSGKTLVAAMAA-LQAIENGYQVAMMAPTELLAEQHATNFAAWFEPL 342
M L + + G GKTL A + A L A+ G V ++ + LA++ A N FE L
Sbjct: 92 MVLNERCIAEMRTGEGKTLTATLPAYLNALT-GKGVHVVTVNDYLAQRDAENNRPLFEFL 150

Query: 343 GLKVGW-LAGKLKGKARAQSLADI 365
GL VG L G R ADI
Sbjct: 151 GLTVGINLPGMPAPAKREAYAADI 174


4AYJ58_RS01360AYJ58_RS01460N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AYJ58_RS01360-1539737486009671947.764096TetR/AcrR family transcriptional regulator
AYJ58_RS01365-1539737486009671748.760331NADH-dependent alcohol dehydrogenase
AYJ58_RS01370-1539737486009671747.515976type 1 glutamine amidotransferase
AYJ58_RS01375-1539737486009671646.903042glutathione S-transferase
AYJ58_RS01380-1539737486009671446.749740DUF4124 domain-containing protein
AYJ58_RS01385-1539737486009671247.301785nitrogen regulation protein NR(II)
AYJ58_RS01390-1539737486009671246.657694nitrogen regulation protein NR(I)
AYJ58_RS01395-1539737486009671145.779295porin family protein
AYJ58_RS01400-1539737486009671245.997770hypothetical protein
AYJ58_RS01405-1539737486009671346.399676cation diffusion facilitator family transporter
AYJ58_RS01410-1539737486009671546.372881hypothetical protein
AYJ58_RS01415-1539737486009671346.118469DNA-binding response regulator
AYJ58_RS01420-1539737486009671446.998255HAMP domain-containing protein
AYJ58_RS01425-1539737486009671846.777164TetR/AcrR family transcriptional regulator
AYJ58_RS01430-1539737486009671447.080292LysR family transcriptional regulator
AYJ58_RS01435-1539737486009671548.125366cytochrome c
AYJ58_RS01440-153973748600967948.868043flavocytochrome c
AYJ58_RS01445-1539737486009671049.312124oxidoreductase
AYJ58_RS01450-1539737486009671048.752518ankyrin repeat domain-containing protein
AYJ58_RS01455-1539737486009671148.790940catalase
AYJ58_RS01460-1539737486009671449.945055sigma-54-dependent Fis family transcriptional
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS01360HTHTETR631e-14 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 63.1 bits (153), Expect = 1e-14
Identities = 41/209 (19%), Positives = 66/209 (31%), Gaps = 13/209 (6%)

Query: 1 MKIETQSTRQHILDIGYKLIVRKGFSSVGLSLLLQAAEVPKGSFYHYFKSKEQFGEALIT 60
K E Q TRQHILD+ +L ++G SS L + +AA V +G+ Y +FK K +
Sbjct: 5 TKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWE 64

Query: 61 DYFEKYQLDLDALFNDSTLTGHQRLMQYWQQWLHVQADGCVDQKCLVVKLSAEVADLSEA 120
L + L + + + A
Sbjct: 65 LSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMA 124

Query: 121 MRVALLKGSAG-IIDRLTTCVQVGINDSSI-ADQDPQSTAEM-------LYHMWLGAS-- 169
+ + DR+ ++ I + AD + A + L WL A
Sbjct: 125 VVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQS 184

Query: 170 --LMNKLGHSPAALERALVTTKAILTPKT 196
L + A L + + P T
Sbjct: 185 FDLKKEARDYVAILLEMYLLCPTLRNPAT 213


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS01385PF06580391e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 39.5 bits (92), Expect = 1e-05
Identities = 35/188 (18%), Positives = 70/188 (37%), Gaps = 33/188 (17%)

Query: 166 TLIIEQADRLRNLVDRL-------LGPQRPTQHSLHNIHQVVQKVYKLVEMALPANIQLK 218
LI+E + R ++ L L Q SL + VV +L + +Q +
Sbjct: 184 ALILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFE 243

Query: 219 RDYDPSIPDIEMDPDQMQQAVLNILQNAVQALEHTGGEILIRTRTQHQVTIGSQRHKLVL 278
+P+I D+++ P +Q V N +++ + L GG+IL++ + +
Sbjct: 244 NQINPAIMDVQVPPMLVQTLVENGIKHGIAQLPQ-GGKILLKGTKDNGT----------V 292

Query: 279 TLSIIDNGPGIPPELMDTLFYPMVTGREQGSGLGLSIAHNIARLHSG---RIDCLSSAGH 335
TL + + G ++ +G GL ++ G +I G
Sbjct: 293 TLEVENTGSLALKN------------TKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGK 340

Query: 336 TEFIISLP 343
++ +P
Sbjct: 341 VNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS01390HTHFIS5600.0 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 560 bits (1445), Expect = 0.0
Identities = 197/473 (41%), Positives = 294/473 (62%), Gaps = 11/473 (2%)

Query: 7 VWILDDDSSIRWVLEKALQGAKLSTASFAAAESLWQALEISQPHVIVSDIRMPGTDGLSL 66
+ + DDD++IR VL +AL A + A +LW+ + ++V+D+ MP + L
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 67 LERLQVHYPHIPVIIMTAHSDLDSAVSAYQAGAFEYLPKPFDIDEAISLVERALTHATEQ 126
L R++ P +PV++M+A + +A+ A + GA++YLPKPFD+ E I ++ RAL +
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRR 125

Query: 127 SPAPAQEAQVKTPEIIGEAPAMQEVFRAIGRLSRSSISVLINGQSGTGKELVAGALHKHS 186
P+ ++ ++G + AMQE++R + RL ++ ++++I G+SGTGKELVA ALH +
Sbjct: 126 -PSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYG 184

Query: 187 PRKDKPFIALNMAAIPKDLIESELFGHEKGAFTGAANVRQGRFEQANGGTLFLDEIGDMP 246
R++ PF+A+NMAAIP+DLIESELFGHEKGAFTGA GRFEQA GGTLFLDEIGDMP
Sbjct: 185 KRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMP 244

Query: 247 LDVQTRLLRVLADGQFYRVGGHNAVQVDVRIIAATHQDLELLVQKGGFREDLFHRLNVIR 306
+D QTRLLRVL G++ VGG ++ DVRI+AAT++DL+ + +G FREDL++RLNV+
Sbjct: 245 MDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVP 304

Query: 307 VHLPPLSQRREDIPQLATHFLASAAKEIGVETKIMTKETAVKLSQLPWPGNVRQLENTCR 366
+ LPPL R EDIP L HF+ A KE G++ K +E + PWPGNVR+LEN R
Sbjct: 305 LRLPPLRDRAEDIPDLVRHFVQQAEKE-GLDVKRFDQEALELMKAHPWPGNVRELENLVR 363

Query: 367 WLTVMASGQEILPQDLPPELLKDPVSVTHTAKGSQDWQSALTEWIDQKLSE--------- 417
LT + I + + EL + ++ ++++ +++ + +
Sbjct: 364 RLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDAL 423

Query: 418 GNSDLLTEVQPAFERILLETALRHTQGHKQEAAKRLGWGRNTLTRKLKELSMD 470
S L V E L+ AL T+G++ +AA LG RNTL +K++EL +
Sbjct: 424 PPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVS 476


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS01395OMPADOMAIN641e-14 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 63.8 bits (155), Expect = 1e-14
Identities = 46/198 (23%), Positives = 70/198 (35%), Gaps = 18/198 (9%)

Query: 1 MNKLSIVAISLLSTLATAQVSAATDTTGFYVGGALNRVTVDGDDFTDSQSGT-----GAG 55
M K +I AI++ AA +Y G L F ++ T GAG
Sbjct: 1 MKKTAI-AIAVALAGFATVAQAAPKDNTWYTGAKLGWSQYHDTGFINNNGPTHENQLGAG 59

Query: 56 IYGGYNFNEWFGLEANLFATGDL----GDKDIDVSAGALSFTPKFTAQINDIFSAYAKVG 111
+GGY N + G E G + ++ A + T K I D Y ++G
Sbjct: 60 AFGGYQVNPYVGFEMGYDWLGRMPYKGSVENGAYKAQGVQLTAKLGYPITDDLDIYTRLG 119

Query: 112 IA--SMAIVLDGNGIDEDLTGFGWTYGVGVNAAVTEHLNIRVSYDVTTGKLDADNNYLGL 169
+ G + D TG + GV A+T + R+ Y T DA
Sbjct: 120 GMVWRADTKSNVYGKNHD-TGVSPVFAGGVEYAITPEIATRLEYQWTNNIGDAHTI---- 174

Query: 170 KDIDTDIKQFAVGMHYQF 187
D ++G+ Y+F
Sbjct: 175 -GTRPDNGMLSLGVSYRF 191


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS01410OUTRMMBRANEA280.016 Outer membrane protein A signature.
		>OUTRMMBRANEA#Outer membrane protein A signature.

Length = 346

Score = 28.4 bits (63), Expect = 0.016
Identities = 16/49 (32%), Positives = 23/49 (46%), Gaps = 4/49 (8%)

Query: 7 LKASLFAILASSAVFATAVNAAPKD----AGCDFSKSEFHKGDRMEHGG 51
+K + AI + A FAT AAPKD G S++H + + G
Sbjct: 1 MKKTAIAIAVALAGFATVAQAAPKDNTWYTGAKLGWSQYHDTGFINNNG 49


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS01415HTHFIS964e-25 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 96.1 bits (239), Expect = 4e-25
Identities = 44/163 (26%), Positives = 75/163 (46%), Gaps = 3/163 (1%)

Query: 2 SRILLIDDDLGLSELLGQLLELEGFQLTLAYDGKQGLDLALNADYDLILLDVMLPKLNGF 61
+ IL+ DDD + +L Q L G+ + + + D DL++ DV++P N F
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 EVLRALRQH-KQTPVLMLTARGDEIDRVVGLEIGADDYLPKPFNDRELIARIRAIIRRSN 120
++L +++ PVL+++A+ + + E GA DYLPKPF+ ELI I +
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 121 LTTQEIHAAPAQEFGDLRLDPSRQEAYCNEQLIILTGTEFTLL 163
++ + + QE Y L L T+ TL+
Sbjct: 124 RRPSKLEDDSQDGMPLVGRSAAMQEIY--RVLARLMQTDLTLM 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS01420PF06580384e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 38.3 bits (89), Expect = 4e-05
Identities = 21/118 (17%), Positives = 47/118 (39%), Gaps = 12/118 (10%)

Query: 277 EAEQLEKLISELLELSRVKLSTNETKVHLGLAESLSQVLDDAEFEAEQQGKSIT--IDID 334
+ + ++++ L EL R L + + LA+ L+ V + + Q + I+
Sbjct: 189 DPTKAREMLTSLSELMRYSLRYSNARQVS-LADELTVVDSYLQLASIQFEDRLQFENQIN 247

Query: 335 EEIELAHFPKSLSRAIENLLRNAIRYAASD------IQLQASATADQVKITIKDDGPG 386
I P L ++ L+ N I++ + I L+ + V + +++ G
Sbjct: 248 PAIMDVQVPPML---VQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSL 302


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS01425HTHTETR373e-05 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 36.9 bits (85), Expect = 3e-05
Identities = 25/135 (18%), Positives = 56/135 (41%), Gaps = 9/135 (6%)

Query: 4 WEQRTDYLVEVAQRCL--RGHQSFDLYRSHLVAASQISKGTIYNHFTTEADLVVAVACAQ 61
++ ++++VA R +G S L + A+ +++G IY HF ++DL +
Sbjct: 9 AQETRQHILDVALRLFSQQGVSSTSL--GEIAKAAGVTRGAIYWHFKDKSDLFSEIWELS 66

Query: 62 YQDWL-ISAKQDRQQYSDP---FECDLFHHCQRLHDVLAHKRFVIERVMPNQELLQQASE 117
+ + + + DP L H + +R ++E + E + + +
Sbjct: 67 ESNIGELELEYQAKFPGDPLSVLREILIHVLESTV-TEERRRLLMEIIFHKCEFVGEMAV 125

Query: 118 VYRNRFNDLLDQYKK 132
V + + N L+ Y +
Sbjct: 126 VQQAQRNLCLESYDR 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS01440HTHFIS300.020 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 30.2 bits (68), Expect = 0.020
Identities = 10/31 (32%), Positives = 17/31 (54%), Gaps = 1/31 (3%)

Query: 39 KWDKEIEILIVGSGFAGLAAAIEATRKGAKD 69
K ++ +L++ S AI+A+ KGA D
Sbjct: 71 KARPDLPVLVM-SAQNTFMTAIKASEKGAYD 100


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS01445NUCEPIMERASE300.015 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 29.8 bits (67), Expect = 0.015
Identities = 13/28 (46%), Positives = 18/28 (64%), Gaps = 2/28 (7%)

Query: 151 ILVTGASGGVGS-VAVTLLANAGYRVVA 177
LVTGA+G +G V+ LL G++VV
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEA-GHQVVG 29


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS01460HTHFIS331e-109 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 331 bits (849), Expect = e-109
Identities = 130/468 (27%), Positives = 225/468 (48%), Gaps = 50/468 (10%)

Query: 201 DLAAQPNLLSSGWQGIVIADS-----DG-----RIVGYNPMAKQLL--NQAKVGDSVEQY 248
+ A +++G +V+ D + RI P L+ Q +++
Sbjct: 35 NAATLWRWIAAGDGDLVVTDVVMPDENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKAS 94

Query: 249 LGDNWSRAGGFNQHLD-------LHLQTQSLNIPSAKSRVAMSNKSLNQLGVRFHDPQLE 301
G ++ + + ++L P + + + S + + + ++
Sbjct: 95 ------EKGAYDYLPKPFDLTELIGIIGRALAEPKRRPS-KLEDDSQDGMPLVGRSAAMQ 147

Query: 302 RAWQHANKVITKQIPLLVLGETGVGKEQFVKKLHAHSVRRSEPLVAVNCAALPAELVESE 361
++ +++ + L++ GE+G GKE + LH + RR+ P VA+N AA+P +L+ESE
Sbjct: 148 EIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESE 207

Query: 362 LFGYQAGAFTGANRTGFIGKIRQAHGGFLFLDEIGEMPLAAQSRLLRVLQEREVVPVGSN 421
LFG++ GAFTGA G+ QA GG LFLDEIG+MP+ AQ+RLLRVLQ+ E VG
Sbjct: 208 LFGHEKGAFTGAQTRS-TGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGR 266

Query: 422 QSFKVDIQIIAATHMDLEQQVAQGLFRQDLFYRLNGLQVRLPALRERQ-DIERIIH---K 477
+ D++I+AAT+ DL+Q + QGLFR+DL+YRLN + +RLP LR+R DI ++ +
Sbjct: 267 TPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQ 326

Query: 478 LHRKHRIAPQAICPELLGLLVQHDWPGNLRELDNLMQVACLMAEGDDTLTWQHLPDYLAQ 537
K + + E L L+ H WPGN+REL+NL++ + D +T + + + L
Sbjct: 327 QAEKEGLDVKRFDQEALELMKAHPWPGNVRELENLVRRLTALYPQ-DVITREIIENELRS 385

Query: 538 KLAAGPLTVDPLNTQLLNEEQPLGEEAKTGQHSASHPLAGKVVSGKVTRGNTATLPTTTA 597
++ P+ + + + + +
Sbjct: 386 EIPDSPIEKAAARS----GSLSISQAVEENMRQYFASFGD--------------ALPPSG 427

Query: 598 VQSDSLHEAIYSNVLQAHQACNGNVSQCAKRLGISRNALYRRLKQMGL 645
+ L E Y +L A A GN + A LG++RN L ++++++G+
Sbjct: 428 LYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGV 475


5AYJ58_RS02015AYJ58_RS02050N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AYJ58_RS02015-1539737486009671849.635922type II secretion system protein GspJ
AYJ58_RS02020-1539737486009671548.623707type II secretion system protein GspI
AYJ58_RS02025-1539737486009671448.422436type II secretion system protein GspH
AYJ58_RS02030-1539737486009671348.163143type II secretion system protein GspG
AYJ58_RS02035-1539737486009671348.527132type II secretion system protein GspF
AYJ58_RS02040-1539737486009671548.711723type II secretion system protein GspE
AYJ58_RS02045-1539737486009671448.872688type II secretion system protein GspD
AYJ58_RS02050-1539737486009671149.126702type II secretion system protein GspC
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS02015BCTERIALGSPG320.001 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 32.2 bits (73), Expect = 0.001
Identities = 16/41 (39%), Positives = 27/41 (65%), Gaps = 3/41 (7%)

Query: 3 LKLTSAQRGFTLLEMLIAIAIFAMIGLASNAVLSTVLTNDE 43
++ T QRGFTLLE+++ I I IG+ ++ V+ ++ N E
Sbjct: 1 MRATDKQRGFTLLEIMVVIVI---IGVLASLVVPNLMGNKE 38


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS02020PilS_PF08805290.003 PilS N terminal
		>PilS_PF08805#PilS N terminal

Length = 185

Score = 29.1 bits (65), Expect = 0.003
Identities = 11/32 (34%), Positives = 17/32 (53%)

Query: 5 KGMTLLEVIVALAVFSIAAVSITKSLGEQMAN 36
KG TL+EV++ + V + A S K +N
Sbjct: 26 KGATLMEVLLVVGVIVVLAASAYKLYSMVQSN 57


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS02025BCTERIALGSPH845e-23 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 84.2 bits (208), Expect = 5e-23
Identities = 44/171 (25%), Positives = 70/171 (40%), Gaps = 39/171 (22%)

Query: 4 LRHAGFTLMEVMLVILLMGLTAAAVTMSIGNSGPQQALDRTARQFIAATEMVLDETVLSG 63
+R GFTL+E+ML++LLMG++A V ++ S A AR F A V + +G
Sbjct: 1 MRQRGFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAAQTLAR-FEAQLRFVQQRGLQTG 59

Query: 64 QFIGIVIEKTSYQFVFYKDG---------------KWEPLDKDRLLSEKQMEPGVVMNLV 108
QF G+ + +QF+ + +W PL R+ +
Sbjct: 60 QFFGVSVHPDRWQFLVLEARDGADPAPADDGWSGYRWLPLRAGRVATSGS---------- 109

Query: 109 LDGLPLVQDDEEDDSWFEEPLIEPSADDKKKHPEPQVMLFPSGEMSAFELT 159
+ G L + ++W P V++FP GEM+ F LT
Sbjct: 110 IAGGKLNLAFAQGEAW-------------TPGDNPDVLIFPGGEMTPFRLT 147


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS02030BCTERIALGSPG2296e-81 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 229 bits (584), Expect = 6e-81
Identities = 97/144 (67%), Positives = 119/144 (82%)

Query: 1 MQMNKKHQGFTLLEVMVVIVILGILASMVVPNLMGNKDKADQQKAVSDIVALENALDMYK 60
M+ K +GFTLLE+MVVIVI+G+LAS+VVPNLMGNK+KAD+QKAVSDIVALENALDMYK
Sbjct: 1 MRATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYK 60

Query: 61 LDNGIYPTTEQGLEALVQKPTISPEPRNYREDGYVKRLPEDPWRNKYLLLSPGENGKLDI 120
LDN YPTT QGLE+LV+ PT+ P NY ++GY+KRLP DPW N Y+L++PGE+G D+
Sbjct: 61 LDNHHYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDL 120

Query: 121 FTAGPDGQPGTEDDIGNWNLQNFQ 144
+AGPDG+ GTEDDI NW L +
Sbjct: 121 LSAGPDGEMGTEDDITNWGLSKKK 144


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS02035BCTERIALGSPF5050.0 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 505 bits (1302), Expect = 0.0
Identities = 228/407 (56%), Positives = 304/407 (74%), Gaps = 1/407 (0%)

Query: 1 MPAFEYKALDAKGKQLKGVIEADTARHARSQLRDQRMMPLEILPVAEKEAKAKSSSFAF- 59
M + Y+ALDA+GK+ +G EAD+AR AR LR++ ++PL + + K+ S+ +
Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLR 60

Query: 60 FKRGISVAELALITRQIATLVAAGLPIEESLKAVGQQCEKDRLASMIMAVRSRVVEGYSL 119
K +S ++LAL+TRQ+ATLVAA +P+EE+L AV +Q EK L+ ++ AVRS+V+EG+SL
Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSL 120

Query: 120 ADSLAEFPHIFDDLYRAMVASGEKSGHLEVVLNRLADYTERRQQLKSKLTQAMIYPAVLT 179
AD++ FP F+ LY AMVA+GE SGHL+ VLNRLADYTE+RQQ++S++ QAMIYP VLT
Sbjct: 121 ADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLT 180

Query: 180 TVAIGVISILLAAVVPKVVGQFEHMGAELPASTRFLISASDFVQNYGVFVVIALVMLFAL 239
VAI V+SILL+ VVPKVV QF HM LP STR L+ SD V+ +G ++++AL+ F
Sbjct: 181 VVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMA 240

Query: 240 FRRMLKSPAFRMKYDNFLLSMPVVGRVSKGLNTARFARTLSILSASSVPLLDGMRIASEV 299
FR ML+ R+ + LL +P++GR+++GLNTAR+ARTLSIL+AS+VPLL MRI+ +V
Sbjct: 241 FRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDV 300

Query: 300 LQNVRVRAAVDDATARVREGTSLGAALTNTKLFPAMMLYMIASGEKSGQLEQMLERAADN 359
+ N R + AT VREG SL AL T LFP MM +MIASGE+SG+L+ MLERAADN
Sbjct: 301 MSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADN 360

Query: 360 QDREFEGNVNIALGVFEPMLVVSMACVVLFIVMAILQPILALNNLIS 406
QDREF + +ALG+FEP+LVVSMA VVLFIV+AILQPIL LN L+S
Sbjct: 361 QDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLMS 407


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS02045BCTERIALGSPD6050.0 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 605 bits (1561), Expect = 0.0
Identities = 327/681 (48%), Positives = 446/681 (65%), Gaps = 34/681 (4%)

Query: 6 IRRKLIAGVVAGAAMFSSQFAWSEQYAANFKGTDIQEFINIVGKNLNKTIIVDPTIRGKI 65
IR + ++ A +F A +E+++A+FKGTDIQEFIN V KNLNKT+I+DP++RG I
Sbjct: 7 IRSFSLTLLIFAALLFRP--AAAEEFSASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTI 64

Query: 66 NVRSYDLLNDEQYYQFFLNVLQVYGYAIVEMENNVIKVIKDKDAKTAAIRVANDNEPGMG 125
VRSYD+LN+EQYYQFFL+VL VYG+A++ M N V+KV++ KDAKTAA+ VA+D PG+G
Sbjct: 65 TVRSYDMLNEEQYYQFFLSVLDVYGFAVINMNNGVLKVVRSKDAKTAAVPVASDAAPGIG 124

Query: 126 DEMVTRIVALYNTEAKQLAPLLRQLNDNAGGGNVVNYDPSNVLMLSGRAAVVNKLVEIVR 185
DE+VTR+V L N A+ LAPLLRQLNDNAG G+VV+Y+PSNVL+++GRAAV+ +L+ IV
Sbjct: 125 DEVVTRVVPLTNVAARDLAPLLRQLNDNAGVGSVVHYEPSNVLLMTGRAAVIKRLLTIVE 184

Query: 186 RVDKQGDTSVQVVPLEYASAGEMVRIIDTLYRATANQAQLPGQAPKVVADERINAVVVSG 245
RVD GD SV VPL +ASA ++V+++ L + T+ A VVADER NAV+VSG
Sbjct: 185 RVDNAGDRSVVTVPLSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSG 244

Query: 246 DEKSRQRVVELIHRLDAEQASTGNTKVRYLRYAKAEDLVEVLTGFAQKLEGDKDPSAQAA 305
+ SRQR++ +I +LD +QA+ GNTKV YL+YAKA DLVEVLTG + ++ +K A
Sbjct: 245 EPNSRQRIIAMIKQLDRQQATQGNTKVIYLKYAKASDLVEVLTGISSTMQSEK--QAAKP 302

Query: 306 GGKRRNEINIMAHTETNALVISAEPDQMRTIESVINQLDIRRAQVLVEAIIVEVAEGDNV 365
I I AH +TNAL+++A PD M +E VI QLDIRR QVLVEAII EV + D +
Sbjct: 303 VAALDKNIIIKAHGQTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGL 362

Query: 366 GFGVQWAAKAGGGTQFNNLGPTIGEIGAGVWAAQPIEGDLVCSGTNLDNCKKNPDKQGDV 425
G+QWA K G TQF N G I AG +G + S
Sbjct: 363 NLGIQWANKNAGMTQFTNSGLPISTAIAGA-NQYNKDGTVSSS----------------- 404

Query: 426 TLLAQALGKVNGMAWGVAMGDFGALIQAVSSDTNSNVLATPSITTLDNQEASFIVGDEVP 485
LA AL NG+A G G++ L+ A+SS T +++LATPSI TLDN EA+F VG EVP
Sbjct: 405 --LASALSSFNGIAAGFYQGNWAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVP 462

Query: 486 ILTGSTASSNNSNPFQTVERKEVGVKLKVVPQINEGNAVKLTIEQEVSGVNG-----NTG 540
+LTGS +++ N F TVERK VG+KLKV PQINEG++V L IEQEVS V ++
Sbjct: 463 VLTGSQ-TTSGDNIFNTVERKTVGIKLKVKPQINEGDSVLLEIEQEVSSVADAASSTSSD 521

Query: 541 VDISFATRRLTTTVMADSGQIVVLGGLINEEVQESVQKVPFLGDIPILGHLFKSSSSKKT 600
+ +F TR + V+ SG+ VV+GGL+++ V ++ KVP LGDIP++G LF+S+S K +
Sbjct: 522 LGATFNTRTVNNAVLVGSGETVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVS 581

Query: 601 KKNLMIFIKPTIIRDGVTMEGIAGRKYNYFRALQLEQ--QERGVNLMPNTQVPVLEEWNQ 658
K+NLM+FI+PT+IRD + +Y F Q +Q +E ++ + + Q
Sbjct: 582 KRNLMLFIRPTVIRDRDEYRQASSGQYTAFNDAQSKQRGKENNDAMLNQDLLEIYP--RQ 639

Query: 659 SEYLPPEVNAILERYKEGKGL 679
+V+A ++ + G L
Sbjct: 640 DTAAFRQVSAAIDAFNLGGNL 660


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS02050BCTERIALGSPC1794e-57 Bacterial general secretion pathway protein C signa...
		>BCTERIALGSPC#Bacterial general secretion pathway protein C

signature.
Length = 272

Score = 179 bits (456), Expect = 4e-57
Identities = 69/288 (23%), Positives = 137/288 (47%), Gaps = 36/288 (12%)

Query: 17 KLLSRIVFWLGFIVIMLLAAQITWKL-VPTHSSASAWSPTPVSVNGKGAGQVDLAGLQQL 75
++ RI+F+L ++ A I W++ +P ++ S+ TP + L
Sbjct: 12 SVIRRILFYLLMLLFCQQLAMIFWRIGLPDNAPVSSVQITPAQARQQPVT------LNDF 65

Query: 76 GLFGKADATSDKPKVEVVETVTDAPKTTLSIQLTGVVASTADQKGLAIIESSGSQETYSL 135
LFG + + ++ +++ P +TL++ LTGV+A D + +AII Q + +
Sbjct: 66 TLFGVSPEKNKAGALDA-SQMSNLPPSTLNLSLTGVMAGDDDSRSIAIISKDNEQFSRGV 124

Query: 136 GDKIKGTSASLKEVYADRIIITNAGRYETLMLDGLVYTSQSPANQQLQQAKSNKADSAVS 195
+++ G +A + + DR+++ GRYE L L +
Sbjct: 125 NEEVPGYNAKIVSIRPDRVVLQYQGRYEVLGLYSQEDSGSDGVPG--------------- 169

Query: 196 RVDQRKNAEISQELAESRTELLADPSKITDYIAISPVRQGDSVAGYRLNPGKDANLFKQA 255
A+++++L + + ++DY++ SP+ + + GYRLNPG ++ F +
Sbjct: 170 -------AQVNEQLQQR------ASTTMSDYVSFSPIMNDNKLQGYRLNPGPKSDSFYRV 216

Query: 256 GFKANDLAKSINGYDLTIMSQALEMMSQLPELTEVSIMVEREGQLVEI 303
G + ND+A ++NG DL QA + M ++ ++ ++ VER+GQ +I
Sbjct: 217 GLQDNDMAVALNGLDLRDAEQAKKAMERMADVHNFTLTVERDGQRQDI 264


6AYJ58_RS02460AYJ58_RS02525N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AYJ58_RS02460-1539737486009671549.578373TonB-dependent receptor
AYJ58_RS02465-1539737486009671749.648712MFS transporter
AYJ58_RS02470-1539737486009671650.054441sucrose phosphorylase
AYJ58_RS02475-1539737486009672049.771207LacI family DNA-binding transcriptional
AYJ58_RS02480-1539737486009672349.619410MFS transporter
AYJ58_RS02485-1539737486009672249.323621peptidase P60
AYJ58_RS02490-1539737486009672048.975087sulfite exporter TauE/SafE family protein
AYJ58_RS02495-1539737486009672249.586908MFS transporter
AYJ58_RS02500-1539737486009672249.269311DNA-binding response regulator
AYJ58_RS02505-1539737486009672149.576670sensor histidine kinase
AYJ58_RS02510-1539737486009671647.631854pirin family protein
AYJ58_RS02515-1539737486009671445.637997DUF4118 domain-containing protein
AYJ58_RS02520-1539737486009671745.309072DNA-binding response regulator
AYJ58_RS02525-1539737486009671844.678233TrkA family potassium uptake protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS02460ECOLIPORIN330.004 E.coli/Salmonella-type porin signature.
		>ECOLIPORIN#E.coli/Salmonella-type porin signature.

Length = 383

Score = 33.3 bits (76), Expect = 0.004
Identities = 59/276 (21%), Positives = 100/276 (36%), Gaps = 67/276 (24%)

Query: 411 DDTSVTLGYYNATQ-NIGMS----WMWNSYLMEVKGDNAALLDVVAADGTAYSDNGLYGY 465
D T + +G+ TQ N ++ W +N +G+ A +A G + D G + Y
Sbjct: 53 DQTYMRVGFKGETQINDQLTGYGQWEYNVQANTTEGEGANSWTRLAFAGLKFGDYGSFDY 112

Query: 466 GVPYWGNCCQRNYDTDYSIKAPYLALASSFGDLSLDASVRYDSGDASG------------ 513
G RNY Y ++ + + FG S + Y +G A+G
Sbjct: 113 G---------RNYGVLYDVEG-WTDMLPEFGGDSYTYADNYMTGRANGVATYRNTDFFGL 162

Query: 514 ----NYAGSVQSQVDMNLDGVISIPEQSVSSIDNANPQPVNYDWSYTSYSLGANYQFSSD 569
N+A Q + + ++I + ++ D+ N D + + Y
Sbjct: 163 VDGLNFALQYQGKNESQSADDVNIGTNNRNNGDDIRYD--NGD----GFGISTTYDIGMG 216

Query: 570 LAAFGRLSHGGRANADRLLFGKVRADGSVAKEDAVDIVDQYELGVKYRYDDLSVFATAFY 629
+A + R N +++ G A G A D + G+KY D +++ Y
Sbjct: 217 FSAGAAYTTSDRTN-EQVNAGGTIAGGDKA--------DAWTAGLKY--DANNIYLATMY 265

Query: 630 SET-------------------EEQNFEATSQRFFD 646
SET + QNFE T+Q FD
Sbjct: 266 SETRNMTPYGKTDKGYDGGVANKTQNFEVTAQYQFD 301


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS02465TCRTETA606e-12 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 59.8 bits (145), Expect = 6e-12
Identities = 69/368 (18%), Positives = 128/368 (34%), Gaps = 46/368 (12%)

Query: 22 LMFFMFAMTSDAVGV-----IIPELISQFGLSMSQASAFHYMPMIFIAMSGLF---LGFL 73
L+ + + DAVG+ ++P L+ S + + + ++ M LG L
Sbjct: 7 LIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGAL 66

Query: 74 ADKIGRKLTILLGLLLFALACFMFALGESFYYFLFLLAFVGTAIGVFKTGALGLIGDIST 133
+D+ GR+ +L+ L A+ + A F + L++ V G A I DI T
Sbjct: 67 SDRFGRRPVLLVSLAGAAVDYAIMATA-PFLWVLYIGRIVAGITGATGAVAGAYIADI-T 124

Query: 134 SSKQHSSTMNTVEGYFGVGAMIGPAIVSYLLISGVSWKYLYFGAGC-----FCLVLCWL- 187
+ + + FG G + GP + + G S +F A F L
Sbjct: 125 DGDERARHFGFMSACFGFGMVAGPVLGGLM--GGFSPHAPFFAAAALNGLNFLTGCFLLP 182

Query: 188 ----AYRADYPQIKRSSTDAINLASTFKMMKNPYALGFSL-AIGLYVATEVAIYV----- 237
R + + + A ++ A+ F + +G A I+
Sbjct: 183 ESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFH 242

Query: 238 WMPTLLQSYQGDYTTLAAYALT-IFFTLRAGGRFLGGWVLERFPWQQVMFWFSFAISACY 296
W T + +LAA+ + G R +M +
Sbjct: 243 WDATTIG------ISLAAFGILHSLAQAMITGPVAARLGERRA----LMLGMIADGTGYI 292

Query: 297 LGSMI---YGIEAAVVLLPLSGLFMSMMYPTLNSKGISCFPVDQHGSVAGVILFFTAVSA 353
L + + +VLL G+ M + S+ + ++ G + G + T++++
Sbjct: 293 LLAFATRGWMAFPIMVLLASGGIGMP-ALQAMLSRQVD---EERQGQLQGSLAALTSLTS 348

Query: 354 AVGPLLMG 361
VGPLL
Sbjct: 349 IVGPLLFT 356


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS02480TCRTETB1117e-29 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 111 bits (280), Expect = 7e-29
Identities = 85/430 (19%), Positives = 171/430 (39%), Gaps = 29/430 (6%)

Query: 30 FLAAVDQTLLATATPAIVEDLGGLR-QASWITIGYMLAMAASVPIYGWLGDNFGRAKILM 88
F + +++ +L + P I D +W+ +ML + +YG L D G ++L+
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 89 IAIVIFALGSIVSA-SAGTMDHMIAGRILQGMGGGGLMSLSQSLIGELVPIRQRARFQGY 147
I+I GS++ +I R +QG G +L ++ +P R + G
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143

Query: 148 FAAMFTLASVGGPVIGGIVVHAYSWHWLFWANIPLA-MLAVWRLNGLHKRSVKPVRQGKF 206
++ + GP IGG++ H W +L IP+ ++ V L L K+ V+ +G F
Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHYIHWSYLL--LIPMITIITVPFLMKLLKKEVR--IKGHF 199

Query: 207 DLVGVVLFPTIITALLYWLSVAGQEFAWLSATSLGFAVFVVLGILGLLLWERRLASPFLP 266
D+ G++L I + + + S+ F + VL L + R++ PF+
Sbjct: 200 DIKGIILMSVGIVFFMLFTTSY----------SISFLIVSVLSFLIFVKHIRKVTDPFVD 249

Query: 267 LDLLAKKAVYMPLLTAALFAACLFAMIFFLPIYLQVGLHTNPAKTG-LLLLPMTFGIVTG 325
L + +L + + + +P ++ + A+ G +++ P T ++
Sbjct: 250 PGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIF 309

Query: 326 STIAGRLLSKDVAPKWLPTFGMGLAFIGLLLISFVPPNANVIGGLGV-LVGIGLGTVMPS 384
I G L+ + P ++ G+ + L SF+ + + + V GL
Sbjct: 310 GYIGGILVDR-RGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTV 368

Query: 385 VQLVVQSVSGKARLSQITAMVSLCRSMGAAIGTALFSVLLYSLLPLTGSELGIAAIKTLP 444
+ +V S + ++++ + G A+ LL + + + LP
Sbjct: 369 ISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLL---------SIPLLDQRLLP 419

Query: 445 TEVVHHAFQY 454
EV + Y
Sbjct: 420 MEVDQSTYLY 429


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS02495TCRTETA363e-04 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 35.6 bits (82), Expect = 3e-04
Identities = 54/278 (19%), Positives = 101/278 (36%), Gaps = 39/278 (14%)

Query: 43 PVSQVAFVFGLL----SLSLAVASSMAGKLQERFGVRNVTLGAGLLLGLGFLLTAQASNL 98
+ V +G+L +L + + G L +RFG R V L + + + + A A L
Sbjct: 37 HSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFL 96

Query: 99 MMLYLCAGILVGFADGTGY--------LMTLSNCVKWFPERKGLISALAIGAYGLGSLGF 150
+LY+ I+ G TG + + F G +SA +G G +
Sbjct: 97 WVLYI-GRIVAGITGATGAVAGAYIADITDGDERARHF----GFMSA----CFGFGMVAG 147

Query: 151 KYINVLLLENTGLETTFQLWGLIAMALVLCGGMLMKDA------PAQSAASQQAESRDFT 204
+ L+ F + L G L+ ++ P + A S F
Sbjct: 148 PVLGGLMGG-FSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLAS--FR 204

Query: 205 LAEAMRKPQYWMLALMFLSACMSG----LYVIGVAKDIGEKMVDLPVLVAANAVAVIAMA 260
A M ++A+ F+ + L+VI + + +AA + +
Sbjct: 205 WARGMT-VVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGI----LH 259

Query: 261 NLSGRLVLGILSDKIPRIRVISLAQIITLVGMVLLLFI 298
+L+ ++ G ++ ++ R + L I G +LL F
Sbjct: 260 SLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFA 297


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS02500HTHFIS683e-15 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 67.9 bits (166), Expect = 3e-15
Identities = 24/132 (18%), Positives = 59/132 (44%), Gaps = 6/132 (4%)

Query: 3 KAIIVEDEYLAREELE-YLVKSHSEIDIVASFEDGLEAFKYLQDHEVDVVFLDIQIPSID 61
++ +D+ R L L ++ ++ I ++ + + D+V D+ +P +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRW---IAAGDGDLVVTDVVMPDEN 61

Query: 62 GLLLAKNLHKSTHPPHVVFVTAHKEF--AVEAFELEAFDYILKPYNEPRIISLLQKIEQV 119
L + K+ V+ ++A F A++A E A+DY+ KP++ +I ++ +
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 120 GRQAPKPQHEAA 131
++ P + +
Sbjct: 122 PKRRPSKLEDDS 133


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS02505PF065802031e-62 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 203 bits (517), Expect = 1e-62
Identities = 60/205 (29%), Positives = 110/205 (53%), Gaps = 13/205 (6%)

Query: 345 EQLQEMTRKAEFTALQSKINPHFLFNALNAISSLIRIRPQQARELIANLADYLRYNLAKG 404
++ M ++A+ AL+++INPHF+FNALN I +LI P +ARE++ +L++ +RY+L
Sbjct: 152 WKMASMAQEAQLMALKAQINPHFMFNALNNIRALILEDPTKAREMLTSLSELMRYSLRYS 211

Query: 405 D-ELIDIQEEVKQVRDYVAIEQARFGDKLEVIFDVDD--VHFCVPCLLLQPLVENAILHG 461
+ + + +E+ V Y+ + +F D+L+ ++ + VP +L+Q LVEN I HG
Sbjct: 212 NARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPMLVQTLVENGIKHG 271

Query: 462 IQPRSAPGRVTIEVKKLDAGICVAVRDTGYGISQEVIDGVAAGRIESSSIGLTNVHQRVK 521
I G++ ++ K + + + V +TG ES+ GL NV +R++
Sbjct: 272 IAQLPQGGKILLKGTKDNGTVTLEVENTG--------SLALKNTKESTGTGLQNVRERLQ 323

Query: 522 LLYGE--GLQLKRLEPGTEVSFYLP 544
+LYG ++L + +P
Sbjct: 324 MLYGTEAQIKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS02520HTHFIS934e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 93.0 bits (231), Expect = 4e-24
Identities = 36/112 (32%), Positives = 60/112 (53%), Gaps = 1/112 (0%)

Query: 4 KVLVVDDEPQIHTFMRISLEAEGFEYISATSIATALKQYRSHQPHLIVLDLGLPDGDGIE 63
+LV DD+ I T + +L G++ ++ AT + + L+V D+ +PD + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 64 LLHALRQQDK-TPVLVLTARDQEEEKIRLLEAGANDYLSKPFGIRELIVRIK 114
LL +++ PVLV++A++ I+ E GA DYL KPF + ELI I
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS02525NUCEPIMERASE270.044 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 27.4 bits (61), Expect = 0.044
Identities = 13/28 (46%), Positives = 16/28 (57%), Gaps = 1/28 (3%)

Query: 6 VIGLGRF-GVAVSRELIHLGHTVTGVDN 32
V G F G VS+ L+ GH V G+DN
Sbjct: 5 VTGAAGFIGFHVSKRLLEAGHQVVGIDN 32


7AYJ58_RS03275AYJ58_RS03315N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AYJ58_RS03275-1539737486009671348.112908glutamate 5-kinase
AYJ58_RS03280-1539737486009671348.451644sodium/proline symporter PutP
AYJ58_RS03285-1539737486009671648.943788FUSC family protein
AYJ58_RS03290-1539737486009672048.499538HlyD family secretion protein
AYJ58_RS03295-1539737486009672048.383760DUF1656 domain-containing protein
AYJ58_RS03300-1539737486009672248.373984TetR family transcriptional regulator
AYJ58_RS03305-1539737486009672049.065934alkene reductase
AYJ58_RS03310-1539737486009671948.930971SDR family NAD(P)-dependent oxidoreductase
AYJ58_RS03315-1539737486009671948.390383oxidoreductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS03275CARBMTKINASE461e-07 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 46.0 bits (109), Expect = 1e-07
Identities = 33/126 (26%), Positives = 51/126 (40%), Gaps = 16/126 (12%)

Query: 116 KDTIFSLLEHGLL---------PIINENDAVTADKLKVGDNDNLSAMVAAAADADTLIIC 166
+TI L+E G++ P+I E+ + + V D D +A +AD +I
Sbjct: 176 AETIKKLVERGVIVIASGGGGVPVILEDGEIKGVE-AVIDKDLAGEKLAEEVNADIFMIL 234

Query: 167 SDVNGLYTQNPHENPDAQLIKQVNEINAEIYAMAGGASSAVGTGGMRTKIQAAKKAISHG 226
+DVNG + Q +++V Y G G M K+ AA + I G
Sbjct: 235 TDVNGAALY--YGTEKEQWLREVKVEELRKYYEEGH----FKAGSMGPKVLAAIRFIEWG 288

Query: 227 IETFII 232
E II
Sbjct: 289 GERAII 294


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS03290RTXTOXIND572e-11 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 57.1 bits (138), Expect = 2e-11
Identities = 42/217 (19%), Positives = 85/217 (39%), Gaps = 39/217 (17%)

Query: 77 TRYKATIAELNAKAESQKLAWELAKHKYKRRIGLTNDNLVSKETFDEAFINTELARTSYE 136
YK+ + ++ ++ S K ++L +K I +L +T+
Sbjct: 269 RVYKSQLEQIESEILSAKEEYQLVTQLFKNEI------------------LDKLRQTTDN 310

Query: 137 LAQ--AQLNTAKIDLARTQIHAPENGTLINLSLR-NGNYVSKGNSVFSLV-KQDSLYITG 192
+ +L + + I AP + + L + G V+ ++ +V + D+L +T
Sbjct: 311 IGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTA 370

Query: 193 YFEETKIPLVHIGQNAEVSL----MSGGHVLHGKVVSIGKAIANTNVTTNGQLLPQIGQT 248
+ I +++GQNA + + + L GKV +I ++G
Sbjct: 371 LVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAI---------EDQRLGLV 421

Query: 249 FNWVRLSQRIPVDIQLDNIPKDIELSVGMTVSIQLQT 285
FN + I + K+I LS GM V+ +++T
Sbjct: 422 FNVII---SIEENCLSTGN-KNIPLSSGMAVTAEIKT 454



Score = 52.5 bits (126), Expect = 6e-10
Identities = 24/155 (15%), Positives = 57/155 (36%), Gaps = 9/155 (5%)

Query: 9 LTLIVVAVAGIAGYWIWSHYLYSPWTRDGRVRA--EIITIAPDVSGWVNQLNVKDNQVVN 66
+ ++ IA + T +G++ I P + V ++ VK+ + V
Sbjct: 60 VAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVR 119

Query: 67 KGDVLFTVDDTRYKATIAELNAKAESQKLA---WELAKHKYKR----RIGLTNDNLVSKE 119
KGDVL + +A + + +L +++ + + L ++
Sbjct: 120 KGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNV 179

Query: 120 TFDEAFINTELARTSYELAQAQLNTAKIDLARTQI 154
+ +E T L + + Q Q +++L + +
Sbjct: 180 SEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRA 214


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS03300HTHTETR594e-13 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 58.9 bits (142), Expect = 4e-13
Identities = 25/63 (39%), Positives = 35/63 (55%)

Query: 1 MKVTKAQAQENREHIVKTASVLFRERGYDGVGIAELMSTAGFTHGGFYKHFKSKADLMAE 60
+ TK +AQE R+HI+ A LF ++G + E+ AG T G Y HFK K+DL +E
Sbjct: 2 ARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61

Query: 61 AAA 63

Sbjct: 62 IWE 64


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS03310DHBDHDRGNASE749e-18 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 73.9 bits (181), Expect = 9e-18
Identities = 49/185 (26%), Positives = 90/185 (48%), Gaps = 2/185 (1%)

Query: 6 VLITGASTGIGAIYADRFARRGHDLVIVARDGAKLKALATRLRQEYRVNVDVLPADLTKS 65
ITGA+ GIG A A +G + V + KL+ + + L+ E R + + PAD+ S
Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEAR-HAEAFPADVRDS 69

Query: 66 SDLA-IVEKRLQDDARIDTLINNAGIAQSGSFVEQTPDLIEKLIALNITALTRLANAVTP 124
+ + I + ++ ID L+N AG+ + G + + E ++N T + + +V+
Sbjct: 70 AAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSK 129

Query: 125 RLVQQGKGAIVNVSSVVGLAPEYQMSVYGASKAFVLFLSQGMHQELSSKGIYVQALLPAG 184
++ + G+IV V S P M+ Y +SKA + ++ + EL+ I + P
Sbjct: 130 YMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPGS 189

Query: 185 TYTEI 189
T T++
Sbjct: 190 TETDM 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS03315DHBDHDRGNASE902e-23 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 90.1 bits (223), Expect = 2e-23
Identities = 54/187 (28%), Positives = 89/187 (47%), Gaps = 10/187 (5%)

Query: 6 VALVTGASSGIGEATAIKLAAAGYRVYGTSRRG--------ATSSDARFP-MLALDVTND 56
+A +TGA+ GIGEA A LA+ G + + ++AR DV +
Sbjct: 10 IAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDS 69

Query: 57 QSVDAAVTELLRLEGRIDLLVNNAGFGVAPAGAEESSIEQAKSILDTNFFGIVRMTRAIV 116
++D + R G ID+LVN AG + P S E+ ++ N G+ +R++
Sbjct: 70 AAIDEITARIEREMGPIDILVNVAGV-LRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 117 PYMRRQGAGRIINISSIIGVVPMPYVALYAASKHAVEGYSEALDHELRTRGIRVSLIEPA 176
YM + +G I+ + S VP +A YA+SK A +++ L EL IR +++ P
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188

Query: 177 FTKTKFE 183
T+T +
Sbjct: 189 STETDMQ 195


8AYJ58_RS04680AYJ58_RS04710N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AYJ58_RS04680-1539737486009673052.095808OmpA family protein
AYJ58_RS04685-1539737486009672952.138454channel protein TolC
AYJ58_RS04690-1539737486009673052.434021HlyD family type I secretion periplasmic adaptor
AYJ58_RS04695-1539737486009673152.741994type I secretion system permease/ATPase
AYJ58_RS04700-1539737486009673052.782928type I secretion C-terminal target
AYJ58_RS04705-1539737486009671248.189209heme biosynthesis protein HemY
AYJ58_RS04710-1539737486009671247.931748uroporphyrinogen-III methylase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS04680OMPADOMAIN907e-24 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 90.4 bits (224), Expect = 7e-24
Identities = 33/118 (27%), Positives = 51/118 (43%), Gaps = 12/118 (10%)

Query: 77 SILFPNDSAYIAPEYYPQIEEVAVFLQQF--PTTKVTIEGHTSRTGTDERNLVLSQERAD 134
+LF + A + PE ++++ L V + G+T R G+D N LS+ RA
Sbjct: 220 DVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGLSERRAQ 279

Query: 135 AVTAVLAERFSIDRSRLTAIGYGPSRPVVLERTPEAEIR---------NRRVVAEVTG 183
+V L + I +++A G G S PV + R +RRV EV G
Sbjct: 280 SVVDYLISK-GIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIEVKG 336


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS04690RTXTOXIND310e-103 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 310 bits (795), Expect = e-103
Identities = 87/431 (20%), Positives = 195/431 (45%), Gaps = 11/431 (2%)

Query: 29 RLIIWALAAMVVCFLLWAGFAKLDKVTTGSGKVIPSSQVQVIQSLDGGIMQELYVQEGEM 88
RL+ + + +V + + +++ V T +GK+ S + + I+ ++ I++E+ V+EGE
Sbjct: 58 RLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGES 117

Query: 89 VTKGQPLVRIDDTRFRSDFAQQEQEVLGLKTNAIRMRAELDSILISDMTSDWREQVLVTK 148
V KG L+++ +D + + +L + R + SI E + +
Sbjct: 118 VRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSI----------ELNKLPE 167

Query: 149 KALVFPESIITAEPALVKRQQEEYNGRLDNLSNQLEILVRQIQQRQQEIDDLASKTTTLT 208
L V R + NQ + +++ E + ++
Sbjct: 168 LKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYE 227

Query: 209 TSMQLISRELELTRPLAKKGIVPEVELLKLERTVNDLQGELNSMRLLRPKVKAAMDEAIL 268
++ L+ L K + + +L+ E + EL + ++++ + A
Sbjct: 228 NLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKE 287

Query: 269 KRREAVFVYAADLRAQLNETQTRLSRMNEAQVGAQDKVSKAIITSPVNGTIKTTHINTLG 328
+ + ++ ++ +L +T + + +++ ++I +PV+ ++ ++T G
Sbjct: 288 EYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEG 347

Query: 329 GVVQPGVDIIEIVPSEDQLLIETKILPKDIAFLHPGLPAVVKITAYDFTRYGGLKGTVEH 388
GVV ++ IVP +D L + + KDI F++ G A++K+ A+ +TRYG L G V++
Sbjct: 348 GVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKN 407

Query: 389 ISADTSQDEEGNSFYLIRVRTAESSLTKNDGTQMPIIPGMLTSVDVITGQRSILEYILNP 448
I+ D +D+ + + + E+ L+ +P+ GM + ++ TG RS++ Y+L+P
Sbjct: 408 INLDAIEDQRLGLVFNVIISIEENCLST-GNKNIPLSSGMAVTAEIKTGMRSVISYLLSP 466

Query: 449 ILRAKDTALRE 459
+ + +LRE
Sbjct: 467 LEESVTESLRE 477


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS04700CABNDNGRPT862e-18 NodO calcium binding signature.
		>CABNDNGRPT#NodO calcium binding signature.

Length = 479

Score = 86.2 bits (213), Expect = 2e-18
Identities = 53/228 (23%), Positives = 75/228 (32%), Gaps = 16/228 (7%)

Query: 7809 PYDTDGNVQTNIDADDLADAILGDQVNAIPGSDRFDGGAGDDVIFGDAIHFAGISGQGYA 7868
+T + + + D I Q + G++ F S
Sbjct: 231 ENETGADYNGHYGGAPMIDDIAAIQ--RLYGANMTTRTGDSVYGFN--------SNTDRD 280

Query: 7869 AIKAYVADKLGINEASDAQVHRYISEHTDEFDQS--GTNDKADILFGGAGNDILFGQGGN 7926
A + K I DA +Q + G GN +
Sbjct: 281 FYTATDSSKALIFSVWDAGGTDTFDFSGYSNNQRINLNEGSFSDVGGLKGNVSIAHGVTI 340

Query: 7927 DYLDGGAGKDTLYGGTGNDTLIGGAGNDTLIGGLGNDVLRGDGGADTFVWRYADADK--G 7984
+ GG+G D L G + ++ L GGAGND L GG G D L G G DTFV+
Sbjct: 341 ENAIGGSGNDILVGNSADNILQGGAGNDVLYGGAGADTLYGGAGRDTFVYGSGQDSTVAA 400

Query: 7985 TDHIVDFNVNEDKLDLSDLLQGETANTLDEYLSFRLDNGSTVIDIDAN 8032
D I DF DK+DLS + + F ++ DA
Sbjct: 401 YDWIADFQKGIDKIDLSAFRNEGQLSFVQ--DQFTGKGQEVMLQWDAA 446



Score = 49.2 bits (117), Expect = 5e-07
Identities = 31/89 (34%), Positives = 40/89 (44%), Gaps = 3/89 (3%)

Query: 7254 GDFTTAPFNTGTRTIDNTSGQDQLLGTGGNDHLVSANGGGDLLDGMDGDDILVGSDAVQG 7313
F + ++ R N + G GN + + G G+DILVG+ A
Sbjct: 302 DTFDFSGYSNNQRINLNEGSFSDVGGLKGNVSIAHGVTI-ENAIGGSGNDILVGNSA--D 358

Query: 7314 DSLYGGTGNDVLVAGLGNDGLYGDAGTDI 7342
+ L GG GNDVL G G D LYG AG D
Sbjct: 359 NILQGGAGNDVLYGGAGADTLYGGAGRDT 387



Score = 45.3 bits (107), Expect = 1e-05
Identities = 31/120 (25%), Positives = 46/120 (38%), Gaps = 3/120 (2%)

Query: 7221 ADKPVVNVILTDNGIPLYSNFKTSGITTEQFRTGDFTTAPFNTGTRTIDNTSGQDQLLGT 7280
+ K ++ + G + S G F+ G +I + + +G
Sbjct: 287 SSKALIFSVWDAGGTDTFDFSGYSNNQRINLNEGSFSDVGGLKGNVSIAHGVTIENAIGG 346

Query: 7281 GGNDHLVSANGGGDLLDGMDGDDILVGSDAVQGDSLYGGTGNDVLVAGLGNDGLYGDAGT 7340
GND LV N ++L G G+D+L G D+LYGG G D V G G D
Sbjct: 347 SGNDILV-GNSADNILQGGAGNDVLYGGAG--ADTLYGGAGRDTFVYGSGQDSTVAAYDW 403



Score = 38.0 bits (88), Expect = 0.001
Identities = 23/101 (22%), Positives = 42/101 (41%), Gaps = 8/101 (7%)

Query: 7264 GTRTIDNTSGQDQLLGTGGNDHLVSANGGGDLLD-----GMDGDDILVGSDAVQGDSLYG 7318
++ + +D T + L+ + D G + + ++ D + G
Sbjct: 269 SVYGFNSNTDRDFYTATDSSKALIFSVWDAGGTDTFDFSGYSNNQRINLNEGSFSD-VGG 327

Query: 7319 GTGNDVLVAGLGNDGLYGDAGTDIAVLLGNRADYIIEKGAG 7359
GN + G+ + G +G DI L+GN AD I++ GAG
Sbjct: 328 LKGNVSIAHGVTIENAIGGSGNDI--LVGNSADNILQGGAG 366


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS04710RTXTOXIND290.026 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.4 bits (66), Expect = 0.026
Identities = 14/77 (18%), Positives = 31/77 (40%), Gaps = 6/77 (7%)

Query: 81 FMLYQQMQQQLLAQDAKNIALQDQLQQALLQPNQRIGQLEQQQLNDAKT-----YQELTK 135
F +Q + Q K A + A + + + ++E+ +L+D +
Sbjct: 195 FSTWQNQKYQKELNLDKKRA-ERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHA 253

Query: 136 LAEDQNQLQDRVNKLAQ 152
+ E +N+ + VN+L
Sbjct: 254 VLEQENKYVEAVNELRV 270



Score = 28.6 bits (64), Expect = 0.047
Identities = 9/72 (12%), Positives = 27/72 (37%), Gaps = 5/72 (6%)

Query: 81 FMLYQQMQQQLLAQDAKNIALQDQLQQALLQPNQRIGQLEQQQLNDAKTYQELTKLAEDQ 140
+ + + + + + +Q++ +L + + Q N+ L KL +
Sbjct: 254 VLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEI-----LDKLRQTT 308

Query: 141 NQLQDRVNKLAQ 152
+ + +LA+
Sbjct: 309 DNIGLLTLELAK 320


9AYJ58_RS05115AYJ58_RS05150N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AYJ58_RS05115-1539737486009671450.042151pseudouridine synthase
AYJ58_RS05120-1539737486009671149.619358hypothetical protein
AYJ58_RS05125-1539737486009671249.622101alkaline phosphatase family protein
AYJ58_RS05130-1539737486009671449.100014SDR family NAD(P)-dependent oxidoreductase
AYJ58_RS05135-1539737486009671348.331857sensor histidine kinase
AYJ58_RS05140-1539737486009671646.879287two-component system response regulator
AYJ58_RS05145-1539737486009672046.557601PKD domain-containing protein
AYJ58_RS05150-1539737486009671845.967330short chain dehydrogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS05115TYPE3IMSPROT280.044 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 28.2 bits (63), Expect = 0.044
Identities = 15/61 (24%), Positives = 25/61 (40%), Gaps = 1/61 (1%)

Query: 126 GIDTIAPAHRLDRETAGVILMTVAPETRALYHQLFIDDAI-RKDYQAIAKLTPEIIQQYQ 184
D R E GV ++ P RALY +D I + +A A++ + +Q
Sbjct: 287 YTDAQVQTVRKIAEEEGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQNI 346

Query: 185 Q 185
+
Sbjct: 347 E 347


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS05130NUCEPIMERASE736e-17 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 73.3 bits (180), Expect = 6e-17
Identities = 44/182 (24%), Positives = 71/182 (39%), Gaps = 24/182 (13%)

Query: 3 KVMVTGATGLLGRAVVKQLELTGHEVV-----------------ATGFSRASERVHKLDL 45
K +VTGA G +G V K+L GH+VV ++ + HK+DL
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 46 TAPLAVEKFIAREQPQVIVHCAAERRPDVSEQNPQAALALNLTASQALAKAAKANN-AWL 104
+ A + + S +NP A NLT + + + N L
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQHL 121

Query: 105 IYISTDYVFDGTQ--PKYAEDAATHPVNFYGESKLKGEEIVLDTSADFAV----LRLPIL 158
+Y S+ V+ + P +D+ HPV+ Y +K E + S + + LR +
Sbjct: 122 LYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFFTV 181

Query: 159 YG 160
YG
Sbjct: 182 YG 183


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS05140HTHFIS932e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 92.6 bits (230), Expect = 2e-24
Identities = 26/129 (20%), Positives = 64/129 (49%)

Query: 3 RLLIVEDDLSLASILGRRLTRHGFECRLTHDASDALLVAREFRPSHILLDMKLAEANGLG 62
+L+ +DD ++ ++L + L+R G++ R+T +A+ ++ D+ + + N
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 63 LIVPLRNLLPKVTMVLLTGYASIATAVEAIRLGADNYLAKPVDTQTLLAALEMQGHSYTL 122
L+ ++ P + +++++ + TA++A GA +YL KP D L+ +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 123 QEDEVDDSP 131
+ +++D
Sbjct: 125 RPSKLEDDS 133



Score = 47.1 bits (112), Expect = 9e-09
Identities = 13/39 (33%), Positives = 22/39 (56%)

Query: 135 KRLEWEHIQQVLNANQGNVSATARQLGMHRRTLQRKLLK 173
+E+ I L A +GN A LG++R TL++K+ +
Sbjct: 434 AEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRE 472


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS05145THERMOLYSIN363e-119 Thermolysin metalloprotease (M4) family signature.
		>THERMOLYSIN#Thermolysin metalloprotease (M4) family signature.

Length = 544

Score = 363 bits (933), Expect = e-119
Identities = 134/490 (27%), Positives = 192/490 (39%), Gaps = 51/490 (10%)

Query: 44 SQFNL--DAGSQLKVEKKLDLGQGKQKQRLQQYFHDVPVYGFSVATSQSSMGFYSDMSGR 101
+ F L A +L + G R +Q G + + S +SG
Sbjct: 64 NTFQLGGQARERLSLIGNKLDELGHTVMRFEQAIAASLCMGAVLVAHVNDGELSS-LSGT 122

Query: 102 VLKNIEKSADFVKPTLTANKALDIAIRGKSEK-AVAGLKAENKQAKLWLYLDDAAKTRLV 160
++ N++K + ++ +A IA + +++ AE + + D RL
Sbjct: 123 LIPNLDKRTLKTEAAISIQQAEMIAKQDVADRVTKERPAAEEGKPTRLVIYPDEETPRLA 182

Query: 161 YVTSFVVYGDEPSRPFTMIDAHSGEVLKRWEGINHA-ASGTGPGGNIKTGQYEYGTDFSY 219
Y + P MIDA G+VL +W ++ A G P T G
Sbjct: 183 YEVNVRFLTPVPGNWIYMIDAADGKVLNKWNQMDEAKPGGAQPVAGTSTVGVGRG----- 237

Query: 220 LDVEVSGDT---CTMNSPNVKTVNLNGATSGATAFSYTCPRNTV-----------KEING 265
V GD T S L T G+ F+Y TV +
Sbjct: 238 ----VLGDQKYINTTYSSYYGYYYLQDNTRGSGIFTYDGRNRTVLPGSLWADGDNQFFAS 293

Query: 266 AYSPLNDAHYFGNVIYNMYSEWYN---TAPLTFQLTMRVHYSSNYENAFWDGSAMTFGDG 322
+ DAHY+ V+Y+ Y + + VHY Y NAFW+GS M +GDG
Sbjct: 294 YDAAAVDAHYYAGVVYDYYKNVHGRLSYDGSNAAIRSTVHYGRGYNNAFWNGSQMVYGDG 353

Query: 323 -ATTFYPLV-SLDVSAHEVSHGFTEQNSGLIYDAQSGGMNEAFSDMAGEAAEFYMHGTND 380
TF P +DV HE++H T+ +GL+Y +SG +NEA SD+ G EFY + D
Sbjct: 354 DGQTFLPFSGGIDVVGHELTHAVTDYTAGLVYQNESGAINEAMSDIFGTLVEFYANRNPD 413

Query: 381 WLVGADIFK---GNGALRYMADPTLDGISIGHIDDYYDGID---VHHSSGVFNKAFYTLA 434
W +G DI+ ALR M+DP G + Y D VH +SG+ NKA Y L+
Sbjct: 414 WEIGEDIYTPGVAGDALRSMSDPAKYGDPDHYSKRYTGTQDNGGVHTNSGIINKAAYLLS 473

Query: 435 N--------LPGWDTRKAFQTFVVANQLYWTADSLFWQGACGVKSAATDLG----LSADD 482
+ G K + F A Y T S F Q AA DL +
Sbjct: 474 QGGVHYGVSVTGIGRDKMGKIFYRALVYYLTPTSNFSQLRAACVQAAADLYGSTSQEVNS 533

Query: 483 VVTAFAAVGI 492
V AF AVG+
Sbjct: 534 VKQAFNAVGV 543


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS05150DHBDHDRGNASE583e-12 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 58.1 bits (140), Expect = 3e-12
Identities = 44/241 (18%), Positives = 89/241 (36%), Gaps = 47/241 (19%)

Query: 3 VLIVGGSGGIGQAMVKRVQEAYPNATVHATYRHHLPQDRQNNIQW----------YQLDV 52
I G + GIG+A V + H + P+ + + + DV
Sbjct: 11 AFITGAAQGIGEA----VARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADV 66

Query: 53 TNEVEIKQLSEQLTE----LDWIINCVGILHTQDKGPEKSLQSLDIDFFQHNLTLNTLPS 108
+ I +++ ++ +D ++N G+L G + SL + ++ ++N+
Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRP---GL---IHSLSDEEWEATFSVNSTGV 120

Query: 109 VMLAKHFCHALKQSDSARFAVISAKVDSITDNRLGGWYSYRASKAALNMFLKTLAIEWQR 168
++ + S + + + + +Y +SKAA MF K L +E
Sbjct: 121 FNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMA---AYASSKAAAVMFTKCLGLELAE 177

Query: 169 TMKHCVVLSLHPGTTDTPLSQP------------------FQQSVPKGKLFTPEYVANCL 210
C ++S PG+T+T + F+ +P KL P +A+ +
Sbjct: 178 YNIRCNIVS--PGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAV 235

Query: 211 L 211
L
Sbjct: 236 L 236


10AYJ58_RS05360AYJ58_RS05460N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AYJ58_RS05360-1539737486009671647.581990fimbrial assembly protein
AYJ58_RS05365-1539737486009671448.024409MSHA biogenesis protein MshJ
AYJ58_RS05370-1539737486009671548.491228MSHA biogenesis protein MshK
AYJ58_RS05375-1539737486009671548.444992pilus (MSHA type) biogenesis protein MshL
AYJ58_RS05380-1539737486009671548.773449DUF2075 domain-containing protein
AYJ58_RS05385-1539737486009671648.268176hypothetical protein
AYJ58_RS05390-1539737486009671647.363243type II/IV secretion system protein
AYJ58_RS05395-1539737486009671946.435897type II secretion system F family protein
AYJ58_RS05400-1539737486009672345.369584MSHA biogenesis protein MshF
AYJ58_RS05405-1539737486009672446.091954prepilin-type N-terminal cleavage/methylation
AYJ58_RS05410-1539737486009672146.748152type II secretion system protein
AYJ58_RS05415-1539737486009671847.190855type II secretion system protein
AYJ58_RS05420-1539737486009671747.980782type II secretion system protein
AYJ58_RS05425-1539737486009671747.970043type II secretion system protein
AYJ58_RS05430-1539737486009671648.342842MSHA biogenesis protein MshP
AYJ58_RS05435-1539737486009671448.107274rod shape-determining protein
AYJ58_RS05440-1539737486009671248.178047rod shape-determining protein MreC
AYJ58_RS05445-1539737486009671248.482759rod shape-determining protein MreD
AYJ58_RS05450-1539737486009671248.467936septum formation inhibitor Maf
AYJ58_RS05455-1539737486009671249.144158ribonuclease G
AYJ58_RS05460-1539737486009671349.689856TIGR02099 family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS05360PF06580290.015 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 28.7 bits (64), Expect = 0.015
Identities = 11/53 (20%), Positives = 20/53 (37%), Gaps = 2/53 (3%)

Query: 25 LGAYAAGFLVLFAALGGYSYWQVSELQQAQQLAAQQ--KLQFDTQKQALEAQI 75
L +V F Y W + + ++ + + + Q AL+AQI
Sbjct: 118 LSIIFNVVVVTFMWSLLYFGWHFFKNYKQAEIDQWKMASMAQEAQLMALKAQI 170


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS05375BCTERIALGSPD1773e-50 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 177 bits (450), Expect = 3e-50
Identities = 72/293 (24%), Positives = 129/293 (44%), Gaps = 26/293 (8%)

Query: 257 PQAGLVTIRAFPSELRQVRTFLNSAESHLQRQVILEAKILEVTLSDGYQQGIQWDNVLGH 316
Q + + A P + + + + + QV++EA I EV +DG GIQW N
Sbjct: 316 GQTNALIVTAAPDVMNDLERVIAQLDIR-RPQVLVEAIIAEVQDADGLNLGIQWANKNAG 374

Query: 317 VGN-------TNVNFGTSKGPGLNDKITSAIGGVTS------LSIKGSDFTTMINLLDTQ 363
+ + + + ++S++ S ++ ++ L +
Sbjct: 375 MTQFTNSGLPISTAIAGANQYNKDGTVSSSLASALSSFNGIAAGFYQGNWAMLLTALSSS 434

Query: 364 GDVDVLSSPRVTASNNQKAVIKVGTDEYFVTDVSSTTVAGTTPVTTPQVELTPFFSGIAL 423
D+L++P + +N +A VG + +T S T +G T + + GI L
Sbjct: 435 TKNDILATPSIVTLDNMEATFNVGQEVPVLT--GSQTTSGDNIFNTVERKTV----GIKL 488

Query: 424 DVTPQIDNDGNVLLHVHPSVIDVKEQTKDIKVSDASLELPLAQSEIRESDTVIRAASGDV 483
V PQI+ +VLL + V V + S S +L + R + + SG+
Sbjct: 489 KVKPQINEGDSVLLEIEQEVSSVADAA-----SSTSSDLGATFN-TRTVNNAVLVGSGET 542

Query: 484 VIIGGLMKSENIEVVSQVPLLGDIPFLGELFKNRSKQNKKTELIILLKPTVVG 536
V++GGL+ + +VPLLGDIP +G LF++ SK+ K L++ ++PTV+
Sbjct: 543 VVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNLMLFIRPTVIR 595


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS05385SYCDCHAPRONE290.022 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 29.1 bits (65), Expect = 0.022
Identities = 10/70 (14%), Positives = 28/70 (40%)

Query: 339 GQFSLSEQAYRQLLQQEPQQAKWWMGLAYALDSQQQFPQARQAYRTALGHRGLSAQASAF 398
G++ + + ++ L + +++++GL + Q+ A +Y +
Sbjct: 50 GKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFH 109

Query: 399 IEQRLTQLGD 408
+ L Q G+
Sbjct: 110 AAECLLQKGE 119


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS05395BCTERIALGSPF302e-102 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 302 bits (775), Expect = e-102
Identities = 116/407 (28%), Positives = 207/407 (50%), Gaps = 6/407 (1%)

Query: 1 MPIYQYRGRSGQGQSVTGQLDAASESAAADMLLARGIIPLEVKVAKVVK----SFSLAKL 56
M Y Y+ QG+ G +A S A +L RG++PL V + + S L+
Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLR 60

Query: 57 FGGKVALEELQIFTRQMYSLTRSGIPILRAIAGLSETAHSQRMKDALNDISEQLTAGRPL 116
+++ +L + TRQ+ +L + +P+ A+ +++ + + + + ++ G L
Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSL 120

Query: 117 SSSMNQHPDVFDSLFVSMVHVGENTGKLEDAFIQLSGYIEREQETRRRIKSAMRYPMFVL 176
+ +M P F+ L+ +MV GE +G L+ +L+ Y E+ Q+ R RI+ AM YP VL
Sbjct: 121 ADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPC-VL 179

Query: 177 IAIALAMV-ILNIMVIPKFAEMFSRFGADLPWATKVLIGTSNLFVNYWALMLVALIGTII 235
+A+A+V IL +V+PK E F LP +T+VL+G S+ + ML+AL+ +
Sbjct: 180 TVVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFM 239

Query: 236 GIRYWHHTEKGEKQWDKWKLHIPAVGSIIERSTLARYCRSFSMMLSAGVPMTQALSLVAD 295
R EK + + LH+P +G I ARY R+ S++ ++ VP+ QA+ + D
Sbjct: 240 AFRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGD 299

Query: 296 AVDNAYMHDKIVGMRRGIESGDSMLRVSNQSKLFTPLVLQMVAVGEETGQIDQLLNDAAD 355
+ N Y ++ + G S+ + Q+ LF P++ M+A GE +G++D +L AAD
Sbjct: 300 VMSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAAD 359

Query: 356 FYEGEVDYDLKNLTAKLEPILIGFVAVIVLVLALGIYLPMWDMLNVV 402
+ E + EP+L+ +A +VL + L I P+ + ++
Sbjct: 360 NQDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLM 406


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS05405BCTERIALGSPG446e-08 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 43.7 bits (103), Expect = 6e-08
Identities = 15/36 (41%), Positives = 27/36 (75%)

Query: 4 KQTGFSLIELVIVIVILGLLAATAIPRFLNVTDDAE 39
KQ GF+L+E+++VIVI+G+LA+ +P + + A+
Sbjct: 6 KQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKAD 41


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS05410BCTERIALGSPG472e-09 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 47.2 bits (112), Expect = 2e-09
Identities = 18/53 (33%), Positives = 32/53 (60%), Gaps = 4/53 (7%)

Query: 1 MKRQQGFTLIELVVVIIILGILAVTAAPKFINLQGDAR----VSALNGLKASI 49
+Q+GFTL+E++VVI+I+G+LA P + + A VS + L+ ++
Sbjct: 4 TDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENAL 56


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS05415BCTERIALGSPG408e-07 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 39.9 bits (93), Expect = 8e-07
Identities = 14/33 (42%), Positives = 22/33 (66%)

Query: 1 MSKQAGFTLVELVTTIILISILAVVVLPRLFTQ 33
KQ GFTL+E++ I++I +LA +V+P L
Sbjct: 4 TDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGN 36


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS05420BCTERIALGSPH372e-05 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 36.8 bits (85), Expect = 2e-05
Identities = 16/58 (27%), Positives = 33/58 (56%), Gaps = 6/58 (10%)

Query: 23 QQGFTLIELVIGMLVIAIAIVMLTSMLFPQA--DRAAKTLHRVKSA-ELA--HSVMNE 75
Q+GFTL+E+++ +L++ ++ M+ + FP + D AA+TL R ++ +
Sbjct: 3 QRGFTLLEMMLILLLMGVSAGMVL-LAFPASRDDSAAQTLARFEAQLRFVQQRGLQTG 59


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS05425BCTERIALGSPG310.004 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 30.6 bits (69), Expect = 0.004
Identities = 10/18 (55%), Positives = 17/18 (94%)

Query: 14 QGFTLVEMVTVILILGIL 31
+GFTL+E++ VI+I+G+L
Sbjct: 8 RGFTLLEIMVVIVIIGVL 25


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS05440SHAPEPROTEIN5580.0 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 558 bits (1440), Expect = 0.0
Identities = 315/348 (90%), Positives = 332/348 (95%), Gaps = 1/348 (0%)

Query: 1 MFKKLRGIFSNDLSIDLGTANTLIYVRGEGIVLNEPSVVAIRGERSGSGQKSVAAVGTEA 60
M KK RG+FSNDLSIDLGTANTLIYV+G+GIVLNEPSVVAIR +R KSVAAVG +A
Sbjct: 1 MLKKFRGMFSNDLSIDLGTANTLIYVKGQGIVLNEPSVVAIRQDR-AGSPKSVAAVGHDA 59

Query: 61 KQMLGRTPGNIQAIRPMKDGVIADFYVTEKMLQHFIKQVHNNSFFRPSPRVLVCVPVGAT 120
KQMLGRTPGNI AIRPMKDGVIADF+VTEKMLQHFIKQVH+NSF RPSPRVLVCVPVGAT
Sbjct: 60 KQMLGRTPGNIAAIRPMKDGVIADFFVTEKMLQHFIKQVHSNSFMRPSPRVLVCVPVGAT 119

Query: 121 QVERRAIRESAMGAGAREVYLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAIISLN 180
QVERRAIRESA GAGAREV+LIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVA+ISLN
Sbjct: 120 QVERRAIRESAQGAGAREVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLN 179

Query: 181 GVVYSSSVRIGGDKFDDAIINYVRRNYGSLIGEATAERIKHTIGTAYPGDEVLEIEVRGR 240
GVVYSSSVRIGGD+FD+AIINYVRRNYGSLIGEATAERIKH IG+AYPGDEV EIEVRGR
Sbjct: 180 GVVYSSSVRIGGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRGR 239

Query: 241 NLAEGVPRSFTLNSNEILEALQEPLSGIVSAVMVALEQSPPELASDISERGMVLTGGGAL 300
NLAEGVPR FTLNSNEILEALQEPL+GIVSAVMVALEQ PPELASDISERGMVLTGGGAL
Sbjct: 240 NLAEGVPRGFTLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISERGMVLTGGGAL 299

Query: 301 LRDLDRLLMQETGIPVMVADDPLTCVARGGGKALEMIDMHGGDLFSEE 348
LR+LDRLLM+ETGIPV+VA+DPLTCVARGGGKALEMIDMHGGDLFSEE
Sbjct: 300 LRNLDRLLMEETGIPVVVAEDPLTCVARGGGKALEMIDMHGGDLFSEE 347


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS05445IGASERPTASE320.005 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 31.6 bits (71), Expect = 0.005
Identities = 21/97 (21%), Positives = 35/97 (36%), Gaps = 6/97 (6%)

Query: 237 EVLTEDGQSYARVTAQPLAALDRIRYVLLIWPSPDSGVTLPNQPTAPAADQSLSEEASKI 296
+V TE Q +VT+Q ++ V P + N PT + S+ +
Sbjct: 1113 KVETEKTQEVPKVTSQVSPKQEQSETVQ-----PQAEPARENDPTV-NIKEPQSQTNTTA 1166

Query: 297 GSASPAEGTSAETTKPVTTPAATVAKPATETTSPATE 333
+ PA+ TS+ +PVT + T
Sbjct: 1167 DTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTT 1203


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS05465PF03544367e-04 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 36.1 bits (83), Expect = 7e-04
Identities = 17/115 (14%), Positives = 32/115 (27%), Gaps = 11/115 (9%)

Query: 1283 ILPVVGVDRPAAPTEAASHNTDITQPVETKSEPQLQ-----EAQSESLELQDIKPEEARS 1337
+L P A + + P + + +Q + E +P +
Sbjct: 32 LLYTSVHQVIELPAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAP 91

Query: 1338 EESKSEEPKPATDQMKAPDLKLVQPQATP------SPEAPNTPATPENQPAVEPI 1386
+ +PKP ++ + P SP PA P + A
Sbjct: 92 VVIEKPKPKPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAAT 146


11AYJ58_RS05750AYJ58_RS05785N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AYJ58_RS05750-1539737486009671749.743410prepilin-type N-terminal cleavage/methylation
AYJ58_RS05755-1539737486009671749.648935ATP-dependent RNA helicase
AYJ58_RS05760-1539737486009671249.623170peptidase M28
AYJ58_RS05765-1539737486009671249.087199excinuclease ABC subunit UvrA
AYJ58_RS05770-1539737486009671547.489511MFS transporter
AYJ58_RS05775-1539737486009671547.259412single-stranded DNA-binding protein
AYJ58_RS05780-1539737486009671646.439201insulinase family protein
AYJ58_RS05785-1539737486009671546.462264multidrug transporter MdtL
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS05750BCTERIALGSPG488e-10 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 48.3 bits (115), Expect = 8e-10
Identities = 17/57 (29%), Positives = 29/57 (50%), Gaps = 3/57 (5%)

Query: 7 KRVQGFTLIELVVVIIILGILAVIAAPKFMNLQRDAKVNAVKGYLGQMQDMTKMLHM 63
+ +GFTL+E++VVI+I+G+LA + P M + A + + L M
Sbjct: 5 DKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAV---SDIVALENALDM 58


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS05770TCRTETB855e-20 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 85.3 bits (211), Expect = 5e-20
Identities = 73/358 (20%), Positives = 139/358 (38%), Gaps = 48/358 (13%)

Query: 48 LWVGIAIGAYGLTQAVLQIPMGILSDKYGRKPIILIGLVLFAIGSLIAANADSIYGV-VF 106
WV A+ LT ++ G LSD+ G K ++L G+++ GS+I S + + +
Sbjct: 52 NWV---NTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIM 108

Query: 107 GRAVQGMGAIA--AAVLALAADLTRDEQRTKVMAIIGMCIGGSFALSLLVGPIVAQHVGL 164
R +QG GA A A V+ + A E R K +IG + + +G ++A ++
Sbjct: 109 ARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHW 168

Query: 165 SGLFFLTAILAVTGMLIVQFLVPNPISHAP---KGDTLATPARLKRML-------TDPQL 214
S L + I +T +++ L KG L + + ML + +
Sbjct: 169 SYLLLIPMITIITVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIV 228

Query: 215 FRLDAGIFILHL-----------------VLTAVFVALPLDLVDAGLVKEKHWMLYF--- 254
L IF+ H+ + V + AG V +M+
Sbjct: 229 SVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQ 288

Query: 255 --PAFVGAFFL---MVPLIIIG------VKRKNTKAMFQIALVIMIVALLAMAAFSN-NL 302
A +G+ + + +II G V R+ + I + + V+ L +
Sbjct: 289 LSTAEIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTS 348

Query: 303 WVLSFAVVLFFTGFNYLEASLPSLIAKFCPVGEKGSAMGVYSTSQFLGAFCGGMLGGG 360
W ++ +V G ++ + + ++++ E G+ M + + + FL G + GG
Sbjct: 349 WFMTIIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGG 406



Score = 30.2 bits (68), Expect = 0.016
Identities = 18/111 (16%), Positives = 42/111 (37%)

Query: 274 RKNTKAMFQIALVIMIVALLAMAAFSNNLWVLSFAVVLFFTGFNYLEASLPSLIAKFCPV 333
+ K + ++I + + +L A + G A + ++A++ P
Sbjct: 75 QLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPK 134

Query: 334 GEKGSAMGVYSTSQFLGAFCGGMLGGGAFQLVGAVGVFIVAVILMSIWLFL 384
+G A G+ + +G G +GG + + ++ +I + FL
Sbjct: 135 ENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFL 185


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS05775PF03544290.013 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 29.2 bits (65), Expect = 0.013
Identities = 18/98 (18%), Positives = 33/98 (33%), Gaps = 4/98 (4%)

Query: 125 PMGGGMPQNAGYQSAPQQAAPAQNQYAPAPQAAPAYQAPAQQQYAAPAPAQQQYGQQQAQ 184
P+ M A + P + P P+ P + P + P + + +
Sbjct: 49 PISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKP----KPKPK 104

Query: 185 PQQGGYAPKPQAAPAPAYQAPAAPAQRPAPQPQQNFTP 222
P+ +P+ P PA+P + AP + T
Sbjct: 105 PKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTA 142


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS05790TCRTETA612e-12 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 61.4 bits (149), Expect = 2e-12
Identities = 66/306 (21%), Positives = 114/306 (37%), Gaps = 16/306 (5%)

Query: 12 VLLYPTGIDMYLVGLPQIAQDLGASEAQLHIAFSVYLGGMAATML----FAGSIADRLGR 67
V L GI + + LP + +DL S + + + L A G+++DR GR
Sbjct: 14 VALDAVGIGLIMPVLPGLLRDLVHSN-DVTAHYGILLALYALMQFACAPVLGALSDRFGR 72

Query: 68 KPVTLFGAIVFALASYLGGMAETSNQFLLIRFMQGIGAGSCYVVAFAILRDTLDDKRRAK 127
+PV L A+ + A + R + GI G+ VA A + D D RA+
Sbjct: 73 RPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGI-TGATGAVAGAYIADITDGDERAR 131

Query: 128 VLSMINGITCIIPVIAPVIGHVIMLKFPWQSLFSTMAGMGLLVCLLCIFILRETKPAQSR 187
++ V PV+G + M F + F A + L L F+L E+ + R
Sbjct: 132 HFGFMSACFGFGMVAGPVLGGL-MGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGERR 190

Query: 188 IEDVNRQLAPQESFRQSFFLSRLVMTTLGVTTILSYVNVSPMLIMGQMGFDRGQYSNVM- 246
L P SFR + + +V + V I+ V P + G DR +
Sbjct: 191 -PLRREALNPLASFRWARGM-TVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTI 248

Query: 247 -----ALTALVSMLVSFSTPFALAVLKEKSLMLISQLLFAAAALVFVLTQVQYFSLNVNL 301
A L S+ + T A L E+ +++ ++ + + + + +
Sbjct: 249 GISLAAFGILHSLAQAMITGPVAARLGERRALMLG-MIADGTGYILLAFATRGWMAFPIM 307

Query: 302 LGFGFV 307
+
Sbjct: 308 VLLASG 313


12AYJ58_RS05900AYJ58_RS05930N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AYJ58_RS05900-1539737486009671649.208599AcrB/AcrD/AcrF family protein
AYJ58_RS05905-1539737486009671448.178195efflux RND transporter periplasmic adaptor
AYJ58_RS05910-1539737486009671747.340426LysR family transcriptional regulator
AYJ58_RS05915-1539737486009672047.416805MATE family efflux transporter
AYJ58_RS05920-1539737486009672546.624023molecular chaperone GroES
AYJ58_RS05925-1539737486009672546.126761chaperonin GroEL
AYJ58_RS05930-1539737486009672639.400328group II intron reverse transcriptase/maturase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS05900ACRIFLAVINRP492e-159 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 492 bits (1267), Expect = e-159
Identities = 213/1050 (20%), Positives = 441/1050 (42%), Gaps = 60/1050 (5%)

Query: 3 IAEYSIRHKVISWMFVLLLLVGGGVSFTGLGQLEFPEFTIKEALVITAYPGASPEQVEEE 62
+A + IR + +W+ ++L++ G ++ L ++P V YPGA + V++
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 63 VTLPLEDALQQLDAIKHVTSI-NSAGLSQIQIEIKESYDKTSLPQVWDEVRRKVNDTAGS 121
VT +E + +D + +++S +SAG I + + T +V+ K+
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQ---SGTDPDIAQVQVQNKLQLATPL 117

Query: 122 LPPGTTAPQVMDDFGD---VYGILFNLSGPDYSNRELSNYAD-YLRRELVLVPGVKKVSV 177
LP + + + F P + ++S+Y ++ L + GV V +
Sbjct: 118 LPQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQL 177

Query: 178 AGNVTEQVVIEISQQKLSALGLDQSYIYGLVNNQNVVSNAGSLVIGDN------RIRIHP 231
G + I + L+ L + + QN AG L I
Sbjct: 178 FGA-QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIA 236

Query: 232 TGEFSNVQDLARLIVSPPGSTELIYLGDIANIEKDYDETPNVLYHNKGEAALSLGISFSS 291
F N ++ ++ + ++ L D+A +E + + N G+ A LGI ++
Sbjct: 237 QTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARIN-GKPAAGLGIKLAT 295

Query: 292 GVNVVEVGQKVSNRLAELESQRPIGMNLATVYNQSQAVDETVNGFLINLLESIAIVIAVL 351
G N ++ + + +LAEL+ P GM + Y+ + V +++ + L E+I +V V+
Sbjct: 296 GANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVM 355

Query: 352 LLFMG-LRSGLLMGLILLLTILGTFIVMKVLGIELQLISLGALIIALGMLVDNAIVVTEG 410
LF+ +R+ L+ + + + +LGTF ++ G + +++ +++A+G+LVD+AIVV E
Sbjct: 356 YLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVEN 415

Query: 411 ILIGLRRGKTR-LEAAKQIVAQTQWPLLGATVIAIIAFAPIGLSQNAAGEFCRSLFQVLM 469
+ + K EA ++ ++Q Q L+G ++ F P+ + G R ++
Sbjct: 416 VERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIV 475

Query: 470 ISLFISWITAITLTPFFCHLLFKDAPSDE-EAQDPYKGWF-------FSLYRASLTLALR 521
++ +S + A+ LTP C L K ++ E + + GWF + Y S+ L
Sbjct: 476 SAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILG 535

Query: 522 FRLVSILLVVAMLFSAVVGFGHIKNVFFPASNTPIFFVDIWMPEGTDIKATERFTADIEK 581
+L+ ++ VV F + + F P + +F I +P G + T++ +
Sbjct: 536 STGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTD 595

Query: 582 LMLSLNEQKDVGLKHLTTVIGQGSQR------FVL--PYQPEKGYPAFAQFIVEMQDLAA 633
L NE+ +V Q FV P++ G A+ ++ A
Sbjct: 596 YYLK-NEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIH----RA 650

Query: 634 VKAYMPELETLLNQRFPQAQYRLKNMENGPSPAAKIEARFYGDNPEVLRALGDQAEAIFH 693
+ + P + + ++ + + + +A
Sbjct: 651 KMELGKIRDGFV---IPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQ 707

Query: 694 AEPSMDGIRHNWRNQVPLIRPQLENAQARETGISKQDLDNALLVNFSGKQIGLYRETSHL 753
S+ +R N + +++ +A+ G+S D++ + G + + + +
Sbjct: 708 HPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRV 767

Query: 754 LPIVARAPAEERLQADSLWKLQIWSSEHNTFVPATQVVSNFNTEWEN--PLVMRRDRMRM 811
+ +A A+ R+ + + KL + S + VP + + + W P + R + +
Sbjct: 768 KKLYVQADAKFRMLPEDVDKLYV-RSANGEMVPFSAFTT---SHWVYGSPRLERYNGLPS 823

Query: 812 LAVMADPKLGSD-ETADSVLRKVKDKVEAISLPAGYHLEWGGEFETAGEAQTAVFSSIPM 870
+ + + G+ A +++ + K LPAG +W G + + + +
Sbjct: 824 MEIQGEAAPGTSSGDAMALMENLASK-----LPAGIGYDWTGMSYQERLSGNQAPALVAI 878

Query: 871 GYLAMFLITVFLFNSVRQPLVIWFTVPLALIGVSAGLLLFDAPFSFMALLGLLSLSGMVI 930
++ +FL L+ S P+ + VPL ++GV LF+ ++GLL+ G+
Sbjct: 879 SFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSA 938

Query: 931 KNGIVLVDQIN-LELSEGKPAYFALVDSCVSRVRPVMMAAITTMLGMIPLISDAFFGS-- 987
KN I++V+ L EGK A + + R+RP++M ++ +LG++PL GS
Sbjct: 939 KNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGA 998

Query: 988 ---MAITIIFGLGFASLLTLIVLPVMYSLV 1014
+ I ++ G+ A+LL + +PV + ++
Sbjct: 999 QNAVGIGVMGGMVSATLLAIFFVPVFFVVI 1028



Score = 74.5 bits (183), Expect = 1e-15
Identities = 39/209 (18%), Positives = 95/209 (45%), Gaps = 13/209 (6%)

Query: 822 SDETADSVLRKVKDKVEAI--SLPAGYHLEWGGEFETAGEAQTAVFSSIPMGYLAMFL-- 877
+ A + +K K+ + P G ++ ++T Q ++ + + A+ L
Sbjct: 295 TGANALDTAKAIKAKLAELQPFFPQG--MKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVF 352

Query: 878 ITVFLF-NSVRQPLVIWFTVPLALIGVSAGLLLFDAPFSFMALLGLLSLSGMVIKNGIVL 936
+ ++LF ++R L+ VP+ L+G A L F + + + G++ G+++ + IV+
Sbjct: 353 LVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVV 412

Query: 937 VDQINLELSEGKPAYFALVDSCVSRVR-PVMMAAITTMLGMIPL-----ISDAFFGSMAI 990
V+ + + E K + +S+++ ++ A+ IP+ + A + +I
Sbjct: 413 VENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSI 472

Query: 991 TIIFGLGFASLLTLIVLPVMYSLVFNIKA 1019
TI+ + + L+ LI+ P + + + +
Sbjct: 473 TIVSAMALSVLVALILTPALCATLLKPVS 501


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS05905RTXTOXIND462e-07 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 45.6 bits (108), Expect = 2e-07
Identities = 27/178 (15%), Positives = 62/178 (34%), Gaps = 28/178 (15%)

Query: 104 EAEHELLAADFKRKTELLNRKLISQSEFDSTQAQLKSAKAALAAARDQLSYTRLTAPFSG 163
+ E++L+ FK + + T + LA ++ + + AP S
Sbjct: 286 KEEYQLVTQLFKNEI---------LDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSV 336

Query: 164 TIAKRLVDNH-QIVQANQGVLTL-QNNNLLDVSIQVPEAMAAGLKQYTDQAHFAAKVRFS 221
+ + V +V + ++ + ++ L+V+ V + A ++
Sbjct: 337 KVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFI-----NVGQNAIIKVE 391

Query: 222 AFPEQSF---DAKFKEYSTQVTPGTQ---AYEVVFSLPQP------QDIQLLPGMSAE 267
AFP + K K + + + V+ S+ + ++I L GM+
Sbjct: 392 AFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVT 449



Score = 31.0 bits (70), Expect = 0.007
Identities = 14/104 (13%), Positives = 36/104 (34%), Gaps = 2/104 (1%)

Query: 68 SGQLTELTLVEGQRVAQGSLLAQLDDRDAKNNLMTREAEHELLAADFKRK-TELLNRKLI 126
+ + E+ + EG+ V +G +L +L A+ + + ++ + R + +L
Sbjct: 104 NSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELN 163

Query: 127 SQSEFD-STQAQLKSAKAALAAARDQLSYTRLTAPFSGTIAKRL 169
E + ++ L + + + K L
Sbjct: 164 KLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKEL 207


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS05915SECFTRNLCASE310.012 Bacterial translocase SecF protein signature.
		>SECFTRNLCASE#Bacterial translocase SecF protein signature.

Length = 333

Score = 30.6 bits (69), Expect = 0.012
Identities = 23/114 (20%), Positives = 44/114 (38%), Gaps = 14/114 (12%)

Query: 169 MALAAVINLILDPLLIFGIGPFPRLEIQGAAIATLFAWLIALSLSGYLLIVRRKMLEWAA 228
AL AV+ L+ D LL G+ +L+ +A L + S++ +++
Sbjct: 178 FALGAVVALVHDVLLTVGLFAVLQLKFDLTTVAALLT-ITGYSINDTVVV---------- 226

Query: 229 FDIDRMRANWTKLAHIAQPAALMNLINP-LANAVIMAMLAHIDHSAVAAFGAGT 281
DR+R N K + + +N L+ V+ M + + +G
Sbjct: 227 --FDRLRENLIKYKTMPLRDVMNLSVNETLSRTVMTGMTTLLALVPMLIWGGDV 278


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS05930PYOCINKILLER300.030 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 29.8 bits (66), Expect = 0.030
Identities = 15/39 (38%), Positives = 17/39 (43%), Gaps = 1/39 (2%)

Query: 512 KETGWNIHHIVERVKGGS-DEMDNLVLLHPNCHRQIHSG 549
IHH V GG M NLV + P H +IH G
Sbjct: 577 GRIKIEIHHKVRVADGGGVYNMGNLVAVTPKRHIEIHKG 615


13AYJ58_RS06885AYJ58_RS06905N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AYJ58_RS06885-1539737486009672146.180837hypothetical protein
AYJ58_RS06890-1539737486009671947.019583prepilin-type N-terminal cleavage/methylation
AYJ58_RS06895-1539737486009671648.312011prepilin-type cleavage/methylation
AYJ58_RS06900-1539737486009671748.857492prepilin-type N-terminal cleavage/methylation
AYJ58_RS06905-1539737486009671549.693160type IV pilin protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS06890RTXTOXINA300.042 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 29.5 bits (66), Expect = 0.042
Identities = 21/75 (28%), Positives = 30/75 (40%), Gaps = 14/75 (18%)

Query: 143 KGAKGNNIFNDAIVSCESLNLTGSSTIDGYDSRKGAYGDSFNN----DQGNSQLNKHGKG 198
G+K +IF+ A G I+G D YGD N+ G+ QL G G
Sbjct: 732 FGSKFTDIFHGA---------DGDDLIEGNDGNDRLYGDKGNDTLSGGNGDDQLYG-GDG 781

Query: 199 NVTTVEPNADVTLSG 213
N + + L+G
Sbjct: 782 NDKLIGVAGNNYLNG 796


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS06895BCTERIALGSPG349e-05 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 34.5 bits (79), Expect = 9e-05
Identities = 18/60 (30%), Positives = 32/60 (53%), Gaps = 6/60 (10%)

Query: 12 IKLTSRQTGFSLSELMIAMV-LGLIIMIAVINFF-----APLKTTVEESKRLENAADTLR 65
++ T +Q GF+L E+M+ +V +G++ + V N A + V + LENA D +
Sbjct: 1 MRATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYK 60


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS06905BCTERIALGSPG392e-06 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 38.7 bits (90), Expect = 2e-06
Identities = 18/49 (36%), Positives = 26/49 (53%), Gaps = 7/49 (14%)

Query: 7 QGFTLVELMVTVAIIGILGSLALPSY-------RDVMAREQLTAAANEL 48
+GFTL+E+MV + IIG+L SL +P+ A + A N L
Sbjct: 8 RGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENAL 56


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS06910BCTERIALGSPG477e-10 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 47.2 bits (112), Expect = 7e-10
Identities = 17/64 (26%), Positives = 35/64 (54%)

Query: 5 EKGFTLIELMIVVAVIGILAAIAIPSFSEYLKQGRRFDAQQYLMTSVQALERNYSRQGKY 64
++GFTL+E+M+V+ +IG+LA++ +P+ ++ + A ++ AL+ Y
Sbjct: 7 QRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLDNHHY 66

Query: 65 PAAQ 68
P
Sbjct: 67 PTTN 70


14AYJ58_RS07735AYJ58_RS07765N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AYJ58_RS07735-1539737486009671347.298050sigma-54-dependent Fis family transcriptional
AYJ58_RS07740-1539737486009671446.772367HAMP domain-containing protein
AYJ58_RS07745-1539737486009671647.357876curli production assembly protein CsgE
AYJ58_RS07750-1539737486009671647.658196curli production assembly protein CsgF
AYJ58_RS07755-1539737486009671647.904339transporter
AYJ58_RS07760-1539737486009671546.238130TetR/AcrR family transcriptional regulator
AYJ58_RS07765-1539737486009671446.148202coniferyl aldehyde dehydrogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS07735HTHFIS424e-147 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 424 bits (1092), Expect = e-147
Identities = 162/506 (32%), Positives = 247/506 (48%), Gaps = 41/506 (8%)

Query: 3 TILIVDDNHAICQALGLMLELNDYRVLICHSPEDALGLLTTQDVDLVIQDMNFTRDTTSG 62
TIL+ DD+ AI L L Y V I + + D DLV+ D+
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVM-----PD 59

Query: 63 EEGRQLFYALRERQGDLPIILMTAWTQLETAVELVKAGAADYMGKPWDDAKVLNSITNLI 122
E L +++ + DLP+++M+A TA++ + GA DY+ KP+D +++ I +
Sbjct: 60 ENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119

Query: 123 ALYKLSRANHRLERVNHQRQVAIADADLCGIVFGSGAMQRCIDLALQLARSDVSVLITGP 182
A K + + + +V S AMQ + +L ++D++++ITG
Sbjct: 120 AEPKRRPSKLEDDSQDGM-----------PLVGRSAAMQEIYRVLARLMQTDLTLMITGE 168

Query: 183 NGAGKDKLADILHANSPLKNKPFIKVNVGALPMDLLEAELFGAEAGAFTGATKARIGRFE 242
+G GK+ +A LH +N PF+ +N+ A+P DL+E+ELFG E GAFTGA GRFE
Sbjct: 169 SGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFE 228

Query: 243 AADGGTLFLDEIGNLPLSGQVKLLRVLQTGEFERLGSHKTQKVKVRVISATNADLAQDIA 302
A+GGTLFLDEIG++P+ Q +LLRVLQ GE+ +G + VR+++ATN DL Q I
Sbjct: 229 QAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSIN 288

Query: 303 EGRFREDLFYRLNVIELALSPLNQRTDDILPLVQHFI-------GSDFSLSKPALQALLH 355
+G FREDL+YRLNV+ L L PL R +DI LV+HF+ + AL+ +
Sbjct: 289 QGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDVKRFDQEALELMKA 348

Query: 356 HRWPGNVRELENACKRAVLLAKSHVLTEADFGLAPVVSAARSATSHISAASAPRRLDQRQ 415
H WPGNVRELEN +R L V+T + + +++
Sbjct: 349 HPWPGNVRELENLVRRLTALYPQDVITREIIEN------------ELRSEIPDSPIEKAA 396

Query: 416 FEPRSFEQRQSDPQPVYVPSSITANATFSDSSARDQTQSANEAIEVSREDIEAALKQHHG 475
S Q+ + + + +A E+ I AAL G
Sbjct: 397 ARSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVLA------EMEYPLILAALTATRG 450

Query: 476 VIARVAKALGLSRQALYRRMDKFGLD 501
+ A LGL+R L +++ + G+
Sbjct: 451 NQIKAADLLGLNRNTLRKKIRELGVS 476


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS07740GPOSANCHOR290.047 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 29.3 bits (65), Expect = 0.047
Identities = 14/47 (29%), Positives = 27/47 (57%), Gaps = 4/47 (8%)

Query: 54 ADLEKTNAAVELALKNTEQLQLAAAMQARQRTEQIKAELQKLEQLKQ 100
ADLE + + ++ + L A+ +A++ Q++AE QKLE+ +
Sbjct: 298 ADLEHQSQVLNANRQSLRR-DLDASREAKK---QLEAEHQKLEEQNK 340


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS07755cdtoxinb290.019 Cytolethal distending toxin B signature.
		>cdtoxinb#Cytolethal distending toxin B signature.

Length = 269

Score = 28.8 bits (64), Expect = 0.019
Identities = 15/50 (30%), Positives = 23/50 (46%)

Query: 144 ETNTSTGGSGVEYYGIGASEMYREDQVTIYLRAVDVHTGKVMMSVSTSKR 193
T + G V S R QV IY AVD G+V +++ +++R
Sbjct: 74 GTLIPSPGIPVRELIWNLSTNSRPQQVYIYFSAVDALGGRVNLALVSNRR 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS07760HTHTETR733e-18 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 72.7 bits (178), Expect = 3e-18
Identities = 29/146 (19%), Positives = 52/146 (35%), Gaps = 7/146 (4%)

Query: 2 TDKRQAILDTALALFVSQGFHGTSTASIAKQAGVATGTLFHHFSSKEALMESLFLTIKQE 61
+ RQ ILD AL LF QG TS IAK AGV G ++ HF K L ++ +
Sbjct: 10 QETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESN 69

Query: 62 FADALLLNT-HNGSDLKLNALQLWQSAIDWSLDNPVKQLFFQQYSMSPM------IAAKV 114
+ L D ++ ++ ++ ++L + + +
Sbjct: 70 IGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQA 129

Query: 115 RDQAMNSILGFISELIKQGQREGLIA 140
+ I + +K ++
Sbjct: 130 QRNLCLESYDRIEQTLKHCIEAKMLP 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS07765PF07299290.038 Fibronectin-binding protein (FBP)
		>PF07299#Fibronectin-binding protein (FBP)

Length = 219

Score = 28.7 bits (64), Expect = 0.038
Identities = 18/75 (24%), Positives = 34/75 (45%), Gaps = 7/75 (9%)

Query: 319 SATDEAIDSQHRKLATQLITHTSEDMLLMQEEIFGPLLPIISYDTLDEAIQYINHRARPL 378
+A D + + LA + I H E++ Q+E+ +L + + + + + IN
Sbjct: 32 TANDRGVIQALKSLAIEKIIHVFENLTDEQKELIDTVLTVQNREDAESFLLKINP----- 86

Query: 379 ALYVMSFDEPTQQKI 393
YV+ F E T Q +
Sbjct: 87 --YVIPFQEVTAQTL 99


15AYJ58_RS08380AYJ58_RS08415N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AYJ58_RS08380-1539737486009674239.445251type IV pilus modification protein PilV
AYJ58_RS08385-1539737486009674239.008130prepilin-type N-terminal cleavage/methylation
AYJ58_RS08390-1539737486009674239.073781pilus assembly protein PilX
AYJ58_RS08395-1539737486009674139.026137rRNA (guanine-N1)-methyltransferase
AYJ58_RS08400-1539737486009673439.869281type IV pilin protein
AYJ58_RS08405-1539737486009673040.300752hypothetical protein
AYJ58_RS08410-1539737486009673041.333960prepilin-type N-terminal cleavage/methylation
AYJ58_RS08415-1539737486009672941.158949prepilin-type N-terminal cleavage/methylation
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS08380BCTERIALGSPG326e-04 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 32.2 bits (73), Expect = 6e-04
Identities = 10/24 (41%), Positives = 18/24 (75%), Gaps = 2/24 (8%)

Query: 13 QRGFSLIEVLVALVIL--VIGLIG 34
QRGF+L+E++V +VI+ + L+
Sbjct: 7 QRGFTLLEIMVVIVIIGVLASLVV 30


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS08400BCTERIALGSPG557e-13 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 55.3 bits (133), Expect = 7e-13
Identities = 21/63 (33%), Positives = 37/63 (58%)

Query: 1 MKQSEGFTLIELMIVVVIIGILASIAYPSYTAYLAKGARGDGIAAVMRIANLQEQFYLDN 60
+ GFTL+E+M+V+VIIG+LAS+ P+ K + ++ ++ + N + + LDN
Sbjct: 4 TDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLDN 63

Query: 61 REY 63
Y
Sbjct: 64 HHY 66


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS08410BCTERIALGSPG362e-05 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 36.0 bits (83), Expect = 2e-05
Identities = 16/50 (32%), Positives = 28/50 (56%), Gaps = 2/50 (4%)

Query: 1 MRK--KKQGFTLVELMVTIAVAAILLTIGVPSMISLYEGIRANTEIERIQ 48
MR K++GFTL+E+MV I + +L ++ VP+++ E + I
Sbjct: 1 MRATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIV 50


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS08415BCTERIALGSPG300.002 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 30.2 bits (68), Expect = 0.002
Identities = 10/28 (35%), Positives = 20/28 (71%)

Query: 5 QKGFSLIELMTSLSILTILLTGGIPSLT 32
Q+GF+L+E+M + I+ +L + +P+L
Sbjct: 7 QRGFTLLEIMVVIVIIGVLASLVVPNLM 34


16AYJ58_RS09330AYJ58_RS09380N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AYJ58_RS09330-1539737486009671250.516466RNA helicase
AYJ58_RS09335-1539737486009671149.525920PAS domain S-box protein
AYJ58_RS09340-1539737486009671049.404953HD domain-containing protein
AYJ58_RS09345-1539737486009671149.105461aminopeptidase P family protein
AYJ58_RS09350-1539737486009671249.353631hypothetical protein
AYJ58_RS09355-1539737486009671349.833133glycoside hydrolase
AYJ58_RS09360-1539737486009671149.548694FKBP-type peptidyl-prolyl cis-trans isomerase
AYJ58_RS09365-1539737486009671049.899699short-chain fatty acid transporter
AYJ58_RS09370-1539737486009671447.535842HPP family protein
AYJ58_RS09375-1539737486009671448.115550TetR/AcrR family transcriptional regulator
AYJ58_RS09380-1539737486009671847.137671DUF1294 domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS09340SECA330.003 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 32.9 bits (75), Expect = 0.003
Identities = 37/175 (21%), Positives = 65/175 (37%), Gaps = 46/175 (26%)

Query: 220 SQVVYPVEQRRKRELLSELIGK-KNWQQVLVFTATRDAADTLVKELNLDGIPSEVVHGEK 278
+VY E + + ++ ++ + Q VLV T + + ++ + EL GI V++ +
Sbjct: 424 PDLVYMTEAEKIQAIIEDIKERTAKGQPVLVGTISIEKSELVSNELTKAGIKHNVLNAKF 483

Query: 279 GQGSRRRALREFVAGDVR---VLVATEVAARGLDI---------------PSLEYVVNYD 320
A VA V +AT +A RG DI P+ E +
Sbjct: 484 ---HANEA--AIVAQAGYPAAVTIATNMAGRGTDIVLGGSWQAEVAALENPTAEQIEKIK 538

Query: 321 LPFLAED---------YV-----H---RI-----GRTGRAGKTGVAISFVSREEE 353
+ ++ H RI GR+GR G G + ++S E+
Sbjct: 539 ADWQVRHDAVLEAGGLHIIGTERHESRRIDNQLRGRSGRQGDAGSSRFYLSMEDA 593


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS09345FLAGELLIN300.019 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 30.4 bits (68), Expect = 0.019
Identities = 14/87 (16%), Positives = 36/87 (41%), Gaps = 4/87 (4%)

Query: 282 QLSSAMEEMSSTITEVAQNTHLTSTSINTAYDLCLKSSANMKANTQKVEQLAKSVADAAN 341
+++ +T+ ++N + + T + + N Q+V +L+ + N
Sbjct: 48 AIANRFTSNIKGLTQASRNANDGISIAQTTE----GALNEINNNLQRVRELSVQATNGTN 103

Query: 342 NAHQLNKEAEQVANAMGEIDSIAEQTN 368
+ L +++ + EID ++ QT
Sbjct: 104 SDSDLKSIQDEIQQRLEEIDRVSNQTQ 130


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS09365MICOLLPTASE474e-07 Microbial collagenase metalloprotease (M9) signature.
		>MICOLLPTASE#Microbial collagenase metalloprotease (M9) signature.

Length = 1104

Score = 46.6 bits (110), Expect = 4e-07
Identities = 32/168 (19%), Positives = 59/168 (35%), Gaps = 11/168 (6%)

Query: 542 WEIDADNGDILNAMHEGLGHGEGTTPPVNKAPIANAGADVNVTGPADVTLNGSGSRDPEN 601
++D + + + + G+ T VNK P A +D +V ++ +G+ S+D +
Sbjct: 744 HKVDGNGNYVYDVVFHGMNTDTNTDVHVNKEPKAVIKSDSSVIVEEEINFDGTESKDEDG 803

Query: 602 GALTYLWTQVSDPTIAIANADMANAAIQLAATQTDVAYSFSLKVTDPEGLSATDSVTVTN 661
Y W + + +N A Y L VTD G T+S +
Sbjct: 804 EIKAYEWD--------FGDGEKSNEAKATHKYNKTGEYEVKLTVTDNNGGINTESKKIKV 855

Query: 662 KADTPNQAPVVSVAAT---ATVEAGKTVSIVASASDADGDALTYAWTV 706
D P + S + K+ +V + + Y + V
Sbjct: 856 VEDKPVEVINESEPNNDFEKANQIAKSNMLVKGTLSEEDYSDKYYFDV 903


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS09370INFPOTNTIATR1373e-43 Macrophage infectivity potentiator signature.
		>INFPOTNTIATR#Macrophage infectivity potentiator signature.

Length = 233

Score = 137 bits (345), Expect = 3e-43
Identities = 65/132 (49%), Positives = 86/132 (65%), Gaps = 2/132 (1%)

Query: 25 KAAQENIRLGNEFLAQNKTKEGVITTASGLQYQVLTQGDGTVHPKASDTVTVHYHGTLID 84
K A+EN G+ FL+ NK+K G++ SGLQY+++ G G P SDTVTV Y GTLID
Sbjct: 99 KKAEENKAKGDAFLSANKSKPGIVVLPSGLQYKIIDAGTGA-KPGKSDTVTVEYTGTLID 157

Query: 85 GTVFDSSVDRGEPIAFPLNRVIKGWTEGVQLMVVGDKVRFFIPSELAYGNRST-GKIGGG 143
GTVFDS+ G+P F +++VI GWTE +QLM G F+P++LAYG RS G IG
Sbjct: 158 GTVFDSTEKAGKPATFQVSQVIPGWTEALQLMPAGSTWEVFVPADLAYGPRSVGGPIGPN 217

Query: 144 SVLIFDVELLKI 155
LIF + L+ +
Sbjct: 218 ETLIFKIHLISV 229


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS09385HTHTETR595e-13 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 58.9 bits (142), Expect = 5e-13
Identities = 20/112 (17%), Positives = 43/112 (38%), Gaps = 5/112 (4%)

Query: 11 SSDKRQQLVTTAFKLFYFQSVHGVGINQILQESAIAKKTLYHHFASKDELVEAVVRYRDE 70
+ + RQ ++ A +LF Q V + +I + + + + +Y HF K +L + +
Sbjct: 9 AQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSES 68

Query: 71 IFYQWLSERVNALPAG-----REGICGLFRALDDWFNQKVPQLCEFRGCFFI 117
+ E P RE + + + +++ F C F+
Sbjct: 69 NIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS09390PREPILNPTASE270.040 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 26.7 bits (59), Expect = 0.040
Identities = 23/101 (22%), Positives = 34/101 (33%), Gaps = 12/101 (11%)

Query: 37 LMPLLLVL-IFLAGLIGAAQQHMLP-----YGIAGMYLTLSLLTFIAYAIDKSAAKRGKW 90
L+P L L + GL+ + G YL L L + + GK
Sbjct: 156 LLPDQLTLPLLWGGLLFNLLGGFVSLGDAVIGAMAGYLVLWSLYWAFKLL------TGKE 209

Query: 91 RTKESTLHLLALMGGWPGALFAQNVLRHKSVKASFRNVFWL 131
LLA +G W G VL S+ +F + +
Sbjct: 210 GMGYGDFKLLAALGAWLGWQALPIVLLLSSLVGAFMGIGLI 250


17AYJ58_RS09880AYJ58_RS09940N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AYJ58_RS09880-1539737486009671448.790800TetR/AcrR family transcriptional regulator
AYJ58_RS09885-1539737486009671449.226641DUF2461 domain-containing protein
AYJ58_RS09890-1539737486009671548.888889TonB-dependent receptor
AYJ58_RS09895-1539737486009671548.670959PhnA domain protein
AYJ58_RS09900-1539737486009671748.430962MFS transporter
AYJ58_RS09905-1539737486009671648.247423MarR family transcriptional regulator
AYJ58_RS09910-1539737486009671748.013333succinylglutamate desuccinylase
AYJ58_RS09915-1539737486009671747.949608methyl-accepting chemotaxis protein
AYJ58_RS09920-1539737486009671547.477922FAA hydrolase family protein
AYJ58_RS09925-1539737486009671648.174603hypothetical protein
AYJ58_RS09930-1539737486009671548.663854beta-N-acetylhexosaminidase
AYJ58_RS09935-1539737486009671648.542641ROK family protein
AYJ58_RS09940-1539737486009671648.429952MFS transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS09880HTHTETR594e-13 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 59.3 bits (143), Expect = 4e-13
Identities = 29/163 (17%), Positives = 62/163 (38%), Gaps = 5/163 (3%)

Query: 8 DRQEKLI-LAMELFWQKGFAETSISDLVGHLGINRFSLYNSFGDKQNLYRECLRFYLDNY 66
+ ++ ++ +A+ LF Q+G + TS+ ++ G+ R ++Y F DK +L+ E N
Sbjct: 11 ETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNI 70

Query: 67 SFGASDTLLHEKAGLAE-IEAYLARFVALQREQKYGCFMQNAVLEKSL--DDESVLQECQ 123
+ + L + ++ + + K + +V+Q+ Q
Sbjct: 71 GELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQ 130

Query: 124 RLFC-RLQASFTQVLRDCQARGELLPHVQPHQVAAFLVLQLQG 165
R C Q L+ C L + + A + + G
Sbjct: 131 RNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISG 173


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS09885FERRIBNDNGPP280.046 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 27.6 bits (61), Expect = 0.046
Identities = 13/46 (28%), Positives = 22/46 (47%), Gaps = 2/46 (4%)

Query: 50 PALQFIEQMQPSILALSPRLTAVPKKVGGSLMRPQRDSRFSKDKTP 95
P L+ + +M+PS + S P+ + + + P R FS K P
Sbjct: 87 PNLELLTEMKPSFMVWSAGYGPSPEML--ARIAPGRGFNFSDGKQP 130


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS09900TCRTETB290.029 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 29.5 bits (66), Expect = 0.029
Identities = 28/165 (16%), Positives = 60/165 (36%), Gaps = 4/165 (2%)

Query: 3 SPQTEKANLLLLGLAFILIGLNLRPILASVGPLLPSIQEDISLSFTLASMLTLLPVLAMG 62
S + N +L+ L + L ++ +V LP I D + + + +L
Sbjct: 6 SQSNLRHNQILIWLCILSFFSVLNEMVLNVS--LPDIANDFNKPPASTNWVNTAFMLTFS 63

Query: 63 LGCFAGFSIAKRLGFNTVMTGSLILLIIATAMRFWAMD-ANWLICSALLAGVGIA-LIQT 120
+G ++ +LG ++ +I+ + + F + LI + + G G A
Sbjct: 64 IGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPAL 123

Query: 121 IMPAMIKLNFGERVPFMMGLYVTAIMGGAALAASSAPFIGMNLGW 165
+M + + E GL + + G + + I + W
Sbjct: 124 VMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHW 168


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS09915BACINVASINB330.005 Salmonella/Shigella invasin protein B signature.
		>BACINVASINB#Salmonella/Shigella invasin protein B signature.

Length = 593

Score = 32.8 bits (74), Expect = 0.005
Identities = 49/223 (21%), Positives = 82/223 (36%), Gaps = 21/223 (9%)

Query: 383 IDNMNTLSDDLAKLLTNISQNAHSLDSSA---TQTNEQGQRIANAATAQISRVDETKALA 439
D ++ D K LT SLD + Q ++ AT +D+
Sbjct: 150 TDTAKSVYDAATKKLTQAQNKLQSLDPADPGYAQAEAAVEQAGKEATEAKEALDKATDAT 209

Query: 440 EQMFTSSSLVTEQASESAKQISQASKYG--NQVKQIADDNRTKIAELSARLSSSVEVMSR 497
+ T + E+A + + NQV Q DN + +A L+ ++ +E++ +
Sbjct: 210 VKAGTDAKAKAEKADNILTKFQGTANAASQNQVSQGEQDNLSNVARLTMLMAMFIEIVGK 269

Query: 498 LSIQSDNIGSILTTISSIAEQTNLLALNAAIEAARAGEHGRGFAVVADEVRSLASRTQAA 557
+ +S N LAL A++ R E + A +E R A T
Sbjct: 270 NTEES---------------LQNDLALFNALQEGRQAEMEKKSAEFQEETRK-AEETNRI 313

Query: 558 TAEIQTMITALQKETSMAANAITTGQNQANECVGQSQNLQDAI 600
I ++ AL S+ A T G + A VG + + D I
Sbjct: 314 MGCIGKVLGALLTIVSVVAAVFTGGASLALAAVGLAVMVADEI 356


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS09920PF07824280.011 Type III secretion chaperone
		>PF07824#Type III secretion chaperone

Length = 120

Score = 28.3 bits (63), Expect = 0.011
Identities = 13/54 (24%), Positives = 22/54 (40%)

Query: 80 AIGLDLTKRDLQSKLKAKGLPWERAKAFDGAALFSPFVPIDDAEAPLHFTLSIN 133
A+G+ D Q+ + + K D L PF + + L + LS+N
Sbjct: 11 ALGIPSIDTDDQAIMLDDDVLIYIEKEGDSINLLCPFCALPENINDLIYALSLN 64


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS09935PERTACTIN280.045 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 28.1 bits (62), Expect = 0.045
Identities = 10/28 (35%), Positives = 15/28 (53%)

Query: 9 GGTKLMLAQVEGKTLLDTWRYPVPADGN 36
LA +GK + T+RY + A+GN
Sbjct: 530 SAATFTLANKDGKVDIGTYRYRLAANGN 557


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS09940TCRTETB300.020 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 29.8 bits (67), Expect = 0.020
Identities = 22/110 (20%), Positives = 46/110 (41%), Gaps = 5/110 (4%)

Query: 48 AFFIAYGVTSIPAGILVDKYGEKRVIILAFSLAFIGAVT-FVCLPTFSIAMFSLFCIGTG 106
AF + + + + G L D+ G KR+++ + G+V FV FS+ + + F G G
Sbjct: 57 AFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAG 116

Query: 107 ----MSLLQVAVNPLLRQAAGPEHFAFFSVLAQLAFGGAATLAPLVYQHF 152
+L+ V V + + + F + + G + ++ +
Sbjct: 117 AAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYI 166


18AYJ58_RS10500AYJ58_RS10530N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AYJ58_RS10500-1539737486009672151.180837TetR/AcrR family transcriptional regulator
AYJ58_RS10505-1539737486009671950.316122hemolysin D
AYJ58_RS10510-1539737486009671649.168101ABC transporter ATP-binding protein
AYJ58_RS10515-1539737486009671448.129881ABC transporter permease
AYJ58_RS10520-1539737486009671247.6915974-hydroxybutyrate CoA-transferase
AYJ58_RS10525-1539737486009671546.573751enhanced serine sensitivity protein SseB
AYJ58_RS10530-1539737486009671645.900094N-acetyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS10500HTHTETR742e-18 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 74.3 bits (182), Expect = 2e-18
Identities = 29/162 (17%), Positives = 59/162 (36%), Gaps = 6/162 (3%)

Query: 31 SDARQRLITAALSLFSHRSYPTVSTREIAREAEVDAALIRYYFGSKAGLFEQMVRETLEP 90
+ RQ ++ AL LFS + + S EIA+ A V I ++F K+ LF ++ +
Sbjct: 10 QETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESN 69

Query: 91 VFTRLREISAAEAPNN---VGEIMQTYYRVMAPNPGLPRLIIRVLQEGDGSEPYRIILSV 147
+ E A + + EI+ L+ + + + ++
Sbjct: 70 IGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQA 129

Query: 148 FEQVLTLSRQWLESTL---VNSGLLKEGVDPDLARLSFVSLM 186
+ S +E TL + + +L + A + +
Sbjct: 130 QRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYI 171


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS10505RTXTOXIND569e-11 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 55.6 bits (134), Expect = 9e-11
Identities = 31/176 (17%), Positives = 61/176 (34%), Gaps = 17/176 (9%)

Query: 51 TVERDRLTLTAPVGELITRVNVVEGQQVKAGEVLLTLDSTSANARLALRQAELEQAKAKL 110
T + ++ + V EG+ V+ G+VLL L + A A Q+ L Q A+L
Sbjct: 91 THSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQ--ARL 148

Query: 111 SEAVTGARLEDIERAKAVLDGANASVKEAQRAFERTNRLYATKVLSQADLDTARAARDTS 170
+ IE + + F+ + +VL L + T
Sbjct: 149 EQTRYQILSRSIEL-----NKLPELKLPDEPYFQNVS---EEEVLRLTSL--IKEQFSTW 198

Query: 171 LAKQAEAEQSLRLLENGTRSEQLEQAKAAVAAASASVAIEQKALADLSLVAARDSV 226
++ + E +L + + A + +E+ L D S + + ++
Sbjct: 199 QNQKYQKELNLD-----KKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAI 249



Score = 47.9 bits (114), Expect = 3e-08
Identities = 30/232 (12%), Positives = 77/232 (33%), Gaps = 15/232 (6%)

Query: 73 VEGQQVKAGEVLLTLDSTSANARLALRQAELEQAKAKLSEAVTGARLEDIERAKAVLDGA 132
V ++V L+ ++ + ++ L++ +A+ + AR+ E V
Sbjct: 179 VSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVL--ARINRYENLSRVEKSR 236

Query: 133 NASVKE-AQRAFERTNRLYATK---VLSQADLDTARAARDTSLAKQAEAEQSLRLLENGT 188
+ + + + V + +L ++ + ++ A++ +L+
Sbjct: 237 LDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLF 296

Query: 189 RSEQLEQAKAAVAAASASVAIEQKALADLS---LVAARDSVVDTLP-WRVGDRIAAGTQL 244
++E L++ + K + A V L G + L
Sbjct: 297 KNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETL 356

Query: 245 IGLLASEHPY-VRVYLPATWLDRVKAGDKVNIRVDG----REMPIAGTVRNI 291
+ ++ + V + + + G I+V+ R + G V+NI
Sbjct: 357 MVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNI 408


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS10510adhesinb290.017 Adhesin B signature.
		>adhesinb#Adhesin B signature.

Length = 310

Score = 29.4 bits (66), Expect = 0.017
Identities = 15/87 (17%), Positives = 26/87 (29%), Gaps = 12/87 (13%)

Query: 220 SPQQLMAAMGARVVEISGDDL------------RNLKQSLISESAVLSAAQIGSRLRVLV 267
P+ + A ++ +G +L N K+ + +S L
Sbjct: 73 LPEDVKKTSQADLIFYNGINLETGGNAWFTKLVENAKKKENKDYYAVSEGVDVIYLEGQS 132

Query: 268 RSDIEDPLAWLKPRVASRTMEEVRASL 294
EDP AWL + + L
Sbjct: 133 EKGKEDPHAWLNLENGIIYAQNIAKRL 159


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS10515ABC2TRNSPORT382e-05 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 38.4 bits (89), Expect = 2e-05
Identities = 40/160 (25%), Positives = 70/160 (43%), Gaps = 11/160 (6%)

Query: 191 GVILTMTMVMFT----SAAIVREREQGNMEFLITTPVRPLELMLGKIVPYVMVGFVQVTI 246
G++ T M T AA R Q E ++ T +R +++LG++ +
Sbjct: 72 GMVATSAMTAATFETIYAAFGRMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGAG 131

Query: 247 ILSAGHLLFDVP---IRGGLDSIALAAMLFICASLTLGLVISTIAKTQLQSMQMTVFILL 303
I L + L IAL + F +LG+V++ +A + + ++
Sbjct: 132 IGVVAAALGYTQWLSLLYALPVIALTGLAFA----SLGMVVTALAPSYDYFIFYQTLVIT 187

Query: 304 PSILLSGFMFPYEAMPIAAQWIAEALPATHFMRMSRAIVL 343
P + LSG +FP + +PI Q A LP +H + + R I+L
Sbjct: 188 PILFLSGAVFPVDQLPIVFQTAARFLPLSHSIDLIRPIML 227


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS10530SACTRNSFRASE280.018 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 27.6 bits (61), Expect = 0.018
Identities = 15/50 (30%), Positives = 24/50 (48%), Gaps = 3/50 (6%)

Query: 87 VTKQARGQGVASKLVKEIISLAHKIGVKNLYLQTEDLTGG---LYLQHGF 133
V K R +GV + L+ + I A + L L+T+D+ Y +H F
Sbjct: 97 VAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHF 146


19AYJ58_RS10900AYJ58_RS10940N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AYJ58_RS10900-1539737486009672546.307852ATP-dependent protease ATP-binding subunit ClpX
AYJ58_RS10905-1539737486009672146.439394endopeptidase La
AYJ58_RS10910-1539737486009672146.580484DNA-binding protein HU-beta
AYJ58_RS10915-1539737486009671946.708464peptidylprolyl isomerase
AYJ58_RS10920-1539737486009671747.186390transporter
AYJ58_RS10930-1539737486009671448.081044enoyl-[acyl-carrier-protein] reductase FabV
AYJ58_RS10935-1539737486009671348.212649peptide ABC transporter ATP-binding protein
AYJ58_RS10940-1539737486009671448.609169peptide ABC transporter ATP-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS10910HTHFIS300.017 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 30.2 bits (68), Expect = 0.017
Identities = 15/70 (21%), Positives = 29/70 (41%), Gaps = 13/70 (18%)

Query: 64 KLPTPHELRAHLDDYVIGQDRAKKVLSVAVYNHYKRLKNASPKDGVELGKSNILLIGPTG 123
+ P+ E + ++G+ S A+ Y+ L D +++ G +G
Sbjct: 124 RRPSKLEDDSQDGMPLVGR-------SAAMQEIYRVLARLMQTD------LTLMITGESG 170

Query: 124 SGKTLLAETL 133
+GK L+A L
Sbjct: 171 TGKELVARAL 180


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS10915HTHFIS340.002 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 34.4 bits (79), Expect = 0.002
Identities = 45/211 (21%), Positives = 76/211 (36%), Gaps = 37/211 (17%)

Query: 262 NMPAEAKEKALAELNKLRMMSP---MSAEATV---VRSY----VDWMTSVPWSQRSKIKR 311
MP E L + K R P MSA+ T +++ D++ P+ I
Sbjct: 56 VMPDENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPK-PFDLTELIGI 114

Query: 312 D---------LAKAQEVLDTDHFGLEKVKERILEYLAVQSRVRQLKGPILCLVGPPGVGK 362
E D L + E V +R+ Q ++ + G G GK
Sbjct: 115 IGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLM-ITGESGTGK 173

Query: 363 TSLGQSIAKATGRK---YVRVALGGVRD---EAEIRGHRRTYIGSMPGKVIQKMAKVGVK 416
+ +++ R+ +V + + + E+E+ GH + G+ G + +
Sbjct: 174 ELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEK---GAFTGAQTRSTGRFEQA 230

Query: 417 N--PLFLLDEIDKMSSDMRGDPASALLEVLD 445
LFL DEI M D + + LL VL
Sbjct: 231 EGGTLFL-DEIGDMPMDAQ----TRLLRVLQ 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS10920DNABINDINGHU1194e-39 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 119 bits (300), Expect = 4e-39
Identities = 53/88 (60%), Positives = 69/88 (78%)

Query: 2 NKSELIEKIASGADISKAAAGRALDSFIAAVTEGLKEGDKISLVGFGTFEVRERAERTGR 61
NK +LI K+A +++K + A+D+ +AV+ L +G+K+ L+GFG FEVRERA R GR
Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62

Query: 62 NPQTGEEIKIAAAKIPAFKAGKALKDAV 89
NPQTGEEIKI A+K+PAFKAGKALKDAV
Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAV 90


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS10940HTHFIS290.022 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 29.0 bits (65), Expect = 0.022
Identities = 13/28 (46%), Positives = 16/28 (57%)

Query: 42 TLAIVGEAGSGKSTLARILVGAEPRSGG 69
TL I GE+G+GK +AR L R G
Sbjct: 162 TLMITGESGTGKELVARALHDYGKRRNG 189


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS10945HTHFIS300.011 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 30.2 bits (68), Expect = 0.011
Identities = 9/16 (56%), Positives = 14/16 (87%)

Query: 38 LVGESGSGRSLLARAI 53
+ GESG+G+ L+ARA+
Sbjct: 165 ITGESGTGKELVARAL 180


20AYJ58_RS11975AYJ58_RS12005N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AYJ58_RS11975-1539737486009672045.427526Ppx/GppA family phosphatase
AYJ58_RS11980-1539737486009672246.263937molecular chaperone
AYJ58_RS11985-1539737486009671846.797505hypothetical protein
AYJ58_RS11990-1539737486009671646.930033cystathionine beta-lyase
AYJ58_RS11995-1539737486009671747.770609proteobacterial dedicated sortase system
AYJ58_RS12000-1539737486009671648.225989proteobacterial dedicated sortase system
AYJ58_RS12005-1539737486009671548.466864hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS12000SHAPEPROTEIN310.010 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 30.9 bits (70), Expect = 0.010
Identities = 16/36 (44%), Positives = 23/36 (63%)

Query: 122 NLVIDIGGGSTEVVIGKKNTPTQLSSLRCGCVSFNE 157
++V+DIGGG+TEV + N SS+R G F+E
Sbjct: 161 SMVVDIGGGTTEVAVISLNGVVYSSSVRIGGDRFDE 196


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS12010SHAPEPROTEIN416e-06 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 41.3 bits (97), Expect = 6e-06
Identities = 24/81 (29%), Positives = 42/81 (51%), Gaps = 11/81 (13%)

Query: 192 AAKRAGFVDVDFLFEPLAAGMDYEATLTDNKTVLVVDVGGGTTDCSVVKMGPAHKQKADR 251
+A+ AG +V + EP+AA + +++ +VVD+GGGTT+ +V+ +
Sbjct: 129 SAQGAGAREVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLN--------- 179

Query: 252 SEDFLGHSGQRIGGNDLDIAL 272
+ S RIGG+ D A+
Sbjct: 180 --GVVYSSSVRIGGDRFDEAI 198


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS12030HTHFIS644e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 64.1 bits (156), Expect = 4e-14
Identities = 29/136 (21%), Positives = 53/136 (38%), Gaps = 7/136 (5%)

Query: 3 RIAIVEDEAAIRENYKDVLQQHGYSVQTYADRPSAMLAFNTRLPDLAIIDIGLGNEIDGG 62
I + +D+AAIR L + GY V+ ++ + DL + D+ + +E
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE--NA 62

Query: 63 FMLCQSLRAMSSTLPIIFLTARDSDFDTVCGLRLGADDYLSKEVSFPHLTARLAALFRRS 122
F L ++ LP++ ++A+++ + GA DYL K L
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGI-----IGR 117

Query: 123 ELAASQTAQENLLERG 138
LA + L +
Sbjct: 118 ALAEPKRRPSKLEDDS 133


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS12035OMPADOMAIN652e-14 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 65.3 bits (159), Expect = 2e-14
Identities = 27/92 (29%), Positives = 47/92 (51%), Gaps = 2/92 (2%)

Query: 152 ELALGLNVQFRTGSYEVESHFLPQLDDVAEVM-NLSP-ELNLELKGYADRRGDVSYNQAL 209
L +V F ++ LD + + NL P + ++ + GY DR G +YNQ L
Sbjct: 214 HFTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGL 273

Query: 210 SEQRLLEVRGYLMKQGVAAERMTTQAFGALSP 241
SE+R V YL+ +G+ A++++ + G +P
Sbjct: 274 SERRAQSVVDYLISKGIPADKISARGMGESNP 305


21AYJ58_RS13695AYJ58_RS13735N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AYJ58_RS13695-1539737486009672441.890368GGDEF domain-containing protein
AYJ58_RS13700-1539737486009671743.070554chemotaxis response regulator protein-glutamate
AYJ58_RS13705-1539737486009671344.278359chemoreceptor glutamine deamidase CheD
AYJ58_RS13710-1539737486009671344.503460chemotaxis protein CheR
AYJ58_RS13715-1539737486009671445.011049PAS domain-containing protein
AYJ58_RS13720-1539737486009671346.640258chemotaxis protein CheW
AYJ58_RS13725-1539737486009671446.555355chemotaxis protein CheA
AYJ58_RS13730-1539737486009671246.393191response regulator
AYJ58_RS13735-1539737486009671146.120623fused response regulator/phosphatase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS13725HTHFIS721e-15 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 72.2 bits (177), Expect = 1e-15
Identities = 28/164 (17%), Positives = 65/164 (39%), Gaps = 6/164 (3%)

Query: 255 KVLLVDDQQSMVDYFSSLLRSHGLMVKGLSSAEQVLPALEQFEPDLFIFDLYMPEVNGLE 314
+L+ DD ++ + L G V+ S+A + + + DL + D+ MP+ N +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 315 LAKMIRQLDKYTSSPILVLSSDDTMQNKVSIIQAGSDDLISKQTAPSLFVA---QVISRA 371
L I++ P+LV+S+ +T + + G+ D + K + + + ++
Sbjct: 65 LLPRIKKARPDL--PVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122

Query: 372 QRGHDIRSSASRDSLTELLNHTQILVAARRCYNVAKRINSQVCI 415
+R + L+ + + R + + + I
Sbjct: 123 KRRPS-KLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMI 165



Score = 54.1 bits (130), Expect = 9e-10
Identities = 29/135 (21%), Positives = 59/135 (43%), Gaps = 2/135 (1%)

Query: 131 HIAIIEDDNNVGVMITKQLREFGFSVQHFLDFTSFLVVQNESPFDLILLDLILPDWTEEA 190
I + +DD + ++ + L G+ V+ + + DL++ D+++PD
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 191 LFEAATEFEKNNTRVFVLSSRGDFDMRLLAIRANVSEYFVKPAETTLLVRKIHQSLKMSE 250
L + + + V V+S++ F + A +Y KP + T L+ I ++L +
Sbjct: 65 LLPRIKK-ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 251 KQPLKVLLVDDQQSM 265
++P L D Q M
Sbjct: 124 RRP-SKLEDDSQDGM 137


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS13730HTHFIS695e-15 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 69.1 bits (169), Expect = 5e-15
Identities = 30/118 (25%), Positives = 52/118 (44%), Gaps = 6/118 (5%)

Query: 3 IKVLVVDDSALMRSLLGKMIEADPELSLVGLAADAYEAKDLVNQFRPDVITLDIEMPKVD 62
+LV DD A +R++L + + V + ++A + D++ D+ MP +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYD--VRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 63 GLTFLDRLMKARPTAVVMISSLTEQG-ADATFNALALGAVDFIPKPKLDSPQGIHDYQ 119
L R+ KARP V++ ++ Q A GA D++PKP D + I
Sbjct: 62 AFDLLPRIKKARPDLPVLV--MSAQNTFMTAIKASEKGAYDYLPKP-FDLTELIGIIG 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS13755PF06580425e-06 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 42.2 bits (99), Expect = 5e-06
Identities = 24/151 (15%), Positives = 51/151 (33%), Gaps = 52/151 (34%)

Query: 435 EIDKGMIEKLVDPLT--HLVRNSLDHGIEKPEKRLAAGKSEAGVLSLKASQRGGNIVIAV 492
+I+ +++ V P+ LV N + HGI + + G + LK ++ G + + V
Sbjct: 245 QINPAIMDVQVPPMLVQTLVENGIKHGIAQ--------LPQGGKILLKGTKDNGTVTLEV 296

Query: 493 HDNGAGLNRERIIQKARENGLQVADNSSDKQVWQLIFAAGFSTALEVTDVSGRGVGMDVV 552
+ G+ + G G+ V
Sbjct: 297 ENTGSLALKNTK--------------------------------------ESTGTGLQNV 318

Query: 553 RRNIEALGG---RIDIESVEGQGSTFEIQLP 580
R ++ L G +I + +G+ + + +P
Sbjct: 319 RERLQMLYGTEAQIKLSEKQGKVNAM-VLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS13760HTHFIS865e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 86.0 bits (213), Expect = 5e-23
Identities = 29/122 (23%), Positives = 52/122 (42%), Gaps = 3/122 (2%)

Query: 1 MSK-KILVVDDSAAIRQMVEATLKSVSYQVVLAKDGREALDICNGQRFDFILTDQNMPRM 59
M+ ILV DD AAIR ++ L Y V + + D ++TD MP
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 60 DGLTLIKSLRAMSAFMRTPIIMLTTEAGDDMKAQGKAAGATGWMVKPFDPQKLLAITAKV 119
+ L+ ++ P+++++ + + GA ++ KPFD +L+ I +
Sbjct: 61 NAFDLLPRIKKAR--PDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118

Query: 120 LG 121
L
Sbjct: 119 LA 120


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS13765HTHFIS653e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 65.2 bits (159), Expect = 3e-13
Identities = 28/104 (26%), Positives = 48/104 (46%)

Query: 11 ILVIDDDAIASQRISDFIHSKGYNVIVCSDLEEGLFEITQNTVDLILINYWLKDGTALAL 70
ILV DDDA ++ + GY+V + S+ I DL++ + + D A L
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 71 LDNLNKDKKETPVIVMSETKENQSVLACFRMGVLDFVVKPINVE 114
L + K + + PV+VMS + + G D++ KP ++
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLT 109


22AYJ58_RS14280AYJ58_RS14300N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AYJ58_RS14280-1539737486009671746.619217phosphoenolpyruvate--protein phosphotransferase
AYJ58_RS14285-1539737486009671745.140248PTS glucose transporter subunit IIA
AYJ58_RS14290-1539737486009671744.986807MFS transporter
AYJ58_RS14295-1539737486009671744.984802methyl-accepting chemotaxis protein
AYJ58_RS14300-1539737486009671545.069313porin
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS14295PHPHTRNFRASE5350.0 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 535 bits (1380), Expect = 0.0
Identities = 191/555 (34%), Positives = 309/555 (55%), Gaps = 5/555 (0%)

Query: 3 ITGIIVSSGIAFGQALHLIHTEHHLDYRPIPLSKIPQQQGKFAKALQELQAQLTH--SQA 60
ITGI SSG+A +A IH E ++D ++ + + K AL++ + +L Q
Sbjct: 5 ITGIAASSGVAIAKAF--IHLEPNVDIEKTSITDVSTEIEKLTAALEKSKEELRAIKDQT 62

Query: 61 ALDSDSENYQLIEADLLLLEDDELIEQVNDAIRTLQLSASVAVERIFAHQANELQSLDDP 120
++ ++ A LL+L+D EL++ + I Q++A A++ + + +S+D+
Sbjct: 63 EASMGADKAEIFAAHLLVLDDPELVDGIKGKIENEQMNAEYALKEVSDMFVSMFESMDNE 122

Query: 121 YLANRAQDVRCLGQRVVAAINGHLAQGLEKLDRPTILLAQDLTPAEFALLPRENLCGVVL 180
Y+ RA D+R + +RV+ + G L + T+++A+DLTP++ A L ++ + G
Sbjct: 123 YMKERAADIRDVSKRVLGHLIGVETGSLATIAEETVIIAEDLTPSDTAQLNKQFVKGFAT 182

Query: 181 KTGGLTSHTAILARAAGIPAILSCQFDADSIPNGTPLVLDALNGELCVNPNPDQQARLTV 240
GG TSH+AI++R+ IPA++ + + I +G +++D + G + VNP ++
Sbjct: 183 DIGGRTSHSAIMSRSLEIPAVVGTKEVTEKIQHGDMVIVDGIEGIVIVNPTEEEVKAYEE 242

Query: 241 TFHHEQARRAALQTYKDVPAQTQDGHTVGLMANVGNLNDITHVSDVGADGIGLFRTEFML 300
+ ++ P+ T+DG V L AN+G D+ V G +GIGL+RTEF+
Sbjct: 243 KRAAFEKQKQEWAKLVGEPSTTKDGAHVELAANIGTPKDVDGVLANGGEGIGLYRTEFLY 302

Query: 301 MDVSTLPDEKAQYSLYCDALHALGGKTFTIRTLDIGADKELPCLCQEIEDNPALGLRGIR 360
MD LP E+ Q+ Y + + + GK IRTLDIG DKEL L E NP LG R IR
Sbjct: 303 MDRDQLPTEEEQFEAYKEVVQRMDGKPVVIRTLDIGGDKELSYLQLPKELNPFLGFRAIR 362

Query: 361 YTLAHPDLFKTQLRAILRAANHGPIRLMFPMVNQVEELDEVFALIAQCQDALEEEEKGYG 420
L D+F+TQLRA+LRA+ +G +++MFPM+ +EEL + A++ + +D L E
Sbjct: 363 LCLEKQDIFRTQLRALLRASTYGNLKVMFPMIATLEELRQAKAIMQEEKDKLLSEGVDVS 422

Query: 421 E-LSYGIVVETPAAVFNLNAMLPRLDFVSIGTNDLTQYAMAADRTNPQLTRDYPSLSPAI 479
+ + GI+VE P+ N +DF SIGTNDL QY MAADR N +++ Y PAI
Sbjct: 423 DSIEVGIMVEIPSTAVAANLFAKEVDFFSIGTNDLIQYTMAADRMNERVSYLYQPYHPAI 482

Query: 480 LALINMTIVQAKAANVKVSLCGELASSPQIAPLLIGMGLDELSVNLSSLLEVKAAICQGN 539
L L++M I A + V +CGE+A PLL+G+GLDE S++ +S+L ++ + + +
Sbjct: 483 LRLVDMVIKAAHSEGKWVGMCGEMAGDEVAIPLLLGLGLDEFSMSATSILPARSQLLKLS 542

Query: 540 IQQFSALAHTALQQD 554
++ A AL D
Sbjct: 543 KEELKPFAQKALMLD 557


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS14305TCRTETA372e-04 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 36.7 bits (85), Expect = 2e-04
Identities = 31/161 (19%), Positives = 60/161 (37%), Gaps = 10/161 (6%)

Query: 33 MVWPFLAVILYE--KFALSATEVGMVLSSAAIISVFTSFVGSSLSDRIGRHKLMYVTGTL 90
++ P L +L + G++L+ A++ + V +LSDR GR ++ V+
Sbjct: 23 LIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAG 82

Query: 91 YIISFSLLAEANSVKGYVVVMTLCSMATSLWRPLTSAAIGDIIAD---PKTRELAMQSLY 147
+ ++++A A + + + T A G IAD R +
Sbjct: 83 AAVDYAIMATAP-----FLWVLYIGRIVAGITGATGAVAGAYIADITDGDERARHFGFMS 137

Query: 148 FIVNVGCAVGPMLGVWLGLTGKQSSFYLTAVAFAVLLIMLY 188
G GP+LG +G + F+ A + +
Sbjct: 138 ACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGC 178



Score = 33.3 bits (76), Expect = 0.002
Identities = 26/130 (20%), Positives = 50/130 (38%), Gaps = 9/130 (6%)

Query: 320 AMVIISTQFLLLKLMARFSLVKRIQIGLLLLICSQVWLAFNPLDLFWGW-IGAIVVMSVA 378
+ ++ + + AR + + +G++ + LAF GW I+V+ +
Sbjct: 257 ILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAF----ATRGWMAFPIMVLLAS 312

Query: 379 ETILFPTMNVHIDRLAPDHLRGAYFGA-ASFYDLGFALAPLGGGIILDHFGGQW---LFL 434
I P + + R + +G G+ A+ L + PL I W ++
Sbjct: 313 GGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAASITTWNGWAWI 372

Query: 435 TGAALSALVI 444
GAAL L +
Sbjct: 373 AGAALYLLCL 382


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS14310FLAGELLIN300.035 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 30.0 bits (67), Expect = 0.035
Identities = 22/195 (11%), Positives = 52/195 (26%), Gaps = 3/195 (1%)

Query: 367 QSTNAMNSQLHELEQLATAMHEMATTSSDVARNAQGASSAAKEADEATNVGSKVVSDTTN 426
+ N L + + E + + +G + K + + +
Sbjct: 240 DAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKV 299

Query: 427 AINALSSKIDMAVAEVNTLGSATDNIATILKVINDIADQTNLLALNAAI--EAARAGDSG 484
+ K+ + VA++ + D + + E+A+ D
Sbjct: 300 STTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLE 359

Query: 485 RGFAVVADEVRTLAQRTQQSTTQIRNMIEQLQTGARAVAEVMSQSKDNAKDAVTLAQGAN 544
AV + T+ + + +T S +DA +
Sbjct: 360 ANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTAS-GVSTLINEDAAAAKKSTA 418

Query: 545 TALDKIREAILQISD 559
L I A+ ++
Sbjct: 419 NPLASIDSALSKVDA 433


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS14315ECOLNEIPORIN482e-08 E.coli/Neisseria porin superfamily signature.
		>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature.

Length = 331

Score = 48.3 bits (115), Expect = 2e-08
Identities = 59/326 (18%), Positives = 110/326 (33%), Gaps = 49/326 (15%)

Query: 63 SLYGSLRPTLEYQDKADDVWD----------IGDALSRLGVKADTEFTPNWHAIAQGEWK 112
+LYG+++ +E I D S++G K + AI Q E K
Sbjct: 22 TLYGTIKAGVETSRSVAHNGAQAASVETGTGIVDLGSKIGFKGQEDLGNGLKAIWQVEQK 81

Query: 113 VRLDDNGRFGEARLAFAGIGSPFGQITFGRQRPVQY--------TLVAEYIDIFNNA--- 161
+ R +F G+ FG++ GR V ++Y+ + A
Sbjct: 82 ASIAGTDSGWGNRQSFIGLKGGFGKLRVGRLNSVLKDTGDINPWDSKSDYLGVNKIAEPE 141

Query: 162 NSPFGYNQESPFF--VNNALIYELKLKPVTVMASAIFDGNSGGSGADTINVGAGFDKNGL 219
+SP F ++ ++ Y L N+G +++ + G + G
Sbjct: 142 ARLISVRYDSPEFAGLSGSVQYALN-------------DNAGRHNSESYHAGFNYKNGGF 188

Query: 220 HLGAAYLQQDVYANANRTG---KEQLTGAVISYEFSSGIYAAVGYQAKDYEFETAVNRTG 276
+ + + K Q+ V Y+ + +YA+V Q +D +
Sbjct: 189 FVQYGGAYKR-HHQVQENVNIEKYQIHRLVSGYD-NDALYASVAVQQQDAKLVEENYSHN 246

Query: 277 STFDSA--LAIPFANAYKLKLGYFWFKDG-IEDTSSQD-YDGYNLTLEWQIAANVRTHLE 332
S + A LA F N ++ Y G + T+ + YD + E+ + +
Sbjct: 247 SQTEVAATLAYRFGNV-TPRVSYAHGFKGSFDATNYNNDYDQVVVGAEYDFSKRTSALVS 305

Query: 333 Y-LQQNNDQRED--DTIIALGIRYDF 355
Q T +G+R+ F
Sbjct: 306 AGWLQEGKGESKFVSTAGGVGLRHKF 331


23AYJ58_RS16180AYJ58_RS16200N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AYJ58_RS16180-1539737486009671648.493045IucA/IucC family siderophore biosynthesis
AYJ58_RS16185-1539737486009671647.962617TonB-dependent siderophore receptor
AYJ58_RS16190-1539737486009671648.335533iron reductase
AYJ58_RS16195-1539737486009671548.065027intracellular septation protein A
AYJ58_RS16200-1539737486009671446.453333hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS16195PF041836180.0 IucA / IucC family
		>PF04183#IucA / IucC family

Length = 580

Score = 618 bits (1596), Expect = 0.0
Identities = 165/593 (27%), Positives = 291/593 (49%), Gaps = 22/593 (3%)

Query: 41 LTPAYWQAANRHLVKKILCEFTHEKIITPTLYGQKAGLNHYELRLNNSTYYFSARHYQLD 100
+ W NR LV K+L E +E++ + G + Y + L + + F A
Sbjct: 1 MNHKDWDLVNRRLVAKMLSELEYEQVFHA----ESQGDDRYCINLPGAQWRFIAERGIWG 56

Query: 101 HLAIDADSIRVSVAGQEQILDAMSLIISLKNDLGISETLLPTYLEEITSTLYSKAYKL-A 159
L IDA ++R ++ + A +L++ LK L +S+ + +++++ +TL L A
Sbjct: 57 WLWIDAQTLRC----ADEPVLAQTLLMQLKQVLSMSDATVAEHMQDLYATLLGDLQLLKA 112

Query: 160 HQAIPAATLARADYQSIEAGMTEGHPVFIANNGRIGFDMQDYRQFAPESAVPMQLVWLGV 219
+ + A+ L + ++ + GHP F+ N GR G+ + ++APE A +L WL V
Sbjct: 113 RRGLSASDLINLNADRLQC-LLSGHPKFVFNKGRRGWGKEALERYAPEYANTFRLHWLAV 171

Query: 220 RKSKTTFAALENLSHDALLKKELG-QQFNDFQQHLKSQQHDPQDFYFMPVHPWQWREKIA 278
++ + + LL + Q+F F Q + D ++ +PVHPWQW++KIA
Sbjct: 172 KREHMIWRCDNEMDIHQLLTAAMDPQEFARFSQVWQENGLD-HNWLPLPVHPWQWQQKIA 230

Query: 279 RVFAGDIARGDLVYLGEGNEQYQVQQSIRTFFNLASPQKCYVKTALSILNMGFMRGLSPL 338
F D A G +V LGE +Q+ QQS+RT N + +K L+I N RG+
Sbjct: 231 TDFIADFAEGRMVSLGEFGDQWLAQQSLRTLTNASRRGGLDIKLPLTIYNTSCYRGIPGR 290

Query: 339 YMSCTPQINAWVANLVESDPYFTQQGFVILKEIAAIGYHHHYYEQALTQDSAYKKMLSAL 398
Y++ P + W+ + +D Q G VIL E AA H Y Y++ML +
Sbjct: 291 YIAAGPLASRWLQQVFATDATLVQSGAVILGEPAAGYVSHEGYAALARAPYRYQEMLGVI 350

Query: 399 WRESPLPHIEPKQNLMTMAALLHTDHEDKALIAALIAASGLPAKEWVNRYLNLYLSPLLH 458
WRE+P ++P ++ + MA L+ D ++ L A I SGL A+ W+ + + + PL H
Sbjct: 351 WRENPCRWLKPDESPVLMATLMECDENNQPLAGAYIDRSGLDAETWLTQLFRVVVVPLYH 410

Query: 459 AFFAYDLVFMPHGENLILVLDEYVPVKILMKDIGEEVAVLNG----AKPLPDDVKRLAVS 514
Y + + HG+N+ L + E VP ++L+KD ++ ++ LP +V+ +
Sbjct: 411 LLCRYGVALIAHGQNITLAMKEGVPQRVLLKDFQGDMRLVKEEFPEMDSLPQEVRDVTSR 470

Query: 515 LEEEMKLNYILLDIFDCIFRYLAPLLDEQTSVSESQFWELVADNVRDYQAQHPHLAAKFA 574
L + ++ + F + R+++PL+ + V E +F++L+A + DY +HP ++ +FA
Sbjct: 471 LSADYLIHDLQTGHFVTVLRFISPLMV-RLGVPERRFYQLLAAVLSDYMKKHPQMSERFA 529

Query: 575 QYDLFKDSFVRTCLNRIQLNNNQQMIDLADREKNL-RFAGGIDNPLAAFRQSH 626
+ LF+ +R LN ++L DL + L + + NPL Q +
Sbjct: 530 LFSLFRPQIIRVVLNPVKL----TWPDLDGGSRMLPNYLEDLQNPLWLVTQEY 578


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS16200PRTACTNFAMLY320.014 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 31.6 bits (71), Expect = 0.014
Identities = 31/130 (23%), Positives = 43/130 (33%), Gaps = 21/130 (16%)

Query: 224 DSGSVRGRAVAAYQDKDSFQDRYEQQRTTLYGIVETDIGDSTLFTLGVDYQDATPSGTMS 283
D+G GR A Q D+ R Q + G F LG D+ A G
Sbjct: 645 DAGGAWGRGFAQRQQLDNRAGRRFDQ--KVAG-----------FELGADHAVAVAGGRWH 691

Query: 284 GGLPLFYSDGSRTNYDRATSTAPDWGSAHTQGLNTFASLEHRFDNGWNLKGTYTYGDNSL 343
G Y+ G R G HT ++ + D+G+ L T
Sbjct: 692 LGGLAGYTRGDRG--------FTGDGGGHTDSVHVGGYATYIADSGFYLDATLRASRLEN 743

Query: 344 EFDVLWATGY 353
+F V + GY
Sbjct: 744 DFKVAGSDGY 753


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS162052FE2SRDCTASE1074e-29 Ferric iron reductase signature.
		>2FE2SRDCTASE#Ferric iron reductase signature.

Length = 262

Score = 107 bits (269), Expect = 4e-29
Identities = 69/242 (28%), Positives = 98/242 (40%), Gaps = 71/242 (29%)

Query: 127 KALHSLWGQWYFGLLVPPMMEWIFNAPETDLESIHWQPQSIFMQVHPSGRVAKFECNIAK 186
K L SLW QWY GL+VPP+M + + + P+ + H +GRVA F ++ +
Sbjct: 89 KPLISLWAQWYIGLMVPPLMLALLTQEKA----LDVSPEHFHAEFHETGRVACFWVDVCE 144

Query: 187 HQPNTALTFKKYHGFEPLCQTNTKSSFKTDTKVHSPLSLYKPTVDKELALQGLILNLLQP 246
+ T HSP + LI L P
Sbjct: 145 DKNATP---------------------------HSPQHRM----------ETLISQALVP 167

Query: 247 SVERLLTLSPVPVKLYWSHLGYLIHWYLGELG--LTEQHSQQLKQALFRRTTFLDGSTNP 304
V+ L + KL WS+ GYLI+WYL E+ L E + L+ ALF T +G NP
Sbjct: 168 VVQALEATGEINGKLIWSNTGYLINWYLTEMKQLLGEATVESLRHALFFEKTLTNGEDNP 227

Query: 305 LYNSINLLIEPVQSSADSSKDSSVKTSTTLNSKPSPKIHCIRRTCCLRYQLANTGQCHDC 364
L+ ++ L +D + +RRTCC RY+L + QC DC
Sbjct: 228 LWRTVVL------------RDGLL----------------VRRTCCQRYRLPDVQQCGDC 259

Query: 365 PL 366
L
Sbjct: 260 TL 261


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS16215adhesinmafb250.044 Neisseria meningitidis: adhesin MafB signature.
		>adhesinmafb#Neisseria meningitidis: adhesin MafB signature.

Length = 467

Score = 25.4 bits (55), Expect = 0.044
Identities = 9/44 (20%), Positives = 14/44 (31%)

Query: 54 AGFSGSLVVADFESLVAAKHWADADPYIEAGVYQSVVVKPFKRV 97
G GS+ + + A W +P V V +V
Sbjct: 279 IGGLGSVAGFEKNTREAVDRWIQENPNAAETVEAVFNVAAAAKV 322


24AYJ58_RS17170AYJ58_RS17435N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AYJ58_RS17170-1539737486009672144.067548two-component system response regulator
AYJ58_RS17175-1539737486009672143.981481VacJ family lipoprotein
AYJ58_RS17180-1539737486009672445.194647flagellar biosynthesis protein FlhB
AYJ58_RS17185-1539737486009672045.203160DUF2802 domain-containing protein
AYJ58_RS17190-1539737486009672545.035203chemotaxis protein CheW
AYJ58_RS17195-1539737486009672745.313415chemotaxis protein CheW
AYJ58_RS17200-1539737486009672644.956772ParA family protein
AYJ58_RS17205-1539737486009672644.974948membrane anchored protein in chemotaxis locus
AYJ58_RS17210-1539737486009672443.400103chemotaxis response regulator protein-glutamate
AYJ58_RS17215-1539737486009672543.776461chemotaxis protein CheA
AYJ58_RS17220-1539737486009672644.818890protein phosphatase CheZ
AYJ58_RS17225-1539737486009672545.149314chemotaxis protein CheY
AYJ58_RS17230-1539737486009672544.951848RNA polymerase sigma factor FliA
AYJ58_RS17235-1539737486009672345.029240MinD/ParA family protein
AYJ58_RS17240-1539737486009672444.857043flagellar biosynthesis protein FlhF
AYJ58_RS17245-1539737486009672245.080619flagellar biosynthesis protein FlhA
AYJ58_RS17250-1539737486009672244.876046flagellar biosynthesis protein FlhB
AYJ58_RS17255-1539737486009672144.766973flagellar type III secretion system protein
AYJ58_RS17260-1539737486009672045.101608flagellar biosynthetic protein FliQ
AYJ58_RS17265-1539737486009671945.219749flagellar biosynthetic protein FliP
AYJ58_RS17270-1539737486009672045.262035flagellar biosynthetic protein FliO
AYJ58_RS17275-1539737486009671944.983668flagellar motor switch protein FliN
AYJ58_RS17280-1539737486009671944.529680flagellar motor switch protein FliM
AYJ58_RS17285-1539737486009672243.903088flagellar basal body-associated protein FliL
AYJ58_RS17295-1539737486009672343.666962flagellar hook-length control protein FliK
AYJ58_RS17300-1539737486009672143.306788flagellar export protein FliJ
AYJ58_RS17305-1539737486009672243.285082flagellar protein export ATPase FliI
AYJ58_RS17310-1539737486009671745.334013flagellar assembly protein FliH
AYJ58_RS17315-1539737486009671845.635834flagellar motor switch protein FliG
AYJ58_RS17320-1539737486009671446.162080flagellar basal body M-ring protein FliF
AYJ58_RS17325-1539737486009671346.697807flagellar hook-basal body complex protein FliE
AYJ58_RS17330-1539737486009671348.267074sigma-54-dependent Fis family transcriptional
AYJ58_RS17335-1539737486009671448.341442PAS domain-containing protein
AYJ58_RS17340-1539737486009671347.181373sigma-54-dependent Fis family transcriptional
AYJ58_RS17345-1539737486009671246.911447flagella export chaperone FliS
AYJ58_RS17350-1539737486009671546.514213hypothetical protein
AYJ58_RS17355-1539737486009671945.146910flagellar hook protein FliD
AYJ58_RS17360-1539737486009672044.193062flagellar biosynthesis protein FlaG
AYJ58_RS17365-1539737486009672143.375426flagellin
AYJ58_RS17370-1539737486009672642.875817flagellin
AYJ58_RS17375-1539737486009673442.564675flagellar hook-associated protein 3
AYJ58_RS17380-1539737486009673042.674437flagellar hook-associated protein FlgK
AYJ58_RS17385-1539737486009672444.465894peptidoglycan hydrolase FlgJ
AYJ58_RS17390-1539737486009672245.275017flagellar basal body P-ring protein FlgI
AYJ58_RS17395-1539737486009672245.472132flagellar basal body L-ring protein FlgH
AYJ58_RS17400-1539737486009671745.951714flagellar basal-body rod protein FlgG
AYJ58_RS17405-1539737486009671346.444249flagellar basal-body rod protein FlgF
AYJ58_RS17410-1539737486009671447.370074flagellar hook protein FlgE
AYJ58_RS17415-1539737486009671946.098191flagellar hook assembly protein FlgD
AYJ58_RS17420-1539737486009671945.849439flagellar basal body rod protein FlgC
AYJ58_RS17425-1539737486009671945.628415flagellar basal body rod protein FlgB
AYJ58_RS17430-1539737486009672145.432264protein-glutamate O-methyltransferase CheR
AYJ58_RS17435-1539737486009672344.782705chemotaxis protein CheV
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS17190HTHFIS907e-22 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 89.9 bits (223), Expect = 7e-22
Identities = 29/114 (25%), Positives = 50/114 (43%), Gaps = 4/114 (3%)

Query: 8 VLLVEDDPVFRQIVASFLDSRGAQVTQACDGEEGLSFFKSQHFDVVLADLSMPKLGGLDM 67
+L+ +DD R ++ L G V + + + D+V+ D+ MP D+
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 68 LKEMTRLAPLVPSVVISGNNVMADVVEALRIGASDYLVKPVSDLFIIEQAIKQS 121
L + + P +P +V+S N ++A GA DYL KP F + + I
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKP----FDLTELIGII 115


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS17195VACJLIPOPROT2326e-79 VacJ lipoprotein signature.
		>VACJLIPOPROT#VacJ lipoprotein signature.

Length = 251

Score = 232 bits (594), Expect = 6e-79
Identities = 96/265 (36%), Positives = 142/265 (53%), Gaps = 20/265 (7%)

Query: 1 MKLKWMGLSLGLLLLPKVQAAEVPVSDTIQQETPAKVQITYDDPRDPFEGFNRAMWDFNY 60
MKL+ L+LG LL +D + DP EGFNR M++FN+
Sbjct: 1 MKLRLSALALGTTLL---VGCASSGTDQQGRS-------------DPLEGFNRTMYNFNF 44

Query: 61 LYLDRYLYRPVAHGYNDYVPMPAKTGINNFVQNLEEPSSLVNNVLQGKWGWAANAGGRFT 120
LD Y+ RPVA + DYVP PA+ G++NF NLEEP+ +VN LQG RF
Sbjct: 45 NVLDPYIVRPVAVAWRDYVPQPARNGLSNFTGNLEEPAVMVNYFLQGDPYQGMVHFTRFF 104

Query: 121 INTTVGLLGVIDVADMMGMSRKQDE---FNEVLGYYGVPNGPYFMAPFAGPYVVRELASD 177
+NT +G+ G IDVA M ++ E F LG+YGV GPY PF G + +R+ D
Sbjct: 105 LNTILGMGGFIDVAGMANPKLQRTEPHRFGSTLGHYGVGYGPYVQLPFYGSFTLRDDGGD 164

Query: 178 WVDGLYFPLSELTMWQTIVKWGLKNLHSRASAIDQERLVDNALDPYAFVKDAYLQHMDYK 237
D LY LS LT ++ KW L+ + +RA +D + L+ + DPY V++AY Q D+
Sbjct: 165 MADALYPVLSWLTWPMSVGKWTLEGIETRAQLLDSDGLLRQSSDPYIMVREAYFQRHDFI 224

Query: 238 VYDGNV-PQKQDDDELLEQYMQELE 261
G + PQ+ + + ++ +++++
Sbjct: 225 ANGGELKPQENPNAQAIQDDLKDID 249


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS17215TYPE3IMSPROT581e-13 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 58.2 bits (141), Expect = 1e-13
Identities = 16/93 (17%), Positives = 34/93 (36%), Gaps = 9/93 (9%)

Query: 10 AVALSYDG--HNAPKIVATGEGLIAEEIIALAKANGVYIHQDPHLSHFL-QLLELGDEIP 66
A+ + Y P + + + +A+ GV I Q L+ L + IP
Sbjct: 268 AIGILYKRGETPLPLVTFKYTDAQVQTVRKIAEEEGVPILQRIPLARALYWDALVDHYIP 327

Query: 67 KELYLLIAELIAFVYMLDGKFPEQWNNMHQKIV 99
E AE++ ++ + + H +++
Sbjct: 328 AEQIEATAEVLRWLERQNIE------KQHSEML 354


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS17230IGASERPTASE320.004 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 32.0 bits (72), Expect = 0.004
Identities = 24/151 (15%), Positives = 46/151 (30%), Gaps = 4/151 (2%)

Query: 19 EDDAPSQAVSPQSIVKADSNQNVATNQTEAALLDAAILAENGRTAASQAHERAITPTVQL 78
++ + + Q+ A S QT A + E A + + P ++
Sbjct: 1070 KEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKE--EKAKVETEKTQEVP--KV 1125

Query: 79 QPKATPRQNEFVEPVSKIASQVDKQALEKLLAPVLKAKTVEPTPSFVEVTPAVDVQPVVE 138
+ +P+Q + + + + P + T T + T + QPV E
Sbjct: 1126 TSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTE 1185

Query: 139 VPQAQPALEEPVNDEIIAEEVVEPVVVPETQ 169
N E +P V E+
Sbjct: 1186 STTVNTGNSVVENPENTTPATTQPTVNSESS 1216


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS17245HTHFIS664e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 66.4 bits (162), Expect = 4e-14
Identities = 31/168 (18%), Positives = 65/168 (38%), Gaps = 9/168 (5%)

Query: 2 AIKVLVVDDSSFFRRRVSEIVNQDPELEVIATASNGAEAVKMAAELNPQVITMDIEMPVM 61
+LV DD + R +++ +++ + SN A + A + ++ D+ MP
Sbjct: 3 GATILVADDDAAIRTVLNQALSR--AGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 62 DGITAVREIMAKCP-TPILMFSSLTHDGAKATLDALDAGALDFLPKRFEDIATNKDDAIL 120
+ + I P P+L+ S+ + + A + GA D+LPK F D+
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSA--QNTFMTAIKASEKGAYDYLPKPF-DLTELIGIIGR 117

Query: 121 LLQQRVKALGRRRMFRPIARPVVTSTPSVRPTSSVLGTTSIASHTPAT 168
L + + + P+V + +++ + + T T
Sbjct: 118 ALAEPKRRPSKLEDDSQDGMPLVGRSAAMQ---EIYRVLARLMQTDLT 162


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS17250PF06580456e-07 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 45.2 bits (107), Expect = 6e-07
Identities = 18/105 (17%), Positives = 37/105 (35%), Gaps = 23/105 (21%)

Query: 435 TLNKEIDLIMV---------GEETDLDKNLVEALADPLVH------LVRNSVDHGIEMPN 479
+L E+ ++ + + + A+ D V LV N + HGI
Sbjct: 217 SLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPMLVQTLVENGIKHGIA--- 273

Query: 480 EREASGKPRTGTITLSASQEGDHILLKIEDDGAGMDPEKLKKIAI 524
P+ G I L +++ + L++E+ G+ +
Sbjct: 274 -----QLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGT 313


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS17260HTHFIS903e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 89.9 bits (223), Expect = 3e-24
Identities = 33/105 (31%), Positives = 51/105 (48%), Gaps = 3/105 (2%)

Query: 6 KILIVDDFSTMRRIIKNLLRDLGFNNTQEADDGSTALPMLQKGDFDFVVTDWNMPGMQGI 65
IL+ DD + +R ++ L G++ + +T + GD D VVTD MP
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 66 DLLKAIRADDSLKHLPVLMVTAEAKREQIIAAAQAGVNGYVVKPF 110
DLL I+ LPVL+++A+ I A++ G Y+ KPF
Sbjct: 64 DLLPRIKKAR--PDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPF 106


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS17275PF05272310.013 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.8 bits (69), Expect = 0.013
Identities = 9/25 (36%), Positives = 12/25 (48%)

Query: 240 VKQGGVVALVGPTGVGKTTSLAKLA 264
K V L G G+GK+T + L
Sbjct: 593 CKFDYSVVLEGTGGIGKSTLINTLV 617


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS17285TYPE3IMSPROT326e-112 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 326 bits (838), Expect = e-112
Identities = 94/347 (27%), Positives = 179/347 (51%), Gaps = 2/347 (0%)

Query: 6 SGERSEEPTGRRLEQAREKGQIARSKELGTAAVLISAACGFYMLGPSLAKSLTRVFETVF 65
SGE++E+PT +++ AR+KGQ+A+SKE+ + A++++ + L + +++ +
Sbjct: 2 SGEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLM--LI 59

Query: 66 TMDRAQVFDTEEMFNVWGVVAGEIVWPMAKIMLLIVVVAFIGNVALGGMNFSTQAMMPKA 125
+++ + ++ + V V E + ++ + ++A +V G S +A+ P
Sbjct: 60 PAEQSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDI 119

Query: 126 SKMSPAAGLKRMFGVQALVELTKGIAKFSVVAFSAYLLLSFYFNEILLLSSDHLPGNVYH 185
K++P G KR+F +++LVE K I K +++ ++++ +L L + +
Sbjct: 120 KKINPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPL 179

Query: 186 ALDLLVWMFILLCSSILLIVVIDVPFQLWNHNKQLKMTKQEVKDEYKDTEGKPEVKGRVR 245
+L + ++ ++I + D F+ + + K+LKM+K E+K EYK+ EG PE+K + R
Sbjct: 180 LGQILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRR 239

Query: 246 QMQRELAQRRMMAEVPNADVIVVNPEHFAVAIKYDVQRSVAPFVVAKGVDDVAFKIREIA 305
Q +E+ R M V + V+V NP H A+ I Y + P V K D +R+IA
Sbjct: 240 QFHQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIA 299

Query: 306 RAHNVTIVSAPPLARAIYHTTKLDQQIPEGLFTAVAQILAYVFQLRQ 352
V I+ PLARA+Y +D IP A A++L ++ +
Sbjct: 300 EEEGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQNI 346


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS17290TYPE3IMRPROT1225e-36 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 122 bits (309), Expect = 5e-36
Identities = 93/249 (37%), Positives = 145/249 (58%), Gaps = 1/249 (0%)

Query: 9 VQTIAAYMWPLFRVASMLMVMVVFGAATTPTRVRLLLAMAITFAIAPVLPPVQNADLFSL 68
+ + Y WPL RV +++ + + P RV+L LAM ITFAIAP LP + +FS
Sbjct: 10 LSWLNLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLP-ANDVPVFSF 68

Query: 69 SAVFITAQQIIIGVAMGFVTQMVMQVFVLTGQIIGMQTSLGFASMVDPGSGQQTPVIGNF 128
A+++ QQI+IG+A+GF Q G+IIG+Q L FA+ VDP S PV+
Sbjct: 69 FALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHLNMPVLARI 128

Query: 129 FLLLATLIFLAVDGHLLMIRMLVVSFETLPISNQGLTLTSYRALADWGSYMFGAALTMSI 188
+LA L+FL +GHL +I +LV +F TLPI + L ++ AL GS +F L +++
Sbjct: 129 MDMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLIFLNGLMLAL 188

Query: 189 SAIIALLLVNLSFGVMTRAAPQLNIFSIGFPITMIGGLFILWLTLTPVMAHFDEVWAAAQ 248
I LL +NL+ G++ R APQL+IF IGFP+T+ G+ ++ + + + +++
Sbjct: 189 PLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFCEHLFSEIF 248

Query: 249 VLLCDMLAL 257
LL D+++
Sbjct: 249 NLLADIISE 257


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS17295TYPE3IMQPROT492e-11 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 48.6 bits (116), Expect = 2e-11
Identities = 22/78 (28%), Positives = 41/78 (52%)

Query: 4 EALIDIFREALSVIVIMVSAIVLPGLGIGLIVAVFQAATSINEQTLSFLPRLIITLLALM 63
+ L+ +AL +++I+ + IGL+V +FQ T + EQTL F +L+ L L
Sbjct: 2 DDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLF 61

Query: 64 FMGHWLVETLMDFFVEMV 81
+ W E L+ + +++
Sbjct: 62 LLSGWYGEVLLSYGRQVI 79


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS17300FLGBIOSNFLIP2784e-97 Escherichia coli: Flagellar biosynthetic protein Fl...
		>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP

signature.
Length = 245

Score = 278 bits (712), Expect = 4e-97
Identities = 126/244 (51%), Positives = 183/244 (75%), Gaps = 3/244 (1%)

Query: 4 RILALVGFVILLCMPSAWAADGVLPAVTVTTGPDGSTEYSVTMQILLLMTSLSFLPAMLI 63
R+L++ ++ L P A+A LP +T P G +S+ +Q L+ +TSL+F+PA+L+
Sbjct: 3 RLLSVAPVLLWLITPLAFAQ---LPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILL 59

Query: 64 MLTSFTRIIIVLSILRQAIGLQQTPSNQVLIGMSLFMTFFIMAPVFDRIYDEGVKPYIEE 123
M+TSFTRIIIV +LR A+G P NQVL+G++LF+TFFIM+PV D+IY + +P+ EE
Sbjct: 60 MMTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEE 119

Query: 124 QLTLQQAFEKGKEPIRSFMLGQVRTTDLKTFIEISGYKNIKSPEEAPMSVLIPAFITSEL 183
++++Q+A EKG +P+R FML Q R DL F ++ ++ PE PM +L+PA++TSEL
Sbjct: 120 KISMQEALEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSEL 179

Query: 184 KTAFQIGFMLFVPFLVLDLVVASILMAMGMMMLSPMIVSLPFKIMLFVLVDGWSLVLGTL 243
KTAFQIGF +F+PFL++DLV+AS+LMA+GMMM+ P ++LPFK+MLFVLVDGW L++G+L
Sbjct: 180 KTAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSL 239

Query: 244 ANSF 247
A SF
Sbjct: 240 AQSF 243


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS17310FLGMOTORFLIN1096e-34 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 109 bits (273), Expect = 6e-34
Identities = 53/119 (44%), Positives = 79/119 (66%)

Query: 7 DDWAAAMAEQALEEANAIELDELVDDSRPISKAEAAKLDTILDIPVTISMEVGRSYISIR 66
D WA A+ EQ + +D I+DIPV +++E+GR+ ++I+
Sbjct: 17 DLWADALNEQKATTTKSAADAVFQQLGGGDVSGAMQDIDLIMDIPVKLTVELGRTRMTIK 76

Query: 67 NLLQLNQGSVVELDRVAGEPLDVMVNGTLIAHGEVVVVNDKFGIRLTDVISQTERIKKL 125
LL+L QGSVV LD +AGEPLD+++NG LIA GEVVVV DK+G+R+TD+I+ +ER+++L
Sbjct: 77 ELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYGVRITDIITPSERMRRL 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS17315FLGMOTORFLIM2481e-82 Flagellar motor switch protein FliM signature.
		>FLGMOTORFLIM#Flagellar motor switch protein FliM signature.

Length = 344

Score = 248 bits (635), Expect = 1e-82
Identities = 87/327 (26%), Positives = 164/327 (50%), Gaps = 12/327 (3%)

Query: 1 MSDLLSQDEIDALLHGVDDVDDDDEIDSAG----HDARSYDFSSQDRIVRGRMPTLEIVN 56
M+++LSQDEID LL + D E D+ YDF D+ + +M TL +++
Sbjct: 1 MTEVLSQDEIDQLLTAISSGDASIE-DARPISDTRKITLYDFRRPDKFSKEQMRTLSLMH 59

Query: 57 ERFARHLRISMFNMMRRAAEVSINGVQMLKFGEYVHTLFVPTSLNMVRFSPLKGTALITM 116
E FAR S+ +R V + V L + E++ ++ P++L ++ PLKG A++ +
Sbjct: 60 ETFARLTTTSLSAQLRSMVHVHVASVDQLTYEEFIRSIPTPSTLAVITMDPLKGNAVLEV 119

Query: 117 EARLVFILVDNFFGGDGRFHAKIEGREFTPTERRIVQLLLKIIFEDYKDAWAPVMDVEFD 176
+ + F ++D FGG G+ R+ T E +++ ++ I + +++W V+D+
Sbjct: 120 DPSITFSIIDRLFGGTGQAAKVQ--RDLTDIENSVMEGVIVRILANVRESWTQVIDLRPR 177

Query: 177 YLDSEVNPAMANIVSPTEVVVINSFHIEVDGGGGDFHITMPYSMIEPIRELLDAG--VQS 234
E NP A IV P+E+VV+ + +V G + +PY IEPI L + S
Sbjct: 178 LGQIETNPQFAQIVPPSEMVVLVTLETKVGEEEGMMNFCIPYITIEPIISKLSSQFWFSS 237

Query: 235 DKQDTDMRWSQALHDEIMDVKVGFDANIVEHELTLKDVMNFKAGDIIPIE---LPEYIMM 291
++ + ++ L D++ V + A + L+++D++ + GDII + + + ++
Sbjct: 238 VRRSSTTQYMGVLRDKLSTVDMDVVAEVGSLRLSVRDILGLRVGDIIRLHDTHVGDPFVL 297

Query: 292 KIEDLPTYRCKMGRSRDNLALKIYEKI 318
I + + C+ G +A +I E+I
Sbjct: 298 SIGNRKKFLCQPGVVGKKIAAQILERI 324


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS17325FLGHOOKFLIK524e-09 Flagellar hook-length control protein signature.
		>FLGHOOKFLIK#Flagellar hook-length control protein signature.

Length = 375

Score = 52.1 bits (124), Expect = 4e-09
Identities = 36/132 (27%), Positives = 65/132 (49%), Gaps = 5/132 (3%)

Query: 592 MKQQLITMVSQGIQHAEIRLDPPELGHMLVKIQVHGDQTQVQFHVTQTQTRDLVEQAMPR 651
+ Q + QG Q AE+RL P +LG + + ++V +Q Q+Q R +E A+P
Sbjct: 244 LSQHISLFTRQGQQSAELRLHPQDLGEVQISLKVDDNQAQIQMVSPHQHVRAALEAALPV 303

Query: 652 LRELLQEQGMQLADSHVSQGGQGERREGSFGDGGGSSGADVDEISAEE-----LHLGLNQ 706
LR L E G+QL S++S +++ + A+ + ++ E+ + + L
Sbjct: 304 LRTQLAESGIQLGQSNISGESFSGQQQAASQQQQSQRTANHEPLAGEDDDTLPVPVSLQG 363

Query: 707 ASSVNSGIDYYA 718
+ NSG+D +A
Sbjct: 364 RVTGNSGVDIFA 375


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS17330FLGFLIJ442e-08 Flagellar FliJ protein signature.
		>FLGFLIJ#Flagellar FliJ protein signature.

Length = 147

Score = 44.0 bits (103), Expect = 2e-08
Identities = 39/145 (26%), Positives = 70/145 (48%)

Query: 1 MANADPLLLVLKLANDAEEQAALLLKSAQLECQKRLNQLSALNNYRLEYMKQMQSQQGQA 60
MA L + LA E AA LL + CQ+ QL L +Y+ EY + S
Sbjct: 1 MAEHGALATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAG 60

Query: 61 ISASHYHQFHRFIRQIDDAITQQNRVVADGEKQKEYRQQHWLEKQKKRKAVELLLASKEK 120
I+++ + + +FI+ ++ AITQ + + ++ + W EK+++ +A + L +
Sbjct: 61 ITSNRWINYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQERQST 120

Query: 121 KRQVVEQKREQKMTDEFASQQFYRR 145
+ E + +QK DEFA + R+
Sbjct: 121 AALLAENRLDQKKMDEFAQRAAMRK 145


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS17340FLGFLIH896e-23 Flagellar assembly protein FliH signature.
		>FLGFLIH#Flagellar assembly protein FliH signature.

Length = 228

Score = 89.1 bits (220), Expect = 6e-23
Identities = 57/201 (28%), Positives = 102/201 (50%), Gaps = 4/201 (1%)

Query: 50 AAKPTTVESVSPPTMAEIEDIRAQAEEEGFA---EGKQQGYEQGLEKGRLEGLEQGHTEG 106
A + P IE+ E++ + +QGY+ G+ +GR +G +QG+ EG
Sbjct: 16 APPQAEFVPIVEPEETIIEEAEPSLEQQLAQLQMQAHEQGYQAGIAEGRQQGHKQGYQEG 75

Query: 107 LAQGHEQGLETGLAQAKVLLSRFEALLTQFEKPLQLLDGDIELSLLNLSMTLAKSVIGHE 166
LAQG EQGL +Q + +R + L+++F+ L LD I L+ +++ A+ VIG
Sbjct: 76 LAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALDSVIASRLMQMALEAARQVIGQT 135

Query: 167 LKTHPEQVLSALRLGIESLPIKEQAVTIRLHPDDVILVEQLYSTAQLTRSKWELEVDPTL 226
++ ++ ++ P+ +R+HPDD+ V+ + A L+ W L DPTL
Sbjct: 136 PTVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRVDDMLG-ATLSLHGWRLRGDPTL 194

Query: 227 SAGDCILSSHRSLVDLTLSSR 247
G C +S+ +D ++++R
Sbjct: 195 HPGGCKVSADEGDLDASVATR 215


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS17345FLGMOTORFLIG2863e-97 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 286 bits (733), Expect = 3e-97
Identities = 109/350 (31%), Positives = 195/350 (55%), Gaps = 7/350 (2%)

Query: 1 MAENKTKEVAPAAAPAFNIKDISGVEKTAILLLSLSEADAASILKHLEPKQVQKVGMAMA 60
M E K KE+ ++ ++G +K AILL+S+ ++ + K+L ++++ + +A
Sbjct: 1 MEEKKEKEIL-------DVSALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIA 53

Query: 61 AMDEFGQEKVIGVHKLFLDDIQKYSSIGFNSEEFVRKALTAALGEDKAGNLIEQIIMGSG 120
++ E V F + + I ++ R+ L +LG KA ++I +
Sbjct: 54 KLETITSELKDNVLLEFKELMMAQEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSALQ 113

Query: 121 AKGLDSLKWMDARQVATIIQNEHPQIQTIVLSYLEPDQAAEIFGQFPENTRLDLMMRIAN 180
++ + ++ D + IQ EHPQ ++LSYL+P +A+ I P + ++ RIA
Sbjct: 114 SRPFEFVRRADPANILNFIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIAL 173

Query: 181 LEEVQPAALQELNDIMEKQFAGQGGAQAAKMGGLKAAANIMNYLDTGIESQLMETMRESD 240
++ P ++E+ ++EK+ A GG+ I+N D E ++E++ E D
Sbjct: 174 MDRTSPEVVREVERVLEKKLASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEED 233

Query: 241 EEMAQQIQDLMFVFENLIDVDDRGIQALLREVQQDVLMKALKGTDDQLKEKILGNMSKRA 300
E+A++I+ MFVFE+++ +DDR IQ +LRE+ L KALK D ++EKI NMSKRA
Sbjct: 234 PELAEEIKKKMFVFEDIVLLDDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRA 293

Query: 301 AELLRDDLEAMGPIRISEVEVAQKEILSIARRLSDSGEIMLGGGGGDEFL 350
A +L++D+E +GP R +VE +Q++I+S+ R+L + GEI++ GG ++ L
Sbjct: 294 ASMLKEDMEFLGPTRRKDVEESQQKIVSLIRKLEEQGEIVISRGGEEDVL 343


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS17350FLGMRINGFLIF3055e-99 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 305 bits (782), Expect = 5e-99
Identities = 161/560 (28%), Positives = 266/560 (47%), Gaps = 42/560 (7%)

Query: 29 NLGGVDMMRQVTMILALAICLALAVFVMLWAQEPEYRPL-GKMETQEMVQVLDVLDKNKV 87
L + ++ +I+A + +A+ V ++LWA+ P+YR L + Q+ ++ L + +
Sbjct: 15 WLNRLRANPRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQDGGAIVAQLTQMNI 74

Query: 88 KYQIDVD--VIKVPEDKYQEVKMMLSRAGVDSPAASQDFLNQDSGFGVSQRMEQARLKHS 145
Y+ I+VP DK E+++ L++ G+ A L FG+SQ EQ + +
Sbjct: 75 PYRFANGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQFSEQVNYQRA 134

Query: 146 QEENLARAIEQLQSVSRAKVILALPKENVFARNASKPSATVVINTRRG-GLGQGEVDAIV 204
E LAR IE L V A+V LA+PK ++F R PSA+V + G L +G++ A+V
Sbjct: 135 LEGELARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRALDEGQISAVV 194

Query: 205 DIVASAVQGLEPSRVTVTDSNGRLLNSGSQDGASATARRELELVQQKEAEYRTKIESILV 264
+V+SAV GL P VT+ D +G LL + G +L+ E+ + +IE+IL
Sbjct: 195 HLVSSAVAGLPPGNVTLVDQSGHLLTQSNTSGRDL-NDAQLKFANDVESRIQRRIEAILS 253

Query: 265 PILGPDNFTSQVDVSMDFTAVEQTSKRYNPDLPALRSEMTVENNTT-----GGSSGGIPG 319
PI+G N +QV +DF EQT + Y+P+ A ++ + G GG+PG
Sbjct: 254 PIVGNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNISEQVGAGYPGGVPG 313

Query: 320 ALSNQPP---------------MESNIPQDAT-KATESVTAGNSHREATRNFELDTTISH 363
ALSNQP N PQ +T + S ++ R T N+E+D TI H
Sbjct: 314 ALSNQPAPPNEAPIATPPTNQQNAQNTPQTSTSTNSNSAGPRSTQRNETSNYEVDRTIRH 373

Query: 364 TRQQVGAVRRISVSVAVDFKPGAAGENGQVARVARTEQELTNIRRLLEGAVGFSSQRGDV 423
T+ VG + R+SV+V V++K A G+ + T ++ I L A+GFS +RGD
Sbjct: 374 TKMNVGDIERLSVAVVVNYKTLADGKP-----LPLTADQMKQIEDLTREAMGFSDKRGDT 428

Query: 424 LEVVTVPFMDQLVEDLPALELWEQPWFWRAIKLGIGALVILVLILAVVRPMLKRLIYPDS 483
L VV PF + L W+Q F + L L+L V + ++ + P
Sbjct: 429 LNVVNSPF-SAVDNTGGELPFWQQQSFIDQLLAAGRWL----LVLVVAWILWRKAVRPQL 483

Query: 484 VNMPEDGRLGNELAEIEDQYAADTLGMLNTQEAEYSYADDGSIHIPNLHKDDDMIKAIRA 543
E+ + E A++ + L+ + E + + + M + IR
Sbjct: 484 TRRVEEAKAAQEQAQVRQETEEAVEVRLS--KDEQLQQRRANQRLGA----EVMSQRIRE 537

Query: 544 LVANEPELSTQVVKNWLQDN 563
+ N+P + V++ W+ ++
Sbjct: 538 MSDNDPRVVALVIRQWMSND 557


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS17355FLGHOOKFLIE576e-14 Flagellar hook-basal body complex protein FliE signa...
		>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE

signature.
Length = 103

Score = 56.6 bits (136), Expect = 6e-14
Identities = 29/86 (33%), Positives = 45/86 (52%)

Query: 26 QPNIMQQVNNTSGADFGQLLSQAVGNVSGLQSTSSNLATRLEMGDTTVTLSDTVIAREKA 85
Q+ F L A+ +S Q+ + A + +G+ V L+D + +KA
Sbjct: 18 MSARAQESLPQPTISFAGQLHAALDRISDTQTAARTQAEKFTLGEPGVALNDVMTDMQKA 77

Query: 86 SVAFEATVQVRNKLVEAYKEIMSMPV 111
SV+ + +QVRNKLV AY+E+MSM V
Sbjct: 78 SVSMQMGIQVRNKLVAAYQEVMSMQV 103


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS17360HTHFIS456e-161 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 456 bits (1176), Expect = e-161
Identities = 167/483 (34%), Positives = 248/483 (51%), Gaps = 42/483 (8%)

Query: 1 MSEAKLLLVEDDASLREALLDTLMLAQYECIDVASGEDAILALKQHQFDLVISDVQMQGI 60
M+ A +L+ +DDA++R L L A Y+ ++ + DLV++DV M
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 GGLGLLNFLQQHHPKLPVLLMTAYATIGSAVDAIKLGAVDYLAKPFAPEVLLNQVSRYLP 120
LL +++ P LPVL+M+A T +A+ A + GA DYL KPF L+ + R L
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 121 LKQNVDQPVVAD-----------EKSLALLALAQRVAASDASVMILGPSGSGKEVLARYI 169
+ + D + + R+ +D ++MI G SG+GKE++AR +
Sbjct: 121 EPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARAL 180

Query: 170 HQHSSRADQAFVAINCAAIPENMLEATLFGYEKGAFTGAYQACPGKFEQAQGGTLLLDEI 229
H + R + FVAIN AAIP +++E+ LFG+EKGAFTGA G+FEQA+GGTL LDEI
Sbjct: 181 HDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEI 240

Query: 230 SEMDLGLQAKLLRVLQEREVERLGGRKTIKLDVRVLATSNRDLKAVVAAGGFREDLYYRI 289
+M + Q +LLRVLQ+ E +GGR I+ DVR++A +N+DLK + G FREDLYYR+
Sbjct: 241 GDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRL 300

Query: 290 NVFPLAWPALSQRPADILPLARHLLVKHAKALNVADVPELDENARRRLLSHRWPGNVREL 349
NV PL P L R DI L RH + + A+ + V D+ A + +H WPGNVREL
Sbjct: 301 NVVPLRLPPLRDRAEDIPDLVRHFV-QQAEKEGLD-VKRFDQEALELMKAHPWPGNVREL 358

Query: 350 DNVIQRALILRAGQVITANDIIIDAQDVILG--------------------------GED 383
+N+++R L VIT I + + I
Sbjct: 359 ENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFAS 418

Query: 384 LDQFAAEPDGLGEELKAQEHVIILETLNQCQGSRKLVAEKLGISARTLRYKMARMRDMGI 443
L E+ +IL L +G++ A+ LG++ TLR K +R++G+
Sbjct: 419 FGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKK---IRELGV 475

Query: 444 QLP 446
+
Sbjct: 476 SVY 478


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS17365PF06580330.002 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 32.5 bits (74), Expect = 0.002
Identities = 19/95 (20%), Positives = 37/95 (38%), Gaps = 19/95 (20%)

Query: 256 LVMNSIEAGAT------EIRIQAAEEGEQLLLNVIDNGKGLDANMQQKVLEPFFTTKSQG 309
LV N I+ G +I ++ ++ + L V + G N + +
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTK------------ES 310

Query: 310 TGLGLA-VVQSVVRNHGGQLQLSCLPNKGCTVSLV 343
TG GL V + + +G + Q+ +G ++V
Sbjct: 311 TGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMV 345


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS17370HTHFIS435e-151 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 435 bits (1119), Expect = e-151
Identities = 170/481 (35%), Positives = 263/481 (54%), Gaps = 21/481 (4%)

Query: 7 RILLIGPPSERLNRLCCIFDFLGEQIVQI-DAEKLSASLQDTRFRALVILTDVMDAEM-- 63
IL+ + L G + +A L + +++TDV+ +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGD--LVVTDVVMPDENA 62

Query: 64 ---LKNIAGQHPWQPMLLL---GNVDDLQVSNILG---NIEEPLTYPQLTELLHFCQVFG 114
L I P P+L++ ++ G + +P +L ++
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122

Query: 115 QVKRPQVPTSANQTKLFRSLVGRSDGIANVRHLINQVATSEATVLVLGQSGTGKEVVARN 174
+ + ++ + LVGRS + + ++ ++ ++ T+++ G+SGTGKE+VAR
Sbjct: 123 KRRPSKLEDDSQD---GMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARA 179

Query: 175 IHYLSERRDGPFIPVNCGAIPPELLESELFGHEKGSFTGAISSRKGRFELAEGGTLFLDE 234
+H +RR+GPF+ +N AIP +L+ESELFGHEKG+FTGA + GRFE AEGGTLFLDE
Sbjct: 180 LHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDE 239

Query: 235 IGDMPLQMQVKLLRVLQERVFERVGGTKTINADVRVVAATHRDLETMISVNEFREDLYYR 294
IGDMP+ Q +LLRVLQ+ + VGG I +DVR+VAAT++DL+ I+ FREDLYYR
Sbjct: 240 IGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYR 299

Query: 295 LNVFPIEMPALCDRKDDVPLLLQELVSRVYNEGRGKVRFTQRAIESLKEHAWSGNVRELS 354
LNV P+ +P L DR +D+P L++ V + EG RF Q A+E +K H W GNVREL
Sbjct: 300 LNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDVKRFDQEALELMKAHPWPGNVRELE 359

Query: 355 NLVERLTILYPGGLVDVNDLPVKYRHIDVPEYCVEMSEEQQERDALASIFSDEEPVEIPE 414
NLV RLT LYP ++ + + R ++P+ +E + + +++ EE +
Sbjct: 360 NLVRRLTALYPQDVITREIIENELRS-EIPDSPIEKAAARSGSLSISQAV--EENMRQYF 416

Query: 415 TRFPSELPPEGVNLKDLLAELEIDMIRQALELQDNVVARAAEMLGIRRTTLVEKMRKYGM 474
F LPP G+ +LAE+E +I AL +AA++LG+ R TL +K+R+ G+
Sbjct: 417 ASFGDALPPSGL-YDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGV 475

Query: 475 T 475
+
Sbjct: 476 S 476


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS17395FLAGELLIN2094e-64 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 209 bits (534), Expect = 4e-64
Identities = 158/508 (31%), Positives = 236/508 (46%), Gaps = 29/508 (5%)

Query: 2 AISVNTNVTSMKAQNQLNGANSKLSTSMERLSSGMRINSAKDDAAGLQISNRFTSQINGI 61
A +NTN S+ QN LN + S LS+++ERLSSG+RINSAKDDAAG I+NRFTS I G+
Sbjct: 1 AQVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGL 60

Query: 62 GVAMRNANDGISIAQTAEGAMSESTNILQRMRDLSLQSANGSNSADDRASMQKEVIALQS 121
A RNANDGISIAQT EGA++E N LQR+R+LS+Q+ NG+NS D S+Q E+
Sbjct: 61 TQASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLE 120

Query: 122 ELTRISDTTSFGGQKLLNGSFGTQSFQVGAQANETIDVSLKSVGAADIGNYKAKALGTAA 181
E+ R+S+ T F G K+L+ QVGA ETI + L+ + +G G
Sbjct: 121 EIDRVSNQTQFNGVKVLSQDN-QMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKE 179

Query: 182 GDYGNGFSNTADNGVAGGTMILTKGSKSTTVTALAGDTALETASRINKAGTSVKATAQTV 241
G+ ++ N T + V + A T + +K + T
Sbjct: 180 ATVGD-LKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTT 238

Query: 242 VQADVTTAFATGTLTMNVNDGTTTSALDLSGVSDNEQLTKLINEFSGESGVSAKLEDGKV 301
A+ TA T + A+ + E T + +
Sbjct: 239 DDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGK 298

Query: 302 TITSSTGKDISFA---------------------------GGATGTGGLSVSNMSSGVLT 334
T+ G+ ++ G + + +
Sbjct: 299 VSTTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDL 358

Query: 335 GTDTIVGGTAKVVTAAGSVSLSSSSDFTLGGTGAGELSTDTGGNFTGVNLVDISTAAGAQ 394
+ V G +K+ + +++ D + G T +N +
Sbjct: 359 EANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAAKKSTA 418

Query: 395 AALATIDGAISGIDSSRADLGAVQNRMNFTISNLNNIQSNVTDARSRIQDVDFASETAQL 454
LA+ID A+S +D+ R+ LGA+QNR + I+NL N +N+ ARSRI+D D+A+E + +
Sbjct: 419 NPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNM 478

Query: 455 TKQQILSQTSSAMLAQANQIPQTALSLL 482
+K QIL Q +++LAQANQ+PQ LSLL
Sbjct: 479 SKAQILQQAGTSVLAQANQVPQNVLSLL 506


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS17400FLAGELLIN2141e-65 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 214 bits (545), Expect = 1e-65
Identities = 154/508 (30%), Positives = 229/508 (45%), Gaps = 29/508 (5%)

Query: 2 AISVNTNVTSMKAQNQLNGANSRLSTSMERLSSGMRINSAKDDAAGLQISNRMSSQINGI 61
A +NTN S+ QN LN + S LS+++ERLSSG+RINSAKDDAAG I+NR +S I G+
Sbjct: 1 AQVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGL 60

Query: 62 GVAMRNANDGISIAQTAEGAMQESTNILQRMRDLSLQSANGSNSGDDRAAMQKELSALQS 121
A RNANDGISIAQT EGA+ E N LQR+R+LS+Q+ NG+NS D ++Q E+
Sbjct: 61 TQASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLE 120

Query: 122 ELTRISDTTSFGGQKLLDGSYGTQSFQVGAQANETIDVSLKSVGAADIGNYKAKALGTAA 181
E+ R+S+ T F G K+L QVGA ETI + L+ + +G G
Sbjct: 121 EIDRVSNQTQFNGVKVLSQD-NQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKE 179

Query: 182 GDYGNGFSNTADNGVAGGTMILTKGSKSTTVTALAGDTALETASRINKAGTSVKATAQTV 241
G+ ++ N T + V + A T + +K + T
Sbjct: 180 ATVGD-LKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTT 238

Query: 242 VQADVTTAFATGTLTMNVNDGTTTSALDLSGVSDNEQLTKLINEFSGESGVSAKLEDGKV 301
A+ TA T + A+ + E T + +
Sbjct: 239 DDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGK 298

Query: 302 TITSSTGKDISFA---------------------------GGATGTGGLSVSNMSSGVLT 334
T+ G+ ++ G + + +
Sbjct: 299 VSTTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDL 358

Query: 335 GTDTIVGGTAKVVTAAGSVSLSSSSDFTLGGTGAGELSTDTGGNFTGVNLVDISTAAGAQ 394
+ V G +K+ + +++ D + G T +N +
Sbjct: 359 EANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAAKKSTA 418

Query: 395 AAIDIIDGAISGIDGSRSDLGAVQNRMSFTISNLSNIQSNVTDARSRIQDVDFASETAQL 454
+ ID A+S +D RS LGA+QNR I+NL N +N+ ARSRI+D D+A+E + +
Sbjct: 419 NPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNM 478

Query: 455 TKQQILSQTSSAMLAQANQIPQTALSLL 482
+K QIL Q +++LAQANQ+PQ LSLL
Sbjct: 479 SKAQILQQAGTSVLAQANQVPQNVLSLL 506


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS17405FLAGELLIN484e-08 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 48.1 bits (114), Expect = 4e-08
Identities = 23/122 (18%), Positives = 66/122 (54%)

Query: 20 QTATSKILDQLSSGKKVNTAGDDPVASQGIDNLNQKNALVDQFIKNIDYATNHLQQTESQ 79
Q++ S +++LSSG ++N+A DD + + Q +N + + Q TE
Sbjct: 21 QSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQASRNANDGISIAQTTEGA 80

Query: 80 LGQADTLVSSMKDMMLRGSNGSMTEAERQTIADDMRKSLDQLLTIANTKDESGNYLFAGN 139
L + + + ++++ ++ +NG+ ++++ ++I D++++ L+++ ++N +G + + +
Sbjct: 81 LNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEIDRVSNQTQFNGVKVLSQD 140

Query: 140 KT 141

Sbjct: 141 NQ 142


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS17410FLGHOOKAP12152e-64 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 215 bits (549), Expect = 2e-64
Identities = 124/455 (27%), Positives = 194/455 (42%), Gaps = 19/455 (4%)

Query: 4 DLLNIARTGVLASQSQLGVTSNNIANANTAGYHRQVATQSTLESQRLGSSFYGTGTYVND 63
L+N A +G+ A+Q+ L SNNI++ N AGY RQ + S + G G YV+
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVSG 61

Query: 64 VKRIYNDYAARELRIGQTTLSAAEASYGKLSELDQLFSQIGKVVPQSLNDLFSGLNSVAD 123
V+R Y+ + +LR QT S A Y ++S++D + S + + D F+ L ++
Sbjct: 62 VQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLVS 121

Query: 124 LPADLGIRSSTLTNAQQVASSLNQMQSYLNGQLDQTNDQITGMTKRINEIGTELANLNLE 183
D R + + ++ + + YL Q Q N I +IN ++A+LN +
Sbjct: 122 NAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLNDQ 181

Query: 184 LMKSPNQDA-----QLLDKQDALVQELSQYAQVNVIPQENGAKSIMLGGSVMLVSGEIAM 238
+ + A LLD++D LV EL+Q V V Q+ G +I + LV G A
Sbjct: 182 ISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTAR 241

Query: 239 SMGTQAGNPFPKELQLNSSIGSQSVTVDPSKL--GGQLGAMFDYRDQTLIPAGHELDQLA 296
+ + P + G+ P KL G LG + +R Q L + L QLA
Sbjct: 242 QLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQLA 301

Query: 297 LGVADNFNKMQAQGIDLNGQVGANIFKDINDPMMSLGRVAGFSDNAGNATLGVTIDDTSL 356
L A+ FN G D NG G + F + V + N G+ +G T+ D S
Sbjct: 302 LAFAEAFNTQHKAGFDANGDAGEDFFA------IGKPAVLQNTKNKGDVAIGATVTDASA 355

Query: 357 LTGGAYELSF--TAPATYELRDTETGAITPLTLTGSTLSGGAGFSIDIKAGAMASGDRFA 414
+ Y++SF L T +TP G G A D F
Sbjct: 356 VLATDYKISFDNNQWQVTRLASNTTFTVTPDANGKVAFDGLELTFT----GTPAVNDSFT 411

Query: 415 IRPTAGASNGIEVVMKDPKGIAAASPKITADAANS 449
++P + A ++V++ D IA AS + D+ N
Sbjct: 412 LKPVSDAIVNMDVLITDEAKIAMASEEDAGDSDNR 446



Score = 88.1 bits (218), Expect = 2e-20
Identities = 38/104 (36%), Positives = 56/104 (53%)

Query: 535 AEGDNSNAVAMAKLSESKVMNNGKSTLADVFENTKLDIGSKTKAAEVRTGSAEAVYQQAY 594
+ DN N A+ L + G + D + + DIG+KT + + + V Q
Sbjct: 441 GDSDNRNGQALLDLQSNSKTVGGAKSFNDAYASLVSDIGNKTATLKTSSATQGNVVTQLS 500

Query: 595 ARVESESGVNLDEEAANLMRFQQAYQASARIMTTAQQIFDTLLS 638
+ +S SGVNLDEE NL RFQQ Y A+A+++ TA IFD L++
Sbjct: 501 NQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALIN 544


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS17415FLGFLGJ1551e-46 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 155 bits (393), Expect = 1e-46
Identities = 66/151 (43%), Positives = 94/151 (62%), Gaps = 1/151 (0%)

Query: 219 GSREEFLATLYPHAEKAAKALGTQPEVLLAQSALETGWGQKIVRGNNGAPSHNLFNIKAD 278
G + FLA L A+ A++ G ++LAQ+ALE+GWGQ+ +R NG PS+NLF +KA
Sbjct: 147 GDSKAFLAQLSLPAQLASQQSGVPHHLILAQAALESGWGQRQIRRENGEPSYNLFGVKAS 206

Query: 279 RRWQGDKANVSTLEFEHGVAVQQKADFRVYSDFEHSFNDFVSFIANGDRYQDAKKVAASP 338
W+G ++T E+E+G A + KA FRVYS + + +D+V + RY A AAS
Sbjct: 207 GNWKGPVTEITTTEYENGEAKKVKAKFRVYSSYLEALSDYVGLLTRNPRYA-AVTTAASA 265

Query: 339 TQFIRALQDAGYATDPRYAEKVIKVMQSISE 369
Q +ALQDAGYATDP YA K+ ++Q +
Sbjct: 266 EQGAQALQDAGYATDPHYARKLTNMIQQMKS 296



Score = 88.2 bits (218), Expect = 1e-21
Identities = 39/91 (42%), Positives = 61/91 (67%), Gaps = 3/91 (3%)

Query: 12 DLGGLDSLRAQAQKDEKGALKKVAQQFEGIFVQMLMKSMRDANAVFQSDSPLNSQYTQFY 71
D L+ L+A+A +D ++ VA+Q EG+FVQM++KSMRDA D +S++T+ Y
Sbjct: 14 DAQSLNELKAKAGEDPAANIRPVARQVEGMFVQMMLKSMRDALP---KDGLFSSEHTRLY 70

Query: 72 EQMRDQQLSVDLSDKGVLGLADMMVQQLSPE 102
M DQQ++ ++ LGLA+MMV+Q++PE
Sbjct: 71 TSMYDQQIAQQMTAGKGLGLAEMMVKQMTPE 101


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS17420FLGPRINGFLGI373e-131 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 373 bits (958), Expect = e-131
Identities = 159/367 (43%), Positives = 221/367 (60%), Gaps = 14/367 (3%)

Query: 5 LMLTIAVLVFSLPSQAE--RIKDIANVQGVRSNQLIGYGLVVGLPGTGEKTS---YTEQT 59
L+ + + + P+QA+ RIKDIA++Q R NQLIGYGLVVGL GTG+ +TEQ+
Sbjct: 11 LVFSALPFLSTPPAQADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSPFTEQS 70

Query: 60 FMTMLKNFGINLPDNVKPKIKNVAVVAVHADMPAFIKPGQDLDVTVSSLGEAKSLRGGTL 119
ML+N GI + KN+A V V A++P F PG +DVTVSSLG+A SLRGG L
Sbjct: 71 MRAMLQNLGITTQGG-QSNAKNIAAVMVTANLPPFASPGSRVDVTVSSLGDATSLRGGNL 129

Query: 120 LQTFLKGVDGNVYAIAQGSLVVSGFSADGLDGSKVIQNTPTVGRIPNGAIVERSVASPFS 179
+ T L G DG +YA+AQG+L+V+GFSA G D + + Q T R+PNGAI+ER + S F
Sbjct: 130 IMTSLSGADGQIYAVAQGALIVNGFSAQG-DAATLTQGVTTSARVPNGAIIERELPSKFK 188

Query: 180 TGDYLTFNLRRSDFSTAQRMADAINDL----LGPDMARPLDATSVQVTAPRDVSQRVSFL 235
L LR DFSTA R+AD +N G +A P D+ + V PR V+ +
Sbjct: 189 DSVNLVLQLRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKPR-VADLTRLM 247

Query: 236 ATLENLDVVPAEESAKVIVNSRTGTIVVGQNVRLLPAAITHGGMTVTIAEATQVSQPNAL 295
A +ENL V + AKV++N RTGTIV+G +VR+ A+++G +TV + E+ QV QP
Sbjct: 248 AEIENLTVET-DTPAKVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESPQVIQPAPF 306

Query: 296 ANGQTVVTANTTIGVEESDRRMFMFNPGTTLDELVRAVNLVGAAPSDVLAILEALKVAGA 355
+ GQT V T I + ++ + G L LV +N +G ++AIL+ +K AGA
Sbjct: 307 SRGQTAVQPQTDIMAMQEGSKVAIVE-GPDLRTLVAGLNSIGLKADGIIAILQGIKSAGA 365

Query: 356 LHGELII 362
L EL++
Sbjct: 366 LQAELVL 372


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS17425FLGLRINGFLGH1423e-44 Flagellar L-ring protein signature.
		>FLGLRINGFLGH#Flagellar L-ring protein signature.

Length = 232

Score = 142 bits (358), Expect = 3e-44
Identities = 75/227 (33%), Positives = 113/227 (49%), Gaps = 18/227 (7%)

Query: 4 YLVVAVALL-LAACSSTQKKPLADDPFYAPVYPEAPPTKIAATGSIYQDSQ-----ASSL 57
Y + ++ +L L C+ PL A P P A GSI+Q +Q L
Sbjct: 9 YAISSLLVLSLTGCAWIPSTPLVQGATSAQPVPGPTP---VANGSIFQSAQPINYGYQPL 65

Query: 58 YSDIRAHKVGDIITIVLKESTQAKKSAGNQIKKGSDM-----TLDPIFAGGSNVSIGGVP 112
+ D R +GD +TIVL+E+ A KS+ + T+ G G
Sbjct: 66 FEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQGL----FGNAR 121

Query: 113 IDLRYKDSMNTKRESDADQSNSLDGSISANVMQVLNNGSLVIRGEKWISINNGDEFIRVT 172
D+ + A+ SN+ G+++ V QVL NG+L + GEK I+IN G EFIR +
Sbjct: 122 ADVEASGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRFS 181

Query: 173 GLVRSQDIKPDNTIDSTRMANARIQYSGTGTFADAQKVGWLSQFFMS 219
G+V + I NT+ ST++A+ARI+Y G G +AQ +GWL +FF++
Sbjct: 182 GVVNPRTISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLN 228


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS17430FLGHOOKAP1437e-07 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 43.0 bits (101), Expect = 7e-07
Identities = 18/119 (15%), Positives = 40/119 (33%), Gaps = 4/119 (3%)

Query: 145 DNATSITVSAEGEVSVKTPGTAENQVVGQLTMTDFINPSGLDPMGQNLYTETG---ASGT 201
+ I +++E + + + Q + +L ++ G A+
Sbjct: 427 TDEAKIAMASEEDAGDSDNRNGQALLDLQSNSKTVGGAKSFNDAYASLVSDIGNKTATLK 486

Query: 202 PIQGTASLDGMGAIRQGALETSNVNVTEELVNLIESQRIYEMNSKVISAVDQMLSYVTQ 260
T + + S VN+ EE NL Q+ Y N++V+ + + +
Sbjct: 487 TSSATQGNV-VTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALIN 544



Score = 35.7 bits (82), Expect = 2e-04
Identities = 9/36 (25%), Positives = 20/36 (55%)

Query: 5 LWISKTGLDAQQTDIAVISNNVANASTVGYKKSRAV 40
+ + +GL+A Q + SNN+++ + GY + +
Sbjct: 4 INNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTI 39


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS17440FLGHOOKAP1402e-05 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 39.9 bits (93), Expect = 2e-05
Identities = 15/35 (42%), Positives = 22/35 (62%)

Query: 2 SFNIALSGISAAQKDLNTTANNIANANTIGFKESR 36
N A+SG++AAQ LNT +NNI++ N G+
Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYTRQT 37



Score = 37.6 bits (87), Expect = 9e-05
Identities = 12/49 (24%), Positives = 25/49 (51%)

Query: 405 SISSSALEQSNIDLTTELVDLISAQRNFQANSRTLEVNNTLQQTVLQIR 453
+S+ S ++L E +L Q+ + AN++ L+ N + ++ IR
Sbjct: 498 QLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS17450FLGHOOKAP1333e-04 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 33.0 bits (75), Expect = 3e-04
Identities = 9/38 (23%), Positives = 18/38 (47%)

Query: 99 NVNVMEEMADMISASRSYQMNVQVAEAAKSMLQQTLGM 136
VN+ EE ++ + Y N QV + A ++ + +
Sbjct: 508 GVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545



Score = 29.9 bits (67), Expect = 0.003
Identities = 16/67 (23%), Positives = 29/67 (43%), Gaps = 6/67 (8%)

Query: 5 SIFDVAGSGMSAQSVRLNTTASNIANADSVSSSVDKTYRSRHPIFEAEMAKAQSQQQASQ 64
S+ + A SG++A LNT ++NI++ + Y + I + +
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYN------VAGYTRQTTIMAQANSTLGAGGWVGN 55

Query: 65 GVAVKGI 71
GV V G+
Sbjct: 56 GVYVSGV 62


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS17465HTHFIS611e-12 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 61.4 bits (149), Expect = 1e-12
Identities = 23/128 (17%), Positives = 52/128 (40%), Gaps = 12/128 (9%)

Query: 180 HIMVIDDSAVARKQIIRSLESLNLQIDTAKDGREALDKLKEIAKEMDNVADEIPLIISDI 239
I+V DD A R + ++L + + + A + L+++D+
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIA---------AGDGDLVVTDV 55

Query: 240 EMPEMDGYTLTAEIRDDPKLKHIKVVLHTSLSGVFNQAMVQKVGANDFIAK-FNPDELAA 298
MP+ + + L I+ + V++ ++ + + GA D++ K F+ EL
Sbjct: 56 VMPDENAFDLLPRIKK--ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIG 113

Query: 299 AVNKHLSL 306
+ + L+
Sbjct: 114 IIGRALAE 121


25AYJ58_RS17495AYJ58_RS17525N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AYJ58_RS17495-1539737486009671646.728016TetR/AcrR family transcriptional regulator
AYJ58_RS17510-1539737486009671649.268070efflux RND transporter periplasmic adaptor
AYJ58_RS17515-1539737486009671648.956380AcrB/AcrD/AcrF family protein
AYJ58_RS17520-1539737486009671648.989265DUF465 domain-containing protein
AYJ58_RS17525-1539737486009671449.017311methyl-accepting chemotaxis protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS17520HTHTETR733e-18 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 73.1 bits (179), Expect = 3e-18
Identities = 43/200 (21%), Positives = 74/200 (37%), Gaps = 11/200 (5%)

Query: 13 RSEQKRQQVLAAAIDLFCRQGFPHTSMDEVAKLAGVSKQTVYSHYGSKDELFVAAIE--S 70
+++ RQ +L A+ LF +QG TS+ E+AK AGV++ +Y H+ K +LF E
Sbjct: 8 EAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSE 67

Query: 71 KCVGHNLNDDLLSDPTKPEFALTQFALQFGEMIVSPEAITVFKACVAQSESHP---EVSQ 127
+G + P P L + + E V+ E + + V Q
Sbjct: 68 SNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQ 127

Query: 128 LFFDAGPKHIVGLLADYLVKVEALGQYRFGNAHHSAVRLCLMLFGELKLKLELGLAAD-- 185
+ L A R +++ G + +E L A
Sbjct: 128 QAQRNLCLESYDRIEQTLKHCIEAKMLP---ADLMTRRAAIIMRGYISGLMENWLFAPQS 184

Query: 186 -ELANDRNDYILGCADMFLR 204
+L + DY+ +M+L
Sbjct: 185 FDLKKEARDYVAILLEMYLL 204


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS17525RTXTOXIND461e-07 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 46.4 bits (110), Expect = 1e-07
Identities = 26/144 (18%), Positives = 52/144 (36%), Gaps = 12/144 (8%)

Query: 101 RLLEAERQ--EIQASLAQTQADVDLATSTL---KRNQELKKSGYVSEQLLDENRSQLNSL 155
+LE E + E L ++ ++ S + K +L + +E L ++ N +
Sbjct: 253 AVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDN-I 311

Query: 156 AAAKNRLLASQHANQLKLDKSILVAPFDGTISQ-RLHNLGEVVAAGSPIFSLVGNINP-E 213
L N+ + S++ AP + Q ++H G VV + +V + E
Sbjct: 312 GLLTLEL----AKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLE 367

Query: 214 AYIGVPVALADQFHPKQQVQVSVQ 237
V + Q + V+
Sbjct: 368 VTALVQNKDIGFINVGQNAIIKVE 391



Score = 31.3 bits (71), Expect = 0.006
Identities = 25/108 (23%), Positives = 44/108 (40%), Gaps = 9/108 (8%)

Query: 76 GKLNELQADSGIKVKQDQILAILDTRLLEAERQEIQASLAQTQADVDLATSTLKRNQELK 135
+ E+ G V++ +L L EA+ + Q+SL Q + + L R+ EL
Sbjct: 105 SIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQ-TRYQILSRSIELN 163

Query: 136 KSGYVS-------EQLLDENRSQLNSLAAAKNRLLASQHAN-QLKLDK 175
K + + + +E +L SL + +Q +L LDK
Sbjct: 164 KLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDK 211


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS17530ACRIFLAVINRP382e-118 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 382 bits (982), Expect = e-118
Identities = 207/1048 (19%), Positives = 422/1048 (40%), Gaps = 56/1048 (5%)

Query: 1 MIKAFVENGRLVSLVIALLLVAGFGAISSLPRTEDPHITNRFASVITPYPGASAERVEAL 60
M F+ ++ +L++AG AI LP + P I SV YPGA A+ V+
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 61 VTEVLENQLRRLEEIKLIQSTS-RPGISVIQLELKDTVMETDPVWSR--ARDLLADAKVN 117
VT+V+E + ++ + + STS G I L + TDP ++ ++ L A
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQS---GTDPDIAQVQVQNKLQLATPL 117

Query: 118 LPEGIPSPTL---DDQIGYAYTAILSLVWNSNTPVRVDMLNRYAKE-LQSRLRLLSGTDF 173
LP+ + + Y A T D ++ Y ++ L L+G
Sbjct: 118 LPQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQ---DDISDYVASNVKDTLSRLNGVGD 174

Query: 174 VKLYGAPTEEMLVQLDGNKMSQLELTPAAIAHILSGADSKIAAGEINN------HEFRAL 227
V+L+GA M + LD + +++ +LTP + + L + +IAAG++ + A
Sbjct: 175 VQLFGAQ-YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNAS 233

Query: 228 VEVSGELDSLTRIRQVPLKIDSQGQVIRLGDIATVTRQAKTPADSIALVDGEQGVFVAAR 287
+ + +V L+++S G V+RL D+A V + + IA ++G+ + +
Sbjct: 234 IIAQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENY-NVIARINGKPAAGLGIK 292

Query: 288 MLNNSRVDLWQAQVNRIVDELNREMPANIQIQWLFEQNSYTSDRLGGLVVNLLQGFVIIL 347
+ + + + EL P +++ + ++ + + +V L + +++
Sbjct: 293 LATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVF 352

Query: 348 LVLLLTLG-LRNAIIVAISLPLTALFTLACMKYIGLPIHQMSVTGLVVALGIMVDNAIVI 406
LV+ L L +R +I I++P+ L T A + G I+ +++ G+V+A+G++VD+AIV+
Sbjct: 353 LVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVV 412

Query: 407 VDAISQRRQ-QGMSRLAAVSETIHHLWLPLAGSTITTILAFAPIVLMPGAAGEFVGGIAM 465
V+ + + + A +++ + L G + F P+ G+ G ++
Sbjct: 413 VENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSI 472

Query: 466 SVMFALLGSYLISHTIIAGLAGRF-------SHEGKHDVWYQHGINLPLVSQYFQASLRI 518
+++ A+ S L++ + L HE K + ++ S+
Sbjct: 473 TIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGK 532

Query: 519 ALKRPILSALLIGITPVLGFYASGKMTEQFFPPSDRDMFQIEVYLAPHVSLDNSLNQVQL 578
L L+ + ++ F P D+ +F + L + + + +
Sbjct: 533 ILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQ 592

Query: 579 MDKKL--HAVNGITQVDWVVGGNTPSFYYNLTQRQQGATNYAQAMVK-----VTDFERAN 631
+ + + V V G + Q A +K D A
Sbjct: 593 VTDYYLKNEKANVESVFTVNGFSFSG--------QAQNAGMAFVSLKPWEERNGDENSAE 644

Query: 632 ELIPELQQQFDS---AFPEAQVLVRKLEQGPPFNAPVELM-IFGPNLDTLRTLGDEVRNI 687
+I + + F + +E G EL+ G D L +++ +
Sbjct: 645 AVIHRAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGM 704

Query: 688 LAQTP-DVLHTRATLSAGAPKLWLQVNEDASLMSGLSLTDIAKQIQMSTTGVIGGSVLEQ 746
AQ P ++ R + L+V+++ + G+SL+DI + I + G +++
Sbjct: 705 AAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDR 764

Query: 747 TESLPVRVRLGDGSREQVTRLSEIQLVTPSGESVALSALAHNEVQVSRGAIPRRNGQRVN 806
+ V+ R + ++ + + +GE V SA + + R NG
Sbjct: 765 GRVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSM 824

Query: 807 TIEAYIVSGVLPAQVLNDVKDKVAQLALPSGYRIEIGGESAKRNEAVGNLLSNVMLVVTL 866
I+ G + +++ ++ LP+G + G S + + + V + +
Sbjct: 825 EIQGEAAPGTSSGDAMALMENLASK--LPAGIGYDWTGMSYQERLSGNQAPALVAISFVV 882

Query: 867 LLATVVLSFNSFRLTAIILLSAIQSAGLGLLAVYVFGYPFGFPVIIGLLGLMGLAINAAI 926
+ + + S+ + ++L LLA +F ++GLL +GL+ AI
Sbjct: 883 VFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAI 942

Query: 927 VILAELEDMPSAR-LGDMETIVSTVSSCGRHISSTTITTVGGFIPLII---AGGGFWPPF 982
+I+ +D+ G +E + V R I T++ + G +PL I AG G
Sbjct: 943 LIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAV 1002

Query: 983 AIAIAGGTLLTTLLSLVWVPTMYLLLMK 1010
I + GG + TLL++ +VP ++++ +
Sbjct: 1003 GIGVMGGMVSATLLAIFFVPVFFVVIRR 1030


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS17540FLAGELLIN300.040 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 29.6 bits (66), Expect = 0.040
Identities = 19/87 (21%), Positives = 34/87 (39%), Gaps = 4/87 (4%)

Query: 426 QIATAIEEMSTSIRDVANHAQDGAVQSQQVDAAAKEGQGQQTKVVQDLLKLSEQLSNSHQ 485
IA + + +A DG +Q + A E +Q + +LS Q +N
Sbjct: 48 AIANRFTSNIKGLTQASRNANDGISIAQTTEGALNE----INNNLQRVRELSVQATNGTN 103

Query: 486 SVEKVSHESEAISKVTEVINSIAEQTN 512
S + + I + E I+ ++ QT
Sbjct: 104 SDSDLKSIQDEIQQRLEEIDRVSNQTQ 130


26AYJ58_RS17630AYJ58_RS17660N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AYJ58_RS17630-1539737486009672145.122681phosphate regulon transcriptional regulatory
AYJ58_RS17635-1539737486009671645.614828porin
AYJ58_RS17640-153973748600967845.956305recombination-associated protein RdgC
AYJ58_RS17650-1539737486009671145.011999M61 family peptidase
AYJ58_RS17660-1539737486009671045.509499tetratricopeptide repeat protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS17645HTHFIS911e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 91.4 bits (227), Expect = 1e-23
Identities = 34/130 (26%), Positives = 64/130 (49%), Gaps = 4/130 (3%)

Query: 3 ARILIVEDELAIREMLTFVMEQHGFTTSAAEDFDSAIALLKEPYPDLILLDWMFPGGSGI 62
A IL+ +D+ AIR +L + + G+ + + + DL++ D + P +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 63 QLAKRLKQDEFTRQIPIIMLTARGEEEDKVKGLEVGADDYITKPFSPKELVARIKAVL-- 120
L R+K+ +P+++++A+ +K E GA DY+ KPF EL+ I L
Sbjct: 64 DLLPRIKKAR--PDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 121 RRSAPTRLEE 130
+ P++LE+
Sbjct: 122 PKRRPSKLED 131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS17650ECOLNEIPORIN842e-20 E.coli/Neisseria porin superfamily signature.
		>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature.

Length = 331

Score = 83.7 bits (207), Expect = 2e-20
Identities = 77/335 (22%), Positives = 128/335 (38%), Gaps = 33/335 (9%)

Query: 6 KTLLASALASATLASAYAAEPLTVYGKLNV---TAQSNDEKGDAT------TTIQSNASR 56
K+L+A LA+ +A A +T+YG + T++S G T I S+
Sbjct: 3 KSLIALTLAALPVA---AMADVTLYGTIKAGVETSRSVAHNGAQAASVETGTGIVDLGSK 59

Query: 57 FGVKGNFELSSSLEAFYTVEYEVDTGAATSDNFKARNQFVGLKGAFGSFSVGRNDTLLKI 116
G KG +L + L+A + VE + A T + R F+GLKG FG VGR +++LK
Sbjct: 60 IGFKGQEDLGNGLKAIWQVEQKASI-AGTDSGWGNRQSFIGLKGGFGKLRVGRLNSVLK- 117

Query: 117 SQGNVDQFNDLSGDL--KSLFKGDNRLGQTATYLSPSISGFVFGATYAAEGDADQQGQDG 174
G+++ ++ S L + + + RL + Y SP +G YA +A + +
Sbjct: 118 DTGDINPWDSKSDYLGVNKIAEPEARL-ISVRYDSPEFAGLSGSVQYALNDNAGRHNSES 176

Query: 175 FSLAAMYGDAKLKKSPIYAAIAYDSDVKGYEILRASVQGKIANLTLGGMYQQQEETYKNA 234
+ Y + A + + I + + ++ +Y ++A
Sbjct: 177 YHAGFNYKNGGFFVQYGGAYKRHHQVQENVNIEKYQIHRLVSGYDNDALYASVAVQQQDA 236

Query: 235 LPVTTD----SVNGYLFSAAYDIDAVTLKAQY-----QDMEDKGDS-----WSVGADYAL 280
V + S + AY VT + Y + + VGA+Y
Sbjct: 237 KLVEENYSHNSQTEVAATLAYRFGNVTPRVSYAHGFKGSFDATNYNNDYDQVVVGAEYDF 296

Query: 281 AKPTKVFAFYT--NRSLEASADDDKYIGVGLEHKF 313
+K T S GVGL HKF
Sbjct: 297 SKRTSALVSAGWLQEGKGESKFVSTAGGVGLRHKF 331


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS17655SECA310.006 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 31.0 bits (70), Expect = 0.006
Identities = 11/41 (26%), Positives = 24/41 (58%), Gaps = 1/41 (2%)

Query: 81 EALEEKVALIEDEENRKMAKKEKDALKD-EIITSLLPRAFS 120
A+E ++ + DEE + + + L+ E++ +L+P AF+
Sbjct: 29 NAMEPEMEKLSDEELKGKTAEFRARLEKGEVLENLIPEAFA 69


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS17665BCTERIALGSPD320.012 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 31.8 bits (72), Expect = 0.012
Identities = 25/119 (21%), Positives = 49/119 (41%), Gaps = 7/119 (5%)

Query: 4 LTVILFTLLLSLPFSVQSRDIEADEVELRESPQQMYDALNQSISFPLSFQNR---DQFER 60
LT+++F LL P + + +++E + LN+++ S + ++
Sbjct: 12 LTLLIFAALLFRPAAAEEFSASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITVRSYDM 71

Query: 61 AAQEQGYSPTEFEQIL--YLLTRLNMEPNVKTKVGFQDAKSLIAVLSTAAQSPYELAMV 117
+EQ Y F +L Y +NM V V +DAK+ +++ A +V
Sbjct: 72 LNEEQYY--QFFLSVLDVYGFAVINMNNGVLKVVRSKDAKTAAVPVASDAAPGIGDEVV 128


27AYJ58_RS18125AYJ58_RS18175N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AYJ58_RS18125-153973748600967947.266570bifunctional diguanylate
AYJ58_RS18130-153973748600967947.041042ATP-dependent protease
AYJ58_RS18135-1539737486009671147.699317alkene reductase
AYJ58_RS18140-1539737486009671147.761708TetR/AcrR family transcriptional regulator
AYJ58_RS18145-1539737486009671147.600000TIGR02922 family protein
AYJ58_RS18150-1539737486009671148.182442methyl-accepting chemotaxis protein
AYJ58_RS18155-1539737486009671048.290189YebC/PmpR family DNA-binding regulatory protein
AYJ58_RS18160-1539737486009671348.057868carotenoid oxygenase
AYJ58_RS18165-1539737486009671448.183978DUF2141 domain-containing protein
AYJ58_RS18170-153973748600967948.332167two-component sensor histidine kinase
AYJ58_RS18175-153973748600967847.964826DNA-binding response regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS18130PF05307340.001 Bundlin
		>PF05307#Bundlin

Length = 193

Score = 34.0 bits (77), Expect = 0.001
Identities = 17/52 (32%), Positives = 29/52 (55%)

Query: 74 ATLSAATSSIATQSKAVPSNTTSTLSSNTFETSVDAQLAQEQLISPNESISV 125
A +S AT ++ T +K N + S+N+F TS + A+ I+P E+ +V
Sbjct: 127 ACVSLATLNLGTSAKGYGVNINNPASTNSFNTSATSSNAKSSAITPAEAATV 178


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS18135HTHFIS310.027 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 30.6 bits (69), Expect = 0.027
Identities = 12/39 (30%), Positives = 17/39 (43%), Gaps = 6/39 (15%)

Query: 339 LFGYVENATFRGTVFTDFSLIRPGSLHKANGGVLLMDAV 377
LFG+ + A FT G +A GG L +D +
Sbjct: 208 LFGHEKGA------FTGAQTRSTGRFEQAEGGTLFLDEI 240


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS18145HTHTETR587e-13 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 58.5 bits (141), Expect = 7e-13
Identities = 27/167 (16%), Positives = 60/167 (35%), Gaps = 3/167 (1%)

Query: 2 RNAEFDREQVLRGAMAAFMHKGYTKTSMQDLTQATGLHPGSIYCAFTNKRGLLIAAIEQY 61
+ A+ R+ +L A+ F +G + TS+ ++ +A G+ G+IY F +K L E
Sbjct: 7 QEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELS 66

Query: 62 QLDRNVQFNSLFAN-SENVQTNLKNYLNHIVAECLSCDSAQACLLTKALNEVAEQDVEIR 120
+ + A + + L+ L H++ ++ + + + ++ +
Sbjct: 67 ESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVV 126

Query: 121 D-IINQYLQSWQQALTQQFTRAAQQGLLQGHRSDEQRAQYFMMGIYG 166
+ Q + +L +RA M G
Sbjct: 127 QQAQRNLCLESYDRIEQTLKHCIEAKMLPADL-MTRRAAIIMRGYIS 172


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS18155FLAGELLIN310.013 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 30.8 bits (69), Expect = 0.013
Identities = 29/256 (11%), Positives = 71/256 (27%), Gaps = 14/256 (5%)

Query: 74 AHDISVQTSKIAIGSAEVSHFIDLLNKSIESNGEHASAIAVAAGQLSHTTAQLGDNAADI 133
K+ + +A D + + + + A G
Sbjct: 217 DTTAPTVPDKVYVNAANGQLTTDDAENNTAVDLFKTTKSTAGTAE---AKAIAGAIKGGK 273

Query: 134 LGQAQEAERVSVQGRSQAQKG-----VAAIRSLSTDIDTAAEQVQALKSRAEEIQKITEV 188
G + + V+ ++ I + A A A +Q V
Sbjct: 274 EGDTFDYKGVTFTIDTKTGNDGNGKVSTTINGEKVTLTVADITAGAANVDAATLQSSKNV 333

Query: 189 INSVAEQTNLLALNAAIEAARAGEQGRGFAVVADEVRSLAGKTAGATQDIGKMLLEIRSE 248
SV E+A+ + AV + ++ G A K+ L ++
Sbjct: 334 YTSVVNGQFTFDDKTKNESAKLSDLEANNAVKGESKITVNGAEYTANAAGDKVTLAGKTM 393

Query: 249 TDKTSGLMERVVTQTADVVT------AMGELDAHFTEISASVTQSAHALGDMEDSLKQYN 302
+ + + +D+ +++ A + + ++
Sbjct: 394 FIDKTASGVSTLINEDAAAAKKSTANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLG 453

Query: 303 NTTNDISRSVTQIRDS 318
NT +++ + ++I D+
Sbjct: 454 NTVTNLNSARSRIEDA 469


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS18175PF06580371e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 36.8 bits (85), Expect = 1e-04
Identities = 20/101 (19%), Positives = 44/101 (43%), Gaps = 15/101 (14%)

Query: 291 LVLQEGISNAVRHG-----KANQLQLSMEDSQNALVLQLSDNGMGLTRGTARNVSAKNGT 345
+++Q + N ++HG + ++ L + L++ + G + KN
Sbjct: 258 MLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSL---------ALKNTK 308

Query: 346 ELNGTGLGGMQERLQP-FNGKVQLRANDSAPGCQLTLTLPA 385
E GTGL ++ERLQ + + Q++ ++ + +P
Sbjct: 309 ESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIPG 349


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS18180HTHFIS742e-17 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 73.7 bits (181), Expect = 2e-17
Identities = 21/112 (18%), Positives = 54/112 (48%), Gaps = 2/112 (1%)

Query: 12 LVEDQQLVRQGIASLLAISDNIRVLWQAEDGQDALKQLASNPVDVLLSDIRMPNLDGIAM 71
+ +D +R + L+ + V + + +A+ D++++D+ MP+ + +
Sbjct: 8 VADDDAAIRTVLNQALSRAG-YDVR-ITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 72 LKQIRQSANRLPVIMLTTFDDSELFLNSLQAGANGFLLKDVSLDKLLHAIET 123
L +I+++ LPV++++ + + + + GA +L K L +L+ I
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117


28AYJ58_RS18455AYJ58_RS18490N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AYJ58_RS18455-1539737486009671647.666946two-component sensor histidine kinase BarA
AYJ58_RS18460-1539737486009671347.083076DUF416 domain-containing protein
AYJ58_RS18465-1539737486009671248.348694DUF3319 domain-containing protein
AYJ58_RS18470-1539737486009671348.503439LysR family transcriptional regulator
AYJ58_RS18475-1539737486009671348.255697AEC family transporter
AYJ58_RS18480-1539737486009671148.490196DNA repair protein RecN
AYJ58_RS18485-1539737486009671249.219131phosphatidylglycerophosphatase A
AYJ58_RS18490-1539737486009671149.873578thiamine-phosphate kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS18455HTHFIS632e-12 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 63.3 bits (154), Expect = 2e-12
Identities = 28/124 (22%), Positives = 48/124 (38%), Gaps = 2/124 (1%)

Query: 678 QSLTVLAVDDNFANLKLIDTLLSELVTTVIAVNSGDEAVKQAKTRTFDLIFMDIQMPGTD 737
T+L DD+ A +++ LS V ++ + DL+ D+ MP +
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 738 GISATKQIRQGSMNRNTPIIAVTAHAIAEERELILGSGMDGYLPKPIDEAALKDVIHRWI 797
+I+ + P++ ++A G YLPKP D L +I R +
Sbjct: 62 AFDLLPRIK--KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119

Query: 798 TRPK 801
PK
Sbjct: 120 AEPK 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS18460BACINVASINB280.025 Salmonella/Shigella invasin protein B signature.
		>BACINVASINB#Salmonella/Shigella invasin protein B signature.

Length = 593

Score = 28.2 bits (62), Expect = 0.025
Identities = 17/41 (41%), Positives = 25/41 (60%)

Query: 142 EALDDFVFAHEVMEEEKELQNSLLEIIEENPKITAELVKGL 182
EAL DF+ A M++ ++ +EI EN K+TAEL K +
Sbjct: 533 EALADFMLARFAMDQIQQWLKQSVEIFGENQKVTAELQKAM 573


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS18480GPOSANCHOR381e-04 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 37.7 bits (87), Expect = 1e-04
Identities = 56/328 (17%), Positives = 105/328 (32%), Gaps = 28/328 (8%)

Query: 59 ANKTEVSAR--FSLDDIPLAKRWLEDNDLELDDECILRRTIGSDGRSRAYINGNPVPLTQ 116
T + L K + E+++ + + ++A +
Sbjct: 34 VVNTNEVSAVATRSQTDTLEKVQERADKFEIENNTLKLKNSDLSFNNKALKD-------H 86

Query: 117 LKLLGQLLIGIHGQHAHHAMLKSEHQLTLLDSYANHRLLIDTVAASFQRCKQIEADLKQL 176
L + L + + SE + + A L + + A +K L
Sbjct: 87 NDELTEELSNAKEKLRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTL 146

Query: 177 EASQHERIARKQLVQYQVEELDEFDLKVDEFDEIEQEHKRLANGTELIDTCQASLDILTE 236
EA + ARK ++ +E F + K L ++ QA L E
Sbjct: 147 EAEKAALAARKADLEKALEGAMNFST------ADSAKIKTLEAEKAALEARQAEL----E 196

Query: 237 GEENNIESLLNRVVSLAEDLQSYDPALSNINTMLNDALIQVQESAGELQHYLSKLELDPA 296
+ + + L++ AL+ L AL + + LE A
Sbjct: 197 KALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLE---A 253

Query: 297 HFAYLEERLSKAMQLARKHHVSPNKLAEHHLALKAELSTLDSDESKLEEIQLQVDASRAA 356
A LE R ++ + + L+AE + L+++++ LE ++A+R
Sbjct: 254 EKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQ- 312

Query: 357 YLSNAQKLSQSRARYAK---ELDKLVTQ 381
S + L SR + E KL Q
Sbjct: 313 --SLRRDLDASREAKKQLEAEHQKLEEQ 338



Score = 36.6 bits (84), Expect = 2e-04
Identities = 40/217 (18%), Positives = 72/217 (33%), Gaps = 14/217 (6%)

Query: 167 KQIEADLKQLEASQHERIARKQLVQYQVEELD-EFDLKVDEFDEIEQEHKRLANGTELID 225
+ A A K ++ + EL+ + ++ + K L +
Sbjct: 165 EGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALA 224

Query: 226 TCQASLDILTEGEENNIESLLNRVVSLAEDLQSYDPALSNINTMLNDALIQVQESAGELQ 285
+A L+ EG N + ++ +L + + L L AL +
Sbjct: 225 ARKADLEKALEGAMNFSTADSAKIKTLEAEKAA----LEARQAELEKALEGAMNFSTADS 280

Query: 286 HYLSKLELDPAHFAYLEERLSKAMQLARKHHVSPNKLAEHHLALKAELSTLDSDESKLEE 345
+ LE A A LE + ++ + + L A + L+++ KLEE
Sbjct: 281 AKIKTLE---AEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEE 337

Query: 346 IQLQVDASRAAYLSNAQKLSQSRARYAK---ELDKLV 379
Q S A+ S + L SR + E KL
Sbjct: 338 ---QNKISEASRQSLRRDLDASREAKKQLEAEHQKLE 371


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS18490TYPE3IMQPROT280.014 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 27.8 bits (62), Expect = 0.014
Identities = 9/39 (23%), Positives = 16/39 (41%)

Query: 71 LSDLAAMGAEPAWMTLALTLPAVDETWLSGFSEGLFEAA 109
+ DL G + ++ L L+ + G GLF+
Sbjct: 1 MDDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTV 39


29AYJ58_RS19330AYJ58_RS19395N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AYJ58_RS19330-1539737486009672846.917699translation initiation factor IF-2
AYJ58_RS19335-1539737486009672746.949686transcription termination/antitermination
AYJ58_RS19340-1539737486009673046.737899ribosome maturation factor RimP
AYJ58_RS19345-1539737486009672646.981293**preprotein translocase subunit SecG
AYJ58_RS19350-1539737486009671746.845817triose-phosphate isomerase
AYJ58_RS19355-1539737486009671647.038205phosphoglucosamine mutase
AYJ58_RS19370-1539737486009671747.380952dihydropteroate synthase
AYJ58_RS19375-1539737486009671747.262149ATP-dependent zinc metalloprotease FtsH
AYJ58_RS19380-1539737486009671947.14956423S rRNA (uridine(2552)-2'-O)-methyltransferase
AYJ58_RS19385-1539737486009671647.399472ribosome assembly RNA-binding protein YhbY
AYJ58_RS19390-1539737486009671747.204869protein translocase subunit SecF
AYJ58_RS19395-1539737486009671846.817895protein translocase subunit SecD
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS19345TCRTETOQM725e-15 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 72.2 bits (177), Expect = 5e-15
Identities = 51/202 (25%), Positives = 78/202 (38%), Gaps = 30/202 (14%)

Query: 387 IMGHVDHGKTSLLDYIRRAKVAAGEAG------------------GITQHIGAYHVETEN 428
++ HVD GKT+L + + A E G GIT G + EN
Sbjct: 8 VLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITSFQWEN 67

Query: 429 GMITFLDTPGHAAFTAMRARGAKATDIVVLVVAADDGVMPQTIEAIQHAKAGNVPLIVAV 488
+ +DTPGH F A R D +L+++A DGV QT + +P I +
Sbjct: 68 TKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIPTIFFI 127

Query: 489 NKMDKPEADIDRV----KSELSQHGVMS-------EDWGGDNMFAFVSAKTGEGVDELLE 537
NK+D+ D+ V K +LS V+ + + EG D+LLE
Sbjct: 128 NKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIEGNDDLLE 187

Query: 538 GILLQAEVLELKAVRDGMAAGV 559
+ + LE + +
Sbjct: 188 K-YMSGKSLEALELEQEESIRF 208


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS19370SECGEXPORT1184e-38 Protein-export SecG membrane protein signature.
		>SECGEXPORT#Protein-export SecG membrane protein signature.

Length = 110

Score = 118 bits (297), Expect = 4e-38
Identities = 61/111 (54%), Positives = 82/111 (73%), Gaps = 1/111 (0%)

Query: 1 MYEVLVVIYLLVALGLIGLVLIQQGKGADMGASFGAGASGTLFGSSGSGNFLTRTTAILA 60
MYE L+V++L+VA+GL+GL+++QQGKGADMGASFGAGAS TLFGSSGSGNF+TR TA+LA
Sbjct: 1 MYEALLVVFLIVAIGLVGLIMLQQGKGADMGASFGAGASATLFGSSGSGNFMTRMTALLA 60

Query: 61 IAFFTLSLLIGNLSANHAKNEDSWKNLGSDTEQVTQPVEQGTQKSETKIPD 111
FF +SL++GN+++N W+NL S + Q K + IP+
Sbjct: 61 TLFFIISLVLGNINSNKTNKGSEWENL-SAPAKTEQTQPAAPAKPTSDIPN 110


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS19375adhesinb310.003 Adhesin B signature.
		>adhesinb#Adhesin B signature.

Length = 310

Score = 31.4 bits (71), Expect = 0.003
Identities = 18/95 (18%), Positives = 34/95 (35%), Gaps = 16/95 (16%)

Query: 142 REARRTFEVIAEELDIVIQKNGTMAFDNAIIAY----EPLWAVGTGKSATPEQAQEVHAF 197
+EA+ F I E +++ G F AY +W + T + TP+Q + +
Sbjct: 186 KEAKEKFNNIPGEKKMIVTSEG--CFKYFSKAYNVPSAYIWEINTEEEGTPDQIKTLVEK 243

Query: 198 IRKRLSEVSPFIGENIRILYGGSVTPSNAADLFAQ 232
+RK + L+ S ++
Sbjct: 244 LRKT----------KVPSLFVESSVDDRPMKTVSK 268


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS19390HTHFIS340.002 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 34.0 bits (78), Expect = 0.002
Identities = 23/82 (28%), Positives = 32/82 (39%), Gaps = 18/82 (21%)

Query: 193 VLMVGPPGTGKTLLAKAIAGESK---VPFFT-----ISGSDFVEMFVGV------GASRV 238
+++ G GTGK L+A+A+ K PF I G GA
Sbjct: 163 LMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTR 222

Query: 239 RD-MFEQAKKSAPCIIFIDEID 259
FEQA+ +F+DEI
Sbjct: 223 STGRFEQAEGGT---LFLDEIG 241


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS19405SECFTRNLCASE2483e-83 Bacterial translocase SecF protein signature.
		>SECFTRNLCASE#Bacterial translocase SecF protein signature.

Length = 333

Score = 248 bits (634), Expect = 3e-83
Identities = 91/306 (29%), Positives = 160/306 (52%), Gaps = 14/306 (4%)

Query: 2 KNINLTKWRYVSSAISIFLMITSLAIIGVKGFNWGLDFTGGVVTEVQLDRKITSSELQPL 61
N + +W++ + +I +MI S+ + V G N+G+DF GG + I +
Sbjct: 12 TNFDFFRWQWATFGAAIVMMIASVILPLVIGLNFGIDFKGGTTIRTESTTAIDVGVYRAA 71

Query: 62 LNAAYQQEVSVISASEPG----------RWVLRYGDIKSADTEQSNVDIQ----QTLAPL 107
L +V + +P R ++ + ++ L +
Sbjct: 72 LEPLELGDVIISEVRDPSFREDQHVAMIRIQMQEDGQGAEGQGAQGQELVNKVETALTAV 131

Query: 108 NSEVQVLNSSVVGPQIGQELAEQGGLALLVAMLCILGYLSYRFEWRLASGALFALVHDVV 167
+ +++ + VGP++ EL +LL A + I+ Y+ RFEW+ A GA+ ALVHDV+
Sbjct: 132 DPALKITSFESVGPKVSGELVWTAVWSLLAATVVIMFYIWVRFEWQFALGAVVALVHDVL 191

Query: 168 FVLAFFALTQMEFNLTVLAAVLAILGYSLNDSIIIADRIRELLIAKPKLSIQEINNQAIV 227
+ FA+ Q++F+LT +AA+L I GYS+ND++++ DR+RE LI + ++++ N ++
Sbjct: 192 LTVGLFAVLQLKFDLTTVAALLTITGYSINDTVVVFDRLRENLIKYKTMPLRDVMNLSVN 251

Query: 228 ATFSRTMVTSGTTLMTVGALWIMGGGPLEGFSIAMFIGILTGTFSSISVGTSLPEFLGLT 287
T SRT++T TTL+ + + I GG + GF AM G+ TGT+SS+ V ++ F+GL
Sbjct: 252 ETLSRTVMTGMTTLLALVPMLIWGGDVIRGFVFAMVWGVFTGTYSSVYVAKNIVLFIGLD 311

Query: 288 PEHYKE 293
K+
Sbjct: 312 RNKEKK 317


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS19410SECFTRNLCASE789e-18 Bacterial translocase SecF protein signature.
		>SECFTRNLCASE#Bacterial translocase SecF protein signature.

Length = 333

Score = 78.0 bits (192), Expect = 9e-18
Identities = 30/172 (17%), Positives = 82/172 (47%), Gaps = 4/172 (2%)

Query: 422 VTIVEERTIGPTLGAENIENGFAALGLGMGITLLFMALWYR-RLGWVANIALISNMVILF 480
+ I ++GP + E + +L + + ++ + + + A +AL+ ++++
Sbjct: 135 LKITSFESVGPKVSGELVWTAVWSLLAATVVIMFYIWVRFEWQFALGAVVALVHDVLLTV 194

Query: 481 GLLALIPGAVLTLPGIAGLVLTVGMAVDTNVLIFERIKDKLKEGRSFALA--IDTGFDSA 538
GL A++ L +A L+ G +++ V++F+R+++ L + ++ L ++ +
Sbjct: 195 GLFAVL-QLKFDLTTVAALLTITGYSINDTVVVFDRLRENLIKYKTMPLRDVMNLSVNET 253

Query: 539 FSTIFDANFTTMITAVVLYSIGNGPIQGFALTLGLGLLTSMFTGIFASRALI 590
S TT++ V + G I+GF + G+ T ++ ++ ++ ++
Sbjct: 254 LSRTVMTGMTTLLALVPMLIWGGDVIRGFVFAMVWGVFTGTYSSVYVAKNIV 305


30AYJ58_RS20385AYJ58_RS20405N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AYJ58_RS20385-1539737486009671046.588103HD domain-containing protein
AYJ58_RS20390-1539737486009671147.584338PAS domain S-box protein
AYJ58_RS20395-1539737486009671247.517532ATP-dependent RNA helicase SrmB
AYJ58_RS20400-1539737486009671247.633409efflux RND transporter periplasmic adaptor
AYJ58_RS20405-1539737486009671548.302377AcrB/AcrD/AcrF family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS20395HTHFIS831e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 83.0 bits (205), Expect = 1e-19
Identities = 33/172 (19%), Positives = 61/172 (35%), Gaps = 23/172 (13%)

Query: 7 SKQTLLLVDDEPVNLRVLKQILHQ-DYQLIFAKNGEEALRLAQIEKPSLILLDIMMPNMT 65
+ T+L+ DD+ VL Q L + Y + N R L++ D++MP+
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 66 GFEVCQILKTMTETQSIPVIFVTALNDEHDEAAGFAAGGVDYIAKPISAAIVKARVKTHL 125
F++ +K +PV+ ++A N G DY+ KP + + L
Sbjct: 62 AFDLLPRIKKARP--DLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119

Query: 126 SLVQADELKRTR---------------LQVIQRLGRAAEYKDN-----ETGT 157
+ + K ++ + L R + E+GT
Sbjct: 120 AEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGT 171


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS20400HTHFIS681e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 67.5 bits (165), Expect = 1e-13
Identities = 27/124 (21%), Positives = 47/124 (37%), Gaps = 7/124 (5%)

Query: 780 TILVVDDIQQNIDLLTVLFTRLGHKVITARDGQQALVRMQKGSIDITLMDLQMPIMDGLT 839
TILV DD +L +R G+ V + + G D+ + D+ MP +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 840 A-AKIRRQQEAEEQLVHMPIIALTASVLEQDQNAAVQAGMDGFANKPIDFAHLSREIARV 898
+I++ + +P++ ++A A + G + KP D L I R
Sbjct: 65 LLPRIKKARP------DLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118

Query: 899 LKLE 902
L
Sbjct: 119 LAEP 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS20410RTXTOXIND416e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 41.0 bits (96), Expect = 6e-06
Identities = 23/121 (19%), Positives = 47/121 (38%), Gaps = 7/121 (5%)

Query: 106 DYEADLMQAEATLAQATAALNEEIARGEVAKIEFKGYDKGLPPELGLRIPQLKKEQANVK 165
+ E ++A L + L + + AK E++ + E+ + +L++ N+
Sbjct: 256 EQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEI---LDKLRQTTDNIG 312

Query: 166 YAQAALARAQRNLERTVIRAPFDGIIKARNV-DLGQYVSLGTNLGELY---DTSVAEIRL 221
LA+ + + +VIRAP ++ V G V+ L + DT +
Sbjct: 313 LLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALV 372

Query: 222 P 222

Sbjct: 373 Q 373



Score = 39.4 bits (92), Expect = 2e-05
Identities = 25/125 (20%), Positives = 53/125 (42%), Gaps = 11/125 (8%)

Query: 62 GVVTPKYKTQLVTEVQGRMLSISADFVA-GGIVKKGDQLAQIEPSDYEADLMQAEATLAQ 120
G +T +++ + ++ + + V G V+KGD L ++ EAD ++ +++L Q
Sbjct: 88 GKLTHSGRSKEIKPIENSI--VKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQ 145

Query: 121 ATA------ALNEEIARGEVAKIEFKGYD--KGLPPELGLRIPQLKKEQANVKYAQAALA 172
A L+ I ++ +++ + + E LR+ L KEQ + Q
Sbjct: 146 ARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQK 205

Query: 173 RAQRN 177
+
Sbjct: 206 ELNLD 210


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS20415ACRIFLAVINRP497e-161 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 497 bits (1282), Expect = e-161
Identities = 214/1042 (20%), Positives = 446/1042 (42%), Gaps = 48/1042 (4%)

Query: 11 FARNSVAANLLMWALLVGGLFSTVLINKEVFPSFELNLLNISVAYPGAAPQEIEEGINIK 70
F R + A +L L++ G + + + +P+ +++S YPGA Q +++ +
Sbjct: 5 FIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQV 64

Query: 71 IEEAIQDINGIKKVTSVA-SEGVGSITVEVEDGYEVQTVLDEAKLRLDAI-STFPVNIEK 128
IE+ + I+ + ++S + S G +IT+ + G + + + +L P +++
Sbjct: 65 IEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQQ 124

Query: 129 PNIYQIKPENNVIWV----SVYGDMTLHDMKELAKS-VRDDLTQLPAVTRAKVTGVRDYE 183
I K ++ + V S T D+ + S V+D L++L V ++ G Y
Sbjct: 125 QGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFG-AQYA 183

Query: 184 IGIEVSENKLREYGLTFSQVALAVQNSSFDLPGGSIRA------QDGDILLRTKGQAYTG 237
+ I + + L +Y LT V ++ + + G + Q + + + +
Sbjct: 184 MRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKNP 243

Query: 238 DDFANIVVTTRPDGSRVMLPQVATIKDDFEERLEYTRFNGKPAAIIEVTSVDDQNALDIA 297
++F + + DGS V L VA ++ E R NGKPAA + + NALD A
Sbjct: 244 EEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALDTA 303

Query: 298 AQVKQYVEDRRATLPANAQLDTWGDMTHYLKGRLNMMMSNMFYGALLVFIILALFL-DLK 356
+K + + + P ++ D T +++ ++ ++ +F +LVF+++ LFL +++
Sbjct: 304 KAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMR 363

Query: 357 LAFWVMMGLPVCFLGTMLIMPLEPFSMTINMLTLFAFILVLGIVVDDAIVIGESAYTE-V 415
+ +PV LGT I+ F +IN LT+F +L +G++VDDAIV+ E+ +
Sbjct: 364 ATLIPTIAVPVVLLGTFAILAA--FGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMM 421

Query: 416 ERHGHSVENVIRGAQKVAMPATFGVLTTIAAFIPMLMVSGPMGIIWKSIGMVVILCLAFS 475
E E + ++ + A FIPM G G I++ + ++ +A S
Sbjct: 422 EDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALS 481

Query: 476 LVESKFILPAHLAHM----KFKKPGAPRGFFGRLKANFNDRVQHFIHHSYRNFLERCIKQ 531
++ + + PA A + + GFFG F D + +S L
Sbjct: 482 VLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTF-DHSVNHYTNSVGKILGSTG-- 538

Query: 532 RYNVVAAFIGVLVLSIALVASGKVRWVFFPDIPSDFIQVQLEMDEGSSEDNTLKVVQSIE 591
RY ++ A I ++ + L ++ F P+ +++ G++++ T KV+ +
Sbjct: 539 RYLLIYALIVAGMVVLFL----RLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVT 594

Query: 592 EALYKMNDKMEQDNGYQVVKHSFINMSSRTSAFIFAELTKGEDR---EVDGVTIAAAWRE 648
+ K N+K ++ + V SF ++ + F L E+R E + +
Sbjct: 595 DYYLK-NEKANVESVFTVNGFSFSGQ-AQNAGMAFVSLKPWEERNGDENSAEAVIHRAKM 652

Query: 649 QLPELLSVKKLSFNASTNDAGGDIS------FRLTSSDLDELSAASKELKQKLASY-EGV 701
+L ++ + FN G + D L+ A +L A + +
Sbjct: 653 ELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASL 712

Query: 702 YDIADNFSSGSHEIRLKI-RPEAEALGLTLSDLARQVRYGFYGYEAQRILRNKEEIKVMV 760
+ N + + +L++ + +A+ALG++LSD+ + + G + K+ V
Sbjct: 713 VSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYV 772

Query: 761 RYPLEQRRTVGYLENMLIRTPTGTSVPFSTVAQIEKGESYASITRVDGKRAITITANANK 820
+ + R ++ + +R+ G VPFS + R +G ++ I A
Sbjct: 773 QADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAP 832

Query: 821 NIVEPSKVVQEIQKDYLPQLQAKYPKIQTALDGGSLDEQNAMVGLLQGFFFALFTIYALM 880
S + ++ +L A G S E+ + + ++ +
Sbjct: 833 GTS--SGDAMALMENLASKLPAGI-GYDWT--GMSYQERLSGNQAPALVAISFVVVFLCL 887

Query: 881 AIPLKSYSQPLIIMSVIPFGIIGALFGHLIQGLAMSVLSLCGIVALAGVVVNDSLILVDF 940
A +S+S P+ +M V+P GI+G L + V + G++ G+ +++++V+F
Sbjct: 888 AALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEF 947

Query: 941 VNRARE-QGQSVRQAAVDSGCYRFRAIILTSLTTFVGLVPIILERSLQAQIVIPMATSLA 999
E +G+ V +A + + R R I++TSL +G++P+ + + + +
Sbjct: 948 AKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVM 1007

Query: 1000 FGILFSTVVTLILVPLLYIILD 1021
G++ +T++ + VP+ ++++
Sbjct: 1008 GGMVSATLLAIFFVPVFFVVIR 1029


31AYJ58_RS20610AYJ58_RS20635N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AYJ58_RS20610-1539737486009671248.323917NADP(H)-dependent aldo-keto reductase
AYJ58_RS20615-1539737486009671247.765363hypothetical protein
AYJ58_RS20620-1539737486009671346.713465MipA/OmpV family protein
AYJ58_RS20625-1539737486009671446.142433hypothetical protein
AYJ58_RS20630-1539737486009671346.756453two-component sensor histidine kinase
AYJ58_RS20635-1539737486009671747.159377DNA-binding response regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS20620HELNAPAPROT300.007 Helicobacter neutrophil-activating protein A family ...
		>HELNAPAPROT#Helicobacter neutrophil-activating protein A family

signature.
Length = 153

Score = 30.2 bits (68), Expect = 0.007
Identities = 22/89 (24%), Positives = 34/89 (38%), Gaps = 20/89 (22%)

Query: 112 AVDASLERLQIDTIDLY----QVHWPDRNTNFFG--ELFYEAQDQETQTPILETLEALAE 165
V+ SL + LY + HW + +FF E F +E ET++ +AE
Sbjct: 12 LVENSLNTQLSNWFLLYSKLHRFHWYVKGPHFFTLHEKF-----EELYDHAAETVDTIAE 66

Query: 166 VIRQGKVRYIGVSNETPWGLMK-YLQLAE 193
+ IG P +K Y + A
Sbjct: 67 RLLA-----IGGQ---PVATVKEYTEHAS 87


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS20630IGASERPTASE290.026 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 28.9 bits (64), Expect = 0.026
Identities = 27/115 (23%), Positives = 44/115 (38%), Gaps = 6/115 (5%)

Query: 127 IHLGTGTLSTKFQ--HDVTNVYDGFQADITYYHPINLGFGDLVPYAGVHYFSKDFANYYT 184
I LG G +K Q H+ Q +T NLG + P GV Y A++
Sbjct: 1377 IDLGYGKFQSKLQTNHNAKFARHTAQFGLTAGKAFNLGNFGITPIVGVRYSYLSNADFAL 1436

Query: 185 G---VTSSEATAQRPAYQADGTFAYKLGYALVIPV-TKHLDITQATGYSHIAANM 235
+ + + + Q D ++ Y LG V P+ + D Q +G ++
Sbjct: 1437 DQARIKVNPISVKTAFAQVDLSYTYHLGEFSVTPILSARYDANQGSGKINVNGYD 1491


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS20640PF06580290.039 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 29.1 bits (65), Expect = 0.039
Identities = 23/156 (14%), Positives = 46/156 (29%), Gaps = 36/156 (23%)

Query: 258 RDLDTMEDLVMTLLSYARLDEANIQPDWQSIELNAWLLEKYQGQVYPDFSVELVPYPTAL 317
D +++ +L R S+ +++ Y + + + L
Sbjct: 188 EDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSY-------LQLASIQFEDRL 240

Query: 318 K--IKTDPKYLSMQVNNLL-----NNALRFG------KAKIRLTLAVEEGATWLHVDDDG 364
+ + +P + +QV +L N ++ G KI L + G L V++ G
Sbjct: 241 QFENQINPAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTG 300

Query: 365 PGIDELESAQVIKPFVRGQHSRGNSGHGMGLAIVDR 400
+ G GL V
Sbjct: 301 SLALK----------------NTKESTGTGLQNVRE 320


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS20645HTHFIS766e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 75.6 bits (186), Expect = 6e-18
Identities = 37/126 (29%), Positives = 56/126 (44%), Gaps = 1/126 (0%)

Query: 6 HILVVEDDISLAEWISDYLLDHGYEVTVASQGDFALEMIAEEIPDLVLLDVMLPVKNGFD 65
ILV +DD ++ ++ L GY+V + S IA DLV+ DV++P +N FD
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 66 VCKEARAFYAG-PILFMTACVEDGDEIRGLDVGADDYLTKPIRPQVLLARIKALLRRVGD 124
+ + P+L M+A I+ + GA DYL KP L+ I L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 125 EEQKQQ 130
K +
Sbjct: 125 RPSKLE 130


32AYJ58_RS21375AYJ58_RS21410N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AYJ58_RS21375-1539737486009671447.648456nitrate/nitrite two-component system sensor
AYJ58_RS21380-1539737486009671748.204883DNA-binding response regulator
AYJ58_RS21385-1539737486009671749.099099hypothetical protein
AYJ58_RS21390-1539737486009671348.803058zinc transporter ZntB
AYJ58_RS21395-1539737486009671248.987269succinylglutamate desuccinylase
AYJ58_RS21400-1539737486009671148.710189lysine-sensitive aspartokinase 3
AYJ58_RS21405-1539737486009671348.225887DUF3293 domain-containing protein
AYJ58_RS21410-1539737486009671347.230209two-component system response regulator ArcA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS21380PF06580424e-06 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 42.2 bits (99), Expect = 4e-06
Identities = 27/147 (18%), Positives = 55/147 (37%), Gaps = 17/147 (11%)

Query: 410 INEGVSTAYVQLRELLSTFRLTIK-EPDLKSALEAMLEQLRAKTNI-------KITLDYK 461
I E + A L L R +++ + +L L + + + ++ + +
Sbjct: 186 ILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQ 245

Query: 462 LAPQWLEAKQHIHILQITREATLNAIKHA-----EASLINIHCYKDDKGMVNIDVCDNGI 516
+ P ++ + ++Q E N IKH + I + KD+ G V ++V + G
Sbjct: 246 INPAIMDVQVPPMLVQTLVE---NGIKHGIAQLPQGGKILLKGTKDN-GTVTLEVENTGS 301

Query: 517 GIGHLKERDQHFGIGIMHERASKLSGK 543
+ G+ + ER L G
Sbjct: 302 LALKNTKESTGTGLQNVRERLQMLYGT 328


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS21385HTHFIS612e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 61.4 bits (149), Expect = 2e-13
Identities = 27/159 (16%), Positives = 61/159 (38%), Gaps = 9/159 (5%)

Query: 6 SVLVVDDHPLLRKGICQLIASDPDFSLFGEAGGGLDALTAVATDEPDIILLDLNMKGMTG 65
++LV DD +R + Q S + + +A + D+++ D+ M
Sbjct: 5 TILVADDDAAIRTVLNQ-ALSRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 66 LDTLNAMRQEGVTSRIVILTVSDAKQDVIRLLRAGADGYLLKDTEPDLLLEKLKNAMLGH 125
D L +++ +++++ + I+ GA YL K + L+ + A+
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL--- 119

Query: 126 RVISDEVEEYLYELKDATDEQEWISSLTPRELQILEQLA 164
E + +L+D + + + + +I LA
Sbjct: 120 ----AEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLA 154


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS21405DHBDHDRGNASE290.026 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 29.2 bits (65), Expect = 0.026
Identities = 24/86 (27%), Positives = 42/86 (48%), Gaps = 5/86 (5%)

Query: 10 GTSVADYNAMNRCADIVLANPHCRLVVVSASSGVTNLLVELTQESINDDGRLQRLK-QIA 68
+S A +C + LA + R +VS S T++ L + ++G Q +K +
Sbjct: 158 ASSKAAAVMFTKCLGLELAEYNIRCNIVSPGSTETDMQWSLWAD---ENGAEQVIKGSLE 214

Query: 69 QIQYAI-LDKLGRPNDVAAALDKLLS 93
+ I L KL +P+D+A A+ L+S
Sbjct: 215 TFKTGIPLKKLAKPSDIADAVLFLVS 240


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AYJ58_RS21415HTHFIS831e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 83.3 bits (206), Expect = 1e-20
Identities = 31/131 (23%), Positives = 61/131 (46%), Gaps = 1/131 (0%)

Query: 1 MQNPHILIVEDEAVTRNTLRSIFEAEGYVVTEANDGAEMHKAMQENKINLVVMDINLPGK 60
M IL+ +D+A R L GY V ++ A + + + +LVV D+ +P +
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 NGLLLARELREIN-NIGLIFLTGRDNEVDKILGLEIGADDYITKPFNPRELTIRARNLLT 119
N L +++ ++ ++ ++ ++ + I E GA DY+ KPF+ EL L
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 120 RVNSAGAEVEE 130
+++E+
Sbjct: 121 EPKRRPSKLED 131



 
Contact Sachin Pundhir for Bugs/Comments.
For best view 1024 x 768 resolution & IE 6.0 or above recommended.