PredictBias

identification of genomic and pathogenicity islands in prokaryotic genome
Home | Help | Analyzed genomes
 
A) Input parameters
GenomeCA382.gbffThreshold dinucleotide bias2
Threshold codon bias4Threshold %GC bias3
E-value (RPSBlast)0.05Genome (non-pathogenic)
 
B) Compare a potential GI or PAI in related non-pathogenic sp. (phylogenetic tree)
Potential GI or PAI start    end  
Select Organism     
 
C) Potential GIs and PAIs in NC_022048 (download)
S.NoStartEndBiasVirulenceInsertion elementsPrediction
1L144_RS00335L144_RS00385Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
L144_RS00335235-2.255740Cof-type HAD-IIB family hydrolase
L144_RS00340333-1.500508aminopeptidase
L144_RS00345233-2.715377divergent PAP2 family protein
L144_RS00350333-3.278916hypothetical protein
L144_RS00355433-3.299458membrane protein
L144_RS00360427-3.662518hypothetical protein
L144_RS00365327-3.237347peptide chain release factor 2
L144_RS00370229-5.153466hypothetical protein
L144_RS00375020-4.457408signal recognition particle-docking protein
L144_RS00380017-4.217081hypothetical protein
L144_RS00385-115-3.478572ABC transporter ATP-binding protein
2L144_RS00860L144_RS00915Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
L144_RS00860292.670566MoxR family ATPase
L144_RS00865292.26122116S rRNA (guanine(527)-N(7))-methyltransferase
L144_RS008702122.682508tRNA uridine-5-carboxymethylaminomethyl(34)
L144_RS008754191.429676tRNA uridine-5-carboxymethylaminomethyl(34)
L144_RS008803261.112964FlbF protein
L144_RS008852211.406133flagellar hook-associated protein FlgK
L144_RS008900140.356771flagellar hook-associated protein 3
L144_RS00895010-1.340268flagellar assembly protein FliW
L144_RS00900214-1.537618carbon storage regulator CsrA
L144_RS009051170.019703tRNA
L144_RS00910211-1.087308tRNA
L144_RS00915215-2.958830hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
L144_RS00860HTHFIS414e-06 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 41.0 bits (96), Expect = 4e-06
Identities = 36/143 (25%), Positives = 55/143 (38%), Gaps = 15/143 (10%)

Query: 33 KEMIDAILMGLLTDGHVLLEGVPGLAKTL---AIQTVSDVLDLEFKRIQ---FTPDLLPS 86
+E+ + + TD +++ G G K L A+ + F I DL+ S
Sbjct: 147 QEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIES 206

Query: 87 DLTGNMVYKSA-TGTFKVRKGPVFS----NVILADEINRAPAKVQSALLEAMGERQVT-L 140
+L G K A TG G F + DEI P Q+ LL + + + T +
Sbjct: 207 ELFG--HEKGAFTGAQTRSTG-RFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTV 263

Query: 141 GDETHRLPDPFFVLATQNPIEQE 163
G T D V AT ++Q
Sbjct: 264 GGRTPIRSDVRIVAATNKDLKQS 286


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
L144_RS00875TCRTETOQM340.002 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 33.7 bits (77), Expect = 0.002
Identities = 34/112 (30%), Positives = 48/112 (42%), Gaps = 14/112 (12%)

Query: 229 GSVNAGKSSLFNLFLKKDRSIVSSYPGTTRDYIEASFELDGILFNLFDTAGLRDADNFVE 288
GSV+ G + N L++ R I T + I SF+ + N+ DT G D V
Sbjct: 34 GSVDKGTTRTDNTLLERQRGI------TIQTGI-TSFQWENTKVNIIDTPGHMDFLAEVY 86

Query: 289 RLGIEKSNSLIKEASLVIYVIDVSSNLTKDDFLFIDSNKSNSKILFVLNKID 340
R S S++ A L+I D T+ LF K +F +NKID
Sbjct: 87 R-----SLSVLDGAILLISAKDGVQAQTR--ILFHALRKMGIPTIFFINKID 131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
L144_RS00885FLGHOOKAP15210.0 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 521 bits (1344), Expect = 0.0
Identities = 138/633 (21%), Positives = 260/633 (41%), Gaps = 105/633 (16%)

Query: 6 SGIEIGKRSLFAHKDAMNTVGHNLSNATKPGYSRQRVTMKTEIPLYAPQLNRAKKQGQLG 65
S I L A + A+NT +N+S+ GY+RQ M A + G +G
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIM-------AQANSTLGAGGWVG 54

Query: 66 QGIVVQSIDRVKDELLNTRIIEESHRLGYWTSQDKFISILEDVYNEPEDQSIRKRLNDFW 125
G+ V + R D + ++ + T++ + +S ++++ + S+ ++ DF+
Sbjct: 55 NGVYVSGVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLST-STSSLATQMQDFF 113

Query: 126 ESWHDLANQPQGLAERKIILERGKSFCEGIRNRFHSLERIYIMANDEIKI----TTDEAN 181
S L + + A R+ ++ + EG+ N+F + ++ + ++ I + D+ N
Sbjct: 114 TSLQTLVSNAEDPAARQALIGKS----EGLVNQFKTTDQYLRDQDKQVNIAIGASVDQIN 169

Query: 182 NYIRNIANLNKQISKSQAMK--DNPNDLMDARDLMVEKLGNIISVSIENKQDPNEFLIHA 239
NY + IA+LN QIS+ + +PN+L+D RD +V +L I+ V + + + A
Sbjct: 170 NYAKQIASLNDQISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMA 229

Query: 240 EGRHLVQGSIANEF-KLEATNGPTRTRWNIL--WANN---DKAYLKTGKLGSLLNIRDEE 293
G LVQGS A + + ++ P+RT + A N + L TG LG +L R ++
Sbjct: 230 NGYSLVQGSTARQLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQD 289

Query: 294 IKNEINELNNIAANIIEIVNEIHEAGRGMDKKNGRSFFSQELKLTDDRGRYDTNGNGQFD 353
+ N L +A E N H+AG + G FF+ + N + D
Sbjct: 290 LDQTRNTLGQLALAFAEAFNTQHKAGFDANGDAGEDFFAIG------KPAVLQNTKNKGD 343

Query: 354 -SVHIFKINSTNEIFPEEKLGFYGTLKFEATNSNEIVEIPYNAPDTVQDVINRINNSNAQ 412
++ +++ + + K+ F +++ T A +T V N A
Sbjct: 344 VAIGATVTDASAVLATDYKISFDNN-QWQVTRL---------ASNTTFTVTPDANGKVAF 393

Query: 413 VTARINSEGKLEIKAVKEQEDENITFKIKHIEDSGSFLTKYTGILNASGPEGAYDYKNID 472
+ G AV + +F +K + D I+N ++
Sbjct: 394 DGLELTFTGTP---AVND------SFTLKPVSD---------AIVNM----------DVL 425

Query: 473 TTD--KLAPKSTYSISPLKNPAAWIKVADIIDSDPSKIASGIKNPTNEISIGDNQAALRI 530
TD K+A S + A DSD + + +N ++G ++
Sbjct: 426 ITDEAKIAMAS-------EEDAG--------DSDNRNGQALLDLQSNSKTVGGAKSF--N 468

Query: 531 SSFGNSQIMIGKNLTLNDYFANTASNIAIKGQISEITKESQSQILKDLTDLRMSISGVNK 590
++ + IG + T N+ +++++ + Q SISGVN
Sbjct: 469 DAYASLVSDIGNKTATLKTSSATQGNV-----VTQLSNQQQ------------SISGVNL 511

Query: 591 DEELANMIEFQQAFIAASKFITVSVELIDTVIN 623
DEE N+ FQQ ++A ++ + + + D +IN
Sbjct: 512 DEEYGNLQRFQQYYLANAQVLQTANAIFDALIN 544


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
L144_RS00890FLAGELLIN584e-11 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 57.7 bits (139), Expect = 4e-11
Identities = 37/133 (27%), Positives = 59/133 (44%), Gaps = 5/133 (3%)

Query: 10 TYENFKTSSAEQESKITKLLENLYKGGKRIVKLRNDPTGVTHAIRLDNDIFKLNVYIKNI 69
T N S + S I +L G RI ++D G A R ++I L +N
Sbjct: 13 TQNNLNKSQSSLSSAIERL-----SSGLRINSAKDDAAGQAIANRFTSNIKGLTQASRNA 67

Query: 70 DTSKSNLRYTEGYLQSLTNILTRAKEIAIQGASGTYESDDKKMISKEVNALLEDVVAIAN 129
+ S + TEG L + N L R +E+++Q +GT D K I E+ LE++ ++N
Sbjct: 68 NDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEIDRVSN 127

Query: 130 AKGPDGYSIFSGT 142
+G + S
Sbjct: 128 QTQFNGVKVLSQD 140



Score = 31.2 bits (70), Expect = 0.008
Identities = 33/358 (9%), Positives = 87/358 (24%), Gaps = 12/358 (3%)

Query: 61 KLNVYIKNIDTSKSNLRYTEGYLQSLTNILTRAKEIAIQGASGTYESDDKKMISKEVNAL 120
+ + ++ ID L + TY K +
Sbjct: 154 TITIDLQKIDVKSLGLDGFNVNGPKEATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGA 213

Query: 121 LEDVVAIANAKGPDGYSIFSGTKIDSEAFKVTRENKISKTSKDGAGPQIIKVEYNGNQAE 180
+ + +G +A T + T + + +
Sbjct: 214 VVTDTTAPTVPDKVYVNAANGQLTTDDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGK 273

Query: 181 KKTEVYNDIHMSNNYPGNVIFFLQNQNIISSINTNGFAVKENTKIYIDNIEIGLTAGDTA 240
+ + + +S+ I + ++
Sbjct: 274 EGDTFDYK---GVTFTIDTKTGNDGNGKVSTTINGEKVTLTVADITAGAANVDAATLQSS 330

Query: 241 LDIVAKINESSAPVEASIDPVLNSLSIKTTTPHQIWITEEKESNVLQTLGILTKNNDTKL 300
++ + + LS + + + K+
Sbjct: 331 KNVYTSVVNGQFTFDDKTKNESAKLSDLEANN----AVKGESKITVNGAEYTANAAGDKV 386

Query: 301 PPYNLSSSTEVRSRSIFDALIELRDTLYNNKEELVGSRSLAEIDESLKRLLISVADLGAK 360
+ + + + + E + + S ID +L ++ + LGA
Sbjct: 387 TLAGKTMFIDKTASGVSTLINEDAAAAKKSTANPLAS-----IDSALSKVDAVRSSLGAI 441

Query: 361 ENRLDRSYERISKEAADMKEDMIQYTDLDVTKAITNLNMASLAYQVSIGISAKIMQTT 418
+NR D + + ++ + D D ++N++ A + Q + A+ Q
Sbjct: 442 QNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNMSKAQILQQAGTSVLAQANQVP 499


3L144_RS01380L144_RS01575Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
L144_RS013801173.157351flagella biosynthesis regulatory protein FliZ
L144_RS013851184.113312flagellar motor switch protein FliN
L144_RS013901164.870427flagellar motor switch protein FliM
L144_RS013950194.464363flagellar basal body-associated protein FliL
L144_RS01400-2192.527963flagellar motor protein MotB
L144_RS01405-2191.911203motility protein A
L144_RS01410-1150.372353flagellar protein FlbD
L144_RS01415-2151.119653flagellar hook protein FlgE
L144_RS01420116-0.322670flagellar hook assembly protein FlgD
L144_RS014255170.685982flagellar hook-length control protein FliK
L144_RS014306172.962283flagellar protein
L144_RS014355183.234829flagellar protein FlbA
L144_RS014405194.228737flagellar protein export ATPase FliI
L144_RS014456193.954428flagellar assembly protein FliH
L144_RS014504193.543925flagellar motor switch protein FliG
L144_RS014552193.427574flagellar basal body M-ring protein FliF
L144_RS014602192.137844flagellar hook-basal body complex protein FliE
L144_RS014651181.512151flagellar basal body rod protein FlgC
L144_RS014701192.796799flagellar basal body rod protein FlgB
L144_RS014750193.869604HslU--HslV peptidase ATPase subunit
L144_RS01480-1193.316218ATP-dependent protease subunit HslV
L144_RS01485-2172.495167DNA-protecting protein DprA
L144_RS01490-1162.598200hypothetical protein
L144_RS01495-1152.513109cell division protein FtsZ
L144_RS015000140.773577cell division protein FtsA
L144_RS01505113-0.600032cell division protein FtsQ/DivIB
L144_RS01510315-0.748834putative lipid II flippase FtsW
L144_RS01515115-2.282650phospho-N-acetylmuramoyl-pentapeptide-
L144_RS01520-116-3.086143UDP-N-acetylmuramoyl-tripeptide--D-alanyl-D-
L144_RS01525018-3.428453hypothetical protein
L144_RS01530016-2.84661216S rRNA (cytosine(1402)-N(4))-methyltransferase
L144_RS01535217-3.233511hypothetical protein
L144_RS01540-113-3.130300hypothetical protein
L144_RS01545-113-2.380622hypothetical protein
L144_RS01550-113-2.752786NAD(+)/NADH kinase
L144_RS01555-113-3.304311chemotaxis protein CheW
L144_RS01560010-3.295684RlmE family RNA methyltransferase
L144_RS01565010-3.147009polyprenyl synthetase family protein
L144_RS01570011-3.176554hypothetical protein
L144_RS01575115-3.239931ABC transporter permease
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
L144_RS01385FLGMOTORFLIN1037e-32 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 103 bits (258), Expect = 7e-32
Identities = 40/77 (51%), Positives = 62/77 (80%)

Query: 35 NFGLLMDVSMQLTVELGRTERKIKDILGMSEGTIITLDKLAGEPVDILVNGKIVAKGEVV 94
+ L+MD+ ++LTVELGRT IK++L +++G+++ LD LAGEP+DIL+NG ++A+GEVV
Sbjct: 53 DIDLIMDIPVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVV 112

Query: 95 VIDENFGVRITEIIKTK 111
V+ + +GVRIT+II
Sbjct: 113 VVADKYGVRITDIITPS 129


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
L144_RS01390FLGMOTORFLIM458e-165 Flagellar motor switch protein FliM signature.
		>FLGMOTORFLIM#Flagellar motor switch protein FliM signature.

Length = 344

Score = 458 bits (1181), Expect = e-165
Identities = 195/345 (56%), Positives = 259/345 (75%), Gaps = 9/345 (2%)

Query: 8 LSQDDIDSLLESINSSESLSLDESLSNVISSPTGKKQKVKVYDFKRPDKFSKEQVRTVSS 67
LSQD+ID LL +I+S ++ D + P +K+ +YDF+RPDKFSKEQ+RT+S
Sbjct: 5 LSQDEIDQLLTAISSGDASIED-------ARPISDTRKITLYDFRRPDKFSKEQMRTLSL 57

Query: 68 FHEAFARYTTTSLSALLRKMVHVHVASVDQLTYEEFIRSIPNPTTLAIINMDPLKGSAIF 127
HE FAR TTTSLSA LR MVHVHVASVDQLTYEEFIRSIP P+TLA+I MDPLKG+A+
Sbjct: 58 MHETFARLTTTSLSAQLRSMVHVHVASVDQLTYEEFIRSIPTPSTLAVITMDPLKGNAVL 117

Query: 128 EVDPTIAFAIVDRLFGGDGDTIKDKSRDLTEIEQSVMESVIIRILANMREAWSQVVDLRP 187
EVDP+I F+I+DRLFGG G K + RDLT+IE SVME VI+RILAN+RE+W+QV+DLRP
Sbjct: 118 EVDPSITFSIIDRLFGGTGQAAKVQ-RDLTDIENSVMEGVIVRILANVRESWTQVIDLRP 176

Query: 188 RFGHIEVNPQFAQIVPPTEMIILVTLEVKIGKVEGLMNFCLPYITIEPIVSKLSTRYWHS 247
R G IE NPQFAQIVPP+EM++LVTLE K+G+ EG+MNFC+PYITIEPI+SKLS+++W S
Sbjct: 177 RLGQIETNPQFAQIVPPSEMVVLVTLETKVGEEEGMMNFCIPYITIEPIISKLSSQFWFS 236

Query: 248 LIGVGTTSENLDALREKLENTAMPLVAEIGEVKLKVREILSLDKGDVLNLESSLINKDLT 307
+ +T++ + LR+KL M +VAE+G ++L VR+IL L GD++ L + +
Sbjct: 237 SVRRSSTTQYMGVLRDKLSTVDMDVVAEVGSLRLSVRDILGLRVGDIIRLHDTHVGDPFV 296

Query: 308 LKVGTKEKFKCRMGLMGNKVSVQITEKIGDIKGFDLLKELTEEVE 352
L +G ++KF C+ G++G K++ QI E+I D +EL+ + E
Sbjct: 297 LSIGNRKKFLCQPGVVGKKIAAQILERIESTSQED-FEELSADEE 340


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
L144_RS01400OMPADOMAIN558e-11 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 54.6 bits (131), Expect = 8e-11
Identities = 28/132 (21%), Positives = 57/132 (43%), Gaps = 21/132 (15%)

Query: 124 ISLAADAFFDSASADVKLEENRDSIQKIASFIGFLSPRGYNFKIEGHTDNIDTDVNGPWK 183
+L +D F+ A +K E + ++ ++ S + L P+ + + G+TD I +D
Sbjct: 215 FTLKSDVLFNFNKATLK-PEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSD-----A 268

Query: 184 SNWELSAARSVNMLEHILNYLDQSDVKRIENNFEVSGFGGSRPIATDDT---------PE 234
N LS R+ + +++YL + + G G S P+ + +
Sbjct: 269 YNQGLSERRA----QSVVDYLISKGIPA--DKISARGMGESNPVTGNTCDNVKQRAALID 322

Query: 235 GRAYNRRIDILI 246
A +RR++I +
Sbjct: 323 CLAPDRRVEIEV 334


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
L144_RS01415FLGHOOKAP1501e-08 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 50.3 bits (120), Expect = 1e-08
Identities = 18/89 (20%), Positives = 33/89 (37%), Gaps = 13/89 (14%)

Query: 4 SLYSGVSGLQNHQTRMDVVGNNIANVNTIGFKKGRINFQDMISQSISGASRPTDARGGTN 63
+ + +SGL Q ++ NNI++ N G+ + I + T GG
Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTT---------IMAQANSTLGAGGW- 52

Query: 64 PKQVGLGMNVASIDTIHTQGAFQSTQKAS 92
VG G+ V+ + + + A
Sbjct: 53 ---VGNGVYVSGVQREYDAFITNQLRAAQ 78



Score = 40.7 bits (95), Expect = 1e-05
Identities = 9/48 (18%), Positives = 28/48 (58%)

Query: 394 IRSGVLEMANVDLAEQFTDMIVTQRGFQANAKTITTSDQLLQELVRLK 441
+ + ++ V+L E++ ++ Q+ + ANA+ + T++ + L+ ++
Sbjct: 499 LSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
L144_RS01425FLGHOOKFLIK401e-05 Flagellar hook-length control protein signature.
		>FLGHOOKFLIK#Flagellar hook-length control protein signature.

Length = 375

Score = 39.8 bits (92), Expect = 1e-05
Identities = 24/82 (29%), Positives = 45/82 (54%), Gaps = 5/82 (6%)

Query: 281 KLVLKPKELGSIRINLNLDSNNNLLGKIVVDNQNVKMLFDQNMHSLNKMLGESGFNASLN 340
+L L P++LG ++I+L +D N + ++V +Q+V+ + + L L ESG +
Sbjct: 260 ELRLHPQDLGEVQISLKVDDNQAQI-QMVSPHQHVRAALEAALPVLRTQLAESGIQLGQS 318

Query: 341 LFLAGENLNSFTGNFKDDSKDQ 362
++GE SF+G + S+ Q
Sbjct: 319 -NISGE---SFSGQQQAASQQQ 336


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
L144_RS01430TYPE4SSCAGX290.016 Type IV secretion system CagX conjugation protein si...
		>TYPE4SSCAGX#Type IV secretion system CagX conjugation protein

signature.
Length = 522

Score = 29.0 bits (64), Expect = 0.016
Identities = 34/143 (23%), Positives = 66/143 (46%), Gaps = 20/143 (13%)

Query: 39 RYFPEFVRTKLLGETSLVFDHNSNIILDEAR--LVKEREAIDIKNQQIEKLKEDLKLKED 96
R + EF++TK L+ D L+E + L KE+EA + + Q+ +K K + + +E
Sbjct: 120 RDYQEFLKTK-----KLIVDAPDPKELEEQKKALEKEKEAKE-QAQKAQKDKREKRKEER 173

Query: 97 SLNKLEFELKQKQKDLDLKQKIIDDIINKYNDEEANILQTAVYLMNMPPEDAVKRLEDLN 156
+ N+ E + + + N N L + D ++RLED+
Sbjct: 174 AKNRANLE------------NLTNAMSNPQNLSNNKNLSELIKQQRENELDQMERLEDMQ 221

Query: 157 PELAISYMRKIEELSKKEGRLSI 179
+ + +++IEEL+KK+ ++
Sbjct: 222 EQAQANALKQIEELNKKQAEEAV 244


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
L144_RS01445FLGFLIH473e-08 Flagellar assembly protein FliH signature.
		>FLGFLIH#Flagellar assembly protein FliH signature.

Length = 228

Score = 46.7 bits (110), Expect = 3e-08
Identities = 43/197 (21%), Positives = 96/197 (48%), Gaps = 22/197 (11%)

Query: 115 KESIETESNAEIER-LAREYEEKLKTDLDIAIAKGREEGYSKGY--------ESGFEDFD 165
+E+I E+ +E+ LA+ + + IA+GR++G+ +GY E G +
Sbjct: 29 EETIIEEAEPSLEQQLAQLQMQAHEQGYQAGIAEGRQQGHKQGYQEGLAQGLEQGLAEAK 88

Query: 166 KVMRKLHAIIASLIAERKGILESSSGQIVSLVMQIAIKVIKRITDSQKDI----VLENVN 221
+HA + L++E + L++ I S +MQ+A++ +++ + +++ +
Sbjct: 89 SQQAPIHARMQQLVSEFQTTLDALDSVIASRLMQMALEAARQVIGQTPTVDNSALIKQIQ 148

Query: 222 EVLKR---VKDKTQITIRVNLDDLDIVRHKKSDFISRFDIIENLEIIEDPNIGKGGCIIE 278
++L++ K Q +RV+ DDL V +S + + DP + GGC +
Sbjct: 149 QLLQQEPLFSGKPQ--LRVHPDDLQRVDDMLGATLS----LHGWRLRGDPTLHPGGCKVS 202

Query: 279 TNFGEIDARISSQLDKI 295
+ G++DA ++++ ++
Sbjct: 203 ADEGDLDASVATRWQEL 219


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
L144_RS01450FLGMOTORFLIG427e-153 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 427 bits (1100), Expect = e-153
Identities = 344/344 (100%), Positives = 344/344 (100%)

Query: 1 MEEKKEKEILDVSALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITS 60
MEEKKEKEILDVSALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITS
Sbjct: 1 MEEKKEKEILDVSALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITS 60

Query: 61 ELKDNVLLEFKELMMAQEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSALQSRPFEFV 120
ELKDNVLLEFKELMMAQEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSALQSRPFEFV
Sbjct: 61 ELKDNVLLEFKELMMAQEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSALQSRPFEFV 120

Query: 121 RRADPANILNFIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPE 180
RRADPANILNFIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPE
Sbjct: 121 RRADPANILNFIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPE 180

Query: 181 VVREVERVLEKKLASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEI 240
VVREVERVLEKKLASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEI
Sbjct: 181 VVREVERVLEKKLASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEI 240

Query: 241 KKKMFVFEDIVLLDDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKED 300
KKKMFVFEDIVLLDDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKED
Sbjct: 241 KKKMFVFEDIVLLDDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKED 300

Query: 301 MEFLGPTRRKDVEESQQKIVSLIRKLEEQGEIVISRGGEEDVLV 344
MEFLGPTRRKDVEESQQKIVSLIRKLEEQGEIVISRGGEEDVLV
Sbjct: 301 MEFLGPTRRKDVEESQQKIVSLIRKLEEQGEIVISRGGEEDVLV 344


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
L144_RS01455FLGMRINGFLIF1621e-45 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 162 bits (412), Expect = 1e-45
Identities = 112/565 (19%), Positives = 222/565 (39%), Gaps = 55/565 (9%)

Query: 23 QKIALGLIIFFVILALVFLIGFSTKSQSIALF-GVEIKDQYLLDRISQRLDRENVKYFLS 81
+I L + + +V ++ ++ LF + +D I +L + N+ Y +
Sbjct: 23 PRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQD---GGAIVAQLTQMNIPYRFA 79

Query: 82 SDGRIYLDDEKLAKKMRAILVREELVPVHMDPWALFDIDRWTITDFERSINLRRSITRAV 141
+ ++R L ++ L + L D +++ I+ F +N +R++ +
Sbjct: 80 NGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQFSEQVNYQRALEGEL 139

Query: 142 EQHIVALDDVDAVSVNLVMPEKALFKESQEPVKASVRITPRPGSDIITNRKKVEGLVKLI 201
+ I L V + V+L MP+ +LF Q+ ASV +T PG + + ++ +V L+
Sbjct: 140 ARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRAL--DEGQISAVVHLV 197

Query: 202 QYAIEGLESDNIAIVDNSGTILNDFSNLDGIDRIDLAEKERKLKLKYEAMLRGEIDSALS 261
A+ GL N+ +VD SG +L SN G DL + + K E+ ++ I++ LS
Sbjct: 198 SSAVAGLPPGNVTLVDQSGHLL-TQSNTSG---RDLNDAQLKFANDVESRIQRRIEAILS 253

Query: 262 KVLSVDRFMIARVNVKLDTSKETTESKEYAPIELQSQDPKASYNTRKVSDSTIISSQTQK 321
++ A+V +LD + + + Y+P KA+ +R+++ S + +
Sbjct: 254 PIVGNGNV-HAQVTAQLDFANKEQTEEHYSP---NGDASKATLRSRQLNISEQVGAGYPG 309

Query: 322 KEYQGQGYSPWGPPGQEGNTPPEYQDLSD-------------ITGKYNESQEIKNVALNE 368
P P TPP Q + + + E N ++
Sbjct: 310 GVPGALSNQPAPPNEAPIATPPTNQQNAQNTPQTSTSTNSNSAGPRSTQRNETSNYEVDR 369

Query: 369 KKSTSEKEPARIVGVSLGIFVDGIWNFVYDEKGDFVIENGMRKREYKPMALEEIKNIEDV 428
++ I +S+ + V+ + + P+ +++K IED+
Sbjct: 370 TIRHTKMNVGDIERLSVAVVVNY---------------KTLADGKPLPLTADQMKQIEDL 414

Query: 429 LQSSFEYKPERGDSITVRNISFDRMNEFREIDENYFASER--FKYFLFIASIVFSLLILV 486
+ + + +RGD++ V N F ++ E F ++ L + L++
Sbjct: 415 TREAMGFSDKRGDTLNVVNSPFSAVDN--TGGELPFWQQQSFIDQLLAAGRWLLVLVVAW 472

Query: 487 FTIFFAISRELERRRRLREEELAKQAHLRRQQALMDG-----GDDIGVDDVVGGIREGDE 541
A+ +L RR EE A Q + +Q + D + R G E
Sbjct: 473 ILWRKAVRPQLTRR---VEEAKAAQEQAQVRQETEEAVEVRLSKDEQLQQRRANQRLGAE 529

Query: 542 LQSNA-ELLAREKPEDVAKLIRTWL 565
+ S ++ P VA +IR W+
Sbjct: 530 VMSQRIREMSDNDPRVVALVIRQWM 554


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
L144_RS01460FLGHOOKFLIE862e-25 Flagellar hook-basal body complex protein FliE signa...
		>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE

signature.
Length = 103

Score = 85.9 bits (212), Expect = 2e-25
Identities = 15/71 (21%), Positives = 38/71 (53%)

Query: 40 TFKDVLINSITDVNKSQLNVSKVTEQAILKPSSIDVHDVVIAMSKANMNLSILKAVVERG 99
+F L ++ ++ +Q E+ L + ++DV+ M KA++++ + V +
Sbjct: 32 SFAGQLHAALDRISDTQTAARTQAEKFTLGEPGVALNDVMTDMQKASVSMQMGIQVRNKL 91

Query: 100 VKAYQDIINIR 110
V AYQ++++++
Sbjct: 92 VAAYQEVMSMQ 102


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
L144_RS01465FLGHOOKAP1483e-09 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 47.6 bits (113), Expect = 3e-09
Identities = 21/80 (26%), Positives = 35/80 (43%), Gaps = 15/80 (18%)

Query: 5 SSINVASTGLTAQRLRIDVISNNIANVSTSRTPDGGPYRRQRIIFAPRVNNPYWKGPFIP 64
S IN A +GL A + ++ SNNI++ + + Y RQ I A N
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVA------GYTRQTTIMAQ--ANSTLGA---- 49

Query: 65 DYLDNGIGQGVRVASIEKDK 84
+G GV V+ ++++
Sbjct: 50 ---GGWVGNGVYVSGVQREY 66



Score = 44.2 bits (104), Expect = 5e-08
Identities = 11/46 (23%), Positives = 23/46 (50%)

Query: 104 KKGYVELPNVNLVEEMVDMISASRAYEANSTVINSSKSMFRSALAI 149
+ VNL EE ++ + Y AN+ V+ ++ ++F + + I
Sbjct: 500 SNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
L144_RS01475HTHFIS310.011 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 31.0 bits (70), Expect = 0.011
Identities = 13/70 (18%), Positives = 30/70 (42%), Gaps = 15/70 (21%)

Query: 20 KYIIGQDEAKKLVSIALVNRYIRSRLPKEIKDEVMPKNIIMIGSTGIGKTEIAR---RLS 76
++G+ A + + ++ R +++ L +++ G +G GK +AR
Sbjct: 137 MPLVGRSAAMQEI-YRVLARLMQTDLT-----------LMITGESGTGKELVARALHDYG 184

Query: 77 KLIKAPFIKV 86
K PF+ +
Sbjct: 185 KRRNGPFVAI 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
L144_RS01490SYCDCHAPRONE290.012 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 28.7 bits (64), Expect = 0.012
Identities = 16/91 (17%), Positives = 34/91 (37%), Gaps = 2/91 (2%)

Query: 69 ARFFNLIGLEFFKLGQYGPAIEYFAKNLEINPNNYLSHFYIGVASYNLAKNLRVKDEVEK 128
+RFF +G +GQY AI ++ ++ F+ + + +
Sbjct: 70 SRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFL 129

Query: 129 YI-ILAENSFLKSLSIR-DDFKDSLFAISNM 157
++A+ + K LS R +++ M
Sbjct: 130 AQELIADKTEFKELSTRVSSMLEAIKLKKEM 160


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
L144_RS01500SHAPEPROTEIN568e-11 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 55.9 bits (135), Expect = 8e-11
Identities = 50/226 (22%), Positives = 87/226 (38%), Gaps = 24/226 (10%)

Query: 160 TGSSSSSQNLVR-CVNRAGFAVDEVVLGSLASSYATLSKEEREMGVLFIDMGKGTTDIIL 218
G++ + +R AG ++ +A++ G + +D+G GTT++ +
Sbjct: 116 VGATQVERRAIRESAQGAGAREVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAV 175

Query: 219 YIDGSPYYTGVIPIGVNRVTLDIAQVWK------VPEDVAENIKITAGIAHPSILESQME 272
Y+ + IG +R I + + E AE IK G A+P ++E
Sbjct: 176 ISLNGVVYSSSVRIGGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAYPGDEVREIE 235

Query: 273 TVIIPNLGTRPPQ--EKSRKELSVIINSRLREIFEMMKAEI------LKRGLYNKINGGI 324
V NL P+ + E+ + L I + + L + + G+
Sbjct: 236 -VRGRNLAEGVPRGFTLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISER---GM 291

Query: 325 VLTGGGALFPGISNLIEEVFNYPARIGL-PMSINGIGE----EHID 365
VLTGGGAL + L+ E P + P++ G E ID
Sbjct: 292 VLTGGGALLRNLDRLLMEETGIPVVVAEDPLTCVARGGGKALEMID 337


4L144_RS01730L144_RS01755Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
L144_RS01730-110-3.017724pyruvate kinase
L144_RS01735010-5.154275AmmeMemoRadiSam system protein B
L144_RS01740013-4.64455450S ribosomal protein L28
L144_RS01745013-4.984776hypothetical protein
L144_RS01750013-5.160375hypothetical protein
L144_RS01755017-3.519261hypothetical protein
5L144_RS02000L144_RS04320Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
L144_RS02000211-0.201571dicarboxylate/amino acid:cation symporter
L144_RS020050110.900572proline--tRNA ligase
L144_RS020103122.005677DUF2259 domain-containing protein
L144_RS020154112.438633hypothetical protein
L144_RS020203112.411959DUF3996 domain-containing protein
L144_RS020254100.996038DUF3996 domain-containing protein
L144_RS02030390.510417mannose-6-phosphate isomerase, class I
L144_RS04320312-0.204841PTS transporter subunit EIIA
6L144_RS03195L144_RS03265Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
L144_RS03195224-2.918977PTS transporter subunit EIIA
L144_RS03200224-3.7136961-phosphofructokinase
L144_RS03205221-3.814880hypothetical protein
L144_RS03215218-3.405510*exodeoxyribonuclease V subunit alpha
L144_RS03220212-2.605401exodeoxyribonuclease V subunit beta
L144_RS03225110-1.562680nicotinate phosphoribosyltransferase
L144_RS03230190.223154glucose-6-phosphate dehydrogenase
L144_RS03235280.158790Na+/H+ antiporter NhaC family protein
L144_RS032403130.719277Na+/H+ antiporter NhaC family protein
L144_RS03245318-0.024437ABC transporter substrate-binding protein
L144_RS03250422-0.256756ABC transporter permease
L144_RS032551151.728440ABC transporter permease
L144_RS032602132.156090ABC transporter ATP-binding protein
L144_RS032652142.433103ribosome biogenesis GTPase YlqF
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
L144_RS03215MYCMG045300.020 Hypothetical mycoplasma lipoprotein (MG045) signature.
		>MYCMG045#Hypothetical mycoplasma lipoprotein (MG045) signature.

Length = 483

Score = 30.4 bits (68), Expect = 0.020
Identities = 30/113 (26%), Positives = 58/113 (51%), Gaps = 12/113 (10%)

Query: 76 LLAKDIQNTIIFTKDNLEKTNKSYNKLIKILKGLETFGNLETIKNIVLLLK--KNNILME 133
L+ +D+ + I +++ NL+K++ S +K+ + F +++IK I K KNN L+
Sbjct: 84 LIERDLLSPIDWSQFNLKKSSSSSDKVNNASDAKDLF--IDSIKEISQQTKDSKNNELLH 141

Query: 134 FNKLKITTPLILENNIYIYTQKNYREEEE---LIKQIIKRLENHKSELNDNKI 183
+ P L+N +++Y + E E+ +IK + HK NDN++
Sbjct: 142 W-----AVPYFLQNLVFVYRGEKISELEQENVSWTDVIKAIVKHKDRFNDNRL 189


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
L144_RS03250MYCMG045355e-04 Hypothetical mycoplasma lipoprotein (MG045) signature.
		>MYCMG045#Hypothetical mycoplasma lipoprotein (MG045) signature.

Length = 483

Score = 34.7 bits (79), Expect = 5e-04
Identities = 25/120 (20%), Positives = 59/120 (49%), Gaps = 4/120 (3%)

Query: 1 MKKIFILIAILTTFACTNKDTITLNVFNWAEYIDETLLDQFEKENNIKINYEIFHNNEEM 60
+K F + + + ++ + T + N+ YI LL++ ++++ + + + +NE++
Sbjct: 5 LKYCFFSLFVSLSSILSSCGSTTFVLANFESYISPLLLERVQEKH--PLTFLTYPSNEKL 62

Query: 61 MAKFNNTKNYYDIIVPSEYLIQELIDEGKIEKLDYSKLPNVTKNITQNLTNLEHDPGNLY 120
+ F N N Y + V S Y + ELI+ + +D+S+ + + + N D +L+
Sbjct: 63 INGFAN--NTYSVAVASTYAVSELIERDLLSPIDWSQFNLKKSSSSSDKVNNASDAKDLF 120


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
L144_RS03265PF05272310.008 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.8 bits (69), Expect = 0.008
Identities = 10/20 (50%), Positives = 12/20 (60%)

Query: 36 ITLLGPSGCGKTTLIKILGG 55
+ L G G GK+TLI L G
Sbjct: 599 VVLEGTGGIGKSTLINTLVG 618


7L144_RS03525L144_RS03615Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
L144_RS035253202.886183signal recognition particle protein
L144_RS035301191.45499730S ribosomal protein S16
L144_RS035351200.688189KH domain-containing protein
L144_RS035401200.17040316S rRNA processing protein RimM
L144_RS035452220.518622tRNA (guanosine(37)-N1)-methyltransferase TrmD
L144_RS035501250.31232350S ribosomal protein L19
L144_RS03555-215-2.422889hypothetical protein
L144_RS03560-110-2.418160pantetheine-phosphate adenylyltransferase
L144_RS03565010-2.94338450S ribosomal protein L32
L144_RS03570-19-2.642078acyl carrier protein
L144_RS03575-210-2.496671ribonuclease III
L144_RS0358009-1.614831CCA tRNA nucleotidyltransferase
L144_RS03585215-0.582854hypothetical protein
L144_RS03590314-0.416839hypothetical protein
L144_RS036052140.980444**endolytic transglycosylase MltG
L144_RS036103141.014337RNA polymerase sigma factor RpoD
L144_RS036153140.923863hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
L144_RS03560LPSBIOSNTHSS1984e-68 Lipopolysaccharide core biosynthesis protein signat...
		>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein

signature.
Length = 166

Score = 198 bits (505), Expect = 4e-68
Identities = 57/157 (36%), Positives = 91/157 (57%), Gaps = 3/157 (1%)

Query: 4 AVFPGSFDPITWGHIDLIKRSLAIFDKVIVLVAKNKSKKYFLSDIERFSLTKDVISSLNF 63
A++PGSFDPIT+GH+D+I+R +FD+V V V +N +K+ S ER I+ L
Sbjct: 3 AIYPGSFDPITFGHLDIIERGCRLFDQVYVAVLRNPNKQPMFSVQERLEQIAKAIAHL-- 60

Query: 64 SNVLVDRYSGFIVDYALINSIKFIVRGIRAFNDFDIEFERYLVNNKLNFEIDTIFLPSSA 123
N VD + G V+YA I+RG+R +DF++E + N L +++T+FL +S
Sbjct: 61 PNAQVDSFEGLTVNYARQRQAGAILRGLRVLSDFELELQMANTNKTLASDLETVFLTTST 120

Query: 124 EHLYVRSDFVKELMLKKDVDLSNFVPELVFNRLKSKF 160
E+ ++ S VKE+ + ++ +FVP V L +F
Sbjct: 121 EYSFLSSSLVKEVA-RFGGNVEHFVPSHVAAALYDQF 156


8L144_RS04065L144_RS04160Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
L144_RS040652174.075631transcription termination/antitermination
L144_RS040700152.864291translation initiation factor IF-2
L144_RS04075-1150.16919330S ribosome-binding factor RbfA
L144_RS04080-1150.008010tRNA pseudouridine(55) synthase TruB
L144_RS040850150.56559830S ribosomal protein S15
L144_RS04090-1140.176916polyribonucleotide nucleotidyltransferase
L144_RS04095114-1.545518hypothetical protein
L144_RS04100211-1.163427YjgP/YjgQ family permease
L144_RS04105211-0.513569YjgP/YjgQ family permease
L144_RS04110412-0.944115tRNA guanosine(34) transglycosylase Tgt
L144_RS04115312-1.707995murein biosynthesis integral membrane protein
L144_RS04120210-1.501095HEAT repeat domain-containing protein
L144_RS04125412-2.084023bifunctional phosphopantothenoylcysteine
L144_RS04355215-1.963804DUF997 family protein
L144_RS04130015-1.641961sodium/pantothenate symporter
L144_RS04135014-1.101863hypothetical protein
L144_RS04140013-1.097502UDP-N-acetylmuramate--L-alanine ligase
L144_RS04145013-1.287175YicC family protein
L144_RS04150014-0.554731AAA family ATPase
L144_RS04155-112-0.175543DNA-directed RNA polymerase subunit omega
L144_RS04160214-2.100052tRNA (adenosine(37)-N6)-dimethylallyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
L144_RS04070TCRTETOQM762e-16 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 76.4 bits (188), Expect = 2e-16
Identities = 41/144 (28%), Positives = 60/144 (41%), Gaps = 22/144 (15%)

Query: 385 ITIMGHVDHGKTKLLSVL------------------QNIDINQTESGGITQHIGAYTIVY 426
I ++ HVD GKT L L + + GIT G + +
Sbjct: 6 IGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITSFQW 65

Query: 427 NDREITFLDTPGHEAFTMMRSRGAQVTDIVVLVVSAIDGVMPQTIEAINHAKEANVPIIV 486
+ ++ +DTPGH F R V D +L++SA DGV QT + ++ +P I
Sbjct: 66 ENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIPTIF 125

Query: 487 AINKIDLPDSNPDK----IKHQLS 506
INKID + IK +LS
Sbjct: 126 FINKIDQNGIDLSTVYQDIKEKLS 149


9L144_RS00860L144_RS00890N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
L144_RS00860292.670566MoxR family ATPase
L144_RS00865292.26122116S rRNA (guanine(527)-N(7))-methyltransferase
L144_RS008702122.682508tRNA uridine-5-carboxymethylaminomethyl(34)
L144_RS008754191.429676tRNA uridine-5-carboxymethylaminomethyl(34)
L144_RS008803261.112964FlbF protein
L144_RS008852211.406133flagellar hook-associated protein FlgK
L144_RS008900140.356771flagellar hook-associated protein 3
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
L144_RS00860HTHFIS414e-06 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 41.0 bits (96), Expect = 4e-06
Identities = 36/143 (25%), Positives = 55/143 (38%), Gaps = 15/143 (10%)

Query: 33 KEMIDAILMGLLTDGHVLLEGVPGLAKTL---AIQTVSDVLDLEFKRIQ---FTPDLLPS 86
+E+ + + TD +++ G G K L A+ + F I DL+ S
Sbjct: 147 QEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIES 206

Query: 87 DLTGNMVYKSA-TGTFKVRKGPVFS----NVILADEINRAPAKVQSALLEAMGERQVT-L 140
+L G K A TG G F + DEI P Q+ LL + + + T +
Sbjct: 207 ELFG--HEKGAFTGAQTRSTG-RFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTV 263

Query: 141 GDETHRLPDPFFVLATQNPIEQE 163
G T D V AT ++Q
Sbjct: 264 GGRTPIRSDVRIVAATNKDLKQS 286


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
L144_RS00875TCRTETOQM340.002 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 33.7 bits (77), Expect = 0.002
Identities = 34/112 (30%), Positives = 48/112 (42%), Gaps = 14/112 (12%)

Query: 229 GSVNAGKSSLFNLFLKKDRSIVSSYPGTTRDYIEASFELDGILFNLFDTAGLRDADNFVE 288
GSV+ G + N L++ R I T + I SF+ + N+ DT G D V
Sbjct: 34 GSVDKGTTRTDNTLLERQRGI------TIQTGI-TSFQWENTKVNIIDTPGHMDFLAEVY 86

Query: 289 RLGIEKSNSLIKEASLVIYVIDVSSNLTKDDFLFIDSNKSNSKILFVLNKID 340
R S S++ A L+I D T+ LF K +F +NKID
Sbjct: 87 R-----SLSVLDGAILLISAKDGVQAQTR--ILFHALRKMGIPTIFFINKID 131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
L144_RS00885FLGHOOKAP15210.0 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 521 bits (1344), Expect = 0.0
Identities = 138/633 (21%), Positives = 260/633 (41%), Gaps = 105/633 (16%)

Query: 6 SGIEIGKRSLFAHKDAMNTVGHNLSNATKPGYSRQRVTMKTEIPLYAPQLNRAKKQGQLG 65
S I L A + A+NT +N+S+ GY+RQ M A + G +G
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIM-------AQANSTLGAGGWVG 54

Query: 66 QGIVVQSIDRVKDELLNTRIIEESHRLGYWTSQDKFISILEDVYNEPEDQSIRKRLNDFW 125
G+ V + R D + ++ + T++ + +S ++++ + S+ ++ DF+
Sbjct: 55 NGVYVSGVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLST-STSSLATQMQDFF 113

Query: 126 ESWHDLANQPQGLAERKIILERGKSFCEGIRNRFHSLERIYIMANDEIKI----TTDEAN 181
S L + + A R+ ++ + EG+ N+F + ++ + ++ I + D+ N
Sbjct: 114 TSLQTLVSNAEDPAARQALIGKS----EGLVNQFKTTDQYLRDQDKQVNIAIGASVDQIN 169

Query: 182 NYIRNIANLNKQISKSQAMK--DNPNDLMDARDLMVEKLGNIISVSIENKQDPNEFLIHA 239
NY + IA+LN QIS+ + +PN+L+D RD +V +L I+ V + + + A
Sbjct: 170 NYAKQIASLNDQISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMA 229

Query: 240 EGRHLVQGSIANEF-KLEATNGPTRTRWNIL--WANN---DKAYLKTGKLGSLLNIRDEE 293
G LVQGS A + + ++ P+RT + A N + L TG LG +L R ++
Sbjct: 230 NGYSLVQGSTARQLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQD 289

Query: 294 IKNEINELNNIAANIIEIVNEIHEAGRGMDKKNGRSFFSQELKLTDDRGRYDTNGNGQFD 353
+ N L +A E N H+AG + G FF+ + N + D
Sbjct: 290 LDQTRNTLGQLALAFAEAFNTQHKAGFDANGDAGEDFFAIG------KPAVLQNTKNKGD 343

Query: 354 -SVHIFKINSTNEIFPEEKLGFYGTLKFEATNSNEIVEIPYNAPDTVQDVINRINNSNAQ 412
++ +++ + + K+ F +++ T A +T V N A
Sbjct: 344 VAIGATVTDASAVLATDYKISFDNN-QWQVTRL---------ASNTTFTVTPDANGKVAF 393

Query: 413 VTARINSEGKLEIKAVKEQEDENITFKIKHIEDSGSFLTKYTGILNASGPEGAYDYKNID 472
+ G AV + +F +K + D I+N ++
Sbjct: 394 DGLELTFTGTP---AVND------SFTLKPVSD---------AIVNM----------DVL 425

Query: 473 TTD--KLAPKSTYSISPLKNPAAWIKVADIIDSDPSKIASGIKNPTNEISIGDNQAALRI 530
TD K+A S + A DSD + + +N ++G ++
Sbjct: 426 ITDEAKIAMAS-------EEDAG--------DSDNRNGQALLDLQSNSKTVGGAKSF--N 468

Query: 531 SSFGNSQIMIGKNLTLNDYFANTASNIAIKGQISEITKESQSQILKDLTDLRMSISGVNK 590
++ + IG + T N+ +++++ + Q SISGVN
Sbjct: 469 DAYASLVSDIGNKTATLKTSSATQGNV-----VTQLSNQQQ------------SISGVNL 511

Query: 591 DEELANMIEFQQAFIAASKFITVSVELIDTVIN 623
DEE N+ FQQ ++A ++ + + + D +IN
Sbjct: 512 DEEYGNLQRFQQYYLANAQVLQTANAIFDALIN 544


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
L144_RS00890FLAGELLIN584e-11 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 57.7 bits (139), Expect = 4e-11
Identities = 37/133 (27%), Positives = 59/133 (44%), Gaps = 5/133 (3%)

Query: 10 TYENFKTSSAEQESKITKLLENLYKGGKRIVKLRNDPTGVTHAIRLDNDIFKLNVYIKNI 69
T N S + S I +L G RI ++D G A R ++I L +N
Sbjct: 13 TQNNLNKSQSSLSSAIERL-----SSGLRINSAKDDAAGQAIANRFTSNIKGLTQASRNA 67

Query: 70 DTSKSNLRYTEGYLQSLTNILTRAKEIAIQGASGTYESDDKKMISKEVNALLEDVVAIAN 129
+ S + TEG L + N L R +E+++Q +GT D K I E+ LE++ ++N
Sbjct: 68 NDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEIDRVSN 127

Query: 130 AKGPDGYSIFSGT 142
+G + S
Sbjct: 128 QTQFNGVKVLSQD 140



Score = 31.2 bits (70), Expect = 0.008
Identities = 33/358 (9%), Positives = 87/358 (24%), Gaps = 12/358 (3%)

Query: 61 KLNVYIKNIDTSKSNLRYTEGYLQSLTNILTRAKEIAIQGASGTYESDDKKMISKEVNAL 120
+ + ++ ID L + TY K +
Sbjct: 154 TITIDLQKIDVKSLGLDGFNVNGPKEATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGA 213

Query: 121 LEDVVAIANAKGPDGYSIFSGTKIDSEAFKVTRENKISKTSKDGAGPQIIKVEYNGNQAE 180
+ + +G +A T + T + + +
Sbjct: 214 VVTDTTAPTVPDKVYVNAANGQLTTDDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGK 273

Query: 181 KKTEVYNDIHMSNNYPGNVIFFLQNQNIISSINTNGFAVKENTKIYIDNIEIGLTAGDTA 240
+ + + +S+ I + ++
Sbjct: 274 EGDTFDYK---GVTFTIDTKTGNDGNGKVSTTINGEKVTLTVADITAGAANVDAATLQSS 330

Query: 241 LDIVAKINESSAPVEASIDPVLNSLSIKTTTPHQIWITEEKESNVLQTLGILTKNNDTKL 300
++ + + LS + + + K+
Sbjct: 331 KNVYTSVVNGQFTFDDKTKNESAKLSDLEANN----AVKGESKITVNGAEYTANAAGDKV 386

Query: 301 PPYNLSSSTEVRSRSIFDALIELRDTLYNNKEELVGSRSLAEIDESLKRLLISVADLGAK 360
+ + + + + E + + S ID +L ++ + LGA
Sbjct: 387 TLAGKTMFIDKTASGVSTLINEDAAAAKKSTANPLAS-----IDSALSKVDAVRSSLGAI 441

Query: 361 ENRLDRSYERISKEAADMKEDMIQYTDLDVTKAITNLNMASLAYQVSIGISAKIMQTT 418
+NR D + + ++ + D D ++N++ A + Q + A+ Q
Sbjct: 442 QNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNMSKAQILQQAGTSVLAQANQVP 499


10L144_RS01350L144_RS01500N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
L144_RS01350-1181.260337flagellar biosynthesis protein FlhF
L144_RS01355-1170.952054flagellar biosynthesis protein FlhA
L144_RS01360018-0.145149flagellar biosynthesis protein FlhB
L144_RS01365-1170.588591flagellar biosynthetic protein FliR
L144_RS013700171.734156flagellar biosynthesis protein FliQ
L144_RS013750162.300564flagellar type III secretion system pore protein
L144_RS013801173.157351flagella biosynthesis regulatory protein FliZ
L144_RS013851184.113312flagellar motor switch protein FliN
L144_RS013901164.870427flagellar motor switch protein FliM
L144_RS013950194.464363flagellar basal body-associated protein FliL
L144_RS01400-2192.527963flagellar motor protein MotB
L144_RS01405-2191.911203motility protein A
L144_RS01410-1150.372353flagellar protein FlbD
L144_RS01415-2151.119653flagellar hook protein FlgE
L144_RS01420116-0.322670flagellar hook assembly protein FlgD
L144_RS014255170.685982flagellar hook-length control protein FliK
L144_RS014306172.962283flagellar protein
L144_RS014355183.234829flagellar protein FlbA
L144_RS014405194.228737flagellar protein export ATPase FliI
L144_RS014456193.954428flagellar assembly protein FliH
L144_RS014504193.543925flagellar motor switch protein FliG
L144_RS014552193.427574flagellar basal body M-ring protein FliF
L144_RS014602192.137844flagellar hook-basal body complex protein FliE
L144_RS014651181.512151flagellar basal body rod protein FlgC
L144_RS014701192.796799flagellar basal body rod protein FlgB
L144_RS014750193.869604HslU--HslV peptidase ATPase subunit
L144_RS01480-1193.316218ATP-dependent protease subunit HslV
L144_RS01485-2172.495167DNA-protecting protein DprA
L144_RS01490-1162.598200hypothetical protein
L144_RS01495-1152.513109cell division protein FtsZ
L144_RS015000140.773577cell division protein FtsA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
L144_RS01350PF05272310.011 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.8 bits (69), Expect = 0.011
Identities = 8/23 (34%), Positives = 12/23 (52%)

Query: 176 VFILVGPTGVGKTTTIAKLAAIY 198
+L G G+GK+T I L +
Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLD 620


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
L144_RS01360TYPE3IMSPROT339e-117 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 339 bits (871), Expect = e-117
Identities = 101/345 (29%), Positives = 182/345 (52%), Gaps = 9/345 (2%)

Query: 25 RTELPTDQKKQKAREEGRVLKSTEINTAVSLLLLFALFFFMLSYFA---LDLIAVFKEQA 81
+TE PT +K + AR++G+V KS E+ + ++ L A+ + Y+ L+ + EQ+
Sbjct: 5 KTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIPAEQS 64

Query: 82 IKLPEVMRMSVYTMGFAYIRSIMGYVVLFFFASLAVNFFVNIIQVGFFITFKSLEPRWDK 141
LP +S + + +L A +A+ +++Q GF I+ ++++P K
Sbjct: 65 Y-LPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIA--SHVVQYGFLISGEAIKPDIKK 121

Query: 142 ISFNFSRWAKNSFFSAGAFFNLFKSLLKVVIICLIYYFIIENNIGKISKLSEYTLQSGIS 201
I N AK FS + KS+LKVV++ ++ + II+ N+ + +L ++
Sbjct: 122 I--NPIEGAKR-IFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITP 178

Query: 202 IVLVIAYKICFFSVMFLAIVGVFDYLFQRSQYIESLKMTKEEVKQERKEMEGDPLLRSRI 261
++ I ++ + ++ + DY F+ QYI+ LKM+K+E+K+E KEMEG P ++S+
Sbjct: 179 LLGQILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKR 238

Query: 262 KERMRVILSTNLRVAIPQADVVITNPEHFAVAIKWDSETMLAPKVLAKGQDEIALTIKKI 321
++ + I S N+R + ++ VV+ NP H A+ I + P V K D T++KI
Sbjct: 239 RQFHQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKI 298

Query: 322 ARENNVPLMENKLLARALYANVKVNEEIPREYWEIVSKILVRVYS 366
A E VP+++ LARALY + V+ IP E E +++L +
Sbjct: 299 AEEEGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLER 343


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
L144_RS01365TYPE3IMRPROT1132e-32 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 113 bits (285), Expect = 2e-32
Identities = 46/242 (19%), Positives = 107/242 (44%), Gaps = 4/242 (1%)

Query: 16 VLVRIFMFLKFSPFFSTIKI-GYFNFFFSLILSVIVVEKIKIIYPLDNMLSFALILLGEA 74
L+R+ + +P S + +++++ + + + + +
Sbjct: 19 PLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPANDVPVFSFFALWLAVQQI 78

Query: 75 ILGLIQAFFVNIIFNVFHLVGFFFSNQIGLAYANIFDVFSEEDSMIISQIFAYLFLLLFL 134
++G+ F + F G Q+GL++A D S + ++++I L LLLFL
Sbjct: 79 LIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHLNMPVLARIMDMLALLLFL 138

Query: 135 SSDFLLRFFVIGIHDSVLNIRVEHLVNMRNSGFVKLLLMSFGFLFEKALLISFPILSLLL 194
+ + L + + + D+ + + NS L + +F L+++ P+++LLL
Sbjct: 139 TFNGHL-WLISLLVDTFHTLPI--GGEPLNSNAFLALTKAGSLIFLNGLMLALPLITLLL 195

Query: 195 LFYLVLGILSKSSPQINLLIISFSTSLFLGLLILYIGFPSLAISSKRVIELSLDSLASFL 254
L LG+L++ +PQ+++ +I F +L +G+ ++ P +A + + + LA +
Sbjct: 196 TLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFCEHLFSEIFNLLADII 255

Query: 255 KL 256

Sbjct: 256 SE 257


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
L144_RS01370TYPE3IMQPROT612e-16 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 61.3 bits (149), Expect = 2e-16
Identities = 21/76 (27%), Positives = 43/76 (56%)

Query: 6 ILYLIRISIENIIILSAPMLIIALIVGLLISIFQAITSIQDQTLSFIPKIIVILLVIVIF 65
+++ ++ ++ILS I+A I+GLL+ +FQ +T +Q+QTL F K++ + L + +
Sbjct: 4 LVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLFLL 63

Query: 66 GPWILNKLMQFTYMIF 81
W L+ + +
Sbjct: 64 SGWYGEVLLSYGRQVI 79


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
L144_RS01375FLGBIOSNFLIP2603e-90 Escherichia coli: Flagellar biosynthetic protein Fl...
		>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP

signature.
Length = 245

Score = 260 bits (667), Expect = 3e-90
Identities = 97/213 (45%), Positives = 138/213 (64%), Gaps = 3/213 (1%)

Query: 41 GGSEIAFSLQLLILLTIITLSPAFLVLMTSFLRISIVLDFIRRALSLQQSPPTQIVMGLA 100
GG + +Q L+ +T +T PA L++MTSF RI IV +R AL +PP Q+++GLA
Sbjct: 34 GGQSWSLPVQTLVFITSLTFIPAILLMMTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLA 93

Query: 101 LFLTIFTMWPTFNSIYEQAYLPLKESKINFNEFYNKGIAPLRIFMYKQMSDGRHEEIRLF 160
LFLT F M P + IY AY P E KI+ E KG PLR FM +Q R ++ LF
Sbjct: 94 LFLTFFIMSPVIDKIYVDAYQPFSEEKISMQEALEKGAQPLREFMLRQT---READLGLF 150

Query: 161 MSMSNYDRPKNFSEVPTHVLIAAFILHELKVAFKMGILIFLPFIVLDIIVASVLMAMGMI 220
++N + VP +L+ A++ ELK AF++G IF+PF+++D+++ASVLMA+GM+
Sbjct: 151 ARLANTGPLQGPEAVPMRILLPAYVTSELKTAFQIGFTIFIPFLIIDLVIASVLMALGMM 210

Query: 221 MLPPVMISLPFKLILFVMVDGWTLITSGLIKSF 253
M+PP I+LPFKL+LFV+VDGW L+ L +SF
Sbjct: 211 MVPPATIALPFKLMLFVLVDGWQLLVGSLAQSF 243


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
L144_RS01385FLGMOTORFLIN1037e-32 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 103 bits (258), Expect = 7e-32
Identities = 40/77 (51%), Positives = 62/77 (80%)

Query: 35 NFGLLMDVSMQLTVELGRTERKIKDILGMSEGTIITLDKLAGEPVDILVNGKIVAKGEVV 94
+ L+MD+ ++LTVELGRT IK++L +++G+++ LD LAGEP+DIL+NG ++A+GEVV
Sbjct: 53 DIDLIMDIPVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVV 112

Query: 95 VIDENFGVRITEIIKTK 111
V+ + +GVRIT+II
Sbjct: 113 VVADKYGVRITDIITPS 129


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
L144_RS01390FLGMOTORFLIM458e-165 Flagellar motor switch protein FliM signature.
		>FLGMOTORFLIM#Flagellar motor switch protein FliM signature.

Length = 344

Score = 458 bits (1181), Expect = e-165
Identities = 195/345 (56%), Positives = 259/345 (75%), Gaps = 9/345 (2%)

Query: 8 LSQDDIDSLLESINSSESLSLDESLSNVISSPTGKKQKVKVYDFKRPDKFSKEQVRTVSS 67
LSQD+ID LL +I+S ++ D + P +K+ +YDF+RPDKFSKEQ+RT+S
Sbjct: 5 LSQDEIDQLLTAISSGDASIED-------ARPISDTRKITLYDFRRPDKFSKEQMRTLSL 57

Query: 68 FHEAFARYTTTSLSALLRKMVHVHVASVDQLTYEEFIRSIPNPTTLAIINMDPLKGSAIF 127
HE FAR TTTSLSA LR MVHVHVASVDQLTYEEFIRSIP P+TLA+I MDPLKG+A+
Sbjct: 58 MHETFARLTTTSLSAQLRSMVHVHVASVDQLTYEEFIRSIPTPSTLAVITMDPLKGNAVL 117

Query: 128 EVDPTIAFAIVDRLFGGDGDTIKDKSRDLTEIEQSVMESVIIRILANMREAWSQVVDLRP 187
EVDP+I F+I+DRLFGG G K + RDLT+IE SVME VI+RILAN+RE+W+QV+DLRP
Sbjct: 118 EVDPSITFSIIDRLFGGTGQAAKVQ-RDLTDIENSVMEGVIVRILANVRESWTQVIDLRP 176

Query: 188 RFGHIEVNPQFAQIVPPTEMIILVTLEVKIGKVEGLMNFCLPYITIEPIVSKLSTRYWHS 247
R G IE NPQFAQIVPP+EM++LVTLE K+G+ EG+MNFC+PYITIEPI+SKLS+++W S
Sbjct: 177 RLGQIETNPQFAQIVPPSEMVVLVTLETKVGEEEGMMNFCIPYITIEPIISKLSSQFWFS 236

Query: 248 LIGVGTTSENLDALREKLENTAMPLVAEIGEVKLKVREILSLDKGDVLNLESSLINKDLT 307
+ +T++ + LR+KL M +VAE+G ++L VR+IL L GD++ L + +
Sbjct: 237 SVRRSSTTQYMGVLRDKLSTVDMDVVAEVGSLRLSVRDILGLRVGDIIRLHDTHVGDPFV 296

Query: 308 LKVGTKEKFKCRMGLMGNKVSVQITEKIGDIKGFDLLKELTEEVE 352
L +G ++KF C+ G++G K++ QI E+I D +EL+ + E
Sbjct: 297 LSIGNRKKFLCQPGVVGKKIAAQILERIESTSQED-FEELSADEE 340


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
L144_RS01400OMPADOMAIN558e-11 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 54.6 bits (131), Expect = 8e-11
Identities = 28/132 (21%), Positives = 57/132 (43%), Gaps = 21/132 (15%)

Query: 124 ISLAADAFFDSASADVKLEENRDSIQKIASFIGFLSPRGYNFKIEGHTDNIDTDVNGPWK 183
+L +D F+ A +K E + ++ ++ S + L P+ + + G+TD I +D
Sbjct: 215 FTLKSDVLFNFNKATLK-PEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSD-----A 268

Query: 184 SNWELSAARSVNMLEHILNYLDQSDVKRIENNFEVSGFGGSRPIATDDT---------PE 234
N LS R+ + +++YL + + G G S P+ + +
Sbjct: 269 YNQGLSERRA----QSVVDYLISKGIPA--DKISARGMGESNPVTGNTCDNVKQRAALID 322

Query: 235 GRAYNRRIDILI 246
A +RR++I +
Sbjct: 323 CLAPDRRVEIEV 334


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
L144_RS01415FLGHOOKAP1501e-08 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 50.3 bits (120), Expect = 1e-08
Identities = 18/89 (20%), Positives = 33/89 (37%), Gaps = 13/89 (14%)

Query: 4 SLYSGVSGLQNHQTRMDVVGNNIANVNTIGFKKGRINFQDMISQSISGASRPTDARGGTN 63
+ + +SGL Q ++ NNI++ N G+ + I + T GG
Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTT---------IMAQANSTLGAGGW- 52

Query: 64 PKQVGLGMNVASIDTIHTQGAFQSTQKAS 92
VG G+ V+ + + + A
Sbjct: 53 ---VGNGVYVSGVQREYDAFITNQLRAAQ 78



Score = 40.7 bits (95), Expect = 1e-05
Identities = 9/48 (18%), Positives = 28/48 (58%)

Query: 394 IRSGVLEMANVDLAEQFTDMIVTQRGFQANAKTITTSDQLLQELVRLK 441
+ + ++ V+L E++ ++ Q+ + ANA+ + T++ + L+ ++
Sbjct: 499 LSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
L144_RS01425FLGHOOKFLIK401e-05 Flagellar hook-length control protein signature.
		>FLGHOOKFLIK#Flagellar hook-length control protein signature.

Length = 375

Score = 39.8 bits (92), Expect = 1e-05
Identities = 24/82 (29%), Positives = 45/82 (54%), Gaps = 5/82 (6%)

Query: 281 KLVLKPKELGSIRINLNLDSNNNLLGKIVVDNQNVKMLFDQNMHSLNKMLGESGFNASLN 340
+L L P++LG ++I+L +D N + ++V +Q+V+ + + L L ESG +
Sbjct: 260 ELRLHPQDLGEVQISLKVDDNQAQI-QMVSPHQHVRAALEAALPVLRTQLAESGIQLGQS 318

Query: 341 LFLAGENLNSFTGNFKDDSKDQ 362
++GE SF+G + S+ Q
Sbjct: 319 -NISGE---SFSGQQQAASQQQ 336


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
L144_RS01430TYPE4SSCAGX290.016 Type IV secretion system CagX conjugation protein si...
		>TYPE4SSCAGX#Type IV secretion system CagX conjugation protein

signature.
Length = 522

Score = 29.0 bits (64), Expect = 0.016
Identities = 34/143 (23%), Positives = 66/143 (46%), Gaps = 20/143 (13%)

Query: 39 RYFPEFVRTKLLGETSLVFDHNSNIILDEAR--LVKEREAIDIKNQQIEKLKEDLKLKED 96
R + EF++TK L+ D L+E + L KE+EA + + Q+ +K K + + +E
Sbjct: 120 RDYQEFLKTK-----KLIVDAPDPKELEEQKKALEKEKEAKE-QAQKAQKDKREKRKEER 173

Query: 97 SLNKLEFELKQKQKDLDLKQKIIDDIINKYNDEEANILQTAVYLMNMPPEDAVKRLEDLN 156
+ N+ E + + + N N L + D ++RLED+
Sbjct: 174 AKNRANLE------------NLTNAMSNPQNLSNNKNLSELIKQQRENELDQMERLEDMQ 221

Query: 157 PELAISYMRKIEELSKKEGRLSI 179
+ + +++IEEL+KK+ ++
Sbjct: 222 EQAQANALKQIEELNKKQAEEAV 244


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
L144_RS01445FLGFLIH473e-08 Flagellar assembly protein FliH signature.
		>FLGFLIH#Flagellar assembly protein FliH signature.

Length = 228

Score = 46.7 bits (110), Expect = 3e-08
Identities = 43/197 (21%), Positives = 96/197 (48%), Gaps = 22/197 (11%)

Query: 115 KESIETESNAEIER-LAREYEEKLKTDLDIAIAKGREEGYSKGY--------ESGFEDFD 165
+E+I E+ +E+ LA+ + + IA+GR++G+ +GY E G +
Sbjct: 29 EETIIEEAEPSLEQQLAQLQMQAHEQGYQAGIAEGRQQGHKQGYQEGLAQGLEQGLAEAK 88

Query: 166 KVMRKLHAIIASLIAERKGILESSSGQIVSLVMQIAIKVIKRITDSQKDI----VLENVN 221
+HA + L++E + L++ I S +MQ+A++ +++ + +++ +
Sbjct: 89 SQQAPIHARMQQLVSEFQTTLDALDSVIASRLMQMALEAARQVIGQTPTVDNSALIKQIQ 148

Query: 222 EVLKR---VKDKTQITIRVNLDDLDIVRHKKSDFISRFDIIENLEIIEDPNIGKGGCIIE 278
++L++ K Q +RV+ DDL V +S + + DP + GGC +
Sbjct: 149 QLLQQEPLFSGKPQ--LRVHPDDLQRVDDMLGATLS----LHGWRLRGDPTLHPGGCKVS 202

Query: 279 TNFGEIDARISSQLDKI 295
+ G++DA ++++ ++
Sbjct: 203 ADEGDLDASVATRWQEL 219


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
L144_RS01450FLGMOTORFLIG427e-153 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 427 bits (1100), Expect = e-153
Identities = 344/344 (100%), Positives = 344/344 (100%)

Query: 1 MEEKKEKEILDVSALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITS 60
MEEKKEKEILDVSALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITS
Sbjct: 1 MEEKKEKEILDVSALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITS 60

Query: 61 ELKDNVLLEFKELMMAQEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSALQSRPFEFV 120
ELKDNVLLEFKELMMAQEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSALQSRPFEFV
Sbjct: 61 ELKDNVLLEFKELMMAQEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSALQSRPFEFV 120

Query: 121 RRADPANILNFIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPE 180
RRADPANILNFIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPE
Sbjct: 121 RRADPANILNFIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPE 180

Query: 181 VVREVERVLEKKLASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEI 240
VVREVERVLEKKLASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEI
Sbjct: 181 VVREVERVLEKKLASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEI 240

Query: 241 KKKMFVFEDIVLLDDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKED 300
KKKMFVFEDIVLLDDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKED
Sbjct: 241 KKKMFVFEDIVLLDDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKED 300

Query: 301 MEFLGPTRRKDVEESQQKIVSLIRKLEEQGEIVISRGGEEDVLV 344
MEFLGPTRRKDVEESQQKIVSLIRKLEEQGEIVISRGGEEDVLV
Sbjct: 301 MEFLGPTRRKDVEESQQKIVSLIRKLEEQGEIVISRGGEEDVLV 344


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
L144_RS01455FLGMRINGFLIF1621e-45 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 162 bits (412), Expect = 1e-45
Identities = 112/565 (19%), Positives = 222/565 (39%), Gaps = 55/565 (9%)

Query: 23 QKIALGLIIFFVILALVFLIGFSTKSQSIALF-GVEIKDQYLLDRISQRLDRENVKYFLS 81
+I L + + +V ++ ++ LF + +D I +L + N+ Y +
Sbjct: 23 PRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQD---GGAIVAQLTQMNIPYRFA 79

Query: 82 SDGRIYLDDEKLAKKMRAILVREELVPVHMDPWALFDIDRWTITDFERSINLRRSITRAV 141
+ ++R L ++ L + L D +++ I+ F +N +R++ +
Sbjct: 80 NGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQFSEQVNYQRALEGEL 139

Query: 142 EQHIVALDDVDAVSVNLVMPEKALFKESQEPVKASVRITPRPGSDIITNRKKVEGLVKLI 201
+ I L V + V+L MP+ +LF Q+ ASV +T PG + + ++ +V L+
Sbjct: 140 ARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRAL--DEGQISAVVHLV 197

Query: 202 QYAIEGLESDNIAIVDNSGTILNDFSNLDGIDRIDLAEKERKLKLKYEAMLRGEIDSALS 261
A+ GL N+ +VD SG +L SN G DL + + K E+ ++ I++ LS
Sbjct: 198 SSAVAGLPPGNVTLVDQSGHLL-TQSNTSG---RDLNDAQLKFANDVESRIQRRIEAILS 253

Query: 262 KVLSVDRFMIARVNVKLDTSKETTESKEYAPIELQSQDPKASYNTRKVSDSTIISSQTQK 321
++ A+V +LD + + + Y+P KA+ +R+++ S + +
Sbjct: 254 PIVGNGNV-HAQVTAQLDFANKEQTEEHYSP---NGDASKATLRSRQLNISEQVGAGYPG 309

Query: 322 KEYQGQGYSPWGPPGQEGNTPPEYQDLSD-------------ITGKYNESQEIKNVALNE 368
P P TPP Q + + + E N ++
Sbjct: 310 GVPGALSNQPAPPNEAPIATPPTNQQNAQNTPQTSTSTNSNSAGPRSTQRNETSNYEVDR 369

Query: 369 KKSTSEKEPARIVGVSLGIFVDGIWNFVYDEKGDFVIENGMRKREYKPMALEEIKNIEDV 428
++ I +S+ + V+ + + P+ +++K IED+
Sbjct: 370 TIRHTKMNVGDIERLSVAVVVNY---------------KTLADGKPLPLTADQMKQIEDL 414

Query: 429 LQSSFEYKPERGDSITVRNISFDRMNEFREIDENYFASER--FKYFLFIASIVFSLLILV 486
+ + + +RGD++ V N F ++ E F ++ L + L++
Sbjct: 415 TREAMGFSDKRGDTLNVVNSPFSAVDN--TGGELPFWQQQSFIDQLLAAGRWLLVLVVAW 472

Query: 487 FTIFFAISRELERRRRLREEELAKQAHLRRQQALMDG-----GDDIGVDDVVGGIREGDE 541
A+ +L RR EE A Q + +Q + D + R G E
Sbjct: 473 ILWRKAVRPQLTRR---VEEAKAAQEQAQVRQETEEAVEVRLSKDEQLQQRRANQRLGAE 529

Query: 542 LQSNA-ELLAREKPEDVAKLIRTWL 565
+ S ++ P VA +IR W+
Sbjct: 530 VMSQRIREMSDNDPRVVALVIRQWM 554


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
L144_RS01460FLGHOOKFLIE862e-25 Flagellar hook-basal body complex protein FliE signa...
		>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE

signature.
Length = 103

Score = 85.9 bits (212), Expect = 2e-25
Identities = 15/71 (21%), Positives = 38/71 (53%)

Query: 40 TFKDVLINSITDVNKSQLNVSKVTEQAILKPSSIDVHDVVIAMSKANMNLSILKAVVERG 99
+F L ++ ++ +Q E+ L + ++DV+ M KA++++ + V +
Sbjct: 32 SFAGQLHAALDRISDTQTAARTQAEKFTLGEPGVALNDVMTDMQKASVSMQMGIQVRNKL 91

Query: 100 VKAYQDIINIR 110
V AYQ++++++
Sbjct: 92 VAAYQEVMSMQ 102


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
L144_RS01465FLGHOOKAP1483e-09 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 47.6 bits (113), Expect = 3e-09
Identities = 21/80 (26%), Positives = 35/80 (43%), Gaps = 15/80 (18%)

Query: 5 SSINVASTGLTAQRLRIDVISNNIANVSTSRTPDGGPYRRQRIIFAPRVNNPYWKGPFIP 64
S IN A +GL A + ++ SNNI++ + + Y RQ I A N
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVA------GYTRQTTIMAQ--ANSTLGA---- 49

Query: 65 DYLDNGIGQGVRVASIEKDK 84
+G GV V+ ++++
Sbjct: 50 ---GGWVGNGVYVSGVQREY 66



Score = 44.2 bits (104), Expect = 5e-08
Identities = 11/46 (23%), Positives = 23/46 (50%)

Query: 104 KKGYVELPNVNLVEEMVDMISASRAYEANSTVINSSKSMFRSALAI 149
+ VNL EE ++ + Y AN+ V+ ++ ++F + + I
Sbjct: 500 SNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
L144_RS01475HTHFIS310.011 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 31.0 bits (70), Expect = 0.011
Identities = 13/70 (18%), Positives = 30/70 (42%), Gaps = 15/70 (21%)

Query: 20 KYIIGQDEAKKLVSIALVNRYIRSRLPKEIKDEVMPKNIIMIGSTGIGKTEIAR---RLS 76
++G+ A + + ++ R +++ L +++ G +G GK +AR
Sbjct: 137 MPLVGRSAAMQEI-YRVLARLMQTDLT-----------LMITGESGTGKELVARALHDYG 184

Query: 77 KLIKAPFIKV 86
K PF+ +
Sbjct: 185 KRRNGPFVAI 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
L144_RS01490SYCDCHAPRONE290.012 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 28.7 bits (64), Expect = 0.012
Identities = 16/91 (17%), Positives = 34/91 (37%), Gaps = 2/91 (2%)

Query: 69 ARFFNLIGLEFFKLGQYGPAIEYFAKNLEINPNNYLSHFYIGVASYNLAKNLRVKDEVEK 128
+RFF +G +GQY AI ++ ++ F+ + + +
Sbjct: 70 SRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFL 129

Query: 129 YI-ILAENSFLKSLSIR-DDFKDSLFAISNM 157
++A+ + K LS R +++ M
Sbjct: 130 AQELIADKTEFKELSTRVSSMLEAIKLKKEM 160


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
L144_RS01500SHAPEPROTEIN568e-11 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 55.9 bits (135), Expect = 8e-11
Identities = 50/226 (22%), Positives = 87/226 (38%), Gaps = 24/226 (10%)

Query: 160 TGSSSSSQNLVR-CVNRAGFAVDEVVLGSLASSYATLSKEEREMGVLFIDMGKGTTDIIL 218
G++ + +R AG ++ +A++ G + +D+G GTT++ +
Sbjct: 116 VGATQVERRAIRESAQGAGAREVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAV 175

Query: 219 YIDGSPYYTGVIPIGVNRVTLDIAQVWK------VPEDVAENIKITAGIAHPSILESQME 272
Y+ + IG +R I + + E AE IK G A+P ++E
Sbjct: 176 ISLNGVVYSSSVRIGGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAYPGDEVREIE 235

Query: 273 TVIIPNLGTRPPQ--EKSRKELSVIINSRLREIFEMMKAEI------LKRGLYNKINGGI 324
V NL P+ + E+ + L I + + L + + G+
Sbjct: 236 -VRGRNLAEGVPRGFTLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISER---GM 291

Query: 325 VLTGGGALFPGISNLIEEVFNYPARIGL-PMSINGIGE----EHID 365
VLTGGGAL + L+ E P + P++ G E ID
Sbjct: 292 VLTGGGALLRNLDRLLMEETGIPVVVAEDPLTCVARGGGKALEMID 337


11L144_RS01870L144_RS01925N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
L144_RS01870-1110.803655S-ribosylhomocysteine lyase
L144_RS01875-1151.312535hypothetical protein
L144_RS018800181.687228HIT family protein
L144_RS018850172.163502magnesium transporter
L144_RS018900172.367083hypothetical protein
L144_RS01895-1183.645882BMP family protein
L144_RS019000204.142684BMP family protein
L144_RS019051194.498236BMP family protein
L144_RS019101195.057160BMP family protein
L144_RS019152194.98257430S ribosomal protein S7
L144_RS019201194.79330530S ribosomal protein S12
L144_RS019250194.770614DNA-directed RNA polymerase subunit beta'
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
L144_RS01870LUXSPROTEIN1883e-64 Bacterial autoinducer-2 (AI-2) production protein Lu...
		>LUXSPROTEIN#Bacterial autoinducer-2 (AI-2) production protein LuxS

signature.
Length = 171

Score = 188 bits (479), Expect = 3e-64
Identities = 52/162 (32%), Positives = 86/162 (53%), Gaps = 11/162 (6%)

Query: 4 ITSFTIDHTKLN-PGIYVSR-KDTFENVIFTTIDIRIKAPNIEPIIENAAIHTIEHIGAT 61
+ SFT+DHT++N P + V++ T + T D+R APN + I+ IHT+EH+ A
Sbjct: 3 LDSFTVDHTRMNAPAVRVAKTMQTPKGDTITVFDLRFTAPN-KDILSEKGIHTLEHLYAG 61

Query: 62 LLRNN-EVWTEKIVYFGPMGCRTGFYLIIFGDYESKDLVDLVSWLFSE----IVNFSEPI 116
+RN+ + +I+ PMGCRTGFY+ + G + + D +W+ + V I
Sbjct: 62 FMRNHLNGDSVEIIDISPMGCRTGFYMSLIGTPSEQQVAD--AWIAAMEDVLKVENQNKI 119

Query: 117 PGASDKECGNYKEHNLDMAKYESSKYLQI-LNNIKEENLKYP 157
P ++ +CG H+LD AK + L++ + K + L P
Sbjct: 120 PELNEYQCGTAAMHSLDEAKQIAKNILEVGVAVNKNDELALP 161


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
L144_RS01895LIPPROTEIN48748e-17 Mycoplasma P48 major surface lipoprotein signature.
		>LIPPROTEIN48#Mycoplasma P48 major surface lipoprotein signature.

Length = 428

Score = 74.3 bits (182), Expect = 8e-17
Identities = 83/340 (24%), Positives = 127/340 (37%), Gaps = 34/340 (10%)

Query: 7 IFGILLTSCFSRNGIESSS-KKIKISMLVD-GVLDDKSFNSSANEALLRLKKDFPENIEE 64
I T+ + ++++ K+K ++ D G +DDKSFN SA EAL + K IE
Sbjct: 40 ISKYTTTNANGKQVVKNAELLKLKPVLITDEGKIDDKSFNQSAFEALKAINKQ--TGIEI 97

Query: 65 VFSCAISGVYSSYVSDLDNLKRNGSDLIWLVGYMLTDASL--LVSSENPKISYGIIDPIY 122
S S+Y S L + IW++ S+ + + ++ I I
Sbjct: 98 NNVEPSSNFESAYNSALSAGHK-----IWVLNGFKHQQSIKQYIDAHREELERNQIK-II 151

Query: 123 GDDVQIPEN---LIAVVFRVEQGAFLAGYIAAKKSFSGK------IGFIGGMKGNIVDAF 173
G D I ++ F +++ AF GY A S + + GG V F
Sbjct: 152 GIDFDIETEYKWFYSLQFNIKESAFTTGY-AIASWLSEQDESKRVVASFGGGAFPGVTTF 210

Query: 174 RYGYESGAKYANKDIEIISEYSNSFSDVDIGRT-----------IASKMYSKGIDVIHFA 222
G+ G Y N+ + Y S +D G T + S + H
Sbjct: 211 NEGFAKGILYYNQKHKSSKIYHTSPVKLDSGFTAGEKMNTVINNVLSSTPADVKYNPHVI 270

Query: 223 AGLAGIGVIETAKNLGDGYYVIGADQDQSY-LAPKNFITSVIKNIGDALYLITGEYIKNN 281
+AG ET + G YVIG D DQ +TSV+K+I A+Y + I
Sbjct: 271 LSVAGPATFETVRLANKGQYVIGVDSDQGMIQDKDRILTSVLKHIKQAVYETLLDLILEK 330

Query: 282 NVWEGGKVVQMGLRDGVIGLPNANEFEYIKVLERKIINKE 321
VV+ D + ++I V E N E
Sbjct: 331 EEGYKPYVVKDKKADKKWSHFGTQKEKWIGVAENHFSNTE 370


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
L144_RS01900LIPPROTEIN48702e-15 Mycoplasma P48 major surface lipoprotein signature.
		>LIPPROTEIN48#Mycoplasma P48 major surface lipoprotein signature.

Length = 428

Score = 69.7 bits (170), Expect = 2e-15
Identities = 66/261 (25%), Positives = 99/261 (37%), Gaps = 29/261 (11%)

Query: 30 KVSLIID-GTFDDKSFNESALNGVKKVKEEFKIELVLKESSSNSYLSDLEGLKDAGSDLI 88
K LI D G DDKSFN+SA +K + ++ IE+ E SSN + S AG +
Sbjct: 63 KPVLITDEGKIDDKSFNQSAFEALKAINKQTGIEINNVEPSSN-FESAYNSALSAGHKIW 121

Query: 89 WLIGYRFS------DVAKVAALQNPDMKYAIID-PIYSNDPIPANLVGMTFRAQEGAFLT 141
L G++ A L+ +K ID I + + F +E AF T
Sbjct: 122 VLNGFKHQQSIKQYIDAHREELERNQIKIIGIDFDIETEYKW---FYSLQFNIKESAFTT 178

Query: 142 GYIAAKLSKTGK-----IGFLGGIEGEIVDAFRYGYEAGAKYANKD-----------IKI 185
GY A + GG V F G+ G Y N+ +K+
Sbjct: 179 GYAIASWLSEQDESKRVVASFGGGAFPGVTTFNEGFAKGILYYNQKHKSSKIYHTSPVKL 238

Query: 186 STQYIGSFADLEAGRSVATRMYSDEIDIIHHAAGLGGIGAIEVAKELGSGHYIIGVDEDQ 245
+ + +V + +D H + G E + G Y+IGVD DQ
Sbjct: 239 DSGFTAGEKMNTVINNVLSSTPADVKYNPHVILSVAGPATFETVRLANKGQYVIGVDSDQ 298

Query: 246 AY-LAPDNVITSTTKDVGRAL 265
D ++TS K + +A+
Sbjct: 299 GMIQDKDRILTSVLKHIKQAV 319


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
L144_RS01905LIPPROTEIN48492e-08 Mycoplasma P48 major surface lipoprotein signature.
		>LIPPROTEIN48#Mycoplasma P48 major surface lipoprotein signature.

Length = 428

Score = 48.8 bits (116), Expect = 2e-08
Identities = 58/296 (19%), Positives = 103/296 (34%), Gaps = 50/296 (16%)

Query: 18 FKSNKKSIKSDKV----VVGVLAHGSFYDKGYNQSVHDGVVKLRDNFGIKLITKSLRPYP 73
+ K+ +K+ ++ V + G DK +NQS + + + GI++
Sbjct: 47 NANGKQVVKNAELLKLKPVLITDEGKIDDKSFNQSAFEALKAINKQTGIEINNVE----- 101

Query: 74 IEGKRLLTVDEAMTEDAYEVQKNPLNLFW-LIGYRFSDLSVKL------SYERPDIYYGI 126
+ E AY + + W L G++ + ER I
Sbjct: 102 ---------PSSNFESAYNSALSAGHKIWVLNGFKHQQSIKQYIDAHREELERNQI---K 149

Query: 127 IDAFDYGDIQVPKNSLAIKFRNEEAAFLAGYIAA-----KMSRKEKIGFLTGPMSEHLKD 181
I D+ K +++F +E+AF GY A + K + G +
Sbjct: 150 IIGIDFDIETEYKWFYSLQFNIKESAFTTGYAIASWLSEQDESKRVVASFGGGAFPGVTT 209

Query: 182 FKFGFKAGIFYAN---PKLRLVSK---KAPSLFDKEKGKAMAL-------FMYKEDKVGV 228
F GF GI Y N ++ K S F + + + V
Sbjct: 210 FNEGFAKGILYYNQKHKSSKIYHTSPVKLDSGFTAGEKMNTVINNVLSSTPADVKYNPHV 269

Query: 229 IFPIAGITGLGVYDAAKELGPKYYVIGLNQDQSYI-APQNVITSIIKDIGKVIYSI 283
I +AG ++ + YVIG++ DQ I ++TS++K I + +Y
Sbjct: 270 ILSVAGPA---TFETVRLANKGQYVIGVDSDQGMIQDKDRILTSVLKHIKQAVYET 322


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
L144_RS01910LIPPROTEIN48603e-12 Mycoplasma P48 major surface lipoprotein signature.
		>LIPPROTEIN48#Mycoplasma P48 major surface lipoprotein signature.

Length = 428

Score = 60.0 bits (145), Expect = 3e-12
Identities = 55/272 (20%), Positives = 97/272 (35%), Gaps = 31/272 (11%)

Query: 28 KTVSLIVDGAFDDKGFNESSSKAIRKLKADLNINIIEKASTGNSYLGDIANLEDGNSNLI 87
K V + +G DDK FN+S+ +A++ + I I +++ + +
Sbjct: 63 KPVLITDEGKIDDKSFNQSAFEALKAINKQTGIE-INNVEPSSNFESAYNSALSAGHKIW 121

Query: 88 WGIGFRLSDILFQ---RASENVSVNYAIIEGV-YDEIQIPKNLLNISFRSEEVAFLAGY- 142
GF+ + Q E + N I G+ +D K ++ F +E AF GY
Sbjct: 122 VLNGFKHQQSIKQYIDAHREELERNQIKIIGIDFDIETEYKWFYSLQFNIKESAFTTGYA 181

Query: 143 ----FASKASKTGKIGFVGGVRGKVLESFMYGYEAGAKYANSNIKVVSQYVGTFGDFGLG 198
+ + + GG + +F G+ G Y N K Y + G
Sbjct: 182 IASWLSEQDESKRVVASFGGGAFPGVTTFNEGFAKGILYYNQKHKSSKIYHTSPVKLDSG 241

Query: 199 ---------------RSTASNMYRDGVDIIFAAAGLSGIGVIEAAKELGPDHYIIGVDQD 243
ST +++ + I+ A + E + Y+IGVD D
Sbjct: 242 FTAGEKMNTVINNVLSSTPADVKYNPHVILSVAGPATF----ETVRLANKGQYVIGVDSD 297

Query: 244 QSY-LAPNNVIVSAVKKVDSLMYS-LTKKYLE 273
Q + ++ S +K + +Y L LE
Sbjct: 298 QGMIQDKDRILTSVLKHIKQAVYETLLDLILE 329


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
L144_RS01925RTXTOXIND310.037 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature.

Length = 478

Score = 31.0 bits (70), Expect = 0.037
Identities = 7/17 (41%), Positives = 14/17 (82%)

Query: 1192 KHLLVRDGDVVKAGDML 1208
K ++V++G+ V+ GD+L
Sbjct: 108 KEIIVKEGESVRKGDVL 124


12L144_RS03915L144_RS03935N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
L144_RS039151220.737489muramidase
L144_RS039201171.356275flagellar P-ring protein
L144_RS039250122.595460hypothetical protein
L144_RS039300113.450788flagellar basal-body rod protein FlgG
L144_RS039353162.786620flagellar basal-body rod protein FlgF
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
L144_RS03915FLGFLGJ487e-10 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 47.8 bits (113), Expect = 7e-10
Identities = 21/69 (30%), Positives = 39/69 (56%), Gaps = 2/69 (2%)

Query: 34 DLRKASLEFEAMFIKQMLESMKKTLNKDQNLLNGGQVEEIFEDMLCEQRAKQMAQAQSFG 93
++R + + E MF++ ML+SM+ L KD L + ++ M +Q A+QM + G
Sbjct: 32 NIRPVARQVEGMFVQMMLKSMRDALPKDG--LFSSEHTRLYTSMYDQQIAQQMTAGKGLG 89

Query: 94 LADLIYNQL 102
LA+++ Q+
Sbjct: 90 LAEMMVKQM 98


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
L144_RS03920FLGPRINGFLGI2579e-86 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 257 bits (658), Expect = 9e-86
Identities = 83/352 (23%), Positives = 153/352 (43%), Gaps = 60/352 (17%)

Query: 35 SLSESVKLKEIADIYPTNTNFLTGIGIVAGLAGKGDSIKQKDL----IIKILEENNIINE 90
+ +++ ++K+IA + N L G G+V GL G GDS++ + +L+ I +
Sbjct: 24 AQADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSPFTEQSMRAMLQNLGITTQ 83

Query: 91 IGSNNIESKNIALVNVSLQVKGNTIKGSKHKACVASILDSKDLTNGILLKTNLKNKEGEI 150
G +N +KNIA V V+ + GS+ V+S+ D+ L G L+ T+L +G+I
Sbjct: 84 GGQSN--AKNIAAVMVTANLPPFASPGSRVDVTVSSLGDATSLRGGNLIMTSLSGADGQI 141

Query: 151 IAIASGITQPNN-KLKGSGYTI-----------DSVIINEN--QNINHSYNIILKKGN-- 194
A+A G N +G T+ + II S N++L+ N
Sbjct: 142 YAVAQGALIVNGFSAQGDAATLTQGVTTSARVPNGAIIERELPSKFKDSVNLVLQLRNPD 201

Query: 195 YTLINRIHKILTS---KKINNKI---KSDSTIEIEAKNIS----LLEEIENIKIETN--P 242
++ R+ ++ + + + I + I ++ ++ L+ EIEN+ +ET+
Sbjct: 202 FSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKPRVADLTRLMAEIENLTVETDTPA 261

Query: 243 KILIDKKNGIILASENAKI-------GTFTFSIEKDNQNI----FLSKNNKTTIQVNSMK 291
K++I+++ G I+ + +I GT T + + Q I F Q + M
Sbjct: 262 KVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESPQVIQPAPFSRGQTAVQPQTDIMA 321

Query: 292 LNE----FILK-----------NSNNLSNKELIQIIQAAQKINKLNGELILE 328
+ E I++ NS L +I I+Q + L EL+L+
Sbjct: 322 MQEGSKVAIVEGPDLRTLVAGLNSIGLKADGIIAILQGIKSAGALQAELVLQ 373


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
L144_RS03930FLGHOOKAP1452e-07 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 44.9 bits (106), Expect = 2e-07
Identities = 10/44 (22%), Positives = 23/44 (52%)

Query: 220 ILEMSNVSIAEEMVTMIVAQRAYEINSKAIQTSDNMLGIANNLK 263
+S V++ EE + Q+ Y N++ +QT++ + N++
Sbjct: 503 QQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546



Score = 39.6 bits (92), Expect = 1e-05
Identities = 19/79 (24%), Positives = 34/79 (43%), Gaps = 14/79 (17%)

Query: 5 LWTAASGMTAQQYNVDTIANNLSNVNTTGFKKIRAEFEDLIYQTHNRAGTPATENTLRPL 64
+ A SG+ A Q ++T +NN+S+ N G+ + A N+
Sbjct: 4 INNAMSGLNAAQAALNTASNNISSYNVAGYTRQTT--------------IMAQANSTLGA 49

Query: 65 GNQVGHGTKIAATQRIFEQ 83
G VG+G ++ QR ++
Sbjct: 50 GGWVGNGVYVSGVQREYDA 68


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
L144_RS03935FLGHOOKAP1438e-07 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 43.0 bits (101), Expect = 8e-07
Identities = 14/64 (21%), Positives = 28/64 (43%)

Query: 214 DTKTSGKAQEIDISLRPKIETETLEASNVNAVKEMVLMIEINRAYEANQKTIQTEDSLLG 273
T T + ++ ++ + S VN +E + + Y AN + +QT +++
Sbjct: 481 KTATLKTSSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFD 540

Query: 274 KLIN 277
LIN
Sbjct: 541 ALIN 544



Score = 38.4 bits (89), Expect = 2e-05
Identities = 13/39 (33%), Positives = 23/39 (58%)

Query: 4 GIYTAASGMMAERRKLDTVSNNLANIDLIGYKKDLSIQK 42
I A SG+ A + L+T SNN+++ ++ GY + +I
Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMA 41



 
Contact Sachin Pundhir for Bugs/Comments.
For best view 1024 x 768 resolution & IE 6.0 or above recommended.