[Biodevelopers] Blast - exact same sequence only gives 98%

Phil Princely phil.princely at gmail.com
Thu Mar 22 22:40:29 EDT 2007


Hi all,

I'm new here, so sorry if this is a bit of an obvious question. I've
been using blast for a while now, but am still learning. Here's my
problem:

I used formatdb to make a blast database from a text file with about
2000 genes. Everything went well, and I could query the database. But
when I input a sequence from the original text file, the result isn't
always 100%. Sometimes it comes out 98% or 95%, when it should always
be 100%. When I look at the results, I find one or more series of xs,
signifying a missing part of data. For example:

Score = 1815 bits (4702), Expect = 0.0
 Identities = 913/953 (95%), Positives = 913/953 (95%)

LTLDRLSNTLSGGESQRISLATQXXXXXXXXXXXXDEPSIGLHQ (Query)
LTLDRLSNTLSGGESQRISLATQ            DEPSIGLHQ
LTLDRLSNTLSGGESQRISLATQLGSSLVGSLYVLDEPSIGLHQ (Subject)

Is there a way to make this 100%. I want to run the 2000 genes against
another genome to find 100% similar regions, 95% similar regions and
so on.

Thanks

Phil P


More information about the Biodevelopers mailing list