[Bioclusters] NCBI database download and format code

Joseph Landman bioclusters@bioinformatics.org
02 May 2003 02:05:46 -0400


On Thu, 2003-05-01 at 18:29, Jeremy Mann wrote:
> I am curious if any knows of any commercial or open source solution to
> breaking up the NCBI dbs into various sizes. Here, our present solution is

You can use the "formatdb -v N" option to have the database
automatically divided into groups of N x 10**6 letters.  I would
recommend this route for the database formatting side.  Keep the
original db around for the other tools.

I am working on a fast segmenter.  Should be done soon.

-- 
Joseph Landman, Ph.D
Scalable Informatics LLC
email: landman@scalableinformatics.com
  web: http://scalableinformatics.com
phone: +1 734 612 4615