[BiO BB] protein sequence for all organism
yvan.strahm at gmail.com
Tue Nov 18 04:17:41 EST 2008
Thanks every one for the tips. I ended up using the taxonomy files from
genbank and uniprot and sorting them according to the organism.
On Tue, Nov 11, 2008 at 9:50 PM, Hongyu Zhang <me at hongyu.org> wrote:
> My solution is to download the taxonomy files from Genebank, which contain
> the information of the taxonomy numbers for all GI numbers and the
> hierarchical taxonomy tree structure. You can write a program to partition
> the protein NR file into separated files/folders, each belonging to a
> specific taxonomy number that is a descendant of the eukaryote node in the
> taxonomy tree.
> The location of the Genbank taxonomy files is
> BBB mailing list
> BBB at bioinformatics.org
More information about the BBB