[BiO BB] Re: Quickly retrieving cross-referenced records from NCBI
J.W. Bizzaro
jeff at bioinformatics.org
Wed Dec 20 17:18:26 EST 2006
Stan, the attached script got removed. Please resend it in-line (in the message body), or send a link. Thanks, Jeff
Gaj Stan (BIGCAT) wrote:
> Dear Dale,
>
> I encountered the same question a few weeks ago, but my focus was the
> other way around: go from NM to NP. For that I've written a Perl script
> that I've adjusted to fit your needs (so going for NP to NM).
>
> If I'm correct, RefSeq splits it's database in three parts: genomic,
> mRNA and protein. For this script to work, you need a) to download a
> species-specific RefSeq mRNA database (ends with .rna.gbff) for the NCBI
> ftp and b) to have your own file of convertable IDs, sorted in a
> list-form..
> Note that this script will NOT detect version numbers: e.g. XP_12345.1
> needs to be converted to XP_12345 in your list before it does it's job!
>
> Although the code is far from perfect, it fulfills your question
> perfectly (-;
>
> Best wishes,
>
> Stan
--
J.W. Bizzaro
Bioinformatics Organization, Inc. (Bioinformatics.Org)
E-mail: jeff at bioinformatics.org
Phone: +1 508 890 8600
--
More information about the BBB
mailing list