[BiO BB] command-line (scriptable) ORF finders?
sariego9 at yahoo.com
Sun Sep 17 14:06:58 EDT 2006
There is also the SEALS package from Koonin's group at NCBI,
we use that alot. it has a bunch of command line tools, I believe it
is all in PERL, so you can gut it and reuse.
if you are looking at ESTs, you may also want to look at estscan,
or there is a genewise like est Gene modeler tool the Wise2
package by Birney and Durbin that you may want to look at.
.=$=. .=$=. .=$=. .=$=. .=$=. .=$=.
@ @ | | | @ | | | @ @ | | | @ | | | @ @ | | | @ | | |
| @ @ | | | @ @ | | | @ @ | | | @ @ | | | @ @ | | | @ @ | |
| | @ | | | @ @ | | | @ | | | @ @ | | | @ | | | @ @ |
~' `~$~' `~$~' `~$~' `~$~' `~$~' `~
----- Original Message ----
From: Ann Loraine <aloraine at gmail.com>
To: General Forum at Bioinformatics.Org <bio_bulletin_board at bioinformatics.org>
Sent: Sunday, September 17, 2006 1:19:38 AM
Subject: [BiO BB] command-line (scriptable) ORF finders?
I'm hoping someone on the list who is involved with EST or full-length
cDNA sequencing projects can help me with something (well..two
(1) I am looking for a command-line, scriptable tool that can take as
input an EST, cDNA, or assembled EST contig ("unigene") sequence and
return the most likely or longest open reading frame. This is for a
plant EST project. It should also pay attention to codon usage rules.
(2) I am also looking for a tool that can take as input a set of exon
annotations (or mRNA-to-genome alignments) and return the most likely
CDS start and end for the given gene structure. Tools that can jigger
the alignment/exon boundaries to optimize the ORF *and* which pay
attention to codon usage rules would be extra great. This is for
deducing novel gene structures from cross-species mRNA-to-genome
alignments. Maybe there is a gene-finder that does this?
I've found a variety of web sites that claim to do this, but, as you
know, Web sites don't really cut it when you are working with
thousands of sequences. And also, I would like to see the code in case
I run into problems.
Any thoughts or suggestions (other than pointers to Web tools, please)
would be greatly appreciated!
Section on Statistical Genetics
University of Alabama at Birmingham
General Forum at Bioinformatics.Org - BiO_Bulletin_Board at bioinformatics.org
More information about the BBB