Table of Contents

Module: __init__ Bio/SeqUtils/__init__.py
Imported modules   
from Bio import Fasta, Alphabet
from Bio.Alphabet import IUPAC
from Bio.Data import IUPACData, CodonTable
from Bio.Seq import Seq
from Bio.Tools import Translate
import os
import sys
Functions   
GC
GC123
GC_Frame
GC_skew
antiparallel
apply_on_multi_fasta
complement
fasta_uniqids
makeTableX
molecular_weight
nt_search
quick_FASTA_reader
quicker_apply_on_multi_fasta
reverse
seq3
six_frame_translations
translate
  GC 
GC ( seq )

calculates G+C content

  GC123 
GC123 ( seq )

calculates totla G+C content plus first, second and third position

  GC_Frame 
GC_Frame ( seq,  genetic_code=1 )

just an alias for six_frame_translations

  GC_skew 
GC_skew ( seq,  window=100 )

calculates GC skew (G-C)/(G+C)

  antiparallel 
antiparallel ( seq )

returns reversed complementary sequence ( = other strand )

  apply_on_multi_fasta 
apply_on_multi_fasta (
        file,
        function,
        *args,
        )

apply function on each sequence in a multiple FASTA file

Exceptions   
NotImplementedError, "%s not implemented" % function
  complement 
complement ( seq )

returns the complementary sequence (NOT antiparallel)

  fasta_uniqids 
fasta_uniqids ( file )

checks and changes the name/ID's to be unique identifiers by adding numbers

  makeTableX 
makeTableX ( table )

  molecular_weight 
molecular_weight ( seq )

  nt_search 
nt_search ( seq,  subseq )

search for a DNA subseq in sequence use ambiguous values (like N = A or T or C or G, R = A or G etc.) searches only on forward strand

  quick_FASTA_reader 
quick_FASTA_reader ( file )

simple and FASTA reader, preferable to be used on large files

  quicker_apply_on_multi_fasta 
quicker_apply_on_multi_fasta (
        file,
        function,
        *args,
        )

apply function on each sequence in a multiple FASTA file

Exceptions   
NotImplementedError, "%s not implemented" % function
  reverse 
reverse ( seq )

reverse the sequence

  seq3 
seq3 ( seq )

Method that returns the amino acid sequence as a list of three letter codes. Output follows the IUPAC standard plus Ter for terminator. Any unknown character, including the default unknown character X, is changed into Xaa. A noncoded aminoacid selenocystein is recognized (Sel, U).

  six_frame_translations 
six_frame_translations ( seq,  genetic_code=1 )

nice looking 6 frame translation with GC content - code from xbbtools similar to DNA Striders six-frame translation

  translate 
translate (
        seq,
        frame=1,
        genetic_code=1,
        translator=None,
        )

translation of DNA in one of the six different reading frames

Exceptions   
ValueError, 'invalid frame'
Classes   
MissingTable
ProteinX

Table of Contents

This document was automatically generated on Mon Jul 1 12:03:20 2002 by HappyDoc version 2.0.1