Members:
data Record with SwissProt data.
Methods
|
|
|
|
__init__
|
__init__ ( self )
|
|
_chomp
|
_chomp (
self,
word,
to_chomp='.,;',
)
|
|
_clean_record
|
_clean_record ( self, rec )
|
|
_clean_references
|
_clean_references ( self, ref )
|
|
_fix_varsplic_sequences
|
_fix_varsplic_sequences ( self, description )
Remove unwanted spaces in sequences.
During line carryover, the sequences in VARSPLIC can get mangled
with unwanted spaces like:
DISSTKLQALPSHGLESIQT -> PCRATGWSPFRRSSPC LPTH
We want to check for this case and correct it as it happens.
|
|
accession
|
accession ( self, line )
|
|
comment
|
comment ( self, line )
|
|
database_cross_reference
|
database_cross_reference ( self, line )
|
|
date
|
date ( self, line )
Exceptions
|
|
SyntaxError, "I don't understand the date line %s" % line
|
|
|
description
|
description ( self, line )
|
|
end_record
|
end_record ( self )
|
|
feature_table
|
feature_table ( self, line )
|
|
gene_name
|
gene_name ( self, line )
|
|
identification
|
identification ( self, line )
Exceptions
|
|
SyntaxError, "Unrecognized data class %s in line\n%s" %( self.data.data_class, line )
SyntaxError, "Unrecognized molecule type %s in line\n%s" %( self.data.molecule_type, line )
|
|
|
keyword
|
keyword ( self, line )
|
|
organelle
|
organelle ( self, line )
|
|
organism_classification
|
organism_classification ( self, line )
|
|
organism_species
|
organism_species ( self, line )
|
|
reference_author
|
reference_author ( self, line )
|
|
reference_comment
|
reference_comment ( self, line )
|
|
reference_cross_reference
|
reference_cross_reference ( self, line )
Exceptions
|
|
AssertionError( "I don't understand RX line %s" % line )
|
|
|
reference_location
|
reference_location ( self, line )
|
|
reference_number
|
reference_number ( self, line )
|
|
reference_position
|
reference_position ( self, line )
|
|
reference_title
|
reference_title ( self, line )
|
|
sequence_data
|
sequence_data ( self, line )
|
|
sequence_header
|
sequence_header ( self, line )
|
|
start_record
|
start_record ( self )
|
|
taxonomy_id
|
taxonomy_id ( self, line )
The OX line is in the format:
OX DESCRIPTION=ID[, ID]...;
If there are too many id's to fit onto a line, then the ID's
continue directly onto the next line, e.g.
OX DESCRIPTION=ID[, ID]...
OX ID[, ID]...;
Currently, the description is always "NCBI_TaxID".
|
|
terminator
|
terminator ( self, line )
|