Preparing document for printing:
When printing, backgrounds are automatically turned to white.

Introduction

[Menu-bar> Help>Selected topics]

Making sequence alignments of related protein sequences is a common task in biology. It reveals regions which are highly conserved and are functionally important. Automatic approaches alone are often not fully satisfactory and manual refinement is necessary in most cases particularly when the sequences are very dissimilar.

STRAP is a comfortable and comprehensive tool to edit multiple protein sequence alignments. A wide range of functions related to protein sequences and protein structures are accessible with an intuitive graphical interface.
STRAP is tightly integrated into your desktop environment supporting Cut_paste and Word_completion and spell check. Drag_and_drop is available for proteins, residue selections, nucleotide structures and hetero structures. Context_menu for proteins, residue selections and files are triggered by right mouse click.

To users that are not familiar with object oriented graphical programs such as CorelDraw or PowerPoint, the program appears to be complicated in the beginning. With the help of the integrated tutorials you will learn how to apply the currently available methods to compare proteins sequences and structures.

The key features are:

What it cannot do:
- Alignment of nucleotide sequences
- Docking
- Gene structure, promotor analysis
- Structure modelling
- Molecular dynamics
- Gene expression
- RNA-3D-structure



Frequently occurring buttons


Selecting objects and using context menus

[Menu-bar> Help>Selected topics] Context menus in STRAP: The term is commonly used for menus which pop up when right clicking (right mouse button) an item in a graphical user interface, offering a list of options which vary depending on the item selected. Watch movie MOVIE:Context_Menu*



Context menus are available for example for proteins, for residue selections, for the alignment panel and for the rubber band selection.

Selecting single items: List items are selected simply by left-click the respective node. Single residue selections in the alignment panel can be selected by clicking with the Ctrl-key.

Selecting more than one item: Selecting more than one list item requires the Shift and Ctrl keys. The Ctrl-key is located at the lower left of the key board and is sometimes labeled Strg and the Shift-key is sometimes termed "UMSCHALT".

Selecting residue selections: Residue selections which are highlighted in the alignment panel can be selected by Ctrl+left-click. Selected residue selections are indicated by marching ants. By dragging a rectangular region in the alignment, all contained residue selections are selected. With the Shift or Ctrl-key the union-set or cut-set is formed between already selected items and those inside the rectangle.

Frequently used menu items can be dragged out the menu and placed on the desktop. Instead of clicking a menu-item the user drags the menu-item to the desktop. Being on the desktop, it can now be accessed much easier. When the menu-item on the desktop is not needed any more, it can be deleted.

The tree view is located at the left of the application. It is usually hidden and can be opened by dragging the vertical divider bar. The tree contains all loaded proteins and their child objects.

Menu-bar

Undock: Frequently used menu items can be dragged to the desktop.


Context-Menus in STRAP

are opened by right click (see MOVIE:Context_Menu*

Context menu of Proteins


Context menu of residue annotations


Context menu of residue selections


Context menu of files


Context menu of rubber band selection


Context menu of alignment panel



Project folders

[Menu-bar> Help>Selected topics>Loading files] Each alignment project is stored in one file directory. This file directory is specified by the user at the beginning of the session and is valid during the entire session. Two instances of Strap for different projects can be run at the same time. Proteins and residue selections can be copied from one to the other via Drag'n drop.

Loading protein files: Protein files are loaded into STRAP by dragging one or more files from the desktop or any other location into the STRAP-application.

Web-pages may contain links to protein databases. These links can be dragged from the web browser into STRAP ( DRAG_LINK* for details).

Several STRAP-sessions can be opened at the same time and proteins can be dragged from one to the other.

The sequence alignment panel

[Menu-bar> Help>Selected topics>Alignment panel] The central view of the STRAP application is the alignment panel. It shows the names and sequences of loaded proteins. The protein labels can be dragged up or down in the row header to change their order. In the amino acid sequences white space can be added or removed, so-called alignment gaps.

Multiple alignment windows: If two or more alignment views are displayed at the same time (New alignment panel[Alignment-Contextmenu>New ] ), they display the same alignment. But if two independent Strap sessions are run in parallel, both display different alignments. Proteins and residue selections can be exchange copied from one to the other by Drag-n-Drop.

Shading: Three residue shadings are available in the tool-bar: "charge", "hydropathy" and "chemical". Alternatively, secondary structure can be highlighted: Helices are painted red and sheets yellow. Further GUI controls are in the context-menu.

Editing: Manipulation of the multiple sequence alignment is performed with the keyboard. STRAP has many sophisticated keyboard commands. Please see Keyboard[Menu-bar> Help>Selected topics>Alignment panel] .

Cursor: The cursor position is shown as a flashing white rectangle in the alignment window. Usually only the alignment row containing the cursor is changed when for example a gap is introduced by pressing the space bar. If more than one protein is selected, then gaps are introduced into all selected proteins.
Residue selections are highlighted in the multiple sequence alignment. When the mouse is over a highlighted residue a balloon message appears. Right-click opens the context menu and Ctrl+left-click selects the selection. It can also be selected by dragging a rectangular region.

Scrolling: The horizontal scroll-bar outlines the entire alignment and shows selections and plots. It can be enlarged with the mouse. If there are many proteins the vertical scroll-bar is visible. But also the horizontal scroll-bar can scroll vertically. Try mouse wheel turning with and without Shift and Ctrl.

Keyboard

[Menu-bar> Help>Selected topics>Alignment panel] The keyboard is used to move the alignment cursor and to introduced/removed alignment gaps.
Cursor Movements
Left
Right
Ctrl+←To first residue
Ctrl+→To last residue
Alt+→ To next gap
Alt+← To previous gap
Home To first row
End To last row
Ctrl+Home To first residue
Ctrl+End to last residue

Movement of the view-port
PgDown Scroll down
PgUp Scroll up
Shift+PgUp Scroll left

Deleting gaps
Delete Delete gap right from or under cursor
Backspace Delete gap left from cursor
Ctrl+Delete Delete next gap right from the cursor
Ctrl+Backspace Delete next gap left from cursor
Ctrl+Shift+Delete Delete entire white space under cursor
Ctrl+Shift+BackspaceDelete entire white space left from cursor

Inserting gaps
Space Insert gap under cursor
Insert Insert gaps until next residue in above row is reached.
Ctrl+Ins Remove gaps until previous residue in above row is reached
Shift+InsInsert gaps until next residue in below row is reached

Repetition number. Performing an operation n times
4 2 Space Insert 42 gaps
4 2 DeleteDelete 42 gaps

Moving a sequence blocks: A group of consecutive residues that are not interrupted by gaps is moved left or right.
>Move right
<Move left

Window
Ctrl+WClose alignment panel

Miscellaneous
Ctrl+KClose protein
Ctrl+*Display letters in better quality
U Upper Case
L Lower Case

Font Size
Ctrl++Zoom in
Ctrl+-Zoom out

Go to position
4 2 iMove cursor to residue index 42
4 r Move cursor to 4th row
4 2 cMove cursor to column 42
4 2 nMove cursor pdb-number 42

Save / restore position
SSave current cursor position
sRevert cursor position previously saved
4 SSave current cursor position to "4"
4 sRestore cursor position previously saved in "4"



The mouse actions follow general conventions:
  1. Sequence alignment
    • Left-click sets the cursor.
    • Middle-click sets the focus without changing the cursor.
    • Right-click opens a context menu either for the alignment pane, for a residue selection or the rubber band selection.
    • Dragging creates a mouse selection.
    • Dragging over more than one rows creates a rectangular region. Residue selections and annotations within the rectangle are selected. Shift-key for union-set and Ctrl-key for cut-set similar to the program gimp or the MS-Window desktop.
    • Alt+Drag scrolls two-dimensionally. Under Unix/Linux also hold Shift-key
  2. Row-header
    • Dragging up/down changes the order of proteins.
    • Ctrl+click selects or deselects the protein.
    • Right-click opens a context menu for the protein.
  3. Scroll-bar or alignment pane
    • Wheel: Scrolls.
    • Shift+Wheel: Scrolls vertically
    • Ctrl+Wheel: Zoom.
Sequence groups
G xDefine all currently selected proteins as group "x".
g xLoad and select proteins of sequence group "x" if not already loaded. Select these proteins.
h xHide all proteins of sequence group "x".
u xUnselect all proteins of sequence group "x".

Sequence Groups:
Sequences can be grouped. Groups are designated by a digit or letter. The names of all proteins of a group are listed in a file in ./annotations/sequenceGroups/.

Alignment Gaps

[Menu-bar> Help>Selected topics>Alignment panel] When there are gaps occurring in all loaded sequences at the same position they are usually not displayed.
Consider the following case:
ASGATA  YTG
ATGATG  YTA
ASGGTAG FSG
       ^
This is a common gap which is not displayed. The view shows:
ASGATA YTG
ATGATG YTA
ASGGTAGFSG
When the sequence "ASGGTAGAFSG" was added to the multiple sequence alignment the gap would not be a common gap any more and would become visible. In the multiple sequence alignment panel is the following:
ASGATA  YTG
ATGATG  YTA
ASGGTAG FSG
ASGGTAGAFSG
This is the reason for the following phenomenon: When gaps are added to one sequence it might seem that gaps are erased in all other sequences instead.

Drag-and-drop

Drag and Drop

Actions which are performed when objects are dropped on certain targets.
D r o p p e d   O b j e c t s
Drop target Proteins Residue selections Hetero groups Images
Proteins
 
Copy residue selection to target protein Add heteros to target protein Set icon image
Residue selections
 
Set background image
3D-View
 
Add 3D-structure to view Highlight atoms/amino acids in 3D Add hetero structure to view
Alignment panel
 
Load or un-hide protein Copy selection to protein Add heteros to protein
Desktop or file browser
 
Copy protein file Write PDB-file

Objects are dragged with the mouse and dropped on a target (see ). The effect depends on the objects that are dragged and the destination where they are dragged onto. The presumed operation is written in the Strap tool-bar. The page Dragging Web-links contains examples of protein and alignment Web links that can be dragged.

Translate Genbank nucleotide files

The Genbank file format and Embl file format are widely used file formats for annotated nucleotide sequences (format specifications: http://www.ncbi.nlm.nih.gov/Sitemap/samplerecord.html and ftp://ftp.ncbi.nih.gov/genbank/gbrel.txt and http://www.ebi.ac.uk/embl/Documentation/User_manual/usrman.html). After loading a Genbank or Embl file, the nucleotide sequence is initially shown. Only after translation into amino acids, the amino acid sequence is shown. The translation is directed by a the CDS-sequence-features within the protein file. Example:
   CDS     5. .799
   /gene="Psmb5"
   /product="proteasome subunit X"
If the Genbank file contains several genes the user can choose one of them.


3D-visualization in Strap



Menu-bar: If the user clicks inside the 3D-view, the menu-bar of the 3D-view and the 3D-tool bar are shown. In addition to this generic menu bar, 3D-programs may or may not have their own native menu bar which can be activated.

Undock menu items: Menu items like "spheres on" or "sticks on" in the style menu might be frequently used. For convenience, menu items can be dragged to the desktop with the mouse and are easier accessible.

Undock the viewer pane: 3D-viewers that are located inside the STRAP-application frame can be moved outside (undocking) even to a 2nd screen. For this purpose the header needs to be dragged with the mouse.

Always floating on top of other windows All Strap windows can be set "Always floating on top of all other windows". Type Ctrl+T.

Picking Atoms in the 3D-View: When atoms are clicked in the 3D-view, Strap will display a toolbar with buttons that act upon the clicked amino acid.
By picking an atom, one amino acid is selected. By picking atoms with certain modifier keys (Shift, Ctrl, Alt), more than one amino acid can be selected. This corresponds to the notion of selecting more than one files in a file browser. The modifier keys depend on the 3D-viewer.
3D-ViewerContinuous selectionDiscontinuous selection
OpenAstexCtrl.Shift+ClickCtrl+Click
JMol Shift+ClickCtrl+Click


Selecting amino acids in the alignment panel: Using the mouse or using Arrow-Left/ Arrow-Right keys together with Shift, a number of amino acids can be selected in Strap. This selection is passed to 3D. Alternatively, a residue selection can be dragged with the mouse into the 3D-view.

Adding proteins: Proteins can be added by . See MOVIE:Drag_to_another_STRAP*

Displaying Sequence Features: Sequence Features can highlighted in the 3D-view using . See MOVIE:Sequence_Features_in_3D*

3D-Superposition: 3D-Superposition allows to compare two or more protein backbones. Two options are available:
  1. All peptide chains in the view: One medium structure will be used as a reference. All other proteins are overlaid upon this reference peptide chain.
  2. All multi-chain complexes in the view: Different chains of the same PDB entry form a complex and their relative position is sustained. For superimposing two multi-chain complexes, the best structurally matching pair of chains is determined. I.E. for superimposing two trimeric G-proteins, the beta propeller structures may be selected. The superposition is computed for this pair of peptide chains and the transformation is applied to the entire complexes. Complexes might also be defined by the user or by a script - see context menu of proteins.

3D-Backbone view

[Menu-bar> Help>Selected topics>3D] The simple 3D-viewer visualizes the C-alpha trace of one or several proteins as a polygon. Helices may be drawn in red and sheets in yellow or the entire chain is drawn in one color. There are a few advantage of the full featured sophisticated viewers also available in Strap:

Residue Selections

Residue selections are objects attached to proteins which select one or more amino acid positions from the amino acid sequence. For example dragging the mouse in the alignment pane selects a continuous chain of residues which may be pasted e.g. into a BLAST web form.

Annotated residue selections

[Menu-bar> Help>Selected topics] Annotated residue selections, allow the assignment of information to specific amino acids or nucleotides of proteins. A residue annotation has a list of entries each having a type and a modifiable text. The user can change these entries and add new entries. See Modifying annotations Annotated residue selections can be created in different ways: Residue selections can be selected in the alignment pane in two different ways: By Ctrl-click or by dragging a rectangular box. Selected residue selections have .

Background Image

Image files and images in the browser can be dragged onto an annotated residue selection. See DRAG_LINK*

Modifying annotations

To edit a * residue annotation the menu item ResidueSelectionPopup#ACTION_edit of the context menu must be clicked. Alternatively the residue underlining in the alignment panel can be double clicked. The annotation data is shown as a table. The rows contain the name, the group name, the residue positions and additional annotation.
Figure: The left column gives the type and the right column the annotation text. Each row has a toggle button to activate and inactivate the entry.
ResidueAnnotationView#docuView(  )

Drag and drop: Rows can be reordered with the mouse or can be dropped on other residue selections.

Export protein files

Using the export dialog (file-menu)

Single protein files can be exported in various output formats.

Dragging proteins with the mouse

Proteins can be dragged from the alignment to the Desktop or the file browser. This requires dragging a protein label such as in the row header of the alignment with the mouse. Proteins can be dropped in other Desktop applications that support . They may also be dropped into another STRAP instance.

By default, the original file that the protein was loaded from, is directly transferred even if the 3D orientation has been changed in Strap or the N-terminus or C-terminus have been cleaved off. This behavior can be changed with an option pane. The button to open this option pane appears in the Strap tool pane when the proteins are dragged.

Export PDF

Multiple sequence alignments can be exported to PDF and with the LaTeX extension TeXshade written by Eric Beitz (see , http://www.pharmazie.uni-kiel.de/chem/Prof_Beitz/texshade.htm).

Installation
LaTeX is required. It provides the two shell commands "latex" and "pdflatex" which will be run by Strap. A graphical front end is not needed.

Exporting the sequence alignment to PDF
By pushing "Generate PDF" in the main card of the TeXshade-Dialog, two files are generated: The LaTeX file with ending .ps and the alignment file with ending .msf. These two files are then processed by the command pdflatex. The result is a pdf file. As soon as the pdf file exists, the "PDF" button to start the PDF viewer becomes active. The file path of pdflatex and of the PDF viewer can be customized.

Including the Alignment in Text-processor documents
Modern textprocessors allow insertion of PDF or Encapsulated PostScript images. For PostScript output, the respective check-box needs to be activated. Subsequently, the PostScript file (Suffix .ps) needs to be converted to Encapsulated PostScript (File suffix .eps) as described in "How do I convert PostScript to EPS?" of page http://www.postscript.org/FAQs/language/ .

Annotations
Residue selections can contain one or more annotations labeled with Texshade. Those entries usually contain the variables "PROTEIN" and "RESIDUES" which stand for the current protein number and the sequence positions.
Useful TeXshade-expressions which can be added to selections
  \feature{top}{PROTEIN}{RESIDUES}{box[Red]}{peptide}
  \feature{ttop}{PROTEIN}{RESIDUES}{box[Red]}{peptide}
  \feature{bottom}{PROTEIN}{RESIDUES}{fill:$\uparrow$}{$S_{129}$}
  \feature{bbottom}{PROTEIN}{RESIDUES}{fill:X}{cleavage}
  \fingerprint{300}
            
TeXshade examples with variables
   \feature{bottom}{PROTEIN}{RESIDUES}{fill:$\uparrow$}{$$FIRST_Aaa_{$FIRST_INDEX}$}
   \feature{bottom}{PROTEIN}{RESIDUES}{fill:$\uparrow$}{$$FIRST_A_{$FIRST_INDEX}$}
            
A detailed description is found on the TeXshade home page.

Plotting
Numeric values computed for each residue with a class ValueOfResidue can be plotted along the sequence alignment. Classes implementing ValueOfResidue have a getValues-method which returns a numeric value for each residue. The user chooses a class in the "plotting"-card IC_PLOT. The plot has four alternative locations: top, bottom, ttop, bbottom.

Fixing LaTeX-errors
Like in any programming language PDF-LaTeX stops when the text contains syntax errors. LaTeX reports the line number where the error occurred. For example 1.42 indicates an error at line 42. Typically the error is an incorrect TeXshade command assigned by the user to a residue annotation. The most frequent errors are unbalanced parentheses or characters that have a special syntactical meaning like underscore or "%".

Memory limitation: With large alignments the pdflatex-run may terminate with an ... memory exceeded ...-error. No PDF-output would be generated. If this happens the LaTeX heap size must be increased. Miktex users should set pdf_mem_size to a higher values in the file miktex.ini. For other LaTeX systems the memory settings in texmf.cnf must be increased ( e.g. multiplied by 10) and the program fmtutil run as root. Typical locations of the texmf.cnf are /etc/texmf/texmf.cnf /usr/share/texmf/web2c/texmf.cnf and /usr/local/teTeX/share/texmf/web2c/texmf.cnf.
 # fmtutil  --byfmt=pdflatex

Publish interactive alignment

Figure: Generation of a Html link. The html code is generated automatically and can be tested in the web browser.
new DialogPublishAlignment(  )
With this dialog a Web-Link can be formed which loads the proteins from the public databases into Strap and displays an alignment. The only requirement for the client machine is Java. Depending on the amount of information two different types are available:

1. Single web address

The web address can not only be included in web-pages, but also in e-mails and Office-documents. The generation is conducted in two steps:
  1. DialogPublishAlignment#BUT_LAB_P The parameter String is written into the first text field and can be modified by the user. Not the sequence but the database accession id is stored. For technical details see http://www.bioinformatics.org/strap/createStrapLinks.html.
  2. DialogPublishAlignment#BUT_LAB_U From the text in the first text field the web address will be generated using and written into the 2nd text field.
  3. DialogPublishAlignment#BUT_LAB_B The generated web address can be tested by clicking. A new STRAP session will be opened in web-mode and an alignment will be loaded with the specified information.

2. Web form

Since the web form has no size limitation, the entire information for the alignment can be included. The draw-back is that it can only be included in web-pages, but not in office documents or e-mails.
  1. DialogPublishAlignment#BUT_LAB_F A html-page including the web link for the selected proteins is generated.
  2. DialogPublishAlignment#BUT_LAB_B The link can be tested in the web browser. From this html code the text between the opening and closing and <body> tags can be used in any html-page.


Stdout/Stderr

Exceptions and the streams stdout and stderr

Stdout and stderr are collected in files residing in ".StrapAlign/log/" unless STRAP was started with the command line parameter "-stderr" and "-stdout". The Java-console can be switched on with ControlPanel in /local/java/jdk1.6.0_20/jre/bin/.

STRAP on Windows

[Menu-bar> Help>Selected topics>Computer system]
Text Editor: does not work properly. or us is used.

The directory for program settings: Since cause problems the Strap directory is named "StrapAlign" instead of ".StrapAlign".
To avoid white space in file paths on Windows XP, the home directory is "C:\StrapAlign\" instead of "%HOMEPATH\StrapAlign". An alternative directory can be provided in the file %HOMEPATH%\StrapAlign\location_of_StrapAlign.ini.

Installing Bioinformatics programs: Strap uses the of if no other specified.