virubanker style=

VirusBanker Bunyaviridae v1.0

Run Virusbanker using Java WebStart
If for some reason virusbanker does not work using Java Web Start, download the zip file

Download Virusbanker1.0 (source level 1.6)

Download Documentation

Content update: Sequences last updated Ausgust 8th 2009.


System requirements

Virusbanker runs on Windows, Mac and Linux in a graphic mode. Make sure you have a version of Java.

Java on Windows:
Virusbanker requires the Java Runtime Environment downloadable on the sun website. Follow the link J2SE Runtime Environment (JRE).

Java on Mac OS X:
A recent version of Mac OS X should have a Java version already installed otherwise the latest version is downloadable from the Apple website. Follow the link Java 5.0.

Java on Linux:
Virusbanker requires the Java Runtime Environment downloadable from the sun website. Follow the link J2SE Runtime Environment (JRE).


Installation:

  1. Install the Java Runtime Envirnment if not alread present.
  2. Download the virusbanker.zip file.
  3. Unzip it wherever you want.


Getting started with Virusbanker

Virusbanker is a MySQL database of virus sequences and alignments that is accessed via a Java program. To use the database you need to download the Java program. The program creates an interface with three spreadsheets and a selection menu. We describe the process of setting up VirusBanker below; here we describe the process of starting to use it.

To start searching the database and see all the nucleotide sequences from a family press the “Find nucleotide” button on the interface.

image1

The sequences will be displayed in the Main spreadsheet. Sequences are displayed as rows in the spreadsheets. To switch and view all protein sequences or all coding nucleotide sequences, press the button called “Find protein” or “Find ORF”. There will be a delay while the sequence records are loaded from the server into the interface.

image2

To select which characteristics are displayed, click the check boxes at the top of the menu. Characteristics are displayed as columns of a spreadsheet. When a new selection of check boxes has been made, press the Find nucleotide, Find protein or ORF button so that the selected characteristics are displayed.

image3

To narrow the range of sequences that are displayed on the Main spreadsheet, use the menu to select a genus, species, serotype (strain) or gene. There will be a short delay while records are loaded. Note that a genus has to be selected before a species or gene/product may be selected, and a genus and species has to be selected before a serotype (strain) can be selected. Once the selection has been made press the Find nucleotide, Find protein or ORF button so that the selected characteristics are displayed.

image4

When sequence records are displayed on the Main spread sheet, they may be sorted by clicking the column titles. They are sorted alphabetically or numerically depending on which column is clicked. To find the longest sequences, select the sequence length check box, then press the Find nucleotide, protein or ORF button, and then click on the title of the sequence length column. You may wish to press it again if it lists the shortest sequences at the top.

image5


Defining a dataset

The main use of VirusBanker is to define a subset of the sequences and download them. Sequence records are selected from the main menu by clicking and dragging the cursor across the records – they should then be highlighted in colour. They may then be saved to the “Set” spreadsheet by pressing the “Save to set” button.

image6

You may view the sequences in the Set spreadsheet by pressing the folder tab at the top of the spreadsheet. Other sequences may be added to the Set spreadsheet by returning to the Main spreadsheet, selecting the sequences, and again pressing the Save to set button. Sequences that are displayed in the Set spreadsheet can be sorted by clicking the column titles. Sequences can be deleted from the Set spreadsheet by selecting them, and pressing the “Delete from set” button.

image7

The characteristics that are displayed may be changed while working with the Set spreadsheet. However, the type of sequence that is selected and displayed in the Set (nucleotide, protein or coding sequence) may not be changed and although it is possible to change the type that is selected in the Main spreadsheet , while working on a set, it is not possible to save a mixture of different types to the set.
Linking to Jalview.

Downloading:

Sequences that are listed on the Set spreadsheet may be downloaded directly into Jalview. Click on the “Jalview” button on the menu, and select either the aligned sequences or the raw sequences. Jalview will then popup. Jalview was created by a research group headed by Geoff Barton at The University of Glasgow. Please see Clamp, M., Cuff, J., Searle, S. M. and Barton, G. J. (2004), "The Jalview Java Alignment Editor," Bioinformatics, 20, 426-7 . For more information visit Jalview .
If you view alignments using Jalview, it is important to ensure that the aligned sequences come from the same gene or coding region. If the Set spreadsheet contains unrelated sequences, such as sequences from different genes, it is still possible to display the aligned sequences in Jalview, however, they will have come from different alignments and the combined alignment will be nonsense. Please check our website for a list of the available alignments.
Downloading

Sequences that are listed on the Set spreadsheet may be downloaded in FASTA or NBRF/PIR format. To download the sequences click on the “Download “ button on the menu, or the File button at the top of the interface and then select the Download set menu item. A download menu will then popup. Press on the “Browse” button and type in the file name, and choose a location for the file on your system. Select a file type from the menu.

image9

The names on the headers of each sequence can be changed by adding or deleting terms to the header box. Terms must be separated by the “+” symbol and “|” will insert a vertical bar into the sequence header. Currently the following terms are available: “gi” for the gi number, “source” for the source feature from the GenBank file, “definition” for the definition feature from the GenBank file, “accession” for the GenBank accession code. When you have completed the menu and header details and nominated a file name, then press the “OK” button on the download menu.



Certificate:

The names on the headers of each sequence can be changed by adding or deleting terms to the header box. Terms must be separated by the “+” symbol and “|” will insert a vertical bar into the sequence header. Currently the following terms are available: “gi” for the gi number, “source” for the source feature from the GenBank file, “definition” for the definition feature from the GenBank file, “accession” for the GenBank accession code. When you have completed the menu and header details and nominated a file name, then press the “OK” button on the download menu.