UGENE

From Bioinformatics.Org Wiki

(Difference between revisions)
Jump to: navigation, search
 
(10 intermediate revisions not shown)
Line 1: Line 1:
 +
'''UGENE''' is a free cross-platform [[bioinformatics]] software package.
 +
{{Infobox Software
{{Infobox Software
| name                  = UGENE
| name                  = UGENE
-
| logo                  = [[Image:UGENE_software_package_logo.svg | thumb | UGENE logo]]
+
| logo                  = [[Image:UGENE software package logo.svg|50px|UGENE logo]]
-
| screenshot            = [[Image:UGENE_1_2_0_screenshot.png | thumb | UGENE viewing Escherichia coli complete genome]]
+
| screenshot            = [[Image:UGENE-1.9.0-overview.png|250px|Working with genomic data in UGENE 1.9.0]]
| developer              = Unipro
| developer              = Unipro
-
| latest_release_version = 1.2.0
+
| latest_release_version = 1.9.0
-
| latest_release_date    = {{release date|2008|09|19}}
+
| latest_release_date    = {{release date|2010|12|15|df=yes}}
| programming language  = [[C++]], [[QtScript]]
| programming language  = [[C++]], [[QtScript]]
| operating system      = [[Cross-platform]]
| operating system      = [[Cross-platform]]
-
| language              = [[English_language | English]], [[Russian_language | Russian]]
+
| language              = [[English language|English]], [[Russian language|Russian]], [[Czech language|Czech]]
| genre                  = [[Bioinformatics]] toolkit
| genre                  = [[Bioinformatics]] toolkit
-
| license                = [[GNU General Public License | GPL]]
+
| license                = [[GNU General Public License|GPL]]
| website                = http://ugene.unipro.ru
| website                = http://ugene.unipro.ru
}}
}}
-
'''UGENE''' is a free cross-platform [[bioinformatics]] software package.  
+
'''UGENE''' is [[Free software|free]] [[Open source|open-source]] [[cross-platform]] [[bioinformatics]] software.  
-
== Features ==
+
It integrates dozens of well-known [[Biology|biological]] tools and [[Algorithm|algorithms]], providing both [[GUI|graphical user]] and [[Command-line_interface|command line]] [[Interface|interfaces]]. Using [[#UGENE Workflow Designer|UGENE Workflow Designer]] one can arrange the required tools and [[Algorithm|algorithms]] into a [[Scientific_workflow_system|workflow]] schema.  
-
[[UGENE]] has modular structure and almost all its functionality comes within plugins. Internally [[UGENE]] is designed with support of parallelism to make profit from using multicore computers.
+
-
* View and analysis of annotated [[DNA]] or [[protein]] sequences
+
In order to provide maximum possible [[Computer_performance|performance]] '''UGENE''' utilizes multicore [[CPU|CPUs]] and [[GPU|GPUs]] to [[Program_optimization|optimize]] some of its [[Subroutine|computational routines]]. Another way to speed up [[Computing|computations]] is to use [[Amazon_EC2|Amazon EC2]] [[Cloud_computing|cloud]] resources.
-
* Huge files support, native support of gzipped documents
+
-
* [[Chromatogram | Chromatograms]] visualization
+
-
* Number of [[DNA]] plots supported
+
-
* Exact local [[sequence alignment]] with [[Smith-Waterman]] algorithm
+
-
* Search for [[Open reading frames | ORFs]]
+
-
* Integrated support of search requests to [[National Center for Biotechnology Information|NCBI]] databases
+
-
* [[Multiple sequence alignment]] with port of [[MUSCLE]]
+
-
* Easy-to-use search with [[Hidden Markov model | HMM]] profiles using adopted [[HMMER]] package
+
-
== External links ==
 
 +
== Key features ==
 +
The software supports the following features:
 +
* Creating, editing  and annotating [[Nucleic acid sequence|nucleic acid]] and [[Protein|protein]] [[Sequence_(biology)|sequences]]
 +
* Search through [[Online_database|online databases]]:  [[National Center for Biotechnology Information|NCBI]], [[Protein Data Bank|PDB]], [[Swissprot|UniProtKB/Swiss-Prot]], [[TrEMBL|UniProtKB/TrEMBL]]
 +
* [[Multiple sequence alignment|Multiple sequence alignment]]:  [[Clustal|Clustal]], [[Multiple Sequence Comparison by Log-Expectation|MUSCLE]], [[List of sequence alignment software|Kalign]], [[MAFFT|MAFFT]], [[T-Coffee|T-Coffee]]
 +
* Online and local [[BLAST|BLAST]] search
 +
* [[Restriction enzyme|Restriction analysis]] with integrated [http://rebase.neb.com/rebase/rebase.html REBASE] [[List_of_restriction_enzyme_cutting_sites|restriction enzyme database]]
 +
* Integrated [http://frodo.wi.mit.edu/ Primer3] package for [[Primer (molecular biology)|PCR primers design]]
 +
* Search for [[Direct repeat|direct]], [[Inverted repeat|inverted]] and [[Tandem repeat|tandem]] [[Repeated sequence (DNA)|repeats]] in [[DNA]] sequences
 +
* Constructing [[Dot plot (bioinformatics)|dotplots]] for [[Nucleic acid sequence|nucleic acid sequences]]
 +
* Search for [[transcription factor]] [[binding site]]s ([[TFBS]]) with [[Position-specific scoring matrix|weight matrix]] and [http://wwwmgs.bionet.nsc.ru/cgi-bin/mgs/sitecon/sitecon.pl?stage=0 SITECON] algorithms
 +
* Aligning short reads with [http://bowtie-bio.sourceforge.net/index.shtml Bowtie] and UGENE genome aligner
 +
* Search for [[Open reading frames|ORFs]]
 +
* [[Cloning]] [[in silico]]
 +
* [[Protein structure|3D structure]] viewer for files in [[Protein Data Bank|PDB]] and [http://www.ncbi.nlm.nih.gov/Structure/MMDB/mmdb.shtml MMDB] formats, [[Anaglyphs | anaglyph]] view support
 +
* [[Protein secondary structure]] prediction with [[GOR method|GOR IV]] and PSIPRED algorithms
 +
* [[HMMER|HMMER2]] and [[HMMER|HMMER3]] packages integration
 +
* Building (using integrated [[PHYLIP]] package) and viewing [[phylogenetic tree]]s
 +
* [[Sequence alignment|Local sequence alignment]] with optimized [[Smith-Waterman algorithm]]
 +
* Combining various algorithms into custom [[workflow]]s with [[#UGENE Workflow Designer|UGENE Workflow Designer]]
 +
* Search for a pattern of various algorithms’ results in a [[nucleic acid sequence]] with [[#UGENE Query Designer|UGENE Query Designer]]
 +
 +
==User interface==
 +
<!-- [[Image:UGENE-1.9.0-alignment editor.png|thumb|right|250px|Alignment editor|250px]] -->
 +
 +
The software has three main views to display [[Biological data|biological data]] on the user’s [[Display device|screen]].
 +
 +
1. The '''Sequence view''' is used to visualize, analyze and modify [[Nucleic acid sequence|nucleic acid]] or [[Protein sequence|protein]] sequences. Depending on the sequence type and the options selected the followings views can be presented inside the '''Sequence view''' window:
 +
 +
* [[Protein structure|3D structure]] view
 +
* Circular view
 +
* [[Chromatogram|Chromatogram]] view
 +
* [[Dot plot (bioinformatics)|Dotplot]] view
 +
 +
2. The '''Alignment editor''' is used to visualize, analyze and modify a nucleic acid or protein [[multiple sequence alignment]].
 +
 +
3. The '''Phylogenetic tree viewer'''.
 +
 +
==UGENE Workflow Designer==
 +
<!-- [[Image:UGENE-1.9.0-wd.png|thumb|right|250px|Workflow Designer|250px]] -->
 +
'''UGENE Workflow Designer''' allows creating and running complex computational [[workflow]] schemas.
 +
 +
The elements that a schema consists of correspond to the bulk of algorithms integrated into '''UGENE'''. Using the '''Workflow Designer''' one can also create custom workflow elements.
 +
 +
The workflow schemas can be run both locally and remotely, either using the [[GUI|graphical interface]] or launched from the [[Command-line interface|command line]].
 +
 +
==UGENE Query Designer==
 +
<!-- [[Image:UGENE-1.9.0-qd.png|thumb|right|250px|Query Designer|250px]] -->
 +
'''UGENE Query Designer''' allows a user to analyze a nucleotide sequence using different algorithms ([[Repeated sequence (DNA)|Repeats]] finder, [[Open reading frames|ORF]] finder, [[Position-specific scoring matrix|Weight matrix]] matching, etc.) at the same time imposing constraints on the positional relationship of the results obtained from the algorithms.
 +
 +
A schema of the algorithms and constraints is either created from the [[GUI]] or edited as a [[plain text]].
 +
 +
The results are saved as a set of [[Annotation#Computational biology|annotation]]s to a specified file in the [[GenBank]] format.
 +
 +
==Supported biological data formats==
 +
* [[Sequence (biology)|Sequence]]s and [[Annotation#Computational biology|annotation]]s: [[FASTA format|FASTA]] (.fa), [[GenBank]] (.gb), [[EMBL]] (.emb), [[GFF]](.gff)
 +
* [[Multiple sequence alignment]]s: [[Clustal]] (.aln), MSF (.msf),  [[Stockholm format|Stockholm]] (.sto), [[Nexus file|Nexus]] (.nex)
 +
* [[Protein structure|3D structures]]: [[Protein Data Bank|PDB]] (.pdb), [http://www.ncbi.nlm.nih.gov/Structure/MMDB/mmdb.shtml MMDB] (.prt)
 +
* [[Chromatogram|Chromatograms]]: ABIF (.abi), SCF (.scf)
 +
* Short reads: Sequence Alignment/Map(SAM) (.sam), [[ACE file format|ACE]] (.ace), FASTQ (.fastq)
 +
* [[Phylogenetic tree]]s: [[Newick format|Newick]] (.nwk)
 +
* Other formats: Bairoch ([[enzyme]]s info), HMM ([[HMMER]] profiles), PWM and PFM ([[Position-specific scoring matrix|position matrices]]), etc.
 +
 +
==Release cycle==
 +
 +
'''UGENE''' is primarily developed by [http://unipro.ru/ Unipro LLC]. Each [[Iteration#Project_management|iteration]] lasts about 6 weeks. By the end of [[Iteration#Project_management|iteration]] a [[Software release life cycle|release]] comes out. One can also download a development snapshot of the software.
 +
 +
The features to be included into the next release are mostly initiated by users.
 +
 +
==Awards==
 +
 +
'''UGENE''' was awarded [http://www.linuxformat.ru/foss-contest#foss2010-results "The best FOSS project in Russia – 2010"] in the category "Group project" at the [[Linux Format]] magazine contest.
 +
 +
==See also==
 +
* [[Sequence alignment software]]
 +
* [[Bioinformatics]], [[Computational biology]]
 +
* [[Free open source bioinformatics projects]]
 +
 +
==Related software==
 +
*[http://www.csc.fi/english/research/sciences/bioscience/programs/ds Discovery Studio]
 +
*[[Gene Designer]]
 +
*[[Vector NTI]]
 +
*[[Geneious]]
 +
*[http://www.clcbio.com/main CLC Main Workbench]
 +
*[[MacVector]]
 +
*[http://www.crimsonbase.com/ QuickGene]
 +
*[http://www.biology.utah.edu/jorgensen/wayned/ape/ Ape]
 +
*[http://www.serialbasics.com/Serial_Cloner.html SerialCloner]
 +
 +
==References==
 +
*{{cite journal | last=Fursov | first=M. | coauthors= Novikova, O. | year=2008 | title=Multitasking software system for DNA analysis |journal=Proceedings of the Sixth International Conference on Bioinformatics of Genome Regulation and Structure | volume=1 | pages=78 |url=http://www.bionet.nsc.ru/meeting/bgrs2008/BGRS2008_Proceedings.pdf | isbn=978-5-91291-005-0}}
 +
*{{cite journal | last=Fursov | first=M. Y.| coauthors= Oshchepkov, D. Y; Novikova, O. S. | year=2009 | title=UGENE: interactive computational schemes for genome analysis |journal=Proceedings of the Fifth Moscow International Congress on Biotechnology | volume=3 | pages=14–15 |url=http://mosbiotechworld.ru/app/tez_09_03.pdf | isbn=5-7237-0372-2}}
 +
*{{cite journal | last=Efremov | first=I. E.| coauthors= Fursov, M. Y; Danilova, Yu. E. | year=2009 | title=UGENE:  high performance genome analysis suite |journal=Proceedings of the Fifth Moscow International Congress on Biotechnology | volume=2 | pages=405–406 | isbn=5-7237-0372-2}}
 +
 +
==External links==
* [http://ugene.unipro.ru/ UGENE homepage]
* [http://ugene.unipro.ru/ UGENE homepage]
 +
* [http://ugene.unipro.ru/podcast.html UGENE podcast]
* [http://ugene.unipro.ru/documentation.html UGENE documentation]
* [http://ugene.unipro.ru/documentation.html UGENE documentation]
 +
* [http://ugene.unipro.ru/forum UGENE forum]
 +
* http://www.linuxformat.ru/foss-contest#foss2010-results
 +
* [https://ugene.unipro.ru/tracker UGENE bug tracker]
 +
 +
[[Category:Bioinformatics]]
 +
[[Category:Bioinformatics software]]
 +
[[Category:Free science software]]
 +
[[Category:Free software programmed in C++]]
 +
[[Category:Computational science]]
 +
[[Category:Software]]

Latest revision as of 15:39, 31 March 2011

UGENE is a free cross-platform bioinformatics software package.

UGENE
Working with genomic data in UGENE 1.9.0
Developer Unipro
Latest release 1.9.0 / 15 December 2010
Operating system N/A
Programming language N/A
Language English, Russian, Czech
Genre Bioinformatics toolkit
License GPL
Website http://ugene.unipro.ru

UGENE is free open-source cross-platform bioinformatics software.

It integrates dozens of well-known biological tools and algorithms, providing both graphical user and command line interfaces. Using UGENE Workflow Designer one can arrange the required tools and algorithms into a workflow schema.

In order to provide maximum possible performance UGENE utilizes multicore CPUs and GPUs to optimize some of its computational routines. Another way to speed up computations is to use Amazon EC2 cloud resources.


Contents

Key features

The software supports the following features:

User interface

The software has three main views to display biological data on the user’s screen.

1. The Sequence view is used to visualize, analyze and modify nucleic acid or protein sequences. Depending on the sequence type and the options selected the followings views can be presented inside the Sequence view window:

2. The Alignment editor is used to visualize, analyze and modify a nucleic acid or protein multiple sequence alignment.

3. The Phylogenetic tree viewer.

UGENE Workflow Designer

UGENE Workflow Designer allows creating and running complex computational workflow schemas.

The elements that a schema consists of correspond to the bulk of algorithms integrated into UGENE. Using the Workflow Designer one can also create custom workflow elements.

The workflow schemas can be run both locally and remotely, either using the graphical interface or launched from the command line.

UGENE Query Designer

UGENE Query Designer allows a user to analyze a nucleotide sequence using different algorithms (Repeats finder, ORF finder, Weight matrix matching, etc.) at the same time imposing constraints on the positional relationship of the results obtained from the algorithms.

A schema of the algorithms and constraints is either created from the GUI or edited as a plain text.

The results are saved as a set of annotations to a specified file in the GenBank format.

Supported biological data formats

Release cycle

UGENE is primarily developed by Unipro LLC. Each iteration lasts about 6 weeks. By the end of iteration a release comes out. One can also download a development snapshot of the software.

The features to be included into the next release are mostly initiated by users.

Awards

UGENE was awarded "The best FOSS project in Russia – 2010" in the category "Group project" at the Linux Format magazine contest.

See also

Related software

References


External links

Personal tools
Namespaces
Variants
Actions
wiki navigation
Toolbox