Metannogen documentation

Note: Documentation is also available from the Help-menu in the running application. An online version of the documentation can be viewed here: Manual

Datasets and annotations of reactions

Datasets/annotations of reactions contain expert opinion entered by curators in form of structured information or free text. Depending on whether reactions of an existing network are annotated or whether a new network is reconstructed the term "reaction" or "dataset" is used, respectively.

Annotating existing networks

In case Metannogen is used to annotate an already existing network, a dataset annotates a reaction with the same identifier. Datasets that annotate reactions of a network will be called "annotations". Conversely, datasets with IDs not matching any reaction in the network are found in the container "Datasets not annotating a reaction". Networks are displayed in the left part of the application as a browsable tree. It is shown in a tabbed pane since more than one networks may be loaded at a time. Reactions appear as child nodes of pathways. Annotations for reactions can be created with the context menu. The context menu is opened by right clicking the tree node of a reaction. Annotated reactions are indicated by a traffic light sign. Annotations are displayed and can be edited in the right part of the application window when the tree node of an annotated reaction is activated. There are text fields, toggle buttons and choice menus for the curator to enter and modify the text-fields of the annotation. Each dataset is saved as one single plain text line with tab separated text fields. Consequently, the information is stored independently of the SBML-file to be annotated. In single-user mode all dataset lines are stored in one single text file on the user's hard-disk. In multi-user mode, each dataset is stored in its own file on the server. The file name is the dataset ID.

Metannogen as a primary network reconstruction tool

If Metannogen is used to reconstruct a metabolic network, the information for one reaction or a class of reactions is contained in a dataset. Usually each dataset describes one biochemical conversion in one or several subcellular compartments. Some enzymes and transport carriers are unspecific. To avoid redundancy and to reduce typing, the syntax can be used. Brace expansion provides a compact form for expressing several different but similar biochemical conversions within one single dataset. See
Metannogen brace expansion for details.

The resulting network consisting of metabolites and reactions can be exported as or any user defined format such as SQL and . The SBML-export allows further analysis such as or analysis in programs like Copasy. Metannogen is very flexible concerning adaptations of the output format towards special needs.

Multi-user mode

When several people work on the same project, the data can be kept on a Web-server. There are two options:
  1. A user can register a project on the www.bioinformatics.org server in the start dialog of Metannogen. The server will return a password.
  2. Repository may be created on any Web server with PERL or PHP. This is explained in a tutorial.
Metannogen prevents data loss through concurrent modification of the same data by different users.

Different identifier systems

The loaded networks might use different identifier systems for metabolites. This problem is solved by dictionaries to translate the identifiers. With the default Web-start link, the dictionary for translating Palsson-group type identifiers into Kegg is loaded. For example the dictionary contains the line "pyr C00022" telling that the metabolite identifier "pyr" corresponds to the identifier "C00022". Both denote Pyruvate. By explicitely mapping the metabolite identifiers of Palsson-group and Kegg networks, the reactions are indirectly assigned. When users choose the context menu item "Find reactions with same stoichiometry" on for example Lactate-dehydrogenase (LDH), they would find two objects: The LDH in Kegg and the LDH in the Palsson network.

These dictionary files are loaded with the program option -dictionaryOfSpecies. Above mapping would be realized by the text line
        pyr C00022
      

Resolving names of metabolites

Dictionary files can be provided with the program parameter "-names" to translate metabolite IDs into human readable names such as
        ...
        C00022\tPyruvate
        C05928\t10-Formyltetrahydrofolyl
        C05931\tN-Succinyl-L-glutamate
        ...
	  
In addition or alternatively, another network can be loaded. Metannogen will also use the metabolite names in all loaded networks.
Figure: The pathway map for Glycolysis from KEGG. Reactions for which a corresponding dataset exist have a green or red bar at the left. Green dataset describe reactions of the network and red dataset contain evidence that the described reaction is not part of the network. Selected objects are highlighted by and by color marks on the scroll-bar. All objects on the map have a and a .
screenshot of Metannogen

Cross-references:

Metannogen provides customizable references to various databases to allow documentation of the sources of evidence for the reaction under consideration. Some cross-references are automatically generated. Others can be entered during the network reconstruction to explain the curator's opinions. Due to its many functions regarding cross-references and browsing existing networks, it can be conveniently used just as a browser for Kegg or other networks.

Graphical pathway maps:

provides graphical Pathway views which are used in Metannogen to map reactions and metabolites graphically. The progress of reconstruction particular metabolic pathways can be followed since those reaction boxes in the graphical pathway maps are highlighted for which an equivalent reaction exists in the network under reconstruction. Metabolite circles for which a transport reaction is defined are also highlighted. All objects in the graphical maps have a context menu and a . They can be selected which is indicated by . In addition KEGG provides image files of single metabolites. They appear when the mouse is over a metabolite name or metabolite identifier in a text-component of the graphical user interface or when the mouse is over the metabolite circle in the graphical pathway map.

Literature management:

Metannogen comes with a powerful publication management system. It is based on semi-automated association of publication full text files in PDF format with Pubmed IDs. Downloaded full text files are shared by all curators. Text expression higlighting and search includes the abstract text as well as the full text. The publication management is also available in form of a smaller program package that starts much faster then Metannogen:
Webstart only publication manager.
Typing aids: When writing annotation text in the multi line text field of datasets, tab-word-completion is convenient and avoids mistyping of long words. For example metabolite names are sometimes complicated. Consider a long metabolite name such as "UDP-N-acetylmuramoyl-L-alanyl-D-glutamate". It is hard to type such complicated words without error. But with word expansion one does only need to type the first few letters like "UDP-N-a" and hit the tabulator key once or a few times until the desired word appears.