GSFLogo GSF

 

mipsLogo mips
GenRE
Info
» Introduction

» Key Concepts
  » Genome Annotation
  » Meaning and Data
  » Knowledge Generation

» Architecture / Technical
  » Data Management
  » Data Integration
  » Data Processing

» Developed with ...
Main
» News
» About
Projects

GenRE is currently associated with the following projects ...

» CYGD
» GenRE-Int
» MosDB
» QuIPoS
mips home mail to webmaster print view

Introduction

Background

Systematic sequencing of genomes has resulted in substantial amounts of sequence data encoding for the structures of all biological macromolecules in a species. Traditionally, this information has been translated into sequence data representing genetic elements, i.e. the functional segments of the DNA such as coding open reading frames or exons, various kinds of RNAs, and last, but not least, promoter elements responsible for the condition dependent, controlled expression.

The complex annotation associates biological information to sequence data. This substantial information can be inferred by the application of computational methods that is systematically applied like in the PEDANT sequence analysis package, but other, more specific information needs careful manual curation. Computational methods for example can transfer information from known sequence properties such as membrane spanning segments, secondary structures, folds, PFAM-domains, motifs and the like, whereas annotating the most important information such as sub-cellular location, protein/protein interaction, co-regulation, membership in pathways or cellular networks requires expert skills.

Analysis of high-throughput experimental data needs advanced data structures that must be able to allow computational methods to access information beyond the individual genetic element. On the one hand, basic information as compiled by the generic public data collections must be included; on the other hand, dynamic, inferred data such as homology, fold, and protein/protein data have to be integrated. The conceptual challenge meets the technical one. For instance, one could ask that a system should be able to cope with the following examples of complex questions:

  1. visualize all known genetic elements and their individual evidence of a small genome based only on the primary information from the sequence contig and the database information
  2. map all assigned functional classifications to a set of expression analysis data and correlate them to all putative protein/protein interactions
  3. compare two genomes dynamically for functional neighbourhood relations

It is evident that more than some simple databases and CGI-scripts are required to achieve the answering of complex biological questions. A system capable to integrate various data sources and a flexible processing layer together with a convenient publishing interface is required.  This is the foundation for GenRE a Genome Research Environment.

© 2003 GSF - Forschungszentrum für Umwelt und Gesundheit, GmbH Ingolstädter Landstraße 1, D-85764 Neuherberg

Imprint

Disclaimer:
MIPS Databases and associated information are protected by copyright. This server and its associated data and services are for academic, non-commercial use only. The GSF has no liability for the use of results, data or information which have been provided through this server. Neither the use for commercial purposes, nor the redistribution of MIPS database files to third parties nor the distribution of parts of files or derivative products to any third parties is permitted. Commercial users may contact the distributor Biomax Informatics AG.