Abstract
To a great extent, our phenotype is determined by our genetic material. Many genotypic modifications may ultimately become manifest in more or less pronounced changes in phenotype. Despite the importance of how specific genetic alterations contribute to the development of diseases, surprisingly little effort has been made towards exploiting systematically the current knowledge of genotype-phenotype relationships. In the past, genes were characterized with the help of so-called "forward genetics" studies in model organisms, relating a given phenotype to a genetic modification. Analogous studies in higher organisms were hampered by the lack of suitable high-throughput genetic methods. This situation has now changed with the advent of new screening methods, especially RNA interference (RNAi) which allows to specifically silence gene by gene and to observe the phenotypic outcome. This ongoing large-scale characterization of genes in mammalian in-vitro model systems will increase phenotypic information exponentially in the very near future. But will our knowledge grow equally fast? As in other scientific areas, data integration is a key problem. It is thus still a major bioinformatics challenge to interpret the results of large-scale functional screens, even more so if sets of heterogeneous data are to be combined. It is now time to develop strategies to structure and use these data in order to transform the wealth of information into knowledge and, eventually, into novel therapeutic approaches. In light of these developments, we thoroughly surveyed the available phenotype resources and reviewed different approaches to analyzing their content. We discuss hurdles yet to be overcome, i.e. the lack of data integration, the missing adequate phenotype ontologies and the shortage of appropriate analytical tools. This review aims to assist researchers keen to understand and make effective use of these highly valuable data.
Keywords: Phenotype, genotype, comparative phenomics, database, RNA interference, analysis