License
when quoting this document, please refer to the following
URN: urn:nbn:de:0030-drops-15133
URL: http://drops.dagstuhl.de/opus/volltexte/2008/1513/

Leser, Ulf ; Groth, Philip ; Weiss, Bertram ; Pohlenz, Hans-Dieter

Mining Phenotypes for Protein Function Prediction

pdf-format:
Dokument 1.pdf (14 KB)


Abstract

Until very recently, phenotypes only very rarely were studied in a systematic manner. While ontologies for describing gene functions now have a 10 year long tradition, similar vocabularies for describing the phenotype of genes are only emerging now; similarly, the techniques for determining phenotypes on a large scale (especially RNAi) are available only for a few years, while genomic sequencing or gene expression studies are already established for a much longer time. In this talk, we describe results from a study for exploiting phenotype descriptions for protein function prediction. We used the data from PhenomicsDB, a phenotype database integrated from several publicly available data sources. Due to the lack of standardization, phenotypes in PhenomicsDB can only be viewed as text (short statements, abstracts, singular terms, …). We clustered these texts and analyzed the corresponding gene clusters in terms of their coherence in functional annotation and their interconnectedness by protein-protein-interactions. We also devised a method for using the close similarity in their phenotype descriptions to predict the function of proteins. We show that this methods yields a very good precision at acceptable coverage.

BibTeX - Entry

@InProceedings{leser_et_al:DSP:2008:1513,
  author =	{Ulf Leser and Philip Groth and Bertram Weiss and Hans-Dieter Pohlenz},
  title =	{Mining Phenotypes for Protein Function Prediction},
  booktitle =	{Ontologies and Text Mining for Life Sciences : Current Status and Future Perspectives},
  year =	{2008},
  editor =	{Michael Ashburner and Ulf Leser and Dietrich Rebholz-Schuhmann},
  number =	{08131},
  series =	{Dagstuhl Seminar Proceedings},
  ISSN =	{1862-4405},
  publisher =	{Schloss Dagstuhl - Leibniz-Zentrum fuer Informatik, Germany},
  address =	{Dagstuhl, Germany},
  URL =		{http://drops.dagstuhl.de/opus/volltexte/2008/1513},
  annote =	{Keywords: Data mining, funciton prediction, bioinformatics, phenotypes, text mining}
}

Keywords: Data mining, funciton prediction, bioinformatics, phenotypes, text mining
Seminar: 08131 - Ontologies and Text Mining for Life Sciences : Current Status and Future Perspectives
Issue date: 2008
Date of publication: 03.06.2008


DROPS-Home | Fulltext Search | Imprint Published by LZI