Document Open Access Logo

Utilization of ordinal response structures in classification with high-dimensional expression data

Authors Andreas Leha, Klaus Jung, Tim Beißbarth



PDF
Thumbnail PDF

File

OASIcs.GCB.2013.90.pdf
  • Filesize: 0.58 MB
  • 11 pages

Document Identifiers

Author Details

Andreas Leha
Klaus Jung
Tim Beißbarth

Cite AsGet BibTex

Andreas Leha, Klaus Jung, and Tim Beißbarth. Utilization of ordinal response structures in classification with high-dimensional expression data. In German Conference on Bioinformatics 2013. Open Access Series in Informatics (OASIcs), Volume 34, pp. 90-100, Schloss Dagstuhl - Leibniz-Zentrum für Informatik (2013)
https://doi.org/10.4230/OASIcs.GCB.2013.90

Abstract

Molecular diagnosis or prediction of clinical treatment outcome based on high-throughput genomics data is a modern application of machine learning techniques for clinical problems. In practice, clinical parameters, such as patient health status or toxic reaction to therapy, are often measured on an ordinal scale (e.g. good, fair, poor). Commonly, the prediction of ordinal end-points is treated as a multi-class classification problem, disregarding the ordering information contained in the response. This may result in a loss of prediction accuracy. Classical approaches to model ordinal response directly, including for instance the cumulative logit model, are typically not applicable to high-dimensional data. We present hierarchical twoing (hi2), a novel algorithm for classification of high-dimensional data into ordered categories. hi2 combines the power of well-understood binary classification with ordinal response prediction. A comparison of several approaches for ordinal classification on real world data as well as simulated data shows that classification algorithms especially designed to handle ordered categories fail to improve upon state-of-the-art non-ordinal classification algorithms. In general, the classification performance of an algorithm is dominated by its ability to deal with the high-dimensionality of the data. Only hi2 outperforms its competitors in the case of moderate effects.
Keywords
  • Classification
  • High-Dimensional Data
  • Ordinal Response
  • Expression Data

Metrics

  • Access Statistics
  • Total Accesses (updated on a weekly basis)
    0
    PDF Downloads
Questions / Remarks / Feedback
X

Feedback for Dagstuhl Publishing


Thanks for your feedback!

Feedback submitted

Could not send message

Please try again later or send an E-mail