When quoting this document, please refer to the following
URN: urn:nbn:de:0030-drops-19344
Go to the corresponding Portal

Seidl, Thomas ; Müller, Emmanuel ; Assent, Ira ; Steinhausen, Uwe

Outlier detection and ranking based on subspace clustering

08421.SeidlThomas.Paper.1934.pdf (0.1 MB)


Detecting outliers is an important task for many applications including fraud detection or consistency validation in real world data. Particularly in the presence of uncertain data or imprecise data, similar objects regularly deviate in their attribute values. The notion of outliers has thus to be defined carefully. When considering outlier detection as a task which is complementary to clustering, binary decisions whether an object is regarded to be an outlier or not seem to be near at hand. For high-dimensional data, however, objects may belong to different clusters in different subspaces. More fine-grained concepts to define outliers are therefore demanded. By our new OutRank approach, we address outlier detection in heterogeneous high dimensional data and propose a novel scoring function that provides a consistent model for ranking outliers in the presence of different attribute types. Preliminary experiments demonstrate the potential for successful detection and reasonable ranking of outliers in high dimensional data sets.

BibTeX - Entry

  author =	{Thomas Seidl and Emmanuel M{\"u}ller and Ira Assent and Uwe Steinhausen},
  title =	{Outlier detection and ranking based on subspace clustering},
  booktitle =	{Uncertainty Management in Information Systems },
  year =	{2009},
  editor =	{Christoph Koch and Birgitta K{\"o}nig-Ries and Volker Markl and Maurice van Keulen},
  number =	{08421},
  series =	{Dagstuhl Seminar Proceedings},
  ISSN =	{1862-4405},
  publisher =	{Schloss Dagstuhl - Leibniz-Zentrum fuer Informatik, Germany},
  address =	{Dagstuhl, Germany},
  URL =		{},
  annote =	{Keywords: Outlier detection, outlier ranking, subspace clustering, data mining}

Keywords: Outlier detection, outlier ranking, subspace clustering, data mining
Collection: 08421 - Uncertainty Management in Information Systems
Issue Date: 2009
Date of publication: 24.03.2009

DROPS-Home | Fulltext Search | Imprint | Privacy Published by LZI