1 Search Results for "Stutzki, Jan"


Document
Multilingual Trend Detection in the Web

Authors: Jan Stutzki

Published in: OASIcs, Volume 37, 4th Student Conference on Operational Research (2014)


Abstract
This paper represents results from our ongoing research project in the foresight area. The goal of the project is to develop web based tools which automatically detect activity and trends regarding given keywords. This knowledge can be used to enable decision makers to react proactively to arising challenges. As for now we can detect trends worldwide in more than 60 languages and assign these trends accordingly to over 100 national states. To reach this goal we utilize the big search engines as their core competence is to determine the relevance of a document regarding the search query. The search engines allow slicing of the results by language and country. In the next step we download some of the proposed documents for analysis. Because of the amount of information required we reach the field of Big Data. Therefore an extra effort is made to ensure scalability of the application. We introduce a new approach to activity and trend detection by combining the data collection and detection methods. To finally detect trends in the gathered data we use data mining methods which allow us to be independent from the language a document is written in. The input of these methods is the text data of the downloaded documents and a specially prepared index structure containing meta data and various other information which accumulate during the collection of the documents. We show that we can reliably detect trends and activities in highly active topics and discuss future research.

Cite as

Jan Stutzki. Multilingual Trend Detection in the Web. In 4th Student Conference on Operational Research. Open Access Series in Informatics (OASIcs), Volume 37, pp. 16-24, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2014)


Copy BibTex To Clipboard

@InProceedings{stutzki:OASIcs.SCOR.2014.16,
  author =	{Stutzki, Jan},
  title =	{{Multilingual Trend Detection in the Web}},
  booktitle =	{4th Student Conference on Operational Research},
  pages =	{16--24},
  series =	{Open Access Series in Informatics (OASIcs)},
  ISBN =	{978-3-939897-67-5},
  ISSN =	{2190-6807},
  year =	{2014},
  volume =	{37},
  editor =	{Crespo Del Granado, Pedro and Joyce-Moniz, Martim and Ravizza, Stefan},
  publisher =	{Schloss Dagstuhl -- Leibniz-Zentrum f{\"u}r Informatik},
  address =	{Dagstuhl, Germany},
  URL =		{https://drops-dev.dagstuhl.de/entities/document/10.4230/OASIcs.SCOR.2014.16},
  URN =		{urn:nbn:de:0030-drops-46663},
  doi =		{10.4230/OASIcs.SCOR.2014.16},
  annote =	{Keywords: Information Retrieval, Web Mining, Trend Detection}
}
  • Refine by Author
  • 1 Stutzki, Jan

  • Refine by Classification

  • Refine by Keyword
  • 1 Information Retrieval
  • 1 Trend Detection
  • 1 Web Mining

  • Refine by Type
  • 1 document

  • Refine by Publication Year
  • 1 2014

Questions / Remarks / Feedback
X

Feedback for Dagstuhl Publishing


Thanks for your feedback!

Feedback submitted

Could not send message

Please try again later or send an E-mail