1 Search Results for "Prado, Simone G. D."


Document
Mining Scientific Articles Powered by Machine Learning Techniques

Authors: Carlos A. S. J. Gulo, Thiago R. P. M. Rúbio, Shazia Tabassum, and Simone G. D. Prado

Published in: OASIcs, Volume 49, 2015 Imperial College Computing Student Workshop (ICCSW 2015)


Abstract
Literature review is one of the most important phases of research. Scientists must identify the gaps and challenges about certain area and the scientific literature, as a result of the accumulation of knowledge, should provide enough information. The problem is where to find the best and most important articles that guarantees to ascertain the state of the art on that specific domain. A feasible literature review consists on locating, appraising, and synthesising the best empirical evidences in the pool of available publications, guided by one or more research questions. Nevertheless, it is not assured that searching interesting articles in electronic databases will retrieve the most relevant content. Indeed, the existent search engines try to recommend articles by only looking for the occurrences of given keywords. In fact, the relevance of a paper should depend on many other factors as adequacy to the theme, specific tools used or even the test strategy, making automatic recommendation of articles a challenging problem. Our approach allows researchers to browse huge article collections and quickly find the appropriate publications of particular interest by using machine learning techniques. The proposed solution automatically classifies and prioritises the relevance of scientific papers. Using previous samples manually classified by domain experts, we apply a Naive Bayes Classifier to get predicted articles from real world journal repositories such as IEEE Xplore or ACM Digital. Results suggest that our model can substantially recommend, classify and rank the most relevant articles of a particular scientific field of interest. In our experiments, we achieved 98.22% of accuracy in recommending articles that are present in an expert classification list, indicating a good prediction of relevance. The recommended papers worth, at least, the reading. We envisage to expand our model in order to accept user’s filters and other inputs to improve predictions.

Cite as

Carlos A. S. J. Gulo, Thiago R. P. M. Rúbio, Shazia Tabassum, and Simone G. D. Prado. Mining Scientific Articles Powered by Machine Learning Techniques. In 2015 Imperial College Computing Student Workshop (ICCSW 2015). Open Access Series in Informatics (OASIcs), Volume 49, pp. 21-28, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2015)


Copy BibTex To Clipboard

@InProceedings{gulo_et_al:OASIcs.ICCSW.2015.21,
  author =	{Gulo, Carlos A. S. J. and R\'{u}bio, Thiago R. P. M. and Tabassum, Shazia and Prado, Simone G. D.},
  title =	{{Mining Scientific Articles Powered by Machine Learning Techniques}},
  booktitle =	{2015 Imperial College Computing Student Workshop (ICCSW 2015)},
  pages =	{21--28},
  series =	{Open Access Series in Informatics (OASIcs)},
  ISBN =	{978-3-95977-000-2},
  ISSN =	{2190-6807},
  year =	{2015},
  volume =	{49},
  editor =	{Schulz, Claudia and Liew, Daniel},
  publisher =	{Schloss Dagstuhl -- Leibniz-Zentrum f{\"u}r Informatik},
  address =	{Dagstuhl, Germany},
  URL =		{https://drops-dev.dagstuhl.de/entities/document/10.4230/OASIcs.ICCSW.2015.21},
  URN =		{urn:nbn:de:0030-drops-54776},
  doi =		{10.4230/OASIcs.ICCSW.2015.21},
  annote =	{Keywords: Machine Learning, Text Categorisation, Text Classification, Ranking, Systematic Literature Review}
}
  • Refine by Author
  • 1 Gulo, Carlos A. S. J.
  • 1 Prado, Simone G. D.
  • 1 Rúbio, Thiago R. P. M.
  • 1 Tabassum, Shazia

  • Refine by Classification

  • Refine by Keyword
  • 1 Machine Learning
  • 1 Ranking
  • 1 Systematic Literature Review
  • 1 Text Categorisation
  • 1 Text Classification

  • Refine by Type
  • 1 document

  • Refine by Publication Year
  • 1 2015

Questions / Remarks / Feedback
X

Feedback for Dagstuhl Publishing


Thanks for your feedback!

Feedback submitted

Could not send message

Please try again later or send an E-mail