Snap ML - Accelerated Machine Learning for Big Data (Keynote Abstract)

Author Haris Pozidis

Thumbnail PDF


  • Filesize: 158 kB
  • 1 pages

Document Identifiers

Author Details

Haris Pozidis
  • IBM Research Zurich, Switzerland

Cite AsGet BibTex

Haris Pozidis. Snap ML - Accelerated Machine Learning for Big Data (Keynote Abstract). In 23rd International Conference on Principles of Distributed Systems (OPODIS 2019). Leibniz International Proceedings in Informatics (LIPIcs), Volume 153, p. 3:1, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2020)


Snap Machine Learning (Snap ML) is a new software library for training popular machine learning models, characterized by very high performance, scalability to TB-scale datasets and high resource efficiency. It continuously evolves and currently supports generalized linear models, decision trees, random forests and gradient boosting machines. Snap ML has been built to address the needs of business applications, which often have to deal with high-volume data, react fast to changing environments, and use resources efficiently to drive down cost. The high efficiency of Snap ML, in particular in dealing with big data, comes from innovations in distributed optimization, among other things. This talk will review the principles of the Snap ML library, explain how it achieves high speed and scalability, and present several cases of business workloads that demonstrate the benefits offered by Snap ML. Haris Pozidis manages the Cloud Storage and Analytics group at IBM Research in Zurich, Switzerland. He was with Philips Research, Eindhoven, The Netherlands, before joining IBM. He has worked on read channel design for DVD and Blu-ray Disc at Philips, and played a key role in developing the first scanning probe-based data storage system at IBM, the “Millipede”. His current focus is on the development of Flash memory controllers for all-flash arrays, on phase change memory technology and system solutions, and on accelerated software libraries for machine learning. He holds over 120 US patents, has co-authored more than 120 publications, is an IBM Principal Research Scientist and Master Inventor, and a Senior Member of the IEEE.

Subject Classification

ACM Subject Classification
  • Computing methodologies → Machine learning
  • Software and its engineering → Software libraries and repositories
  • Machine Learning
  • Big Data


  • Access Statistics
  • Total Accesses (updated on a weekly basis)
    PDF Downloads