Stochastic gradient descent‐based support vector machines training optimization on Big Data and HPC frameworks

Abeykoon, Vibhatha; Fox, Geoffrey; Kim, Minje; Ekanayake, Saliya; Kamburugamuve, Supun; Govindarajan, Kannan; Wickramasinghe, Pulasthi; Perera, Niranda; Widanage, Chathura; Uyar, Ahmet; Gunduz, Gurhan; Akkas, Selahatin

Published in

Wiley, Concurrency and Computation: Practice and Experience, 8(34), 2021

DOI: 10.1002/cpe.6292

Tools

Export citation

Search in Google Scholar

Stochastic gradient descent‐based support vector machines training optimization on Big Data and HPC frameworks

Journal article published in 2021 by Vibhatha Abeykoon

, Geoffrey Fox

, Minje Kim, Saliya Ekanayake, Supun Kamburugamuve

, Kannan Govindarajan, Pulasthi Wickramasinghe, Niranda Perera, Chathura Widanage, Ahmet Uyar, Gurhan Gunduz

, Selahatin Akkas

This paper was not found in any repository, but could be made available legally by the author.

Full text: Unavailable

Preprint: archiving allowed

Upload

Postprint: archiving restricted

Upload

Published version: archiving forbidden

Policy details

Data provided by

Abstract

SummarySupport vector machines (SVM) is a widely used machine learning algorithm. With the increasing amount of research data nowadays, understanding how to do efficient training is more important than ever. This article discusses the performance optimizations and benchmarks related to providing high‐performance support for SVM training. In this research, we have focused on a highly scalable gradient descent‐based approach to implementing the core SVM algorithm. In providing a scalable solution, we have designed optimized high‐performance computing and dataflow‐oriented SVM implementations. A high‐performance computing approach means the algorithm is implemented with the bulk synchronous parallel (BSP) model. In addition, we analyzed the language level optimizations and math kernel optimizations on a prominent HPC modeling programming language (C++) and dataflow modeling programming language (Java). In the experiments, we compared the performance of classic HPC models, classic dataflow models, and hybrid models designed on classic HPC and dataflow programming models. Our research illustrates a scientific approach in designing the SVM algorithm at scale in classic HPC, dataflow, and hybrid systems.

Published in

Links

Tools

Stochastic gradient descent‐based support vector machines training optimization on Big Data and HPC frameworks

Abstract