Maggy: open-source asynchronous distributed hyperparameter optimization based on Apache Spark

Meister, Moritz Johannes (2019). Maggy: open-source asynchronous distributed hyperparameter optimization based on Apache Spark. Thesis (Master thesis), E.T.S. de Ingenieros Informáticos (UPM).

Description

Title: Maggy: open-source asynchronous distributed hyperparameter optimization based on Apache Spark
Author/s:
  • Meister, Moritz Johannes
Contributor/s:
  • Mozo Velasco, Alberto
  • Downling, Jim
Item Type: Thesis (Master thesis)
Masters title: Data Science
Date: July 2019
Subjects:
Faculty: E.T.S. de Ingenieros Informáticos (UPM)
Department: Lenguajes y Sistemas Informáticos e Ingeniería del Software
Creative Commons Licenses: Recognition - No derivative works - Non commercial

Full text

[img]
Preview
PDF - Requires a PDF viewer, such as GSview, Xpdf or Adobe Acrobat Reader
Download (1MB) | Preview

Abstract

For the past two years, Hopsworks, an open-source machine learning platform, has used Apache Spark to distribute hyperparameter optimization tasks in machine learning. Hopsworks provides some basic optimizers (grid-search, random-search, differential evolution) to propose combinations of hyperparameters (trials) that are run synchronously in parallel. However, many such trials perform poorly, and waste a lot of hardware accelerator cycles on trials that could be stopped early, freeing up resources for other trials. In this thesis, the work on Maggy is presented, an open-source asynchronous and fault-tolerant hyperparameter optimization framework built on Spark. Maggy transparently schedules and manages hyperparameter trials, enabling state-of-the-art asynchronous optimization algorithms, thereby increasing resource utilization and increasing the number of trials that can be performed in a given period of time up to 30% on a fixed amount of resources. Early stopping is found to perform best when the model is sensitive, in terms of generalization performance, to the hyperparameter configurations.

More information

Item ID: 56977
DC Identifier: http://oa.upm.es/56977/
OAI Identifier: oai:oa.upm.es:56977
Deposited by: Biblioteca Facultad de Informatica
Deposited on: 21 Oct 2019 14:22
Last Modified: 21 Oct 2019 14:22
  • Logo InvestigaM (UPM)
  • Logo GEOUP4
  • Logo Open Access
  • Open Access
  • Logo Sherpa/Romeo
    Check whether the anglo-saxon journal in which you have published an article allows you to also publish it under open access.
  • Logo Dulcinea
    Check whether the spanish journal in which you have published an article allows you to also publish it under open access.
  • Logo de Recolecta
  • Logo del Observatorio I+D+i UPM
  • Logo de OpenCourseWare UPM