Optimizing Pipelined Computation and Communication for Latency-Constrained Edge Learning

Nicolas Skatchkovsky; Osvaldo Simeone

doi:10.1109/LCOMM.2019.2922658

Optimizing Pipelined Computation and Communication for Latency-Constrained Edge Learning

Nicolas Skatchkovsky, Osvaldo Simeone

Research output: Contribution to journal › Article › peer-review

14 Citations (Scopus)

116 Downloads (Pure)

Abstract

Consider a device that is connected to an edge processor via a communication channel. The device holds local data that is to be offloaded to the edge processor so as to train a machine learning model, e.g., for regression or classification. Transmission of the data to the learning processor, as well as training based on Stochastic Gradient Descent (SGD), must be both completed within a time limit. Assuming that communication and computation can be pipelined, this letter investigates the optimal choice for the packet payload size, given the overhead of each data packet transmission and the ratio between the computation and the communication rates. This amounts to a tradeoff between bias and variance, since communicating the entire data set first reduces the bias of the training process but it may not leave sufficient time for learning. Analytical bounds on the expected optimality gap are derived so as to enable an effective optimization, which is validated in numerical results.

Original language	English
Article number	8736251
Pages (from-to)	1542-1546
Number of pages	5
Journal	IEEE COMMUNICATIONS LETTERS
Volume	23
Issue number	9
Early online date	13 Jun 2019
DOIs	https://doi.org/10.1109/LCOMM.2019.2922658
Publication status	Published - 1 Sept 2019

Keywords

Machine learning
mobile edge computing
stochastic gradient descent

Access to Document

10.1109/LCOMM.2019.2922658

Optimizing Pipelined Computation_SKATCHKOVSKY_Accepted9June2019Publishedonline13June2019_GREEN AAMAccepted author manuscript, 480 KB

Cite this

@article{5d53fbc052fa43c384ae83bb9fdeed8d,

title = "Optimizing Pipelined Computation and Communication for Latency-Constrained Edge Learning",

abstract = "Consider a device that is connected to an edge processor via a communication channel. The device holds local data that is to be offloaded to the edge processor so as to train a machine learning model, e.g., for regression or classification. Transmission of the data to the learning processor, as well as training based on Stochastic Gradient Descent (SGD), must be both completed within a time limit. Assuming that communication and computation can be pipelined, this letter investigates the optimal choice for the packet payload size, given the overhead of each data packet transmission and the ratio between the computation and the communication rates. This amounts to a tradeoff between bias and variance, since communicating the entire data set first reduces the bias of the training process but it may not leave sufficient time for learning. Analytical bounds on the expected optimality gap are derived so as to enable an effective optimization, which is validated in numerical results.",

keywords = "Machine learning, mobile edge computing, stochastic gradient descent",

author = "Nicolas Skatchkovsky and Osvaldo Simeone",

year = "2019",

month = sep,

day = "1",

doi = "10.1109/LCOMM.2019.2922658",

language = "English",

volume = "23",

pages = "1542--1546",

journal = "IEEE COMMUNICATIONS LETTERS",

issn = "1089-7798",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "9",

}

TY - JOUR

T1 - Optimizing Pipelined Computation and Communication for Latency-Constrained Edge Learning

AU - Skatchkovsky, Nicolas

AU - Simeone, Osvaldo

PY - 2019/9/1

Y1 - 2019/9/1

N2 - Consider a device that is connected to an edge processor via a communication channel. The device holds local data that is to be offloaded to the edge processor so as to train a machine learning model, e.g., for regression or classification. Transmission of the data to the learning processor, as well as training based on Stochastic Gradient Descent (SGD), must be both completed within a time limit. Assuming that communication and computation can be pipelined, this letter investigates the optimal choice for the packet payload size, given the overhead of each data packet transmission and the ratio between the computation and the communication rates. This amounts to a tradeoff between bias and variance, since communicating the entire data set first reduces the bias of the training process but it may not leave sufficient time for learning. Analytical bounds on the expected optimality gap are derived so as to enable an effective optimization, which is validated in numerical results.

AB - Consider a device that is connected to an edge processor via a communication channel. The device holds local data that is to be offloaded to the edge processor so as to train a machine learning model, e.g., for regression or classification. Transmission of the data to the learning processor, as well as training based on Stochastic Gradient Descent (SGD), must be both completed within a time limit. Assuming that communication and computation can be pipelined, this letter investigates the optimal choice for the packet payload size, given the overhead of each data packet transmission and the ratio between the computation and the communication rates. This amounts to a tradeoff between bias and variance, since communicating the entire data set first reduces the bias of the training process but it may not leave sufficient time for learning. Analytical bounds on the expected optimality gap are derived so as to enable an effective optimization, which is validated in numerical results.

KW - Machine learning

KW - mobile edge computing

KW - stochastic gradient descent

UR - http://www.scopus.com/inward/record.url?scp=85072262620&partnerID=8YFLogxK

U2 - 10.1109/LCOMM.2019.2922658

DO - 10.1109/LCOMM.2019.2922658

M3 - Article

SN - 1089-7798

VL - 23

SP - 1542

EP - 1546

JO - IEEE COMMUNICATIONS LETTERS

JF - IEEE COMMUNICATIONS LETTERS

IS - 9

M1 - 8736251

ER -

Optimizing Pipelined Computation and Communication for Latency-Constrained Edge Learning

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this