Skip to main navigation Skip to search Skip to main content

Time estimation for deep learning model’s inference in distributed processing units

  • Ernesto Portugal
  • , Angel Ayala
  • , Francisco Cruz
  • , Bruno Fernandes
  • , Sergio Murilo

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

One problem with cloud computing is that it may fail to meet the desired time limits for real-time applications. In this regard, fog computing paradigm has gained ground as it complements the cloud by providing nodes with processing and storage capabilities closer to the data generation level. However, this level of the architecture has limited resources, making it necessary to efficiently distribute the workload involved in applications, especially when employing deep learning models. One technique to achieve this is task offloading, which involves distributing inference tasks throughout the architecture. Nevertheless, it is also important to know the time required for these tasks to be carried out within the network in order to obtain the desired response. In this work, we propose a queue-based convolutional neural network that allows estimating the response time for a deep learning inference task. Preliminary results demonstrate a good fit to the behavior of the datasets used in the experiment.

Original languageEnglish
Title of host publication2023 IEEE Latin American Conference on Computational Intelligence, LA-CCI 2023
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798350348071
DOIs
StatePublished - 2023
Event2023 IEEE Latin American Conference on Computational Intelligence, LA-CCI 2023 - Recife-Pe, Brazil
Duration: 29 Oct 20231 Nov 2023

Publication series

Name2023 IEEE Latin American Conference on Computational Intelligence, LA-CCI 2023

Conference

Conference2023 IEEE Latin American Conference on Computational Intelligence, LA-CCI 2023
Country/TerritoryBrazil
CityRecife-Pe
Period29/10/231/11/23

Keywords

  • convolution time series
  • deep learning
  • fog computing
  • time estimation

Fingerprint

Dive into the research topics of 'Time estimation for deep learning model’s inference in distributed processing units'. Together they form a unique fingerprint.

Cite this