Eyal Estrin

Posted on Apr 17, 2023 • Originally published at eyal-estrin.Medium

Introduction to deep learning hardware in the cloud

#aws #azure #gcp #dataops

For more than a decade, organizations are using machine learning for various use cases such as predictions, assistance in the decision-making process, and more.

Due to the demand for high computational resources and in many cases expensive hardware requirements, the public cloud became one of the better ways for running machine learning or deep learning processes.

Terminology

Before we dive into the topic of this post, let us begin with some terminology:

Artificial Intelligence – "The ability of a computer program or a machine to think and learn", Wikipedia
Machine Learning – "The task of making computers more intelligent without explicitly teaching them how to behave", Bill Brock, VP of Engineering at Very
Deep Learning – "A branch of machine learning that uses neural networks with many layers. A deep neural network analyzes data with learned representations like the way a person would look at a problem", Bill Brock, VP of Engineering at Very

Source: https://www.simplilearn.com/tutorials/artificial-intelligence-tutorial/ai-vs-machine-learning-vs-deep-learning

Public use cases of deep learning

In this blog post, I will focus on deep learning and hardware available in the cloud for achieving deep learning.

Deep Learning workflow

The deep learning process is made of the following steps:

Prepare – Store data in a repository (such as object storage or a database)
Build – Choose a machine learning framework (such as TensorFlow, PyTorch, Apache MXNet, etc.)
Train – Choose hardware (compute, network, storage) to train the model you have built ("learn" and optimize model from data)
Inference – Using the trained model (on large scale) to make a prediction

Deep Learning processors comparison (Training phase)

Below is a comparison table for the various processors available in the public cloud, dedicated to the deep learning training phase:

Additional References

Deep Learning processors comparison (Inference phase)

Below is a comparison table for the various processors available in the public cloud, dedicated to the deep learning inference phase:

Additional References

Summary

In this blog post, I have shared information about the various alternatives for using hardware available in the public cloud to run deep learning processes.

I recommend you to keep reading and expand your knowledge on both machine learning and deep learning, what services are available in the cloud and what are the use cases to achieve outcomes from deep learning.

Additional References

About the Author

Eyal Estrin is a cloud and information security architect, the owner of the blog Security & Cloud 24/7 and the author of the book Cloud Security Handbook, with more than 20 years in the IT industry.

Eyal is an AWS Community Builder since 2020.

You can connect with him on Twitter and LinkedIn.

The Ops Community ⚙️

Introduction to deep learning hardware in the cloud

Terminology

Public use cases of deep learning

Deep Learning workflow

Deep Learning processors comparison (Training phase)

Additional References

Deep Learning processors comparison (Inference phase)

Additional References

Summary

Additional References

About the Author

Top comments (0)

Read next

Suggestions About Geolocation APIs for Beginners

How I Solved a Real Problem with a Simple Translation Tool (And You Can Too)

Top AI Tools for Developers in 2025

Monitoring Web Performance: Why Your Synthetic Tests Aren't Telling the Whole Story