Introduction
Getting started with Microsoft Azure Cobalt 100, Azure Linux 3.0, and Apache Spark
Create an Azure Cobalt 100 Arm64 virtual machine
Set up an Azure Linux 3.0 environment
Install Apache Spark on Azure Cobalt 100 processors
Validate Apache Spark on Azure Cobalt 100 Arm64 VMs
Benchmark Apache Spark
Next Steps
You can deploy your Spark workload either in an Azure Linux 3.0 Docker container or on a virtual machine created from a custom Azure Linux 3.0 image.
The Azure Linux Container Host is an operating system image optimized for running container workloads on Azure Kubernetes Service (AKS). Microsoft maintains the Azure Linux Container Host, which is based on CBL-Mariner, an open-source Linux distribution created by Microsoft.
To learn more, see What is Azure Linux Container Host for AKS?
Azure Linux 3.0 supports AArch64. However, a standalone virtual machine image for Azure Linux 3.0 or CBL Mariner 3.0 is not yet available for Arm. To use the default Microsoft software stack, you can run a Docker container with Azure Linux 3.0 as the base image and run your Spark application inside the container.
The Microsoft Artifact Registry offers updated Docker images for Azure Linux 3.0.
To run a Docker container with Azure Linux 3.0, install Docker and run:
sudo docker run -it --rm mcr.microsoft.com/azurelinux/base/core:3.0
The default container starts with a Bash shell. Both tdnf
and dnf
are available as package managers inside the container.
Currently, the Azure Marketplace offers official virtual machine images of Azure Linux 3.0 only for x86_64
architectures, published by Ntegral Inc. While native Arm64 (AArch64) images are not yet available, you can create your own custom Azure Linux 3.0 virtual machine image for AArch64 using the
AArch64 ISO for Azure Linux 3.0
.
For detailed steps, see Create an Azure Linux 3.0 virtual machine with Cobalt 100 processors .
Whether you use an Azure Linux 3.0 Docker container or a virtual machine created from a custom image, the Spark deployment and benchmarking steps in the following sections remain the same.
Once the setup is complete, continue to the next section to install and run Spark.