Benchmark Rust performance using Criterion

Deploy Rust on Google Cloud C4A (Arm-based Axion VMs)

Log an issue

Fork and edit

Discuss on Discord

Deploy Rust on Google Cloud C4A (Arm-based Axion VMs)

Overview

This section demonstrates how to benchmark Rust performance using cargo bench and the Criterion library to measure code execution speed and performance consistency on Arm64 hardware.

Create a benchmark project

Create a new Rust project specifically for benchmarking:

    

        
        
cargo new rust-benchmark
cd rust-benchmark

Configure Criterion as a dependency

Criterion is the recommended benchmarking crate for Rust. Open the Cargo.toml file in your project root directory and replace the existing content with:

    

        
        
[dependencies]
criterion = "0.5"

[[bench]]
name = "my_benchmark"
harness = false

This configuration enables Criterion for high-precision benchmarking and disables the default test harness.

Create the benchmark directory and file

Create the benchmark structure that Cargo expects:

    

        
        
mkdir benches

Create a new file named my_benchmark.rs in the benches/ directory and add the following benchmark code to measure Fibonacci number calculation performance:

    

        
        
use criterion::{black_box, Criterion, criterion_group, criterion_main};

// Example benchmark function
fn fibonacci(n: u64) -> u64 {
    match n {
        0 => 0,
        1 => 1,
        n => fibonacci(n - 1) + fibonacci(n - 2),
    }
}

fn benchmark_fibonacci(c: &mut Criterion) {
    c.bench_function("fibonacci 20", |b| b.iter(|| fibonacci(black_box(20))));
}

criterion_group!(benches, benchmark_fibonacci);
criterion_main!(benches);

This code implements a recursive Fibonacci function and measures how efficiently Rust computes the 20th Fibonacci number. The black_box function prevents the compiler from optimizing away the benchmark.

Run the benchmark

Execute the benchmark using Cargo:

    

        
        
cargo bench

Cargo compiles your code with optimizations enabled and runs the Criterion benchmarks, providing detailed performance metrics.

The output is similar to:

    

        
        Running benches/my_benchmark.rs (target/release/deps/my_benchmark-f40a307ef9cad515)
Gnuplot not found, using plotters backend
fibonacci 20            time:   [12.026 µs 12.028 µs 12.030 µs]
Found 1 outliers among 100 measurements (1.00%)
  1 (1.00%) low mild

Performance summary

The benchmark output provides several key metrics: the average time represents the mean execution time across benchmark runs, outliers identify runs that were significantly slower or faster than average, and the plotting backend indicates that plotters is being used since Gnuplot wasn’t found on the system.

The following table shows results from running the benchmark on a c4a-standard-4 (4 vCPU, 16 GB memory) Arm64 VM in GCP using SUSE:

Benchmark	Average Time (µs)	Min (µs)	Max (µs)	Outliers (%)	Remarks
fibonacci 20	12.028	12.026	12.030	1.00%	Stable performance with minimal variation

The Fibonacci benchmark demonstrates consistent performance on the Arm64 platform. The average execution time of 12.028 µs indicates efficient CPU computation, while only 1% of measurements were outliers. This low variance confirms Rust’s reliable execution speed and performance stability on Arm64 architecture.

Back

Deploy Rust on Google Cloud C4A (Arm-based Axion VMs)

Introduction

Get started with Rust on Google Axion C4A (Arm Neoverse-V2)

Create a Google Axion C4A Arm virtual machine on GCP

Install Rust

Perform baseline testing

Benchmark Rust performance using Criterion

Next Steps

Deploy Rust on Google Cloud C4A (Arm-based Axion VMs)

Overview

Create a benchmark project

Configure Criterion as a dependency

Create the benchmark directory and file

Run the benchmark

Performance summary