Deploy Arcee AFM-4.5B on Arm-based Google Cloud Axion with Llama.cpp: Next Steps

Deploy Arcee AFM-4.5B on Arm-based Google Cloud Axion with Llama.cpp

Log an issue

Fork and edit

Discuss on Discord

Deploy Arcee AFM-4.5B on Arm-based Google Cloud Axion with Llama.cpp

Continue Learning

Read related resources

Find more information about the topics in this Learning Path:

Arcee AI
Announcing the Arcee Foundation Model family
Deep Dive - AFM-4.5B, the first Arcee Foundation Model
Google Cloud Axion instances
Google Cloud Compute Engine Documentation

Join the Arm Developer Program

Connect, upskill, and build with the Arm Developer Community. Join today for hands-on technical resources and education materials, along with the support of Arm engineers and the broader ecosystem.

Join now

Back

Back to all learning paths under Servers and Cloud Computing

Deploy Arcee AFM-4.5B on Arm-based Google Cloud Axion with Llama.cpp

Introduction

AFM-4.5B deployment on Google Cloud Axion with Llama.cpp

Provision a Google Cloud Axion Arm64 environment

Configure your Google Cloud Axion Arm64 environment

Build Llama.cpp on Google Cloud Axion Arm64

Install Python dependencies for Llama.cpp

Download and optimize the AFM-4.5B model for Llama.cpp

Run inference with AFM-4.5B using Llama.cpp

Benchmark and evaluate AFM-4.5B quantized models on Axion

Review your AFM-4.5B deployment on Axion

Next Steps

Deploy Arcee AFM-4.5B on Arm-based Google Cloud Axion with Llama.cpp

Share

Give Feedback

Continue Learning

Read related resources

Join the Arm Developer Program