Review | Arm Learning Paths

Deploy a Large Language Model (LLM) chatbot on Arm servers

Log an issue

Fork and edit

Discuss on Discord

Deploy a Large Language Model (LLM) chatbot on Arm servers

What you've learned

You should now know how to:

Download and build llama.cpp on your Arm server
Download a pre-quantized Llama 2 model from Hugging Face
Re-quantize the model weights to take advantage of Arm improvements
Compare the pre-quantized Llama 2 model weights performance to the re-quantized weights on your Arm CPU

Knowledge Check

Back

Next