Who is this for?
This is an introductory topic for software developers interested in building an on-device customer support chatbot for Android using Meta's Llama models and the ExecuTorch runtime.
What will you learn?
Upon completion of this Learning Path, you will be able to:
- Set up a development environment for building and deploying ExecuTorch-based apps on Android
- Describe how ExecuTorch uses KleidiAI kernels to accelerate performance on Arm-based platforms
- Export a Llama 3.2 model to .pte format optimized for on-device inference
- Run a Llama model on an Arm-powered Android phone and verify inference performance
- Build and run an Android chat app configured as a customer support assistant
Prerequisites
Before starting, you will need the following:
- An Apple M1/M2/M3 development machine, or a Linux machine with at least 16GB of RAM
- An Arm-powered smartphone with the i8mm feature running Android, with 16GB of RAM
- A USB cable to connect your smartphone to your development machine
- Android Debug Bridge (adb) installed. Follow the steps in
adb
to install Android SDK Platform Tools
- Java 17 JDK. Follow the steps in
Java SE 17 Archive Downloads
to download and install JDK for your host
- Python 3.10 or later
- A
Hugging Face
account with access to Meta Llama models