Understand voice sentiment analysis for on-device AI

Build a Sentiment-Aware Voice Assistant with On-Device LLMs

Log an issue

Fork and edit

Discuss on Discord

Build a Sentiment-Aware Voice Assistant with On-Device LLMs

Learn voice-based sentiment classification for LLM applications

Voice-based LLM applications often rely primarily on transcribed text from speech input, such as in interactions with non-player characters in games or voice assistants. This approach can overlook vocal cues—like tone, pitch, and emotion—present in a speaker’s voice. As a result, responses may feel less natural and may not fully capture the user’s underlying intent.

To address this, voice-based sentiment classification analyzes audio input to determine the user’s emotional state, which is then incorporated into the LLM prompt to enable more context-aware responses. In this Learning Path, you’ll build a sentiment-aware voice assistant that runs entirely on-device. The application records audio, performs transcription—converting speech into written text—using Whisper, classifies sentiment directly from the voice signal, and combines the transcript and voice-based sentiment to guide responses from a local LLM running with llama.cpp.

Image Alt Text:Pipeline diagram showing audio input flowing through Whisper transcription and HuBERT sentiment classification, then both text and sentiment being combined into an LLM prompt that generates context-aware responses Voice sentiment classification pipeline

You’ll start by building a baseline voice-to-LLM pipeline—capturing audio, transcribing it into text, and using it to generate responses with an LLM. You’ll then extend this pipeline with a voice-based sentiment classification model. This involves training the model, optimizing it for efficient on-device inference, and integrating it into a unified application.

Back

Build a Sentiment-Aware Voice Assistant with On-Device LLMs

Introduction

Understand voice sentiment analysis for on-device AI

Set up your environment

Build the voice-to-LLM pipeline

Train the voice sentiment classification model

Convert and quantize the model

Integrate the voice sentiment classification model

Next Steps

Build a Sentiment-Aware Voice Assistant with On-Device LLMs

Learn voice-based sentiment classification for LLM applications