Google Launched Gemini 3.1 Flash Live - Here's What It Means for Your AU Sales Voice Stack
Gemini 3.1 Flash Live cuts AI voice agent costs by ~90%. Here is what changed, what it costs per call and the right entry point for AU B2B founders.
On March 25, Google launched Gemini 3.1 Flash Live via the Gemini Live API. For anyone building voice AI into their sales stack, the cost numbers are worth stopping for.
We are talking about a ~90% drop in the infrastructure cost of running an AI voice agent.
What Gemini 3.1 Flash Live actually does differently
The old approach runs four separate steps: voice activity detection, speech-to-text, the language model, then text-to-speech. Each step adds latency. Each handoff adds cost.
Gemini 3.1 Flash Live removes the whole chain. Native audio in, native audio out, using bi-directional WebSocket streaming. Sub-second latency. 90+ languages. Built-in noise filtering. And live tool-calling during active calls.
The cost numbers that change the conversation
A detailed breakdown shared on r/B2BSaaS on March 28 put the combined audio cost at roughly $0.023 per minute. About $0.005/min for audio input and $0.018/min for output.
For a typical AU B2B service business receiving 80 inbound enquiries per month (5-minute average call), your AI infrastructure cost is under $10/month. Even modelled conservatively at 3x that, you are under $25/month to qualify every inbound lead via voice.
The feature that matters most for sales
Live tool-calling during active calls. The AI can check your CRM, look up calendar availability and book a meeting - all without the caller noticing a pause.
The model also outperforms previous Gemini 2.5 Flash on multi-step function calling by 19%. That is directly relevant to complex sales qualification flows.
The right entry point for AU founders
The temptation is to think about cold calling. That is not where to start.
The right entry point is inbound. The lead has already raised their hand. And right now, most businesses are letting them hit a voicemail or wait 24 hours for a callback.
That gap is where AI voice earns its keep. At under $25/month for qualification, the case for fixing it is very hard to argue against.
We use voice agents in our AI SDR stack at Njin. For how this fits into a broader voice setup, see the complete guide to voice AI for sales. For the speed-to-lead data, see the 60-second vs 60-minute difference.
If you are thinking about outbound voice, read the compliance guide first. DNCR rules apply to automated voice calls.
The Njin voice AI solution shows what a live inbound qualification flow looks like.