3 weeks ago

Fri Jan 16, 2026 7:08pm PST

Why is nobody using this? Full-duplex voice streaming with Gemini Live in React

I built a React hook for real-time voice conversations with Google's Gemini Live API and I'm confused why more people aren't doing this.

Gemini Live is capable - full-duplex audio (interrupt the AI mid-sentence), screen sharing so the AI sees what you're looking at, tool calling, built-in VAD. But using it from a browser is painful:

- Browser audio is 48kHz, Gemini wants 16kHz in and sends 24kHz out - PCM16 endianness conversions - Buffer management to avoid clicks and gaps - Keeping your API key out of client code

So I wrapped it into a single hook:

const { connect, transcripts, isConnected } = useGeminiLive({ proxyUrl: 'wss://your-project.supabase.co/functions/v1/gemini-live-proxy' });

Includes a Supabase Edge Function proxy, screen sharing, auto-reconnection, real-time transcription, full TypeScript.

GitHub: https://github.com/loffloff/gemini-live-react

Everyone builds voice AI with OpenAI's Realtime API, but Gemini Live is cheaper and screen sharing is underrated. Am I missing something?

comments:

add comment

loading comments...