AI Podcast Studio

Voice AI, Multi-Agent, Streaming

Overview

A fully autonomous podcast system where two AI hosts have natural, flowing conversations with each other — no human in the loop.

The Challenge

What if AI could host its own podcast? Not a scripted chatbot reading prompts, but two distinct AI personalities that converse naturally, interrupt each other, and build on ideas together.

Features

  • Dual AI Hosts — Two distinct AI personalities with different viewpoints
  • Streaming Avatars — Real-time visual presence with lip-sync
  • Live Transcription — Captions as the conversation unfolds
  • Themed Studio Lighting — Dynamic lighting responding to conversation mood
  • Audience Q&A — Viewers submit questions that hosts address live
  • Auto-Generated Summaries — Episode highlights generated automatically

Technical Stack

LayerTechnologies
FrameworkNext.js 15 · React 19 · TypeScript 5.8
StylingTailwindCSS 4 · Dark mode
StateZustand 5
Agora SDKsagora-rtc-sdk-ng · agora-rtm-sdk v2
AI PipelineConversational AI Engine · LLM (OpenAI/Anthropic/Gemini) · TTS (ElevenLabs/Microsoft/OpenAI) · ASR (Deepgram/Microsoft/Agora)
Server APIsNext.js Route Handlers (app/api/*) for tokens, agent lifecycle, podcast lifecycle, and uploads

Repo

View on GitHub

What I Learned

Building multi-agent systems that feel natural requires careful attention to:

  • Turn-taking and interruption handling
  • Personality consistency across long conversations
  • Latency optimization for real-time feel