youtube-transcript.ai

Stanford MS&E435 Economics of the AI Supercycle | Spring 2026 | Applications, Applied AI

Watch with subtitles, summary & AI chat
Add the free Subkun extension — works directly on YouTube.
  • Watch
  • Subtitles
  • Summary
  • Ask AI
Try free →

Founders, engineers, and investors interested in the economics and infrastructure of AI development and deployment.

TL;DR

This video features Tuhin, CEO of Base 10, discussing the exponential growth of AI inference and the critical role of infrastructure in powering AI companies. He highlights how Base 10 optimizes inference for custom AI models, enabling businesses like Whisper Flow and Abridge to achieve superior performance and reliability compared to relying solely on frontier models or generic cloud providers.

Key Takeaways

In This Video

  1. 00:09Inference is About to Explode

    The speaker predicts a billion-fold increase in inference, introducing the guest who is driving this growth.

  2. 00:31Tuhin's Entrepreneurial Journey

    Tuhin, CEO of Bay 10, shares his winding path from finance to machine learning research and startups.

  3. 02:07Falling for Early-Stage Tech

    After moving to San Francisco, Tuhin fell in love with building products in small teams at early-stage tech companies.

  4. 02:57Founding Bay 10

    Tuhin and co-founders started Bay 10 in 2019 to build infrastructure for the rapidly growing machine learning field.

  5. 03:56Bay 10's Customer Examples

    Bay 10 powers companies like Whisper Flow (speech-to-text) and Abridge (healthcare scribe) with optimized inference.

  6. 06:16The Future of AI Inference

    The thesis is that AI will be massive, with inference as the key value delivery. Custom models are crucial for profitability.

  7. 07:28Why Choose Bay 10?

    Bay 10 offers superior performance, reliability, and a developer platform, solving inference stack pain points.

Questions & Answers

What is the main focus of Ban?
Ban focuses on production inference, powering the fastest-growing AI companies by providing optimized infrastructure for machine learning models.
What is Whisper Flow and what does Ban do for them?
Whisper Flow is a speech-to-text app. Ban runs optimizations and infrastructure for Whisper Flow, ensuring low latency for their voice typing experience.
What is Abridge and how does Ban support it?
Abridge is an ambient scribe for healthcare, integrated with EMRs. Ban runs dozens of their models, including speech-to-text and those for clinical notes, ensuring reliability and speed.
Why do companies choose Ban over cloud providers like AWS or GCP?
Companies choose Ban for performance optimizations, reliability across clouds, and a developer platform that offers flexibility and security, which are difficult to achieve independently on general cloud providers.
What is the economic advantage of using open-source models compared to frontier models?
Open-source models are about 90 days behind frontier models but can be run 70-90% cheaper, making them a cost-effective option for many AI applications.

Key Terms

Download or copy the punctuated YouTube transcript (Markdown)

Full Transcript

Loading transcript…

Source

YouTube video. Original: https://www.youtube.com/watch?v=Qh7Oxvo5sJI
Transcript captured and processed by youtube-transcript.ai on 2026-06-08.