Speechlab and Realtime Translation with Ivan Galea

Speech technology has been around for a long time, but in the last 12 months it’s undergone a quantum leap. New speech synthesis models are able to produce speech that’s often indistinguishable from real speech. I’m sure many listeners have heard deep fakes where computer speech perfectly mimics the voice of famous actors or public figures. A major factor in driving the ongoing advances is generative AI.

Speechlab is at the forefront of using new AI techniques for realtime dubbing, which is the process of converting speech from one language into another. For the interested listener, we recommend hearing the examples with President Obama speaking Spanish or Elon Musk speaking Japanese in this YouTube video. Ivan Galea is the Co-founder and President at Speechlab and he joins the show to talk about how we’re on the cusp of reaching the holy grail of speech technology – real time dubbing – and how this will erase barriers to communication and likely transform the world.
This episode is hosted by Lee Atchison. Lee Atchison is a software architect, author, and thought leader on cloud computing and application modernization. His best-selling book, Architecting for Scale (O’Reilly Media), is an essential resource for technical teams looking to maintain high availability and manage risk in their cloud environments.
Lee is the host of his podcast, Modern Digital Business, an engaging and informative podcast produced for people looking to build and grow their digital business with the help of modern applications and processes developed for today’s fast-moving business environment. Listen at mdb.fm. Follow Lee at softwarearchitectureinsights.com, and see all his content at leeatchison.com.

Sponsors

When did scraping public web data become so hard?

Well it just got a lot easier with the tools from Bright Data, designed specifically for scraping at scale.

Bright Data has a new headful browser to easily make API calls and fetch any number of browser sessions, then interact with them using Puppeteer, Playwright, or even Selenium. Scale up to as many browsers as you need, and host them all on Bright Data’s award-winning proxy network.

Bright Data also unblocks sites for you with its CAPTCHA solvers and fingerprints.

Listeners of Software Engineering Daily get $25 of free credit on Bright Data’s platform. Go to softwareengineeringdaily.com/brightdata, or check the show notes for the link.

WorkOS is a developer platform to make your app enterprise-ready. With a few simple APIs, you can immediately add common enterprise features like Single Sign-On, SAML, SCIM user provisioning, and more. Developers will find beautiful docs and SDKs that make integration a breeze. WorkOS is kind of like “Stripe for enterprise features.” WorkOS powers apps like Webflow, Hopin, Vercel, and more than 100 others. The platform is rock solid, fully SOC-2 compliant, and ready for even the largest enterprise environments. So what are you waiting for? Integrate WorkOS today and make your app enterprise-ready. To learn more and get started, go to softwareengineeringdaily.com/workos

Today’s podcast is brought to you by DoiT. An award-winning strategic partner of AWS, Google Cloud and Microsoft Azure, DoiT works alongside more than 3,000 customers to save them time and money.Combining intelligent software with expert consultancy and unlimited support, DoiT delivers the true promise of the cloud at peak efficiency with ease, not cost.Turbocharge your growth and optimize your cloud investment so you can concentrate on what you’re good at – quickly growing your business.Learn more at doit.com

Software Daily

Software Daily

 
Subscribe to Software Daily, a curated newsletter featuring the best and newest from the software engineering community.