Real World Speech Interfaces: What Builders Are Seeing

by Jaim Zuber | at Minnebar20

Are you building with speech, or thinking about it? Let’s compare notes.

Speech interfaces (e.g. speech-to-text, text-to-speech) have become one of the most practical (and widespread) ways AI shows up in real products.

2025 was the year of AI meeting note takers. Transcription apps are everywhere (and seemingly popped up out of nowhere).

2026 is about dictation. WisprFlow and Willow are changing how we interact with computers. Prompting Claude from a keyboard is sooo 2025.

Yet, it feels like we’ve only scratched the surface of what these technologies can do.

Building in the space? Let’s talk shop and make wild guesses on where this is headed.

Jaim Zuber

Apple Platforms Engineering and Leadership. Mobile and desktop.

Hacking on speech-to-text (ASR) systems while he searches for the next gig.

He likes hockey, BBQ, and making noise with a modest array of instruments… sometimes in public.

Links: - jaimzuber.com - BlueSky - Mastodon - GitHub


Are you interested in this session?

This will add your name to the list of interested participants. It will help us gauge interest for scheduling purposes.

Interested Participants