Writing

Articles

Thinking out loud about product management, AI agents, and building with LLMs.

March 22, 2026/5 min read

Does the inference platform matter?

I deployed Whisper Large v3 on four inference platforms and got up to 67 percentage points of WER divergence on the same audio file. Same model, same input, different output.

Speech-to-TextInference PlatformsEvaluation

March 15, 2026/6 min read

Evaluating speech-to-text models for Indian banking

How do you evaluate a model that has no system prompt? I tested three ASR providers on code-mixed banking conversations and found that my measurement was more broken than the models.

Speech-to-TextASREvaluation

March 1, 2026/3 min read

What I learned building a speech-to-text app from scratch

Why do dictated words just 'appear' and why pay for Wispr Flow when open-source models exist? I built a local STT app to find out.

Speech-to-TextAI ProductsmacOS

February 22, 2026/2 min read

Revisiting the questions AI asked me: An ode to the AskUserQuestion tool

The QnA with Claude are the best part of my AI sessions. So I built a tool to resurface them.

Claude CodeAI ToolsReflection

February 15, 2026/4 min read

Keeping context fresh for PM worklfows

How I leverage Claude Code with Claude in Chrome to keep PM context fresh and automated across recurring data workflows

Claude CodeClaude in ChromeCustom Skills

February 9, 2025/4 min read

Agent Teams for Product Managers

Can AI agents that argue with each other help a PM stress-test a product hypothesis? I tested Anthropic's Agent Teams feature to find out

Claude CodeAgent Teams