Welcome to my blog: News and thoughts

I write about everything that I learn, join me on my journey.

· 6 min read

Create Video Receipts for AI Agents with Playwright Screencast API

Playwright v1.59.0 ships the Screencast API, letting AI agents produce verifiable video evidence of their work. Engineers can replay agent actions with chapter markers and action annotations—no manual test replay required. Setup is three lines: start the screencast, run your agent logic, stop and save. This is the observability layer agentic workflows have been missing.

Create Video Receipts for AI Agents with Playwright Screencast API

· 7 min read

I Ate My Own Dog Food: How I Benchmarked AI Skills and Proved Eval-Driven Development Works

I Ate My Own Dog Food: How I Benchmarked AI Skills and Proved Eval-Driven Development Works I built a tool to test AI skills. Then I used it on my own project. The benchmarks shocked even me. Anton Gulin is an AI QA Architect — the first person to claim this title on LinkedIn. He builds AI-powered test automation systems where AI agents and human engineers collaborate on quality. Former Apple SDET, current Lead Software Engineer in Test at CooperVision. Find him at anton.qa or on LinkedIn.

I Ate My Own Dog Food: How I Benchmarked AI Skills and Proved Eval-Driven Development Works

Subscribe

Get notified when I publish something new, and unsubscribe at any time.