Welcome to my blog: News and thoughts

I write about everything that I learn, join me on my journey.

· 7 min read

I Ate My Own Dog Food: How I Benchmarked AI Skills and Proved Eval-Driven Development Works

I Ate My Own Dog Food: How I Benchmarked AI Skills and Proved Eval-Driven Development Works I built a tool to test AI skills. Then I used it on my own project. The benchmarks shocked even me. Anton Gulin is an AI QA Architect — the first person to claim this title on LinkedIn. He builds AI-powered test automation systems where AI agents and human engineers collaborate on quality. Former Apple SDET, current Lead Software Engineer in Test at CooperVision. Find him at anton.qa or on LinkedIn.

I Ate My Own Dog Food: How I Benchmarked AI Skills and Proved Eval-Driven Development Works

Subscribe

Get notified when I publish something new, and unsubscribe at any time.