· 4 min read
Eval-Driven Development for AI Agent Skills
Why skills need testing, not just writing — and how to do it systematically.
I write about everything that I learn, join me on my journey.
· 4 min read
Why skills need testing, not just writing — and how to do it systematically.
· 6 min read
Technical decisions and lessons learned from rewriting a Python CLI tool as an OpenCode plugin.
· 7 min read
I Ate My Own Dog Food: How I Benchmarked AI Skills and Proved Eval-Driven Development Works I built a tool to test AI skills. Then I used it on my own project. The benchmarks shocked even me. Anton Gulin is an AI QA Architect — the first person to claim this title on LinkedIn. He builds AI-powered test automation systems where AI agents and human engineers collaborate on quality. Former Apple SDET, current Lead Software Engineer in Test at CooperVision. Find him at anton.qa or on LinkedIn.
· 6 min read
A practical walkthrough for creating, testing, and installing production-grade OpenCode skills.
· 2 min read
Master Page Object Model in Playwright & TypeScript (2026). Structure scalable tests, copy real-world architecture patterns, and speed up testing.
· 4 min read
A practical, step-by-step guide to migrating your Selenium test suite to Playwright. Includes code comparison, common pitfalls, and a migration strategy that won't disrupt your team.
· 5 min read
Flaky tests destroy team confidence and slow deployment. Here are 10 proven strategies to eliminate flaky Playwright tests — from someone who's fixed thousands of them.
· 4 min read
Learn Playwright from scratch. This step-by-step tutorial will have you writing your first automated test in under 10 minutes — no prior automation experience required.
· 4 min read
Which test automation framework should you choose in 2026? A side-by-side comparison of Playwright, Cypress, and Selenium based on real-world experience testing for Apple, Fortune 500 companies, and enterprise teams.
Get notified when I publish something new, and unsubscribe at any time.