PA
I actually wrote 52 assertions in total, but featured the top 10 in the article for clarity. On that first run, I hit a 3/7 pass-fail ratio for those featured tests. The state-management issues were triggered by specific sequences the AI discovered through path exploration; it wasn't just a linear script, but an agent finding edge cases by interacting with the UI in ways I hadn't anticipated
