Hi, I liked your approach and it inspired me to do something similar for the evaluation of my chatbot. It's a slightly different approach but uses the same principle of decomposing the evaluation approach.
Thanks for writing this piece!
If you want to see how I used your approach, check it out here: sebastianpdw.medium[.]com/evaluating-ai-chatbots-ai-engineering-in-action-b8cfd0351635