Theresa Fruhwuerth
Breaking problems down to first principle - then building back up with caffeine.
Introduction I have been tinkering with LLMs at work and outside now for quite a while and one of the most pressing issues compared to traditional machine learning is the unsolved problem of how to evaluate them. Evaluating LLM outputs is exponential...
llmshowto.com14 min read
Sebastian
Hi, I liked your approach and it inspired me to do something similar for the evaluation of my chatbot. It's a slightly different approach but uses the same principle of decomposing the evaluation approach.
Thanks for writing this piece!
If you want to see how I used your approach, check it out here: sebastianpdw.medium[.]com/evaluating-ai-chatbots-ai-engineering-in-action-b8cfd0351635