Really impressive pipeline design. The 5-stage approach with MediaPipe for keypoint extraction instead of raw video processing is smart — it solves the latency problem that kills most real-time CV projects. I've worked on automation pipelines where similar dimensional reduction tricks made the difference between a demo and a production system. Curious about Part 2 — how do you handle the keypoint normalization across different camera distances and angles? That's usually where things get tricky in real deployments.