+1 to this :) Feels like we’ve moved from “prompt engineering” to “system engineering”. Most issues in my opinion come from context drift or state mismatches, not the model. I’ve been playing with setups where the agent is more tightly connected to the workspace (instead of just chat), and the difference is pretty noticeable. Even small things like file awareness and history make a big impact. A lot of products are working on this problem-solving so there is a choice there :D