Interesting, thanx, i use models in security, looks like a costy model Mr Oleh Kem
Today I'm using Opus 4.8, and it's working amazingly.
I am using it since yesterday and its truly great model many more to come up next.
Opus 4.8 works well, i use it from morning. Is more agentic, but think a little longer. It have more thinking options when i click Effort. now when i want tu understand my codebase faster by archtocode diagram tool Opus 4.8 create diagrams more advanced
Alessandro Pieraccini
The feeling is that the most relevant point is no longer raw benchmarks or a few percentage points of difference between models, but the progressive increase in operational reliability within complex agentic workflows.
Aspects such as self-correction, tool orchestration, coherent handling of multi-step contexts, and the ability to challenge unsound plans are probably becoming more important than pure generative capability itself.
The more autonomous these systems become, the more the bottleneck shifts away from writing code and toward supervision, input quality, and real understanding of the application domain.