Writing smaller code snippets would not be the proper evaluation. The best way to evaluate would be do a hobby project using Co-pilot. I saw some articles where they used Copilot in projects and their impression was not good at all. It's still in very early stages so the performance is understandable. The so called 'perfect pair programmer' is more distant than you expect.