This is great. After changing one model, consider testing different models with the following criteria:
cost, speed of response, quality of response, match with intended use case.
For example, Qwen coder might be better at coding. Additionally, the example chat application may not be able to handle some models due to the differences in the response. It would be great to hear about these. For example, I tested some reasoning models and they didn't work.