This is very insightful to get a better understanding of the underlying model. It would be interesting to try this with ChatGLM, it's a Chinese (yet bilingual) model. It would be interesting to see if there are any differences based on what country the LLM is made in. Especially since a lot of websites are banned in China. Although, I'm not sure if that affected the data collection process.