Subword Tokenization
Why ChatGPT Can Understand “rizz” But Your Model Can’t
Anuja Gadde
4 min read·Just now
The hidden reason language models either break or thrive — and it’s not what you think.
Last week, I wrote about preprocessing — cleaning HTML, handling emojis,...
subword-tokenization.hashnode.dev4 min read