Discussion

Arnav Gupta

Android and NodeJS Developer and Teacher. Have founded a few companies.

Jan 23, 2025

Evaluating SotA LLM Models trying to solve a net-new LeetCode style puzzle

I am sure a lot of you would have seen this particular meme template. It has given rise to entire genre of tiktoks where girls are amazed at how much calculation guys do to pick which stall to go to in a crowded row of urinals. I actually even made ...

arnav.tech20 min read

#llm #leetcode #artificial-intelligence #model-evaluation

Responses(1)

LS

Luv Singh

Question everything

Jan 24, 2025

I guess one of the reasons for o1 performing better could be it's better distribution of training data especially for reasoning tasks than deepseek (as these 2 are primarily reasoning models). These llms mostly approximate the training data distribution, since o1 has better (and more) I guess that's why it did well (though inherently none of them can reason like we do)

Recent in Forum

T
20% off aragon ai Promo Code (ARAGONAI20) to All Customers
4h ago
S
Why relying on AI will ruin your junior dev career
713O F A F M5h ago
S
Does your university rank matter in tech anymore?
610F A F M F5h ago
S
Laravel vs MERN: Stop overcomplicating your MVP
611O F A F M5h ago
S
Is PHP actually dying, or are we just coping?
611O F A F M5h ago

View all threads

Search Hashnode

Evaluating SotA LLM Models trying to solve a net-new LeetCode style puzzle

Responses(1)

Recent in Forum