LLM for Coding Benchmarks and Datasets
Benchmark/DatasetYear CreatedInstitutionTime SpanDataset SizeData FormatLLM Testing Capability
LiveCodeBench2024UC San Diego, Microsoft Research2024-ongoing (continuously updated)Several hundred problemsProgramming contest problems with test cases...
huanganni.hashnode.dev1 min read