SantaCoder: don't reach for the stars!
Compact code LLMs and dataset stewardship Framing and aims At first glance the project sets out a pragmatic ambition: build a compact, effective code model and carefully vet the training corpus. The team focused on the Santa models trained at 1.1B pa...
paperium.hashnode.dev4 min read