Scraping 10k YouTube transcripts for LLM training data
Was building a training set from AI podcast transcripts and hit YouTube API's 10,000/day quota immediately. Here is how I worked around it.
The problem
YouTube Data API v3 has a 10,000 unit daily quota per project. A transcript pull costs about 1-3 u...
theaientrepreneur.hashnode.dev2 min read