The Billion-Row Problem: How Big Companies Find Duplicate Customers Using ML (Record Linkage ML Pipeline)
Feb 20 · 17 min read · Here is a scenario that should make any data engineer sweat a little. :) A large telecom company runs a loyalty program. Over a decade of mergers, app migrations, and manual data entry, their customer
Join discussion

