Emulates the methods the US Census Bureau uses to link people across multiple data sources, using open-source software (Splink) and simulated data (from pseudopeople).
data-science
spark
record-linkage
entity-resolution
fuzzy-matching
dask
census-bureau
data-matching
splink
-
Updated
Jun 24, 2024 - HTML