28 seeds, one corroborated lead: an Epstein-network investigation in public data
What an entity-resolution pipeline finds (and misses) when pointed at 28 publicly-sourced seeds from the Epstein corporate-network reporting.
About
Software engineer building open-source tools that make data quality accessible to every team.
I created the Golden Suite — a collection of Python libraries for checking, transforming, matching, and orchestrating data. Each tool works standalone or as part of the pipeline.
Open Source Data Tools
Building tools that check, transform, match, and map data. All open source. All production-grade.
What an entity-resolution pipeline finds (and misses) when pointed at 28 publicly-sourced seeds from the Epstein corporate-network reporting.
A 9-member ICIJ cluster, 100% GLEIF-anchored, walked from source rows through GoldenMatch dedupe to a finished provenance report.
Ingesting ICIJ + GLEIF + OpenSanctions + UK PSC into one unified company table, then deduping it with GoldenMatch on a 24-vCPU Railway service.