Open Source Data Tools

Ben Severn

Building tools that check, transform, match, and map data. All open source. All production-grade.

GitHub Stars
Monthly Downloads
1,500
Tests Passing

Projects

5 packages on PyPI

GoldenCheck

try →

Validate & profile data quality

GoldenFlow

try →

Transform & standardize data

GoldenMatch

try →

Deduplicate & match records

GoldenPipe

try →

Orchestrate the full pipeline

infermap

try →

Map messy columns to target schemas

Pipeline

01
GoldenCheck
Scan
02
GoldenFlow
Transform
03
GoldenMatch
Deduplicate
GoldenPipe orchestrates the full pipeline
infermap maps schemas before data enters