Tools
Reference index for all five Golden Suite tools: GoldenCheck, GoldenMatch, GoldenPipe, GoldenFlow, and infermap. Covers how each fits a full pipeline.
The Golden Suite provides 5 tools that cover the full data management lifecycle — from profiling raw data to producing golden records.
Note: These five packages are the engine. bensevern.dev is the hosted funnel that wraps them — connectors on the way in, a review queue in the middle, and destination push on the way out. The packages are MIT-licensed on PyPI, so you can self-host the engine and skip the SaaS, or let the funnel run them for you. This section documents the engine; the Workbench and API sections cover the hosted layer.
Tool Overview
| Tool | Category | What it does |
|---|---|---|
| GoldenCheck | Quality | Profile and validate data quality |
| GoldenMatch | Matching | Probabilistic record linkage and dedup |
| GoldenPipe | Transform | Data transformation pipelines |
| GoldenFlow | Orchestration | Workflow orchestration and scheduling |
| infermap | Schema | Automatic schema mapping |
How They Work Together
A typical workflow: GoldenCheck profiles your source data → infermap maps columns to your target schema → GoldenPipe transforms and cleans the data → GoldenMatch deduplicates and links records → GoldenFlow orchestrates the entire pipeline.
Inside the hosted funnel, this same sequence sits between your sources (the connectors that pull SaaS data in) and your destinations (the warehouse / cloud targets the golden records get pushed to). Self-hosting the engine, you wire that ingest + push yourself; the SaaS does it for you.