Tools

Reference index for all five Golden Suite tools: GoldenCheck, GoldenMatch, GoldenPipe, GoldenFlow, and infermap. Covers how each fits a full pipeline.

The Golden Suite provides 5 tools that cover the full data management lifecycle — from profiling raw data to producing golden records.

Note: These five packages are the engine. bensevern.dev is the hosted funnel that wraps them — connectors on the way in, a review queue in the middle, and destination push on the way out. The packages are MIT-licensed on PyPI, so you can self-host the engine and skip the SaaS, or let the funnel run them for you. This section documents the engine; the Workbench and API sections cover the hosted layer.

Tool Overview

ToolCategoryWhat it does
GoldenCheckQualityProfile and validate data quality
GoldenMatchMatchingProbabilistic record linkage and dedup
GoldenPipeTransformData transformation pipelines
GoldenFlowOrchestrationWorkflow orchestration and scheduling
infermapSchemaAutomatic schema mapping

How They Work Together

A typical workflow: GoldenCheck profiles your source data → infermap maps columns to your target schema → GoldenPipe transforms and cleans the data → GoldenMatch deduplicates and links records → GoldenFlow orchestrates the entire pipeline.

Inside the hosted funnel, this same sequence sits between your sources (the connectors that pull SaaS data in) and your destinations (the warehouse / cloud targets the golden records get pushed to). Self-hosting the engine, you wire that ingest + push yourself; the SaaS does it for you.

Was this page helpful?
Edit this page on GitHub