Getting Started

Install the Golden Suite and run your first dedup in under 5 minutes.

The Golden Suite is a collection of open-source Python tools for data quality, matching, transformation, and schema mapping.

What's in the Suite

ToolPurpose
GoldenMatchProbabilistic record linkage and deduplication
GoldenPipeData transformation pipelines
GoldenCheckData quality profiling and validation
GoldenFlowWorkflow orchestration
infermapAutomatic schema mapping

Next Steps

Start with Installation to set up the tools, then follow the Quickstart to run your first dedup.