Open source software components

OpenSanctions releases both its data pipeline and entity matching service as free software that can be extended, adapted and verified by its users.

We're making available the following software components for anyone's use under MIT license terms:

  • yente - an entity matching API that can be used to run queries against the data produced by OpenSanctions. It is used to run self-hosted screening applications.
  • zavod - the toolkit used to run the OpenSanctions data pipeline, which manages how we ingest, normalise, and publish data repeatably from many varied sources.
  • nomenklatura - the data integration system used to store entity data with full lineage and to integrate entity profiles across multiple sources.
  • We also make available the code for all of our data crawlers in our main GitHub repository. Please note our FAQ item on replicating the OpenSanctions data from source.