Using the bulk data

The datasets published by OpenSanctions are made available in multiple formats, suitable for different purposes.

Please also refer to the entity structure definition and the data dictionary. Advanced users may want to learn about the statement-based data model.

Data formats

Bulk data is made available in the following formats for each data source and collections we maintain. Besides these data export formats, it's also worth understanding the dataset metadata available to describe the contents and update cycle of each source.

FollowTheMoney-based JSON (recommended): the native format for OpenSanctions data is a graph of JSON objects.
- FollowTheMoney Delta Updates provide version-to-version updates of the FollowTheMoney entities in the database.
Simplified CSV (comma-separated values) tables: useful for using the data in spreadsheet programs like Excel.
Names-only text files: useful for very simple cross-referencing and text search.
Statement-based CSV exports: identifies the source, language and freshness for each claim (property value) about each entity.
Securities-centric CSV exports: a custom export of information about public companies and the financial instruments they issue.
Senzing entity data: data exports generated for use with the Senzing entity resolution engine.