Frequently asked questions

#165: How do I use only PEP data from the database?

Category: Data structure · Last updated: · Permalink

The full database includes both individuals exposed to sanctions and similar risk indications, and a global dataset of political office-holders (Politically Exposed Persons, PEPs).

If the PEP data is the only part of our database relevant to your use case, it may be tempting to download the smaller data collection which combines data from PEP-related sources. Unfortunately, this will lead to undesired outcomes. While the smaller collection includes basic profiles for each political office holder, it does not contain highly relevant details generated through data enrichment, including:

  • Records related to relatives and close associates of PEPs.
  • Detailed profile information on PEPs, including name aliases in various alphabets, and biographic details.
  • Annotations to describe the branch and level of government the PEP is associated with.

Instead, we strongly recommend that data users interested in PEP data download the default dataset, and then filter that dataset upon import to include only persons with the role.pep and role.rca risk topic included in their properties. This "fetch everything, filter later" approach will generate a much richer dataset that aligns with what you'd expect to be included.

Related questions

« Back to full FAQ index