RSSChangelog

See updates across the data model, metadata structure, and API of our service. Breaking changes that require updates to data consumer applications are announced prior to their implementation.

Filter by components: Hosted API (3) · yente (4) · Bulk data delivery (2) · Data model (9) · Export formats (6) · Datasets (4) · Metadata (1)

#26 Migrating Chinese Unified Social Credit Identifiers (USCI) to a custom property

Effective date:
Components affected:Data modelExport formats
Announcement:

All LegalEntities in the data model now have a uscCode property, which is designed to store Unified Social Credit Identifiers for Chinese companies and individuals. In order to ease adoption of this new property, we are currently emitting USCI in both the new uscCode property, and in the more generic registrationNumber property that has been available previously. As of December 1, 2025, we will stop including USCI in registrationNumber, and make the identifiers available in the more specific field only.

#27 Removing unused source identifiers in `referents`

Effective date:
Components affected:Data modelExport formats
Announcement:

Starting October 15, 2025, we are planning to remove out-of-use entity identifiers from the referents list after a grace period. At present, the referents associated with entities include all IDs of source entities that have been used to describe the entity in the past, including those no longer present in our dataset. This has lead to over 100,000 entity IDs mentioned in referents which are no longer associated with any source data. In the future, we plan to remove ("garbage collect") unused referents after ca. 6 months.

Users of the data may need to update monitoring systems in which referents is used to suppress repeat alarms for a matched entity in a screening system. When an entity ID is changed and moved to referents, consumer systems should update their internal reference to the main (canonical) entity ID when drift is observed.

#28 New program_ids field in targets.simple.csv

Effective date:took effect on
Components affected:Export formats
Announcement:

The targets.simple.csv bulk data format now contains a new program_ids column that contains a list of program IDs that an entity is sanctioned under. This is part of our broader effort to link entities to sanctions policy programs under which they’ve been designated. More information on what sanctions programs are and how we represent them in our data model can be found in our FAQ. For more context, see our recently-published blog post.

#22 Case-Insensitive Name/Alias Collapsing

Effective date:took effect on
Components affected:Export formats
Announcement:

We are starting deduplicate name variants that differ only in letter-case. If an entity lists several names whose only variation is capitalization (e.g. VLADIMIR PUTIN, Vladimir Putin, vladimir putin), we will keep just the variant closest to title-case (Vladimir Putin) and omit the others.

Why? Reduces noise and file size, and makes downstream matching more intuitive. Scope affected: all exports (CSV, JSON, Senzing) and API responses; you will see fewer aliases. Compatibility: no schema changes, only redundant values are removed. If you relied on the presence of case-only variants, review your unit tests.

We are also removing some invalid name values from the dataset, including names that consist of single-character, non-letter names.

#18 Notice: do not use the `all` dataset

Effective date:took effect on
Components affected:DatasetsExport formats
Announcement:

We'd like to strongly advise all customers that are using the all dataset to use default instead.

The all dataset is an internal artifact of our data infrastructure. Other than having a pleasing name, it is a strictly inferior data product. As of February 2025, we've reduced the update frequency of all to monthly updates - meaning that users of the all scope will use outdated data in their systems.

all also includes data meant for internal verification/testing purposes which should not be included in production systems (unless you have a regulatory requirement to screen for 1990 "Die Hard" movie villain, John Gruber).

#13 Generating target files from relevant topics

Effective date:took effect on
Components affected:Data modelExport formats
Announcement:

We're phasing out the use of the target flag throughout the system, and switching the export formats that are based on target to use a defined list of topics as their source of truth.

A binary flag (target) is an insufficient method to describe what entities are associated with risk. For the past few months, we've been recommending the use of topics to decide if a match is relevant (e.g. as a PEP, sanctioned entity). However, some export formats - such as targets.nested.json and targets.simple.csv are still using targets to decide which entities to include.

On January 15, we will switch these two export formats (targets.nested.json and targets.simple.csv) to include any entities tagged with one the topics listed below. This is guaranteed to include all current targets, but will bring in additional entities that have topics assigned, but are not marked as targets. In short: the new exports will be more correct, and a bit larger.

This will result in the targets.nested.json export of the default dataset becoming equivalent to the topics.nested.json export of the same collection. This export can be used for testing until the change becomes effective on January 15, 2025. We will eventually remove the topics.nested.json export format on February 15, 2025, and only generated the file named targets.nested.json going forward.

Topics included in new target definition:

  • corp.disqual
  • crime.boss
  • crime.fin
  • crime.fraud
  • crime.terror
  • crime.theft
  • crime.traffick
  • crime.war
  • crime
  • debarment
  • export.control
  • export.risk
  • poi
  • reg.action
  • reg.warn
  • role.oligarch
  • role.pep
  • role.rca
  • sanction.counter
  • sanction.linked
  • sanction
  • wanted