Contents

Downloads

Links to the most recent files we've exported:

What is this data?

Each file contains a complete snapshot of all of the data in this register. That includes data from:

The raw data from each of these sources has been imported, the companies reconciled with OpenCorporates and - where possible - de-duplicated based on OpenCorporates results and our own de-duplication algorithms. It has also been periodically refreshed with reconciliations with OpenCorporates database to collect company name changes, dissolutions, etc.

The data is formatted in version 0.2 of Open Ownership's Beneficial Ownership Data Standard (BODS). All source data has been standardised on export, attempting to keep as much source-specific information as possible, but prioritising consistency across sources. Data from individual source registries mapped to v0.2 can be downloaded and reused in a variety of formats via our BODS data analysis tools.

We have extensive documentation on the standard, in particular we recommend understanding the key concepts in the data model and then referring to the specifics using the interactive schema browser.

The file is in JSONLines format, meaning each BODS statement is a JSON object on a single line of the file. It has been compressed with Gzip, which is usually already available on MacOS and Linux systems, and is easily readable with a variety of compression programs on Windows.

Our regular data updates at least every month, and we will make a new export after each update. This page will be updated as soon as that happens, but you can also rely on the "latest" link at the top of the page to always point to the most recent data.

What is this data not?

100% accurate, up-to-date, or lawful for every use case. As per our terms and conditions we cannot guarantee the accuracy of the data, nor that your use of it is lawful. Due to the nature of our periodic update and refresh process, we also can't guarantee the freshness of the data. Refer to the information quality section of our terms for more details.

A comprehensive dataset of companies in the jurisdictions we have data for. It only contains those people and companies who've declared beneficial ownership information. For example, the UK PSC and DK CVR registers are broadly intended to cover all companies in their respective countries, but there will always be companies who don't comply. In addition, some jurisdictions (e.g. Slovakia) only require declarations from specific sectors or types of companies.

Data on company directors, officials or shareholders. Whilst individuals in these roles may also be beneficial owners, the inverse is not always true. The people in our data are only those who meet the definition of a beneficial owner in their respective jurisdiction. There are often thresholds of shareholding (for example) which must be exceeded to qualify. Conversely, shareholding is not the only way one becomes a beneficial owner.

The Register is a work-in-progress and there are a number of issues that we are working to resolve. Please check the list of issues on Github, report any additional issues or visit our project board.

How do I use it?

While we suggest you read the Beneficial Ownership Data Standard documentation, here's a very brief overview of how the data is structured and how we suggest you can use it:

Format & Structure

As mentioned above, the file is a single JSONLines format file, with one JSON statement per line. Lines are separated by Unix line endings (\n). The file has been compressed with Gzip, so expect the uncompressed version to be roughly 10x bigger than the compressed one.

Each statement concerns a person, entity or relationship between them. They are in order such that, if you process the file sequentially from first line to last, you will see the people or entities involved in any relationships before you see the relationships. Note that this means if you process the file in a parallel or distributed fashion, you will need a means of revisiting statements which you see out of order.

Example data

Find example data in the BODS documentation.

Identifying and linking entities

The primary identifier in BODS is the statementID. This uniquely identifies a particular statement about a person, legal entity or relationship between them. You may also want to link our data to other datasets that have company numbers or other identifiers in them.

The identifiers field is the way you can do this.

The BODS schema allows for an array of identifiers to help users of the data link people and companies to other data sources. We make use of this feature to provide several of our own identifiers as well as transferring as much information from the original source data as we can, so that you can link the records back to their original data.

Here are some examples of identifiers you might see and an explanation of how to interpret them:

An official identifier using an org-id scheme

1
2
3
4
5
{
  "scheme": "GB-COH",
  "schemeName": "Companies House",
  "id": "0123456"
}

We will give these identifiers when we're sure that the id we have is from the org-id.guide scheme given. For example, we'll give company numbers under the GB-COH scheme for companies declaring their beneficial owners in the UK People with significant control (PSC) Register.

If entities have identifiers from a register that we matched in org-id, you should be able to uniquely identify them and connect them to other data sources. For example, you can match companies to a national register or match them to other datasets like those from the Open Contracting Partnership which use the same identifiers.

All of the identifiers we currently match to org-id schemes are for company numbers. Org-id has many other identifier schemes available, such as for charities or educational institutions, however none of our data sources have verified data which we can link to them, so we do not use them. Similarly, BODS allows for schemes to declare official identifiers for people, such as tax IDs or passport numbers, but none of our data sources report these either.

Unofficial identifiers

1
2
3
4
{
  "schemeName": "GB Persons Of Significant Control Register",
  "id": "0123456"
}
1
2
3
4
{
  "schemeName": "GB Persons Of Significant Control Register",
  "id": "/company/0123456/persons-with-significant-control/individual/hijklmn12343"
}
1
2
3
4
{
  "schemeName": "UA Edinyy Derzhavnyj Reestr",
  "id": "12345-Test Person"
}

In cases where we can't give an official org-id scheme, we'll give an identifier with just a schemeName. These names will always start with the ISO-3166-1 alpha2 country code for the country the register is in, followed by the original language name for the register. For example: UK People with significant control (PSC) Register.

The intention is that these identifiers can be used to uniquely identify a person/company record within that data source, though not necessarily outside of it. Likewise, they will identify the record but may not uniquely identify the entity. For example, for beneficial owners from the UK Persons with significant control Register we give the value of the 'self link', a path-like string which uniquely identifies a beneficial owner record, but doesn't identify a specific person or company across records.

The values in the id field will vary depending on the source. As shown above, we use whatever field(s) in the source can uniquely identify the record. In some cases this means combining multiple fields (with a hyphen -). Currently, the fields are:

  • DK Centrale Virksomhedsregister: enhedsNummer
  • SK Register Partnerov Verejného Sektora: KonecniUzivateliaVyhod.Id
  • GB Persons Of Significant Control Register: beneficial owner "self links"
  • GB Persons Of Significant Control Register - Registration numbers: beneficial owner "registration numbers". i.e. unverified company registration numbers.

An OpenCorporates identifier

1
2
3
4
5
{
  "schemeName": "OpenCorporates",
  "id": "https://opencorporates.com/companies/gb/0123456",
  "uri": "https://opencorporates.com/companies/gb/0123456"
}

For companies we've successfully matched to OpenCorporates, we include the url to that company on opencorporates.com. Note that in some rare occasions we will have matched a company to more than one OpenCorporates record. For completeness, we're including the same value in both the id and the uri fields.

A register identifier

1
2
3
4
5
{
  "schemeName": "OpenOwnership Register",
  "id": "https://register.openownership.org/entities/abcdefg12345",
  "uri": "https://register.openownership.org/entities/abcdefg12345"
}

Every single known person and company will have an identifier that gives their URL on the register. Unknown companies are declared explicitly in BODS, but don't have a corresponding page in the register so we cannot give a URL for them. Again the value is in both the id and uri.

License

Please see our terms and conditions.

Further information

Contact us if you have questions about using this data which aren't answered here.


Latest data

Exported: 2024-03-03

Download  

Changelog

For more information on our iterative improvements to this data, see our changelog of major changes.