Contents

Downloads

Links to the most recent files we've exported:

What is this data?

Each file contains a complete snapshot of all of the data in this register. That includes data from:

The raw data from each of these sources has been imported, the companies reconciled with OpenCorporates and - where possible - de-duplicated based on OpenCorporates results and our own de-duplication algorithms. It has also been periodically refreshed with reconciliations with OpenCorporates database to collect company name changes, dissolutions, etc.

The data is formatted in version 0.1 of OpenOwnership's Beneficial Ownership Data Standard. All source data has been standardised on export, attempting to keep as much source-specific information as possible, but prioritising consistency across sources.

We have extensive documentation on the standard, in particular we recommend understanding the key concepts in the data model and then referring to the specifics using the interactive schema browser.

The file is in JSONLines format, meaning each BODS statement is a JSON object on a single line of the file. It has been compressed with Gzip, which is usually already available on MacOS and Linux systems, and is easily readable with a variety of compression programs on Windows.

Our regular data updates occur every three weeks, and we will endeavour to make a new export after each update. This page will be updated as soon as that happens, but you can also rely on the "latest" link at the top of the page to always point to the most recent data.

What is this data not?

100% accurate, up-to-date, or lawful for every use case. As per our terms and conditions we cannot guarantee the accuracy of the data, nor that your use of it is lawful. Due to the nature of our periodic update and refresh process, we also can't guarantee the freshness of the data. Refer to the information quality section of our terms for more details.

A comprehensive dataset of companies in the jurisdictions we have data for. It only contains those people and companies who've declared beneficial ownership information. For example, the UK PSC and DK CVR registers are broadly intended to cover all companies in their respective countries, but there will always be companies who don't comply. In addition, some jurisdictions (e.g. Slovakia) only require declarations from specific sectors or types of companies.

Data on company directors, officials or shareholders. Whilst individuals in these roles may also be beneficial owners, the inverse is not always true. The people in our data are only those who meet the definition of a beneficial owner in their respective jurisdiction. There are often thresholds of share-holding (for example) which must be exceeded to qualify. Conversely, share-holding is not the only way one becomes a beneficial owner.

A complete BODS representation of the Register's dataset. As you'll see from the documentation, BODS' schema provides fields for lots of data. We try to map the Register's data to this model as fully as possible, but there is data we don't have because it's not provided in our sources and data we haven't mapped fully yet.

In particular, we're aware of the following omissions and issues:

  • The data is in v0.1 of the data standard, not the most recent v0.2. We're working on updating this.
  • We're not making incremental updates and marking old statements via replacesStatements fields. Each update is a standalone snapshot of the entire data set, although our statementIDs should be consistent where data doesn't change.
  • We're not reporting identifiers (such as company numbers) where we don't trust them to be official in the original source. This means companies from submissions, third party collected data like the EITI pilot, or companies outside of the source's jurisdiction (e.g. non-UK companies in PSC data) may not have any identifiers.
  • Not all sources provide identifiers for people. Where they do, we include them, but often they're missing.
  • We're not marking what type of address a person's address is, as this information is not available in our sources.
  • We're not marking which specific Interests constitute beneficial ownership and which don't. We need to do more analysis on our sources to understand when it's possible to determine this.
  • We only set source fields to report provenance on ownershipOrControlStatements, not entityStatements or personStatements. This is because of how, historically, the register has stored provenance information, but we're working on it.
  • We don't set statementDate, because we don't have a dates available across our sources that is consistent with BODS' meaning of this field. We need to do more work to understand when we can report this accurately.
  • We currently report all entities as registeredEntity even when they're obviously not, such as the state which owns state-owned entities.
  • We don't have any data to be able to report placeOfBirth, placeOfResidence, pepStatus or deathDate for people, nor alternateNames for companies.

How do I use it?

While we suggest you read the Beneficial Ownership Data Standard documentation, here's a very brief overview of how the data is structured and how we suggest you can use it:

Format & Structure

As mentioned above, the file is a single JSONLines format file, with one JSON statement per line. Lines are separated by Unix line endings (\n). The file has been compressed with Gzip, so expect the uncompressed version to be roughly 10x bigger than the compressed one.

Each statement concerns a person, entity or relationship between them. They are in order such that, if you process the file sequentially from first line to last, you will see the people or entities involved in any relationships before you see the relationships. Note that this means if you process the file in a parallel or distributed fashion, you will need a means of revisiting statements which you see out of order.

Example data

An entity statement for an example company, registered with the UK's Companies House register:

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
{
  "statementID": "1dc0e987-5c57-4a1c-b3ad-61353b66a9b7",
  "statementType": "entityStatement",
  "entityType": "registeredEntity",
  "name": "EXAMPLE LTD",
  "foundingDate": "2019-10-01",
  "identifiers": [
    {
      "scheme": "GB-COH",
      "id": "0123456"
    }
  ],
  "incorporatedInJurisdiction": {
    "code": "GB",
    "name": "United Kingdom"
  },
  "addresses": [
    {
      "type": "registered",
      "address": "Example street, London, SW1A 1AA",
      "country": "GB"
    }
  ]
}

An example person who will own the example company:

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
{
  "statementID": "019a93f1-e470-42e9-957b-03559861b2e2",
  "statementType": "personStatement",
  "statementDate": "2019-10-01",
  "personType": "knownPerson",
  "nationalities": [
    {
      "code": "GB",
      "name": "United Kingdom"
    }
  ],
  "names": [
    {
      "type": "individual",
      "fullName": "Jane Smith"
    }
  ],
  "birthDate": "1973-01",
  "addresses": [
    {
      "address": "Example street, London, SW1A 1AA",
      "country": "GB"
    }
  ]
}

Their relationship (Jane owns all the shares in Example Ltd):

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
{
  "statementID": "fbfd0547-d0c6-4a00-b559-5c5e91c34f5c",
  "statementType": "ownershipOrControlStatement",
  "statementDate": "2019-10-01",
  "subject": {
    "describedByEntityStatement": "1dc0e987-5c57-4a1c-b3ad-61353b66a9b7"
  },
  "interestedParty": {
    "describedByPersonStatement": "019a93f1-e470-42e9-957b-03559861b2e2"
  },
  "interests": [
    {
      "type": "shareholding",
      "startDate": "2019-10-01",
      "share": {
        "exact": 100
      }
    }
  ]
}

License

Please see our terms and conditions.

Further information

Contact us if you have questions about using this data which aren't answered here.


Latest data

Exported: 2019-11-07

Download