Download our data
Contents
Downloads
Links to the most recent files we've exported:
What is this data?
Each file contains a complete snapshot of all of the data in this register. That includes data from:
- UK People with significant control (PSC) Register (regularly updated)
- Denmark Central Business Register (CVR) (regularly updated)
- Slovakia Public Sector Partners Register (regularly updated)
The raw data from each of these sources has been imported, the companies reconciled with OpenCorporates and - where possible - de-duplicated based on OpenCorporates results and our own de-duplication algorithms. It has also been periodically refreshed with reconciliations with OpenCorporates database to collect company name changes, dissolutions, etc.
The data is formatted in version 0.2 of Open Ownership's Beneficial Ownership Data Standard (BODS). All source data has been standardised on export, attempting to keep as much source-specific information as possible, but prioritising consistency across sources. Data from individual source registries mapped to v0.2 can be downloaded and reused in a variety of formats via our BODS data analysis tools.
We have extensive documentation on the standard, in particular we recommend understanding the key concepts in the data model and then referring to the specifics using the interactive schema browser.
The file is in JSONLines format,
meaning each BODS statement
is a JSON object on a
single line of the file. It has been compressed with
Gzip, which is
usually already available on MacOS and Linux systems, and is easily
readable with a variety of compression programs on Windows.
Our regular data updates at least every month, and we will make a new export after each update. This page will be updated as soon as that happens, but you can also rely on the "latest" link at the top of the page to always point to the most recent data.
What is this data not?
100% accurate, up-to-date, or lawful for every use case. As per our terms and conditions we cannot guarantee the accuracy of the data, nor that your use of it is lawful. Due to the nature of our periodic update and refresh process, we also can't guarantee the freshness of the data. Refer to the information quality section of our terms for more details.
A comprehensive dataset of companies in the jurisdictions we have data for. It only contains those people and companies who've declared beneficial ownership information. For example, the UK PSC and DK CVR registers are broadly intended to cover all companies in their respective countries, but there will always be companies who don't comply. In addition, some jurisdictions (e.g. Slovakia) only require declarations from specific sectors or types of companies.
Data on company directors, officials or shareholders. Whilst individuals in these roles may also be beneficial owners, the inverse is not always true. The people in our data are only those who meet the definition of a beneficial owner in their respective jurisdiction. There are often thresholds of shareholding (for example) which must be exceeded to qualify. Conversely, shareholding is not the only way one becomes a beneficial owner.
The Register is a work-in-progress and there are a number of issues that we are working to resolve. Please check the list of issues on Github, report any additional issues or visit our project board.
How do I use it?
While we suggest you read the Beneficial Ownership Data Standard documentation, here's a very brief overview of how the data is structured and how we suggest you can use it:
Format & Structure
As mentioned above, the file is a single JSONLines format file,
with one JSON statement
per line. Lines are
separated by Unix line endings (\n
). The file has been
compressed with Gzip, so expect the uncompressed version to be
roughly 10x bigger than the compressed one.
Each statement concerns a person, entity or relationship between them. They are in order such that, if you process the file sequentially from first line to last, you will see the people or entities involved in any relationships before you see the relationships. Note that this means if you process the file in a parallel or distributed fashion, you will need a means of revisiting statements which you see out of order.
Example data
Find example data in the BODS documentation.
Identifying and linking entities
The primary identifier in BODS is the statementID
. This uniquely
identifies a particular statement about a person, legal entity or
relationship between them. You may also want to link our data to other
datasets that have company numbers or other identifiers in them.
The identifiers
field is the way you can do this.
The BODS schema allows for an array of identifiers to help users of the data link people and companies to other data sources. We make use of this feature to provide several of our own identifiers as well as transferring as much information from the original source data as we can, so that you can link the records back to their original data.
Here are some examples of identifiers you might see and an explanation of how to interpret them:
An official identifier using an org-id scheme
We will give these identifiers when we're sure that the id we have is
from the org-id.guide
scheme
given. For example, we'll give
company numbers under the GB-COH
scheme for
companies declaring their beneficial owners in the UK People with
significant control (PSC) Register.
If entities have identifiers from a register that we matched in org-id, you should be able to uniquely identify them and connect them to other data sources. For example, you can match companies to a national register or match them to other datasets like those from the Open Contracting Partnership which use the same identifiers.
All of the identifiers we currently match to org-id schemes are for company numbers. Org-id has many other identifier schemes available, such as for charities or educational institutions, however none of our data sources have verified data which we can link to them, so we do not use them. Similarly, BODS allows for schemes to declare official identifiers for people, such as tax IDs or passport numbers, but none of our data sources report these either.
Unofficial identifiers
In cases where we can't give an official org-id scheme
,
we'll give an identifier with just a schemeName
.
These names will always start with the ISO-3166-1 alpha2
country code for the country the register is in, followed by the
original language name for the register. For example: UK People with
significant control (PSC) Register.
The intention is that these identifiers can be used to uniquely identify a person/company record within that data source, though not necessarily outside of it. Likewise, they will identify the record but may not uniquely identify the entity. For example, for beneficial owners from the UK Persons with significant control Register we give the value of the 'self link', a path-like string which uniquely identifies a beneficial owner record, but doesn't identify a specific person or company across records.
The values in the id
field will vary depending on the
source. As shown above, we use whatever field(s) in the source can
uniquely identify the record. In some cases this means combining
multiple fields (with a hyphen -
). Currently, the fields are:
- DK Centrale Virksomhedsregister:
enhedsNummer
- SK Register Partnerov Verejného Sektora:
KonecniUzivateliaVyhod.Id
- GB Persons Of Significant Control Register: beneficial owner "self links"
- GB Persons Of Significant Control Register - Registration numbers: beneficial owner "registration numbers". i.e. unverified company registration numbers.
An OpenCorporates identifier
For companies we've successfully matched to OpenCorporates, we
include the url to that company on opencorporates.com. Note that in
some rare occasions we will have matched a company to more than one
OpenCorporates record. For completeness, we're including the same
value in both the id
and the uri
fields.
A register identifier
Every single known person and company will have an identifier that
gives their URL on the register. Unknown companies are declared
explicitly in BODS, but don't have a corresponding page in the
register so we cannot give a URL for them. Again the value is in
both the id
and uri
.
License
Please see our terms and conditions.
Further information
Contact us if you have questions about using this data which aren't answered here.
Changelog
For more information on our iterative improvements to this data, see our changelog of major changes.