458. Having good data on the location of profit and economic activity of multinational enterprises (MNEs) is key to assessing the implications of international corporate tax reforms, such as the Pillar One and Pillar Two proposals currently being discussed by the OECD/G20 Inclusive Framework on Base Erosion and Profit Shifting (BEPS). However, while a range of data sources provide valuable insights on the profit and activities of MNEs, no existing data source is sufficiently comprehensive in its geographic coverage and in terms of variables available to be used in isolation for a comprehensive reform impact assessment covering all 137 jurisdictions in the OECD/G20 Inclusive Framework on BEPS (Inclusive Framework).
459. Against this background, the OECD Secretariat has undertaken to combine a range of existing data sources into a consistent framework, which serves as a central instrument supporting the impact assessment analysis in this report. The framework consists of a set of four matrices: a profit matrix, focusing on the location of the profit of MNEs across jurisdictions, and three matrices focusing on indicators of the economic activity of MNEs (turnover, tangible assets and payroll). Each matrix contains data spanning more than 200 jurisdictions (each jurisdiction corresponding to a matrix row) and broken down across more than 200 jurisdictions of ultimate parent of the MNE considered (each jurisdiction of ultimate parent being a matrix column). Each matrix therefore takes the form of a square table with more than 200 rows and more than 200 columns. For example, the France-United States cell in the profit matrix would represent the profit of US MNEs (i.e. MNEs with an ultimate parent in the United States) in France.
460. The matrices combine data from a range of sources, and build on earlier efforts to map the profit and activity of MNEs for the analysis of profit shifting (Tørsløv, Wier and Zucman, 2018[1]) and the study of global value chains (GVCs) (Cadestin et al., 2018[2]). A primary source of data used in the matrices is the newly available anonymised and aggregated Country-by-Country Report (CbCR) data,1 which have been collected as a result of the implementation of the 2015 BEPS Action Plan and were published for the first time by the OECD in July 2020 (OECD, 2020[3]). Other sources include the ORBIS database of firm-level financial accounts (in jurisdictions where ORBIS coverage is good), the OECD AMNE database (which includes data from the Eurostat FATS database and from the US Bureau of Economic Analysis) and the OECD Analytical AMNE database (Cadestin et al., 2018[2]), which builds upon and complements the OECD AMNE database. The data considered focus essentially on year 2016, which is the latest available year across all the data sources used.
461. These various data sources complement each other as they have different geographic coverage and include different variables, meaning that the combined dataset is richer than any data source taken individually. These sources also have substantial overlap in their coverage. This overlap is used to benchmark sources against each other, in order to address the limitations of individual data sources and ensure the overall consistency of the approach, as further discussed below. The methodology aims to make data across the four matrices as comparable as possible, so as to enable the joint use of the matrices (e.g. using simultaneously the profit and turnover matrices to compute average profitability). To this end, efforts have been made to rely as much as possible on comparable data sources for the same cell across the different matrices. For example, if a cell is filled with CbCR data in the profit matrix (e.g. profit of US MNEs in France), the aim has been to use CbCR data to fill the corresponding cell in the other matrices (e.g. turnover of US MNEs in France).
462. In matrix cells where no source of ‘hard’ data is available, estimates are based on extrapolations relying on macroeconomic data (e.g. FDI data, GDP, GDP per capita). The extrapolation methodology builds on the information contained in the matrix cells filled with hard data, which aims to ensure consistency within each matrix. The extrapolation methodology is also designed to make the data across the four matrices as comparable with each other as possible. For example, extrapolations in the tangible assets and payroll matrices are based on data from the turnover matrix.
463. Among the four variables considered in the matrices, profit is arguably the most difficult to extrapolate when it is not observed in hard data, because the profit of MNEs may not always be located in the same jurisdiction as their economic activity. To overcome this issue, a sophisticated extrapolation methodology based on foreign direct investment (FDI) data has been developed. This methodology, inspired by Damgaard and Elkjaer (2017[4]) and Casella (2019[5]), involves various steps to identify the ultimate foreign investor into a jurisdiction, based on successive iterations on the existing data on ‘immediate’ foreign investors, and to eliminate ‘pass-through FDI’ from the data. One of the intermediate outputs of this procedure is a full matrix of FDI by jurisdiction of ultimate investor, which is interesting in its own right.
464. Overall, the various extrapolations ensure that all of the cells in the matrices can be filled, which makes it much easier to use the matrices for economic analysis. Extrapolated data are more fragile than hard data, but extrapolations represent a moderate share of the total amounts in the matrices (on average 25% across the four matrices), meaning that the information in the matrices is based predominantly on hard data. There are important geographic differences in the share of extrapolated values. This share is relatively low in high-income jurisdictions, higher in middle-income jurisdictions, and very high in low-income jurisdictions.2 In investment hubs, the share of extrapolated values, while substantial (e.g. close to 40% in the profit matrix), is much lower than it would have been in the absence of the CbCR data, highlighting the importance of CbCR as a key new source of data on the amount of profit in investment hubs.
465. The various data sources mobilised to build the matrices have limitations, as is the case for any source of economic data. More specifically, CbCR data on profit have issues related to ambiguities in the treatment of intra-company dividends as well as ‘stateless’ entities, these ambiguities being related to the fact that 2016 was the first year in which the data was collected (OECD, 2020[3]). This may give rise to cases of double-counting of profit and revenues.3 Another limitation of the data sources is that ORBIS unconsolidated account data has uneven coverage across jurisdictions. Reflecting this, ORBIS is used to fill the matrices only in jurisdictions where coverage is deemed sufficiently good, but even in these jurisdictions, coverage is not always exhaustive. A limitation of the OECD Analytical AMNE database is that some values are based on imputations and alternative sources to fill coverage gaps in the underlying data (Cadestin et al., 2018[2]). Finally, a limitation of the OECD AMNE database is that it does not include the financial sector in its data on inward investment in European jurisdictions.
466. To assess the implications of these limitations, improve data quality and ensure consistency across the various data used in the matrices,4 extensive benchmarking and quality checks have been undertaken in this chapter. The benchmarking primarily takes advantage of the fact that, in many matrix cells, several data sources are simultaneously available, making it possible to assess their consistency. Data in the matrices have also been cross-checked against other relevant sources, including tax or financial account data shared with the OECD Secretariat by jurisdiction representatives. Overall, the consistency checks reveal some inconsistencies, but suggest good overall data comparability across sources. For example, the correlation between CbCR data and estimates based on ORBIS, computed across the matrix cells where both of these sources are available, exceeds 90% in the profit and turnover matrices, and the correlation of estimates based on extrapolations with those from hard data ranges between 64% and 96% across the four matrices and the various hard data sources considered.
467. The four matrices have been used extensively by the OECD Secretariat in its assessment of the estimated effect of Pillar One and Pillar Two on tax revenues (Chapters 2 and 3 of this report) and MNE investment behaviour (Chapter 4). In the case of Pillar One, the profit and turnover matrices were used primarily to assess the location of the residual profit of MNE groups (in the form of a ‘residual profit matrix’), so as to identify jurisdictions that would provide double tax relief, i.e. from which residual profit would be reallocated under Pillar One (see Chapter 2).
468. In the case of Pillar Two, the profit matrix was used, in combination with data on effective tax rates, to assess the amount and the location of the ‘low-taxed’ profit of MNEs (i.e. profit that is currently taxed at an effective rate below the potential minimum tax rate). The profit and turnover matrices have also been used to assess the extent of MNE profit shifting and how profit shifting could be reduced by the introduction of Pillar Two (the tangible assets and payroll matrices have also been used instead of the turnover matrix for the purpose of robustness checks), and, in turn, how this could affect tax revenues across jurisdictions (see Chapter 3). In addition, the turnover matrix has been used to proxy where some of the revenues from the minimum tax would accrue (here as well, the tangible assets and payroll matrices have been used for the purpose of robustness checks).5 Finally, the tangible assets and payroll matrices were used to model the implications of potential ‘carve-outs’ to the minimum tax based on economic substance.6
469. In the investment impact analysis, the matrices were used to calibrate the framework used to assess the impact of Pillar One and Pillar Two on forward-looking effective tax rates (see Chapter 4 and Hanappi and González Cabral (2020[6])).
470. This chapter contains a preliminary version of the four matrices, presented at a certain level of aggregation (i.e. by income groups and broad geographic regions). After extensive consultation with members of the Inclusive Framework, there was no consensus over whether or not jurisdiction-specific data in the four matrices should be publicly released as part of the economic impact assessment. In view of this lack of consensus, no jurisdiction-specific data are included in this chapter.
471. Looking ahead, the matrices presented in this chapter could be used in the future for a range of other purposes, in the area of tax policy analysis and beyond, as discussed in the conclusion of this chapter.