Regional Entrepreneurship Capital and Firm Production

Certain theories suggest that the capacity of a region to generate new firms, called entrepreneurship capital, has positive spillover effects on the production of the firms in that region. Evidence generated with aggregated data at the regional level supports this prediction. This paper argues that, using aggregated data at the regional level, entrepreneurship capital could be correlated with regional production even if entrepreneurship capital has no spillover effects on firm production. This will not be the case when data at the firm level are used. This paper provides evidence from a sample of 11,276 Spanish firms during the 2004–2012 period. Positive spillovers are estimated in between effects models, but such spillovers are only found in technological firms when within effects models have been estimated. Thus, the regional entrepreneurship capital spillovers are unclear when data at the firm level are used. Plausible interpretations and implications are discussed.


Introduction
The Knowledge Spillover Theory of Entrepreneurship (KSTE) suggests that entrepreneurship capital in a certain region has positive spillovers that increase the production of firms in the region Keilbach 2005, 2007;Acs et al. 2009;Audretsch and Lehmann 2016). From this starting point, a growing amount of literature is estimating the effects of regional entrepreneurship capital on a region's production using aggregated data at the regional level (Audretsch and Keilbach 2004a,b,c, 2005Mueller 2006Mueller , 2007Bönte et al. 2008;Cravo et al. 2010;Stough et al. 2008;Chang et al. 2012;Hafer 2013;Laborda et al. 2011;Carree et al. 2014;Mendonça and Grimpe 2015). In practically all the cases, these studies report the positive and significant effects of regional entrepreneurship capital on regional production, which are interpreted as supporting the existence of positive spillovers.
The measurement and concept of entrepreneurship capital has been discussed, unlike the measurements of other kinds of inputs (Erikson 2002;Audretsch and Keilbach 2004a;Bönte et al. 2008). The empirical applications work with different measures that range from the stock of new firms in the region (Audretsch and Keilbach 2004a,b,c, 2005 to the entry rate of firms in key industries (Chang et al. 2012). Most of the entrepreneurship capital measures used 1 are part of (and can therefore be related to) the number of firms in the region (Henrekson and Sanandaji 2014).
This paper argues that the regional entrepreneurship capital could affect the regional aggregated production by at least two means: by affecting the number of firms in the region and/or by affecting the spillovers in firm production. The next section of this paper discusses why using data aggregated at the regional level makes it very difficult to distinguish between these two effects. Therefore, data need to be gathered at the firm level to properly test whether entrepreneurship capital has positive spillovers on a regional level that increase the production of firms in this region, as suggested by the Knowledge Spillover Theory of Entrepreneurship and other theoretical arguments such as the increase of competition suggested by Geroski (1989) or Roberts and Tybout (1996). 1 Other authors have used the annual average of new firms per 1,000 workers created in a three-year period, such as Keilbach (2004a,b, 2008). Mueller (2006Mueller ( , 2007 uses this indicator in addition to the number of new firms created in one year. Sutter and Stough (2009) use the average number of technological and innovative firms created in the last five years, while Bönte et al. (2008), Salas-Fumás and Sánchez-Asín (2013a,b) and Stough et al. (2008) use the self-employment rate on a regional level. Erken et. al (2016) use the business ownership rate (number of business owners per workforce) corrected for the level of economic development (GDP per capita) to evaluate the relation between entrepreneurship and total factor productivity. Acs et al. (2016) argues "that spatial externalities of various forms constitute serious market failures that require intervention" (p. 36). Thus, it is important to properly measure the existence and amount of such externalities to justify government intervention and the quantity of resources devoted to such an intervention. Therefore, we have collected information about the entrepreneurship capital in Spanish Autonomous Communities (NUTS 2 in accordance with Eurostat) from INE 2 . From the Spanish Technological Innovation Panel 3 , we construct an unbalanced data panel of 11,276 Spanish firms during the 2004-12 period with information about the production obtained, the capital and the labour used by each firm. As far as we know, this paper is the first to have used data at the firm level to provide evidence for the effect of spillovers of regional entrepreneurship capital on existent firms' production. In addition to a more proper measure of the regional entrepreneurship capital spillovers, data at the firm level have many other advantages over data aggregated at the regional level. For example, the data allowed us to work with an exponentially higher number of observations and to analyse which kind of firms benefits the most from regional entrepreneurship capital.
The paper is organized as follows. In Section 2, the previous literature methods are summarized and their limitations discussed. Section 3 discusses the methodological approach used in this paper and states the hypotheses. In Section 4, we present the data. Section 5 presents the estimations of firms' production functions. Section 6 concludes the paper by discussing its implications.

A main shortcoming of the previous literature
Most of the literature cited in the introduction has estimated the impact of regional entrepreneurship capital on the production for region i at period t, Yi,t. The usual method is to act in accordance with that used by Solow (1956). The regional output is obtained as a combination of the sum of inputs purchased by the firms of the region and other regional inputs. Cobb-Douglas (1928) functions are usually estimated, and the inputs considered are labour (Li,t), (physical) capital (Ki,t), regional knowledge (Ri,t) and regional entrepreneurship capital (Ei,t): ln , = ln , + ln , + ln , + ln , + , , where , captures the usual error terms. According to the study, the error terms may include time and/or regional fixed effects. Usually, except for specific cases (see Carre et al. 2014), regional cross effects are not considered. Hence, the main parameters estimated are production elasticity with respect to labour (β), capital (), regional knowledge (μ) and regional entrepreneurship capital (δ). The estimations of the elasticity of production with respect to regional knowledge and regional entrepreneurship capital are usually positive and statistically significant.
The literature has interpreted the positive relationship between regional entrepreneurship capital and regional production as evidence of the regional entrepreneurship capital spillovers on firms' production. This interpretation is not free of assumptions. To make such assumptions explicit, we will formalize our arguments. This formalization is based on the previous analyses of the limitations related to the use of regional aggregated production functions (see Fisher 1969Fisher , 2005 for further details).
In each period, a set of inhabitants of the region decide to be (or continue to be) entrepreneurs, thus determining the number of firms in the region, , . These firms will contract for a set of inputs to produce outputs in accordance with the production function described by the following equation: . If all the firms in the region have the same size ( , = , , ), it is easy to show that the regional production is , = , , , = , + , (1− −s) , , , where = 1 − − measures the returns to scale at the regional aggregated level, while the returns to scale at the firm level are + . Those returns differ when the average size of the firms in the region has spillovers on the production of the firms in this region, s≠0.
It is easy to verify that in those particular cases, when the regional aggregated level returns to scale are constant ( = 1 − − = 0) or s=-, the regional production is , = , , , = , , , , = , , , , . Taking logarithms and adding error terms, this equation is equal to Equation 1, which is the one used in the literature. In short, these are the main assumptions although only Mueller (2006) reports a test of their significance. Thus, it is difficult to accept the assumption that, at the regional aggregated level, returns to scale are constant.
Therefore, the alternative is to assume that the average size of the firms , is a regional input with an elasticity equal to = − = −(1 − − ). The problem with aggregated data at the regional level is that we cannot provide evidence about the value of s. When + > 0, the previous literature has omitted a relevant variable, the number of firms in the region. This finding is important, because it appears reasonable to expect that the measures of entrepreneurship capital For illustrative purposes, let us assume that the correlation between entrepreneurship capital and the number of firms is one, and specifically that , = , . Let us call ̂ the estimated coefficient of the entrepreneurship capital using Equation [1]. In accordance with Equation [3], ̂= + + ; therefore, the bias will depend on the presence of returns to scale and on the spillovers of the regional average size of the firms on the production of the firms in the region, s. In fact, we cannot exclude the possibility that the entrepreneurship capital does not affect the production at the firm level (=0), although there is evidence showing a positive effect on the production aggregated at the regional level (̂>0 since + > 0). This finding is an important shortcoming of the previous literature, since it casts doubt on their main interpretation of the evidence generated, that the regional entrepreneurship capital has positive spillovers on the production of the firms in the region. This bias disappears using data at the firm level to estimate Equation [2].
Data at the firm level also enables us to address new issues other than those in the cited literature.
Some of the papers revised seek to identify the kind of entrepreneurship capital that generates more spillovers. For example, Keilbach (2004a,b,c, 2008) classified entrepreneurship capital on the basis of the technological intensity of the sectors: high technology, ICTs, and other sectors. The researchers find that other sectors that are less technological generate more spillovers. In terms of geographical location, urban zones generate higher spillovers than rural ones (Audretsch and Keilbach 2005). Data at the firm level allows us to extend those analyses and estimate the elasticities of production with respect to entrepreneurship capital for different groups of firms. Consequently, we can identify the groups of firms that benefit the most from entrepreneurship capital spillovers (i.e., which receive more externalities). The KSTE justifies those spillovers by the role of entrepreneurship capital in the diffusion of knowledge.
The evidence can therefore be interpreted in terms of the differences in the absorptive capacity of knowledge between firms.

The methodological approach
We use the number of firms per inhabitant in the region as the measure of regional entrepreneurship capital 4 , , = , , . Implicitly, we assume that the number of firms in the region ( , ) is determined by the population of the region ( , ) and the entrepreneurship capital in the region ( , ). The focus of our analyses, and that of the previous literature, is not to quantify the effects of entrepreneurship capital on regional GDP via the increases in the number of firms, ); instead, it is to measure the spillovers of regional entrepreneurship capital on firms' production. In other words, the focus is to test whether , , depends on , . The previous section has noted the problems in disentangling both effects using data aggregated at the regional level. Thus, we use data at the firm level to estimate the firms' production function (Equation [2] in logarithmic terms), which is defined in the section above: ln , , = ln , , + ln , , + ln , + ln , + ln , + , , . [4] Given the purpose of this paper, in accordance with the previous literature measuring only the entrepreneurship capital spillovers in the firms of the same region, we do not consider cross-border effects. Apart from the inclusion of entrepreneurship capital (and the average size of the firms in the region), these kinds of equations have been extensively estimated in other contexts (see Syverson 2011 for further discussion of their limitations). The following discussion is focused on the proposed measure of entrepreneurship capital: , = , , .
Obviously, we will not argue that everyone in the population has the same probability of being an entrepreneur nor that all types of entrepreneurs provoke the same spillovers. For example, the persistence of the decision of being an entrepreneur is well documented (for a recent discussion, see Fritsch and Wyrwich 2014). Consequently, the probability of being an entrepreneur is higher for those who were entrepreneurs the previous year (persistent entrepreneurs) than for the remainder of the population (entrant entrepreneurs). Certain authors claim (Congregado et al. 2012) that, among the persistent entrepreneurs, there will be a relatively higher presence of entrepreneurs who play the risk-bearing arbitrageur role, emphasized by the writings of Knight (1921), Say (1803) and Kirzner (1979), than the role of innovative entrepreneurs, highlighted by Schumpeter (1950).
In accordance with the KSTE, innovative entrepreneurs will cause relatively more spillovers and appear to be more present among the entrant entrepreneurs. Therefore, Keilbach (2004a,b, 2008) use the annual average of new firms per 1,000 inhabitants created in a three-year period as the measure of entrepreneurship capital. This ratio can be related with the probability of an inhabitant of the region creating a new firm. Unfortunately, we do not have data regarding the regional startups for each year; therefore, we cannot provide empirical evidence analysing the differences among both measures. Previous evidence suggests that those measures are highly correlated (Fritsch and Wyrwich 2014). This finding is consistent with recent theoretical approaches (Van den Steen 2010; Gutiérrez and Ortín-Ángel 2016) arguing that, in any case, an important role of entrepreneurs is to establish a use of productive factors that otherwise would not exist.
Nevertheless, we can test whether the entrepreneurship capital in a region affects the production ( > 0) of those firms located in the region.
Hypothesis 1: The production of firms in a region is positively related to regional entrepreneurship capital.
As previously noted, certain authors use the average of the entrepreneurship capital during certain time periods because, as is argued, the entrepreneurship capital is a latent variable. To focus our argument, let us assume that the measure used is related to the entrepreneurship capital in the following manner: , = , , = ,0 ∏ ∆ , +(1− ) , 1 . The error measurements are captured by , , where t is independent random variables with a mean equal to zero. The relative importance of the measurement errors with respect to the annual variation in entrepreneurship capital ∆ , is measured by the parameter ∈ [0,1]. When =1, , is the most accurate measure of the entrepreneurship capital, and when = 0, the most accurate measure is the average (of logarithms) of , during the period analysed. Other approaches use time series techniques. For an example 5 , refer to Congregado et al. (2012).
Our approach utilizes the fact that we have an unbalanced data panel referring to 11,276 firms (j) distributed throughout 18 regions (i) with an average of 7.9 observed years per firm (t); thus, there are ultimately 89,370 observations. We can estimate the elasticity of production with respect to the measure of entrepreneurship using the fixed effects (or within) model ( ). Furthermore, disaggregated data at the firm level enable us to estimate the between effects model (and the correspondent elasticity ) with 11,276 observations instead of the 18 observations that would be used for data aggregated at the regional level.
Assuming that entrepreneurship capital is related to the production of the firms in accordance with Equation [2], and the annual variation in entrepreneurship capital ∆ , is cyclical =ln ,0 , where T is the number of periods observed), we expect that = .
Therefore, the elasticities estimated will be the same when =1 and differ otherwise ( As discussed in the section above, data at the firm level also allow us to identify those firms that benefit more from entrepreneurship capital spillovers. In accordance with the KSTE, entrepreneurship is a facilitator of knowledge dissemination. Consequently, it is expected that 5 According to Congregado et al. (2012) terminology, ,0 is the non-stationary natural rate component, and ∆ , is the stationary cyclical component.
entrepreneurship capital spillovers will be higher in those firms with lower current levels of knowledge and in firms with a higher capacity to learn, or absorptive capacities (Cohen and Levinthal 1990;Qian and Acs 2013). As a proxy of these two concepts, we use the size of the firm and its technological intensity. The technological intensity has been related to firms' higher absorptive capacities from the outset (Cohen and Levinthal 1990). Furthermore, we postulate that small firms have fewer resources; therefore, on average, we expect that they have accumulated lower levels of knowledge and thus have more to learn. In short, we propose the following hypothesis: Hypothesis 2: The benefits from entrepreneurship capital spillovers decrease with the size of the firm and increase with its technological intensity.
As in much of the literature on firms' productivity (Syverson 2011 for a summary), Equation [1] and [2] are interpreted in terms of how the production is organized, without considering that there is a process of input accumulation that occurs nearly simultaneously. Keilbach (2004a,c, 2008) argue that estimations based only on Equation where are the z possible determinants of entrepreneurship capital, are the parameters to be estimated and , are the usual error terms. The estimator of the relationship between firms' production and entrepreneurship capital that provides the simultaneous equation estimation is a full information instrumental variable (Hausman 1978) in that it considers the possible correlation between error terms and regressors and between the error terms of the two equations.
To consider the persistence of the regional levels of entrepreneurship, we will introduce the entrepreneurship capital of the last year as an independent variable in Equation [5]. Although the 6 The next considerations regarding entrepreneurship capital can be extended to the remainder of inputs in Equation [2]. As the focus of the paper is based on the entrepreneurship capital, we omit such analyses.
persistence is important, the decision of being an entrepreneur may evolve over time due to changes in their financial situation, economic perspectives, regional government policies and other personal situations.
Therefore, we have collected information concerning wealth, the aggregated value added and the population density, as well as the economic perspectives of the region and the annual increase in the aggregated value added. The existing evidence (see Koellinger and Thurik 2012 for a discussion) is ambiguous about the relationship between the previous general economic conditions of the inhabitants of a region and the current entrepreneurship capital of the region.
An analysis of each regional government policy is beyond our research purpose. One of the main instruments for such policies is tax. We have collected information about the tax pressure in each region. Consistent with the evidence available (for further discussion see Gentry and Hubbard 2004), we expect a positive relationship between the previous tax pressure and the current entrepreneurship capital.
Finally, one of the most dramatic changes in the personal situation is to become unemployed. We include information about the regional unemployment rates. Consistent with the existing evidence (for a more detailed discussion see Fairlie 2013), we expect that entrepreneurship capital will increase in those regions with previously higher unemployment rates.

Data
The firms' data used in this study originate from the Spanish Technological Innovation Panel (PITEC) and refer to the 2004-2012 period. This is an unbalanced data panel (each year some firms join and others leave the panel). We use the information corresponding to the firms' research activity and location of headquarters to allocate the firms to a specific Spanish region (further details about the allocation process are provided in Appendix 1). Those R&D activities are expected to play a key role in the diffusion/absorption of knowledge. On average, we have 7.9 observations per firm and 11,276 firms; therefore, we use a total of 89,370 observations. Next, we define the variables collected at the firm level.  Table 2).
Insert Table 2 about here In accordance with the EUROSTAT classification of enterprises, firms can be classified into three categories according to the number of employees: small (1-49 employees), medium (50-249 employees) and large firms. Table 3 shows the distribution of firms by technological intensity and size categories. Therefore, a maximum of nine categories can be used in the analyses. For simplicity purposes, in this paper, we present the analyses that best summarize the results. We use two dummies, one for size (50 or more employees) and another for technological intensity (includes HT and MT firms) 7 . The category omitted in the analyses is firms with less than 50 employees and of low technological intensity.
Insert Table 3 about here For each region, in our case Autonomous Community (i= 1,…,18), we have collected the following aggregated information.
Regional Knowledge (Ri,t) is measured by the number of patents filed each year based on the Spanish Patents and Trademarks Office (Bönte et al. 2008). Unfortunately, we do not have access to data to estimate other proxies used in previous studies, for example, the total number of people 7 All the analyses that are cited but not provided in the text are available on request from the authors. employed in private companies or universities in areas related to R&D (Mueller 2007) or the annual regional costs of R&D (Griliches 1998).
The information on the stock of firms (ni,t) in each region is available in the Central Business Register (DIRCE) database. The entrepreneurship capital is measured by the ratio between the stock of firms and the regional population 8 obtained from the INE, , = , / , . The average size of firms in the region is measured by the ratio between the regional labour force obtained from the INE and the stock of firms, , = , / , . Table 4 presents a summary of those variables.
Insert Table 4 about here Regarding Equation [5], the independent variables are the aggregated value added for each region ( , −1 ), the population density (number of inhabitants per square kilometre; DENi,t-1), the annual growth of the regional output (gi,t-1 = ln[Yi,t-1/Yi,t-2]), the tax pressure (ratio between taxes and GDP; TAXi,t-1 ) and the regional unemployment (UNEMi,t-1). We have measured such determinants using information from the INE. In the estimations, these variables have been included one period lagged; therefore, we also collect regional data for 2003.

Results
Regarding Hypothesis 1, Table 5 shows different estimations of Equation [2] using the between and the within effects models, in this last case providing the clustered (by firm) robust variance estimators. Breush and Pagan's (1979) and Hausman's (1978) tests indicate that the fixed effects model is the most appropriate for modelling the non-observable heterogeneity among firms. The difference between the first and second pairs of columns is the inclusion of the variables measured at the regional level (in the best of cases, the increase in the explanatory power of the model, R 2 , is 0.0024). The last two columns show the joint estimation with Equation [5].
Insert Table 5 about here 8 There is also information available about the economically active population. The estimations presented in the next section have been replicated using the ratio between the stock of the firms and the economically active population as the measure of entrepreneurship capital. The conclusions are very similar but, in this case, the models have a lower explanatory capacity, R square, than those of the estimations presented in the text.
The elasticity of production with respect to labour () takes values of between 0.7382 and 0.8914, the elasticity with respect to capital (α) takes values of between 0.1680 and 0.2198, and the elasticity of production with respect to private knowledge (ρ) takes values of between 0.0051 and 0.0067. All of those parameters are statistically significant at the 1% level. The null hypothesis of the constant returns to scale is rejected in both models. In short, the elasticities of production with respect to the inputs purchased by the firms are very stable among the different models estimated.
This finding does not apply to the parameters associated with regional inputs (last four columns in Table 5). The coefficients associated with the regional entrepreneurship capital and knowledge cases, but not statistically significant; therefore, we cannot reject that s = 0. The average size of the firms in the region has no spillover on firms' productivity.
Regarding the determinants of the entrepreneurship capital (Equation [5]), we find evidence of their persistence and their positive correlation with the personal wealth of the inhabitants of the region, the regional output growth, the tax pressure and the unemployment ratio.
Regarding Hypothesis 2, Table 6 shows the estimations of the above models allowing for differences in the elasticities of production with respect to the entrepreneurship capital depending on the size and the technological intensity of the firm. Hypothesis 2 is supported in the within effects model and partially supported in the between effects model. The spillovers decrease with the size of the firm (statistically significant in both models) and increase with the technology intensity (only statistically significant in the within effects model). It is important to note that the elasticities of production with respect to the entrepreneurship capital for technological firms with less than 50 employees are very similar using the between effects model (approximately 1.46) and the within effects model (1.3057 and 1.4624; all of these values are statistically significant).
This finding does not apply to the other types of firms. The values of the remaining parameters are consistent with the previous estimations in Table 5.

Conclusions
Spatial externalities have been considered a main argument for justifying the existence of policies stimulating entrepreneurship among the population (Acs et al. 2016). This paper uses data at the firm level to test whether the regional entrepreneurship capital has positive spillovers on the production of the firms in the region. Previous evidence is based on aggregated data at the regional level. The theoretical discussion in this paper suggests that those studies may be overestimating the spillovers, particularly when the production functions present decreasing returns to scale, and the regional average size of the firms has no spillovers on the production of the firms of the region.
Our evidence suggests that both features are present in the production functions estimated (at least in the within effects model); therefore, it is important to provide evidence at the firm level to confirm the presence of regional entrepreneurship capital spillovers. It is important to note that we do not provide evidence for the sources of the entrepreneurship capital spillovers (for such kind of evidence see Carree et al. 2014). Therefore, we cannot guarantee that the source of those spillovers is the KSTE argument, and we cannot reject the possibility of other sources of positive (or negative) spillovers being present at the same time.
For the overall sample of firms, we find that the regional entrepreneurship capital has positive spillovers on the production of the firms in the region; these are statistically significant in the between effects but not in the within effects model. Consequently, one could argue that we do not find clear support for such spillovers; however, those results could have different interpretations.
In accordance with Congregado et al. (2012), the ratio of the stock of firms and the population is a measure of the latent variable, entrepreneurship capital. The annual variation in the measure is an imperfect indicator of the annual variation in the latent variable. According to Congregado et al. (2012) and our estimations, approximately 20-35% of the variation in the measure is related to a variation in the latent variable.
Another interpretation is that we have omitted time invariant variables correlated with entrepreneurship capital. If this is the case, it appears that those variables are particularly relevant for non-technological firms. It is important to note that, for technological firms, the elasticities estimated are positive, statistically significant and with similar values in both models (between and within effects). We also find that, if there is a benefit, small firms benefit more from the regional entrepreneurship capital spillovers.
In short, we only detect generally extended positive spillovers in between effects models, while in the within effects models, significant positive spillovers are only found for technological firms.
Our results need to be confirmed in other contexts: different geographical areas, different definitions of region and different periods of time. The evidence presented opens interesting theoretical and empirical questions among which are the omitted time invariant variables correlated with the regional entrepreneurship capital and the production of the firms in the region.      *: Significant at the 0.10 level. **: Significant at the 0.05 level. ***: Significant at the 0.01 level. Robust standard errors are in brackets.