Optimal Bank Capital with Costly Recapitalization*

Samu Peura  

Sampo Bank

Jussi Keppo  

University of Michigan

We study optimal bank capital choice as a dynamic trade‐off between the opportunity cost of equity, the loss of franchise value following a regulatory minimum capital violation, and the cost of recapitalization. We introduce a recapitalization delay that results in a strictly positive probability of capital adequacy violation. We calibrate the model to bank accounting return data and evaluate the model’s ability to explain observed bank capital ratios. Differences in return volatility explain a significant fraction of the cross‐sectional variation in bank capital ratios. Differences in the level of capital market imperfections across banks constitute a secondary explanation.

I. Introduction

 

A general risk management lesson from models with frictions is that, in the absence of explicit risk management tools, such as financial derivatives, firms will choose to hold buffer stocks of liquid assets and capital as hedges against liquidity and earnings risks. The argument for the buffer stock role of liquid assets has been theoretically presented and empirically verified, for example, by Kim, Mauer, and Sherman (1998) and Opler et al. (1999), who find that firm liquid asset holdings are positively related to cash flow risks. The buffer stock role of equity capital, on the other hand, is supported by many empirical studies on capital structure (see, e.g., Titman and Wessels 1988; Harris and Raviv 1990; Booth et al. 2001), which find that firm leverage is negatively related to earnings volatility. In other words, we observe risk management considerations to influence both corporate investment decisions and corporate financing decisions.1

The capital structure decision in banks is, in its very essence, a risk management decision. A bank practitioner views bank capital not primarily as a form of financing but as a buffer against asset risks that need to be managed so that the bank can satisfy its minimum capital requirement even under relatively adverse future scenarios.2 It is implicit in this view that the violation of the minimum capital requirement is costly for the bank and that the bank faces costs and/or constraints associated with portfolio adjustment and recapitalization. Subject to these conditions, the role of buffer capital as a hedging mechanism against minimum capital violation is well founded.

Much of academic banking theory has been built on a quite different view on bank capital. A sizable literature has studied conditions or regulatory setups that could eliminate the bank owners’ asset substitution moral hazard problem.3 This literature concentrates on incentives in asset risk choice while taking bank capital as exogenous and abstracting from dividend and recapitalization choices. Therefore, the literature is ill suited for explaining banks’ actual capital choices.4 However, the two points of view on bank capital are complementary. The literature that builds on risk‐shifting incentives provides a rationale for banking regulation in general and for minimum capital regulation in particular. A bank manager thinking on bank capitalization takes the minimum capital constraint (the prevailing form of banking regulation), as well as the consequences of minimum capital violation, as additional constraints to her choice. In a minimum capital regime, such as the current Basel regime, a bank’s capital choice is really a choice of the capital buffer to be held in excess of the minimum requirement. Asset risks, recapitalization constraints, and the penalty from regulatory capital violation are the key determinants of this choice.

In this article, we adopt the bank manager’s point of view on bank capital and test whether a simple optimizing model built on this view is capable of explaining the observed patterns of bank capital holdings. Our model is one of a value‐maximizing bank and prescribes an illiquid bank portfolio, imperfections in capital raising transactions, and loss of franchise value associated with the violation of the minimum capital requirement. Our model does not display any risk‐shifting opportunities, since the bank portfolio is assumed to be illiquid. The model predicts strictly positive levels of buffer capital. We estimate bank return parameters from bank‐level accounting data and compare the implied bank capital ratios with actual bank capital ratios. We are not aware of a similar calibration exercise performed in the existing banking literature.

The idea to model banks’ capitalization decision based on the buffer stock role of bank capital is not new. Our model builds on the basic continuous‐time model of a capital‐constrained firm presented in Milne and Robertson (1996). Hojgaard and Taksar (1999) have extended the basic model into an insurance company setting by assuming that risk reduction at a proportional cost is available, which is interpreted as cheap reinsurance. Milne and Whalley (2001) have extended the model to allow for a recapitalization option. Peura (2002) has analyzed the effects of an equity issuance option that is subject to a proportional cost and an upper bound on the rate at which external capital can be raised. Our main modeling innovation is to impose a delay on capital issues.5 We suggest that this is a natural stylized assumption that proxies for the fact that capital raising transactions in reality require heavy preparatory work. It turns out that the delay has a significant effect on the qualitative nature of the resulting capital raising policy. In the presence of a delay, bank liquidation probability is also strictly positive, unlike in a model where recapitalization can be implemented instantaneously.

The determination of bank capital from a trade‐off perspective has also been studied in discrete time by Froot and Stein (1998), Furfine (2001), and Estrella (2004). In the context of a model with costly external capital, Froot and Stein (1998) demonstrate that a bank investing in illiquid products may adjust its capital structure in order to accommodate the illiquid risks it chooses to bear. Estrella (2004) uses a variant of the classical inventory or cash management models to study cyclicality of bank capital. In his model, the objective is to minimize the combined costs from over‐ and undercapitalization, as well as those from adjustment of capital. Furfine (2001) presents a model of a value‐maximizing bank and calibrates the model to panel data in an attempt to explain banks’ portfolio shifts following the introduction of the Basel Accord in 1989.

Our calibration exercise is based on a sample of U.S. commercial banks in S&P’s Compustat database. The average total capital ratio over the 1993–2002 period in the sample is 13.0%. The minimum requirement imposed by the Basel Accord is 8%, implying that the average capital buffer is 5% of risk‐weighted assets, or over 50% of the minimum requirement. Our model, implemented with the sample means and volatilities of bank returns, only yields an average capital buffer of 2.4%. The model capital ratio is highly sensitive to the level of return volatility, however, and much of the shortfall between the model and actual capital ratios is attributable to banks with below average return volatilities. There is reason to suspect that these estimates suffer from a peso problem, due to insufficient length of the data history. When applied to banks with above‐average credit‐loss provisions over the sample period, the model generates an average capital buffer of 3.5% and explains some 40% ( ) of the variation in capital levels across banks. We find that forward‐looking volatility estimates could improve model fit, both in terms of average levels and cross‐sectionally. Our model can replicate observed capital ratios exactly if volatility is made an implied parameter, analogous to how the Black‐Scholes model is used in practice. The average implied volatility is higher with small banks, even after acknowledging the likely differences in capital market imperfections.

The rest of this article is organized as follows. Section II presents our model of capital control and shows that the nonhomogeneous problem of capital control can be transformed into a homogenous problem of capital ratio control. The solution is characterized in terms of a set of variational inequalities. Optimal policies and the value function, also in the limiting cases of the model, are derived in Section III. Section IV illustrates optimal policies through numerical examples. Model parameters are calibrated and the comparison against actual capital ratios is presented in Section V. Section VI concludes.

II. The Model

 

A. A Model of Bank Capital

We imagine a bank whose portfolio size, measured by its regulatory risk‐weighted assets R,6 grows at a constant positive rate r, so that for some initial positive R0. The growth rate r is assumed to coincide with the risk‐free rate. The bank’s relative profitability remains unchanged over time, so that the scale of the bank’s profits also grows at the rate r. Cumulative bank profit Yt therefore satisfies where is a standard Wiener process.7 The parameter μ is the expected return on (risk‐weighted) assets, and σ is the asset return volatility. These are both assumed to be positive constants. This implies that expected instantaneous bank profit is proportional to bank portfolio size (measured by risk‐weighted assets) and that instantaneous profit is stochastic and nonpredictable.

Owners control bank capital through dividend payments and issues of new capital. Dividends payments can be implemented instantaneously, but capital issuance is associated with a delay of length Δ and with a cost that is a fixed proportion K of the size of the bank (as measured by R). Formally, a capital control policy is a collection , where is a nondecreasing process representing the cumulative amount of dividends paid under policy , is an increasing sequence of order times of new capital issues, and are the amounts of capital raised at each issue of capital. We denote by the class of admissible policies that satisfy the following: is a nondecreasing right‐continuous process adapted to Ft such that ; each is a stopping time of the filtration Ft; each is measurable with respect to The measurability of si with respect to means that owners may decide on the exact amount of capital to be raised at time based on all then‐available information. They do not need to precommit to any quantity of capital at time ti when they order the capital issue. Additionally, admissible controls satisfy Condition (2i) states that a new issue may not be ordered while a previously ordered issue is waiting to be completed. Condition (2ii) states that dividends may not be paid between the ordering of a capital issue and the actual capital collection. The condition has important technical merit but also an economic justification, in that ruling out simultaneous capital issues and dividend payments is likely to reduce conflicts of incentives between existing and new equity holders. The potential incentive conflicts are not explicitly present in our model, and we do not analyze the division of bank value between existing and new equity holders. We simply think of constraint (2ii) as a restriction set by the capital markets.

Bank capital stock as a function of policy is denoted and satisfies the integral dynamics where I{·} is the indicator function of the event defined in the parentheses. This implies that cumulative profits and new issues of capital feed to the capital stock, while dividend payments represent a leakage from the capital stock. Bank capital also earns the risk‐free rate.8

The minimum capital requirement under the current Basel Accord states that bank capital must at all times exceed 8% of the bank’s risk‐weighted assets. We assume that the corrective action from violation of the minimum capital requirement will be liquidation.9 The model bank therefore only operates up to the liquidation time The value of the bank under policy to its owners, given the initial level of capital , is the expected discounted present value of dividends less capital issues until liquidation: where ρ is a positive constant representing the wedge between debt and equity finance due to capital market frictions such as taxation and agency costs of equity and K is a nonnegative constant representing the cost of capital issuance.10 Equation (5) implies that the cost from capital issuance is proportional to bank size, as measured by risk‐weighted assets. The capital control problem is to identify the value of an optimally managed bank and an admissible policy that achieves this value. The model has six parameters in total: μ and σ characterize bank returns, r is the risk‐free rate (and also the implicit cost of debt [see n. 8]), ρ is the wedge between debt and equity finance, and Δ and K are the capital raising delay and cost, respectively.

B. A Normalized Model of Bank Capital Ratio

The capital dynamics defined in (3) is not time homogenous, which makes direct solution of the problem (6) difficult. The problem of capital control can, however, be transformed into a time‐homogenous problem of capital ratio control through a simple normalization. The normalized state variable, the bank capital ratio X, is defined as The following proposition presents the capital ratio control problem and shows its connection to the capital control problem (6).

Proposition 1. Given a policy π ∈ Π, let bank capital ratio satisfy and define the first time of capital ratio violation by Define bank equity value, given policy π, by where the expectation is conditional on the capital ratio dynamics (8i), and let the value function be Then (6) can be expressed in terms of (8iv) as Moreover, let be the policy that achieves the optimum in (8iv). Then the policy that achieves the optimum in (6), , can be expressed in terms of by The key to this result is to understand that when π and are related through (10i)–(10iii), then the capital ratio process given by (8i) is exactly the process , derived from (1) and (3) with the help of Ito’s lemma, given that the initial values satisfy . The complete proof of the proposition is available from the authors upon request.

Equation (9) implies that the objective function of the capital ratio control problem, (8iii), can be interpreted as the value of bank equity as a percentage of risk‐weighted assets. Moreover, since the capital ratio dynamics in (8i) do not depend on the level of the capital ratio, we may, without loss of generality, normalize the default point in (8ii) to 0 and interpret X as the excess capital ratio, that is, the capital ratio in excess of 8%. The relation (9) between the solution to the original capital control problem and the solution to the capital ratio control problem is preserved once we interpret , correspondingly, as excess capital, that is, capital stock in excess of 8% of risk‐weighted assets. We will use this reinterpretation in the remainder of the article.

C. Characterization of Optimum

We characterize the value function (8iv) through a set of variational inequalities. For this purpose, we define two operators. Let D be the set of real‐valued functions on R+. We define the operator M:DD by where XΔ is the value at time Δ of X defined by τ0 is the first hitting time of 0 of X, and the expectation is conditioned on Operator M can be interpreted as the expected value of the decision to order new capital immediately, given that the “continuing value” of the problem is f. Also, we define the infinitesimal generator A by for all sufficiently regular f. This may be interpreted as the expected instantaneous change in the value of the function f, given that no immediate controls are undertaken.

Now the following characterization of optimum can be established using standard arguments (see, e.g., Fleming and Soner 1993; Hojgaard and Taksar 1999).

Proposition 2. Assume that the value function (8iv) satisfies Ito’s formula. Then it satisfies the following set of inequalities for all : Inequalities (13i)(13v) are first‐order conditions to our problem that follow from standard dynamic programming arguments applied to the Bellman equation. With the exception of (13i), they must be understood as functional (in)equalities, that is, to hold for all positive (the domain of the state variable in our problem). Equality (13i) follows from 0 capital buffer (remember our reinterpretation of the state variable as excess capital) being an absorbing state in our model. Inequality (13ii) holds since the value of immediate order of new capital can never exceed the value function by definition of the value function. Inequality (13iii) holds since applying no control to the capital stock is always an admissible policy. Inequality (13iv) must hold since paying dividends is an admissible policy, and (13v) states that, in an optimum, one of the inequalities must be tight. That is, for all x, either taking no action or taking some of the admissible actions must represent the optimal policy.

We note that proposition 2 does not assume that the value function is twice continuously differentiable everywhere. It is well known that the second derivative of a value function in general exhibits a discontinuity at the boundary of the region where impulse control actions are optimal (see Dumas 1991). This does not prevent Ito’s formula from applying, but the differential generator in inequality (13iii) is to be interpreted in terms of left or right derivatives.11

III. Solutions

 

Constructing a solution to (13i)–(13v) requires a guess on the form of the solution, that is, on the order of the “optimality regions” for each of the policies. Our assumption on the form of the solution in the general case (i.e., under parameter combinations such that capital issues are optimally undertaken) is the following: (1) for , it is optimal to immediately order new capital, (2) for it is optimal neither to order new capital nor to pay dividends, and (3) for , it is optimal to pay dividends. Furthermore, we expect to have and u2 finite; u1 may be 0 when capital market imperfections are prohibitively high. Figure 1 illustrates the model with this form of solution.

Fig. 1.— Illustration of model structure: x is the current level of the capital buffer, u1 denotes the capital issue barrier, and u2 denotes the dividend barrier; Δ is the delay (in years) between the decision to issue capital and the actual issue. All of these are as a percentage of risk‐weighted assets.

Open New Window

According to our initial guess, we look for a function that solves (13ii) with equality for solves (13iii) with equality for and solves (13iv) with equality for Such function also solves (13i) and (13v). The function is to be continuously differentiable at the impulse control barrier u1 and twice continuously differentiable at the singular control barrier u2.

A. General Case

We define the following functions: where and where In (14), Φ is the cumulative standard normal distribution and φ is the density of the standard normal distribution. The following result gives the value function (8iv) in terms of these functions, as well as a sufficient condition on the problem parameters for the existence of the general solution.

Proposition 3. Let where M and f1 are given by (14) and (15) and Then there exists a solution to the set of algebraic equations satisfying and such that for all In terms of the solution to (18i)(18ii), the value function (8iv) is where M is given by (14), f1 is given by (15), and f2 is given by (16). The algebraic system (18i)–(18ii) is nonlinear, but standard numerical optimization procedures (secant method, Newton’s method) converge well, given reasonable initial values.12

The interpretation of the solution (19) is simple. The value function coincides with the function M in the region where immediate ordering of capital issues is optimal, with the function f1 in the “wait and see” region and with the linear function f2 in the dividend payment region. Figure 2 illustrates the solution in this general case. Smooth pasting (up to second derivatives) of f1 and f2 takes place at u2, the dividend payment barrier. The solution to (18i)–(18ii), in turn, imposes smooth pasting (up to first derivatives) of M and the capital issue order barrier. The function M is more concave at u1 than is f1, and the second derivative of the value function at this point experiences a discontinuity.

Fig. 2.— Construction of the value function. The figure shows the components of the value function (19). Parameter values: and . The optimal capital issue barrier u1 is 0.90%, and the optimal dividend barrier u2 is 3.66%. These are marked with vertical dotted lines. The optimal dividend barrier in the absence of the capital issue option, u0, is 3.80%.

Open New Window

B. Limiting Cases

The limiting cases of the model emerge as either the capital issue cost or the capital issue delay approaches zero, or as either of these exceeds a critical value so that capital issues are no longer optimal. We find the limiting cases interesting, since they help to understand the comparative statics of the general model and show exactly which of the capital market imperfections drive the qualitative results.

Case 1: K or Δ above a Critical Value When Δ or K are above their critical values (which depend on other problem parameters) such that capital issuance is no longer optimal, the optimal policy and the value function reduce to those of the Milne and Robertson (1996) model without the capital issue option. This value function is a special case of (19), obtained by setting and , as given in (17). The value function takes the form where a1, a2, d1+, and d1− are as in (15).

Case 2: In the absence of any capital raising delay, new capital can be issued instantaneously and there is perfect control on the minimum level of capital. Given the opportunity cost of capital, it is clearly optimal to wait until the buffer capital falls arbitrarily close to zero before issuing new capital. As zero is an absorbing boundary, however, new issues would have to be implemented before buffer capital actually hits zero. Not surprisingly, an optimal policy in the model without delays does not exist. Yet ε‐optimal policies can be constructed that set the capital issue barrier arbitrarily close to 0. One may think of the value function in the case as the limit of the values associated with such ε‐optimal policies.

Taking the limit as Δ approaches 0, the function M in (14) simplifies to for all Taking the limit of this as we let the capital issue point x approach zero, we obtain the boundary condition satisfied by the limiting value function in the case: This condition now replaces (13i). By the previous reasoning, the capital issue barrier in the limiting case is located at zero, so that the solution only has one free barrier, the dividend barrier. As in the general case, the dividend barrier is also the level up to which the capital ratio is replenished each time a new capital issue is implemented. The solution is given in the following proposition, proven in the appendix.

Proposition 4. If where u0 is given by (17), the value function is where is the unique positive solution for u2 in the equation Otherwise, the value function and the barrier are identical to (20) and (17). The parametric form of (22) is the same as that of (20), the difference between the two value functions being the location of the barriers u0 and . When the condition of proposition 4 holds, and in this case (22) is a left‐shifted version of (20).

If both Δ and K are equal to 0, then (23) is solved by This limiting case represents perfect market conditions. In perfect markets, no buffer stocks of capital are held and all profits are immediately paid out as dividends. When losses are realized, the capital to cover the losses is instantaneously raised from capital markets. The controlled capital ratio would be a constant equal to the 8% minimum.

Case 3: We can say little more about this limiting case than about the general case. The limit does not involve a degeneracy as the limit does. This is evident from the formula (14) for the function M, where setting K to zero does not influence the qualitative properties of M.

This observation implies that the presence of delay drives the qualitative nature of the solution, in particular the existence of the nonzero capital issue barrier. The sole presence of the fixed cost does not generate a positive capital issue barrier and will not result in a positive probability of liquidation. We find these observations to support the presence of a capital issue delay, which is precisely the additional ingredient in our modeling strategy relative to earlier contributions in the banking literature.

IV. Comparative Statics

 

In this section, we illustrate the effects of the capital issue cost and delay on optimal capital issue and dividend policies. Further, we analyze the value of the capital issue option.

A. Optimal Dividend and Capital Raising Policies

Figure 3 shows the response of the barriers u2 and u1 to the parameters Δ and K determining the degree of capital market imperfections. We have drawn the figure with μ, σ, and ρ fixed at representative values. We will return to the estimation of these parameters in the following section.

Fig. 3.— Optimal capital issue and dividend barriers. The upper picture shows the optimal dividend barrier u2 as a function of capital market imperfections and K. The lower picture shows the optimal capital issue barrier u1. Fixed parameter values: and

Open New Window

The optimal dividend barrier in the top picture of figure 3 is nondecreasing with respect to K and Δ. The optimal choice of the dividend barrier balances the expected cost of new capital issues, as well as the expected loss of continuing value from liquidation, against the time value of delayed dividends. The dividend barrier is nondecreasing in the capital issue cost, since the latter increases expected capital raising costs. The dividend barrier is nondecreasing in the length of the delay, since a longer delay, ceteris paribus, implies a higher probability of liquidation. Also, the dividend barrier is quite sensitive to the introduction of small costs of, and delays in, capital issuance, given that these are initially at a low level. When the cost or the delay is already sizable, the dividend barrier is relatively insensitive to small increases in them.

The optimal capital issue barrier in the bottom picture of figure 3 is nonincreasing in the capital issue cost K. The higher the capital issue cost, the higher risk of liquidation the owners of the bank are willing to take in order to avoid the capital issue cost. However, the capital issue barrier does not behave monotonically with respect to the delay Δ. When the delay is relatively short, the optimal response to an increase in the delay is an increase in the capital issue barrier. In this case, a longer delay induces earlier (in time) ordering of new capital. When the delay is relatively long, on the other hand, the optimal response to an increase in the delay is a decrease in the capital issue barrier. This happens because the value of ordering a new capital issue is affected two ways by changes in the delay. First, an increase in the delay increases the probability of liquidation during the delay, ceteris paribus, inducing an increase in the capital issue barrier. Second, the model forbids dividend payments during delay, so that an increase in the delay defers potential dividend payments further into the future, should a capital issue be ordered now, suggesting a decrease in the capital issue barrier. It turns out that, for short delays, the former effect dominates, while for sufficiently long delays, the latter effect dominates. Figure 3 also suggests that the point where the positive response of the capital issue barrier with respect to the delay turns negative is the lower, the higher is the cost of capital issuance. Finally, consistent with proposition 4, the capital issue barrier converges to zero as the delay approaches zero.

Comparing the dividend and capital issue barriers in figure 3, we observe that the expected size of a new issue (which is well approximated by the difference between the dividend barrier and the capital issue barrier) increases with the capital raising cost K. The owners optimally issue new equity in larger quantities but less frequently when the cost of issuance is high, relative to the case where the cost of issuance is low. We will use this observation in the next section to identify plausible values for the parameters Δ and K describing capital market imperfections.

B. Value of the Capital Issue Option

The opportunity to issue new equity, being an option in our model, cannot reduce bank value.13 The value of the capital issue option in the model is the difference between the value functions (19) and (20), for a given initial capital ratio.

Figure 4 shows the value of the capital issue option as a function of the capital buffer. We observe that the value of the option is monotonically declining in the capital issue cost K (and also in the delay Δ, although this is not shown in fig. 4). This is unsurprising, since both parameters are business constraints. As a function of the capital buffer, the value of the capital issue option displays a humped‐shaped behavior. The option value is at its highest when the capital buffer is somewhat below the optimal capital issue barrier (marked by a rectangle in fig. 4). The value of the capital issue option decreases as the capital buffer falls sufficiently below the optimal capital issue barrier. Here it becomes increasingly unlikely that the bank will survive until the end of the delay. As the capital buffer approaches zero, the value of the capital issue option approaches zero as well. The value of the capital issue option also goes down when the capital buffer increases above the capital issue barrier and above the dividend barrier (marked with a cross). Here the probability of capital shortages in the near future is low. The value of the option ultimately levels off to a constant. This happens somewhat above the dividend barrier (this is easily grasped from [19] and [20]). To conclude, the option to issue new capital is most valuable to banks that are in the neighborhood of the optimal capital issue barrier but that are still a reasonable distance above their minimum capital requirement.

Fig. 4.— Value of the capital issue option. Value of the capital issue option as a percentage of risk‐weighted assets, as a function of the capital buffer. Fixed parameter values: The rectangle (cross) along each line indicates the location of the capital issue barrier u1 (the dividend barrier u2).

Open New Window

It turns out that the value of the capital issue option, as a percentage of the total (cum‐option) value of the bank, may be substantial for troubled banks. In terms of the example in figure 4, when the capital issue cost equals 0.25% of risk‐weighted assets and the capital issue delay is 0.5 years, the value of the capital issue option is over 15% of the value of a troubled bank, that is, one whose capital buffer approaches zero. However, the option value is less than 2% of bank value for an otherwise identical bank that is optimally capitalized, that is, one whose capital buffer equals the dividend barrier.

The recapitalization option may also have value despite substantial capital market imperfections. Figure 4 is based on an annual expected bank income (μ) of 1.0% of the bank’s risk‐weighted assets. The option to issue capital still has value when the cost of a capital issue is 2% of risk‐weighted assets, that is, 2 years worth of expected profit. This suggests that we should observe bank owners optimally recapitalizing even when this means paying out several years worth of expected earnings in capital issue–related expenses.

V. Calibration and Empirical Tests

 

Our purpose is to provide a crude assessment of the potential of the model to explain observed bank capitalizations, both the average level of bank capital and the variation in capital levels across banks. In our tentative calibration, only μ and σ are estimated at the bank level from accounting data. Other parameters are treated as common to all banks. The Δ and K are fixed based on their implied capital issuance frequency and cost, while ρ is fixed based on its implied bank valuation multiple. We will later evaluate the cross‐sectional variation in bank capital ratios that can be generated by variations in Δ and K.

A. Calibration of Model Parameters

The accounting identity that governs the evolution of bank equity is of the form where Ct is bank equity at time t, NIt is net income over period ( ), Dt is dividends over period ( ), and St is equity issuance over period Consistent with our model, we decompose net income as where ROAt stands for return on (risk‐weighted) assets over period and rt is the risk‐free rate over period R is risk‐weighted assets and X is bank capital ratio as before. The first summand in (25) is the stochastic return on bank asset portfolio. The second summand is the return on bank equity, where equity is assumed to be invested at the risk‐free rate. Also consistent with our modeling assumptions, we impose the condition that bank risk‐weighted assets grow at the risk‐free rate Combining (24)(26), an approximate expression for the discrete dynamics of the bank capital ratio Xt in terms of the accounting variables is where, according to (25), ROAt has the expression A comparison of the model capital ratio dynamics (8i) and the discrete dynamics (27) then suggests that we should interpret the model parameters μ and σ in terms of accounting data as We use annual Compustat data on actual capital and accounting returns for a sample of U.S. national commercial banks over the period 1983–2002. We qualify all banks with at least 15 years of data. We drop one additional bank from the sample because its μ estimate is negative (in the model, such a bank would be liquidated at once), ending up with a sample of 61 banks. For each bank, we estimate μ and σ from the time series of the bank’s ROA, calculated according to (28).14 The risk‐free rate in this calculation is taken to be the prevailing federal funds rate. Table 1 summarizes the parameter estimates.

Table 1
Table 1 Bank Return Parameters (%)

Open New Window

Strikingly, the correlation between the μ and σ estimates is significantly negative, −0.37 (bottom part of table 1). Ex ante, a positive correlation between risk and return would be expected. The negative correlation suggests that the sample period is of insufficient length to properly capture the risk‐return trade-off in banking and that some banks’ estimates may suffer from a peso problem, that is, the nonexistence in a finite sample of a crisis that takes place with small probability. The worst loss years in the data are 1987, 1990, and 1991. Banks that experienced losses (as measured by net income) or high credit loss provisions during these years have systematically below‐average mean returns and above‐average volatilities of returns over the sample. Many banks that did not experience such loss episodes, on the other hand, display both a high mean return and a low volatility of return.

In order to separate the banks that have undergone a loss episode from those that have not, we split the sample into two groups based on the average level of loan‐loss provisions over the sample period. Table 1 shows the parameter estimates for both groups. We observe that banks (29 in total) with higher‐than‐average loan loss provisions have (1) on average, lower μ estimates, (2) on average, higher σ estimates, and (3) a positive and highly significant correlation between capital levels and σ estimates. The last observation is important, since the σ estimate should be the key parameter driving capital levels in the model. Moreover, the correlation between capital levels and the μ estimates is of the correct sign (negative, although not significant) in the group of banks with high loan loss provisions. None of the correlations are significant in the other group of banks, which supports our claim that the μ and σ estimates in that group are likely to suffer from the peso problem.

As for the other parameters, ρ has an alternative interpretation within our model as the earnings‐to‐price ratio of an optimally capitalized bank with stationary growth. This is due to the result obtained from formulas (16) and (19).15 We use a common estimate for ρ equal to 4% across all banks, implying a maximal price‐to‐earnings ratio of 25.

For Δ and K, we identify common estimates across banks such that their implied capital raising cost and frequency are of plausible magnitude. In this exercise, we set μ and σ close to their average values in the bank‐level data. Table 2 shows proportional capital raising cost and expected capital raising frequency subject to different combinations of Δ and K. The table implies that Δ has only a second‐order effect on the proportional cost of capital issues. This is because the delay has only a modest impact on the difference , which, in turn, determines the average size of a capital issue. The difference is driven by K, and hence we may choose K rather independently from Δ, so as to yield a desired proportional cost of capital issuance. The empirical evidence on capital issuance costs (see, e.g., Lee et al. 1996; Bajaj, Mazumdar, and Sarin 2002) suggests that direct costs of equity issuance can be as high as 10% of the issue size. There are likely to be some indirect costs as well, such as the cost of the bank’s own effort, so that direct and indirect costs together could be more than 10% of the proceeds of the issue. Table 2 implies that a K of 0.25% (of risk‐weighted assets) achieves proportional capital issuance costs in the 10%–12% range. We treat this as our best estimate for K.

Table 2
Table 2 Impact of Delay and Cost on the Frequency and Size of Capital Issues

Open New Window

Given K fixed at 0.25%, table 2 shows that a Δ equal to 0.08/0.5/1.0 years implies an expected capital issuance frequency of 18/28/44 years. We do not possess comprehensive data on bank capital issues to evaluate these values empirically. However, 18 years would imply an emergency recapitalization in every second or third recession, which, to our intuition, is more frequent than most banks are likely, or expect, to experience. We find 0.5 years a plausible estimate for Δ, yielding an average recapitalization frequency of 28 years.

B. Cross‐Sectional Variation and Level of Capital Ratios

We begin by illustrating the performance of our model in explaining the cross‐sectional variation in capital ratios. Figure 5 plots actual capital buffer against the model predicted capital level, the latter taken to be the dividend barrier (u2). The left picture uses the average over 1993–2002 as the measure of actual capital for each bank, while the right picture uses the maximum capital over the same period. These are both shown since it is not immediately obvious which are the relevant quantities to be compared against each other.16

Fig. 5.— Actual versus model capital ratios for the sample banks. The left picture shows average actual capital buffer (over 1993–2002) plotted against model dividend barrier (u2). The right picture shows maximum actual capital buffer (over 1993–2002) plotted against model dividend barrier (u2). The filled (empty) squares are banks with above‐average (below‐average) loan loss provisions. A linear least squares fit is drawn into the two pictures for both groups of banks.

Open New Window

Figure 5 shows that the model has predictive power only within the group of banks with higher‐than‐average loan loss provisions. Among this group of banks, the model dividend barrier explains over 30% of the variation in both average capital and maximum capital (correlations 0.57 and 0.60 against the respective variables). The slope coefficient in the linear least squares fit of actual capital on model dividend barrier is positive, yet significantly less than one, and the intercept in the same fit is significantly positive. This suggests that there could be a component in actual capital buffers that is unrelated to the volatility of bank returns.

As for the group of banks with lower‐than‐average loan loss provisions, the R2’s are virtually zero, and the slope coefficients of actual capital on model capital, if nonzero, are negative. However, this is consistent with the hypothesis that mean returns are overestimated and volatilities of returns are underestimated within this group of banks. Both these biases lower the model dividend barrier, which systematically (with only one exception!) underestimates actual capital buffer.

Figure 6 illustrates the behavior of the model dividend barrier as a function of the μ and σ estimates. The left picture of figure 6 shows that the dividend barrier is effectively driven by the σ estimate, the correlation between the two being 0.99. This picture also sorts out the two groups of banks, showing that a significant number of the banks with lower‐than‐average loan loss provisions have σ estimates that are no higher than 0.5%. These banks systematically have model dividend barriers less than 2%, and they constitute most of the observations in the utmost‐left region in figure 5. The right picture in figure 6, in turn, verifies that μ is negatively correlated (−0.48) with the dividend barrier. The effect of μ on the dividend barrier is also nonlinear and depends on the level of volatility.17 This is highlighted in the right picture by separating the observations into four groups according to the level of the σ estimate.

Fig. 6.— Drivers of the model capital buffer. The left picture shows model dividend barrier (u2) plotted against σ estimates for the sample of 61 banks. The banks that have higher‐than‐average (lower‐than‐average) loan loss provisions have been marked with squares (crosses). The right picture shows u2 plotted against the corresponding μ estimates. The observations have been marked depending on the level of the σ estimate: 0–25 percentile (cross); 26–50 percentile (triangle); 51–75 percentile (circle); 76–100 percentile (square). The correlation in the left (right) picture is 99% (−48%). The correlations among the subsamples in the right picture are −28% (cross), −79% (triangle), −41% (circle), and −6% (square).

Open New Window

The model falls somewhat short of explaining the level of capital held by banks. In the aggregate sample, the average of model dividend barriers (u2) is 2.4%, while the average of (time series averages of) actual capital is 5.0%. The difference is even wider if the comparison is made against maximum capital (7.2%); however, we do not endorse this comparison mainly because discreteness of dividend payments and temporary investment considerations are likely to distort the maximum measure. Among the group of banks with higher‐than‐average loan loss provisions, the difference between average actual capital (4.6%) and average model dividend barrier (3.2%) is narrower but still large in proportional terms. The averages also cannot be reconciled through an adjustment of the capital market imperfections. Even in the absence of the capital issue option, the average of model dividend barriers (u0) is only 2.7% in the aggregate sample and 3.8% among banks with higher‐than‐average loan loss provisions. Lowering the remaining model parameter ρ down to 2% would generate a 0.35% higher average dividend barrier (0.6% higher among banks with higher‐than‐average loan loss provisions), not sufficient to match the average capital buffer in the sample. Also, the implied price‐earnings ratio would no longer be realistic (a ρ of 2% implies a price‐earnings ratio of 50 for a well‐capitalized bank).

It is possible that our model underestimates actual capital because we have misspecified the minimum capital requirement. This is suggested by the significantly positive intercepts in the regressions of actual capital on model capital in figure 5. The model restricts the capital buffer of a zero volatility bank to zero. A positive volatility intercept could mean that the effective minimum capital requirement faced by banks is higher than 8%, perhaps closer to the 10% “well capitalized” requirement imposed by the Federal Deposit Insurance Corporation Improvement Act (FDICIA; we will discuss alternative regulatory requirements in Sec. VI). Interestingly, no bank in the sample has a time series average capital buffer less than 2%, that is, a total capital ratio less than 10%.

Table 3 shows correlations between model capital and the μ and σ estimates, as well as some possible explanatory factors that are not present in our model. The latter include bank asset size, asset growth rate, and the level of loan loss provisions (this has been used as a grouping variable). Also shown are correlations between the model residual (the difference between actual capital buffer and the model proxy, either u2 or u0) and the explanatory variables. These can provide signals concerning possible model misspecification. Concentrating on the restricted sample in the lower part of table 3, we observe that the volatility estimate (σ) is significantly negatively correlated with the model residual. This is not surprising, given the significantly positive intercept of actual capital against model capital (fig. 5). There is also a negative correlation between average loan loss provisions and the model residual, suggesting that the time series average may underestimate actual capital with those banks that have experienced large credit losses. On the positive side, we observe that asset growth rate is not significantly correlated with the model residual, suggesting that our constant growth rate (across banks) assumption does not affect the cross‐sectional performance of the model.

Table 3
Table 3 Correlations between Model Capital and Explanatory Factors

Open New Window

Finally, there is, in table 3, a material negative correlation between bank asset size and actual capital (−0.43 in the restricted sample). The correlation between bank asset size and the model residual is lower in absolute value but still negative, suggesting that larger banks hold less capital than small banks, even after controlling for possible differences in their μ and σ estimates. So far we have not varied Δ and K across banks, but it is quite plausible that these are inversely related to bank size. A simple way to test this is to apply our estimated Δ and K to large banks only (those with higher‐than‐average assets) and assume that capital issues are prohibitively costly for small banks. Therefore u2 (u0) is treated as the model capital buffer for large (small) banks. The right‐most column in table 3 confirms that this reduces the correlation between bank size and model residual close to zero. Moreover, the R2 in a regression of average (maximum) actual capital on model dividend barrier is now 39.7% (42.9%), suggesting that differences in capital market imperfections across banks can explain an additional 10% of the variation in capital levels across banks.

Continuing with the idea that only large banks possess the recapitalization option, figure 7 shows the implied volatilities that replicate the average level of the capital buffer among large and small banks within the restricted sample. For large banks (average assets over 40 billion US$), the average capital buffer of 3.8% implies a volatility of 1.06%. For small banks, the average capital buffer of 5.3% implies a volatility of 1.19%. The average σ estimate is 0.82% for large banks and 1.06% for small banks, implying that the required volatility adjustment is, in fact, higher for large banks, after controlling for likely differences in capital market imperfections.

Fig. 7.— Implied volatilities for large and small banks. Dividend barrier (u2) plotted against volatility (σ) for large banks ( ) and small banks ( capital markets prohibitively high). The dotted lines indicate the implied volatilities, 1.06% and 1.19%, corresponding to the actual capital ratios of large and small banks, 3.81% and 5.26%, respectively. The average σ estimate for large (small) banks is 0.82% (1.06%). All estimates are based on the subsample of banks with higher‐than‐average loan loss provisions (29 banks in total).

Open New Window

VI. Discussion and Conclusions

 

The banking literature so far lacks good attempts to explain observed bank capital holding behavior with optimizing models of capital choice. In this article, we have expanded existing trade‐off models of bank capital choice by introducing a recapitalization delay. This has intuitive appeal, since in the presence of a delay, banks face liquidation risk despite their option to recapitalize and are forced to initiate recapitalization at positive buffer capital levels. We have also illustrated the option value accruing from the recapitalization option and the nature of the optimal capital raising policies and dividend policies. Finally, we have calibrated our model to bank‐level accounting return data and tested its ability to replicate actual bank capital ratios.

In an “unconditional” sample, the model’s predictive power is low, mainly because the correlation between bank‐level volatility estimates and actual capital is low. Once we restrict examination to banks with above‐average levels of loan loss provisions over the sample, however, the model explains some 40% of the variation in capital ratios across banks and 70% of the average level of the capital buffer. The variation in model capital ratios is mainly driven by the variation in the volatility estimates. Our restricting to the more ex post risky banks is grounded by the belief that these banks’ ex post measure of volatility is close to the ex ante uncertainty faced by the banks. That there is a peso problem in our bank return sample is supported by the fact that, in the unconditional sample, the correlation between average bank return and volatility of bank return is significantly negative.

Our calibration exercise has been somewhat arbitrary. We specified a model, estimated some of its parameters from bank‐level accounting returns, imposed common values for other parameters (mainly guided by high level economic intuition), and evaluated the correlation between model output and actual capital buffers. It is the task of future research to test alternative model specifications. A possible source of model misspecification is the nature of the minimum capital requirement. In addition to the Basel Committee’s minimum solvency standard, the FDICIA imposes minimum leverage requirements. These should not be binding over risk‐based capital requirements with banks that have relatively risky portfolios, but this need not be the case with banks that have a significant portion of their portfolios invested, for example, in bank assets. There are also soft capital ratio targets set by the regulators, such as the FDICIA “well capitalized” category, which requires a 10% total capital ratio. This is soft regulation, since violating the 10% limit does not imply prompt corrective action or significant restrictions on the bank’s activities. The FDICIA first stipulates corrective regulatory intervention once the 8% limit is breached.18 In light of the FDICIA stipulations, our assumption concerning the penalty from regulatory capital violation is extreme. The FDICIA contains five categories of capitalization, a lower category always implying more restrictions on bank behavior. A model in which the penalty gradually increases as the capital ratio deteriorates would therefore be more realistic.

In addition to regulators, market discipline could dictate an implicit minimum capital requirement. This could be pressure coming from rating agencies, competitors and peers (swap market participants), or customers. Our model abstracts from market interactions. Whether these omissions matter reduces to the issue of which constituency provides the binding constraint on bank capitalization. Banking theory traditionally holds that customers, to the extent that they enjoy the security of deposit insurance, are not motivated in monitoring banks. Rating agencies’ view is that of debt holders. Jokivuolle and Peura (2004) suggest that the regulatory capital buffer (that is what we measure) dominates the economic capital constraint that measures bank riskiness from the debtors’ perspective. However, there is some evidence that competitor reactions, in particular the access to swap markets, could constitute a binding constraint on bank capital (Jackson, Perraudin, and Saporta 2002).

Our model assumes normally distributed bank returns, while bank portfolio returns, and credit returns in particular, are known to be negatively skewed and positively serially correlated. This is evident from bottom‐up portfolio models such as CreditMetrics (JP Morgan 1997), where a positive correlation between counterparty credit standings invariably generates a skewed bank portfolio return distribution. The Compustat bank return data over 1983–2002 indicates that there is negative skewness, as well as positive serial correlation, in the sample banks’ accounting returns. It is not clear how our volatility estimate captures these effects. Incorporating nonnormality or serial correlation into the model is likely to destroy the analytic tractability of the current model.

Accounting conventions could bias the volatility estimates as well. Banks in most jurisdictions have some options for income smoothing in the form of discretionary loan loss provisions (see, e.g., Pain [2003] on banks’ provisioning behavior). Discretionary provisioning allows banks to distribute credit losses, which are typically realized only during a fraction of quarters over each “credit cycle,” more evenly over time.19 This will cause banks’ accounting returns to be less volatile than their actual portfolio (cash) returns. Of course, bank capital requirements apply to an accounting measure of capital; hence, the dynamics of accounting measures could be all that matter.

Appendix
Proofs

 

Proof of proposition 3. We construct a function that solves (13ii) with equality for solves (13iii) with equality for and solves (13iv) with equality for . The function should be continuously differentiable at the impulse control barrier u1 and twice continuously differentiable at the singular control barrier u2.

We first construct the solutions to (13ii)(13iv) when these hold as equalities. Denoting our candidate solution by f1, (13iii) becomes The general solution to this is the exponential function The general solution to (13iv), denoted f2, is Twice continuous differentiability at u2 therefore requires that and that Imposing these on (A2) yields Substituting and into (A1) gives Subject to this boundary condition, (A3) becomes Now assume that (13ii) holds with equality when applied to some concave function f that satisfies The supremum in (11) is then achieved by . Also using (A5), (11) simplifies to Let which measures the net benefits from new issues of equity. Equation (A7) further simplifies to where and is the density of the absorbed process that starts at The first equality in (A8) is due to the spatial homogeneity of arithmetic Brownian motion; the second equality follows since the values of X outside the event defined in the indicator function do not affect the expectation; and the third equality is due to the fact that, for the absorbed process , the event in the indicator function is exactly the event that the process has not been absorbed by time Δ. Using the Reflection Principle (see, e.g., Borodin and Salminen 1997), the density can be written as where denotes the density of a normal distribution with mean μ and variance σ2, that is, Substituting this into (A8) and integrating, we get where Φ(y) and φ(y) are the cumulative standard normal distribution and its density. Direct substitution shows that

Value matching and smooth pasting conditions at u1 can now be formulated as This nonlinear system of equations is quite complicated algebraically, and closed‐form solutions for u1 and u2 do not exist. A sufficient condition for the existence of the solution is provided by lemma 4. The condition is expressed in terms of a positive barrier u0, defined by which can be solved for Lemma 5 then shows that function (19) constructed in terms of the solution to (A10iA10ii) is concave and satisfies (13i)–(13v), and, hence, by a sufficiency theorem, coincides with the value function (8iv). This completes the proof of proposition 3. QED

Lemma 1. This will be needed in the proofs of lemmas 2 and 3. where

Proof. The first equality follows from the linearity of the expectation, the second from the law of total probability, and the third from the Strong Markov property of arithmetic Brownian motion; the fourth just rearranges, and the fifth follows from integration by parts. QED

Lemma 2. If then and for all

Proof. This lemma is important because it is easy to show that capital issuance cannot be optimal unless Let us rewrite the function M starting from (A7) as follows: where, again, and the third equality uses lemma 1. We know that p satisfies the Kolmogorov backward equation and that its partial derivatives satisfy for all We differentiate the final expression in (A12) once and twice with respect to x and obtain where the inequalities hold for all nonnegative β because of the signs of the partials of QED

Lemma 3. For all

Proof. Beginning with (A7), we get where is as defined in lemma 1. The third equality utilizes lemma 1, the first inequality is due to setting K to 0, the second inequality is because (this is due to [A5] and [13iv]), and the third inequality is because QED

Lemma 4. If then there exists a solution to (A10iA10ii) satisfying such that for all

Proof. Suppose that the condition holds.

(i) In (A11), we have defined u0 such that Therefore, The condition implies that there is a positive x within (0, u0) such that However, by lemma 3, This implies that must cross from above within the interval

(ii) From (A4), we get that so that for By lemma 3, we also have for From (A4) and (A9), we get (given positive Δ) A necessary condition for the general solution is that Then, by lemma 2, is increasing and concave in x, while given by (A4) is also increasing in x. Combined with the previous inequality, these imply that one can always find a positive u2 in the interval such that for

(iii) Following from i and ii, by the continuity of and with respect to x and u2, there will exist a u2 in the interval such that for some while for all But at this choice of continuous differentiability of and with respect to x implies that This is because two continuously differentiable functions that coincide in the interior of their domain but do not cross must possess equal derivatives at the point where the functions coincide. QED

Lemma 5. Assume that a solution to (A10iA10ii) as described in lemma 4 exists and that V is defined by (19). Then V is a concave solution to (13i13v) and satisfies Ito’s formula.

Proof. V is concave: By construction for Differentiating f1 given by (15) three times shows that on Therefore, f1 has an increasing second derivative on which, combined with the fact that implies that f1, and therefore V, is concave on We also know from lemma 2 that is globally concave with respect to x, so that V is concave on Equality of first derivatives of M and f1 at u1 then implies that V is globally concave.

V satisfies Ito’s formula because each of the component solutions is twice continuously differentiable, and V satisfies the smooth pasting conditions at the barriers u1 and u2.

V solves (13i)–(13v):

(13i): because

(13ii): for by construction. By lemma 4, for and, by lemma 3, for

(13iii): For we have and we get from Itô’s formula where τ is a stopping time defined by for some ε > 0, such that and Taking expectations and noting that the last term is a martingale because M(x;u2) is concave, we obtain where is the value of waiting until τ prior to ordering a new capital issue. Because immediate ordering of capital is the optimal action at we have Combining the last two equations and the fact that for we get A limit operation then gives us for For we have by construction. For we have so that for all

(13iv): for by construction. That for follows from the concavity of V (proved above).

(13v): Follows directly from construction. QED

Proof of proposition 4. The first‐order condition is as in the general case, but with (13i) and (13ii) replaced by (21). Inequalities (13iii) and (13iv) are solved as in the general case, and smooth pasting at u2 is enforced, as previously. This yields formulas (A4) and (A6). The barrier u2 is solved by substituting (A4) into the boundary condition (21). Assuming that the maximum in (21) is achieved by the first term, u2 is determined from the equation defined in (A13) has the following properties: (i) (ii) (iii) as ; (iv) and (v) The convergence in iii is exponential, and g is concave by iv and v. It follows that, when K is positive, we have and because the right‐hand side of (A13) is linear in u2, (A13) is solved by a unique

For the maximum in (21) to be achieved by the first term, the solution to (A13) must satisfy that is, must be positive at the point of intersection with the function . Setting , we get that where u0 is given by (17). Then a necessary and sufficient condition for (A14) to hold is that , or equivalently, that which is the condition in the proposition. In this case, . QED.

References

 
  • Alvarez, Luis H. R., and Jussi Keppo. 2002. The impact on delivery lags on irreversible investment under uncertainty. European Journal of Operational Research 136:173–80.
  • Bajaj, Mukesh, Sumon Mazumdar, and Atulya Sarin. 2002. Cost of issuing preferred stock: An empirical analysis. Journal of Financial Research 30:577–92.
  • Bar‐Ilan, Avner, and William C. Strange. 1996. Investment Lags. American Economic Review 86:610–22.
  • Bhat, Vasanthakumar N. 1996. Banks and income smoothing: An empirical analysis. Applied Financial Economics 6:505–10.
  • Bhattacharya, Sudibto, Manfred Plank, Gunter Strobl, and Josef Zechner. 2002. Bank capital regulation with random audits. Journal of Economic Dynamics and Control 26:1301–21.
  • Booth, Laurence, Varouj Aivazian, Asli Demirguc‐Kunt, and Vojislav Maksimovic. 2001. Capital structures in developing countries. Journal of Finance 56:87–130.
  • Borodin, Andrei, and Paavo Salminen. 1996. Handbook of Brownian motion: Facts and formulae. Basel: BirkHäuser.
  • Crouhy, Michel, and Dan Galai. 1991. A contingent claim analysis of a regulated depository institution. Journal of Banking and Finance 15:73–90.
  • Dumas, Bernard. 1991. Super contact and related optimality conditions. Journal of Economic Dynamics and Control 15:675–85.
  • Estrella, Arturo. 2004. The cyclical behavior of optimal bank capital. Journal of Banking and Finance 28:1469–98.
  • Fleming, Wendell H., and H. Mete Soner. 1993. Controlled Markov processes and viscosity solutions. New York: Springer Verlag.
  • Fries, Steven, Pierre Mella‐Barral, and William Perraudin. 1997. Optimal bank reorganization and the fair pricing of deposit guarantees. Journal of Banking and Finance 21:441–68.
  • Froot, Kenneth A., David S. Scharfstein, and Jeremy C. Stein. 1993. Risk management: Coordinating corporate investment and financing policies. Journal of Finance 48:1629–58.
  • Froot, Kenneth A., and Jeremy C. Stein. 1998. Risk management, capital budgeting, and capital structure policy for financial institutions: An integrated approach. Journal of Financial Economics 47:55–82.
  • Furfine, Craig. 2001. Bank portfolio allocation: The impact of capital requirements, regulatory monitoring, and economic conditions. Journal of Financial Services Research 20:33–56.
  • Green, Richard C. 1984. Investment incentives, debt and warrants. Journal of Financial Economics 13:115–36.
  • Harris, Milton, and Artur Raviv. 1990. Capital structure and the information role of debt. Journal of Finance 45:321–49.
  • Hojgaard, Bjarne, and Michael Taksar. 1999. Controlling risk exposure and dividends payout schemes: Insurance company example. Mathematical Finance 9:153–82.
  • Jackson, Patricia, William Perraudin, and Victoria Saporta. 2002. Regulatory and economic solvency standards for internationally active banks. Journal of Banking and Finance 26:953–976.
  • Jokivuolle, Esa, and Samu Peura. 2004. Stress tests of banks’ regulatory capital adequacy: Application to tier‐1 capital. In The Basel handbook: A guide for financial practitioners, ed. Michael Ong. London: RISK.
  • JP Morgan. 1997. CreditMetrics: Technical Document. New York: CreditMetrics.
  • Kahane, Yehuda. 1977. Capital adequacy and the regulation of financial intermediaries. Journal of Banking and Finance 1:207–18.
  • Kim, Chang‐Soo, David C. Mauer, and Ann E. Sherman. 1998. The determinants of corporate liquidity: Theory and evidence. Journal of Financial and Quantitative Analysis 33:335–59.
  • Koehn, Michael, and Anthony M. Santomero. 1980. Regulation of bank capital and portfolio risk. Journal of Finance 35:1235–44.
  • Lee, In‐Moo, Scott Lochhead, Jay Ritter, and Quanshui Zhao. 1996. The costs of raising capital. Journal of Financial Research 19:59–74.
  • Leland, Hayne E. 1998. Agency costs, risk management, and capital structure. Journal of Finance 53:1213–43.
  • Lobo, Gerald J., and Dong‐Hoon Yang. 2001. Bank managers’ heterogeneous decisions on discretionary loan loss provisions. Review of Quantitative Finance and Accounting 16:223–50.
  • Mello, Antonio S., and John E. Parsons. 2000. Hedging and liquidity. Review of Financial Studies 13:127–53.
  • Merton, Robert C. 1977. An analytic derivation of the cost of deposit insurance and loan guarantees: An application of modern option pricing theory. Journal of Banking and Finance 1:3–11.
  • Milne, Alistair. 2002. Bank capital regulation as an incentive mechanism: Implications for portfolio choice. Journal of Banking and Finance 26:1–23.
  • Milne, Alistair, and Donald Robertson. 1996. Firm behavior under the threat of liquidation. Journal of Economic Dynamics and Control 20:1427–49.
  • Milne, Alistair, and Elizabeth Whalley. 2001. Bank capital regulation and incentives for risk‐taking. Unpublished manuscript, Finance Department, Cass Business School, London.
  • Myers, Stewart C., and Nicholas S. Majluf. 1984. Corporate financing and investment decisions when firms have information that investors do not have. Journal of Financial Economics 13:187–221.
  • Opler, Tim, Lee Pinkowitz, Rene Stulz, and Rohan Williamson. 1999. The determinants and implications of corporate cash holdings. Journal of Financial Economics 52:3–46.
  • Pain, Darren. 2003. The provisioning experience of the major UK banks: a small panel investigation. Working paper no. 177, Bank of England, London.
  • Peek, Joe, and Eric S. Rosengren. 1997. How well capitalized are well‐capitalized banks? New England Economic Review (September/October): 41–50.
  • Peura, Samu. 2002. Dividends, costly external capital, and firm value: The case of constant scale. Discussion Paper 536, Department of Economics, University of Helsinki.
  • Rochet, Jean‐Charles. 1992. Capital requirements and the behavior of commercial banks. European Economic Review 36:1137–70.
  • Subramanian, Ajay, and Robert A. Jarrow. 2001. The liquidity discount. Mathematical Finance 11:447–74.
  • Titman, Sheridan, and Robert Wessels. 1988. The determinants of capital structure choice. Journal of Finance 43:1–19.
  • * This article was formerly titled “Optimal capital management with fixed costs and implementation delays.” The comments of an anonymous referee are greatly appreciated. We would also like to thank Vesa Kanniainen, Luis H. R. Alvarez, Sophie Shive, Ingyu Chiou, Stephen Schaefer, and conference participants at the 2003 Midwest Finance Association annual meeting, the 2003 European Financial Management Association annual meeting, the 2003 INFORMS annual meeting, and the 2005 American Finance Association annual meeting, and the seminar participants at New York University, the University of Michigan, and the University of Helsinki for helpful comments. All errors are ours. Contact the corresponding author, Jussi Keppo, at .

  • 1. By now there is a large literature on the interactions between financing, investment, and risk management in the presence of capital market frictions (e.g., Froot, Scharfstein, and Stein 1993; Froot and Stein 1998; Leland 1998; Mello and Parsons 2000).

  • 2. Our use of the term bank capital refers to book equity. This is the relevant measure of capital in an analysis of bank capital adequacy since minimum capital requirements under the Basel Accord apply to book equity.

  • 3. This literature includes Kahane (1977), Merton (1977), Koehn and Santomero (1980), Green (1984), Crouhy and Galai (1991), Rochet (1992), Fries, Mella‐Barral, and Perraudin (1997), and Bhattacharya et al. (2002).

  • 4. This point has been emphasized by Milne (2002).

  • 5. The effects of delays on irreversible investment decisions have been studied in Bar‐Ilan and Strange (1996) and Alvarez and Keppo (2002), and also by Subramanian and Jarrow (2001) in connection with optimal liquidation.

  • 6. Risk‐weighted assets, under the current Basel Accord from 1988, are calculated as a weighted sum of a bank’s nominal exposures, where the weights depend on product type and counterparty sector. For large banks, risk‐weighted assets are typically between 65% and 70% of total assets.

  • 7. The Wiener process is defined on an underlying probability space (Ω, F, P), where we take F to be the standard filtration generated by the Wiener process.

  • 8. This assumption can be justified in several ways. We could assume that bank capital is explicitly invested in a risk‐free asset. Alternatively, we could postulate that any capital the bank has replaces an equivalent amount of borrowing/deposit funding and that the effective cost of borrowing/deposits to the bank equals the risk‐free rate. The latter assumption, in turn, could be justified by the presence of deposit insurance.

  • 9. In practice, a violation of the minimum capital requirement will not result in immediate liquidation but will generate additional costs and constraints to the bank, due to increased regulatory surveillance. (Peek and Rosengren [1997] list the provisions for Prompt Corrective Action specified in the FDICIA.) Also, the bank’s competitive position is likely to be affected. Therefore, the bank’s owners are likely to lose a significant amount of the bank’s economic rent.

  • 10. As pointed out by a referee, ρ should not be interpreted as an equity risk premium, since it is assumed to be constant and does not depend on bank leverage. This suggests that our modeling framework is risk neutral and that the parameter μ under a risk‐neutral measure need not coincide with its observed value. In particular, since uncertainty in our model is driven by a Brownian motion, a change of measure would influence the drift, but not the volatility, of the earnings process. We ignore this potential source of error in estimating μ. This should not be crucial since, as we will demonstrate in Sec. V, the drift term only has a secondary effect on model capital ratios.

  • 11. We have presented no proof of sufficiency of the first‐order conditions ([13i][13v]). Subject to general restrictions on the form of the solution, such proof of sufficiency can be formulated and can be obtained from the authors upon request.

  • 12. We have done the optimization with the Solver add‐in in Microsoft Excel.

  • 13. This is not generally true in models with asymmetric information, such as Myers and Majluf (1984).

  • 14. Our estimate for μ is the time series average; the estimate of σ is the time series standard deviation. Risk‐weighted assets, and therefore capital ratios, do not exist in the data prior to 1993. In calculating (28) prior to 1993, we estimate each bank’s risk‐weighted assets based on the average post‐1993 risk‐weighted assets‐to‐total assets ratio of the bank.

  • 15. Remember that μ denotes expected earnings, while V(u2) is bank value when capital ratio is at the optimal dividend barrier, both evaluated as a percentage of risk‐weighted assets. Hence, the ratio of μ to V(u2) should proxy for the average earnings‐to‐price ratio of a well‐capitalized bank.

  • 16. In the continuous‐time model, the dividend barrier is the highest capital ratio that would ever be observed, so that a time average of capital ratios within the model would invariably be below u2. Given annual dividend payments, we would expect banks to deviate upward from the u2 level between annual dividend payments, even considerably so in years with unexpectedly high profits. Yet banks that have experienced large unexpected losses should have average capital ratios well below u2.

  • 17. Milne and Whalley (2001) have observed the nonlinearity of the relationship between μ and u2.

  • 18. Peek and Rosengren (1997) contains a description of the FDICIA from 1991 containing the well‐capitalized category.

  • 19. Statistical tests that affirm banks’ income‐smoothing behavior have been provided, e.g., by Bhat (1996) and Lobo and Yang (2001).

© 2006 by The University of Chicago. All rights reserved.