Estimating Structural Bond Pricing Models*
A difficulty that arises when implementing structural bond pricing models is the estimation of the value and risk of the firm's assets, neither of which is directly observable. We perform a simulation experiment to evaluate a maximum likelihood method applicable to this problem. Contrasting the performance of the maximum likelihood estimators to that of estimators traditionally used in academia and industry, we find strong support for the maximum likelihood approach. In fact, the inefficiency of the traditional estimator may help explain the failure of past attempts to implement structural bond pricing models.
I. Introduction
Corporate bond markets have more than doubled in size over the last 10 years, to reach a size exceeding that of the Treasury markets. The growth of the corporate debt sector to a dominant source of finance for U.S. corporations underlines, by itself, the importance of accurate bond pricing models.1 In addition, the market for credit derivatives is growing rapidly and accurate risk management and valuation tools will become necessary. Moreover, banks and regulators have recently taken a marked interest in credit risk modeling for risk management purposes. An important issue in this context is whether banks should be permitted to use in‐house credit risk management models to determine capital requirements. A number of different approaches have been suggested, among them KMV Corporation's PortfolioManager, which is based on a structural bond pricing model following Merton (1974).
The objective of this paper is to perform a simulation study to evaluate two distinct approaches to estimating structural bond pricing models. The performance of the currently most popular method is contrasted to a maximum likelihood approach developed by Duan (1994), which to date has been largely ignored in the credit risk literature.
The traditional approach to implementing structural models has been to solve a system of equations that match the observed stock price and estimated stock volatility with model outputs (see Ronn and Verma 1986). However, as pointed out by Duan, in theory, one of the equations is redundant and no unique solution exists, unless, as in practice, the model is misspecified. Nevertheless, the approach is simple to implement and may have merit from a practical perspective, if it provides sufficiently precise estimates. It has been applied in academic studies, adapted for commercial purposes by the KMV Corporation, and is often the only estimation approach considered in major finance textbooks (such as Hull 2002). However, we demonstrate that the maximum likelihood approach, which circumvents the theoretical problem, exhibits markedly superior performance.2
Structural bond pricing models value debt as a contingent claim on the firm's assets. This approach was pioneered by Black and Scholes (1973) and Merton (1974) and has since drawn considerable attention from practitioners and academics alike. An important feature of structural bond pricing models is that, since all securities of a firm are treated as derivatives on the firm's assets, it is possible to use price information for one class of securities (typically equity) to infer the value of another (typically debt).
Perhaps as a result of the failure of initial attempts to implement structural bond pricing models (see Jones, Mason, and Rosenfeld 1984 and Ogden 1987), little progress was made in the empirical validation of the contingent claims approach. During the 1990s, a number of stylized facts were incorporated into models, among them violations of the absolute priority rule in bankruptcy, taxes, costly financial distress, debt renegotiation, and stochastic interest rates.3 The more‐recent models are often better able than their predecessors to generate prices in line with market quotes with reasonable inputs. However, this alone does not guarantee that they will actually do well on market data, given that the problem remains of estimating the unobserved asset value and its volatility.
Our evaluation is based on three theoretical bond pricing frameworks: the classic Black and Scholes (1973)/Merton (1974) model, the Briys and de Varenne (1997) model, and the Leland and Toft (1996) model. The Black and Scholes/Merton model is the first, simplest, and best known of the structural models. It has also been implemented recently in the academic literature.4 The Briys and de Varenne model is similar to the Longstaff and Schwartz (1995) and Nielsen, Saá‐Requejo, and Santa‐Clara (1993) models, in that it extends the Black and Scholes/Merton model to allow for stochastic interest rates and the possibility of default prior to debt maturity. Leland and Toft retain a constant term structure but, by incorporating taxes and distress costs and endogenizing the default decision, are able to study the link between optimal capital structure and the cost of debt financing. All these models share a common (and for our purposes necessary) feature in that they provide a value for the firm's equity.5
We first examine to which degree estimators for asset risk, firm value, and bond prices are unbiased and efficient. Second, we investigate whether the asymptotic distributions of estimators carry over to small samples. The maximum likelihood approach is then contrasted to the traditional method of estimating structural bond pricing models.
To evaluate the performance of the two methods, we perform a series of Monte Carlo experiments. We simulate sample paths for the asset value of firms that differ along the dimensions of operating risk and financial leverage. The corresponding stock price paths are then used to estimate, using both methods, the prices and credit spreads of different corporate bonds. A similar set of experiments was carried out by Lo (1986) to study the performance of maximum likelihood estimators of option prices. In that study, the state variable (the underlying stock price) is directly observable, whereas in what follows, we use stock prices to estimate the level and parameters of our state variable, the firm's asset value. Many of our results are directly related to this added complexity.
We demonstrate that the maximum likelihood approach clearly outperforms the traditional method in terms of both lack of bias and efficiency. The errors of the latter approach are of a magnitude that can help explain the consistent failure of attempts to implement structural bond pricing models. In contrast, maximum likelihood bond price estimators are unbiased and efficient, even for very risky bonds. The performance of the traditionally used approach, on the other hand, deteriorates as the spread increases. Moreover, the asymptotic distributions of the maximum likelihood estimators turn out to provide useful approximations in small samples.
The structure of the paper is as follows. The next section provides a brief overview of the theoretical models. Section III reviews the maximum likelihood as well as the traditional estimation approach, and section IV describes the simulation experiment. Section V reports and discusses the results, and section VI concludes.
II. Structural Bond Pricing Models
In this section, we review the three bond pricing models: the Black and Scholes/Merton (BSM) model, the Briys and de Varenne (BV) model, and the Leland and Toft (LT) model. Since the focus of this paper is on the performance of an estimation approach rather than on the theoretical properties of any given model, we provide only brief descriptions of the models.
In all three models, the same fundamental assumptions are made regarding financial markets. Arbitrage opportunities are ruled out and investors are price takers. Furthermore, for at least some large investors, there are no restrictions on short‐selling stocks or risk‐free bonds, and these can be traded costlessly and continuously in time. In addition, when we analyze the BV model, we consider the special case where the term structure is driven by a Vasicek model for the short rate rt under the risk‐adjusted pricing measure
where a denotes the mean reversion speed,
is the mean reversion level of the short rate, and γ is its standard deviation.6 The variable
is a Wiener process. The other two models are based on a constant interest rate r.
Furthermore, we assume that at least one class of the firm's securities, such as common stock, is traded and consequently completes the market; we do not need to assume that the assets of the firm are traded.7
The value of assets is denoted ωt, and its changes are taken to obey a geometric Brownian motion. In the LT model, assets generate revenue at a rate β, which is not reinvested. This cash flow is used to service debt before being paid out as dividends to shareholders. Therefore, the evolution of the asset value can be described by
The term
is the expected return from holding the firm's assets, including accumulating the cash flow βω. The growth rate of the assets is
. The parameter σ is the volatility of the asset value, and λ can be interpreted as the market price of risk associated with the operations of the firm. Finally,
is the Wiener process that generates asset value uncertainty. When interest rates are stochastic, we denote by ρ the instantaneous correlation between
and
.
Next, consider the firm's securities. In particular, we need a formula for the stock price to estimate the asset value and volatility, as well as a formula for the bond we ultimately want to price. We distinguish between two “layers” of debt: the firm's total debt (the sum of bank loans, bonds, accounts payable, salaries due, accrued taxes, etc.) and the specific bond we are interested in. We simply refer to the former as debt (Δ) and the latter as the bond (ℬ). For future reference, denote the value of the corresponding risk‐free debt with D. The value of the firm (Φ) is the sum of the value of debt and equity (ℰ).
The three models differ in how the capital structure is set up. In the BSM and BV models, the firm issues a single discount bond, which therefore also constitutes the firm's total debt. In the LT model, the firm continuously issues and redeems bonds of a given maturity. The bonds are serviced by a continuous coupon stream until the principal repayment. The firm's debt is made up of all previously issued but unredeemed bonds, and total debt service is consequently the sum of payments to all those bonds. The coupon and the principal of the bonds are designed to establish a constant aggregate debt service flow, which provides the basis for a closed‐form solution for equity. Thus, the model elegantly combines finite maturity debt with a tractable pricing function for the firm's stock.
Furthermore, there are differences across models in the way that financial distress is triggered. In the BSM model, default occurs at the maturity of the single bond issue if the value of the assets is insufficient to repay the principal amount. In such a case, the creditors take over the firm and recover the value of the assets.
In the other two models, default can occur at any time. In the LT model, financial distress is triggered when shareholders no longer find it profitable, given the revenue produced by the assets, to continue servicing debt. Finally, in the BV model, default occurs when the asset value crosses an exogenous lower threshold.
We now turn to a more detailed description of the valuation formulae implied by the models (a list of notation is provided in table 1).
A. The Black and Scholes/Merton Model
The value of equity in the Black and Scholes (1973)/Merton (1974) (BSM) model is computed using the standard call option formula, with the exercise price set equal to the nominal amount (N ) of the discount bond with maturity T:
where φ [·]represents the standard normal distribution function with d1 and d2 given in the appendix. All revenue generated by the assets is reinvested; therefore,
in (1). The value of the bond is, of course, equal to the value of the assets less the value of equity:
Since there are no taxes nor bankruptcy costs in this model, the value of the assets equals the value of the firm. Note that
represents the risk‐adjusted probability of default up to date T.
B. The Briys and de Varenne Model
The Briys and de Varenne (1997) (BV) model differs from the BSM model in two respects: Default can occur prior to the maturity of the single bond issue and interest rates are stochastic. More precisely, default occurs if the value of the firm's assets at any time falls below
where
and P(t, T) is the value at t of a unit of risk‐free bond maturing at T. The value of the unit bond was derived by Vasicek (1977) and is reported in the appendix. In case of default prior to maturity, bondholders receive
and equity holders
; in case of default at maturity, they receive
and
, respectively. The value of the bond is
where PE, d1 − d6, l0, and q0 are given in the appendix. The bond expression consists of five terms. The first captures the value of an otherwise identical credit‐risk‐free bond. The second reflects the loss in value at the maturity of the bond, corresponding to the short put present in the BSM model. The third captures the recovery in case of a default prior to maturity, and the last two capture the fact that sharing of any surplus in financial distress is assumed to deviate from the absolute priority rule (
would correspond to the case of no such deviations).
Given the assumed absence of bankruptcy costs or taxes, equity is, as in the BSM model, simply the residual claim to the firm's assets:
The model is a direct extension of Nielsen et al. (1993), which differs from the model of Longstaff and Schwartz (1995) only by assuming that the default threshold, rather than being a constant, is linked to the value of a risk‐free bond. In contrast to these two models, the BV model readily allows the derivation of an equity formula for a firm with discount debt.
C. The Leland and Toft Model
Leland and Toft (1996) allow for taxes and bankruptcy costs. The continuously paid coupons C are tax deductible at a rate τ and the realized costs of financial distress amount to a fraction k of the value of the assets in default. In this setting, the value of the firm is equal to the value of assets plus the tax shield less the costs of financial distress:
We let L denote the default barrier, that is, the critical asset value at which the equity holders voluntarily declare bankruptcy. The formulae for L and x are given in the appendix.
Shareholders are residual claimants to the value of the firm and so
where the value of debt is given by
The formulae for the functions I(ωt) and J(ωt), as well as for i(ωt, s) and j(ωt, s) used later, are given in the appendix. Note that the value of equity, the firm, and debt are independent of time.
Bonds are issued with maturity T, principal
and coupon
. This particular choice of bond principal and coupon is required to derive the preceding tractable debt formula. The value of a bond with remaining maturity
is
III. Estimation
In this section, we turn to a description of the two empirical approaches we wish to evaluate: first, the maximum likelihood approach developed by Duan (1994, 2000), then the traditional method used by Ronn and Verma (1986) and others.
We do not consider estimation of the short‐rate process in the Briys and de Varenne model. When implementing a structural bond pricing model with stochastic interest rates, the risk‐free‐term structure model is generally calibrated using Treasuries, before turning to the credit‐risk model itself. Indeed, in most applications to date, the traditional method has been applied only to the estimation of the asset value and volatility.8 A notable exception is Duan, Moreau, and Sealey (1995), who extend the traditional method to a three‐equation system in order to estimate the interest rate elasticity of (bank) equity in a model of deposit insurance with stochastic interest rates.
A. Maximum‐Likelihood Estimation
The problem at hand is the maximum likelihood estimation of the unobserved asset value process (1). This problem was first studied by Duan (1994) in the context of deposit insurance. The estimation is carried out using a time series of stock prices,
. We use subscript i to index daily observations, in contrast to subscript t, which refers to a point in time in years.
We need the likelihood function of the observed price variable. Defining f(·) as the conditional density for
gives us the following log‐likelihood function for the observed equity vector Eobs:
whereθ is a vector of parameters to be estimated. The choice of individual parameters to include in θ depends on the model and the data set.
To derive the density function for equity, a change of variables is made. This allows us to work with the well‐known density function g for a normally distributed variable, the log of the asset value:
The (1‐period) conditional moments of the asset value distribution, mi and si, for each model, are given in the appendix.
The change of variables results in
where
is the subset of the parameter vector necessary to price equity. Note that EiEi(ωi, ti; ·) refers to the equity formula, whereas
denotes an observed market value. The function transforming equity to asset value is defined as
, the inverse of the equity value function. Hence, there is a one‐to‐one correspondence between the stock price
and the asset value ωi (given
). By inserting (10) into (8), we obtain the log‐likelihood of the vector Eobs for a given choice of θ as
Differentiating the equity formulae (2), (5), and (6) with respect to the (log‐) asset value yields the desired results. The parameter vector,
, is estimated by maximizing eq. (11) with respect to θ. The market price of risk is estimated as a consequence of the chosen estimation method, even though it is not relevant for pricing. Finally, an estimate of the value of assets is simply obtained using the inverse equity function:
.
Once we have obtained the pair
, we can compute the corresponding bond prices and credit spreads, (B, Σ). Following Lo (1986), we can calculate the asymptotic distributions of these estimators. For any differentiable function of a variable, it holds that the maximum‐likelihood (ML) estimator of that function is the function evaluated at the ML estimator of the variable. In this case, it also holds that the estimators are asymptotically normally distributed:
where
is the asymptotic standard deviation of
.
B. The Volatility Restriction Method
We now review the traditional method of estimating the model from stock prices (see, for example, Jones et al. 1984; Ronn and Verma 1986; Ogden 1987; Delianedis and Geske 1999; and Hull 2002). For reasons that will become clear, we refer to this as the volatility restriction (VR) method. Only two unknowns are estimated: the current asset value, ωn, and its volatility, σ. The following steps are typically carried out:
| 1. | The stock price volatility σE is estimated using historical data. We denote this estimate by | ||||
| 2. | The asset value and volatility are estimated by solving the following system of equations: | ||||
This method has several theoretical problems, as pointed out by Duan (1994). The stock price volatility
typically is estimated assuming that it is constant,9 even though it is a known function of ω and t. Furthermore, the first equation is redundant, since it is used to derive the equity price formula in the second equation.10 Another disadvantage of this approach is that it does not allow the straightforward calculation of the distributions of the estimators for ω and σ.
IV. The Simulation Experiment
Implementing structural models involves estimating the unobservable asset value and volatility. Although the maximum‐likelihood approach readily allows for the estimation of other parameters, such as the earnings rate, β, we estimate only asset value and volatility for ease of comparison with the volatility restriction approach.
A. Experiment Design
To measure the performance of the two estimation approaches, we implement them on simulated data for various firm scenarios. Firms are defined along two dimensions, financial risk and business risk. Following Merton (1974), we measure financial risk with the quasi‐debt ratio,
, the ratio of risk‐free debt to the asset value at the beginning of the sample period. Business risk is measured by the instantaneous volatility of the asset value, σ.
Four base‐case scenarios are created by setting financial and business risk to be either “high” or “low.” A firm is considered to have high financial risk if its quasi‐debt ratio is 1 and low if it is 0.75. Scenarios are constructed by fixing ω1 and changing N, the nominal amount of debt. Business risk is deemed high if
and low if
. A complete list of the parameters used to construct the base scenarios in the three models can be found in table 1. The values are chosen so as to be representative of values used in previous studies and available empirical evidence.11
The four base‐case scenarios imply values for firms' leverage, equity volatility, and spreads. For example, a firm with high financial but low business risk in the Briys and de Varenne model displays a leverage of 82% and a current equity volatility of 59%, whereas a low financial but high business risk firm has a leverage of 62% and an equity volatility of 76%. The resulting spreads are 270 and 351 basis points, respectively. Overall, spreads range from 104 to over 600 basis points. This ensures that our study covers a wide array of bonds, ranging from investment grade to speculative grade. We chose relatively high levels of financial and business risk. We expect estimation to be more difficult in such conditions and therefore an evaluation of estimator performance to be more informative. Intuitively, the higher is the uncertainty, the more difficult it should be to indirectly “observe” asset value using equity values.
Fixing an initial asset value ω1 and interest rate ri, we simulate 2,000 paths for the asset value (and, if applicable, also the interest rate). In the case of a constant interest rate, we evolve the following dynamics
where
is a standard normal variable. When both asset value and interest rates are random, the following set of dynamics is used:12
where
is another standard normal variable. The variables
and
have correlation ρ.
For each asset value path, we first compute the corresponding stock price path. Second, we use this equity path to estimate the current asset value and the parameter vector θ. Finally, we use the estimates
to price the bond (using eqq. [3], [4], or [7]) and compute the standard error of the estimates (using eq. [12]). These steps are repeated for each sample path to assess the sampling distribution for a given model and scenario.
B. Output
In this section, we discuss the output produced and the tests performed. Since the ultimate use of an implemented model is to price bonds, the first question to address is whether the price and spread estimators are unbiased in small samples. The metric reported in the tables is the mean error of an estimator. Since the true value of the asset value, the spread and the price, is different for each path, we report relative errors for these variables. Second, to measure efficiency, we report the corresponding standard deviation and mean absolute error of the estimators.
A third issue is whether the asymptotic distributions are useful approximations in small samples. We measure the skewness (Sk.) and kurtosis (Ku.) of the sampled distributions and perform Jarque‐Bera (JB) tests for normality.13 Even if the normality of a particular estimator can be rejected, its estimated standard deviation might be useful for hypothesis testing and to compute confidence intervals in small samples. Therefore, we report the mean estimated standard deviation (mean estimated std.) and, as a measure of its efficiency, its standard deviation (std. of estimated std.).14
To further pursue this issue, we carry out size tests ; that is, we count how often the true value of an estimated value falls outside the confidence interval, calculated using the estimated standard deviation (i.e. the population size). If its size is close to the chosen nominal size (we use 1%, 5%, and 10%), one may conclude that the asymptotic distribution is useful for calculating confidence intervals and conducting t‐tests. An asterisk indicates that we can not reject the null hypothesis of the population size equaling the nominal size.15
The price estimates ultimately depend on the estimates of the volatility and value of the assets. To help understand the results, we therefore report output for
as well.16
No asymptotic distribution has been suggested for the volatility restriction method. Therefore, only bias and efficiency are reported for this approach.
V. Results
With three models and four scenarios, we ran 12 basic Monte Carlo experiments, each with a distinct set of asset values and, if applicable, interest rate paths. To save space we do not report the results for all these experiments individually.17 Instead, table 2 summarizes the results: panels A‐D show results organized by scenario, averaged over models; and panel E reports average results across models and scenarios. Although the economic interpretation of these averages may be somewhat ambiguous, it nevertheless provides an accessible overview of the overall performance of the volatility restriction and maximum‐likelihood methods. We do not, however, test the size for the summary tables.
We also present results for selected individual scenarios: Table 3 uses a low risk Leland and Toft scenario to illustrate why the traditional technique fails, and table 4 uses a high risk Black and Scholes/Merton scenario to show the effect on estimators of varying sample size.
A. Result Overview
First, we note, in table 2, that the bias of the maximum‐likelihood approach is negligible for practical purposes. This result holds for all models and scenarios. In contrast, the volatility‐restriction approach exhibits an average spread error of 23% (panel E). However, the bias varies across models and scenarios, which will be discussed in detail later.
It is also evident from table 2 that the ML approach is much more efficient than the VR approach; the standard deviation and mean absolute error of the latter estimators are several times higher (e.g., the standard deviation of the ML spread error is 13%, whereas that of the VR estimator is 54%). Again, this is true for all models and scenarios. Figure 1 provides a visual summary of the relative efficiency of the two empirical approaches for the three different models. The ML approach clearly dominates, although not as strikingly for the BSM model.
Fig. 1.— Kernel density plots for credit spread errors. The solid line represents the density of the relative spread errors,
, using the ML approach and the dotted line using the VR approach. The three panels plot the density pairs for the BSM, BV, and LT models, respectively.
The explanation for the failure of the VR approach is intuitive. A highly volatile historical stock price series translates into a high estimated asset volatility and vice versa. This is a direct effect of solving the system of equations. However, high stock volatility is not necessarily the result of high asset volatility, it could be the result of a historically high leverage. Therefore, in a situation where asset value and hence stock prices have risen over the sample period, leverage and stock volatility have fallen. Historical stock volatility, computed as the average of realized volatilities, therefore is higher than the current level. This translates into an excessive asset volatility estimate and thus a low bond price estimate. In sum, the described volatility restriction effect implies that increasing stock prices result in underpriced bonds, whereas decreasing stock prices produce overpriced bonds.
The reasons the VR approach tends to underprice are two: first, because stock prices increase on average in our simulations, and second, because the positive impact of leverage on stock volatility is more pronounced at high leverages. This effect can also explain why estimation is less successful when the financial risk is high (table 2, panels A and B). The higher is the leverage, the more pronounced the effect on stock volatility. In a low‐leverage firm, the assumption of constant equity volatility is less severe.
The volatility restriction effect is analyzed in table 3 with the LT model in the low business‐ and financial‐risk scenario. The upper panel displays the result for all 2,000 asset value paths, and the two lower panels for the split sample: the middle panel contains the results for increasing and the lower panel for decreasing paths. The resulting average stock volatilities are 51%, 48%, and 68%, respectively. As can be seen, increasing paths lead to too‐high spread estimates and vice versa. Most paths are increasing and hence VR overpredicts spreads on average.
It is also evident from table 3 that the problem just described is not present when applying the maximum‐likelihood approach. As illustrated by figure 2, the ML estimators are able to disentangle the effect of leverage on stock volatility from the impact of business risk. The ML relative spread errors (crosses) are evenly scattered around zero, regardless of terminal asset value. In contrast, the VR errors (circles) are higher for firms that ended up more valuable at the end of the estimation period. These are the firms for which leverage has fallen and where the VR method tends to overestimate asset risk and thus also bond yield spreads.
Fig. 2.— Credit spread estimation and historical leverage. The y axis represents the percentage spread error,
, and the x axis, the actual asset value at the end of the estimation period in the low financial‐ and asset‐risks scenario for the LT model. The crosses and circles represent pairs of terminal asset values and relative spread errors for the ML and VR approaches, respectively.
We now turn to the small‐sample distributions of the maximum‐likelihood estimators. Comparing columns Standard Deviation and Mean Estimated Std. in any table, the standard deviation estimator is unbiased except for the spread error. The reason for this result is that the spread is sometimes close to zero and, therefore, very small errors produce huge relative errors; the standard deviation estimator of the basis point error is unbiased. The variability of the estimated standard deviation of asset risk (Std. of Estimated Std.) is about half the mean estimate (see table 2, panel E). Again, the absolute estimator is much more precise.
However, by the Jarque‐Bera test, we often reject the hypothesis of normally distributed estimators. Yet, the size tests are quite successful, in the sense that the population size is close to the nominal size. Therefore, it appears that one can still use the estimated standard deviations to construct reliable confidence intervals and t‐tests. In this sense, the estimators are “sufficiently normal.”
B. Sample Size
In this section, we examine how the distributions of the maximum‐likelihood estimators depend on the length of the equity price sample (n). We investigate four sample sizes: 90, 250, 500, and 750 days. Results are displayed in table 4 for the Black and Scholes/Merton model; results are similar for all models.
Using a 3‐month sample size, the ML estimators are quite inefficient. Increasing the sample size to 250 days leads to improved efficiency and normality. As noted previously, estimators are unbiased and efficiency is improved by roughly one third. Moreover, the Jarque‐Bera test statistic has decreased; we are less confident in rejecting normality for the Black and Scholes/Merton estimators. The size tests leave a mixed impression; most population sizes approach the nominal ones, but in some cases, the convergence does not appear to be monotonic (e.g., the 1% size seems to oscillate). We cannot detect any statistically significant deviation from the nominal sizes 5% and 10%.
Turning to longer samples still, the same pattern can be observed. Efficiency (and the efficiency of the efficiency estimate, the Std. of Estimated Std.) improves monotonically, whereas the effect of the measures of normality is less clear cut.18 However, overall, the small sample distributions of estimators become more normal as the sample size increases.
C. Further Results
As noted previously, the bias of the volatility‐restriction approach varies across models and scenarios. In fact, implementation of the Leland and Toft model accounts for most of the bias; the average bias of the asset volatility is −0.1% in the BSM model, 0.4% in the BV model, and 5.3% in the LT model. The reason is that the (instantaneous) stock volatility in this model becomes extreme as asset values approach the barrier. This accentuates the volatility restriction effect. In the low business‐risk and high financial‐risk scenario, for example, the distress barrier is 687, which, when assets are worth 1,000, leads to a stock volatility of 95%. When assets are down to 800, volatility is 246%; at 750, it is 439%; and at 700, stock volatility reaches a staggering 2,046%. Thus, asset value paths that, at some point during the sample period were near the barrier, lead to a severe overestimation of the current stock volatility and badly underpriced bonds. In a scenario with a very low barrier, very few or no paths would pass close to the barrier, few extreme stock volatilities would be observed, and the effect just described would be mitigated.
This problem for the VR approach with the LT model arises because equity value drops to zero on default, and this causes extreme volatilities. The BV model incorporates deviations from the absolute priority rule, which tends to reduce volatility at the brink of bankruptcy. Shareholders do not face an all‐or‐nothing situation, which, in turn, prevents stock volatility from attaining extreme values.19 As a result, the VR approach is more successful with the BV model.
Of the three theoretical models, the implementation of the Briys and de Varenne model is the most efficient for the maximum‐likelihood as well as the volatility‐restriction approach. The mean spread error standard deviations are 7.5% and 17.3% for the two approaches with this model, whereas they are 17.7% and 28.9% with the BSM model. Recall that the two models have the same capital structure, but in addition, the BV model allows for stochastic interest rates and the possibility of early default. The explanation for estimator performance in this model is again related to the default barrier, although not through its effect in stock volatility.20 In the BV model, the barrier L is an exogenous fraction (δ) of risk‐free debt. The recovery to bondholders in financial distress is therefore also exogenous:
. This implies that one crucial component of the bond price, the payoff in financial distress, is fixed and independent of the estimated value and risk of assets. Naturally, this benefits both estimation approaches, as evidenced by the low standard deviations. A closely related feature of the BV model is that the performance of some estimators actually improves as the risk and hence the spread increases; the riskier is the scenario, the more important the (exogenously specified) distress component of the bond price.21
The errors of the maximum‐likelihood approach are, as previously noted, independent of historical leverage. Errors do, however, depend on the realized volatility of the asset value path: An unusually high asset volatility results in an unusually high stock volatility which, in turn, translates into an excessive asset volatility estimate. After controlling for realized asset volatility, no further variables describing the realized stock value path have any explanatory power on the errors.
The maximum‐likelihood approach works well also for very risky firms, with spreads exceeding 1,000 basis points. Results, not reported here, show that the approach stays unbiased for all models, although efficiency may decrease somewhat. The performance of the VR approach, on the other hand, tends to deteriorate drastically as the spread increases.
VI. Concluding Remarks
We evaluated a maximum‐likelihood approach, originally developed by Duan (1994) for implementing structural bond pricing models. We ran Monte Carlo simulations for three different models (the classical Black and Scholes/Merton model, the Briys and de Varenne model, and the Leland and Toft model) to gauge the small‐sample properties of the estimators and contrasted the method to the traditional “volatility‐restriction” approach.
The studied maximum‐likelihood approach has several advantages over the volatility‐restriction approach beyond avoiding theoretical inconsistencies. First, it allows the straightforward derivation of the distributions of estimators and, thus, bounds around estimates of bond prices, spreads, and potentially, any other metric that can be inferred from the model. Notably, it allows the computation of confidence intervals for default probabilities, which would clearly be useful in credit‐risk‐management applications. Second, it easily allows for the estimation of several model parameters.
We demonstrated that the maximum‐likelihood approach clearly dominates the traditionally used alternative. In fact, the latter performs so poorly that it may help explain the failure of attempts to implement structural bond pricing models in the past. No matter how satisfactory the theoretical features of a model, its empirical use may have been limited by the chosen implementation method. The maximum‐likelihood approach analyzed in this paper, on the other hand, appears well suited for model testing.
The maximum likelihood bond price and spread estimators are unbiased and relatively efficient. In many instances, we can reject the hypothesis that the asymptotic (normal) distributions of estimators are carried over to small (250‐day) samples. Nevertheless, we showed that standard deviations of estimators are often useful for calculating confidence bounds and conducting hypothesis tests. Thus, even if the estimators are generally nonnormal, they are “sufficiently normal” to be useful in applied work.
Appendix
For notational convenience, we assume all pricing takes place at
and drop related subscripts (e.g., we use N to denote N0).
The Black and Scholes Model
The integration limits are standard:
The Briys and de Varenne Model
The integration limits are
with
and
Finally, the volatility term in the Vasicek case is given by
The value of a risk‐free unit bond is
with
The Leland and Toft Model
The default barrier is
where
and n[·] is the standard normal density function.
The components of the debt and bond formulae are
and
Finally,
and
Estimating with Maximum Likelihood
The maximum‐likelihood function (11) depends on the model in two ways: first, through the derivative of the equity function, and second, through the conditional moments of the asset value distribution. The former are straightforward to calculate from the respective equity formulae (2), (5), and (6). The latter, for the three constant interest rate models, are as follows:
For the Briys and de Varenne model, the conditional moments are
where B(0, Δt) and Σ(Δt) were given in appendix B.
References
- Anderson, R., and S. Sundaresan. 1996. Design and valuation of debt contracts. Review of Financial Studies 9:37–68.
- Black, F., and M. S. Scholes. 1973. The pricing of options and corporate liabilities. Journal of Political Economy 7:637–54.
- Briys, E., and F. de Varenne. 1997. Valuing risky fixed rate debt: An extension. Journal of Financial and Quantitative Analysis 32:239–248.
- Collin‐Dufresne, P., R. Goldstein, and S. Martin. 2001. The determinants of credit spread changes. Journal of Finance 56:2177–2207.
- Delianedis, G., and R. Geske. 1999. Credit risk and risk neutral default probabilities: Information about rating migrations and defaults. Working paper, UCLA.
- Duan, J.‐C. 1994. Maximum likelihood estimation using price data of the derivative contract. Mathematical Finance 4:155–67.
- ———. 2000. Correction: Maximum likelihood estimation using price data of the derivative contract. Mathematical Finance 10, no. 4:461–62.
- Duan, J.‐C., A. F. Moreau, and C. Sealey. 1995. Deposit insurance and bank interest risk: Pricing and regulatory implications. Journal of Banking and Finance 19:1091–1108.
- Duan, J.‐C., and J.‐G. Simonato. 2002. Maximum likelihood estimation of deposit insurance value with interest rate risk. Journal of Empirical Finance 9:109–32.
- Duffie, D., and D. Lando. 2000. Term structures of credit spreads with incomplete accounting information. Econometrica 69:633–64.
- Eom, Y. H., J. Helwege, and J.‐Z. Huang. 2004. Structural models of corporate bond pricing: An empirical analysis. Review of Financial Studies, 17:499–544.
- Ericsson, J., and J. Reneby. 1998. A framework for valuing corporate securities. Applied Mathematical Finance 5:143–63.
- ———. 2004. A note on contingent claims pricing with non‐traded assets. vol 2, no. 3.
- Fan, H., and H. Sundaresan. 2000. Debt valuation, renegotiations and optimal dividend policy. Review of Financial Studies 13:1057–99.
- Hull, J. 2002. Options, futures and other derivative securities. Saddle River, NJ: Prentice Hall.
- Jones, E., S. Mason, and E. Rosenfeld. 1984. Contingent claims analysis of corporate capital structures: An empirical investigation. Journal of Finance 39:611–27.
- Kim, I., K. Ramaswamy, and S. Sundaresan. 1993. Does default risk in coupons affect the valuation of corporate bonds? A contingent claims model. Financial Management, Special Issue on Financial Distress.
- Leland, H. 1994. Risky debt, bond covenants and optimal capital structure. Journal of Finance 49:1213–52.
- Leland, H., and K. B. Toft. 1996. Optimal capital structure, endogenous bankruptcy and the term structure of credit spreads. Journal of Finance 51:987–1019.
- Lo, A. W. 1986. Statistical tests of contingent‐claims asset‐pricing models. Journal of Financial Economics 17:143–73.
- Longstaff, F. A., and E. S. Schwartz. 1995. A simple approach to valuing risky fixed and floating rate debt. Journal of Finance 50:789–819.
- Mella‐Barral, P. 1999. The dynamics of default and debt reorganization. Review of Financial Studies 12:535–78.
- Mella‐Barral, P., and W. Perraudin. 1997. Strategic debt service. Journal of Finance 52:531–56.
- Merton, R. C. 1974. On the pricing of corporate debt: The risk structure of interest rates. Journal of Finance 29:449–470.
- ———. 1980. On estimating the expected return on the market: An exploratory investigation. Journal of Financial Economics 8:323–62.
- Nielsen, L. T., J. Saa‐Requejo, and P. Santa‐Clara. 1993. Default risk and interest rate risk: The term structure of default spreads. Working paper, INSEAD.
- Ogden, J. P. 1987. Determinants of the ratings and yields on corporate bonds: Tests of contingent claims model. Journal of Financial Research 10.
- Ronn, E. I., and A. K. Verma. 1986. Pricing risk‐adjusted deposit insurance: An option‐based model. Journal of Finance 41:871–95.
- Vasicek, O. 1977. An equilibrium characterisation of the term structure. Journal of Financial Economics 5:177–88.
-
* We are grateful to Dietmar Leisen, Spencer Martin, Paul Sφderlind, and an anonymous referee for important comments. We are indebted to Sune Karlsson for advice on Gauss and other issues. Any remaining errors are, of course, our own. Contact the corresponding author, Jan Ericsson, at jan.ericsson@mcgill.ca.
-
1 During the period 1997–2000, about 70% of new capital raised by U.S. corporations was in the form of debt. As of the third quarter of 2003, approximately $4.3 trillion worth of corporate debt was outstanding (source: the Bond Market Association).
-
2. In a recent paper, Duan and Simonato (2002) apply the two empirical methods to a model of deposit insurance to market data and find that the maximum likelihood approach yields higher estimates of the insurance premium than the traditional approach. Using a Monte Carlo experiment, they demonstrate that the maximum likelihood approach is unbiased.
-
3. Models of corporate debt along these lines include Kim, Ramaswamy, and Sundaresan 1993; Nielsen, Saá‐Requejo, and Santa‐Clara 1993; Leland 1994; Longstaff and Schwartz 1995; Anderson and Sundaresan 1996; Leland and Toft 1996; Briys and de Varenne 1997; Mella‐Barral and Perraudin 1997; Ericsson and Reneby 1998; Mella‐Barral 1999; Fan and Sundaresan 2000; Duffie and Lando 2000; and Collin‐Dufresne, Goldstein, and Martin 2001.
-
4. For example, in Delianedis and Geske (1999) and Eom, Helwege, and Huang (2002).
-
5. In contrast, for some structural models such as Longstaff and Schwartz (1995) and Nielsen et al. (1993), it is not apparent how to value the firm's equity.
-
6. The model in Briys and de Varenne (1997) allows for time‐dependent parameters but retains the Gaussian framework.
-
7. See Ericsson and Reneby (1999) for a discussion of this issue.
-
8. By not considering the interplay between the estimation of interest rate and asset value parameters, the standard errors of the latter may be biased, as pointed out by Duan and Simonato (2002), who provide an interesting discussion of the issue. However, it is not clear that this bias affects the two empirical approaches differently and influences our results on their relative efficiency. For simplicity, we refrain from analyzing these effects.
-
9. The method used in the cited studies is identical to the estimation of the instantaneous (and constant) stock return volatility, assuming the stock obeys a geometric Brownian motion. We follow this approach when we implement this method in our Monte Carlo study.
-
10. It is interesting to note that this implies that, if the estimation of σε would produce the correct estimate,
, one of the equations would be redundant. Therefore, the first theoretical inconsistency (assuming constant stock price volatility) is necessary to find a unique solution to a system of equations that, otherwise, have an infinite number of solutions. -
11. Note that nominal debt is lower in the LT model than in the other two, as their debt pays coupons.
-
12. These dynamics are derived by writing down the continuous time equations for
and ln
in integral form before discretizing. -
13. The null hypothesis of a normally distributed estimator can be rejected with 5% significance if the JB statistic exceeds 6.
-
14. When using the term estimate, we refer to the estimate for a particular sample path. The expected value of an estimate is calculated as the mean of estimates across generated sample paths. The estimated standard deviation is a relative value, (estimated std. of spread)/(estimated spread), consistent with the true standard deviation.
-
15. The standard error of the population size p, given 2,000 replications, is
. This yields relatively wide confidence bounds. Suppose that the population sizes are 1%, 5%, and 10%, respectively, then standard errors are approximately 0.2%, 0.5%, and 0.7%. -
16. For brevity, we do not report results for the estimation of the market price of risk, λ, since the estimates are too weak to be useful; this is related to the result in that it is next to impossible to estimate the expected return of an asset whose dynamics can be described by an Itô process (see Merton 1980). Fortunately, we do not need the market price of risk to price bonds.
-
17. These results, and other unreported results discussed later, are available on request.
-
19. Undisplayed results show that, with a model similar to LT but allowing for a 5% payout to the shareholders in distress, the volatility restriction approach does not fail as dramatically.
-
20. The perfomance is not related to the presence of stochastic interest rates. Unreported results for the BV model with constant interest rates show that the performance of the empirical approaches is similar.
-
21. In practice, of course, the advantage may not be important, due to uncertainty about recovery rates.







