T W O T O P I C S IN F I N A N C E : 1. W E L F A R E A S P E C T S O F A N A S Y M M E T R I C I N F O R M A T I O N RATIONAL EXPECTATIONS M O D E L 2. B O N D O P T I O N PRICING, E M P I R I C A L E V I D E N C E By BRUCE JOHN DIETRICH-CAMPBELL B.Sc, The University of British Columbia, M B A , The University of British Columbia, 1977 1981 A THESIS S U B M I T T E D IN P A R T I A L F U L F I L L M E N T O F T H E REQUIREMENTS FOR T H E D E G R E E OF D O C T O R OF PHILOSOPHY in T H E F A C U L T Y O F G R A D U A T E STUDIES F A C U L T Y O F C O M M E R C E A N D BUSINESS A D M I N I S T R A T I O N We accept this thesis as conforming to the required standard T H E U N I V E R S I T Y O F BRITISH C O L U M B I A August 1985 © Bruce John Dietrich-Campbell, 1985 'K In presenting t h i s thesis i n p a r t i a l f u l f i l m e n t of the requirements for an advanced degree at the University of B r i t i s h Columbia, I agree that the Library s h a l l make i t f r e e l y available for reference and study. I further agree that permission for extensive copying of t h i s thesis for scholarly purposes may be granted by the head of my department or by h i s or her representatives. It i s understood that copying or publication of t h i s thesis for f i n a n c i a l gain s h a l l not be allowed without my written permission. Department The University of B r i t i s h Columbia 1956 Main Mall Vancouver, Canada V6T 1Y3 Date /RI ^ t°> <W«j S ^ S T ABSTRACT Welfare Aspects of an Asymmetric Information Rational Expectations Model In part 1 of this study I examine several models of competitive markets in which a group of uninformed traders uses the equilibrium price of a traded asset as an indirect source of information known to a group of informed traders. Four different models are compared in two homogeneous information cases plus one asymmetric information case, revealing a) an allocative efficiency benefit resulting from the opportunity to trade current consumption for future consumption, b) a 'dealer' benefit accruing to traders who are able to observe and act on demand fluctuations not apparent to other traders, c) a 'hedging' benefit accruing to all traders, and d) a loss of hedging benefits due to information dissemination before hedge trading can take place. The effect of an increase in precision of information given to informed traders is calculated for the above factors and for net welfare. Bond Option Pricing, Empirical Evidence In part 2, a two-factor model using the instantaneous rate of interest and the return on a consol bond to describe the term structure of interest rates - the Brennan-Schwartz model - is used to derive theoretical prices for American call and put options on U.S. government bonds and treasury bills. These model prices are then compared with market prices. The theoretical model used to value the debt options also provides hedge ratios which may be used to construct zeroinvestment portfolios which, in theory, are perfectly riskless. Several trading strategies based on these 'riskless' portfolios are examined. ii T A B L E OF C O N T E N T S Abstract List of Tables List of Figures ii vi viii Overview 1 Welfare Aspects of an A s y m m e t r i c Information R a t i o n a l Expectations M o d e l Part A . A.l Introduction A.2 Model Description 2 . . 3 5 A.2.1 Utility Functions A.2.2 Aggregation and Trading Volume A.2.3 Endowments A.2.3.a Grossman and Stiglitz Model A.2.3.b Efficient Market Model A.2.3.C Hedging Model A.2.4 The Riskless Technology A.2.5 Random Elements A.2.6 Timing of Events 5 6 7 8 10 13 15 16 17 A.3 Model Derivation A.3.1 Initial and Terminal Wealth A.3.2 'Post-info' Expected Utility A.3.2.a Probability Distribution of Future Consumption A.3.2.b Optimal Investment and Consumption A.3.2.C 'Post-info' Expected Utility Functions A.3.3 - Market Clearing ' .= A.3.3.a Generalized Model A.3.3.b Market Clearing Price A.3.4 'Pre-info* Expected Utility A.3.4.a Uninformed Trader A.3.4.b Informed Trader 21 21 22 23 25 27 30 32 33 36 38 40 A.4 Model Interpretation A.4.1 Case 1: Homogeneous Information, Non-Random Supply 44 46 iii . . . A.4.2 Case 2: Homogeneous Information, Random Supply A.4.2.a Efficient Market Model A.4.2.b Hedging Model A.4.3 Case 3: Asymmetric Information, Randomness Present A.4.3.a Standard Model A.4.3.a.i Efficient Market Model A.4.3.a.ii Hedging Model A.4.3.b Extended Model A.4.3.b.i Efficient Market Model A.4.3.b.ii Hedging Model . . . . 51 55 59 65 66 68 74 75 76 79 A.5 Summary and Conclusions 81 A.6 References 87 A.7 Appendices A.7.1 Appendix 1. Case 3, Standard Model: Derivatives A.7.2 Appendix 2. Case 3, Standard Model: Differences A.7.3 Appendix 3. Case 3, Extended Model: Derivatives 88 88 92 94 B o n d O p t i o n P r i c i n g , E m p i r i c a l Evidence Part B B.l . 101 Introduction 102 B.2 Pricing Theory B.2.1 Asset Pricing Theory B.2.1.a The Brennan-Schwartz Model B.2.1.b The Black-Scholes Model B.2.2 Arbitrage Portfolios 104 105 109 114 115 B.3 Data Description B.3.1 Parameter Estimation Data B.3.1.a Short Rate Series B.3.1.b Consol Rate Series B.3.l.c Market Price of Short Rate Risk Parameters B.3.2 Test Period Data B.3.2.a Bond Option Data B.3.2.b Treasury Bill Option Data B.3.2.C Arbitrage Portfolio Data 118 119 120 120 121 122 123 125 127 B.4 130 Numerical Solution of the Asset Pricing P D E iv B.4.1 B.4.2 The Alternating Direction Method Stability of the Solution Method 133 139 B.5 Parameter Estimation B.5.1 The Simple Linearization Method B.5.1.a Minimum Distance Estimator B.5.Lb A One-Dimensional Example of Simple Linearization . . B.5.2 Brennan-Schwartz Parameter Estimates B.5.2.a Choice of the Brennan-Schwartz Joint Process Form . . . B.5.2.b Brennan-Schwartz and Simple Linearization B.5.3 Estimation of the Price of Risk and Reversion Coefficient a . B.5.3.a Minimum Distance Estimator B.5.3.a.i Portfolio Formation Schemes B.5.3.a.ii Covariance Matrix Assumptions B.5.3.b Parameter Estimates v 142 143 144 145 147 148 150 153 154 155 156 157 B.6 Pricing Model Errors B.6.1 Linear Interpolation B.6.2 Cubic Spline Interpolation B.6.3 Quadratic Interpolation B.6.4 Bond Option Pricing B.6.4.a Choice of Interpolation Method B.6.4.b Bond Option Pricing Errors B.6.5 Treasury Bill Option Pricing B.6.5.a Treasury Bill Option Settlement Adjustment B.6.5.b Treasury Bill Option Pricing Errors 161 161 163 164 166 167 171 173 174 176 B.7 Arbitrage Tests : B.7.1 Bond Option Arbitrage Tests B.7.2 Treasury Bill Option Arbitrage Tests 178 180 182 B.8 Suggestions for Further Research B.8.1 Analytic Solution of a Schaefer-Schwartz Stochastic Process B.8.2 Simultaneous Solution of First Partial.Derivatives B.8.3 Asset Pricing by Risk-Adjusted Expectation . . B.8.4 Testing for Instability B.8.5 Grid Spacing and Boundary Conditions B.9 B.10 Summary and Conclusions . 184 184 187 190 195 196 198 References 201 v LIST OF T A B L E S Welfare Aspects of an Asymmetric Information Rational Expectations Model Part A Table A . l Table A.2 97 97 98 Bond Option Pricing, Empirical Evidence Part B Table B . l Table B.2 Table B.3 Table B.4 Table B.5 Table B.6 Table B.7 Table B.8 Table B.9 Table B.10 Table B . l l Table B.12 Table B.13 Table B.14 Table B.15 Table B.16 Table B.17 Table B.18 Table B.19 Table B.20 Table B.21 Table B.22 Table B.23 Table B.24 Table B.25 Table B.26 Table B.27 Table B.28 Table B.29 Table B.30 Table B.31 : vi 203 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 Table B.32 Table B.33 Table B.34 Table B.35 Table B.36 Table B.37 Table B.38 Table B.39 Table B.40 Table B.41 Table B.42 Table B.43 Table B.44 IL 236 U „ 8 „Q 240 241 242 7 • • ; ; ; ; ; 2 4 3 244 245 246 vii LIST OF FIGURES Welfare Aspects of an A s y m m e t r i c Information R a t i o n a l Expectations M o d e l Part A . 99 Figure A . l 99 viii OVERVIEW This thesis presents two unrelated studies in finance. To aid the reader, part A Welfare Aspects of an Asymmetric Information, Rational Expectations Model - is presented in its entirety before part B - Bond Option Pricing, Empirical Evidence. This means that the references, appendices, tables and figures for part A have been placed before the start of part B. Only the references and tables for part B will be found at the end of the thesis. 1 PART A: Welfare Aspects of an A s y m m e t r i c Information R a t i o n a l Expectations M o d e l A.l INTRODUCTION This study deals with models of rational expectations markets, that is, with markets in which traders look to the equilibrium price of an asset as a source of information on which to base their own trading decisions. It is true that prices reflect information in Walrasian models, with the difference being that Walrasian traders do not attempt to use the information that is present in prices. To the extent that traders actually do try to use market prices as a indirect source of other traders' information, modelling trading activity using a rational expectations model such as those used here should be a more accurate representation of reality. At the least, we may be able to identify qualitative aspects of rational expectations models not present in their Walrasian counterparts. Models of Walrasian markets have been analyzed extensively in order to isolate and identify the various dynamics underlying individual welfare changes. The same is not true of the rational expectations models that have appeared in the finance literature. The reason is undoubtedly the complexity of the formulae describing the expected utility of market participants. In this study, I have hopefully illuminated the inner workings of several asymmetric information, rational expectations models and helped somewhat to fill this gap in our knowledge. The basic features of the models used in this study are the same as used by Grossman and Stiglitz (1980), namely, negative exponential utility functions, normally distributed random variables and the division of traders into two groups: informed 3 traders and uninformed traders. The informed traders all receive a piece of information before trading begins, 1 and the uninformed traders try to use the price of the risky asset as an indirect source of the information that informed traders have received. In several points, however, my models do differ from the Grossman and Stiglitz model. Unlike their model, the individual traders in my models do not have the choice of being informed or uninformed. The groups are predetermined and traders do not have the option of moving from one group to the other. In addition, the Grossman and Stiglitz model allows consumption to occur only at one point in time. I improve slightly on this by allowing consumption to occur both at the beginning and end of the period which is modelled. M y main concern in this study has been to provide an understanding of the welfare changes which result in these rational expectations models when the quality of information given to the informed trader group is increased. 2 As there are five welfare effects which come to light in this study, all of them requiring reasonably lengthy discussion, I will not attempt to outline the results at this point, and trust that the sections that follow will succeed in that regard. 1 2 A l l informed traders receive the same piece of information. B y increasing the quality of information, I mean that the correlation between the information and the future payoff on a risky asset is increased. 4 A.2 MODEL DESCRIPTION The two elements that are crucial to an analytic solution of the problem addressed here are first of all the negative exponential form of the utility function used and second the assumption that all random variables are joint normally distributed. The reason that these two rather restrictive assumptions are needed has to do with the nature of the problem being solved, namely, its rational expectations character. Simply stated, the market participants condition their actions on an observable indicator - in this case a market price - but this same indicator is endogenously determined by the actions of all the participants in the model. That is, we must solve a 'chicken and the egg' problem or, in more technical terms, a fixed point problem. As a result, the analytic solution found in this study is very sensitive to some of the assumptions, especially the two mentioned above. A.2.1 U t i l i t y Functions As was mentioned above, it is quite important that the utility function used is negative exponential. This utility function paired with joint normally distributed random variables combine to allow the only known analytically solvable rational expectations model. 3 Even when these two assumptions are made, the solution This is not true if one allows random variables having discrete probability distributions. See Kraus and Sick (1979) for an example using the power utility 5 is still quite difficult to obtain. So difficult, in fact, that the models to date have been single period models with consumption allowed only at the end of period. The utility function used in the literature up to this point has been U = -/? e x p ( - a C ) , a > 0, 1 > /? > 0, 'standard' model, where C is consumption at the end of period. Since this has been the standard in the literature to this point, I will call this the 'standard' model. In addition, I analyze a more complex model allowing consumption at two points in time, namely, both the beginning and end of period. The utility function used for this extension of the standard model is V where C 0 4 — — exp(—aC0 ) — /3 exp(—aCi), 'extended' model, is the beginning of period consumption and C\ is consumption at the end of period. Because this model is an extension of the standard one, I call it the 'extended' model. A.2.2 Aggregation and Trading Volume Unlike the Verrecchia (1982) model where each individual receives an independent piece of information, the Grossman and Stiglitz formulation has all informed 1 function. Note that a is the same for both the beginning and end of period terms in the 'extended' model utility function, and the same for all all traders. The variable /? is also the same for all traders. 6 traders receiving the same piece of information. Because of this, we make the assumption that we can aggregate over all the informed traders, replacing them with a representative informed trader. Similarly, we use a representative uninformed trader. Apparently this is an innocuous assumption, though there is one aspect of it which makes model interpretation difficult. The one problem which arises is with regard to trading volume. Since we have replaced groups of individuals with representatives, the model trading volumes represent only the trading that occurs between the two groups. They do not include trading which occurs between members of the groups. That is, representative trader trading volume includes only mtergroup trading, not tniragroup trading. 5 Other than this, there should be no other side effect of group aggregation. A.2.3 Endowments Previous to this study, there was no question what the endowments of the market participants were. There were only two assets - the riskless technology and the risky asset - and trader t was given an initial endowment of each: m,- of the riskless technology and /,• of the risky asset. Aggregate supply constraints were 5 Of course, in the particular framework of this study, since all traders have the same utility function there is no intragroup trading, only intergroup trading. Intragroup trading would occur in a model in which traders had different tastes. 7 imposed to give 1 1 n - £ m , = m, n "£/.=/• With this setup, trader t's initial wealth is Woi = mi + fip, where p is the price of the risky asset. A.S.S.a GROSSMAN 6 AND STIGLITZ MODEL In the interests of simplicity, Grossman and Stiglitz (1980) limited the initial endowments of traders to. the two mentioned above, namely, endowments of the riskless technology and risky asset. However, since participants in the model are given perfect information about the entire structure of the economy - the utility functions of all traders and the distribution functions of all random variables are common knowledge - Grossman and Stiglitz were able to show that without some obscuring 'noise' in the model the result is a fully revealing risky asset price. That is, when there is no 'noise' the uninformed traders have enough knowledge of the structure of the economy to enable them to figure out exactly what the informed traders' demand for the risky asset is as a function of the information The price of the riskless (constant returns to scale) technology is 1. Prices are in terms of units of a good which may either be consumed or invested, with the only restriction being that consumption cannot occur at the beginning of period in the standard model. 8 they receive. Since no trading takes place in this rational expectations market except at the equilibrium price, we assume that the equilibrium price is part of the information set of the uninformed. However, because they have exact knowledge of the informed traders' demand function, the uninformed traders can use the equilibrium price to figure out exactly what information the informed traders have received. Unless we add some obscuring 'noise', the equilibrium price is a sufficient statistic for the information. Grossman and Stiglitz chose to introduce the needed 'noise' by making the aggregate supply of the risky asset an unobservable random variable. With this change, the equilibrium price reveals to the uninformed traders only a linear combination of aggregate supply noise and the informed traders' information. Using this linear combination, uninformed traders are not able to exactly invert the pricing function to find what information was received by the informed traders. They are only able to calculate a probability distribution for this information. Since the price no longer fully reveals information it is called a partially revealing equilibrium price. 9 A.2.3.b EFFICIENT MARKET MODEL There is an alternative way to introduce noise into the supply of the risky asset. 7 We can retain the assumption of a random aggregate supply of the risky asset, and still allow the beginning of period holdings to be non-random. This leads us into a small difficulty of interpretation, since the sum of the initial endowments of risky asset does not equal the aggregate supply. That is, i=i where s is the per trader supply of risky asset. Apparently, by making this assumption the model has been opened. That is, an exogenously determined element has been added. One characteristic of this exogenously determined supply component is that its size is independent of the price of the risky asset. It is a perfectly inelastic supply component. This is in stark contrast to the demand/supply functions of our rational expectations traders. These traders condition their demands on the market price of the risky asset. Since this additional supply component is exogenous and perfectly price inelastic, we can infer that another - unpredictable - group of traders has been added to the model, and that they are not rational expectations traders. The interpretation that I like to put on the model is the following. If we are This model is labelled the 'efficient market model' not because the market modelled here is actually efficient - it is not - but because a group of traders assumes, or acts as if, it were efficient. 10 attempting to model a real market by using rational expectations traders, we most likely don't really want to model a situation where every trader is a rational expectations trader. In order to draw information from the price at which an asset is trading, a trader has to be intimately familiar with the market he is trading in. If the price acts in a peculiar way, he must first be able to identify peculiar price behaviour when it occurs, and second be able to interpret what any given peculiar price behaviour means. The ordinary trader does not follow the market enough to be able to identify peculiar price behaviour, or to interpret it if he could identify it. The rational expectations traders, therefore, form a central group of traders closely following the price behaviour of an asset. Normally, when we think of an efficient market we think of the price of the asset reflecting all known information about the asset. The asset price reflects all this information because a small group of traders closely follow the asset's price movements and step in whenever the price departs even slightly from what it 'should be'. They keep the asset price where it 'should be'. The ordinary trader, of course, benefits from this, too. He is assured that the equilibrium price at which an asset trades in an efficient market is the correct price at which to buy the asset. That is, since the asset is presumably always correctly priced, the price the ordinary trader has to pay doesn't matter. It's always guaranteed by the actions of the rational expectations traders to be the right price. If the market is efficient, a 'naive' trader can safely ignore the fact that an asset price conveys information. This, then, is a justification of the exogenous, perfectly inelastic supply of the risky asset. We c a n think of it as due to the demand/supply generated by 'naive', 11 efficient market traders. The reason that they are modelled as a random supply 8 element is that they are unpredictable as far as the rational expectations traders are concerned. When the rational expectations traders invert the pricing function, the result is a probability distribution for supply/demand aggregated over all the rational expectations traders. This group of traders forms a small, stable core of predictable traders, whereas little is known about the much larger group of 'naive', efficient market traders. Because little is known about how they form their demands, they appear to trade randomly. 9 Given that the beginning of period endowments of risky asset are now free of the additional role of introducing randomness into the model, I decided to set the rational expectations traders' initial endowments equal to the Hakansson, Kunkel and Ohlson (1982) 'no-information' endowments. That is, I set the endowments to what the traders' equilibrium holdings would be if all rational expectations traders were uninformed and we allowed them to trade to equilibrium before the beginning of the p e r i o d . 10 Since we are dealing with the simplified case where all Although I label these traders 'efficient market traders', I do not mean to imply that the market modelled here is actually efficient. What is meant is that these traders assume or act as if it were efficient. As is shown in later sections, this turns out to be an erroneous assumption. The market is not efficient in the model I have labelled the 'efficient market model'. Note that there must be some correlation between the individual demands of these • 'naive' traders, assuming that there is a large number of traders in this group. If there were no correlation, or 'fads', then aggregation of their demands - each of which is assumed normally distributed with a mean of zero - would result in an aggregate demand of zero, with no variance. That is, in the limit as we aggregate over an infinite number of 'naive' traders, without 'fads' there would be no random supply component. Note that these Hakansson, Kunkel and Ohlson endowments are only applicable in a single-period model. If we wished to view this as a model of one period taken from a multi-period framework, then this additional round of trading can be eliminated. It is not required, as all of the possible information effects that may arise are discussed in this study. Eliminating this round of trading would not 12 traders have the same absolute risk aversion, this means that all traders receive the same no-information initial endowment of risky asset, namely, the expected beginning of period per trader supply, / . A.2.8.C HEDGING MODEL Consumption based trading is an important use of 'real' markets, but we know that it is not the only use. People also trade in the market in order to hedge their positions in other non-tradable assets, or assets which might be traded if one was willing to pay high transaction costs. The reason for hedge based trading is that exposure to risk from holding an asset - which can be traded only by incurring high transaction costs - can be reduced by trading in another asset with transaction costs which are relatively low. For simplicity, I will assume here that the assets being hedged are non-tradable assets. Specifically, I assume that traders have an endowment of an additional, nontradable asset. This endowment, hi, has a payoff which is equal to the payoff on the risky asset. This, of course, covers all possible cases, since an asset with a payoff which is partially correlated with the payoff on the risky asset can be thought of as a combination of two assets: one having a payoff perfectly uncorrelated with the risky asset payoff and the other perfectly correlated. 11 Trader t's initial wealth, change any of the conclusions of this study. I do not explicitly model the asset having a payoff perfectly uncorrelated with the 13 therefore, is Woi = mi + {fi + hi )p, where p is, as before, the price of the risky asset. 1 Randomness is introduced into the model by assuming that the endowments of this non-traded asset are random. That is, at the beginning of the period each trader receives income in the form of non-tradable asset, hi. The amount of income received by one individual is unknown to the rest of the traders in the market, that is, it appears random to t h e m . 13 As proposed by Hellwig (1980), we also assume that no trader can use his own income, hi, to determine anything about the aggregate quantity, h. To prevent this, Hellwig assumes that there is a very large number of traders in the market, so that each trader's contribution to this aggregate is infinitesimal and, consequently, the correlation between any particular hi and the aggregate h is also infinitesimal. There are two pleasing aspects to this model. First, we have avoided the GrossmanStiglitz assumption of random beginning of period holdings of a traded asset, somerisky asset payoff, as there are no effects of information on this asset's contribution to utility. That is, explicitly adding this asset would merely shift the utility function, with the amount of the shift being unaffected by anything else in the model. Since the payoff on the risky asset and hi are equal, the shadow price of the non-tradable asset hi will equal the market price of the risky asset. I assume that the trader knows his own non-tradable income. It is sufficient to assume that the rest of the traders are ignorant of it. Note that this model is conceptually extendible to a multiperiod framework, whereas the Grossman and Stiglitz model is not (even though the two are mathematically identical). 14 thing which is difficult to justify if we axe thinking in terms of financial assets such as stocks. Second, we have expanded the model to include not only consumption based trading, but also hedge based trading. Notice, however, that since the risky asset and non-tradable asset are perfect substitutes, this model is mathematically identical to the Grossman-Stiglitz model. It provides another way to interpret the Grossman-Stiglitz framework. A.2.4 T h e Riskless Technology Besides the risky asset, there is also a riskless technology in this economy. The riskless technology can be consumed either now or at the end of the period in the 'extended' model, but only at the end of the period in the 'standard' m o d e l . 14 The payoff per unit of the riskless, constant returns to scale technology, R, is assumed to be exogenously given. R = 1 + r, r exogenous This corresponds to a rate of return, r, which is totally insensitive to supply or demand. That is, the supply of riskless technology is perfectly elastic at this rate of return. This assumption is needed in order to keep the model relatively simple. As is 1 The risky asset cannot be consumed, it must be held until the end of the period. At that time it produces a payoff which is consumable. 15 easily seen, if we were instead to allow the riskless technology price to be sensitive to aggregate demand for the riskless technology, then its price would convey information about that demand. Of course, demand for the riskless technology is a function of the information that informed traders have, so that the uninformed traders in the model could potentially use the price of the riskless technology as an indirect source for this information. 15 To prevent the price of the riskless technology from fully revealing this information, we would be forced to introduce randomness into the supply of the riskless technology, thus complicating the model needlessly. Therefore, in order to keep the model relatively simple, we want to restrict the function of information transmission to the price of the risky asset only, which forces us to make the supply of the riskless technology perfectly elastic. A.2.5 R a n d o m Elements Up to this point, we have already mentioned several random variables in the models. In the section on endowments, section A.2.3, when defining the random initial endowments of /,• in the Grossman and Stiglitz model, the random aggregate supply of the risky asset, a, in the efficient market model, or the random nontradable endowments, hi, of the hedging model, we defined the random elements uing normal probability distributions. This is, as mentioned previously, required for the analytic solution of our rational expectations model. A similar argument holds if the riskless technology price is sensitive to the price of the risky asset. 16 I have mentioned in several places that there is a risky asset in the models, but have never explicitly defined it. I have also mentioned that the informed group of traders receives information correlated with the future payoff on the risky asset. More precisely stated, l e t 16 17 where x is the end of period payoff per unit of the risky asset and e is the information received by the informed group of traders. The correlation between these two variable, p, can be thought of as the 'informativeness' of the information. Note also that the end of period payoff on the non-tradable asset endowment or income hi is equal to the payoff on the risky asset. 18 As a last note, all the random elements are taken to be independently distributed, except for x and € which, as stated, have a correlation of p. A.2.6 T i m i n g of Events The exact timing of events may be confusing at first, so I have provided a time line in Figure A . l . The first event to occur is the receipt of the risky and riskless Recall also that all informed traders receive the same piece of information. Since the definition of information is somewhat arbitrary, I have chosen to let the variance of e be equal to the variance of x, and its mean be equal to zero. This is justifiable,' as the information contained in the random variable e is exactly the same as the information contained in an arbitrary linear combination a + be. See section A.2.3. 17 endowments. We also supply the rational expectations traders with their common knowledge regarding the utility functions of the other participants. We do not, however, provide them with the knowledge that there will be an asymmetric information situation in the future. Next, we allow the rational expectations traders to trade to a Hakansson, Kunkel and Ohlson (1982) 'no-information' equilibrium. 2 0 19 Once the Hakansson, Kunkel and Ohlson equilibrium has been reached, we sup- ply traders with their endowments of non-tradable asset (in the hedging model), and with the rest of their common knowledge, namely, who will be members of the informed group plus the distribution functions for all the random variables of the model. Following this, we supply traders with their additional non-tradable endowments (in the hedging model only). We can then calculate the 'pre-info' expected utility of wealth for the informed and uninformed traders in the model. That is, we calculate their expected utility before receipt of information by the informed traders. Of course, the 'post-info' expected utility of wealth is a function of the actual signal received, so it cannot be used as a basis for conclusions regarding an individual's welfare. Otherwise, our conclusion might also be a function of the actual signal received. It seems more reasonable to base any conclusions on the expected utility of individuals at a point in time where a signal is expected to be but has not yet been received, that is, on the 'pre-info' expected utility, which is calculated in expectation of information receipt. That is, the rational expectations traders trade without knowing that there will be an asymmetric information situation arising in the future. Note that this round of trading is not present in the Grossman and Stiglitz model, only in the efficient market and hedging models. As mentioned in a previous footnote (section A.2.3.b), this round of trading is not essential in the efficient market model nor the hedging model. 18 Once a signal has been received by the informed trader group, trading (and consumption in the 'extended' model) may occur. The beginning of period market clearing price of the risky asset is determined by equilibrium conditions and is used by the uninformed traders as an imperfect signal of the information received by the informed traders. It is at this point that we can calculate the 'post-info' expected utility of traders. Nothing further happens in the model until the end of period, at which time the riskless technology and risky asset generate their payoffs, and final consumption of end of period wealth occurs. One potential problem with the sequence of events as depicted in Figure A . l is the lack of an additional round of trading just before the point where a signal is received by the informed trader group. Presumably, the uninformed traders could somehow insure themselves against potential exploitation by the better informed traders if only they had the opportunity of doing so before the information was received. In fact, it is not even necessary to introduce an insurance market into the model.. Simply allowing an additional round of trading in the risky asset just before information was received, by the informed traders would protect the uninformed traders against any potential exploitation. The effect of an additional round of trading before information receipt would be to reveal some otherwise unknown random variable to all traders. For example, in the Grossman and Stiglitz model the only reason the market clearing price of the 19 risky asset does not fully reveal to uninformed traders the information received by informed traders is that the price is a function of two random variables - the aggregate supply of risky asset and the information received by informed traders both of which are unknown to the uninformed traders. If there were an additional round of trading prior to receipt of information, then the clearing price at that point would be a function of only the aggregate supply of risky asset. That is, the price would reveal the value of the aggregate supply to uninformed traders. When the next round of trading occurred, after information was received by the informed traders, the market clearing price would still be a function of two random variables - aggregate supply and information - but only one would still be unknown to the uninformed traders. The result is that the price would reveal the value of the second random variable to the uninformed traders. That is, it would perfectly reveal the information received by the informed traders. In order to retain a partially revealing market clearing price, we cannot allow another round of trading to occur between receipt of endowments and information. A partially revealing price must be a function of two random variables, both of which are unknown to uninformed traders. 21 In the efficient market model, we can allow an additional round of trading between rational expectations traders without revealing an otherwise unknown random variable to the uninformed. However, we cannot allow a round of trading between rational expectations traders and 'naive' traders if doing so would reveal what the 'naive' trader demand at the next trading point - after information receipt would be. For example, if this round of trading exhausted 'naive' trader demand, then the next round of trading would take place without 'naive' traders and the price of the risky asset would be fully revealing. In the hedging model, an extra round of trading would reveal to the uninformed traders the size of the average endowment of non-tradable asset, h. Consequently, at the next round of trading, the risky asset price would be fully revealing. 20 A.3 MODEL DERIVATION It may be confusing to understand how one can calculate optimal actions using a price function for the risky asset, when that price function is endogenously determined by these actions and has itself not yet been calculated. This, of course, is the essence of the rational expectations problem. The price function is a fixed point solution to the problem. It is that particular price function which, when used to calculate optimal actions, leads to a price function which just happens to be the same as the one we started with. The models that are derived in the sections below ignore the possibility of informed trader cartels. That is, we deal here only with the case of non-schizophrenic traders a la Hellwig (1980). A.3.1 Initial and Terminal Wealth Collecting all that we have said about endowments, the initial wealth of one of our traders, trader t, must be W oi = m,- + (/„• + hi) p. If this trader changes his position so that he holds a total of 2,- units of the risky 21 asset 22 and consumes Co, at the beginning of the p e r i o d , 23 then assuming he invests the balance of his tradable wealth in the riskless technology, his holdings at the beginning of period are Zi units of the risky asset, hi units of non-tradable asset h, n%i + (/,- — Zi) p — Coi units of riskless technology. Since the end of period payoffs on these different assets are, respectively, x, x and R, this individual's end of period wealth would be Wu = [Zi + hi) x + R (m,- + (/,• - Zi) p - Co,-) = R (mi + fip - C ) + Zi(x - Rp) + hiX. 0 t The model ends at the end of period, so this is also the end of period consumption, A.3.2 'Post-info' Expected U t i l i t y The post-info expected utility of our traders is calculated at the beginning of period, at exactly the same time that the market equilibrium price of the risky asset is determined. The reason the two events occur simultaneously is that we cannot calculate an uninformed trader's expected utility until he has his full Unlimited short sales are allowed. Thus, the Grossman-Stiglitz and hedging models are mathematically identical even though the hedging model contains an extra asset. In the 'standard' model, consumption is not allowed at the beginning of period. Just let Co* be zero for this case. Ci in the 'standard' model. 22 information set. However, part of the uninformed trader's information set is the market clearing price itself. He calculates his optimal beginning of period consumption and investment using the market clearing price of the risky asset as a. signal telling him something about the information received by the informed traders. Of course, this is the exact nature of the rational expectations model we are solving. The optimal consumption and investment are fixed point functions. We use the optimal consumption and investment decisions of all individuals to determine the market clearing price, and given that market clearing price people are satisfied that the consumption and investment decisions that they made are actually optimal. That is, nobody wants to change his decision. A.S.2.a PROBABILITY DISTRIBUTION OF FUTURE CONSUMPTION Using the end of period wealth expression derived above, our description of individual t to this point is Ui = -p e x p ( - a C i ) , Ci = R(rrii + fip) +Zi(x- Rp) + hiX, in the 'standard' model and Vi - - exp(aC ) - $ e x p ( - a C ) , 0l lt Cu = R(rrii + /,p - C 0 » ) + z,-(z - Rp) + hiX, 23 in the 'extended' model. As all the variables in the model are normally distributed, 25 and end of period consumption is a simple linear combination of these normal variables, end of period consumption must also be a drawing from a normal distribution. In the 'standard' model, the mean and variance of this distribution are E (C,-| J,-) = R{mi + fiP ) + z,-[E (z|/,) - Rp] + h{ E (z|/,), v a r (C,|/,) = + /i,) var (z|/,), 2 where J, is the information set available to individual t. 26 In the 'extended' model we have E (Cu \Ii) = R(mi + Up- C O , - ) + *,-[E (z|/,) - Rp] + ht E (z|/,-), var (Cu \Ii) = {zi + htfvar (z|/j). In order to calculate the optimal beginning of period consumption (in the 'extended' model) and risky asset investment, we will need to calculate the derivatives of the expressions above with respect to the decision variables Co,- and z,-. For convenience, these will be stated here. For the 'standard' model we have ^ E(C,|/,)=E(z|/,)- Rp, J r Q — v a r (C,|/,) = 2{zt + hi) var (z|/,), aZi I make the assumption here that the price of the risky asset will be a simple linear combination of variables drawn from normal distributions, and will therefore also be a drawing from a normal distribution. We see below that there does exist a fixed point price function satisfying this assumption. The informed trader information set contains the information received by the informed trader group plus the price of the risky asset, while the information set of uninformed traders only contains the price of the risky asset. In addition, all rational expectations traders have complete knowledge about the utility functions of all traders and distributions of the random elements of the economy. 24 and in the 'extended' model d . E(C ,|/,) = E(x|/,)-iZp, azi Q —var \Cu\Ii) = 2{zi + ^ ) v a r ( x | / ) , azi 1 t 9 9 C0 i E(Cu \Ii) = -R, Ol A.S.S.b OPTIMAL INVESTMENT AND CONSUMPTION In this section it becomes clear exactly why we require negative exponential utility functions and normal random variables. The problem is a standard utility maximization problem with two decision variables, z,- and Co,-. In the 'standard' model, individuals solve for J,*, their maximum expected 'post-info' utility, J- {Ii) = Ji(zi; Ii) = max Ji{zi\ Ii), Ji(zi;Ii)=-E(Ui(zi)\Ii), and for Kf in the 'extended' model, K*{Ii) = Ki{zUC*0 i,Ii)= max #,-(*,-,C ,-;/,-), Zij Coi 0 Ki(zi, C ; / , ) = E (Vi(Zi , C , ) | / , ) . 0 I The 0 reason that we can solve this problem analytically is due to the following property of the exponential function: E (exp(—ax)\Ii) = exp ^—aE(x|/,) + ^ a v a r (x|/,)^ , 2 25 for x normally distributed. That is, the expectation of an exponential function is itself exponential, but only if the argument of the exponential function is normally distributed. Now it is clear why these two assumptions are so critical. Actual calculation of individual x's expected utility gives us Ji{zi\ Ii) = ~P exp ^ - a E (C,|/ ) + ^ a v a r (C |/,) f 2 t in the 'standard' model, and Ki{zi>Cu \Ii) = - e x p ( - a C ) -p exp (-aE{Cu \Ii) + ^ o v a p ( C | / , ) 2 0 t l f in the 'extended' model. If we calculate the first order conditions and set them equal to zero we find that the 'standard' model optimal beginning of period investment in the risky asset z*{ satisfies 27 AJ.(0;;J,) = O a^-EiCM-la^vaxiCM) ozi L ozi = 0, => o[B(x|/,-) - Rp] - a (z * + hi)m{x\Ii) = 0, 2 t E(«|/,-)-JZp a v a r (x|i,) In the 'extended' model, the optimal beginning of period investment, z,*, and 2 7 The expressions for the expectations and variances of C,- have been taken from the previous section. 26 consumption, C Q , - , satisfy d dzi ^*,c0v/t) = o 2 ^-[JiCJCJO + expC-aCS,.)! a ^ - E ( C | / ) - Ja Avar(C |/,) lf as,- t 2 as,- lf = 0, E(x\Ij) -Rp avar(z|/,) dQ -^,-,C7*,;/i)=o 0i a exp(-aCo,-) -[^(/i) + exp(-aC*,.)] => a e x p ( - o C ; , ) - [«•,*(/,) + exp(-oCJ,)| {-aR\ = 0," => exp(-oC„\) = - K;(/,). Given these expressions for individual i's optimal beginning of period investment and consumption, we can aggregate the demand for the risky asset and impose an aggregate supply constraint. Notice that the price of the risky asset appears in two places. It appears explicitly as 'p' in the equations, and also implicitly as a part of the information set J,-. A.S.2.C 'POST-INFO' 28 EXPECTED UTILITY FUNCTIONS The final step in calculating the 'post-info' expected utility functions is simply substitution of the expressions for z\ and CQJ back into the expressions for exAs mentioned in a previous footnote, we are assuming that p is a simple linear combination of variables drawn from normal distributions and is therefore also a drawing from a normal distribution. 27 pected utility which were used to derive them. That is, from the previous sections, we have J,(z,;/,) = -p exp (-aE(Ci\Ii) + i a v a r (C,-|/<)) 2 = - 0 exp I - a[R(mi + Up) + z , ( E (x|/,-) - Rp) + A , E (x|/,) + ia tf,(z,-,C ,;/,) 0 2 [(2, + A , ) v a p ( x | / t ) ] j , 2 = - e x p ( - a C ) - p exp ^ - a E (C -|/,-) + i a v a r (C -| J,)^ 0t 2 lt lf = - e x p ( - a C ) - p exp ^ - a[i2(m,- + / ; p - C . ) 0f 0 + z,(E(x|/ )-i?p)-rA E(x|/,)] t t 2 + i a [(z,- + /i -) var(x|/,)] 2 t Substitutirg in the optimal values of z,- and Co,- gives 7;(/ -) = - p exp ^ - a[i2(m,- + /,p) + < ( E (x|/,) t + ia = ~P exp 2 [ ( z ; + ^) var(x|/,-)] 2 - ai2[m,- + (/,- + hi)p\ l[E(x|/,-)-flp] 2 var(x|/i) 28 ; + A , E (*|/,-)] * ; ( / „ • ) = - e x p ( - a C \ ) - p exp ( - a[J2(m,- + Up - C \ ) + < [ E (z|/,•) - £ p ] 0 0 + /i,E(z|J,)] = - exp(-oCo,) - p exp - oJ2[m -.- (/,• + hi)p - C * , - ] f r l[E(z|J,)-£pp + 2 var(z|J ) t The terms in C Q , - can be eliminated by using one of the first order conditions from the previous section, namely, exp(-aCo\) = - (j^z) *7(4), giving -R x exp (- .,K*„,* ,u!l?««-*'f M <>"'•"«>(- -(Mr) 2 var(z|/ ) t ) ri R 1 [E (z|J,) - Rp] 2 - exp - 1 1 + R var(z|/ ) t aR[mi + (/,- + hi)p\ l[E(z|/,)-£pj 2 - 2 var(z|J,) ln(/?J2) - (1 + R) ln 29 ^ R 2 A l l that is needed now is an expression for the market clearing price function p. A.3.3 M a r k e t Clearing Examination of the optimal investment and consumption functions derived in the previous section reveals that the investment decision is independent of the consumption decision. This is the result of using the negative exponential utility function, which has the characteristic that all investment decisions are independent of each o t h e r 30 and of total wealth. This should have become apparent above when it was shown that the optimal investment decision was the same in both the 'standard' and 'extended' models, and contained no references to beginning of period consumption or wealth. Because of this characteristic of the exponential utility function, the equilibrium price of the risky asset may be found without having to worry about simultaneously satisfying a constraint on aggregate consumption. As p is contained in the information set E(x|/,) and var(z|/,) cannot be evaluated until the functional form of p is known. Assuming, of course, that the assets invested in are independent of each other. That is, that they are not partial substitutes or complements. 30 If we let _ — z 0 i Si — Rp "—~2 ' x o _ u — z a(T xi u ~ Rp __2 i aa xu where U \_ ji/ ^ x f i = E(z|e, p), \xu = E(x|p), varfzl/) — i = I xU ~ a v a r if J, is the informed trader information set, if J, is the uninformed trader information set, ( l 'P^' r v a r € ( b)> z ^ ^* 1 3 ^ A ig * m n e ^ o n n e ( l trader information set, uninformed trader information set, then - faj-fc if trader * is informed, \zl-hi if trader x is uninformed. The next step is to sum the individual demands for the risky asset »=i \«e/ ieu / = Az° + (1 - A)z°, - A, where t € / , and i € imply sums over, respectively, all informed and uninformed traders, n j , riu and n are the number of informed, uninformed and total traders, and A is the proportion of the total traders who are informed. 31 To find the market equilibrium, we set this average trader demand equal to the per trader supply, s. Az? + (l-A)*J = s + h 3 1 That is, A = n / / n . 31 A.S.S.a GENERALIZED MODEL In the sections above, I discussed three different models corresponding to three different sets of assumptions about the initial endowments received by traders. The first was the Grossman and Stiglitz model, which had endowments of the riskless technology, m,-, and of the risky asset, /,-. The average endowment of the risky asset was normally distributed and equal to the per trader supply of the risky asset. s~N{ y } \ 5 1 f i = f= N f 1=1 hi = 0, h= • Grossman and Stiglitz 0 The second model was the efficient market model. It also had initial endowments of only the riskless and risky assets. The difference is that the endowment of risky asset is constant and equal to the Hakannson, Kunkel and Ohlson (1982) 'noinformation' endowments. The endowment is n o t equal to the per trader supply. fi = f ^ s, f constant ' • efficient market hi = 0, h= 0 The last model was the hedging model. In this model, iitial endowments of a non-tradable asset, h i , with payoff equal to the payoff on the risky asset are also received. As in the previous model, the endowment of risky asset is constant. In this model, however, the per trader supply is also constant, and equal to the 32 endowment of risky asset. fi = f = s, 1 f constant • hedging model n -'%2.hi = n t=i h~N{ti ,ol} h We can generalize these three models by noting that the important thing in the market equilibrium condition of the previous section was the sum s + h. If we define t then we have { = 3 + h, N{nf,cr'f}, Grossman and Stiglitz model, -^{/> «}> efficient market model, a N{f + A*fc,o-fc}» hedging model, which allows us to express the market clearing condition in a generalized form Az° + (1 - X)»l = t, t~N{n ,a?}, t where fit and o~ depend on the particular model used. In the sections that follow, t as much analysis as possible is done using this generalized model. Following the general analysis, results are analyzed for the particular cases of the efficient market model and hedging model. As the Grossman and Stiglitz and hedging models are mathematically identical, no further reference will be made to the Grossman and Stiglitz m o d e l . 32 I have chosen to use the hedging model interpretation instead of the Grossman and Stiglitz interpretation in order to avoid the assumption of random endowments of a traded asset. This assumption is difficult to justify, especially in a multiperiod situation. 33 A.S.S.b MARKET CLEARING PRICE Up to this point, I have avoided making any assumptions about the market clearing price of the risky asset, except for assuming that it is drawn from a normal distribution. In this section we find a market clearing price function which is 33 normally distributed and satisfies the market clearing condition developed in the previous section. Note that even though we can show existence of this fixed point pricing function - by actually calculating it - we will not have shown uniqueness. There is nothing in the theory which rules out the existence of more than one solution to this fixed point problem. The price function that we find here is a linear combination of random variables. P = Po + Pi« + Pi{t - (h), PO, PI, PI non-stochastic If this pricing function ts a fixed point solution, then we can use it to calculate z® and Zy for use in the market clearing condition. Since x is uncorrected with t, the informed traders' distribution for x, conditional on their information set, i s 2 34 2 x|e,p = x\e ~ N {fix + pe,a x {l - p )} , and x/ = E(x|e,p) = H + p€, x giving o Zl x = R i ~ P ™ 2 o-lj =var(x|e,p) = a {\ 2 x _ A*» + pe - R(p0 2 - p ), + pit + p2 {t - fit )) 2 a*x (l-P ) xI This was needed in section A.3.2 for the calculation of 'post-info' expected utility. Recall that e is the same for all informed traders. 34 Similarly, since for the uninformed traders, we have 2 alv = <r (l - <j>p ), xu = E(x|p) =fix + <fip[e - 9{t - /it)], 2 where P2 Pl^j PM + PW 2 ,_ Pi' <r 2 2 <rl + Po?' This gives ft; - Rp U 2° - fix + <f>p[e - 9{t - fit )] + R[p0 +pie + p (* - fit)] 2 2 avlu acrUl-<t>P ) If we substitute these expressions for z ° and zv into the market clearing equation, we find t = Az° + (1 - A)4 _ ^ /*x + P€ - fl[p + pie + p2 (t - fit)] 0 a<r (l-p ) 2 /*x + X a<7 (l-p ) 2 2 2 M ~ 0(t - fit)] + R[p0 € I + + Pl€ + p (< ~ f^t)] 2 a<T (l-<f>p*) x {{fix - #Po) + v {fix - Rp0 )] [{p-Rp1 )+v{<f>p-Rp1 )]e + {-Rpi) + v{-<f>p9 - Rib)] (t - fH) j , where 35 Equating the constant terms on each side of the equation, those in (t — fit) 35 the terms in e and gives us expressions for the price function parameters. 2 1 / P o = aal{\-p ) \ A ( l + «0 *J * ( ^ - R ITT^J PI = - ~w ^ ) P2 { — A — + V * * ) These parameters can be simplified slightly by recalling the definition of 9, namely, 9 — — p<zjp\. Using the expressions for p\ and p^ we find that ; — 0 , Xp which we can use to express the price parameters as 1 / PI = R 9p \ (JTT; • P2 = - 0P\ • Notice that we can give <f> a natural interpretation as the Mnformativeness of the price system' 3 6 as it is equal to the square of the correlation between the price, p, and the signal, e, received by the informed traders. 2 n 2, [ P 3 5 3 8 _ A ' J " cov (p,€) _ 2 var(p)var( ) ~ C (PI <TI) (PI <T* + PW)°X _ ~* Use fit + {t — Ht) instead of t as the left-hand side of the equation. See Grossman and Stiglitz (1980), p. 399. 36 A.3;4 'Pre-info' Expected U t i l i t y In the sections above, we first derived expressions for traders' 'post-info' expected utility functions assuming only that the price was drawn from a normal distribution. We then found a fixed point market clearing price function which was normally distributed. T h e final step is to substitute the price function into the expressions for 'post-info' expected utility and take expectations over all the random variables which are not known at the 'pre-info' point in time. That i3, we find each trader's expected utility in expectation of the receipt of information. In order to accomplish this, we need to make use of the following properties of the exponential function: E exp -K76) 2 : exp + = e x p \2 u 1+ H -5TW for b ~ 2 + 7 2 i ) ])' l n ( 1+272) N{0,1}, and 2 E exp -[(r'6) + V'b+Q] Vl/+2IT'| = exp (- forfe~ _ I exp Q *'(/ + 2 r r ' ) - ^ - n ^ 1 2 I T ' ) - * + l]n\I+ 1 + 2IT'|]) , N{0,1}. We must reexpress the 'post-info' expected utility functions in terms of standard normal variables and then use the above expressions to find the 'pre-info' expected utility. 37 A.S4.& UNINFORMED TRADER From section A.3.2.C, the 'post-info' expected utility of an uninformed trader in the 'standard' model is J tu = - P and e x P y- L a xU in the 'extended' model 1 {xu - Rpf aR\mi + (/,• + hi)p] + - 'xU - ]n{f3R)-{l + R) Since the only difference between the 'post-info' and 'pre-info' information sets of an uninformed trader is that the 'post-info' set contains the market clearing price of the risky asset, p, while the 'pre-info' set does not, we can find this trader's 'pre-info' expected utility by calculating or ^ = E ( ^ ) tf^ where the expectation is taken over p. First, since p is not a standard normal variable, ~ N{PO,PWJ<(>}, we define a transformation of p which is. \ pi ) °x 38 = E(J5fo), That is, we can substitute p= Po+Pi~j=b into the expressions above, in order to have the 'post-info' expected utility functions expressed in terms of the standard normal variable 6. This is not all we need, however. We also must have an expression for xu in terms of 6. Since we showed in section A.3.3.b that xu = V-x + <M - 0{t - fr)], € 2 alu = <rl{l - <f>p ), it is easily shown that *u = + <rx p\/<j> b. Making the necessary substitutions, we find that Jiu = ~P exp - aR (m{ + (/,• + hi) (p0 + Pi ^ | 6 ^ which is equivalent to ^ where 7 2 + "]), 1 (<f>p - Rprf 2 7 = -/?exp(-[( &) + # = o 2 <p{\ - <pp*) ' + AR{FI+ * ~ 7 ? \—*2(I-*P»)— l (fix - Rpo) U) —2 — ax (l - <pp 2) + aR[mi + (/,- + hi)p0 }. 39 H I ) P I ) Using the formula given in the previous section, we now know that the 'pre-info' expected utility of an uninformed trader in the 'standard' model is + 2 2 i +-ln(l + 2 ) 7 2 where 7, rp and u are as given above. The same procedure can be followed for the 'extended' model, resulting in ^ = E ( ^ ) = -exp - ln{pR) -{1 + R) 1+ R\ 1 2 0 /(l + &Y' 2 l + 2 7 / ( l + #) 2 where 7, rp and ui are the same as given above for the 'standard' model. A.3.4.b INFORMED TRADER From section A.3.2.C, the 'post-info' expected utility of an informed trader in the 'standard' model is J-j = - exp y- aR[mi + (/,- + hi)p] - In/? + \ [ X l _ 2 R P ) 2 axi 2 and in the 'extended' model Kl = - exp [ - 1 + R aR[mi + {fi + hi )p\ + \ 2 [il J V axi _ln(/WZ)-(l + J 2 ) l n ( i ± £ ) 40 ? The difference between the 'post-info' and 'pre-info' information sets of an informed trader is that the 'post-info' set contains the informed trader signal, e, and the market clearing price of the risky asset, p, while the 'pre-info' set does not. If we note that the price of the risky asset is a simple linear combination of e and t, knowing both € and p is equivalent to knowing both e and t. Therefore, the expectation of J*t taken over all e and p is identical to the expectation taken over all e and t. The latter pair of variables will be used, as it is easier to calculate the expectations T u = E(J£) and T iL = E(i^), where the expectation is taken over all e and t. First, since the vector of e and t is not standard normal, (:)-{(:)•(* i)Y we define a transformation which is. >=(o :rG4)~*<°.'> That is, we substitute P = Po + Pi(l,-0) -A. + P . U . - ' j f j °)» = Po+Pi(<r*i-M)6| into the expressions above, in order to have the 'post-info' expected utility functions expressed in terms of the standard normal variable 6. As before, we also must have an expression for xj in terms of b. 41 Using the expressions derived in section A.3.3.b, we can show that xi = Hx + {o- p,0) b. x If we make the necessary substitutions, we find (- aR[mi + (fi + hi)p0 ] -\n/3 + aR(f{ 1 - 1 2<r (l-/> ) + This is equivalent to 2 2 + A,-)pi (a , -a 6) b x t [(it, - Rp0 ) + (<rx (p - Rp^atRpJ) bf J ^ = -/5exp(-[(r'6) +* 6 + n]), 2 , where * r = y/2trl(l - p ) 2 ( °*(p V -Rpi)\ VtRpid ) ' *l( 1 (Mx - -RPO) n =2 2 ^(1 -p ) 2 + aR[mi + (fi + hi)p0 ]. Using the formula given in section A.3.4, we now know that the 'pre-info' expected utility of an informed trader in the 'standard' model is J*u = E ( J £ ) = - exp (- |fl - In/? - \ ¥ ' ( J + 2 I T ' ) - > + \ In\I + 2TT'\ ) , 1 where T , ^ and ft are as given above. 42 The same procedure can be followed for the 'extended' model, resulting in Tu = E( J f c ) = - exp (- ^ (n 2 1 + R\ + i l n / + 2 2 i -(1 + R) hi ( ^ ) + 1 + RJ 1+ R rr' + j? where T, # and ft are the same as given above for the 'standard' model. 43 ) A.4 MODEL INTERPRETATION Each step in the unfolding of a model may be difficult in its own way. The derivation we have just gone through was difficult because the algebraic manipulations were complicated and unenlightening. This next step is also difficult, but in a. different way. We have to use the formulas developed in the previous step to make model predictions or descriptions, and then these predictions or descriptions must be interpreted. We must present an intuitive argument which makes the same predictions as the purely mathematical one and which correctly captures the interactions between elements in the mathematical model. The first requirement, namely, providing an intuitive argument giving the same predictions as the mathematical model is not the difficult part. What is difficult is providing a correct intuitive argument, where by correct I mean not only paralleling some final prediction, but also paralleling the actual dynamics whereby that prediction is produced. In the following sections, I aim to develop an understanding of the dynamics of the models, that is, of the interactions of each of the assumptions making up the models. This understanding will automatically build up to understanding of the more complex model predictions. With this aim in mind, I have divided the interpretation into three cases. In case 1, I present characteristics of the models when all traders are informed (that is, we do not have an asymmetric information model), and there is no random supply (that is, endowments of the non-tradable asset, hi, and risky asset supply are not 44 random). This case introduces the first of the factors which underlie the dynamics of these models: the allocative efficiency benefit. As information quality increases we may find that utility increases due to a more efficient allocation between current and future consumption. As shown in Tables A . l and A.2, this benefit does not arise in the 'standard' model - there is no current consumption in that model. In the 'extended' model, however, this factor can be identified. The second case adds one element of complexity by introducing a random supply. In this case, however, we still do not have an asymmetric information model. This case introduces the rest of the factors needed to understand the model dynamics. An analysis of the efficient market model brings to light the reason why the market modelled is not actually efficient. We see that the relation between the 'naive' efficient market traders and the rational expectations traders is similar to that of an ordinary trader to his dealer. As the dealer has better information than his clients - in particular, knowledge of unexpected demand/supply variantions - he is able to use his own inventory of risky asset to supply unexpectedly high demand and absorb unexpectedly high supply. I name this benefit the rational expectations traders receive in return for this service the dealer benefit (see Table A . l ) . Two additional factors are found in the analysis of the hedging model in this second case. As one of the motivations for trading is the desire to hedge one's position in other non-tradable assets, it is not surprising to find a factor which we can identify as a benefit from the opportunity to hedge. This hedging benefit is shown to be analogous to the dealer benefit which arose in the efficient market model. In addition, a factor is found which reflects the risk of market revaluation 45 of one's endowment of non-tradable asset. After information release, when one does finally have the opportunity to hedge, the resulting benefits are diminished relative to what they would have been had no information been released. This decrease in the benefits from hedging I name revaluation risk. In the third case, the uninformed rational expectations trader is introduced, thus giving us an asymmetric information case. This is the most interesting case, but also the most difficult to analyze. Tables A . l and A.2 summarize the results of the analysis of this case for the efficient market model and hedging model, respectively. A.4.1 C a s e i : Homogeneous Information, N o n - R a n d o m Supply In terms of model variables, this case has no uninformed traders, 37 A = l, and has endowments of only the riskless and risky assets. The endowment of the risky asset is the Hakannson, Kunkel and Ohlson 'no information' endowment. hi = 0 /•• = /, The generalized randomness variable t is also constant and equal to the endowment of risky asset. t = f, H = /, cx = t 0 By non-random supply I mean that the endowments of the non-tradable asset, hi, and the risky asset supply are constant. 46 Several immediate consequences of these assumptions a r e 9 = 2 ao*{l-p*) 38 2 aa x {l-p ) Xp = 1, <t> = - m m if 9p \ 2 i [ / z x - aal(l - p ) /], Po = p_ (1 + <j>v\ Pi V1+ R P2= v J p_ = R' -9pi. It is a. simple matter to substitute the above into the general expressions for the 'pre-info' expected utility of an informed trader to give (<rx {p-Rpx)\ 1 atR V ^<?UI-P U ")\ P ) 2 (fi x - Rp ) 0 2 pi 2 0 (0\ W' = ) ( (Tx (p - °1{1-P ) ft = «*[».,• + (/,- + MPo] + 2 2 2 = aRmi + afiix - \* <T x {1 - p ) f , 2 which, when substituted into the 'pre-info' expected utility function for the 'standard' model, give Ai = ~ ft-ln/? exp ^ ~ + 2VT')- 1^ + ^- ln|/+2rr'| ) 1 = - exp ( - 3 8 - l^'(I Recall that <f> = c o r r ( p , e), so that <f> = 1 means that the price fully reveals the information e. 2 47 Similarly, when substituted into, the 'pre-info' expected utility function for the 'extended' model, we find Ku = - exp n - b ( w (-[r^( 1 v /+ rr 2 1+ R v 1 + 2 = - exp - I + R) ( ^)) $ 1+ R ^(n-mWJO-d + a j i n ^ ) ) 2 - \~ l i TV 1+ R 2 1 a <xx p f 2 2 (1 + R) -exp ( 1 + J J ) I n 2 1 aRrm + afnx -]n((3R) -(1 + R) In 1+ R 2 -^^('-(TT^" )])Now that we have explicit expressions for the 'pre-info' expected utility, we can see what the effects are when we increase the quality or 'informativeness' 3 9 of the signal given to these informed traders. It is easy to see that dp 2 41 = 0, :J* r ' — dp 2 K*u T > 0. - That is, using the terminology of Hakannson, Kunkel and Ohlson leifer (1971), (1982) or Hirsh- the social value of information is always zero in the 'standard' model and positive in the 'extended' m o d e l . 40 The correlation between the payoff on the risky asset and the signal received by the informed traders, p, will be referred to as the quality or 'informativeness' of the signal. This is not the same as the 'informativeness' of the price system, <f>. This result for the 'extended' model was shown by Epstein and Turnbull (1980). 48 This result, of course, already throws into doubt the intuition that better information makes traders better off. As Hirshleifer (1971) pointed out, however, this result should not be too surprising. Information really has no intrinsic worth these traders can't eat it - and only has a derived value when it can have an effect on the allocation of goods in the economy. The flat expected utility curve in the 'standard' model merely points out that our informed traders can't use their information to create a better allocation of their wealth. If they receive information that the future payoff on the risky asset will be poor, their immediate reaction is to sell at the currently high price and buy more riskless technology with the proceeds. But, since all traders are informed, they all want to sell. This depresses the price of the risky asset far enough that everyone decides to retain- their holdings. That explains the flat expected utility curve of the 'standard' model, but the same argument does not appear to apply in the 'extended' model. The argument used to explain the 'standard' model offers only one possible reason for an increase in utility as the quality of information increases. The better information must be allowing a more efficient allocation of wealth. Of course, there is no possibility for a trader to change his holdings of risky asset, since this is a homogeneous information economy, so the effect must be due to the only other investment in the economy: the riskless technology. What is happening is that the allocation of wealth between beginning of period consumption and investment in the riskless technology is more efficient given better information. 41 This is an effect of not fixing the aggregate supply of riskless Returning to Table A . l , the results of this case are shown in the first column. 49 technology. If the supply were fixed, then we would have the same situation as we have with the risky asset, namely, attempts to change holdings of the riskless technology leading only to a price adjustment. The effect, therefore, is not an unrealistic one. We would expect the number of shares of a particular company to be insensitive to demand, which is consistent with the modelling of the risky asset, but the total supply of alternative investments might be sensitive to d e m a n d . 42 We can make an analogy between this and the flexibility of investment plans. We would not expect information to have any value in an economy with totally inflexible investment plans or opportunities. There has to be the possibility of increasing investment in some assets and cutting back investment in others before we would expect information effects in a homogeneous information economy. This possibility is provided by the perfectly elastic supply of the riskless technology, though it is not the fact that the supply is perfectly elastic that is important. What is important is that the supply of riskless technology is not perfectly inelastic. 43 In conclusion, we can expect the 'standard' and 'expected' models to provide different conclusions. In the 'standard' model, because traders consume only at one point in time there is no possibility of trading off current consumption against future consumption. That is, there are no possibilities for increasing allocative efficiency. In the 'extended' model, since consumption occurs at two points in time, This case does not involve dealer benefit. The supply curve for the risky asset, or for a stock, is vertical. The supply curve for total available investments is unlikely to be vertical. That is, what is important is that the supply curve for the riskless asset is not perfectly vertical, not that it is perfectly flat. 50 there is the possibility of foregoing current consumption in return for increased future consumption. At least, the opportunity is present as long as there is at least one investment vehicle which is not in fixed supply. In the 'extended' model this function is provided by the riskless technology. A.4.2 Case 2: Homogeneous Information, R a n d o m Supply In analyzing this case, 44 I first present the mathematical analysis in terms of the generalized model. In separate sections following this, the results are interpreted for the efficient market model and the hedging model. As in the previous case, we still have no uninformed traders, that is, A = l , but the generalized random variable t is no longer constant, 2 t ~ N{(it,a }. The assumptions regarding endowments depend upon the specific model chosen. 1 By random supply, I mean that either the supply of the risky asset is random, as in the efficient market model, or the endowments of the non-tradable asset, h{, are random, as in the hedging model. 51 The immediate consequences of these assumptions are 9 = Xp p - m m . ) 1 Po = Pi = ( °P \ = ^[M*-«^(l-P )Me]i a p_ (1 + <f>v\ _ p_ R\l + u J R' P2 = - d p i . Notice that the price function of the risky asset is identical to what it was in case 1, where we had <r = 0. In the previous case, however, the value of p really t 2 didn't matter, since (t — pit) was constrained to be zero. What we see in this case, therefore, is the addition of a non-zero term to the pricing function. Once again, we must substitute the above into the general expressions for the Unlike the previous case, the price is not fully revealing. That is, 0 < <f> < 1. 52 'pre-info' expected utility of an informed trader. 0 x/2<r (l-p ) V ^2^(1 -pi) V 2 <rt RPl 9 °tRpi9 2 J \i J ~ \a<Tx <rt (l - P ) J 2 ft = « * [ m , + (/, + fc)Po] + \ * l ~ * { P $ = aRmi + apxtfi + hi) 2 2 2 2 + U a x (l-p )[n -2nt (fi + hi)} We can find the expected utility for the 'standard' model by noting that 1 2 *'(/+ 2ir')- * = a a 2 x 2 2 p (fi + hi) + 1 4 ; P ^ ~ 2 ) _ 2 J 2 ) [fH - (fi + hi)} , ft - i * ' ( / + 2rr')- ^ 1 = aRmi + afix (fi + hi) - £ a < r ( / , + A , ) 2 + -a* (l-p){ft -(fi x 2 + hi)} t = aflm - + afix {U + hi) - - a V ( / , - + A , ) 2 t + 2 1/ 2 From this, it is easily seen that b o t h (a ap« V 2 2 ( / X t " ( / « + ^ ' 46 r a. o r r ' W i l A / 1 2 2 a *x (l-p ) \ 2 U + aVMCl-p ); 9 2 / °> efficient market model, I < °» hedging model, = and 9 ap 4 6 2 ln\I + 2rr'| = 1 • ap 2 ln[l + a <r <r (l - p )] < 0. 2 2 The efficient market model has fit — (/,• +fe,-)= 0 53 2 2 Since J u = ~ e x P (- n _ m / g _ I $ ' ( / 2 r r ' ) ^ + Jin|/ + 2rr'| _1 + for the 'standard' model, we have unambiguously 47 J i l < 0. dp2"U Similarly, for the 'extended' model, we have - a W r f U l U (l + i g ) a ^ ( l - p ) 4 2 2 2 and I 21 + n l + # / iE V = j—g 1 + + + * 1 + R) 1+ f a <r (l-p ) Z) + R + Ai) - I (j^g) aV»(/- + A,) ) + 2 2(l i rr*\ 2 . 2 _ aWyi-p )^-^ 2 L + ,, ^ 2 2 From this we can see that dp*\l + R 21 + R\ 1 + 1 + R) 1+ R (1 + R)a?o? X 2((l + R) + a*<rZcT?(l-pZ)¥ 1 R > 0, { < > 0, 47 efficient market model, hedging model, This result and those following are interpreted in the immediately following sections. 54 which may be positive or negative, and In 1 + 2 dp' 4 a TV 1 dp +R 1+ K (T^W -^ 2(1 2 <0. Since the first partial derivative above may be positive or negative, and K ii = - exp ln(/3R)-{l 1 v ( 2 1 + R rr \ V 1 + 2 + R) 1+RI _ 1 * 1+R rr' 1 +R the partial derivative, d —* 3p may likewise be positive or negative A4.2.CL EFFICIENT MARKET 48 MODEL In this model, the endowment of risky asset is constant and equal for all traders.- 49 fi = f = constant Even though the endowments are not random, the needed randomness is provided by a group of non-rational expectations traders outside the model, contributing a This is true for both the efficient market model and the hedging model. The endowment hi is zero. 55 random supply component. The previous section seems quite clear about its prediction for the 'standard' model. There is no possibility of an increase in expected utility for these traders if we increase the quality of their information. From the discussion of the previous case, however, this is not surprising. There is, after all, no possibility of trading off current consumption - none is allowed - against future consumption. If we look at the terms in the expression for expected utility, = ^ (aRtm + a f ix f - 50 ^a ^/ ) 2 2 = 0, In | J + 2IT'| = ^ ln(l + a <r a (l - p )) < 0, 2 2 2 2 we see that there is one term causing a decrease in expected utility as we increase quality of. information. In order to interpret this effect, notice that expected utility increases as we increase 2 the supply variance, a . What we are seeing here is simply due to a difference in outlook. In this model, the 'naive' or efficient market trader has the belief that as long as he is buying at the equilibrium price then the purchase price does not matter. He believes the asset is 'correctly' priced at equilibrium. Substitution of (/4t — (fi + ^ t ) ) — 0 has been made. These expressions are from the previous section. 56 The rational expectations traders have a different view of the situation. In particular, they know that the equilibrium pricing function for the risky asset is P = Po+Pie + P2(t- where p i = 0 when p = 0, increasing to l/R Ht), as p approaches 1. Also, p < 0 2 when p = 0, increasing to p = 0 as p increases to 1. It is clear, then, that the 2 equilibrium price tells the rational expectations trader something that the 'naive' trader doesn't even consider, namely, the aggregate supply of the risky asset. The risky asset, like any other asset, has a price that falls when supply rises and rises when supply falls, something that the rational expectations traders realize. Basically, they sell the risky asset to the 'naive' traders when there is a. small supply of it or buy it when supply is high and make a profit on doing so. 51 They realize that the price fluctuates due to both information and supply effects while the 'naive' investor believes that the price only changes because of new information entering the market. When p equals 1, the rational expectation traders have perfect information about the future payoff on the risky asset, thus making it riskless in their eyes. Because we implicitly assumed competition between traders, they will ensure that this now riskless technology is priced at the riskless technology price. This happens to make the price perfectly insensitive to supply fluctuations, just like the riskless technology price. What is happening, therefore, is that as the risky asset becomes Essentially, the rational expectations traders are acting as dealers. From their position, they can note fluctuations due to 'fads', or other effects, and use their stock of risky asset to satisfy demand when it is abnormally high and absorb abnormally high supply. In return for this service they realize a profit. 57 less risky - in the eyes of the informed rational expectations traders - its price becomes less sensitive to supply variability, which reduces the potential gains that the rational expectations traders stand to make by buying low or selling high to the 'naive' traders. That is, 'pre-info' expected utility decreases as p is increased because the dealer benefit decreases. 52 What we find when we turn to the 'extended' model is not surprising. The equations show that the term tending to increase expected utility due to a more efficient allocation of current and future consumption is once more present, 2 dp \l 1 ++ RR — — 21 21 ++ iR 2 \ ^ + 11 ++ RJ RJ 1 V 11 + R 2 2/2 ~ 2(l + iE)2 a a * J >0, and that we still have the term from the 'standard' model which depresses expected utility as we increase quality of information. dp 1 + 2; 1+ R dh £ ( 1 + 1 I fcb)«'** -' >) (l + JJ)+a»ffl<r;(l-^) The result, therefore, depends on the size of the supply variance, a . 2 For small cr , the benefits due to trading with the 'naive' traders are small to begin with, so 2 Returning to Table A . l , we have now moved from the first column - allocative efficiency - to the second - dealer benefit. To this point, we have discussed only the top half of the column, that is, the 'standard' model. Note that the positive signs shown in this column apply only to case 3, which involves asymmetric information: In the current, homogeneous information case, this column becomes strictly negative. 58 the loss of dealer benefit due to better quality information is small. For <x small 2 enough, increasing the quality of information will cause an increase in expected utility due to a better allocation of current and future consumption. Otherwise, the net effect will be a decrease in utility due to decreased supply sensitivity of the risky asset price, and a. consequent drop in the benefits of trading with the group of 'naive' traders. A.4.2.b HEDGING MODEL The 'standard', hedging model also allows no possibility of an increase in 'pre-info' expected utility if we increase the quality of information. As was discussed in the efficient market model section above, this is due to not having the opportunity to trade off current consumption against future consumption. The reason in this model for a decrease in expected utility given better quality information is that we have replaced the situation we had in case 1, namely, constant endowments of the risky asset, with a situation where the endowments are random. Since endowments are random, trading between rational expectations traders will be necessary to bring about market equilibrium. This is unlike the previous efficient market model case above, where no trading between rational expectations traders took place, only trading between rational expectations traders and 'naive' traders. Along with the need to trade with other rational expectations traders 59 in the hedging model comes vulnerability to information which changes those traders' perceptions of the asset one wishes to trade. No matter what endowment one has, it is certain that one will have to do some trading with other rational expectations traders. Before one has the opportunity to trade, however, information will be disseminated which tells everyone either that the asset you want to trade is desirable or that it is undesirable. One is exposed to the risk of revaluation of one's endowment. As the quality of information given out is increased, the extremes of revaluation become more probable, thus increasing the revaluation risk. This, naturally, is the reason for the decrease in expected utility in the 'standard' m o d e l . 53 In the 'extended' model, we have the same tendency to a decrease in expected utility as we increase the revaluation risk by increasing the 'informativeness' of information. In this model, however, we see a counteracting increase in expected utility. As we identified in the previous case, this potential increase in expected utility is due to the ability to trade off current and future consumption by altering one's investment in the riskless technology. In theory, we would expect an insurance market to arise to allow risk-sharing of this revaluation risk. As noted in a previous footnote (section A.3), however, we cannot allow this insurance trading to take place after traders have received their non-tradable asset endowments. Such a round of insurance trading would reveal the average non-tradable endowment, h, to the uninformed traders. Consequently, at the next round of trading - after receipt of information by the informed traders - the risky asset price would be fully revealing. The only way to allow insurance without destroying the partially revealing price of the risky asset is to allow traders to purchase an insurance contract before endowments of the risky asset have been received. Furthermore, settling up on these insurance contracts could not take place until after the beginning of period trading in the risky asset had taken place. 60 By examining the actual equations, _±_ (JL. dp \l 2 54 IJL-u+tlZLy-i-lJ] _ 21 + J T + R 1+ R 2 (l + R)a a (T 1 2 J 2 x 2 1 + RJ {p H 2((l + R)+a vZ<Tl(l-p ))2 - hi) 2 a\ 2 <> 0, we see that the positive term identified in the previous section as the benefit due to increased allocative efficiency is still present. In addition, we have a second, negative, term dependent on the deviation of one's endowment from the expected endowment. Notice that for a\ small, the negative term of the partial derivative above will likewise be s m a l l . 55 The positive term also changes, but in the limit as a\ approaches zero, the positive term dominates. This holds true even if we add in the negative effect from the following term. dp' In 1+2 TV 1 + R {1 + R) + tfielalil - p*) < 0 This expression also vanishes as a\ approaches zero. In conclusion, we can say that information will have value in the 'extended' model only if the variance of the average non-tradable asset endowment, o\, is not too large. This value arises from the ability to improve one's allocation of current and future consumption. As the endowment variance increases, however, the exposure to revaluation risk increases, decreasing expected utility. This may lead to a For this model, t ~ N{f + fih,^h) and /,• = / , so that (it — (/,• + hi) = p and (Tt = Ch. While varying trjj, we must keep (fih — hi)/(Th constant. 61 h — hi decrease in expected utility as quality of information is increased. In the analysis above, two negative contributions from the partial derivatives were lumped into what I named revaluation risk. But one of those factors, namely, the partial derivative of the logarithmic expression, was also present in the previous efficient market model analysis. In that analysis, the explanation of the negative effect on utility was that the rational expectations traders derived a benefit from trading with the group of 'naive' traders, 57 and that this benefit decreased as information quality increased. There is no outside group of 'naive' traders in this model, but the logarithmic term and its negative partial derivative are still present. Even though there is no outside group of 'naive' traders, the explanation of this term is similar. Simply stated, because every trader's pre-trading shadow price of the risky asset is different, 58 5 9 every trader benefits from the opportunity to trade. Traders with shadow prices lower than the market price benefit by selling risky asset. Traders with shadow prices higher than the market price benefit Returning again to Table A.2, we have moved to the third column describing revaluation risk. This was named the dealer benefit. After trading, of course, the price of the risky asset and each trader's shadow price are equal. The efficient market model may also be interpreted in terms of shadow prices. When the 'naive' trader supply component is positive, we can imagine that the 'naive' trader shadow price of the risky asset is zero. That is, they wish to sell at any price. The price of the risky asset does not fall to zero because the 'naive' trader supply component is finite. That is, the 'naive' trader is never the marginal trader. When the 'naive' trader supply component is negative (ie. they want to buy risky asset), then we can imagine that their shadow price of the risky asset is infinitely positive. That is, they wish to buy at any price. 62 by buying risky asset. The differences in shadow prices, of course, arise from the different trader endowments of the non-tradable asset. That is, this benefit is result of having the opportunity to hedge one's position in the non-tradable asset, h{. As this hedging benefit is the result of differences in pre-trade shadow prices, anything which diminishes these differences reduces the hedging benefit. As we saw in the efficient market model, however, as the quality of information increases, the risky asset becomes more and more riskless. A t the same time, the differences between shadow prices diminish, until in the limit where p = 1 we have all the shadow prices exactly equal to the price of the riskless asset. Therefore, this factor identifies the future benefit to be derived from trading with a market of individuals having varying shadow prices for the r ^ k y asset. As information quality increases, the differences between shadow prices diminish, causing the hedging benefit to decrease. The final term to consider is the negative term from the first partial derivative above. This term is the only element of expected utility which depends on the actual endowment, hi, and represents the exposure to revaluation risk discussed above. One receives an endowment, but information will be revealed causing everyone to revalue the asset you wish to trade. 63 In summary, we have 60 d •g-^K * {I d = ^^-(revaluation risk term) d_ + ^-^(allocation efficiency term) dp Q dp* + -^-(hedging benefit term), where we have shown that exposure to revaluation risk increases as information quality increases, thus depressing expected utility. Q ^—2 (revaluation risk term) < 0 Also, allocative efficiency increases as information quality increases, thus increasing expected utility, Q ^-r-(allocative efficiency term) > 0, dp £ and the hedging benefit decreases as information quality increases. 61 d dp- : (hedging benefit term) < 0 The net effect depends on the size of the endowment variance, <r\, which in turn determines how much hedging will take place in the model. If the market is used mainly as a vehicle for hedging other assets {o\ large), then better quality information may not be desired by traders. The reason is that it exposes them to revaluation risk. If hedging is not a very important use of the market (o\ small), The analysis for the standard model is the same except that the allocative efficiency term is lacking. 'Pre-info' expected utility therefore unambiguously falls in the 'standard' model when information quality increases. This term is shown in Table A.2 with a positive or negative derivative for the informed trader. The positive sign is possible only in an asymmetric information model. It is not present in a homogeneous information case such as this. 64 then better quality information is desirable, as it leads to increased allocative efficiency. Certainly this could have effects on the amount of information produced. If information produced immediately becomes public (this is, after all, a homogeneous information model), then our traders might not produce information to the point where the marginal cost of information equals the marginal benefit due to increased allocative efficiency. Obviously, if hedging is an important use of the market, then we have to also note the effects of information on hedging possibilities. A.4.3 Case 3: A s y m m e t r i c Information, Randomness Present In this last case we finally introduce the uninformed rational expectations trader. The result is that the expressions for 'pre-info' expected utility become very complicated. Luckily, some degree of simplification is possible and allows a. few conclusions to be extracted. Unfortunately, a complete analysis is not possible. As was done in case 2, the analysis will first be presented for the general model. The interpretation of the results will be given in the sections immediately following the general analysis. In addition, the 'standard' model will be fully treated before 65 continuing on to the 'extended' model. A.4.S.a STANDARD MODEL As was shown in section A.3.4.a above, the 'pre-info' expected utility of an uninformed trader in the 'standard' model is where 7 2 = l(^-fl 2 ) P l 2 2 <t>(\-<t>P ) ' 7$ V — « 2 d - ^ ) — It is shown in appendix 1 t h a t d ( 2 dp \ 2 _ 1 rp \ 2 1 + 27 / 2 + a R p i i f i+ h i ) )' 62 f = 0, efficient market model, A > 0, \ < 0, hedging model, A > 0, and — ln(l + 27 ) < 0 , 2 A>0. The second derivative below is negative in both models except for the case p = 1, at which point it is zero. 66 The conclusion we reach is that the 'pre-info' expected utility of the uninformed trader in the 'standard' model unambiguously decreases as we increase the quality of information. 63 6 4 _d_ ^* dp What about the 'pre-info' expected utility of an informed trader in the 'standard' model? To handle this question we can borrow a result from Grossman and Stiglitz (1980, pp. 406-407), namely, 65 which gives us If the added logarithmic term also has a negative partial derivative, then the 'preinfo' expected utility of both informed and uninformed traders decreases as quality of information increases. This is, however, not the case. It can be shown that the partial derivative of this term may be positive or negative (see appendix 1). As a result, the partial derivative of the informed trader's expected utility is, likewise, either positive or negative. dp Jn <> 0 The interpretation of this result is presented in the following sections. Note that the derivative below is zero only in the efficient market model at the point where p = 1. Otherwise it is negative. This esult can be shown to also hold in the general model. 67 A.^.S.a.i Efficient Market Model The previous section is quite definite about the effect of better quality information on the 'pre-info' expected utility of uninformed traders in this asymmetric information, 'standard' model. Their expected utility is unambiguously decreased when the informed traders receive better quality information. This is not surprising, as we saw in case 2 that even when every trader is informed the result is still a decrease in expected utility as the quality of information increases. In fact, if we subtract the utility function terms derived for the homogeneous information case from the terms from the current asymmetric information case (see appendix 2), we can sign the differences as follows: ( -5Tw)"( -i*' w ft (/+2rrr 66 '*) ' =0 ln(l + 2 y ) - l n | / + 2 l T ' | < 0. J As a result, the uninformed trader is unambiguously worse off here than he would be in the homogeneous information case. 67 The reason for the decrease in utility is not that the uninformed have poorer quality information than they would in the homogeneous information case. After all, in the 'standard' model better quality information decreases expected utility. The reason for the decrease in utility lies in a decrease in the dealer benefit that our rational expectations traders receive from trading with the outside group of 'naive' The second difference below is negative for both the efficient market model and the hedging model, except when p = 0 or p = 1, at which points it is zero. Except, of course, when p = 0 or p = 1, at which points he is just as well off.. 68 investors. 68 Note that since this benefit is the only reason that the uninformed traders are trading at all, it is not possible for all of the benefit to disappear. If it did, then the uninformed traders would presumably not trade, there now being no reason for them to do so. Since this situation does not arise, we are assured that the benefit to trading never completely disappears. The dealer benefit decreases because the uninformed traders are not able to distinguish perfectly between above average demand due to 'naive' traders versus above average demand on the part of the informed traders. When they sell to the 'naive' investors they end up better off than if they hadn't; when they sell to the informed traders they end up worse off than if they hadn't. O n the whole, however, they end up better off than if they stopped trading altogether. The benefits that they give away to the informed are less than the benefits that they receive from trading with the 'naive' traders. What about the 'pre-info' expected utility of the informed traders in this asymmetric information model? We know at least one thing, namely, that at the two end points, p = 0 and p = 1, the expected utility of the informed and uninformed converge at the values found in the homogeneous information case. Since the uninformed trader expected utility lies below what it did in the homogeneous information case, we would expect the utility of the informed traders to lie above what we found in the homogeneous information case. Recall from previous sections that the logarithmic term in the expected utility function was identified with the dealer benefit. 69 This turns out to be the case, as can be verified by examining the terms of the informed traders' 'pre-info' expected utility in this case. If we once again subtract the utility terms derived for the homogeneous information case from the terms of this asymmetric information case (see appendix 2), we can again sign the differences. 69 As a result, the informed trader is unambiguously better off here than he would be in the homogeneous information case. 70 Further, we can show that the partial derivative of the logarithmic term is guaranteed to be negative if the quality of information is raised high enough. 71 for This is consistent with a decrease of expected utility to the level of the homogeneous information model as p approaches one. The informed trader 'pre-info' expected utility is higher in this efficient market model than it is in the homogeneous information case because the informed traders are able to capture more of the dealer benefit here than they could when everyone was informed. This is due to the difficulty the uninformed have in distinguishing The first difference is identical to the one for the uninformed trader above. The second difference is positive for both the efficient market model and the hedging model, except when p — 0 or p = 1, at which points it is zero. Except, of course, when p = 0 or p = 1, at which points he is just as well off. Substitution has been made for at = <T». 70 between out of the normal demand due to 'naive' investors versus informed traders. There are, therefore, two effects occurring simultaneously. As p increases from zero, the portion of the dealer benefit captured by the informed traders increases. 72 At the same time, the total dealer benefit decreases. Certainly the informed traders are better off than they would be if everyone were informed, but are they better off than they would be if everyone were uninformed? In the homogeneous information case we saw that better quality information caused a drop in utility, so that if traders had the option of forming an enforceable cartel, they would do so. The cartel would in effect be an agreement not to use the signal, e, thus moving everyone to the higher expected utility point where p = 0. Would the informed traders also want to form such a cartel in this asymmetric information case? This question can be answered by looking at the partial derivative of the informed trader's expected utility function at the point p = 0. If it is possible for it to be positive, then it is possible for the informed traders to reach a higher expected utility than they would have if everyone was uninformed. (In the 'standard' model with homogeneous information this is the highest utility level possible.) Using the expressions derived in appendix 1, it can be shown that the partial derivative of the 'pre-info' expected utility of an informed trader in this asymmet- This occurs as p increases from zero. When p approaches one the opposite occurs: the portion of the benefit captured by informed and uninformed traders once again equalizes. 71 ric information model may be positive or negative at p = 0. The derivatives of the terms contained in the informed trader's utility function are at at 73 p = 0, p = 0. As A approaches zero, the second term also approaches zero. As <rs approaches 2 zero, it approaches —A /2, which is greater than -1/2. The last term, however, is 1/2 when p is zero. Both of these terms are due to the presence of the dealer benefit. The negative term is due to the shrinkage of the total benefit as information quality increases, while the positive term reflects the fact that the informed trader group initially captures a greater portion of the benefit. 74 We see, therefore, that if the benefits to trading with the outside group of 'naive' traders are too great, or there are too many informed traders, then the loss of these benefits due to increased information quality will outweigh the gain in benefits due to being one of the informed. If, however, A and a9 are sufficiently small, then the gain of benefits from being one of the informed traders dominates. In case 2, we saw that when all traders were informed, then increasing the quality Since this is the efficient market model, substitution has been made for p,t — (/,• -f hi) — 0 and <r = cr,. Returning to Table A . l , we have here the explanation for the '+/-' entries in the dealer benefit column for the informed trader. Naturally, in a homogeneous information model the portion of the benefit captured by the informed traders cannot increase - all traders are informed - so the positive term is lacking. That is, in the homogeneous information case the derivative becomes unambiguously negative. t 72 of information caused a decrease in expected utility. Here we see that when not all traders are informed, those that are informed may receive an increase in expected utility when information quality increases. Naturally, as we increase the proportion of informed traders, A, we expect to eventually see this possibility disappear. 75 In fact, we can state that when up to half of the traders are informed, then the informed traders will experience a net increase in expected utility when information quality is increased. This can be seen by adding together the two partial derivatives above. V U U ^ 7 7 ( 1 + 2 7 V"2 > 0, l+a%^ for A < i , at p = 0 Therefore, we see that the informed trader in the asymmetric information model may not only be better off than he would be if he were uninformed, he may be better off than he would be if everyone, including himself, were uninformed. This, of course, is not a situation conducive to the stability of a cartel. As we saw, the negative contribution to the partial derivative of the informed trader expected utility depended on the proportion of traders informed. When that proportion decreases to zero as a result of a cartel being set up, this term disappears. As a result, only the positive contribution to the partial derivative is present when a cartel has been created. There is always a great incentive to be the only informed As A approaches 1, the model approaches the homogeneous information case. In the homogeneous information case we know that better quality information reduces informed traders' expected utility. 73 trader and reap huge benefits. Because all the potentially informed traders have this incentive, we expect the classic solution to such a prisoners' dilemna, namely, that all cartel participants cheat, causing the cartel to fail. A.4-S.a.ii Hedging Model The analysis for the hedging model parallels that of the efficient market model in the previous section. The difference is that in this model there is no outside group of 'naive' traders. Instead, each trader receives an endowment, hi, which he hedges by trading in the risky asset market. As discussed in section A.4.2.b, this leads to a hedging benefit from trading which is analogous to the benefit from trading with a group of 'naive' traders. In addition, however, we introduce the risk of endowment revaluation into the model. 76 The expression describing the change in revaluation risk as information quality is changed is the following partial differential (see appendix 1). for A > 0 The important points to note about this term are that it is negative and that it approaches zero as A or approach zero. This means that all of the conclusions See section A.4.2.b. 74 regarding the uninformed trader which were made in the previous section on the efficient market model, also hold in this model. His expected utility decreases as information quality increases, and always lies below the level we would find if every trader were informed. The conclusions regarding the informed trader are also not changed. If hedging is an unimportant use of the market (<Th small), or there are few informed traders (A small), then the benefit of receiving a greater portion of the hedging benefits due to better quality information outweighs the losses of expected utility due to increased revaluation risk and decreased total hedging benefits. That is, we once again find a situation where the benefit of being the only informed trader is large, thus ruling out the possibility of a cartel. If hedging is an important use of the market (tr^ large), or there are a large number of traders informed (A large), then receiving a larger portion of the hedging benefit does not compensate for the losses resulting from an increase in information quality. A cartel would still not be successful, however, since imposing a cartel immediately sets A equal to zero. As we have seen when A is zero, the negative terms in the partial derivative of expected utility (such as the term above) disappear. This leaves just the positive attraction of receiving a larger portion of the hedging benefit, thus creating a large incentive for all cartel members to cheat and become informed. 75 c A4-S.b EXTENDED MODEL In the previous sections we have been able to do a reasonably thorough analysis of the 'pre-info' expected utility functions for informed and uninformed traders. Once we attempt to pass to the 'extended' model, however, the equations become relatively intractable. As a result, most of the analysis to follow will concentrate on the efficient market model. Because fit — {fi + A,) = 0 in this model, some simplification of the equations is possible. A.4.S.b.i Efficient Market Model The buildup of models looked at up to this point gives us confidence in predicting what to expect when extending the 'standard' model. To explain the dynamics of the efficient market, 'standard' model, we only needed to use two concepts, namely, the loss of total dealer benefit as information quality increased, and the unequal division of dealer benefit, with informed traders receiving more of this benefit than uninformed traders. 77 We also saw that as the proportion of informed traders decreased, the discrepancy As p increases from 0, the division of the dealer benefit becomes more unequal. As p increases to 1, the division once more equalizes. 76 between the amount of dealer benefit received by informed versus uninformed traders increased. This lead to instability of any cartel that might be proposed for informed traders. What difference do the previous models predict when we extend the 'standard' model? From what we have seen, the only effect on the efficient market model is that a new benefit appears. This new benefit arises from the possibility of trading off current consumption against future consumption by trading in the riskless technology. Unlike the 'standard' model, therefore, we should not expect to find an unambiguously negative partial derivative for the uninformed trader expected utility as we vary p . 2 The partial derivative should have the possibility of being posi- tive, to allow for the fact that counteracting the decrease in dealer benefit lost as information quality is increased, we have more efficient allocation of consumption. We can verify this prediction, even though the proof is long (see appendix 3 ) . The prediction, in fact, makes sense intuitively, since we would expect the positive benefit of improved consumption allocation to show up most strongly when the 'informativeness' of the risky asset price, ie. <j>, is highest. Since the price becomes fully revealing when p = 1, it makes sense to find these benefits showing up as we approach a fully revealing price. Naturally, since we previously showed that the homogeneous information case 77 expected utility was higher in the 'extended' model than in the 'standard' model, we know that at p = 1 the uninformed traders must have higher expected utility in this asymmetric information, 'extended' model than they have in the asymmetric information, 'standard' m o d e l . 78 This by itself, however,'will not guarantee that we can find a situation where uninformed trader expected utility increases as information quality increases. In order to have the partial derivative above guaranteed positive at p = 1, we need to have the losses of dealer benefit caused by increased information quality tailing off to zero as information quality increases to 1. This is, in fact, the situation we have, as is shown below (see appendix 3 ) . £ b ( 1 + * + 2 79 ^ = TTiibf5= ' 0 a " = 1 Therefore, at the same time as the loss of dealer benefit decreases to zero, the benefit due to better consumption allocation is in its region of greatest increase, resulting in a guaranteed upswing in uninformed trader expected utility as information quality increases to 1. Unfortunately, the expressions for the 'pre-info' expected utility of the informed trader in this asymmetric information, 'extended' model are not analytically tractable, so no further analysis can be presented here. We would expect, however, that an informed trader in the 'extended' model would have higher expected This is because the risky asset price is fully revealing at p = 1, thus creating a homogeneous information situation. Recall that the logarithmic term has been associated with the dealer benefit in previous sections. 78 utility than the corresponding informed trader in the 'standard' model, due to the added benefit from better allocation of consumption. The reason for expecting this is the fact that the consumption and investment decisions were shown in section A.3.3 to be independent of one another given negative exponential utility. Changing the consumption allocation opportunities should, therefore, not affect the investment decision. Since we are improving the consumption opportunities only, we should expect an increase in utility. This should hold for both the informed and uninformed traders in this model. A.^.S.b.ii Hedging Model Unfortunately, this model which is potentially the most interesting is also the most difficult. The expressions for informed and uninformed trader 'pre-info' expected utility are unmanageable, and leave us no choice but to speculate using the concepts built up in previous sections. As we have seen, the dynamics of this model depend on four factors: the decrease of hedging benefits as information quality increases, increase of revaluation risk as information quality increases, increased efficiency of consumption allocation as information quality increases, and the fact that the hedging benefit realized by 79 an informed trader is higher than the hedging benefit realized by an uninformed trader. When we pass from the efficient market model to the hedging model, the only qualitative change that is expected is due to the addition of revaluation r i s k . 80 We would expect both informed and uninformed trader expected utility to be lower in the hedging model than in the efficient market model, with the divergence between the two models increasing as information quality increases. 81 Because of the multiplicity of effects, further analysis is not possible. The effects due to the hedging benefit are analogous to the effects due to the dealer benefit in the efficient market model. Revaluation risk increases as information quality increases. 80 A.5 S U M M A R Y A N D CONCLUSIONS This study examines in detail several asymmetric information, rational expectations models similar to the Grossman and Stiglitz (1980) model. It provides a detailed discussion of the welfare effects following an increase in the quality of information given to the informed trader group in these models. In case 1 - homogeneous information, non-random supply - an allocative efficiency effect was identified in the 'extended' model but not in the 'standard' model. In the 'extended' model, when traders received better quality information, they were able to allocate their wealth more efficiently between current and future consumption, thereby increasing their welfare. The vehicle allowing this trade-off was shown to be the riskless technology. It was also pointed out that replacing the riskless technology with a fixed supply of riskless asset would prevent such an allocative efficiency effect and make the 'standard' and 'extended' models qualitatively equivalent. In case 2 - homogeneous information, random supply - a dealer benefit was identified in the efficient market model. The price of the risky asset was shown to fall when supply rose and rise when supply fell, something that the rational expectations 'dealers' were aware of, but which was unknown to or unobservable by the 'naive' efficient market traders. Because they were able to observe unusually high demand when it occured, and satisfy that demand from their own stock of risky asset (and, conversely, absorb abnormally high supply of the risky asset), the 81 rational expectations traders made a profit by trading with the 'naive' traders. In the 'extended' model, as the quality of information given to traders increases, the size of this dealer benefit decreases, resulting in an ambiguous net effect on welfare. The net effect was shown to depend on the size of the supply variance due to 'naive' traders, a . If the dealer benefit is small (a small), then the loss 2 2 of benefit is small as information quality increases, and may be outweighed by the increase in allocative efficiency benefits, leading to a net increase in welfare. For a large dealer benefit [a large), however, the loss of benefit is large and may 2 outweigh the increased allocative efficiency effects, leading to a net decrease in welfare. In the hedging model of case 2, a benefit analogous to the dealer benefit of the efficient market model was identified. This benefit is the result of differences between traders' pre-trade shadow prices for the risky asset (due to different endowments of the non-tradable asset). Because trading with a person who has a different shadow price for the risky asset provides both you and the other person with an increase in welfare, this hedging benefit accrues to all parties. Increasing the quality of information given to traders, however, diminishes the differences between pre-trade shadow prices and results in a decrease in this hedging benefit. In addition to the hedging benefit, another factor was identified in the hedging model. Because the endowments held by rational expectations traders differ, trading between rational expectations traders will take place. This is unlike the 82 situation in the efficient market model, where rational expectations traders trade only with the 'naive' trader group, not between themselves. Each trader is thus exposed to the risk that his endowment will be revalued when information is disseminated to the other rational expectations traders. As the quality of information given out increases, the extremes of revaluation become more probable, thereby decreasing expected utility (by Jensen's inequality). The net effect on expected utility in the hedging model following an increase in information quality depends on the size of the endowment variance, crj*, which in turn determines the extent of hedge based trading which occurs in the model. If the market is used extensively for hedging (crj[ large), then even though better quality information results in an allocative efficiency benefit, it will also result in a large decrease in hedge-based usage of the market (ie. a decrease in hedging benefit and increase in revaluation risk). This effect may outweigh the allocative efficiency increase, leading to a net drop in welfare. 82 In case 3, the uninformed rational expectations trader is introduced, resulting in an asymmetric information model. Tables A . l and A.2 summarize the effects of an increase of information quality on the different factors outlined above, and the For example, if market prices from one sector of an economy were used as indicators by another sector of the economy, the government might feel that it would be to the common good to collect and disseminate information. Traders in the different markets would have access to this information and would trade on the basis of it, thus producing prices which reflected a greater amount of information than would otherwise be the case. We can see from the hedging market analysis that traders in some markets could be against such a scheme. Certainly, the result would be prices which convey better information, thereby increasing investment efficiency in the economy, but at the same time hedging opportunities would be decreased. 83 net effect on expected utility. In the 'standard', efficient market model, it was shown that the uninformed trader is unambiguously worse off than he would be in a homogeneous information scenario. The informed investor was shown to be unambiguously better off. The reason for this is that the dealer benefit - which is shared equally in a homogeneous information situation - is unequally distributed in this asymmetric information case, due to the difficulty which uninformed traders have in distinguishing between abnormal demand based on 'naive' trader activity and demand due to receipt of good information by informed traders. It was also shown in the 'standard' model that it is possible to find informed traders enjoying a higher level of utility than they would have if everyone was uninformed (which is the highest utility level possible in a homogeneous information case). This was shown to be possible even when up to half of the traders in the model belonged to the informed group. As the size of the informed trader group decreases, the benefits of being one of the remaining informed traders increases until, at the limit where no trader is informed, we find that becoming the only informed trader is guaranteed to result in an increase in utility. This finding rules out the possibility of the informed traders voluntarily deciding not to receive information. 83 These arguments can be applied to the 'extended', efficient market model by using the fact that, the investment and consumption decisions in these models This argument is only valid if the information to be received is not of perfect quality. Given perfect information, the price of the risky asset becomes perfectly revealing regardless of the fact that only one trader is informed. Given less than perfect information, since there are an infinite number of traders in the model this problem does not arise. 84 are independent of one another. Since the 'extended' model adds a consumption opportunity, while leaving the investment opportunities of the 'standard' model unchanged, we would expect only that the utility curves for both informed and uninformed traders would lie above their counterparts in the 'standard' model, with the separation between these pairs of curves increasing as information quality (and thus allocative efficiency) increases. Turning finally to the hedging model, although we cannot analytically verify that the conclusions reached in the efficient market model also hold here, extension of these conclusions to the hedging model seems intuitively correct. The only aspect which must be taken into account when passing from the efficient market model to the hedging model is the factor of revaluation r i s k . 84 When addressing the question of whether or not becoming the only informed trader would result in an increase in utility, we must look to the effects of revaluation risk on the conclusions reached above. Given that there are an infinite number of traders in the model, we can assume that the informativeness of the price system, <f>, is infinitesimal when only one trader is informed. This, of course, is the reason that being the only informed trader in the efficient market model guaranteed an increase in utility. Since revaluation risk only occurs in situations where other traders are able to obtain information about the risky asset, we see that revaluation risk will also be infinitesimal when only one trader is informed. The reason is that the only source of information The hedging benefit of the hedging model is analogous to the dealer benefit of the efficient market model. 85 that is available to the uninformed traders is the risky asset price, which is only infinitesimally revealing. 86 A.6 REFERENCES Epstein, L . G . and S.M. Turnbull, 1980, Capital asset prices and the temporal resolution of uncertainty, Journal of Finance 35, No. 3, 627-643. Grossman, S.J. and J.E. Stiglitz, 1980, On the impossibility of informationally efficient markets, American Economic Review 70, No. 3, 393-408. Hakansson, N.H., J . G . Kunkel and J.A. Ohlson, 1982, Sufficient and necessary conditions for information to have social value in pure exchange, Journal of Finance 37, No. 5, 1169-1181. Hellwig, M.F., 1980, On the aggregation of information in competitive markets, Journal of Economic Theory 22, 477-498. Hirshleifer, J., 1971, The private and social value of information and the reward to inventive activity, American Economic Review 61, 561-574. Kraus, A. and G.A. Sick, 1979, Communication of aggregate preferences through market prices, Journal of Financial and Quantitative Analysis 14 (Proceedings), No. 4, 695-703. Verrecchia, R.E., 1982, Information acquisition in a noisy rational expectations economy, Econometrica 50, No. 6, 1415-1430. 87 A.7 A.7.1 APPENDICES A p p e n d i x 1. Case 3, Standard M o d e l : Derivatives In this appendix several partial derivatives are calculated for use in sections A.4.3.a, A.4.3.a.i and A.4.3.a.ii. First, we consider terms from the 'pre-info' expected utility function of the uninformed trader in the 'standard' model. Following this, the informed trader utility function is considered. The expressions in section A.4.3.a do not simplify very easily. We can, however, perform some simplification by introducing a new variable, f, t Using this new variable, we find that 4>P - Rpi = - l-p +Ap £' 2 2 2 l(l-p )(l-0p ) 2 Rpo = Hx - aRPl (fi + hi) = l-p 2 + Ap ^ 2 ap(^(l-p )+A Q 2 2 l - p + Ap e 2 88 {fi + hi). Substituting these expressions into those for 7 , rp and u, we find 2 1 AVf(l-^) a 2fll-p» +A p a 0 * 2 7 ._ aAp£(l-p )(l-^ ) *x f V 2 (l-p v^l 2 2 2 + \p S) 2 * , a ^ ( l - P ) + 2A Q U 2 1 - P + Ap e 2 ' w = cD + aRrrti + afit (fi + hi), 2 Q= 2 2 2 _a * x (l-p )(l-+p ) l a ^ ( l - p ^2 (l-p2 2 ) + ( l - ^ ) 2 A p 2 ^ 2 2 Using these expressions, it can be shown that ,+,72,2 ^(W 2 2 + Ap fl 2 2 2 ^ ( i _ p 2+ V 2 2 + AV$ (l-^ ) 0 2 2 2 _ fll - , ) + 2^(1 - p )Ap e + A V^ 2 2 2 ^ ( l - p + Ap 0 2 and £(1 + 2 7 2 ) - ^ 2 2 2 2 2 2 la a (l-p ) (l-<j>p ) - (i-^ 2 + A ^ ) ...a r f I* " (A + M 2 so that ~ 1 u —- 1> 2 2 1 + 27 1 a^f 2 2 2 2 4>(\-p ) (l-<f>p ) 2 [IH ~ (fi + hi)] - (fi + hi)'< <f>(l - p ) + 2^(1 - P ) A p ^ + A p £ 2 2 2 2 89 2 2 2 and a R m i 2 = U ~ i i + 27 +2 + m f i+ ^ " \ ^ a2<r + h *U* *f <f>(\ - p ) + 20(1 - P ) A p £ + A p f 2 2 2 2 2 2 of 2 At this point we would like to take the partial derivative of the above expression with respect to p . First, however, we define yet another variable, a, 2 a = a c- <7 (l-p ), 2 2 2 2 which gives us , _ J V _ _ a(l-p ) + A p ' 2 2 a(l-p ) a(l-p ) + A p ' 2 2 s 2 2 2 and * l-p ) (Wp ) = 2 ( 2 2 - ^ - l T 0(1-p ) +20(1-p )Ap £ + A p £ 2 2 2 2 2 2 ^ + . r l 2 _ A p ( l - p ) [(l - p )(a + A p ) + (a + Ap ) ] [a(l-p ) + A p l 2 2 2 2 2 2 2 giving 2 2 2 2 2 2 0(l-p ) (l-^p ) 2 0(1 - p ) + 20(1 - p )Ap f + A p e (l-p )(cr + A V ) 2 2 2 2 2 2 2 2 2 ( l - p ) ( a + A p ) + (a + Ap ) 2 2 2 2 2 It can be shown tat the partial derivative of the expression above is 2 0 ( i - p ) ( i - 0 p )2 a ap 2 2 <t>{\ 2 - p ) + 20(1 - p2)Ap e + A p f 2 2 2 2 A(q + Ap )[(a + A p ) + (1 - A)a] 2 2 2 [ ( l - p ) ( a + A p ) + (a + Ap )] 2 < 0, 2 for A > 0, 90 2 2 2 2 which tells us that d dp _ 1 ( 2 \ 2 \ f = 0, 21 + 2 7 7 \ < 0, rp 2 efficient market model, A > 0, hedging model, A > 0. Similarly, defining yet another variable, 5, 6 = cc(i-p we can express 7 2 2 2 + 2 2 \p )+\ p , as ^ _ 1(1 - p ) a ( a + 2 2 6 2 The partial derivative of this with respect to p 2 2 ap ' < 0, 5 AV) 2 2 can be shown to b e 85 3 for A > 0 This tells us that dp 2 ln(l + 27 ) < 0. 2 Turning now to the utility of the informed trader, we see from section A.4.3.a that the informed trader 'pre-info' utility for the 'standard' model differs from that of the uninformed trader by the addition of the term »(V#) If we make the necessary substitutions, we find that 1 - <pp 1 8 5 2 -p 2 2 a + \p a(l - 2 2 2 p ) + A p2' The derivative below is negative except for the case p = 1, at which point it is zero. 91 which has a partial derivative which may be either positive or negative. (l-<f>p \ d dp A.7.2 2 _ 2 tt[q(l-/? )-AV] \ 1 - p J - (1 - p»)[a(l - p ) + 2 2 2 AV] a A p p e n d i x 2. Case 3, Standard M o d e l : Differences In this appendix, terms from the 'pre-info' expected utility function in the 'standard' model, homogeneous information case (case 2) are subtracted from the corresponding terms in the asymmetric information utility functions. The results are used in section A.4.3.a.i. From appendix 1, the relevant parts of the uninformed trader expected utility in this asymmetric information case are: 2 U 1 ib ~ 2 1 + 2^2 a R m i + a ^ = fi + 1 ~ 2 a2<T ^*' + k i ) 2 1 + ln(1 + ^ ) = h ( a(a + A V ) 2 (a + A 2 p 2 ) ( l - p 2 ) ( 1 + + [fit — (/„• + a + A/) 2)2 a hj)] 2 2 ^L^±iVl). The corresponding terms from the homogeneous information case of section A.4.2 92 are 86 n - i*'(j+2rr')" * 1 = aRmi + anx {fi -, + 2 + hi) - \ ^ l i f i . W-(fi \l + a) + hi) 2 + hi)}' ° 2t l n | / + 2 r r | = ln(l + o). , If we subtract the terms derived for the homogeneous information case from the terms of the current asymmetric case, we find the differences below 1 / a{a + A p ) 2 a_\ 2 + A p ) ( l - p ) - r ( a + Ap ) 2 V(a 2 2 2 2 aV(l-A) 1 I - [ft + hj)) l + 2 *t 2 2 2 [fH - (fi + hi)} 2 2(1 + a)[(a + A p ) ( l - p ) + (a + Ap ) ] 2 87 2 2 a 2 2 2 efficient market model, hedging model, = 0, > 0, k ^ l + a ( l - p ) ( « + A p )/5 ^ < 2 2 2 2 2 Q Turning now to the informed trader 'pre-info' expected utility, U ~ IT+2^ 2 = a R m i + ° ^ ( / ' 8 6 8 7 k i ) q(a + 1 2(a + \ a2<T*( fi ~ 2 h i 2 2 ? [jn-(/.• +*,-)]* A p ) 2 + A p ) ( l - p ) +{a 2 + + Ap ) 2 2 a 2 <• **>) - * {(*&fM 0 " -^ + These expressions may be pendix 1. The first difference below or p = 1, at which points models except when p = 0 + found using an approach similar to that used in apis positive in the hedging model, except when p = 0 it is zero. The second difference is negative for both or p — 1, at which points it is zero. 93 (1 In the efficient market model, we can ignore the first term, since fit — (fi + hi) = 0. If we subtract the corresponding logarithmic term for the homogeneous case from the one above* it can be shown that, the difference is unambiguously positive. * (Ul-pa) Av) V , // cr + A V 1+ W 2 2 a (l-p )(a + - ) ) ~H l+ a ) ^° AV)\\ + A.7.3 88 A p p e n d i x 3. Case 3, Extended M o d e l : Derivatives In this appendix several partial derivatives are calculated for use in section A.4.3.b.i. The utility function of interest is the 'pre-info' expected utility of the uninformed trader in the efficient market, 'extended' model. From appendix 1, we have 89 2 2 7 1 A V £ ( 1 - <f>p ) 2 l<p(l-p* \PW + rp = aax p\J~4>(\ + 2-y )/, 2 2 (j = Q + aRrrii + af , 2 ^ = - \« °lll ~ *P (1 + 2 7 ) ] / 2 2 2 Except at the points p = 0 and p = 1, where the difference is zero. As this appendix is concerned only with the efficient market model, substitution has been made for fit = /> fi = / , A,- = 0 and a = at . Simplification has been performed. t 94 Using the above, we can easily find that 1 / \ + R\ 1 ^ \ 21 + A + 2 7 / 2 2 l 2 2 af -^a <T x f = aRmi + 2 2 2 1 + 272 +-* * x R<pp ( 2 2 ) f. The first step to finding the effect of an increase in information quality on the uninformed trader expected utility is to find the partial derivative of the above. Using the results shown in appendix 1, we can show that Given the above, we can see that 2 dp 2R<f>p 2 (l + r + 2 7 ) 2 2 d-y dp 2 2 2 1 + 2-y d(4>p ) 2 l + fl + 7 2 dp < > 0. 2 2 For example, at p = 0, d(<f>p )/dp = 0, so that Combining this with ^ Q l n ( l + i2 + 27 )) 2 95 <0, a t p = 0, 2 we have d —* K dp iv < 0, at p = 0. However, at p = 1, we have dy so that d 2 dp 2 [+r ( r r ^ ) J = r b > ' = a t Combined with 3/J 2 Q l n ( l + i2 + 27 )) 2 =0, we have d d p 2 K i u > 0 , 96 atp=l. atp=l, Table A . l . Decomposition of utility functions into component factors for the asymmetric information case of the efficient market model. Entries in the table are the signs 2 of the partial derivative with respect to p of the component factors. The last column shows the sign of the partial derivative of the utility function itself with 2 respect to p . alloc effic 'standard* uninformed** model informed 'extended' uninformed model informed a dealer 6 net c benefit - - +/" +/+/" e + + - ° The allocative efficiency benefit is only present in the 'extended' model. The dealer benefit is only present in the efficient market model. Its analog in the hedging model is the hedging benefit (see Table A . 2 ) . 2 This column shows the net effect on the utility function of an increase in p . Uninformed refers to the representative uninformed trader utility function, informed to the representative informed utility. Note that this ambiguity of sign is present only in the asymmetric information case. In the homogeneous information case, all entries referring to this note are strictly negative. b c d e 97 Table A.2. Decomposition of utility functions into component factors for the asymmetric information case of the hedging model. Entries in the table are the signs of the 2 partial derivative with respect to p of the component factors. The last column shows the sign of the partial derivative of the utility function itself with respect 2 to p . alloc effic 'standard' uninformed model informed 'extended' uninformed model informed 0 hedging* reval benefit risk - 6 + + - - c net d +H +/+/- * The allocative efficiency benefit is only present in the 'extended' model. The hedging benefit is only present in the hedging model. Its analog in the efficient market model is the dealer benefit (see Table A . l ) . Revaluation risk is only present in the hedging model. 2 This column shows the net effect on the utility function of an increase in p . Uninformed refers to the representative uninformed trader utility function, informed to the representative informed utility. Note that this ambiguity of sign is present only in the asymmetric information case. In the homogeneous information case, all entries referring to this note are strictly negative. 6 e d e 98 Figure A . l . The sequence of events taking place in the models. t—5 t—4 t—3 t—2 1 ^0 ^1 Endowments of the risky asset and riskless technology are received at this point. Common knowledge of all traders' utility functions is disseminated. Trading to a Hakansson, Kunkel and Ohlson (1980) 'no-information' equilibrium position is allowed (in the efficient market and hedging models only, not in the Grossman and Stiglitz model). Endowments of the non-tradable asset are received (in the hedging model). Common knowledge about who will be in the informed trader group plus the distribution functions of all random variables is disseminated. Calculation of 'pre-info' expected utility. Receipt of information by the informed trader group. This is the beginning of period. Trading in the risky asset takes place. Consumption takes place (in the 'extended' model). 'Post-info' expected utility is calculated. This is the end of period. The risky asset and riskless technology payoffs are received and consumed. 99 (This page intentionally left blank.) 100 P A R T B: B o n d O p t i o n P r i c i n g , E m p i r i c a l Evidence 101 B.l INTRODUCTION O n October 22, 1982, US government bond, note and treasury bill option contracts began trading on the American Stock Exchange and the Chicago Board Options Exchange. Because these contracts have default-free government instruments as their underlying securities, the Brennan and Schwartz (1983a) two-factor model for pricing default-free, interest rate dependent options was used to compute theoretical prices for comparison with the actual market quotations now available. It is only recently that the Brennan and Schwartz two-factor model was extended (Brennan and Schwartz (1983a)) to the valuation of interest rate dependent options. 1 In addition to this model, two other models - Courtadon (1982) and Ball and Torous (1983) - have been recently proposed for the valuation of interest rate dependent options. These models, however, are not considered in this study for the reasons outlined below. The Ball and Torous model uses the prices of two pure discount bonds as state variables to provide an analytic solution for the value of a European option on a pure discount bond. Although this model provides an analytic solution method, it does not appear extendible to the valuation of American options written on coupon bonds. As the Brennan and Schwartz two-factor model is based on numerical solution procedures for deriving option values, it is not limited in this fashion. The extension of contingent claims theory to interest rate dependent options was preceded by applications in the area of valuation of interest rate dependent claims such as government bonds. See Cox, Ingersoll and Ross (1978), Vasicek (1977), Richard (1976) and Brennan and Schwartz (1977, 1979, 1980, 1982, 1983b). 102 The Courtadon model, like the Brennan and Schwartz model, is also based on numerical solution procedures. In Brennan and Schwartz (1983a) it is shown that the Courtadon single-factor model can be viewed as a special case of the Brennan and Schwartz model. In their comparison, however, Brennan and Schwartz assumed that the stochastic process parameters and market preference parameters were given, and therefore the same for both models. In practice, these parameters are not given, and should be estimated separately for both models. It was felt that the substantial amount of effort required for reestimation of parameters for the Courtadon model would be beyond the scope of this study, and so a direct comparison of the Brennan-Schwartz and Courtadon models has been left as a topic for future research. The tests that were performed in this study examined whether profits could be made by writing options when the Brennan and Schwartz model indicated that they were overvalued and buying them when undervalued. The trading strategy consisted of forming a theoretically riskless, zero-investment arbitrage portfolio, where the proper proportions, or hedge ratios, of assets held in the portfolio were calculated using results from the Brennan and Schwartz theoretical framework. It was found that the trading strategies did generate arbitrage profits, but that these profits were not sufficient to cover reasonable transactions costs that would be incurred if the strategies were actually implemented. The Brennan and Schwartz 2 model prices appear to be sufficiently accurate to justify practical use of the model for valuing interest rate dependent options. 2 As noted in the conclusions to this study, care mst be exercised when interpreting the presence of these apparent before-transactions-costs arbitrage profits. 103 B.2 PRICING THEORY The model presented in this section is similar to the multi-factor model for pricing contingent claims developed by Cox, Ingersoll and Ross (1978). Unlike Cox, Ingersoll and Ross, however, who develop a full general equilibrium model of an economy, the theory presented here relies on arbitrage arguments. The basic assumption of the model is that the underlying uncertainty in the economy can be modelled by a multivariate Wiener process w(<) evolving stochastically through time, 3 4 and that there is an n-vector of state variables x(£) which are related to the Wiener processes by means of the Ito stochastic differential equation dx = /?(x,y,*) dt + n(x,y,t) dw(t), where the Wiener process is characterized by E ( d w ) = 0, dvrdw' = Idt, and the m-vector y(t) of non-stochastic state variables is described by dy = l(y, t) dt. In order to simplify matters I also assume that nn' is of full rank, and without loss of generality also let x and w 3 4 both be n-vectors. The only other critical A very good introductory work on stochastic processes, also providing a review of the literature, is Maliaris and Brock (1982). I also make the standard assumptions that there are no taxes or transactions costs, no restrictions on short sales, and that trading is allowed to place at any point in time, ie. continuously. 104 assumption is that the assets I am pricing do indeed have prices that are functions only of the state variables x, y and t. 5 a = «(x,y,0 For example, Brennan and Schwartz have typically used the instantaneous riskless rate of interest, r, and the yield on a consol bond, /, as their two state variables when pricing default-free government bonds. Hopefully, most of the uncertainty that affects the prices of the wide variety of government bonds is also reflected in the movement of these two state variables. Indeed, the good fit of the Brennan6 Schwartz two-factor model to actual market bond prices shows that this is not such a bad assumption to make, but if they had chosen instead to use the price of gold and the Dow-Jones market index as state variables perhaps their results would have been different. B.2.1 Asset Pricing Theory In the non-stochastic situation where we have z as a function of y and t, we can 5 6 In theory, this is a perfectly valid assumption to make. The problem in practice is to identify exactly what x and y are. Note that using two other factors related to r and / by an invertible function would be equivalent to using r and /. That is, it is not essential that r and / be the correct underlying factors, but just that they are related to the two underlying factors. For example, use of x = r — / and y = ln(/) as two factors would be theoretically equivalent to the use of r and /. 105 find the differential of z by straightforward partial differentiation as follows: dz(y, i) — V z dy + z* dt, (z non-stochastic), y where V'y = (d/dyi,...,d/dym ) the matrix is a vector operator, and V z is shorthand for y ® z. When z is a stochastic variable, however, the situation becomes more complicated and we must use Ito's Lemma for the differential. dz(x,y, t) = V z dx + Vj,z dy + zt dt + ^ ( r / r / ' V ^ V ^ z dt x = (v z(3 + V x f f z 7 + z t + \tv{nr,'Vx V'x )^ dt + V^zrj dw = Z(/x dt + s dw) where t r ^ ' V . ^ ) = £ 2 fiJ ( r ? » 7 % - d /dxidxj w'V.Vj;) (ie. the trace of is a scalar operator acting on all the elements of z, and Z = diag(z,), fi = Z" QtrtwV ' , V )z + V^z /? + V z 7 + z 1 s= Z - 1 x y • t V zr;. x The next few steps are the heart of the pricing theory c<nd involve imposing a 'no arbitrage' rule on the assets. This is nothing new to finance and can be found as far back as Debreu (1959). More recently, we see the arbitrage condition in Black and Scholes' (1973) seminal option pricing paper, and find it forming the central core of Ross' (1976) arbitrage pricing theory. In its usage here we form an arbitrage portfolio with total investment p = S'l, where 6 is a vector of the dollar amounts invested in the different assets. The 7 7 That is, Si is the dollar amount invested in asset t which has the unit price z,-. 106 return on this portfolio is, therefore, l — = <5'Z (dm +cdt) = S'(n + Z~ c)dt + 8'a dw, _1 P where c is a vector of payouts per unit of asset (for example the coupon payment on a bond). Now, if we let S be the subspace spanned by the columns of s, any 8 chosen from the orthogonal complement of S will give 5's = 0, making the portfolio return totally non-stochastic, that is, riskless. By the no arbitrage rule, the return on this portfolio must be exactly the return that one would receive from a riskless investment, that is, 8'1 r dt. Therefore, we must have l V<5e5x. 6'(p + Z- c-rl)=0, This can only be true for all 8 G S" , however, if we have 1 /i+ Z _ 1 c- r l e S, in which case we can state that there is a vector function A ( x , y , t) which satisfies p, + Z _ 1 c- r l = sA. Cox, Ingersoll and Ross (1978) showed that the function A can be interpreted as the vector of prices that the market assigns to the uncertainty in the economy represented by the underlying Wiener processes, or, for short, the 'market price of risk function'. 107 At this point the theory is basically finished, since the arbitrage result above gives us a partial differential equation for any asset price z. Before replacing pi and s in the arbitrage result above, however, since I am mainly interested in assets which have a maturity date, I will replace time t with time left to maturity r. Note that this results in %r = — z t since dr = — dt. Combining this with the definitions of \x and s, the partial differential equation for an individual asset price becomes j t r ( w ' V , V'x )z + Vx z(f3-ri\) + Vy zi + c-rz = zT . The only difficulty now is that we don't have expressions for the functions /?(x,y, t) and rj(x,y,t) arising from the stochastic differential equation for x , l(y,t) from the equation in y, nor for the market price of risk function A(x,y, t). If we knew these functions, then in principle the partial differential equation would be solved. 8 There is nothing else to be done about /?, rj or 7 here, but Brennan and Schwartz (1979) made an important observation about the market price of risk function. As they pointed out, if the price of one of the assets in the economy is known as a function of the state variables and time, then we can very simply identify a linear combination of the risk prices A. That is, given a known price function z we can calculate n and s to produce ft + c/z — r = s A, which is a linear combination of the market prices of risk. If we had a full basis, X{,, of these known asset price functions then we could replace A altogether by A =B^ (/x + Z ^ C f r - r l ) , 1 1 6 In practice, of course, we would still have the problem of estimating the solution. 108 \ and reduce the asset pricing partial differential equation to fi + c/z- r = as^ (n + Z ^ c j , - r l ) . 1 b The only exception to this rule concerns assets which have prices dependent on the instantaneous riskless rate, r. Any asset with a price dependent only on the instantaneous riskless rate must have ft + c/z — r = 0, thus making these assets useless for determining A even if the riskless rate r is one of the state variables. B.S.l.a THE BRENNAN-SCHWARTZ MODEL In this section we look at a special case of the theory which has been proposed for pricing options on default-free bonds, namely, the Brennan-Schwartz model, and see how simplifications were achieved by using several quite reasonable assumptions. First, as mentioned above, Brennan and Schwartz chose as their state variables the instantaneous riskless rate, r, and the yield on a consol bond, / . 9 Already, a very important choice has been made. As pointed out in Brennan and Schwartz (1979), since we know that the price function of a consol bond, V , is V(x,t) = l/l, There are no non-stochastic state variables, y . 109 choosing / as a state variable allows us to simplify the partial differential equation by solving for a linear combination of the price of risk function A. In order to simplify matters even more, Brennan and Schwartz made the further assumption that (3 and rj are time-independent and that the correlation between the stochastic processes for r and / is independent of the levels of r and /, that is, /?(x, *)=/?(*), i7(x,*) = i7(x), where p is a constant correlation coefficient. This allows us to define a. transformation of A, giving Now, when we evaluate the partial differential equation for the known consol bond price function we find that V,V = (0,-1), t r ^ ' V ^ V Vr = 0, = 2^f and so that i t r ( w ' V , V ) V + VX V((3 - <r\ ) + x x c-rV = 7 f - £ ( f t - * a A , ) + (*-r)j = VT =0. This gives us an expression for /32, 2 110 c=l = lV, and allows us to eliminate A( from the partial differential equation. The simplified equation is -0- Z 2 rr + pO-\<72Zrl + ^l^ll + {Pi ~ <7\K)zr + K^Z/? + I - r)z{ + C - TZ = Z . T It might seem that there is little advantage in all of this maneuvering to replace the unknown A/ since / is itself unobservable. There is, after all, no consol bond outstanding in the United States or Canada. In effect the problem boils down to a choice between either (a) using / as a state variable in the theory and then finding some observable proxy to / in order to estimate the covariance function a of the stochastic process of r and / or (b) using the yield of an outstanding bond as a state variable and then estimating the additional price of risk function. The advantage of simplicity seems to lie with the first choice. For example, if we used the yield on an issued bond as a state variable we would have to worry about the changing time left to maturity of the bond as time passed and what effect that change would have on the parameters we are trying to estimate. As is clear from the development of the theory, the resulting partial differential equation applies to all assets with prices which are dependent only on the state variables and time. We use the same differential equation to find the prices of different coupon bonds and also options on these bonds. The difference, therefore, lies in the boundary conditions that we impose on the solution. For a discount bond with price <5(r, /, r), time to maturity r, and a principal value 111 of $100, 10 the boundary condition is simply the payout that the holder receives at maturity of the bond, 6{r,l,0) = 100. When pricing a european call, Cs(r, I, T), or put, Pfi(r, /, r) once again the boundary condition is simply the payout that the holder receives at maturity of the option CE (r, /, 0; K) = max(0, £ ( r , /, r ; c) B K), PE (r, 1,0; K) = max(0, K - B(r, /, r ; c)), s where K is the exercise price of the option and B(r, /, TB\C) is the price of underlying bond having a coupon rate c and Tg time left to maturity as of the date the option matures. The only options on government issues that are currently traded, however, are American options, that is, options that can be exercised at any point in time. Since this is the case, we need an extra boundary condition for an American option preventing its price from falling below what the holder would receive if he exercised the option. That is, at all times up to and including maturity we must have C{r,l,r; K) > max(0, B(r,I,T + T ;C) B P{r,l,T;K) > m a x ( 0 , i i C - B{r,l,r + K), T ;C)), B with equality holding at maturity, r = 0, of the option. The above boundary conditions hold for the usual type of option, namely, an 1 0 In this study I have simplified matters by standardizing all bonds to a face value of $100. 112 option on a specific underlying bond. There are also options being traded where the underlying instrument changes over time. In particular, we need to value options where the security deliverable upon exercise has a fixed time to maturity, r. For these 'fixed maturity' options, we have the following boundary conditions, C(r, /, r; K) > max(0, B(r, I, r; c) - P(r,l,r\K) > max(0,iT - with equality once again holding at maturity. K), £(r,/,r;c)), 11 The only question remaining is how to value a coupon bond. I will follow Brennan and Schwartz in this matter, and assume that we can neglect any tax effects. With this being the case, a coupon bond becomes simply a portfolio of discount bonds with the value B(r, /, T ; cj = where T c < ^ c 6(r, l,r-rc ) + 6{r, I, r) are the times to maturity of the different bond coupon payouts, and c is the coupon rate of the bond. 1 Strictly speaking, the asset pricing theory as developed above does not hold if the underlying asset changes continuously, as it does here. This is because the theoretical arguments are based on forming an arbitrage portfolio and holding it for an instant of time. One of the assets included in this portfolio is supposed to be the asset underlying the option, so an implicit assumption is that the underlying asset can be held for an instant of time. This, of course, is not true if the underlying asset changes continually. If, however, we modify the boundary condition so that it is imposed only at a countable number of instants - that is, exercise is only allowed at a countable number of instants - then the 'underlying asset' at any point in time is the asset which is deliverable at the next permissible exercise time, and the theory is once more valid. 113 B.2.1.b THE BLACK-SCHOLES MODEL Another special case of the general asset pricing model is the Black and Scholes (1973) single-factor pricing model. When Black and Scholes brought this model forward, their main concern was the pricing of derivative assets, so their model contained only a single stochastic state variable z, which was taken to be the price of the underlying asset. This single state variable simplicity has a price, of course, as the random nature of the riskless rate, r, is left und escribed. Since r is present in the asset pricing partial differential equation, we may find that our asset prices are in fact sensitive to unexpected changes in r. The Black-Scholes asset pricing partial differential equation, therefore, is z \^ ^ + nX)zx + izr + c-rz = zT , where z is the price of the asset underlying the derivative asset z, c is the per unit payout on z and dx = (3(x) dt + n(x) dw, dr = 7(7-, t) dt. If we value the underlying asset z itself using this partial differential equation, since xx = 1, xxx = 0, z = 0 and z = 0, we have r r (3 — i]X + cx — rx = 0, where cx is the per unit payout on the asset z. This simplifies the equation to \l Zxx 2 + (rz - cx )zx + izr + c-rz 114 = zT . This is the Black-Scholes pricing equation except for several simplifying assumptions that they made, namely, r rj = c x xi <rx constant, 7 = 0, ie. r constant, c = 0, the payout on z is zero, which-give the classic Black-Scholes equation 2 2 ^<r x x zxx + (rx- cx )zx - rz = zT . In this study I use all of these common assumptions, so that bond option prices are assumed to conform to the differential equation -CT B ZBB 2 3 - [rB 2 12 — C )ZB - rz — B zT , where CB is the instantaneous dollar interest accrual of the underlying bond. Note that only one parameter, <TB, has to be estimated, making this model quite simple when compared to the Brennan-Schwartz model. As with the Brennan-Schwartz model, boundary conditions for American call and put options are C(B, r; K) > max(0, B - P(B,T;K) >mzx{0,K - K)), B)), with equality holding at maturity. The variable denoting the underlying asset has been changed from x to B to stress that the underlying asset is a bond (or treasury bill) in this study. 115 A r b i t r a g e Portfolios B.2.2 As was shown in the theory section above, the entire asset pricing theory rests on a 'no arbitrage' rule which is assumed to hold in an efficient market. If we were able to form the 'arbitrage' portfolios mentioned there, we could test whether or not this no arbitrage condition really holds in the market. The problem is that in order to calculate the amount of each asset to hold in the arbitrage portfolio we need a theory to tell us what the subspace S spanned by the columns of s = Z - 1 V^B n is. As a first step to identifying a suitable arbitrage portfolio, note that if we let T be the subspace spanned by the columns of Z S'Z~ Vx m l n = (<5'Z~ V z)n = 0, l which means that T - — S - and T = S. 1 V5 € x 1 _ 1 V . « , then 3 T L This means that in order to form an arbitrage portfolio we need only find a vector of dollar amounts of each asset, 6 ET. That is, we need only find a 5 orthogonal to Z~ V' l %. x For the case where we have n state variables, if we can find an n-vector of assets, Z b , which form a non-singular basis Z ~ V z&, b 1 r x then this basis spans the subspace S and we can express ( V Z h ) / z h of any asset to be hedged as a linear combination x of the columns of Z ^ " V . Z f e . That is, if 1 a then z& = VhV z x 116 h + u' V z b x b = 0, or where u is a vector of the number of units of each asset in the arbitrage portfolio. Therefore, given an n-vector of assets, Z t , which allows us to span S we can hedge away the risk of any other asset. Since the theory requires the resulting riskless portfolio to return the riskless rate, if we combine the asset positions above with an appropriate position in the riskless asset, that is, invest —<5'1 in therisklessasset, the result is a zero-investment arbitrage portfolio. Because it is a zero-investment portfolio, it should have a zero return. The test of the no arbitrage rule is therefore a type of market efficiency test. Of course, it is really a joint test of market efficiency, the particular models that I use, and the accuracy of the parameters that I estimate in order to derive theoretical option prices, but this is true of all tests of market efficiency. 13 The procedure that will be followed here is to look for discrepancies between the theoretical and market prices of a bond option and then form a hopefully riskless, zero-investment arbitrage position to take advantage of any mispricings that are found. That is, we choose Uh > 0 if i/fi < 0 if ZH < zjf, Zh > zff, buy option when undervalued write option when overvalued where zff is the observed market price of the bond option. That is, if there appear to be arbitrage possibilities, then either the market is inefficient, the models used in this study are incorrect, or the parameter estimates used to derive theoretical prices were inaccurate. 117 B.3 DATA DESCRIPTION The main aim of this study is the comparison of bond option price quotations with theoretical model prices. Naturally, this creates two needs for data. We certainly need to collect the bond option price quotations, but we also need data to help us create the theoretical prices. In the previous section on asset pricing it became clear that before assets can be priced with an asset pricing partial differential equation we must first estimate all the parameters of the underlying stochastic differential equation, and also any parameters involved in the market price of risk function. In this study, I have chosen to collect data from two non-overlapping periods. The first period is the 'estimation period', and consists of monthly observations running from October 1970 through October 1982. Data from this period is used to estimate any needed model parameters. 14 The estimation period is followed by the 'test period', consisting of daily data covering the period from November 1, 1982 through October 31, 1983. Data from this period is used to test the asset pricing models and perform the arbitrage tests. 15 The longer the estimation period is, the greater the number of data points available and hopefully the better my parameter estimates. O n the other hand, the longer the estimation period, the more likely it is that changes occur in the parameters during the period (ie. non-stationarity of parameters). I chose a 12 year estimation period as, hopefully, a good compromise between these two opposing factors. That is, the arbitrage tests done in this study will be testing whether or not there are arbitrage possibilities given past data series on bond and treasury bill prices. If traders in fact form their expectations based on more sources of information, and markets are efficient, then we would not expect to find any arbitrage profits in the tests done in this study. 118 B.3.1 Parameter E s t i m a t i o n D a t a As shown above, the Brennan-Schwartz pricing model produces the asset pricing equation from the stochastic differential equation in r and / (s)-(«w//- + f^ + ^)*+ft.i)«-. «*-(; 0If we were given parameterized forms for Pi, <7i, a2 and Aj we could use the stochastic differential equation, along with data series for r and / from the estimation period, to estimate these parameters, leaving us with only A still unknown. r Since A is a market price of risk function and, as we noted in the previous section, r is the only market risk price that cannot be eliminated from the pricing equation by using an observable asset price, we are forced to estimate this function by comparing theoretical and market prices over the estimation period. That is, we must try various values for the parameters in A and choose those values which r result in the best fit between theoretical prices calculated with the asset pricing partial differential equation and actual market prices. 119 B.S.I.a SHORT RATE SERIES I mentioned above that estimation of the stochastic process parameters requires a time series for the instantaneous riskless rate of interest r. Of course, there is no such series available and we must instead find an acceptable proxy for r. As my proxy I chose the continuously compounded yield on the outstanding treasury bill which was the closest to having 30 days left to maturity. This data series was readily available for each month-end froni the C R S P US government bond tape. I selected monthly proxy values from the estimation period October 1970 through October 1982. This period lies just before the November 1, 1982 start of the testing period containing the bond option price data to be tested. B.S.l.b CONSOL RATE SERIES Just as there is no instantaneous riskless rate series available, there is also no series available for the yield on a consol bond: there is no consol bond outstanding in the United States. We are forced once again to find an acceptable proxy for the unavailable series. The proxy that I chose was the yield on a very long maturity bond, which should provide a good approximation to the yield on a consol. As long as the bond's par value repayment is so far in the future that it is discounted almost to zero, the yields on the long term bond and consol should be quite close. 120 The proxy that I used, therefore, was the continuously compounded yield of the outstanding bond with the longest time left to maturity, under the condition that the bond also be normally taxable. 16 1 7 This series was also collected from the CRSP US government bond tape for each month-end over the period of October 1970 through October 1982, a total of 145 observations. 18 B.SJ.c MARKET PRICE OF SHORT RATE RISK PARAMETERS Once the parameters of the stochastic differential equation in r and / have been estimated, we can proceed to the estimation of the market determined risk price function A . Since the only way to estimate this function is to actually calculate r the theoretical prices of some assets and compare them to the quoted market prices, the asset pricing partial differential equation must be solved repeatedly for different values of the parameters of A until the best fitting parameters are r found. The tax treatment of certain bonds, termed 'flower' bonds, is different from the treatment of most bonds, resulting in prices higher than those on normally taxable bonds. I also chose a second proxy the same as the above but with an additional condition: the bond also had to be trading within a $10 dollar range of par. This condition was added just in case there were tax effects due to differential treatment of coupon payouts and capital gains. As there was no significant difference in the parameter estimates from these two series, the added condition was considered unnecessary and was dropped. Actually, a total of 289 observations covering the period October 1958 through October 1982 were collected for both the r and / series. The first half of the series from October 1958 to October 1970 were used to test how variable the parameter estimates were from one time period to the next. The parameter estimates from this first half of the estimation period were not otherwise used. 121 Once again, the best available source of pertinent market data is the C R S P US government bond tape, from which I obtained month-end market bond prices for the estimation period of October 1970 through October 1982. 19 The price used was whatever was available on the C R S P tape, either an actual sale price, bid price or ask price or, if both bid and ask prices were given, the middle of the bid-ask spread. B.3.2 Test P e r i o d D a t a The option data that is needed for this study was not available in computer readible format, and had to be collected from quotations published in the Wall Street Journal. The test period follows directly after the parameter estimation period, and runs from November 1, 1982 to October 31, 1983. Data were collected for each trading day in this period. Only normally taxable bonds and notes were chosen. 122 B.S.2.a BOND OPTION DATA Throughout the entire year-long test period there were only options outstanding on five notes and three bonds, with all of the note options listed on the American Stock Exchange (AMEX) and all of the bond options listed on the Chicago Board Options Exchange (CBOE). Because these options mature 9 months after their initial listing and are listed each quarter, there may be as many as three options 20 outstanding which differ only by date of maturity. 21 Table B . l shows a summary of the bond and bond option data collected. In all, there were a total of 274 different bond options traded during the one year test period, generating 3793 bond option price observations. As is shown in Table B.2, most of these options traded at prices below $5. 22 Two further breakdowns show that most of the option trades occurred with less than 5 months left to maturity on the option (Table B.3) and that the options trade close to the money (Table B.4). There are several technical details that should be mentioned here. First, the note option contracts traded on A M E X are 'small contracts', that is, the underlying principal amount of the note is $20,000. On the CBOE, bond option contracts Note and bond options mature and are listed at the end of the third Friday of March, June, September and December. Exchange rules actually allow listing of options with up to 15 months to maturity, which would allow trading of up to 5 options differing only by maturity date. Neither exchange has listed options with more than 9 months to maturity. I have standardized all the options so that the underlying bonds have a principal value of $100. 123 were either 'small contracts' or 'large contracts', where a 'large contract' has an underlying principal amount of $100, OOO. 23 Second, the exercise prices shown in Table B . l must be adjusted, just as bond price quotations must be adjusted, to take into account accrued interest. For example, the holder of a call option with an exercise price of $102 per $100 of principal value would have to pay $102 plus the accrued interest on the underlying bond in order to exercise his option. Third, the actual bond option price quotation is shown in the Wall Street Journal as a decimal amount. The decimal portion actually represents 3 2 nd s of a dollar, so that a quotation of $2.10 per $100 principal value is actually a price of $2 10/32. Lastly, there is a delay of two business days between exercise of a bond option and final settlement. For example, if an option is exercised on Thursday, then the exercise price plus accrued interest on the underlying bond up to and including the settlement date must be paid on Monday, the settlement date. As mentioned in a previous footnote, I ignore this aspect and standardize all options to a contract size of $100. 124 B.3.2. b TREASUR Y BILL OPTION DATA The treasury bill option is considerably more complicated than the bond option we have just looked at. Unlike the bond option, the treasury bill option exercise settlement date is not two trading days after exercise, but is instead the Thursday of the week following the week in which the option is exercised. 24 Also, the underlying security which must be supplied on the settlement date is not the same from week to week, as with bond options. The deliverable treasury bill is one which has 13 weeks to maturity as of the settlement date. 25 Naturally the deliverable treasury bill changes every week. As an example, we can suppose that the writer of a call option has his option called on Monday. In order to lock in the value of his settlement date obligations, he purchases a treasury bill which will have 13 weeks left to maturity on the settlement date. Since the Monday exercise date and Thursday settlement date are 10 days apart, on the exercise date he would buy a treasury bill with 14 weeks and 3 days left to maturity. If the exercise instead took place on a Friday, then the settlement date would be only 6 days away and the writer would purchase a treasury bill with 13 weeks and 6 days left to maturity in order to lock in his settlement date obligations. Therefore, the maturity of the underlying treasury bill is, strictly speaking, not 13 weeks, but varies from 13 weeks and 6 days to 14 weeks and 3 days. Or the next trading day following the Thursday if it is a holiday. Exchange rules permit options on 26-week treasury bills also, but these have not been listed. 125 As with bond options, treasury bill options are listed quarterly and initially have 9 months to maturity. 26 The result is that there could be up to 3 options out- standing which differ only by maturity date. Table B.5 shows a summary of the treasury bill option data collected. There were a total of 37 treasury bill op- tions listed during the test period, and a total of 819 treasury bill option price observations were collected. At this point I must mention the method used to adjust the quoted treasury bill option exercise prices. If, for example, the exercise price quoted is k, then the actual price payable on settlement is K = 100 - (100 - k)91/360 dollars per $100 dollars principal value. For k = 90, say, this gives K = $97.4722. The exercise prices used in this study have been converted from the quoted value to the actual dollar amount payable. Several other details should be mentioned concerning the contract size traded and the method of quoting treasury bill option prices. First, treasury bill options have traded only on A M E X and only in contracts with $200,000 underlying principal value. 27 The method of quoting treasury bill option prices is also quite different from bond options. The quoted price is given in decimal form and is in fact a decimal number, but must be adjusted as follows. If p is the quoted treasury bill Treasury bill options mature and are listed at the end of the third Friday in March, June, September and December. This is a 'small contract' for a 13-week treasury bill option. A 'large contract' for a 13-week treasury bill option would have an underlying principal value of $1,000,000. The 'small contract' and 'large contract' sizes for the as yet untraded 26-week treasury bill options are $100,000 and $500,000 respectively. As usual, I ignore contract size and standardize all options to have an underlying pricipal value of $100. 126 option 'premium', then the actual price payable for a 13-week treasury bill option is P = p £| per $100 principal value. 28 Note that the factor for price adjustment, 13/52, is different from the adjustment factor for exercise prices, 91/360. Returning once more to the collected data, we see in Table B.6 that most of the treasury bill options traded at prices under 30 cents. Two further breakdowns show that - as with bond options - most of the trades took place with less than 5 months left to maturity on the option (Table B.7) and that the options trade close to the money (Table B.8). B.S.2.C ARBITRAGE PORTFOLIO DATA The hedging theory section above showed that we could hedge away the risk of any asset in an n-factor model by combining it with the proper holdings of n other assets forming a basis over the n-dimensional risk space. Therefore, when one is dealing with the Black-Scholes model, since the risk space is one-dimensional, the risk of a bond or treasury bill option can be hedged away by holding the correct amount of the underlying bond or treasury bill. 29 If the option had been one on a 26-week treasury bill we would have had P = p |§. In fact, the theory allows one to hedge away the option risk by using any bond or treasury bill, not just the underlying instrument. 127 When dealing with the Brennan-Schwartz model, the risk space is two-dimensional,! so a second hedging asset is needed. I decided to complete the hedge portfolio by using a bond having 5 years to maturity. When combined with the 20 to 30 year maturity underlying bond of a bond option, this 5 year bond would mainly hedge the short rate risk while the position in the underlying bond mainly hedges consol rate risk. With a 13-week treasury bill option the situation is exactly the opposite. The hedge portfolio position in the underlying treasury bill mainly hedges the short rate risk while the 5 year bond position hedges mainly consol rate risk. The 5* year bond is sufficiently different from the underlying bonds and treasury bills to be used in both cases. So that I would be able to form the required zero-investment arbitrage portfolios, I collected from the Wall Street Journal price data for each underlying bond or treasury bill on all of the days that at least one option on the bond or treasury bill traded. Both bid and ask prices were recorded. 30 For the 5 year maturity bond I chose two bonds which appeared to be heavily traded: their bid-ask spreads were narrow. Over the first part of the test period, from November 1, 1982 to April 30, 1983, I used the 12 5/8% bond maturing November 15, 1987, and for the latter part of the test period, from March 1, 1983, 9 7/8% bond maturing May 15, 1988. 31 to October 31, 1983 I chose the Both bid and ask prices were collected In the Wall Street Journal bond pricesth are quoted in decimal format. The decimal portion of the number represents 64 s of a dollar, so that a quotation of $100.4 is really $100 4/64 per $100 principal value. Treasury bill prices are quoted in discount form, that is, if the discount quoted is d, the actual price payable is P = 100 — d^ where n is the number of days to maturity of the treasury bill. For example, the price of a 13-week treasury bill quoted at a discount of 8.68 would be P = 100 - 8.68 ^ = $97.8059 per $100 principal value. There was a period of overlap in the data collected for these two bonds, since an arbitrage portfolio formed before March 1, 1983 would contain the 12 5/8% bond 128 for these two bonds on each trading day of the periods given above. The last data required is a time series for the riskless rate r on each day of the test period. This is needed for two purposes. First, we need r in testing the no arbitrage condition. The zero-investment arbitrage portfolio requires an investment in the riskless asset, so we must know what the riskless rate is before we can test for positive expected arbitrage returns. Second, since the numerical solution of bond and option prices will provide prices as a function of r and /, r is needed in order to calculate the theoretical price of an option on any given day. For this latter reason, I chose the same proxy for r as I chose in the estimation period, namely, the continuously compounded yield on the treasury bill having the closest to 30 days left to maturity. These data were collected from the Wall Street Journal for each trading day of the test period. Both bid and ask prices were collected and transformed from discount form. The yield of the price midway between the bid and ask prices was used to proxy r. maturing November 1987. If there was no period of overlap, then we would run into trouble if the portfolio was held past March 1, 1983, as we would not be able to provide a price for this bond when the portfolio was liquidated. 129 ( B.4 NUMERICAL SOLUTION OF T H EASSET PRICING P D E The Brennan Schwartz asset pricing model results in the partial differential equation Lz = zr , a a 2 z = z(r,l,r) a 2 a 2 with a, 6, c, d, e and / functions of r and / o n l y . a The first step in the solution 32 of this equation is replacing the differential operator L by the finite difference operator L , 3 3 - 2 , H Hi 5P Ar 6f r 2 4A r A/ A/ where the difference operators 5 and # H Hi 2Ar 2A/ r 2 , are defined to have the following effects on a function of r and /: 6?Hr, I) = f(r + Ar, /) - 2/(r, /) + /(r - Ar, /), 6?f(r, I) = /(r, / + A/) - 2/(r, /) + /(r, / - A/), ^r/(r,/) = /(r + A r , / ) - / ( r - A r , / ) , J|/(r,0 = /(r,/ + A0-/(r,/ + A/). The reason for using these particular difference approximations is that they are the ones that follow from the Taylor series expansions of / , so that Lz = Lz + 0{ A r 2 + A/2) = zT + 0 ( A r 2 + A/2 Strictly speaking, the partial differential equation also contains a term due to possible asset payouts, such as the coupon payout on a bond. Since I will only be numerically solving the asset princing partial differential equation for the values of assets which do not make payouts - discount bonds and options - I ignore this additional term. See Varga (1962) for a detailed outline of the methods used in this section. 130 The solution, Z, that we would obtain from solving the mixed difference-differential equation, namely, LZ = ZT is not equal to the real solution, z. If we let v = z — Z be the difference between the approximate and real solutions, we see from the two equations above that L(z -Z) 2 = (z- Z)T + 0{Ar + A / ) , 2 or Lv = vr +e, e(r,/,0) = 0, 2 e(r, I, r) = 0 ( A r + Al ) 2 Since the initial conditions are known at maturity (T = 0), the initial value for e is zero. If we now choose a regular two-dimensional grid of r and / points with a spacing of A r between the / points in the r direction and Al between the J points in the / direction, we can define the vectors '*(ri,/i,r)> < 34 (Z{ru lu ry z(n,li,T) Z(n,lU T) z(ruh,T) Z(ruh,r) z(rI ,lj,r)J K ^(r ,/ ,r)> 1 1 €(r/,/i,T) r A uh,r) Z{ri,lj T)) t The vector Z(T) is defined so that Z(r,-,/y,T), which is the value of Z(r,l,r) at the intersection of the i t h column and j t h row of the grid, is the i + (j — l ) i s t element of the vector Z ( r ) . That is, the first J elements of Z(T) are those in the first row of the grid, namely Z(ri,/i,r) through Z(TI,1\,T). The next / elements are from the second row of the grid, and so on until the J t h row. 131 and replace the difference operator L by the matrix G to give,35 : G z = z + s + e. r GZ = Z + a, T G v = v 4- e, v = z r where the matrix G is defined as below. G = A (MR - 2i + M; +D Ar )(M, 2A/ 2 +E 2Ar +C ( M i - 21 + M|; A/ 2 + F +S 2A/ M r = matrix of all zeros except the first upper diagonal which is all ones, M j = matrix of all zeros except the fi 1 upper diagonal which is all ones, A = diag(dfc), O f + ^ x ) / = a(ry, /y), B = diag(6fc), 6 - (y_!)/ = 6(r,-, /y), C = diag(c ), e«+(,-_i)/ = c(r,-, /y), D = d i a g ( i ), d E = diag(ejfe), «,•+(,•_!)/ = e(r,-, /y), F = diag(/ ), /,•+(,•_!)/ = /(r,-, /y). fc k fc f + _ i+{y x ) x For example, a(r,-,/y) is the = d (rt-, /y), + (j — l ) / t h element along the diagonal of A . The vector 8 is the result of imposing boundary conditions on the edges of the (r, /) grid. For simplicity, it will not be explicitly considered here. The matrix S is also the result of imposing boundary conditions on the edges of the (r, /) grid. 132 The solutions to these mixed difference-differential systems can be shown to be Z(r) = - G V(T) = / Jo - 1 s + exp(rG)[Z(0) + G ~ s ] 1 > exp[(r — i/)G]e(i/) du. The above equation in v(r) can be used to show that if all of the characteristic values of G have negative real parts, then v(r) is bounded as r increases to infinity. 37 If this condition does not hold, then the matrix G is said to be unstable and we cannot guarantee that v remains bounded as r increases to infinity. That is, the finite difference solution, Z , can not be guaranteed to be 'close' to the real solution s unless the characteristic values of G all have negative real parts. B.4.1 38 T h e A l t e r n a t i n g Direction M e t h o d The alternating direction method is based on an approximation to e x p ( A r G ) in the solution to the mixed difference-differential equation found in the previous section. Z(r + A r ) = - G _ 1 8 + exp(ArG)(Z(r) + G ^ s ) Before making this approximation, the key of the alternating direction method is first decomposing G into G r and G / G = G + Gi, r See Varga (1962). If all the characteristic values of G have negative real parts, then the effects of discretization errors dies out exponentially as we solve for larger and larger values of T. If this condition does not hold, then we may find the effects of discretization errors increasing exponentially as we solve for larger and larger values of r. 133 where the exact composition of G r and G( is discussed below. Once this decomposition has been made, we approximate e x p ( A r G ) by T . T = ( l - j A r c / ) " ' ( l - i A r G , ) " ' (l + |A»G,) (l+±ArG r ) AT 2 = I + A r G+ — G = exp(ArG) + 2 + 0(Ar ) 3 0(AT ). 3 This approximation is consistent, 39 as T agrees with e x p ( A r G ) up to the linear term A r G . At first glance, the system of equations that result from discretizing the time dimension using the approximation T seems to be no easier to solve than any other system resulting, from a consistent approximation. ^1 - l -ArG,^ = (i + Z(r + A r ) r ^ArG,) + [(l + = ( i - \ArG ) (i + \ArG ) iArG,) Z(r) r (l + iArG ) r (l - IATG,) (i - ^ArC,)] G ^ s + ^ A r G + i A r G , G ^ Z(r) + A r s 2 r If, however, we follow the Douglas and Rachford (1956) alternating direction format and define Z*(r + A r ) to satisfy (i - \ A r G ^ j 3 9 Z*(r + A r ) = (i + ± A r ( G + G ) ) r Z(r), Any approximation which agrees with the first two power series terms of exp( A r G ) , namely, I + A r G , is a consistent approximation. 134 then the solution Z ( r + A r ) is found f r o m 40 + Ars*, Once we define exactly what G r and G j are, it becomes clear exactly why this is called an alternating direction method. Assuming for the moment that the coefficient b of the cross-derivative term bz \ in the partial differential equation is r zero, we can decompose G as follows: G = G + G , r ( G G, Since each of these matrices is tridiagonal, solution of the two-factor model is reduced to the solution of / tridiagonal systems of order J plus J tridiagonal systems of order J at each time step, where J and J are the number of grid points in the r and / dimensions. First a set of tridiagonal systems is solved to give Z * ( T + A T ) , and this intermediate value is used in solving the next set of tridiagonal systems, producing Z(T + A r ) . The method is equivalent to alternately solving a one-factor model in the / dimension followed by a second one-factor model in the r dimension. Hence the name alternating direction. The original papers on the alternating direction method did not address the adjustment of boundary conditions, that is, the use of s* instead of a. For a discussion of this topic see Fairweather and Mitchell (1967). 135 Given a certain form of G and Gj it is relatively easy to establish the stability of r the Douglas-Rachford alternating direction method. If, that is, G and Gj have r only negative diagonal and non-negative off-diagonal elements, then all of their characteristic values, rjr k and n/*, have negative real parts. 41 Since the solution after n time steps is Z(nAr) + G ^ s = T(Z((n - 1) Ar) + G~ a) l = T (Z(0) + G ^ s ) , n the stability of the system is assured if ||T|| is less than 1. ||T|| = < I ( l - i A r G ) "(lr lAKS.) im r (l + ±ArG ) (l - IArG,) ( l - IATC.) ' ( l + i A r G , ) 1 + iArRe(n tm J max m 1 - iATRe(n ) ( l + '-ArG.) ( l + i A r G ) -1 r 1 + iArRe(n ) max n l-iArRe(n ) rn r n Since all of the characteristic values of G and Gj are assumed to have negative r real parts, ||T|| is in fact less than 1, verifying the stability of the Douglas-Rachford method. Up to this point the analysis has assumed that the coefficient b of the bzr \ term is zero. Without this assumption, we are not able to decompose G into two tridiagonal matrices. There have been several alternative methods proposed for handling cases such as ours, where 6 is not zero. The Douglas and Gunn (1964) method extends the original alternating direction idea one step further by decomposing G into three parts G = G + Gj + G j , r r See Varga (1962) for the properties of positive matrices. 136 where G and G , are the same as defined above, and G j is the result of a finite r r difference approximation to bz i which is slightly different from the one I have r used in previous sections. The approximation to exp(ArG) then becomes exp(ArG) = (i - l -ArG ) r/ (i - ^ A r G ) (i - r ^ATG/) x ( i + ^ A r G , ) ( l + ^ A r G ) (i + ^ A r G , ) + 0 ( A r ) . r 3 r Unfortunately, this method also requires the solution of three sets of tridiagonal systems at each time step instead of the two sets of systems required in the Douglas-Rachford method. 42 The necessity of solving three sets of tridiagonal systems at each time step can be avoided, however, as was shown by McKee and Mitchell (1970). This is made possible by using a. backward difference approximation to the bz i term instead of r the Crank-Nicolson approximation implied in the Douglas-Gunn procedure. That is, they approximate exp(ArG) by p(ArG) = (i - i A r G ) (i - i A r G , ) r x ^ I + ± A r G , ) (l+^ArG ) + ArG r r t + 0(Ar ),' 2 Where G w = ^ ^ ( M r -M' )(M, -M|), r and G and G j are as in the Douglas-Rachford method. This still produces a r consistent approximation. 4 2 Actually, if 6 may be both positive and negative at different points of the grid, the Douglas-Gunn method splits G into four parts, and requires the solution of four sets of tridiagonal systems at each time step. In our problem, however, 6 is either always positive or always negative, depending on the sign of the correlation coefficient p. Because 6 is either positive or negative, but not both, one of the four splittings becomes zero. 137 Since making this substitution for exp(ArG) produces the same system of equations as the Douglas-Rachford method, namely, I-IATG, (I-IATG,) R Z(r + Ar) Z(r) (l + ^ A r G « ) ( l + ^ A r G ) + A r G , r r (i + jArGi) (i + ^ A r G ) + A r G , r r -(l-iArG^I-iArG^G-s = ^1 + i A r G + i A r G , G ^ Z(r) + Are, 2 r using the Douglas-Rachford alternating direction form gives us once again (i - jArG,) Z-(r + Ar) = (i + j A r ( G + Gr)) Z(T), ( I - { A * G , ) I-i ( ATG,) Z(r + Ar) = Z*(r + Ar) - -ArG Z(r) + Ars*, R 8* = 8. The only difference between this and the equivalent systems found using the Douglas-Rachford method is the difference in the decomposition of G . 4 3 One of the drawbacks of the McKee-Mitchell method is the difficulty of showing stability. Because of the unusual form of their approximation to exp(ArG), stability cannot in general be shown. Although in general we cannot show stability, McKee and Mitchell were able to show that their method is stable when applied to the problem 6 < 4ac, azrr + bzr i + cz\\ = z , 2 r a > 0, c > 0, That is, here we have G = G + Gj + G j , whereas in the Douglas-Rachford case with 6 = 0 we had G = G + G , . r r r 138 * = <*(»•» Oi ( .0i c = c(r,/), 6= 6 r a fact which turns out to be useful below. B.4.2 Stability of the Solution M e t h o d The Douglas-Rachford alternating direction method is only guaranteed stable if the characteristic values of both G r and Gj all have negative real parts. Unfor- tunately, the form of G in our particular problem does not tell us whether this condition holds. If we look at the G matrix for the Brennan-Schwartz two-factor model, namely, \Ar + + 2Ar/ f C 1 E\ [AP 2 Al) + Ar \ 2 / M ' + 2 / 2 C (- AP 2 \Ar 2j 1P \ + 2 2Arj / C 1 E \ (SP * 1 Ai) + i ^ ( M , - M ; ) ( M , - M j ) , where A = diag(ajfe), of-+(,•_!)/ = ^ ( r , - , fy), B = diag(fefc), fc,+(y_i)/ = p<7i (r,-, /y)(r (r -, fy), 3 C = diag(cfe), c, y_ + ( D = diag(rf ), 1 ) 7 = ^ ( r , - , /y), = fc Pi(ri,l3 ) E = diag(e ), fi, y_i)/ F = diag(/fc), /,+(y_i)/ = - r y , fc +( t - ^i(r,-,/y)A (r,-,/y,*), r = fj(<rf (r,-,/y)//? + /y - r,), r Mi , since a* and are positive and fk is negative, A > 0, C > 0, P < 0, it is clear that the diagonal of G is made up of negative elements. If the offdiagonal elements were all non-negative, then all of the characteristic values of G r and G( would be guaranteed negative and the stability of our numerical methods would be assured. 44 We see, however, that the bzT \ term produces two negative elements per row of G . As the partial differential equation is elliptic in its space variables, ie. 6 < 4ac, it turns out that these elements do not cause instability. 2 45 There are, however, other potentially negative elements. If either of the following conditions holds then G will have negative off-diagonal elements. 1 d Ofc Ar k 2 Ar 2 or Ck 2 A/ A/ 2 In using the Brennan-Schwartz model, I assume that the variances of r and / are as below. o"x(ry, lj) = cyr,-, ar constant, (7 (ry,/y) = <rj/y, a\ constant. 2 The variance of changes in the short rate is proportional to the short rate r, so that as r approaches zero so does its variance. Since the drift component of r does not approach zero at the same time, we find that some off-diagonal elements of G are negative for small values of r. This raises the possibility of instability. See Varga (1962) for the theory of positive matrices. This can be seen by an analysis similar to the one performed by McKee and Mitchell. 140 For grid points where either r or I Is small, we may find G containing negative off-diagonal elements. As a test of the stability of the method, I numerically solved the Brennan-Schwartz model using both the alternating direction and successive overrelaxation (SOR) methods with the same stochastic process forms and parameter values as were found by Brennan and Schwartz (1982). The solution did indeed show instability, occuring when either the r and / grid was not extended to large enough interest rates, or when too large a spacing was used between grid points. For the parameters and grid dimensions used by Brennan and Schwartz, namely, r running from 0.00 to 0.50 with A r = 0.01, / running from 0.00 to 0.50 with A / = 0.01 and A r = 1/24 the solution was well-behaved. This stability was also present when various other parameter values were tried. As a result, I decided to use these grid dimensions above when solving the partial differential equation in later parts of the study. 46 4 7 The instability of the solution was not the result of using the alternating direction solution method instead of the successive overrelaxation method (SOR) used by Brennan and Schwartz (1982). In a comparison of the alternating direction and SOR methods; I found that the alternating direction method was, in fact, less affected by instability. One unsettling aspect of instability in a system such as this is the possibility that the 'correct' parameter values remain undiscovered because they lie in a region where the solution method is unstable. 141 B.5 PARAMETER ESTIMATION Up until now, the joint stochastic process for r and / has been discussed in quite general terms. In this section we must finally specify particular parameterized forms for the process drift and variance terms, and for the market price of short term risk A r . Brennan and Schwartz (1982) estimated the parameters in parameterized forms of fii, fa, o"i and 0 2 by using a time series of 30 day treasury bill yields and a series of yields on a very long maturity bond to proxy for r and /, respectively. Once these parameters were estimated, they were left with only the market price of short rate risk Ar to estimate. This they estimated by finding the value of Ar which resulted in the best fit between market bond prices and theoretical bond prices. The theoretical bond prices were, of course, found by numerical solution of the asset pricing partial differential equation for various values of A r . In the next section, I first examine the procedure used by Brennan and Schwartz to estimate the parameters of the joint stochastic process for r and /. The conclusion I reach is that for the particular estimates found in this study, no confidence can be placed in the estimates for the parameters of B\ or 82- The estimates of the parameters of <T\ and 0*2 and the correlation p between the two Wiener processes can, however, be well estimated. As a result, unlike Brennan and Schwartz, I only estimate <Xi, a<i and p using the two time series of yields. I then simultaneously estimate Ar and the parameters of B\ by finding the values producing the best 142 fitting theoretical bond values. B.5.1 The Simple Linearization M e t h o d In their first papers on the two-factor model for bond pricing, Brennan and Schwartz (1977, 1979, i980) used joint stochastic processes for r and / which allowed one to solve the forward equation analytically. This basically limited them to processes which were linear, that is, of the form ft) = A ( (') + b ) < f l + c ' i w ' dwdvr' = Idt, where A and C are constant matrices and b is a constant vector. As shown by Phillips (1972) this system can be solved for A r and Al. Unfortunately, the solution of a linear differential system is a special case. It is not possible to find an analytic solution to a general non-linear differential system, such as is used in this study. As an alternative Ananthanarayanan (1978) and Brennan and Schwartz (1982) proposed what they called a simple linearization method. Instead of directly solving the problem, they suggested the following approximation. t+At / /"t+At \dKs)J = I ft+At r s P( ( )>K*)-s)ds + J t+At / 143 r,(r(s),l(s))3 )dv,(S ) [t + At <fo + ij(r(*).'(0.0 J <M«), or 1 1 n- (r(<), /(«), *) ( j) - *' MO. 'W. '(0.0^ + Aw(0, Aw(*) ~ J V { 0 , I A * } . That is, this approximation assumes that both 3 and n are approximately constant over the interval At. Clearly, this result can be used in maximizing the likelihood function T L=p{ , ri /i, n p i=2 ( « > '.•» «.-i .--i»'.--1» r r 1 PMMn-uU-uti-i) Aw,_ i = B.S.l.a T7 (r,-„ i , =^ e x p i, !) MINIMUM DISTANCE / 1A v ' (-- (A/-I!) ,Aw._t\ 'j, -/?(»V-I,'.--I,*.--I)A* ESTIMATOR The section above ended by suggesting that after making the simple linearization approximation the results could be used to maximize a likelihood function. It would indeed be likelihood maximization if the matrix function rj was known, but since n contains unknown parameters the estimates derived from the maximization are only asymptotically maximum likelihood estimators as the number of observations increases to infinity. 48 As is easily seen, maximizing the 'likelihood' See Malinvaud (1966) and Phillips (1972). 144 function given above is equivalent to minimizing the distance function t 1=2 Aw,_i = !7~ (r,-_i,/,-_!, (Afcll) l -0( <-i.'<-i.'<-i)A* r except for the factor P(ri,l\,t%) which I have ignored in this study. B.5.1.6 A ONE-DIMENSIONAL EXAMPLE OF SIMPLE LINEARIZATION The first test of the simple linearization, method proved quite successful. In his thesis Ananthanarayanan (1978) showed that the simple linearization parameter estimates were almost identical to the estimates one would find by analytic solution of a particular one-dimensional stochastic process for the instantaneous riskless rate r. The process examined was dr = m(/i — r) dt + r <j dw, a dw ~ iV{0, dt}, where m, /z, a and a are constants. We can better analyze the situation if we change to a process which is homoscedastic. By Ito's Lemma, dx(r) = x dr + r -x r rr -x (dr) 2 rr It 2ot 2 <7 + x m(fi - r) r 145 dt + x r <7 dw. r a If we choose x = r a 1 a / o " ( l — a), then xr r a = 1 and we have m (/t-r)r — dx = a 1 --ao-r dt + dw. Now, if we make the simple linearization assumption that the drift term changes very little over the interval At, we have Ax,~ r m, . _„ 1 - ( / i - r , ) r . - -oro-r*a - l % a At + Awi, Awi ~ N{0, At}. As can be seen from Tables B.9 and B.10, the assumption that the drift term changes very little over the span of a month is a good one given the specific parameter values found by Ananthanarayanan. For a range of beginning of month values of r ranging from 0.05 to 0.20 we see that the end of period values are practically unchanged. 49 In fact, for the second set of parameter values shown in Table B.10, the end of period range is too small to show up with only three decimal digits. Actually, the way I have presented the simple linearization method is slightly different from the way that it has actually been used by Ananthanarayanan and Brennan and Schwartz. In my example, I first changed variables from r to x in order to end up with a homoscedastic process. The simple linearization procedure used by Ananthanarayanan (1978) and Brennan and Schwartz (1982) would instead use — At + A to,-. ~ — ^ I calculated these values using the simple linearization method. The end of month range of r given in the last column is a range of two standard deviations around the mean - again computed using simple linearization. 146 The agreement between my procedure and the original should be good since we have shown that rf changes little over At. B.5.2 Brennan-Schwartz Parameter Estimates The simple linearization procedure has been shown to work well in the one particular case studied by Ananthanarayanan, but this is not justification for its use in any other case. Justification would only come from showing that the assumptions of the simple linearization method are valid. In the Brennan-Schwartz scenario that I consider here, the first step in verifying these assumptions is to transform the process = 6(r,l) dt + n(r,0 dw(t), dwdw' = Idt, to an equivalent homoscedastic one: dx = 7(x)<ft + dw(t). Since Ito's Lemma gives us if we choose x(r, /) such that V r i x = rj 1 the result will be the homoscedastic stochastic differential system dx = r,- 1 /? + ir7tr(»7r/'V ,V; |/)x r 147 dt + dvr. B.5.2.a CHOICE OF THE BRENNAN-SCHWARTZ JOINT PROCESS FORM The particular Brennan-Schwartz process used in this study was,50 J = pdt + T} <iw, a(l - r) /(of + <r,A, + / where <rr and <r, are constant; The process for r is a mean reverting process, reverting to a changing mean: the consol rate /. The drift term for the / process was derived in the theory section using the fact that the consol bond value is a known function of the consol yield. In addition, I assume that A, is constant. 51 As mentioned in the previous section, if we want to transform this process to an equivalent homoscedastic form, we need to choose alternate variables x which Other processes were tried before this one was decided on. I had to reject using C-0 / (a 4- 6 >.l 2 2 \ + c r) 2 J which is the drift of the process used by Brennan and Schwartz. There was almost perfect correlation between the estimates of a and 61 and between a , 62 and C 2 . This high degree of correlation makes the parameter estimates meaningless, regardless of their standard errors. In order to avoid this correlation between parameter estimates it was necessary to adopt a process such as the one used in this study. As noted in the previous footnote, assuming A, is a linear combination of r and / - which is the next level of sophistication - would result in a problem of almost perfect correlation between parameter estimates. x 148 2 satisfy V r J x = n 1 . The desired alternate variables are which lead to the desired homoscedastic form i, t r(„'V r ,V' P ,)x = i , ( * » J l + ^ W ^ j + (ft) To be complete, of course, we must note that r and / in the above stochastic process for x are to be treated as functions of x. ( D = - ( ( O : H Also note that I have not uniquely determined x. which satisfy '=C !)• GG 149 There are many G matrices but they differ only by a rotation. B.5.2.b BRENNAN-SCHWARTZ AND SIMPLE LINEARIZATION The simple linearization assumption of a locally constant stochastic process drift term was shown by Ananthanarayanan to be a good assumption for the particular case that he was examining. This does not mean that it is a good assumption for all models that might be used. To be sure that the assumption is also good for the Brennan-Schwartz model that we are using in this study, we should redo the calculations that we performed when examining Ananthanarayanan's stochastic process. If we assume for the moment that the simple linearization assumption is valid for the Brennan-Schwartz model used here, then evolution of the homoscedastic system derived in the previous section can be approximated by the following 150 discrete difference equation -t + At i: It dx(s) 'MO) ( *t+At i ) (*(0) 1-1 [t+At ds + 2*' a //(x(Q) _ \ _ 1 ( ^ G _ <rr Vr(x(0) 1 v I<7i + A t+ 7 2 dw(s), Jt \ 2°"' + A, + —(J(x(t)) - r(x(t))) J It Ax(0 A \ r A* + Aw(f), ^(/(x(<))-r(x(i))) > Aw(0 ~ JV{0,IA*}. If we use this approximation and the minimum distance function to estimate the parameters of the process - oy, o\, p, a and of + o-,Aj - the results are the parameters shown in Table B . l l . But is the simple linearization assumption valid? We can make a quick check by taking the parameters estimated for the October 1970 to October 1982 period and repeating the exercise we followed in the one-dimensional example above. There will be a slight difference, however, since we are dealing here with a twodimensional case. If we start at the point r = 0.08 and / = 0.10, our first task is identifying reasonable end of period points. If we let S be / 5 = GAx(<) - a //(x(Q) _ \ _ *' WW) / \ 2 r { fa + A, + -(/(x(<)) - r(x(*)))) 151 ^ then the Mahalanobis distance of any end of period point can be computed by calculating Table B.12 shows the results of this exercise for various points which have a Mahalanobis distance of 2 from the expected end of period point (calculated using the simple linearization assumption). We see tfeat the beginning and end of period drifts are radically different. For example, the first row tells us that the point r = 0.0626, / = 0.0964 is not an unreasonable end of period point. At that point, however, the end of period drift for the first component of x is 3 times what it is at the beginning of the period. For the ending point r = 0.10, / = 0.1083 it is negative 20% of the beginning of period drift value. Apparently, the simple linearization assumption does not hold for the BrennanSchwartz model parameters in Table B . l l . In order to investigate this further, the parameters were reestimated using an assumption similar to the simple linearization assumption. If the process drift is in fact close to constant over the period At, we should see no difference in parameter estimates whether we use the beginning of period drift, end of period drift or some convex combination of the two in our estimation procedure. Ax{t) ~ G At + Aw(t) , §0* /• = (1 - + Aj + - ( / * - r*) f)l(x(t)) + //(*(* + A i ) ) , 0*1 r* = (1 152 f)r(x(t)) + fr(x(t + A*)) That is, we should find the same parameter estimates no matter what value of / between 0 and 1 we use in the estimation procedure above. As might be expected from our analysis, the parameter estimates do in fact change as / is varied. Table B.13 shows, however, that only or and of + O-JAJ are affected. The values of o>, <rj and p remain practically the same for the three values of / zero, one-half and one - that were used. Since the estimation of a is not reliable when using the simple linearization method, I decided to estimate it in the next stage, along with the market price of short rate risk, A R . I did, however, decide that the simple linearization estimates of c , 07 and p were stable enough to justify r their use in later stages of the study. The parameter values for the intermediate case, / = 0.5 were used in the rest of the study. B.5.3 E s t i m a t i o n of the P r i c e of R i s k and Reversion Coefficient a As was illustrated in the previous section, it is not possible to estimate the reversion coefficient a of the stochastic process for the short rate of interest r by analyzing a short and long rate time series. The standard deviations and correlation - oy, a\ and p - for the joint process in r and / appear, however, to be stably estimable from these time series. For these reasons, I decided to estimate the reversion coefficient at the same time as the market price of short rate risk A P , which for simplicity I assume to be constant. 153 B.5.S.a MINIMUM DISTANCE ESTIMATOR The only thing preventing numerical solution of the asset pricing partial differential equation is the lack of values for A r 5 2 and a. All other parameter values have been estimated. The procedure in this section, therefore, is to try various values of A and a with the aim of finding the 'best-fitting' pair of values. For each r pair of values tried the partial differential equation must be numerically solved for discount bond values, and theoretical coupon bond values calculated as portfolios of discount bonds. These theoretical prices are then compared to actual market prices, giving a large vector of bond pricing errors, et , for each month of the estimation period from October 1970 to October 1982. Assuming multivariate normality and lack of serial correlation in bond pricing errors EN = O, E M I = {||; = we would choose the pair of A and ct which give the best fit between theoretical r and actual bond prices by minimizing the distance function 53 T- D = ^e't Set . t=i The 'best' values of A and a are those resulting in the minimum value for this r function. 54 Assumed constant, as mentioned above. As mentioned previously, when S is unknown and is estimated from the sample the distance function is asymptotically maximum likelihood. See Malinvaud (1966) and Phillips (1972). I found that the estimates of A and a were different if serial correlation was or r 154 B.5.S.a.i Portfolio Formation Schemes We cannot, however, estimate the above distance function directly by using individual bond price errors. There are two problems with such an approach. First, since the maturity of each bond decreases as we follow its time series of prices, presumably the variance of its pricing residuals also changes. This would lead to non-stationarity in the covariance matrix S. Second, individual bonds do not have prices spanning the entire sample period, resulting in a missing data problem. Both these problems are ameliorated by combining the bonds into portfolios according to maturity, and using these portfolios to estimate the unknown parameters, A and a. r In this study three different equally-weighted portfolio formation schemes were compared. Scheme number 1 places all bonds with maturities between 0 and 1 years into portfolio 1, between 1 and 2 years into portfolio 2, and so on until the was not deleted from bond pricing errors. As the main purpose of this study is the pricing of options on bonds, not simply the pricing of bonds, I decided not to deleting the serial correlation from bond pricing errors. The reason is that deletion of serial correlation from the pricing errors when estimating bond prices logically requires a similar correction of theoretical bond prices in later stages of the study. For example, if we were testing trading strategies based on differences between theoretical and market prices, as our theoretical price we would use this period's model price corrected for serial correlation by using the previous period's pricing error and an estimated correlation coefficient. In this study, however, we use the theoretical bond prices to form boundary conditions when solving the partial differential equation for option prices. In this context there is no previous period pricing error that can be used to correct the model prices for serial correlation. Since there is no way that this correction can be applied when pricing these options, I decided to ignore the correction when pricing bonds also. As a result, the model bond prices which I obtain are as close as possible to the actual market bond prices. Hopefully this improves the accuracy of the option prices obtained in later stages of the study. 155 tenth portfolio which contains bonds maturing in 9 to 10 years. This is the scheme used by -Brennan and Schwartz and, as can be seen in Table B.14, there are some problems with scarcity of data in portfolios 8, 9 and 10. In addition, bonds with maturities greater than 10 years are totally ignored. It is probably not a good idea to ignore the higher maturity bonds in this study, as these are typically exactly the bonds which options are written on. In order to address this problem an additional portfolio containing bonds of maturities between 10 and 20 years was added in scheme number 2. This still does not correct the scarcity of data in the 7 to 10 year maturity range, however, so in portfolio formation scheme number 3, the three portfolios previously containing the 7 through 10 year maturity bonds were combined into a single portfolio. In addition, because of a relative abundance of data in the 10 to 20 year portfolio in scheme 2, in scheme number 3 it was split into two portfolios. For the reasons given above, I feel that the third portfolio formation scheme is the most desirable. The other two have been included here to examine how sensitive the estimates of A and a are to the particular portfolio scheme used. r B.5.S.a.ii Covariance Matrix Assumptions 156 The formation of bond portfolios avoids the problem of having a non-stationary covariance matrix, but still leaves the problem of actually estimating the covariance matrix. In order to test the sensitivity of the estimates of A and a, once r again three different approaches were taken. The first approach corresponds to an ordinary least-squares regression (OLS) of theoretical portfolio prices on observed prices and is accomplished by assuming the covariance matrix is the identity matrix. (OLS) S = I, The second approach applies an ad hoc heteroscedacity adjustment with the assumption that the variance of a portfolio's pricing errors is proportional to the average maturity of the portfolio. S = H, H = diag(A,), hi = average maturity of portfolio t. (HETERO) Finally, the third approach applies generalized least-squares methodology, approximating the covariance matrix by using the portfolio pricing errors. 1 T (GLS) t=i This last method is asymptotically maximum likelihood. B.5.S.b PARAMETER ESTIMATES Table B.15 shows a selection of the values of values of A and a tested in the r two-dimensional non-linear search for the minimums of the 9 different distance 157 functions. Since there were only two unknowns, the search procedure was begun by selecting a large number of initial (A , a) points with the intention of sketching r out the overall shape of the surface that was being searched. The first thing to note from Table B.15 is that the market price of short term risk is positive, contrary to the expected negative value found by Brennan and Schwartz. This is independent of the portfolio formation scheme and the covariance matrix assumption. 65 This is unexpected, but not impossible. The theory section tells us that the expected return fi of a discount bond 6(r, /, r) in the Brennan-Schwartz model satisfies fi — r = a A = \[Srcr rX r r + £jo-j/Af]. 6 We would expect higher risk to command a higher return, and since the choice of the form of the Brennan-Schwartz model used here links higher interest rates with higher interest rate variances and hence higher risk, we would expect higher returns at higher levels of interest rates. Since we would expect the price of a discount bond to decrease with increasing r, that is, 6r < 0, the only way to have fi increase as r increases is to have A negative. r This argument is quite persuasive, but the numerical solution points out that it is also quite incorrect. The price of a discount bond does not always decrease as we increase the short rate of interest r. In some areas of the r and / grid we find Also independent of whether the serial correlation of bond pricing errors is taken into account or not. 158 SR > 0. The key to understanding why the numerical solution is correct and the argument incorrect is the fact that while r is a yield on a discount bond, / is not. It is a yield on an asset which pays a perpetual continuous coupon. When we take the partial derivative of the discount bond price with respect to r, we are doing so while holding / constant. This is not the same as holding the long rate of interest constant, but should be imagined instead as keeping constant a weighted average of all rates, from the shortest to the longest. As a result, when we increase the short rate the only way to keep this weighted average constant is to decrease some other rate or rates. When we increase the short rate the value of consol coupon payments relatively close to the present decrease, and the value of other payments further in the future must increase to keep the value of the consol constant. But, this means that as we increase the short rate r, some longer term discount bond must increase in value. That is, for shorter maturity discount bonds we, of course, find 6 negative, but as we increase r the time to maturity we eventually find 6 positive. As is clear, if try to use the r argument above to establish the sign of A , this ambiguity in the sign of S leads r r to an ambiguity in the sign of A . r Once the initial sketch of the surface was finished, the most promising areas were investigated for a minimum by assuming that the surface could be treated as locally parabolic. 56 The method consisted of finding the minimum point of this fit- The exact form used was z(r, /) ~ a 0 0 + ai0 r + aQ1 l + a^or where z is the height of the surface. 159 2 + an rl + a02 l 2 ted parabolic surface, evaluating the distance function at this new point, refitting the parabolic surface and so on until convergence was reached. It turned out that the minimum was in a long, very narrow trough running through the surface. 57 As a result, the distance function value for the final parameter estimates differs only slightly from its value farther along the trough for different parameter values. 58 Reporting the standard errors of these estimates would, therefore, be misleading, as the trough indicates a high degree of correlation between the parameter estimates. What is more promising is a comparison of the optimal values of A and a for the r three different portfolio formation schemes and three different covariance matrix assumptions. As shown in Table B.16 the estimates appear relatively insensitive to these different assumptions. The optimal parameter values for all nine of these treatments lie within a factor of two of each other. For the rest of the study, I decided to use the estimates derived from using the third portfolio scheme and the G L S covariance matrix assumption, namely, A = 0.260 and a = 0.558. r This is apparently not uncommon. See Marquardt (1963). This is not the reason for A being positive. The trough lies almost completely in the quadrant where both A and a are positive. r r 160 B.6 PRICING MODEL ERRORS The numerical solution of the asset pricing partial differential equation only supplies asset prices for the actual grid points used. For points lying between these grid points some sort of interpolation must be used. When interpolating bond prices, I used the simplest interpolation method: linear interpolation. This is the same method as was used successfully by Brennan and Schwartz, and is successful because of the small curvatures present in bond prices as a function of r and /. When options are valued, however, the curvatures are much greater, and linear interpolation may produce a poor fit. To investigate this possibility, I compared three different methods of interpolation when pricing options, linear interpolation, cubic spline interpolation and a^form of quadratic interpolation. B.6.1 Linear Interpolation By far the easiest interpolation method is the one used by Brennan and Schwartz to interpolate bond prices: linear interpolation. Quite simply, if we have prices at the four points ( r , / „ ) , m = t, t + 1, n = j,j + 1, r m 161 t + x - r,- = A r , l - - l = 3 +l 3 Al, then we can linearly interpolate the price at any point inside this rectangle as P(r, 0 = (i - /)(i - g)Pij + (i - f)gPi,j+i + / ( i - g)Pi+i,j + fgPi+u+u ri<r<ri+1 , Pit = (,•</</y+i, P[ri,lj), 0 <J / = ^ < l , - Ar ~ o < «y = ^ ' ~ < _ i A/ This is the same as using a bilinear surface 59 . to interpolate over the rectangle. In order to estimate the first partial derivatives at the point (r, /), we need several other data points. P R ( R ' 0 - 2A7 = 2A7 [ ( 1 [ { 1 60 9)Pii " ~ " ~ + ( 1 9){Pi+iJ + / ( l -g)(P 2j i+ " V gPi <i Pi ij) ~ ~ + +1 + / ( 1 ~ d Pi+ l<>- ^(Pi+ i,i+i - Pij) + fg{Pi+,j+ i + ~ p f3 i+w\ Pi-ij+i) - Pi,j+-i)], which simplifies to the desired expression at the nodes (r,-, lj) p (r . i.) ~ J*!Lp.. " ' ^ - 2 A r ~ n i r , J Pi+i,j-Pi-UJ 2Ar The partial derivative in the / direction is approximated similarly. This is the simplest method available, and it will be successful if the curvature of the surface being interpolated is not too great. A bilinear surface satisfies the equation z(x,y) = a0 o + aiox + anxy + a i y and is so named because a cross-section of this surface in either the x or y direction is a straight line. The points in the r direction are assumed to be separated by a distance A r , and those in the / direction by Al. 0 162 B.6.2 Cubic Spline Interpolation Probably the most popular form of curve interpolation is interpolation using cubic splines. Basically, if we start with a one-dimensional grid of points, r,-, and prices at these points, Pi, the interpolation is done by fitting a separate cubic function between each pair of points. That is, between r,- and r , + i we would fit the cubic function 3 2 P{r) = ai + 6,r + c t r + dt r , r, < r < r , + 1 . Actually, the function that is fitted is the cubic spline ^) = , ( r ^ ) % , + 3 ( ^ ) l ^-'»(^r-) '-'' (ir)' + Ar 2 s r r +(Pi+ s +l) I „ *i = -Q~ i s r («')» r « < r < r,+1. By 'fitting' a spline between adjacent points, I mean that the first and second derivatives of the entire curve are made continuous. Pi = Si(n) = Si^(n), S'i(n) = SUin), S'/in) = s^fa). This is accomplished by finding the appropriate values for the second derivatives,5,-', at the grid points. 61 This method is perfect for interpolating between a large number of points on a line, but of course the computation required increases as we increase the number 6 1 See Vemuri and Karplus (1981) regarding the use of cubic splines for interpolation. 163 of points. We could use a two dimensional version of the cubic spline interpolation method and compute the interpolating splines for the entire r, / grid at each time step, but since we need to interpolate at most one point at each time step this would be an expensive proposition. Instead, I decided to use one-dimensional cubic spline interpolation in the r direction followed by one-dimensional cubic spline interpolation in the / dimension. For each interpolation, in order to reduce computational requirements, I would use only a four by four grid of data points. B.6.3 Quadratic Interpolation As an alternative to cubic spline interpolation I wanted to try interpolation with a quadratic curve. In keeping with the cubic method, I also wanted the method to only use a four by four grid of data points. The quadratic interpolating polynomial used was Q(x) = a 0 + ai(x - r) + a (x 2 2 - r) , where what is desired is the interpolation function value and first derivative at i = r, that is, P(r) = Q(r) = a 0 and P'{r) = Q'(r) — o i . I also wanted the 164 quadratic to fit perfectly at the points r,- and r Pi = Q(r.) =a0 f + i. + ai(ri - r) + a (r,- - r ) = a - O i / A r + a / A r 2 2 0 JVn = Q(r»+i) = ao + ai(r,-+i - r) + a (r,+i - r ) 0 1 + a (l-/) Ar 2 2 2 2 2 = a -a (l-/)Ar 2 2 . 2 Ar y These two equations can be solved for the two needed unknowns P(r) = a0 and P'(r) = a x P(r) = a p' ( r ) = leaving us with only a = P ( l - /) + J W 0 t a i = ^±LZ^L_ a 2 - a /(l - / J A r , (i_2/)Ar, to estimate. 2 If the prices at the four points r -_j through r , t quadratic function, then the value of a Pi — Pi+i + P , ) / 4 A r . 2 + 2 2 2 2 = ^P" + 2 really were described by a would be exactly a 2 = (Pi-i — The closer the curve is to being quadratic (ie. having a constant second derivative), the better this approximation will be. Using this estimate of the second derivative gives u s P(r) = Pi(l -f) + Pi+l f P'(r) = Ar P, + 1 - Pi - 62 - i / ( l - / K i ^ i - Pi - 4 - 2/)(P,_ - Pi - P 1 If the curve is linear in this area, then P , _ i — P,- — P I+L t + P I + L + + P i + P, + 2 P, t + 2 + 2 ), ) is zero and the quadratic interpolation value is the same as the linear one. This one-dimensional interpolation method was extended to two dimensions usOnce again notice that the linear interpolation functions P(r) = P , ( l —/) + P , / and P'(r) = (P,-+i — P , ) / A r are contained in these quadratic approximations. As in the cubic spline case, the quadratic interpolation value is the linear interpolation value plus a 'correction' term due to the second derivative. + 1 165 ing a four by four grid of data points in the same manner used for cubic spline interpolation. B.6.4 Bond Option Pricing As mentioned in the pricing theory section, any asset value which is a function of r, / and maturity only will be described by the same asset pricing partial differential equation. The differences between the prices of these assets arises from their different boundary conditions. Since we will be valuing American calls and puts, the boundary conditions that need to be imposed a r e C{r,l,r,K)> max(0, B{r, I, T + 63 T ;C)-K), B with equality at maturity. Of course, in our numerical solution method we cannot impose this boundary condition at all points in time, simply because we are only solving for option values at discrete time points. The best that we can do is to approximate the option holder's right of exercise at any time by the right to exercise at certain discrete points in time, namely, the points where we calculate the partial differential equation solution. ' I only consider a call option on a specific bond here as the treatment of put options is analogous. There are no 'fixed maturity' bond options traded, only 'fixed maturity' treasury bill options. 166 At each time step, therefore, we first solve the partial differential equation ignoring the right to exercise. This gives us a preliminary option value C (TK). + We then allow the option holder to exercise his option, which gives us the option value we desire. + C(r,-, l3 ; r; K) = max[C (r,-, l0 ; r; K), B{r{ , l3 ;T + r ;c)-K} B This means, of course, that we need a complete grid of theoretical bond prices at each time step of the solution. Aside from the interpolation method to be used there are several other details that needed to be addressed. When close to maturity, option price functions have a zone of high curvature in the region where the bond price equals the exercise price (the region where the option trades at the money). In order to hopefully avoid problems arising from too large a spacing between grid points, I decided to reduce the time step size to one day as compared to the two weeks that I used when valuing bonds. In addition, since the option values at extreme points on the grid should be close to either zero or the value of the option when exercised, B(ri,lj,r + TB\C) — K, I tested to see whether the grid's range of interest rates could be reduced somewhat. While the numerical solution was unaffected when I reduced the maximum / value from 0.50 to 0.25, I was not able to do this with the r dimension. The resulting grid of r from 0.00 to 0.50 and / from 0.00 to 0.25 was used in pricing all options. B.6.4.a CHOICE OF INTERPOLATION 167 METHOD How then do we judge whether one interpolation method is better than another? One test would be a comparison of pricing errors for the three different methods, such as is presented in Table B.17. As expected, the linear interpolation method performs poorest, with a root mean square error of $1.70. Both the cubic and quadratic methods do substantially better, ending up with root mean square errors of $0.67 and $0.66, respectively. Actually, the improvement is even better than it would seem from Table B.17, since all three methods should produce essentially the same result when interpolating only a small distance from available data points. For example, if we have data points at r = 0.10 and 0.11, then we would expect the methods to differ more when interpolating to the point point halfway between these points, r = 0.105, than when interpolating to r = 0.101. There is a great potential for improvement in some of the pricing errors, and practically no possibility of improvement in others. Including the cases with little possibility of improvement decreases the apparent reduction in size of interpolation errors. 64 Judging from Table B.17 alone, there does not seem to be any reason to prefer cubic oyer quadratic interpolation. We have been mentioning all along, however, that we also need to be concerned with the first partial derivatives that come from our numerical solution of asset prices. These are needed in order to form arbitrage portfolios. Table B.18 shows For example, imagine the much simplified situation where half of the theoretical prices are not interpolated, but still give an error of $0.50, and the other half are interpolated and result in errors of $1.00 when linear interpolation is used. The average error is then '$0.75. The $0.50 errors are not affected by a change from linear to quadratic interpolation, but say that the $1.00 errors drop to $0.70. This is really a reduction of $0.30, but the average error only drops $0.15 to $0.60. 168 that the partial derivative of option prices with respect to r is not likely to cause much difficulty for interpolation methods. Only the results from the linear interpolation method are given, since the other two methods produce essentially the same results. As the last line of Table B.18 shows, the average change in bond option price from one r grid to the next - a separation of 0.01 - results in an average option price drop of $0,021. The maximum price drop was $0.71 and the maximum rise $0.60. Calls and puts have not been separated, as the results are similar for both. Notice that the derivative can be either positive or negative for both calls and puts whereas we might at first expect only negative slopes for calls and positive slopes for puts. This follows from the discussion above about the change in bond prices when r is changed while holding / constant. 65 As we saw, bond prices could either rise or fall as r increased. Consequently, the same can be said of option values. These figures contrast starkly with those in Tables B.19 and B.20, showing the partial derivative of bond call and put options with respect to / for the three different interpolation methods. The average drop in call option price for a 0.01 increase in / is $4.30 when linear interpolation is used - not anywhere near the $0,021 change that we saw in the r direction - with a maximum drop of $10.80 and a minimum drop of $0.10. The figures are similar for put options, except that prices now rise when / increases. The average rise is $4.50 for a 0.01 increase in /, with a maximum rise of $10.50 and a minimum of $0.20. In general, all three See section B.5.3.b. 169 methods still produce results that differ only slightly. There is one problem, however, which shows up in the first row of both tables. Unlike the effect of changes in r on bond prices, an increase in / while holding r constant should unambiguously cause a decrease in bond prices. This should cause a decrease in call values and an increase in put values. Unfortunately, we see in the first row of Table B.19 that both the cubic and quadratic methods produce an increase in some call option prices when. I increases. The first row of Table B.20 shows the same story for put options. Both the cubic and quadratic methods produce a decrease in some values. As this occurs only when interpolating options with one month or less to maturity, we can guess that the problem lies with the high curvature existing in the at the money region of the option price function. As the maturity of the option decreases, this curvature becomes more and more localized and abrupt, until at maturity we find a discontinuity in the first derivative. Apparently, the abruptness of this at the money region for very short maturity options causes trouble for the cubic and quadratic interpolation methods. In an attempt to correct this problem with the / partial derivative, I decided that if cubic or quadratic interpolation resulted in a option value below or above the two bracketing option values, then linear interpolation would be used. Similarly, if the signs of the partial differential with respect to I differed at the two bracketing points, once again linear interpolation would be used. The results of this adjustment are shown in Tables B.21 and B.22. As can be seen, the adjustment works for quadratic interpolation but not for cubic spline interpolation. 170 In fact, the adjusted partial derivatives for the cubic method are worse than the unadjusted values. For this reason, I decided to use the quadratic interpolation method in this study. B.6.4.b BOND OPTION PRICING ERRORS The net result of the entire process is displayed in Table B.23. As can be seen, on average the Brennan-Schwartz model leads to an average overpricing of $0.33 for the call option sample and $0.30 for the puts. There may be a slight rise in root mean square errors as time to maturity of the options increases, but there does not appear to be any pronounced pattern to the errors. In Table B.24 the same data are shown by the ratio of bond price to option exercise price (in, at or out of the money). The pricing errors seem to be slightly greater for in the money options versus out of the money options, but once again, the pattern is not very pronounced. Since we are pricing options on long term bonds, assuming a constant short term interest rate - as in the Black-Scholes model - may not have much of an effect on option values. At least, we might believe this before looking at the Black-Scholes 171 pricing errors in Tables B.25 and B.26. 66 6 7 Making this assumption increases the average call option pricing error from $0.33 to $0.57, almost double what it was using the Brennan-Schwartz model. 68 Similarly, the average put option pricing error rises to $0.55. As mentioned in the section on parameter estimation, the parameters that we have estimated may be non-stationary. If this is the case, it may be that we are using incorrect parameter estimates to compute theoretical prices. As has been shown many times with options on stocks, option prices are quite sensitive to the variance estimate used. To test this possibility, the variance and covariance parameters oy, cr, and p were estimated for the option testing period, October 1982 to October 1983, and the options repriced with the new estimates. These 'in sample' variance estimates were oy = 0.215, cr, = 0.132 and p = 0.193. Notice that while both <xr and p change from their 'out of sample' estimates of 0.448 and 0.512, respectively, cr, remains about the same. 69 As a result, we see in Tables B.27 and B.28 that the average Brennan-Schartz pricing errors decrease by half to $0.15 for calls and $0.14 for puts. Therefore, a good deal of the pricing error seems to be due to the use of 'out of sample' parameter estimates, not the solution method. The variance rate of the consol rate, cr,, was used as the required Black-Scholes variance estimate as in Brennan and Schwartz (1983a). Theoretically, the correct variance to use would be the variance of the price of the underlying bond. Since bond options are typically written on recently issued bonds, however, in general there is no past time series of prices to use in computing the variance. The method used in this study to solve the Black-Scholes model solves over a grid of r versus bond price. When using a two sample <-test, this difference is highly significant. Since cr, was used as the Black-Scholes variance estimate, no significant change occurred in the Black-Scholes pricing errors when using the 'in sample' variance estimates. 172 B.6.5 Treasury B i l l Option P r i c i n g The procedure for pricing treasury bill options is similar to the procedure used in pricing discount bonds and bond options. Once again, however, the boundary conditions are different. The underlying security of traded treasury bill options is not fixed, as it is for bond options. There, an option was written pn a specific bond outstanding, say, the 12% bond maturing august 15, 2013. With a treasury bill option, the security which must be delivered upon exercise is always a 13-week treasury b i l l . 70 The boundary conditions imposed, therefore, are those where the underlying security is a discount bond with a fixed time to maturity, r, C(r, I, r; K) > max[0,8{r, I, T) - K\, with equality at maturity. As with the bond options, we only impose this boundary condition at certain discrete points in time, namely, the points where we calculate the partial differential equation solution. The above boundary conditions are appropriate as far as theoretical option pricing is concerned, but in order to conform to reality we must make a small adjustment. When a call option is actually exercised, the parties of the contract do not have to settle up until several days after the exercise day. When we were pricing bond options, the slight delay of two business days could be ignored with little impact on option pricing. The reason is that the exercise price of the option includes accrued interest on the bond up to and including the settlement date. Therefore, any interest gathered by putting the exercise price amount in the bank for those This is discussed in section B.3.2.b. 173 few days must be used to pay the additional accrued interest on the b o n d . 71 The situation is different with treasury bill options, however, as is illustrated in the next section. B.6.5.a TREASURY BILL OPTION SETTLEMENT ADJUSTMENT We were able to ignore the delay between exercise and settlement dates when pricing bond options because the option exercise price included accrued interest on the underlying bond. Since interest does not accrue to the holder of a treasury bill, however, we find that a small adjustment must be made to the pricing method when pricing treasury bill options. Consider, for example, what happens when a treasury bill call option is exercised. In order to lock in his settlement date obligations the writer of the option must immediately buy a treasury bill which will have 13 weeks to maturity on the settlement date. His immediate cost at exercise, therefore, is the current cost of the underlying treasury bill, just as it would be if settlement occurred on the same day as exercise. On the other hand, the person exercising the option can place the exercise amount in the bank for those few days and earn the riskless rate on it. The immediate cost to him, therefore, is the discounted exercise amount, discounted by the riskless The only benefit that would result would be from differences in the bank rate and the bond accrual rate. Since options are typically written only on recently issued bonds, this difference sould be minimal. 174 rate of interest r. Since the exercise and settlement dates are between 6 and 10 days apart, the discounted exercise price may be significantly different from the undiscounted price. The market is well aware of this detail, as can be seen from market option prices on the date of maturity. At maturity, we expect the option price to be either zero or the treasury bill price minus exercise price. For example, our sample contains a treasury bill option price on the maturity date of December 17, 1982. The call exercise price was 92, which translates to $97.9778 per $100 principle value, and the option premium was 0.20, which corresponds to a price of $0.05 per $100.00 principle value. O n the same day, the underlying treasury bill was quoted at a bid discount of 7.81 (price of $97.89564) and an ask discount of 7.71 ($97.92258). Using that day's yield on a 27 day treasury bill as a proxy for the riskless rate gives us a bid yield of 0.076256 (bid discount 7.50) and an ask yield of 0.075033 (discount 7.38). There are 6 days between the exercise and settlement dates. We can now calculate the return to the following two arbitrage strategies: (a) Buy one call option, short one underlying treasury bill and exercise the option. Place the discounted exercise amount in an account bearing the riskless rate of interest. O n the settlement date, pay the exercise amount and close out the short position in the treasury bill, (b) Write one call option. Since the option has positive market value, assume that it is exercised, buy one underlying treasury bill and take out a loan at the riskless rate for the discounted exercise amount. On the settlement date, deliver the underlying treasury bill and pay off the loan. 175 The first strategy results in a loss of —(call price) — (disc, exercise price) + (underlying bond price) = -0.05 - 97.97778 x exp(-0.075033 x 6/365) + 97.89564 = -0.01137. Similarly, the second strategy results in a loss of +(call price) + (disc, exercise price) — (underlying bond price) = +0.05 + 97.97778 x exp(-0.076256 x 6/365) + 97.92258 = -0.01754. If we had not discounted the exercise price there would apparently have been arbitrage profits to be made. B.6.5.b TREASURY BILL OPTION PRICING ERRORS Brennan-Schwartz treasury bill pricing errors are shown in Tables B.29 and B.30 by time to maturity and the ratio of treasury bill price to option exercise price, respectively. The model overprices treasury bill options by an average of $0,033 for calls and $0,044 for puts. Once again, there does not seem to be any pattern to the errors. As was mentioned above, it is unfair to use the Black-Scholes method for pricing options on treasury bills. The assumption of a constant short rate of interest might have had empirical validity when pricing options on long term bonds, but is totally incorrect when pricing treasury bills. When we reprice our sample of treasury bill options using 'in sample' variance 176 estimates, as shown in Tables B.31 and B.32, the model no longer overprices options. Instead, it now underprices call options by $0,025 on average and puts by $0,015. 177 B.7 ARBITRAGE TESTS In the pricing theory section we showed that the same model used to price assets also tells us how to form an arbitrage portfolio. 72 If our modelling of reality is correct, then these theoretical arbitrage portfolios should allow us to take advantage of any arbitrage opportunities - assuming any exist. If our modelling is incorrect, then what appear to be arbitrage opportunities will in reality be due to theoretical mispricings, and should not lead to trading profits. The arbitrage procedures followed in this study consist of initially buying or writing one option contract. The option is bought if it is 'underpriced' by the market (i.e. market price less than theoretical price), and written if it is 'overpriced'. That is, if U\ is the number of option contracts bought, V\ = +1 if the option is 'underpriced' and V\ = — 1 if 'overpriced'. We then hedge this position in the option by going long or short in two other assets. The most natural choice for one of these assets is, of course, the option's underlying asset. The other asset chosen was a 5 year b o n d . 73 According to the formula derived in section B.2.2, the quantity of assets that should be bought to hedge the option position is given See section B.2.2. The second asset cannot be too close in characteristics to the first asset. If it is too close, then the hedging procedure may choose to take extreme offsetting positions in the two hedging assets. The five year maturity bond is sufficiently different from both the long term bonds underlying bond options (typically of 20 to 30 year maturity) and the treasury bills underlying treasury bill options so that it can be used in all the hedging portfolios formed in this study. 178 by ^ dl dl J ^ dl J where v\ and z\ are the quantity bought and (theoretical) price of the option, v2 and 22 are quantity bought and (market) price of the underlying asset and uz and z3 are the quantity bought and (market) price of the 5 year b o n d . investment arbitrage portfolio is completed by investing -(viZi 74 The zero- + v2 &2 + V3Z3) in the riskless asset. Two different arbitrage trading strategies were used. In strategy number 1, each of the 248 options is treated separately. The arbitrage portfolio is formed at each available option price observation and the dollar arbitrage returns cumulated for each option. These cumulative dollar arbitrage returns are then averaged over the 248 options. In the second trading strategy returns are not cumulated. The average of the roughly 3500 dollar arbitrage returns is given. The partial derivatives were calculated by fitting a quadratic curve to the price grids given by the numerical solution procedure (see section B.6.3). When pricing bond options, I used the value for / that is given by assuming both that the proxy for r is correct and that the theory correctly prices the underlying bond. The grid of theoretical bond prices is then examined for the value of / which is consistent with both of these assumptions. A similar procedure was used when pricing treasury bill options, except that the / proxy was assumed correct and the r value was found by examining the grid of theoretical treasury bill prices. 179 B.7.1 Bond Option Arbitrage Tests As explained above, trading strategy number 1 calculates cumulative dollar arbitrage profits, cumulated over the life of each option. When applying this strategy, trading was free of any commissions on trading, and bonds were bought and sold at the middle of the bid-ask spread. 75 Over the 248 options in the sample, the av- erage cumulative dollar arbitrage return was $1,297 when the Brennan-Schwartz model was used and $0,790 for the Black-Scholes model. 76 Both of these aver- age returns are significant, with ^-statistics of 9.8 and 6.3, respectively. 77 The difference between the two models, $0.51, is also significant, with a ^-statistic of 2.8. In Table B.33, however, we see that these apparent arbitrage profits are not large enough to cover reasonable transactions costs which would be incurred in implementing the strategy. The first line of the table shows that simply buying and selling bonds at the bid and ask prices instead of at the middle of the bid-ask spread reduces these profits from $1,297 to $0,771 (Brennan-Schwartz model). In order to weed out any incorrect data, I ignored an option price observation if the ratio of the theoretical and market prices differed by a factor of 10. The ratio of 10 was chosen as it seemed unlikely that correct data would result in market and theoretical prices differing by a factor of 10. Only a few observations were eliminated as a result of this test. As the difference between results for calls and puts is slight, only the aggregated results are provided here. To check for outliers, scatter plots of the arbitrage gain versus days to maturity and holding period were made. The plots showed no apparent outliers and also indicated that there was little, if any, relation between the size of the arbitrage return and the number of days to maturity. There was a slight relation between the size of arbitrage return and the length of the holding period, though doubling the holding period did not appear to double the return. 180 Adding bond commissions of 1/4% results in very significant losses from the arbitrage strategy. 78 The picture does not change if we use our 'in sample' variance estimates. Without charging commissions, and buying and selling bonds at the middle of the bid-ask spread we find an apparent cumulative dollar arbitrage return of $1,678, with a ^-statistic of 12.5. Once again, however, Table B.34 shows us that this is not large enough to cover reasonable transaction costs. When we turn to tine second trading strategy - where returns are not cumulated - we see in Table B.35 that once more there appear to be arbitrage opportunities when no transaction costs are charged. The average return in the first line of the table is $0,092, with a highly significant 4-statistic of 11.1. In computing the next lines of the table, we put restrictions on the formation of the arbitrage portfolio. Instead of arbitraging whenever we had an option price observation, we instead only formed the arbitrage portfolio when the difference between the theoretical and market prices of the option differed by at least the amount of the filter. This approach recognizes that no matter what model is used, some level of mispricing is inevitable. Because of this, it is quite possible that small differences between theoretical and market prices do not indicate real arbitrage opportunities, only theoretical mispricings. By imposing a filter, hopefully we improve the The arbitrage profits are more sensitive to bond commissions than option commissions because the arbitrage trading strategy results in much more bond trading than option trading when portfolio rebalancing is needed. 181 chances of recognizing real arbitrage opportunities. As we see from Table B.35, as the filter increases so does the average arbitrage return. The significance level of the returns stays approximately constant, however, due to the decrease of sample size. At all the filter levels the average BrennanSchwartz return is significantly larger than the Black-Scholes return. Once again, using 'in sample' variances increases the Brennan-Schwartz arbitrage returns, as shown in Table B.36. In all three of these models - Brennan-Schwartz, BlackScholes and Brennan-Schwartz ('in sample' variance) - Tables B.37 and B.38 show that these apparent arbitrage profits disappear when we buy and sell bonds at their bid and ask prices and charge reasonable commissions. B.7.2 79 Treasury B i l l Option A r b i t r a g e Tests The same tests were applied to the treasury bill option sample as were used in the previous section. Because of the more limited sample size, however, the results were much less significant. For example, the cumulative dollar return for trading strategy 1 averaged over the 31 available treasury bill options (no commissions charged, treasury bills and bonds bought and sold at the middle of the bid-ask spread) is $0.2070 with a ^-statistic of 1.3 when using the BrennanSchwartz model, increasing to $0.2753 with a ^-statistic of 1.4 when 'in sample' 80 • A filter of zero is used in Tables B.37 and B.38. As mentioned above, the Black-Scholes model is not suited to valuing treasury 182 variances are used. Tables B.39 and B.40 show, however, that these apparent arbitrage returns are not large enough to cover the transaction costs incurred by the arbitrage strategy. We see the same pattern when looking at the results of trading strategy number 2 (returns not cumulated). Tables B.41 and B.42 show that using a filter to restrict formation of the arbitrage portfolio does increase the apparent arbitrage profits. The inclusion of transaction costs in Tables B.43 and B.44, however, shows that these apparent arbitrage profits disappear when reasonable costs must be borne. bill options. The values in Tables B.43 and B.44 are computed using a filter of zero. 183 81 B.8 SUGGESTIONS FOR FURTHER RESEARCH As the involved procedure of the Brennan-Schwartz model was being performed, I noted down possible improvements in the method that might bring about an increase in pricing accuracy. These suggestions cover all aspects of the procedure. B.8.1 A n a l y t i c Solution of a Schaefer-Schwartz Stochastic Process This and the next several suggestions deal with the choice of a joint stochastic process similar in form to the process used by Schaefer and Schwartz (1983). In their paper, Schaefer and Schwartz note that the long rate is empirically uncorrected with the spread between the short and long rates. Because of this, they propose using a joint process which is in terms of the spread, s = r — /, and the consol yield, I. ds = Pi dt + ai dwi dl = p2 dt + o2 dw2 E[dwi] = E[dw2 ] = 0, dwv dxv2 = 0 The advantage of this formulation is that there is no correlation between the processes for s and /, which simplifies the asset pricing partial differential equation by eliminating the cross-derivative term in zt {. 184 The specific joint process form proposed here da — — a) dt + a(n <r dwi, s dl = l(<rf + ciXi — s) dt + <Til du)2, is slightly different from the one used by Schaefer and Schwartz and allows an analytic solution for the transition probability function. If we make a transformation of variables to x = s/cr and y = ln(/)/o"/, the result is the homoscedastic t process dx•= a( — — x) dt + dw\, dy = (ai + Xi -x) dt + d\V2- Notice that the stochastic differential equation for x does not contain any reference to the variable y, and can therefore easily be solved. x{t) = z(0) + [1 - e x p ( - t a ) ] (jf- - i ( 0 ) ^ + J Using this expression for x, we can solve for y . exp[-(< - s)a] dw^s) 8 2 l l (T C r y(i) = y(0) + {(Ti + X )t - — / x{s) ds + dw {s) <*i Jo Jo t 2 We see from the process in x that / x(s)ds= Jo f Jo — ds — — dx(s) + — dwi (s)\ L°"« = JLt (T s <* <* J ~[x(t) - x(o)\ + - / a J or dwM 0 = ±t - l[l - exp(-ta)} (Jf- - x ( 0 ) ) + - / [1 - e x p ( - ( * - s)a)\ dw^s), <* Jo 8 2 I assume that the market price of long term risk function Xi is constant. The analysis is similar if Aj is a linear function of x and y. 185 which allows us to finish our solution for y. y(t) = y(0) + (a\ + A, - —^ t + — [ 1 - exp(-ta)} (^- - x(0) \ ai J aai \as , / act J [1 - exp(-(< - s)a)] dwi(s) + / dw2 (s) J 0 0 We see, then, that the expectations and variances of x and y are analytically solvable. E[z(0] = x(0) + [1 - exp(-ta)] (j± - r(0)) E[y(0] = y(o) + U + A, - \ var[x(£)| = / acr, exp[—2(f — s)a] ds Jo = i + £r(E[x(0] - *(o)) cr, / -^[l-«p(-2ta)] cov[x(t), y(t)] = cr /** —/ exp(-(* - s)a)[l - e x p ( - ( £ - s)a)\ ds OLCl Jo [1 - e x p ( - t a ) ] 2a, a? 2 -t fe) ^ [ 1 - exp(-(< - s)a)] ds + j f ' ds 2 tr[y(01 = 1+ 2 2 =l fe)1 * - h fe) 1 1 - e x *-<«)][3 - With observations spaced A t apart, we have E[3i] = +(1 - £ ) ( / * E[ln(/,)] = l n f t - O + (cr, +<r,A, - JI)A< + 2 var[As] = £ ( l cov[A ,Aln/] = 3 var[A ln l-«5. a {p - s,^), 6% ^ ( l - S ) = (af + 2 , ^ ( 1 - 5)(3 - 6), 186 e « ) ] where 5 = exp(—Ate*). These expressions can be used to maximize the likelihood function T t'=2 _ ( 3{ - E[s,-] \ * " V ln(/ ) - E[ln(/,-)l)' e t _ / var[As] cov[As, A In/] \ ^ ~ V cov[As, A ln J] var[A ln /] ^ to find the 'best-fitting' parameters. B.8.2 SimultaneoTis Solution of F i r s t P a r t i a l Derivatives If in addition to the partial differential equation which we solve for the price we had two other partial differential equations which we solved for the two first partial derivatives, then perhaps accuracy of the interpolation methods used would increase. That is, say that we are using a Schaefer-Schwartz stochastic differential equation in s and / such as the one discussed above. Since we assume that there is zero correlation between the processes for s and /, the cross-derivative term in zs i does not appear in the partial differential equation for z. The partial differential equation, therefore, takes the form 187 Taking the partial derivative of both sides of this partial differential equation with respect to s or I gives us two more partial differential equations, this time in terms of za and zj. —Hz d ds = Ha z d —Hz dl = Hiz + Hzi = (zi)r , <9 3 H, = a — • + b9 — 2 + Hz, = {zs )T , a os d d + da — - 1 os dl 2 dl* + cs — <9 3 d d H( = a , —2 + b{ — + c,— + dt — - 1 ds dl* ds dl 2 2 These three partial differential equations form a system of equations Hz z , H= r / H \ H„ 0 H 0 \ 0 , z= (z z9 ) , with the solution z(r,/, r) = exp(rH)z(r,/,0). If we split the operator H in a manner reminiscent of the Douglas-Rachford procedure H = S + L, S= (S \SS \Si 0 S 0 / L O \ L, \Li L 0 0\ 0 , L= SJ O 0 L with and similar definitions for S9 , Ls , Si and Li, we can replace exp(ArH) = (i - ^Ars) (i - \ArLJ exp(ArH) with * (i + i A T L ) (i + ^Ars) + 0(Ar ). 3 This means that we can approximate the solution z by (l - \ATLJ (l - \Ars) z(r, /, r + Ar) = (i + 188 ±ArLJ (l + ±Ars) z(r, /, r), which can be expressed in Douglas-Rachford alternating direction form as ( l - i A r L ) « * ( r , I , r + Ar) = ( i + i A r ( H + S)) i ( r , /,r) = a(r, J,r), ( i - i ATS) i ( r , /, r + Ar) = z*(r, /, r + Ar) - i ArSs(r, /, r) = flr, /, r). With the system in this form, we can make a final simplification to eliminate some of the partial derivatives in the operators above; For example, in the solution for z*(r, 1,T + Ar) it can be shown that , ( l - ^ A r L ) * ( r , / , r + Ar) Z fl 0 0 1 29 (6$ + d) \ -2'(*^Jr+ & + 5 0 l-20(&j£ + (6i + <O|f + 4) j = r), oc(r,l, where 0 = l + rA*/4' This can easily be converted to the more convenient upper triangular form 1 -0 -9 ° 1 0 °\f( I - -1 A r L 0 1) \ > 2 1 0 = <f>~ \0 0 1 0 1 = -29(b§i \ a<2-9cti | , a - 0c*i 3 189 + d)\ -20u l-20u 23 3 3 (z* where 2 «33 <9 d = bjp +(bt + d-6b) — + {di -9d). This reduces the entire solution of z* to the solution of a differential equation in (1 - 20u )z,* = <f>(az - Octi) 33 Once the tridiagonal system resulting from the finite difference approximation to the above equation in z\ has been solved, the values of z* and z* are simply calculated as z* = <j>ctx + 29 (b^ + d^j z\ z* = <P(a2 - 9cti) + 29u2z z\ Given the solution for z*, we can proceed similarly to solve for z using (I - | A r L ) z = (3. This procedure may result in better estimates of the first partial derivatives of the solution, and improve interpolation accuracy. B.8.3 Asset Pricing by Risk-Adjusted Expectation The amount of memory required by the alternating direction method for a large grid size is quite large, and may exceed that available on, for example, a microcomputer. There is an approximation that can be made to simplify the solution procedure and reduce the amount of memory needed. The question, of course, 190 is whether or not the approximation results in a poor numerical solution to the partial differential equation. As Cox and Ross (1976) have shown, the value of an asset described by the models we are dealing with can be expressed as the expectation of a discounted value z(s, /, r) = E r(u)du J z(s, /, r - A T ) exp , where the expectation is taken with respect to the risk-adjusted stochastic process for s and /. That is, if we were using the stochastic process ds = a(n — dl = s) dt + as dw\, / ( o f + (T1X1 — s) dt + ail dw^, the risk-adjusted process would be ds = [a(fJL - s) - a,\s ] dt + at dwt , dl = / ( o f — s) dt + (T{1 div?. As was shown in a previous section, the transition probability function for this riskadjusted process can be solved analytically. If we change over to a homoscedastic system by using the new variable ln(/) instead of /, then the elements var[As], cov[As, A In/] and var[Aln/] will be independent of s, ln(/) and r. Then, making the approximation 2(3, /, r) ~ exp - ^ A r [r(r) + E(r(r allows us to compute the discount factors 191 - Ar))] j E[z(s, I, r - Ar)] only once for each grid point. The calculation of the expectation E[z(a, /, r - A r ) ] can be considerably simplified, along the lines of Brennan and Schwartz (1978). Basically, that paper illustrated how the transition probability function resulting from, a continuous time stochastic process could be approximated by a set of jump probabilities where the jumps are restricted to be to the grid points available. That is, if we had a mean-zero normal process, x ~ NlOjO .}, instead of computing averages and variances as -2 var we use the discrete jump probabilities p - = pxqx If where p - is the probability of Zf jumping i grid points, that is from 0 to * A x . oo E[/(*)] = £ /(* + iAx)Vxi «=—oo oo var [/(*)!= £ [f(x + iAx)-E(f(x))} 2Pxi 1= —oo In order to be interpreted as probabilities, the sum of the p - must equal 1. In XI addition, we must ensure that the variance of x is, in fact, equal to cr .. These two 2 192 restrictions uniquely determine the two unknowns px and q. oo oo Yl £ t=l \ i=—oo t=—oo oo l q J =Px ( l + 2 . ^ g i Y Pxi=Px var[x] = / l [,-A«] p„- = 2 A x 3 2 ( 1 ^ ) i=l »"= —oo 7 2 - 2 (l + , fvg« qx + 1 = 0 Ax 2 , // Ax \ 2 , 2 ^x Ax 2 / / Crf Since we want qx to be between 0 and 1, we choose Ax 2 / al / When we have two independent mean-zero variables we can approximate the expectation of z(x, y) as + oo E[z(x,y)] = +oo Y Y PxiP y2(x y t'= — oo j——oo 193 + tAx,y4-;'Ay), where pyj - = py qy ' and py and qy are defined similarly to px and qx . If we define the sums + a(x,y) = a {x,y)-z{x,y) + ct-{x,y), +00 +00 ct+{x,y) = £ g ' z ( z , y + ; A y ) , a~{x,y) = £ 3=0 ^ ( z , y - j Ay), 3=0 + 0{x,y) = 0 {x,y) - a{x,y) + / T (z,y), +00 +00 £ ( z , y ) = ^Tqx a(x i=0 + + iAx,y), 0~{x,y) = £ c £ a ( z - i A z , y ) , t'=0 then we can express E[z(z, y)] as E[z(z,y)] = px p 0(x,y). y In addition, there are simple recursion formulae for computing a and 0. + + a (x, y) = z(x, y) + qy a (x, y + Ay) <x~(z,y) = z(z,y) + gj,oT(x,y - A y ) + + 0 {x, y) = a(x, y) + q 0 {x + Ax, y) x P~(x,y) - a{x, y) + q 0~{x - Ax, y) x This general idea might be used for our problem. The key lies in separately handling the expected drift and variance terms in the stochastic process. That is, we approximate E[z(z(r), y(r), r] ~ E [z (E[z(r - Ar)], E[z(r - Ar)], r - A r ) ] , where both E and E are expectations with respect to a homoscedastic stochastic process, but E is an expectation ignoring the drift of the process. That is, we are 194 assuming that the variance and drift terms of x and y are independent and can be considered one after another. First we take account of the drift, acting as if the variance was zero, and then we take account of the variance as if the drift were zero. Because of the simple recursion relations defining a and 6 above, computational requirements of this method are of the same order as numerical solution using the alternating direction method, namely, 0(1 J) for an i x J grid. Similarly, memory requirements are also O(IJ). This is, of course, quite desirable, but even more desirable is the fact that the memory requirements are small when compared with the alternating direction method. This approach requires only four I x J grids, one for the solution at a particular point in time, another for calculating or and 0 and two others for storing the expected drifts in the x and y directions. This method should be compared with a standard partial differential equation solution method to see whether the its approximations affect the solution values to any marked degree. B.8.4 Testing for Instability In this study, I assumed that my numerical solutions were relatively unaffected by instability, even though the conditions for stability could not be shown to hold. One way to test for possible instability in the numerical solution of, say, a bond 195 call option, C(r,/, T ) , would be to also solve for the difference between the bond price and option value, namely, X(r, /, r) = B(r,l,r + Tr?; c) — C(r, /, r; K). If the values of C from these two approaches differ too greatly, then it would have to be due to some instability in the system. Alternatively, a more traditional approach could be used. The more common method is to first numerically solve for C(r,/, r + A r ) using the values from the previous step, C(r, /, r), and a single time step of size A r . Then, we once more solve for C(r,/, r + A r ) , but this time using two time steps of size ^ A r . If the difference between the two values exceeds an allowable error tolerance, the step size is decreased until the tolerance is acceptable. If properly done, these two methods should require the same-amount of memory. The traditional method, however, requires an additional numerical solution per time step. B.8.5 G r i d Spacing and B o u n d a r y Conditions The numerical methods used in this study were all developed assuming that the underlying r, / grid had regular spacings of A r between points in the r direction and A / in the / direction. The accuracy of our procedures could probably be 196 greatly increased by using an irregularly spaced grid instead, placing more grid points in the areas of high curvature of the solution (the at the money region) and allowing larger spacings between points elsewhere. A n attempt in this direction, however, may lead to difficulties. If the spacings are made too large in some parts of the grid in order to allow closer spacing elsewhere we may find instability showing up in the solutions. Perhaps a related problem could be addressed at the same time, namely, the fact that we are not interested in what values the numerical solution takes on in large areas of the grid. As our interest rate data show, r and / are seldom very far apart, which means that only a relatively small region surrounding the diagonal r = / is of interest to us. If we transformed variables in order to exclude some of the uninteresting areas, presumably we would be able to move our grid points closer together. For example, we could use the Schaefer-Schwartz joint process for s and / instead of the Brennan-Schwartz process for r and /. This automatically places the solution grid along the diagonal, r = I. A n alternative to this would be to use the variables which transform the stochastic process used in this study to a homoscedastic f o r m , 83 namely, Since G is only determined to within a rotation, we can choose a G such that the x grid excludes most of the off-diagonal area included in the r, / g r i d . 84 See section B.5.2.a. In this study, on the boundaries where r and / are at their largest values the boundary conditions zrr = 0 and zu = 0, respectively, were used. If these bound197 B.9 SUMMARY AND CONCLUSIONS In this study the Brennan and Schwartz two-factor model, which has previously been used to value bonds, was applied to the valuation of options on US government bonds and treasury bills. The results were promising, and suggest that the model prices for options are accurate enough for practical purposes. Several innovations on the Brennan-Schwartz methodology were introduced in this study. One of these was the due to an examination of the 'simple linearization method' used by Brennan-Schwartz (1983b) and Ananthanarayanan (1978) in estimating the parameters of the joint process in r and /. For the particular data and process forms used in this study, this method was found to be unsuitable for the estimation of stochastic drift parameters, although estimates of the variance and covariance parameters appeared to be well estimated. This problem was addressed by estimating the drift parameter required, a, at the same time as the market price of risk parameter, A . That is, a two-dimensional search was r performed to find the pair of values, (a, A ), which minimized an appropriate r distance function. The theoretical bond values required at each iteration of the non-linear search procedure for finding the best-fitting pair, (a, A ), were calculated by numerically r aries are moved in closer to the diagonal, however, it may be that these boundary conditions become inappropriate. If we are solving a system of equations for the price plus its first partial derivatives, as in section B.8.2, we could easily impose the boundary conditions zrrr = 0, zrr i = 0, zr n = 0 and zm = 0, instead. 198 solving the asset partial differential equation derived in section B.2. A comparison was made between discount function values derived numerically by the successive overrelaxation method (SOR) used by Brennan and Schwartz and the alternating direction method used by Schaefer and Schwartz. For the range of parameter values tested, the S O R and alternating direction methods produced almost identical results. When the results differed, it was the.result of instability of the solution, an occurrence which appeared more often when using the S O R method. Because of its apparent greater stability in the problem at hand and its lower computational cost, the alternating direction method was used in the rest of the study. The estimation procedure for (a, A ) consisted of forming portfolios of bonds to r represent various maturities for each time point available, and then using an estimated covariance matrix of portfolio pricing errors in forming a weighted sum of squared bond pricing errors. Such a distance function was minimized for three different portfolio formation schemes and three different covariance matrix assumptions. The values of (a, A ) were found to be relatively similar for the nine r distance functions that were minimized. Once all needed parameters had been estimated, the numerical solution for bond and treasury bill option values was straightforward. In order to compare the numerical option values with actual option values, however, some form of interpolation was required. Because most of the option price observations collected were for options trading close to the money, the high curvature of the theoretical solution in this region resulted in poor correspondence between actual and theoretical values when using linear interpolation. The other two methods tested 199 - cubic spline and quadratic interpolation - also had some difficulty in the high curvature region, but the quadratic method in general gave reasonable values. Comparison of market and theoretical option prices showed that there were differences between theoretical and market values. When 'in-samp!e' variance estimates were used, the size of these errors decreased considerably suggesting that further research into the optimal length of the parameter estimation,period and possible use of a moving estimation period could result in better pricing results. One possible explanation for the remaining pricing errors is incorrectness of the market data used, and the arbitrage tests at the end of this study were designed to examine this possibility. If arbitrage profits had resulted from the tests, it would not necessarily have indicated that the market data were incorrect due to market inefficiency, although this is one possibility. Another possible interpretation would have been that there were apparent - but not necessarily realizable - market inefficiencies due to such factors as transcription errors in the data, non-simultaneity of bond and option quotations, or use of quotations at which trading could not have occurred due to thin trading. The close correspondence between theoretical and market prices, such that returns from arbitrage, while positive, were insufficient to cover reasonable transactions costs, suggests that the model prices are accurate enough to justify practical use of the model, and that realizable arbitrage possibilities promising positive returns after transactions costs would presumably be detectable. 200 B.10 REFERENCES Ananthanarayanan, A . L . , 1978, A stochastic specification of short term interest rate process and the pricing of extendible and retractable bonds, unpublished P h D dissertation, University of British Columbia. Ball, C . B . and W . N . Torous, 1983, Bond price dynamics and options, Journal of Financial and Quantitative Analysis 18, 517-531. Black, F . and M . Scholes, 1973, The pricing of options and corporate liabilities, Journal of Political Economy 8, 637-654. Brennan, M . J . and E.S. Schwartz, 1977, Savings bonds, retractable bonds and callable bonds, Journal of Financial Economics 5, 67-88. Brennan, M . J . and E.S. Schwartz, 1978, Finite difference methods and jump processes arising in the pricing of contingent claims: a synthesis, Journal of Financial and Quantitative Analysis, September, 461-474. Brennan, M . J . and E.S. Schwartz, 1979, A continuous time approach to the pricing of bonds, Journal of Banking and Finance 3, July, 133-155. Brennan, M . J . '.nd E.S. Schwartz, 1980, Conditional predictions of bond prices and returns, Journal of Finance 35, No. 2, 405-417. Brennan, M . J . and E.S. Schwartz, 1982, A n equilibrium model of bond pricing and a test of market efficiency, Journal of Financial and Quantitative Analysis 17, No. 3, 301-329. Brennan, M . J . and E.S. Schwartz, 1983a, Alternate methods for valuing debt options, Finance 4, No. 2, 119-137. Brennan, M . J . and E.S. Schwartz, 1983b, Duration, bond pricing and portfolio management, in: G . Kaufman, G . Biervag and A . Toevs, eds., Innovations in portfolio management: Duration analysis and immunization (JAI Press). Courtadon, G . , 1982, The pricing of options on default free bonds, Journal of Financial and Quantitative Analysis 17, 75-100. Cox, J . C . , J . E . Ingersoll and S.A. Ross, 1978, A theory of the term structure of interest rates, Research paper No. 468 (Graduate School of Business, Stanford University, Stanford, C A ) . . Cox, J . C . and S.A. Ross, 1976, The valuation of options for alternative stochastic processes, Journal of Financial Economics 3, No. 1/2, 145-166. Debreu, G . , 1959, Theory of Value (New York: John Wiley). 201 Douglas, J. and J.E. Gunn, 1964, A general formulation of alternating direction methods. Part 1. Parabolic and hyperbolic problems, Num. Math. 6, 428453. Douglas, J. and H.H. Rachford, 1956, On the numerical solution of heat conduction problems in two and three space variables, Trans. Am. Math. Soc. 82, 421-439. Fairweather, G. and A.R. Mitchell, 1967, A new computational procedure for A.D.I, methods, SIAM Journal 4, No. 2, 163-170. Maliaris, A . G . and W.A. Brock, 1982, Stochastic methods in economics and finance (Amsterdam: North-Holland). Malinvaud, E., 1966, Statistical methods of econometrics (Amsterdam: NorthHolland). Marquardt, D.W., 1963, An algorithm for least-squares estimation of nonlinear parameters, J. Soc. Indust. Appl. Math. 11, No. 2, 431-441. McKee, S. and A.R. Mitchell, 1970, Alternating direction methods for parabolic equations in two space dimensions with a mixed derivative, The Computer Journal 13, No. 1, 81-86. Merton, R . C , 1973, Theory of rational option pricing, Journal of Finance 29, 449-470. Phillips, P.C.B., 1972, The structural estimation of a stochastic differential equation system, Econometrica 40, 1021-1041. Richard, S., 1978, An arbitrage model of the term structure of interest rates, Journal of Financial Economics 6, No. 1, 33-57. Ross, S.A., 1976, The arbitrage theory of capital asset pricing, Journal of Economic Theory 13, 341-360. Schaefer, S.M. and E.S. Schwartz, 1983, A two factor model of the term structure: an approximate solution, unpublished paper. Varga, R.S., 1962, Matrix iterative analysis (Englewood Cliffs, N.J: PrenticeHall). Vasicek, O., 1977, An equilibrium characterization of the term structure, Journal of Financial Economics 5, 177-178. Vemuri, V . and W.J. Karplus, 1981, Digital computer treatment of partial differential equations (Englewood Cliffs, NJ: Prentice-Hall). 202 Bond and bond option data collected from the Wall Street Journal over the period of November 1, 1982 to October 31, 1983. calls Bb puts 6 bond/note o co 4 min max n I min I max I iV I options obs 13 112 120 265 11 112 120 198 24 463 1 0 i « , 1992 N o v 15n 102 i l 11 96 104 190 12 92 104 117 23 307 1 0 | « , 1993 Feb 15n 94^ 104 i f 8 92 108 84 5 96 104 24 13 108 1 0 i » , 1 9 9 3 M a y 15n 89^ 100£ 13 90 102 77 10 88 100 22 23 99 103|| 3 100 104 14 4 98 104 9 7 23 130if 47 112 132 1003 47 112 132 515 94 1518 38 86 104 780 34 86 104 375 72 1155 12 98 106 101 6 98 104 19 18 120 A u ; 15n , 2011 Nov 15 113 1 0 | « , 2012 Nov 15 I2t , 2013 A u g 15 total options total observations c max \ N 119^ 14* b min 131«, 1992 M a y 15n 111», 1993 a n total total K 99^ 105i| 129 145 2514 274 1279 Bonds followed by the letter V are actually treasury notes. Bond price in dollars per $100 principal value. n denotes the number of options that were written over the period on each bond. K is the option exercise price per $100 principal value. N is the number of option price observations collected for the bond. 3793 I Table B.2. Number of bond option observations for various bond option price levels. N pa a b b . calls puts 795 718 269 309 275 149 86 72 38 28 53 0-1 1-2 2-3 3-4 4-5 5-6 6-7 7-8 432 272 140 97 40 15 8-15 5 Range of bond option price in dollars per $100 principal value. Number of observations. 204 Table B.3. Bond option data by number of months to maturity. calls N a b puts 6 N P P o> 0-1 1-2 464 529 1.29 1.73 1.42 1.59 278 271 1.72 2.82 1.90 2.81 2-3 3-4 4-5 5-6 6-7 7-8 8-3 555 466 252 120 72 43 13 2.06 2.17 2.30 2.22 2.74 3.33 4.00 1.55 1.39 1.42 1.34 1.76 1.93 1.37 249 217 139 60 24 34 7 2.23 3.16 3.78 3.30 3.94 4.63 3.18 1.72 1.87 2.88 1.97 2.43 2.63 1.56 0-9 2514 1.95 1.56 1279 2.72 2.35 Number of 'months' to maturity of the option, where a 'month' is actually 30 days. eg. the first row summarizes all observations with 1 to 30 days to maturity. N denotes number of option observations. p is the average option price in dollars, o-p is the sample standard deviation of option prices. 205 Table B.4. Bond option data for in and out of the money options. calls B/K a 0.85-0.98 0.98-1.00 1.00-1.02 1.02-1.15 a b N P 704 687 637 486 0.73 1.36 2.19 4.24 puts 6 0.66 0.82 0.97 1.30 N P 270 293 331 385 6.08 2.60 1.91 1.16 Bond price, B, divided by option exercise price, K. N denotes number of option observations. p is the average option price in dollars. <7P is the sample standard deviation of option prices. 206 2.55 1.11 1.14 0.93 Table B.5. Treasury bill and treasury bill option data collected from the Wall Street Journal over the period of November 1, 1982 to October 31, 1983. | Ba 1 1 | min | 97.3095 a 6 | | calls" | puts 1 max |n | 97.9845 | 20 min | max | N | n | K min | max ! | | N 97.4722 | 98.2306 | 395 | 17 | 97.4722 | 98.2306 | 424 | total options 37 total obs 819 Treasury bill price in dollars per $100 principal value. n denotes the number of options that were written over the period on each bond. K is the option exercise price per $100 principal value. N is the number of option price observations collected for the bond. 207 Table B.6. Number of treasury bill option observations for various treasury bill option price levels. N p a 0.00-0.10 0.10-0.20 0.20-0.30 0.30-0.40 0.40-0.50 0.50-0.60 a b b calls puts 238 83 187 99 66 46 16 10 56 17 1 0 Range of treasury bill option price in dollars per $100 principal value. Number of observations. 208 Table B.7. Treasury bill option data by number of months to maturity. calls m a 6 a N puts 6 P <Tp N P <x p 0-1 1-2 2-3 3-4 4-5 5-6 6-7 7-8 8-9 62 79 80 84 39 16 11 16 8 0.11 0.09 0.10 0.12 0.12 0.14 0.18 0.14 0.17 0.10 0.08 0.09 0.10 0.08 0.12 0.11 0.08 0.08 93 77 77 86 30 22 15 14 10 0.16 0.12 0.13 0.18 0.22 0.21 0.28 0.17 0.21 0.14 0.10 0.11 0.14 0.13 0.12 0.15 0.11 0.17 0-9 395 0.11 0.09 424 0.16 0.13 Number of 'months' to maturity of the option, where a 'month' is actually 30 days. eg. the first row summarizes all observations with 1 to 30 days to maturity. N denotes number of option observations. p is the average option price in dollars. cr is the sample standard deviation of option prices. p 209 Table B.8. Treasury bill option data for in and out of the money options. calls B/K a 0.9915-0.9965 0.9965-0.9980 0.9980-0.9995 0.9995-1.0025 N 77 100 133 85 puts 6 P (Tp 0.03 0.06 0.11 0.25 0.03 0.04 0.04 0.06 N 123 102 119 80 P (T 0.32 0.14 0.10 0.05 0.10 0.07 0.08 0.04 ° Treasury bill price, B, divided by option exercise price, K. b N denotes number of option observations. p is the average option price in dollars. (Tp is the sample standard deviation of option prices. 210 p Table B.9. Analysis of the one-factor stochastic process examined by Ananthanarayanan, a namely, dr = m{fi — r)dt + <rr dw, dw ~ iV{0, dt}. Parameter values are a = 0.5, fx = 0.09517, m = 0.007162, cr = 0.008856, A i = 1/12. r - + At Xt a t 0.050 0.100 0.150 0.200 50.50 71.42 87.47 101.00 b t+At 0.049 0.098 0.148 0.198 T 50.51 71.42 87.46 100.98 t+At 0.051 0.102 0.152 0.202 T ° rt is the beginning of period instantaneous riskless rate. Xt is the corresponding beginning of period value for the homoscedastic process dx = m, -(fi (7 „ r)r" a 1 -aar a-1 - dt ~; dw, L a that is, x = r\ /<r{l — a). xt+At is tbe (simple linearization model) expected value of x at the end of the period, t a xt+At = x + t —{fi - r)rf - -oarrf~ l 2\/A7, At. t+At short rate corresponding to Xt+At — that is, the short rate corresponding to the x which is two (simple linearization model) standard deviations below the mean. t+At * * corresponding to x +At + 2\/A~i. r 13 r i S t n e s n o r r a e t 211 Table B.10. Analysis of the one-factor stochastic process examined by Ananthanarayanan, a namely, dr = m(fi — r)dt + <rr dw, dw ~ N{0, dt}. Parameter values are a = 0.5, H = 0.0012934, m = 0.0025221, a = 0.00083096, A i = 1/12. 7 Zt + At 0.050 0.100 0.150 0.200 538.19 761.11 932.17 1076.38 b 538.13 761.03 932.07 1076.27 t+At 0.050 0.100 0.150 0.200 r r t+At 0.050 0.100 0.150 0.200 rt is the beginning of period instantaneous riskless rate. Xt is the corresponding beginning of period value for the homoscedastic process <*x = m a —(n - r ) r - -cxer a-l dt + dw, a b that is, xt = r\ /<r(l — a). Xt+At is the (simple linearization model) expected value of x at the end of the period, _ m . . _„ 1 _ i At. xt+&t = xt + —{fi - r)r -aar? n t t+At * t corresponding to xt+&t — 2y/At, that is, the short rate corresponding to the x which is two (simple linearization model) standard deviations below the mean. t+At ° r t * corresponding to x +&t + 2\fAt. r 13 r 13 n es n o r s n r a e t 212 Simple linearization method estimates for the Brennan-Schwartz interest rate process parameters. period a a of + criM P total period (Oct 70 - Oct 82) 0.876 (0.266) 0.0546 (0.0373) 0.442 0.142 0.530 first half (Oct 70 - Oct 76) 0.461 (0.298) 0.0276 (0.0554) 0.381 0.141 0.425 second half (Oct 76 - Oct 82) 1.644 (0.467) 0.0792 (0.0481) 0.495 0.142 0.631 * Standard errors in parentheses. Table B.12. Analysis of the two-factor Brennan-Schwartz process. Parameter values are a = 0.876, o f + cr,A, = 0.0546, <J = 0.442, a = 0.142, p = 0.530, A t = r t+At r a 6 a x h+At Plt+At/Plt 02t + 1/12. At/02t b 0.0626 0.0964 3.1 1.2 0.0650 0.0934 2.3 1.1 0.0750 0.0929 0.9 1.0 0.0900 0.0962 -0.3 0.8 0.1000 0.1003 -0.8 0.7 0.1044 0.1050 -0.8 0.7 0.1000 0.1083 -0.2 0.8 0.0900 0.1091 0.7 1.0 0.0750 0.1061 2.2 1.2 0.0650 0.1005 3.1 1.2 (rt+At> h+At) is an end of period point with a (simple linearization model) M a halanobis distance of 2 from the (simple linearization model) expected end of period point. The expected end point and the Mahalanobis distance were calculated using the homoscedastic transformation of the process in r and /. The starting point was r = 0.08, / = 0.10. 0u and Pu+At are the beginning and end of period drifts in the first component of the homoscedastic system Pit = —(ft - n) - \a . r Pit and Pit+At are similar values for the second component: p2t = -<?i + A/ + i '-. 2 ai These columns show the ratios of the end of period to beginning of period drifts. 214 Table B.13. Simple linearization method estimates of the Brennan-Schwartz interest rate process parameters for three different drift assumptions. of + (TlXl ay 0.876 (0.266) 0.0546 (0.0373) 0.442 0.142 0.530 0.5 0.317 (0.276) 0.0259 (0.0377) 0.448 0.142 0.512 1.0 -0.257 (0.269) -0.0021 (0.0378) 0.444 0.143 0.508 r 0.0 a 6 a h P Indicates the simple linearization drift used: / = 0 for the beginning of period drift, / = 1 for end of period drift, and / = 0.5 for an average of the two. Standard errors in parentheses. 215 Table B.14. Patterns of missing data for 3 different portfolio formation schemes. scheme # 1 * a 6 scheme # 2 N/n y n 144 16.1 0-1 1-2 144 16.6 3 2-3 144 4 3-4 5 4-5 6 5-6 7 6-7 8 7-8 9 8-9 10 9-10 108 11 n/a n/a scheme # 3 N/n y 144 16.1 0-1 144 16.1 1-2 144 16.6 1-2 144 16.6 9.5 2-3 144 9.5 2-3 144 9.5 144 7.9 3-4 144 7.9 3-4 144 7.9 144 5.1 4-5 144 5.1 4-5 144 5.1 144 3.1 5-6 144 3.1 5-6 144 3.1 142 2.9 6-7 142 2.9 6-7 142 2.9 88 1.8 7-8 88 1.8 7-10 136 4.1 96 1.8 8-9 96 1.8 10-15 132* 5.3 2.1 9-10 108 2.1 15-20 117 3.2 n/a 10-20 132 8.1 n/a n/a n/a y n i 0-1 2 n N/n Portfolio number in the given portfolio formation scheme. Portfolio formation scheme. y denotes the number of years to maturity, ie. '0-1' indicates that all bonds with maturities from 0 to 1 year are put into the portfolio. n is the number of non-missing portfolio observations, ie. months in which there was at least one bond observation in the portfolio, n = 144 indicates no missing portfolio observations. N is the total number of bond observations for this portfolio over the entire test period. Therefore, N/n is the average number of bond observations per each portfolio observation. 216 Table B.15. Distance function values for various values of the market price of short rate risk, A , and the reversion coefficient, or, from the stochastic process for the short rate, r. Estimated using month-end data, from the period of October 1970 to October 1982. r parameters a -0.250 0.000 0.250 0.450 0.000 0.000 0.000 0.000 0.260 0.200 0.240 0.300 a b scheme #1" OLS HET scheme #2 GLS OLS HET scheme #3 GLS 0.800 39,100 248.0 26,300 54,400 291.0 47,700 0.800 9680 59.9 5710 16,000 77.4 11,100 0.800 2940* 27.6 2430 5040" 33.5 3550 0.800 8970 81.1 3860 9920 83.7 4180 0.600 7550 49.4 4800 13,100 64.8 9680 0.500 6570 46.4 4300 11,600 60.3 8890 0.400 6050 48.3 3790 10,300 60.3 8240 0.300 7040 62.5 4290 10,600 72.3 122,000 0.558 7560 68.9 3200 8660 71.9 39006 0.667 3530 33.56 2110" 5520 39.0 2840 0.840 3000 26.7 2160 5320 33.2* 3080 0.926 3070 28.4 2140 5180 34.3 2910 OLS HET GLS 56,900 275.0 21,100 19,400 80.6 9750 7420 38.8 4350 10,500 82.5 3170 16,600 69.5 9650 15,200 65.8 8890 13,900 66.5 7480 14,100 78.7 4930 9730 72.5 3030* 7670 43.6 3350 77006 38.2* 3770 7400 39.0 3520 Three different portfolio formation schemes. O L S , H E T ( = H E T E R O ) and G L S are the three covariance matrix assumptions. Minimum value of the distance function. 217 Table B.16. Minimum distance estimates of A and a for three portfolio formation schemes and three covariance matrix assumptions. r scheme #1" parameters a scheme #2 scheme #3 OLS HET GLS OLS HET GLS OLS HET GLS 0.250 0.240 0.200 0.250 0.240 0.200 0.300 0.240 0.260 0.800 0.840 0.667 0.800 0.840 0.667 0.926 0.840 0.558 Three different portfolio formation schemes. OLS, HET(=HETERO) and GLS are the three covariance matrix assumptions. 218 Table B.17. Bond option pricing errors (calculated for the three different interpolation methods) by number of months to maturity. linear* m a 0-1 1-2 2-3 3-4 4-5 5-6 6-7 •7-88-9 0-9 N 742 800 804 683 391 180 96 77 e c 0.47 0.86 0.69 0.61 0.74 0.72 0.46 cubic CRMS 20 0.49 0.32 1.72 1.84 1.73 1.53 1.77 1.51 1.47 1.61 1.90 3793 0.66 1.70 e 0.10 0.27 SRMS quadratic e CRMS 0.41 0.40 0.41 0.48 0.45 0.05 0.94 0.53 0.56 0.60 0.74 0.87 0.70 1.05 0.07 0.25 0.39 0.38 0.39 0.46 0.44 0.53 0.55 0.58 0.73 0.86 0.69 1.04 0.90 1.34 0.90 1.34 0.32 0.67 0.04 0.92 0.30 0.66 * m is the number of 'months' to maturity of the option, where a 'month' is actually 30 days, eg. the first row summarizes all observations with 1 to 30 days to maturity. N is the number of option observations. The three different interpolation methods. e is the average pricing error. ZRMS is the root mean square pricing error. b c 219 Table B.18. First derivative of bond option prices with respect to r (calculated using linear interpolation) by number of months to maturity. linear* ymin v ymax ma N 0-1 1-2 2-3 742 800 804 683 391 180 96 77 -67 -71 -62 -60 -45 -32 -38 -1.5 -3.0 -3.1 -1.6 -2.5 -0.6 2.5 60 45 33 -43, 20 -32 3.4 2.4 16 19 3793 -71 -2.1 60 3-4 4-5 5-6 6-7 7-8 8-9 0-9 r V r ' r 28 19 14 16 m is the number of 'months' to maturity of the option, where a 'month' is actually 30 days, eg. the first row summarizes all observations with 1 to 30 days to maturity. N is the number of option observations. Linear interpolation method used. c v , V and V are the minimum, average and maximum values, respectively, of the partial derivative of option price with respect to r. a 6 mia r r r m a x 220 Table B.19. First derivative of bond call option price with respect to / (for the three different interpolation methods) by number of months to maturity. linear m a N cubic 6 Ci quadrat ic C ' Ci m n Ci 0- 1 464 -1040 -440 -10 -1160 -420 60 -1140 -420 50 1- 2 529 -1080 -450 -20 -1120 -430 -7 -1110 -430 -2 2- 3 555 -1000 -440 -20 -1040 -430 -20 -1030 -430 -20 3- 4 466 -870 -430 -50 -890 -410 -30 -890 -420 -30 4- 5 252 -990 -410 -80 -1000 -390 -70 -1000 -390 -70 5- 6 120 -810 -390 -140 -810 -380 -110 -810 -380 -120 6- 7 72 -850 -420 -110 -860 -400 -100 -850 -400 -110 7- 8 43 -830 -410 -150 -830 -400 -130 -830 -400 -130 8- 9 13 -730 -550 -320 -730 -540 -300 -710 -530 -310 0-9 2514 -1080 -430 -10 -1160 -420 60 -1140 -420 50 ' m is the number of 'months' to maturity of the option, where a ''month' is actually 30 days, eg. the first row summarizes all observations with 1 to 30 days to maturity. N is the number of option observations. The three different interpolation methods. C i , Ci and C j are the minimum, average and maximum values, respectively, or the partial derivative of call option price with respect to /. m m m a x 221 Table B.20. First derivative of bond put option price with respect to / (for the three different interpolation methods) by number of months to maturity. linear* m a N pmin c cubic r Pi pmax l pmin Pi quadratic r r pmax pmia Pi l l r p max l 0- 1 1- 2 2- 3 3- 4 4- 5 5- 6 6- 7 7- 8 8- 9 278 271 249 217 139 60 24 34 7 20 70 90 130 120 140 160 220 220 460 470 420 460 460 410 430 430 440 1050 1000 960 930 930 780 810 800 530 -10 30 80 120 110 130 160 220 220 470 480 430 470 470 420 440 430 450 1180 1010 990 970 950 810 830 810 550 -50 40 70 130 110 130 150 220 220 470 480 430 470 470 420 440 430 440 1150 1010 980 960 950 800 830 820 540 0-9 1279 20 450 1050 -10 460 1180 -50 460 1150 m is the number of 'months' to maturity of the option, where a 'month' is actually 30 days, eg. the first row summarizes all observations with 1 to 30 days to maturity. N is the number of option observations. The three different interpolation methods. c p m i a ) pt a n ( j pmax a r e j n j m u n i ) average and maximum values, respectively, m ol the partial derivative of put option price with respect to /. a b 222 Table B.21. Adjusted first derivative of bond call option price with respect to / (for the three different interpolation methods) by number of months to maturity. linear m a N Qjaiu cubic 6 6 Qjma Ci Ci quadratic qmax Ci 0- 1 1- 2 2- 3 3- 4 4- 5 5- 6 6- 7 7- 8 8- 9 464 529 555 466 252 120 72 43 13 -1040 -1080 -1000 -870 -990 -810 -850 -830 -730 -440 -450 -440 -430 -410 -390 -420 -410 -550 -10 -20 -20 -50 -80 -140 -110 -150 -320 -1150 -1120 -1040 -890 -1000 -810 -860 -830 -730 -410 -420 -430 -410 -390 -380 -400 -400 -540 0.7 1.1 -20 -30 -70 -110 -100 -130 -300 -1140 -1110 -1030 -890 -1000 -810 -850 -830 -710 -420 -430 -430 -420 -390 -380 -400 -400 -530 -1.6 -10 -20 -30 -70 -120 -110 -130 -310 0-9 2514 -1080 -430 -10 -1150 -410 1.1 -1140 -420 -1.6 ° m is the number of 'months' to maturity of the option, where a 'month' is actually 30 days, eg. the first row summarizes all observations with 1 to 30 days to maturity. N is the number of option observations. * The three different interpolation methods. C. , Ci and C , are the minimum, average and maximum values, respectively, of the partial derivative of call option price with respect to /. e m m m a x 223 Table B.22. Adjusted first derivative of bond put option price with respect to / (for the three different interpolation methods) by number of months to maturity. 6 linear pmin N b c •max Pi r l r Pi l quadratic pmax M pmin r l Pi pmax M 0- 1 278 20 460 1050 -3.7 360 1010 10 470 1110 1- 2 271 70 470 1000 -10 400 1010 40 480 1010 2- 3 90 420 960 80 -10 410 860 70 430 980 3- 4 130 460 930 120 -20 470 970 130 470 960 4- 5 139 120 460 930 -30 440 890 110 470 940 5- 6 60 140 410 780 130 420 810 130 420 800 6- 7 24 160 430 810 160 440 830 150 440 830 7- 8 34 229 430 800 -30 410 810 220 430 820 8- 9 7 220 440 530 220 450 550 220 440 540 1279 20 450 1050 -30 410 1010 10 460 1010 0-9 a c cubic m is the number of 'months' to maturity of the option, where a 'month' is actually 30 days, eg. thefirstrow summarizes all observations with 1 to 30 days to maturity. N is the number of option observations. The three different interpolation methods. pmin^ pmaxa r e ^ e j j average and maximum values, respectively, of the partial derivative of put option price with respect to /. m n m u n i j 224 Table B.23. Brennan-Schwartz bond option pricing errors by number of months to maturity. calls m a 6 a N 0-1 1-2 2-3 3-4 4-5 5-6 6-7 7-8 464 529 555 466 252 120 72 43 8-9 0-9 e puts 6 &RMS N 0.49 0.57 0.63 0.75 0.81 0.71 1.08 0.90 13 0.09 0.31 0.43 0.40 0.41 0.51 0.38 0.00 0.71 0.94 278 271 249 217 139 60 24 34 7 2514 0.33 0.67 1279 e 0.14 0.20 0.36 0.40 CRMS 1.34 0.60 0.54 0.52 0.73 0.98 0.68 0.95 0.90 1.93 0.30 0.68 0.42 0.41 0.66 0.11 Number of 'months' to maturity of the option, where a 'month' is actually 30 days. eg. The first row summarizes all observations with 1 to 30 days to maturity. N denotes the number of option price observations, e is the average pricing error in dollars. CRMs is the root mean square pricing error. 225 Table B.24. Brennan-Schwartz bond option pricing errors for in and out of the money options. calls* B/K a 0.85-0.98 0.98-1.00 1.00-1.02 1.02-1.15 a b N 704 687 637 486 e 0.18 0.36 0.44 0.38 puts CRMS 0.55 0.64 0>7 0.71 N e 270 293 331 385 0.38 0.36 0.31 0.18 Bond price, B, divided by option exercise price, K. N denotes the number of option price observations. e is the average pricing error in dollars. RMS is the root mean square pricing error. S 226 CRMS 0.79 0.68 0.69 0.59 Table B.25. Black-Scholes bond option pricing errors by number of months to maturity. calls m a b e CRMS N- e CRMS 555 466 252 120 0.15 0.51 0.71 0.70 0.75 0.90 0.50 0.74 0.88 1.19 1.06 1.05 278 271 249 217 0.16 0.37 0.59 0.75 139 60 0.83 0.94 0.68 0.43 0.90 1.27 0.95 24 34 8-9 72 43 13 1.12 7 1.21 0.88 1.72 0.60 0.66 0.72 0.96 1.19 1.14 1.33 1.28 2.07 0-9 2514 0.57 0.86 1279 0.55 0.86 0-1 1-2 2-3 3-4 4-5 5-6 6-7 7-8 a N puts 6 464 529 Number of 'months' to maturity of the option, where a 'month' is actually 30 days. eg. The first row summarizes all observations with 1 to 30 days to maturity. N denotes the number of option price observations, e is the average pricing error in dollars. CRMS is the root mean square pricing error. 227 T a b l e B.26. Black-Scholes b o n d option pricing errors for in a n d out of the money options. calls B/K a 6 a N e 0.85-0.98 704 0.37 0.98-1.00 687 1.00-1.02 1.02-1.15 puts 6 N e 0.69 270 0.53 0.89 0.60 0.84 293 0.60 0.87 637 0.75 1.00 331 0.60 0.88 486 0.59 0.88 385 0.48 0.82 CRMS Bond price, B, divided by option exercise price, K. N denotes the number of option price observations. e is the average pricing error in dollars. CRMS is the root mean square pricing error. 228 CRMS Table B.27. Brennan-Schwartz bond option pricing errors (using 'in-sample' variance estimates) by number of months to maturity. calls m a 6 a e N 0-1 1-2 2-3 3-4 4-5 5-6 6-7 464 529 555 466 252 120 72 7-8 8-9 43 0-9 puts 6 CRMS 0.48 0.49 0.50 0.65 0.71 0.56 0.99 13 0.02 0.16 0.23 0.17 0.15 0.23 0.11 -0.31 0.39 2514 0.15 N e CRMS 1.00 0.72 278 271 249 217 139 60 24 34 7 0.08 0.07 0.19 0.19 0.18 0.15 0.39 -0.20 1.05 0.59 0.50 0.42 0.65 0.91 0.57 0.80 0.91 1.75 0.58 1279 0.14 0.62 Number of 'months' to maturity of the option, where a 'month' is actually 30 days. eg. The first row summarizes all observations with 1 to 30 days to maturity. N denotes the number of option price observations. e is the average pricing error in dollars. ZRMS is the- root mean square pricing error. 229 Table B.28. Brennan-Schwartz bond option pricing errors (using 'in-sample' variance esti' mates) for in and out of the money options. calls* B/K a 0.85-0.98 0.98-1.00 1.00-1.02 1.02-1.15 N e 704 687 637 486 0.03 0.16 0.22 0.20 puts CRMS 0.50 0.54 0.67 0.62 N e 270 293 331 385 0.26 0.20 0.13 0.01 Bond price, B, divided by option exercise price, K. * N denotes the number of option price observations. e is the average pricing error in dollars. CRMS is the root mean square pricing error. a 230 CRMS 0.72 0.60 0.63 0.56 Table B.29. Brennan-Schwartz treasury bill option pricing errors by number of months to maturity. calls puts 6 e CRMS N 62 79 80 84 39 16 11 16 0.004 0.029 0.050 0.051 0.072 0.078 0.047 0.072 8 0.065 0.022 0.044 0.068 0.063 0.086 0.097 0.057 0.085 0.089 93 77 77 86 30 22 15 14 10 0.043 0.062 m° N 0-1 1-2, 2-3 3-4 4-5 5-6 6-7 7-8 8-9 0-9 395 424 e CRMS 0.013 0.037 0.042 0.033 0.024 0.038 0.001 0.054 0.057 0.023 0.049 0.053 0.061 0.061 0.058 0.091 0.082 0.065 0.031 0.053 * Number of 'months' to maturity of the option, where a 'month' is actually 30 days. eg. Thefirstrow summarizes all observations with 1 to 30 days to maturity. 6 N denotes the number of option price observations. e is the average pricing error in dollars. CRMS is the root mean square pricing error. 231 T a b l e B.30. Brennan-Schwartz treasury bill option pricing errors for in a n d out of the money options. calls B/K a N puts 6 e CRMS N e CRMS 0.9915-0.9965 77 0.024 0.041 123 0.009 0.042 0.9965-0.9980 100 0.057 0.072 102 0.040 0.057 0.9980-0.9995 133 0.052 0.071 119 0.037 0.059 0.9995-1.0025 85 0.033 0.050 80 0.044 0.054 ° Treasury bill price, B, divided by option exercise price, K. 6 N denotes the number of option price observations. e is the average pricing error in dollars. CRMS is the root mean square pricing error. 232 Table B.31. Brennan-Schwartz treasury bill option pricing errors (using 'in-sample variance estimates) by number of months to maturity. 5 calls a e CRMS N -0.020 -0.019 -0.020 -0.024 -0.028 -0.025 -0.065 0.027 0.030 0.040 3-4 4-5 5-6 6-7 62 79 80 84 39 16 11 7-8 8-9 16 8 -0.052 -0.067 0.037 0.044 0.047 0.073 0.067 0.087 0-9 395 -0.025 0.041 m 0-1 1-2 2-3 a 6 puts 6 N e CRMS 93 77 77 86 30 22 15 -0.002 0.000 -0.010 -0.021 -0.040 -0.031 -0.074 0.015 0.033 0.032 0.052 0.065 0.053 0.113 14 10 -0.035 -0.029 0.068 0.034 424 -0.015 0.045 Number of 'months' to maturity of the option, where a 'month' is actually 30 days. eg. The first row summarizes all observations with 1 to 30 days to maturity. N denotes the number of option price observations, e is the average pricing error in dollars. CRMS is the root mean square pricing error. 233 Table B.32. Brennan-Schwartz treasury bill option pricing errors (using 'in-sample' variance estimates) for in and out of the money options. calls N e 0.9915-0.9965 0.9965-0.9980 77 100 0.9980-0.9995 0.9995-1.0025 133 85 -0.024 -0.028 -0.028 -0.026 B/K a b a puts 6 CRMS 0.036 0.037 0.049 0.034 N e 123 102 119 80 -0.009 -0.009 -0.023 -0.022 Treasury bill price, B, divided by option exercise price, K. N denotes the number of option price observations. e is the average pricing error in dollars. CRMS is the root mean square pricing error. 234 CRMS 0.045 0.042 0.054 0.033 Table B.33. Bond option arbitrage strategy number 1. Comparison of Brennan-Schwartz and Black-Scholes average cumulative dollar returns for different levels of commissions. commission* option 0% 3 0 3 ° b 0 bond 0% 0 1/4 1/4 Brennan-Schwartz n 248 248 248 248 Black-Scholes 6 difference* 1 da td n dc td dc 0.771 0.266 -0.578 6.6 2.5 -5.2 -8.8 248 248 248 248 0.447 0.022 -0.409 -0.834 3.9 0.2 -3.9 -7.8 0.32 0.24 -0.17 -1.083 -0.25 td 2.0 1.6 -1.1 -1.5 Commissions charged for trading in options and bonds to rebalance the zeroinvestment arbitrage portfolio. Note that bonds are bought at the ask price quoted in the Wall Street Journal and sold at the bid price. n denotes the number of options. dc is the average cumulative dollar arbitrage return. td is the i-statistic of dc. Difference between Brennan-Schwartz and Black-Scholes cumulative returns. 235 Table B.34. Bond option arbitrage strategy number 1. Brennan-Schwartz (using 'in-sample' variances) average cumulative dollar returns for different levels of commissions. Brennan-Schwartz commission a 6 0 6 ('in-sample' variance) option bond n dc 0% 3 0 3 0% 0 1/4 1/4 248 248 248 248 1.113 0.577 -0.356 -0.892 U 9.5 5.5 -0.3 -7.3 Commissions charged for trading in options and bonds to rebalance the zeroinvestment arbitrage portfolio. Note that bonds are bought at the ask price quoted in the Wall Street Journal and sold at the bid price. n denotes the number of options. dc is the average cumulative dollar arbitrage return. td is the f-statistic of dc . 236 Table B.35. Bond option arbitrage strategy number 2. Comparison of Brennan-Schwartz and Black-Scholes average (non-cumulative) dollar returns for various filter levels. Brennan- Schwart z r 0.00 0.30 0.50 0.70 1.00 a b e b Black-Scholes difference 0 N d td N d td d td 3510 2140 1467 959 425 0.092 0.139 0.181 0.237 0.379 11.1 12.2 11.9 11.3 9.4 3504 2552 2015 1550 928 0.056 0.080 0.093 0.103 0.154 6.7 7.8 8.0 7.3 7.6 0.04 0.06 0.09 3.0 3.9 4.6 4.1 5.0 0.13 0.22 Filter level in dollars. The zero-investment arbitrage portfolio was only formed if theoretical and market option prices differed by at least the filter amount. Note that no commissions are charged and bonds are bought and sold at the middle of the bid-ask spread quoted in the Wall Street Journal. N denotes the number of observations. d is the average (non-cumulative) dollar arbitrage return. td is the i-statistic for d. Difference between the Brennan-Schwartz and Black-Scholes returns. 237 Table B.36. Bond option arbitrage strategy number 2. Brennan-Schwartz (using 'in-sample' variances) average (non-cumulative) dollar returns for various filter levels. Brennan-Schwartz 6 ('in-sample' variance) r 0.00 0.30 0.50 0.70 1.00 ° 6 N d 3511 1826 1159 695 285 0.119 0.191 0.240 0.342 0.565 td 14.6 14.9 13.2 12.5 10.2 Filter level in dollars. The zero-investment arbitrage portfolio was only formed if theoretical and market option prices differed by at least the filter amount. Note that no commissions are charged and bonds are bought and sold at the middle of the bid-ask spread quoted in the Wall Street Journal. N denotes the number of observations. d is the average (non-cumulative) dollar arbitrage return. td is the i-statistic for d. 238 Table B.37. Bond option arbitrage strategy number 2. Comparison of Brennan-Schwartz and Black-Scholes average (non-cumulative) dollar returns for different levels of commissions. commission" Brennan-Schwartz Black-Scholes 6 option bond N d 0% 3 0 3 0% 0 1/4 1/4 3510 3510 3510 3510 0.059 0.030 -0.022 -0.051 td 7.5 4.0 -2.9 -6.6 N d 3504 3504 3504 3504 0.035 0.012 -0.016 -0.039 td 4.4 1.5 -2.0 -5.1 difference 11 d. 0.02 0.02 -0.01 -0.01 td 2.1 1.7 -0.6 -1.1 ° Commissions charged for trading in options and bonds to rebalance the zeroinvestment arbitrage portfolio. Note that bonds are bought at the ask price quoted in the Wall Street Journal and sold at the bid price. 6 N denotes the number of observations. d is the average (non-cumulative) dollar arbitrage return. td is the i-statistic for d. Difference between the Brennan-Schwartz and Black-Scholes returns. e 239 Table B.38. Bond option arbitrage strategy number 2. Brennan-Schwartz (using 'in-sample' variances) average (non-cumulative) dollar returns for different levels of commissions. Brennan-Schwartz commission" option 0% 3 0 3 a b bond 6 ('in-sample' variance) N d td 0 3511 3511 10.7 7.0 1/4 1/4 3511 3511 0.083 0.052 -0.007 -0.038 0% -0.9 -5.0 Commissions charged for trading in options and bonds to rebalance the zeroinvestment arbitrage portfolio. Note that bonds are bought at the ask price quoted in the Wall Street Journal and sold at the bid price. N denotes the number of observations. d is the average (non-cumulative) dollar arbitrage return. td is the ^-statistic for d. 240 Table B.39. Treasury bill option arbitrage strategy number 1. Brennan-Schwartz average cumulative dollar returns for different levels of commissions. commission" option 6 n dc 6 td 0% 0% 31 0.1521 1.0 3 0 31 0.1103 0.7 0 1/4 31 -0.8370 -3.4 1/4 31 -0.8789 -3.4 3 ° tbill Brennan-Schwartz Commissions charged for trading in options and treasury bills to rebalance the zero-investment arbitrage portfolio. Note that treasury bills are bought at the ask price quoted in the Wall Street Journal and sold at the bid price. n denotes the number of options. dc is the average cumulative dollar arbitrage return. td is the ^-statistic of dc . 241 Table B.40. Treasury bill option arbitrage strategy number 1. Brennan-Schwartz (using 'insample' variances) average cumulative dollar returns for different levels of commissions. Brennan-Schwartz* commission option 0% 3 0 3 0 tbill 0% 0 1/4 1/4 ('in-sample' variance) n 30 30 30 30 dc 0.2027 0.1572 -1.1399 -1.1854 td 1.1 0.8 -3.7 -3.8 ° Commissions charged for trading in options and treasury bills to rebalance the zero-investment arbitrage portfolio. Note that treasury bills are bought at the ask price quoted in the Wall Street Journal and sold at the bid price. * n denotes the number of options. is the average cumulative td is the J-statistic of de. dc dollar arbitrage return. 242 Table B.41. Treasury bill option arbitrage strategy number 2. Brennan-Schwartz average (noncumulative) dollar returns for various filter levels. Brennan- Schwartz /• 0.00 0.03 0.05 0.07 N d td 777 477 301 176 0.0083 0.0166 0.0281 0.0465 2.1 3.0 3.5 4.1 6 ° Filter level in dollars. The zero-investment arbitrage portfolio was only formed if theoretical and market option prices differed by at least the filter amount. Note that no commissions are charged and treasury bills and bonds are bought and sold at the middle of the bid-ask spread quoted in the Wall Street Journal. b N denotes the number of observations. d is the average (non-cumulative) dollar arbitrage return. tj, is the t-statistic for d. 243 Table B.42. Treasury bill option arbitrage strategy number 2. Brennan-Schwartz ('in-sample' variance) average (non-cumulative) dollar returns for various filter levels. Brennan-Schwartz* ('in-sample' variance) r 0.00 0.03 0.05 0.07 N d 738 274 122 79 0.0112 0.0220 2.4 2.2 0.0485 0.0788 2.4 2.7 td Filter level in dollars. The zero-investment arbitrage portfolio was only formed if theoretical and market option prices differed by at least the filter amount. Note that no commissions are charged and treasury bills and bonds are bought and sold at the middle of the bid-ask spread quoted in the Wall Street Journal. * N denotes the number of observations. d is the average (non-cumulative) dollar arbitrage return. td is the ^-statistic for d. a 244 Table B.43. Treasury bill option arbitrage strategy number 2. Brennan-Schwartz average (non cumulative) dollar returns for different levels of commissions. commisions* option 0% 3 0 3 a 6 tbill 0% 0 1/4 1/4 Brennan-Schwartz 6 N d td 777 777 777 777 0.0063 0.0048 -0.0297 -0.0311 1.6 1.2 -6.3 -6.5 Commissions charged for trading in options and treasury bills to rebalance the zero-investment arbitrage portfolio. Note that treasury bills are bought at the ask price quoted in the Wall Street Journal and sold at the bid price, N denotes the number of observations. d is the average (non-cumulative) dollar arbitrage return. td is the i-statistic for d. 245 Table B.44. Treasury bill option arbitrage strategy number 2. Brennan-Schwartz ('in-sample' variance) average (non-cumulative) dollar returns for different levels of commissions. Brennan-Schwartz* commisions option 0% 3 0 3 a 6 a ('in-sample' variance) tbill N d td 0% 0 1/4 1/4 738 0.0085 1.8 738 738 738 0.0069 -0.0422 -0.0438 1.5 -7.5 -7.6 Commissions charged for trading in options and treasury bills to rebalance the zero-investment arbitrage portfolio. Note that treasury bills are bought at the ask price quoted in the Wall Street Journal and sold at the bid price. N denotes the number of observations. d is the average (non-cumulative) dollar arbitrage return. td is the ^-statistic for d. 246
- Library Home /
- Search Collections /
- Open Collections /
- Browse Collections /
- UBC Theses and Dissertations /
- Two topics in Finance: 1. Welfare aspects of an asymmetric...
Open Collections
UBC Theses and Dissertations
Featured Collection
UBC Theses and Dissertations
Two topics in Finance: 1. Welfare aspects of an asymmetric information rational expectations model :… Dietrich-Campbell, Bruce John 1985
pdf
Page Metadata
Item Metadata
Title | Two topics in Finance: 1. Welfare aspects of an asymmetric information rational expectations model : 2. Bond option pricing, empirical evidence |
Creator |
Dietrich-Campbell, Bruce John |
Publisher | University of British Columbia |
Date Issued | 1985 |
Description | In part 1 of this study I examine several models of competitive markets in which a group of uninformed traders uses the equilibrium price of a traded asset as an indirect source of information known to a group of informed traders. Four different models are compared in two homogeneous information cases plus one asymmetric information case, revealing a) an allocative efficiency benefit resulting from the opportunity to trade current consumption for future consumption, b) a 'dealer' benefit accruing to traders who are able to observe and act on demand fluctuations not apparent to other traders, c) a 'hedging' benefit accruing to all traders, and d) a loss of hedging benefits due to information dissemination before hedge trading can take place. The effect of an increase in precision of information given to informed traders is calculated for the above factors and for net welfare. In part 2, a two-factor model using the instantaneous rate of interest and the return on a consol bond to describe the term structure of interest rates - the Brennan-Schwartz model - is used to derive theoretical prices for American call and put options on U.S. government bonds and treasury bills. These model prices are then compared with market prices. The theoretical model used to value the debt options also provides hedge ratios which may be used to construct zero-investment portfolios which, in theory, are perfectly riskless. Several trading strategies based on these 'riskless' portfolios are examined. |
Subject |
Securities Government securities -- United States Options (Finance) -- United States |
Genre |
Thesis/Dissertation |
Type |
Text |
Language | eng |
Date Available | 2010-06-11 |
Provider | Vancouver : University of British Columbia Library |
Rights | For non-commercial purposes only, such as research, private study and education. Additional conditions apply, see Terms of Use https://open.library.ubc.ca/terms_of_use. |
DOI | 10.14288/1.0096540 |
URI | http://hdl.handle.net/2429/25565 |
Degree |
Doctor of Philosophy - PhD |
Program |
Business Administration |
Affiliation |
Business, Sauder School of |
Degree Grantor | University of British Columbia |
Campus |
UBCV |
Scholarly Level | Graduate |
Aggregated Source Repository | DSpace |
Download
- Media
- 831-UBC_1985_A1 D53.pdf [ 10.6MB ]
- Metadata
- JSON: 831-1.0096540.json
- JSON-LD: 831-1.0096540-ld.json
- RDF/XML (Pretty): 831-1.0096540-rdf.xml
- RDF/JSON: 831-1.0096540-rdf.json
- Turtle: 831-1.0096540-turtle.txt
- N-Triples: 831-1.0096540-rdf-ntriples.txt
- Original Record: 831-1.0096540-source.json
- Full Text
- 831-1.0096540-fulltext.txt
- Citation
- 831-1.0096540.ris
Full Text
Cite
Citation Scheme:
Usage Statistics
Share
Embed
Customize your widget with the following options, then copy and paste the code below into the HTML
of your page to embed this item in your website.
<div id="ubcOpenCollectionsWidgetDisplay">
<script id="ubcOpenCollectionsWidget"
src="{[{embed.src}]}"
data-item="{[{embed.item}]}"
data-collection="{[{embed.collection}]}"
data-metadata="{[{embed.showMetadata}]}"
data-width="{[{embed.width}]}"
async >
</script>
</div>
Our image viewer uses the IIIF 2.0 standard.
To load this item in other compatible viewers, use this url:
http://iiif.library.ubc.ca/presentation/dsp.831.1-0096540/manifest