Open Collections

UBC Theses and Dissertations

UBC Theses Logo

UBC Theses and Dissertations

Comparison of fitting techniques for systems of forestry equations LeMay, Valerie 1988

Your browser doesn't seem to have a PDF viewer, please download the PDF to view this item.

Item Metadata

Download

Media
831-UBC_1989_A1 L45.pdf [ 9.2MB ]
Metadata
JSON: 831-1.0075305.json
JSON-LD: 831-1.0075305-ld.json
RDF/XML (Pretty): 831-1.0075305-rdf.xml
RDF/JSON: 831-1.0075305-rdf.json
Turtle: 831-1.0075305-turtle.txt
N-Triples: 831-1.0075305-rdf-ntriples.txt
Original Record: 831-1.0075305-source.json
Full Text
831-1.0075305-fulltext.txt
Citation
831-1.0075305.ris

Full Text

C O M P A R I S O N OF FITTING TECHNIQUES F O R SYSTEMS OF FORESTRY  EQUATIONS By  Valerie LeMay B. Sc. (Forestry). University of Alberta, 1981 M . Sc. (Forestry). University of Alberta,  1982  A THESIS SUBMITTED IN PARTIAL FULFILLMENT OF THE REQUIREMENTS  FOR THE DEGREE OF  DOCTOR OF PHILOSOPHY  in THE FACULTY OF GRADUATE STUDIES DEPARTMENT OF FORESTRY  We accept this thesis as conforming to the required standard  THE UNIVERSITY OF BRITISH COLUMBIA December  1988  © Valerie LeMay, 1988  in  presenting  degree freely  at  this  the  available  copying  of  department publication  of  in  partial  fulfilment  of  the  University  of  British  Columbia,  I  agree  for  this or  thesis  reference  thesis by  this  for  his thesis  and  scholarly  or for  her  of  T h e U n i v e r s i t y of British Vancouver, Canada  Date  DE-6 (2/88)  Columbia  I  further  purposes  gain  shall  that  agree  may  representatives.  financial  permission.  Department  study.  requirements  It not  be  that  the  Library  permission  granted  is  by  understood be  for  allowed  an  advanced  shall for  the that  without  make  it  extensive  head  of  my  copying  or  my  written  Abstract  In order to describe forestry problems, a system of equations is commonly used. The chosen system may be simultaneous, i n that a variable which appears on the left hand side of an equation also appears on the right hand side of another equation in the system. Also, the error terms among equations of the system may be contemporaneously correlated, and error terms within individual equations may be non-iid in that they may be dependent (serially correlated) or not identically distributed (heteroskedastic) or both. Ideally, the fitting technique used to fit systems of equations should be simple; estimates of coefficients and their associated variances should be unbiased, or at least consistent, and efficient: small and large sample properties of the estimates should be known; and logical compatibility should be present in the fitted system. The first objective of this research was to find a fitting technique from the literature which meets the desired criteria for simultaneous, contemporaneously correlated systems of equations, in which the error terms for individual equations are non-iid. This objective was not met i n that no technique was found in the literature which satisfies the desired criteria for a system of equations with this error structure. However, information from the literature was used to derive a new fitting technique as part of this research project, and labelled multistage least squares ( M S L S ) . The M S L S technique is an extension of three stage least squares from econometrics research, and can be used to find consistent and asymptotically efficient estimates of coefficients, and confidence limits can also be calculated for large sample sizes.  For small sample sizes, an iterative routine labelled  iterated multistage least squares ( I M S L S ) was derived. The second objective was to compare this technique to the commonly used techniques  ii  of using ordinary least squares (simple or multiple linear regression and nonlinear least squares regresion), and of substituting all of the equations into a composite model and using ordinary least squares to fit the composite model.  The three techniques were  applied to three forestry problems for which a system of equations is used. The criteria for comparing the results included comparing goodness-of-fit measures (Fit Index, Mean Absolute Deviation, Mean Deviation), comparing the traces of the estimated coefficient co variance matrices, and calculating a summed rank, based on the presence or absence of desired properties of the estimates. The comparison indicated that O L S results i n the best goodness-of-fit measures for all three forestry- problems; however, estimates of coefficients are biased and inconsistent for simultaneous systems. Also, the estimated coefficient covariance matrix cannot be used to calculate confidence intervals for the true parameters, or to test hypothesis statements. Finally, compatibility among equations is not assured. The fit.of the composite model was attractive for the systems tested; however, only one left hand side variable was estimated, and, for larger systems with more variables and more equations, this technique may not be appropriate. The M S L S technique resulted in goodness-of-fit measures which were close to the O L S goodness-of-fit measures. Of most importance, however, is that the M S L S fit ensures compatibility among equations, estimates of coefficients and their variances are consistent, estimates are asymptotically efficient, and confidence limits can be calculated for large sample sizes using the estimated variances and probabilities from the normal distribution.  Also, the number and difficulty of steps required for the M S L S technique  were similar to the O L S fit of individual equations.  The main disadvantage to using  the M S L S technique is that a large amount of computer memory is required; for some forestry problems with very large sample sizes, the use of a subsample or the exclusion of the final step of the M S L S fit were suggested. This would result i n some loss of efficiency, but estimated coefficients and their variances would be consistent. iii  Table of Contents  Abstract  ii  List of Tables  ix  List of Figures  xi  Acknowledgement  xii  1  Introduction  1  2  Previous Attempts at Fitting Forestry Equation Systems  6  2.1  Ordinary Least Squares for Individual Equations  6  2.2  Compatible Systems using Substitution  15  2.3  M i n i m u m Loss Function  17  2.4  Econometric Methods Based on the Assumption of i i d Error Terms  2.5  Econometric Methods without the Assumption of i i d Error Terms  2.6  Discussion of Previous Approaches  3  . . .  19  . . . .  25 29  A n Alternative Simultaneous Fitting Procedure  33  3.1  Extension of Econometric Methods to Multistage Least Squares  33  3.2  Estimation of the Error Covariance M a t r i x  41  3.2.1  Autocorrelation and Contemporaneous Correlation  41  3.2.2  Heteroskedasticity and Contemporaneous Correlation  47  iv  3.2.3  Autocorrelation, Heteroskedasticity, and Contemporaneous Correlation  3.3  4  5  50  Confidence Limits, Hypothesis Testing, and Prediction  54  Procedures for Comparison of Fitting Techniques  59  4.1  Selection of Equations for the Systems  .  4.2  Obtaining the O L S , Composite, and M S L S Fits  60  4.3  Tests for Heteroskedasticity and Serial Correlation  62  4.4  Criteria for Comparison of the Three Techniques  64  4.4.1  Goodness-of-fit Measures  64  4.4.2  Trace of the Estimated Coefficient Covariance M a t r i x  66  4.4.3  Table of Estimated Coefficients and Standard Deviations  67  4.4.4  Ranking for Other Features  : . . . .  59  67  Application 1: Tree Volume Estimation  69  5.1  Introduction  69  5.2  Preparation of D a t a  69  5.3  Model Selection  72  5.4  Ordinary Least Squares Fit  80  5.4.1  Unweighted Simple or Multiple Linear Regression  80  5.4.2  Testing for i i d Error Terms  80  5.4.3  Estimating the Error Covariance M a t r i x of Each Equation  5.4.4  Appropriate O L S Fit Based on Error Structure  5.5  . . . .  82 84  Composite Model Fit  84  5.5.1  Derivation of the Composite Model  84  5.5.2  Unweighted Regression of Composite Model  85  5.5.3  Testing for iid Error Terms  86 v  5.6  5.7  5.8 6  5.5.4  Estimating the Variance of the Error Terms  86  5.5.5  Weighted Regression of the Composite M o d e l  87  M S L S Fit  88  5.6.1  First Stage Equations  88  5.6.2  Second Stage Equations  89  5.6.3  Testing for i i d Error Terms  89  5.6.4  Estimation of the Error Covariance M a t r i x  5.6.5  E G L S to Fit the System of Equations  .  90 92  Comparison of the Three F i t t i n g Techniques  93  5.7.1  Goodness-of-fit Measures  93  5.7.2  Relative Variances  97  5.7.3  Table of Estimated Coefficients and Standard Deviations  98  5.7.4  Ranking for Other Features  100  Conclusion  102  Application 2: Estimation of Tree Diameter Distribution  105  6.1  Introduction  105  6.2  Preparation of Data  107  6.3  Model Selection  110  6.4  Ordinary Least Squares F i t  6.5  . .  113  6.4.1  Unweighted Simple or Multiple Linear Regression  113  6.4.2  Testing for i i d Error Terms  113  6.4.3  Estimating the Error Covariance M a t r i x of Each Equation  6.4.4  Appropriate O L S Fit Based on Error Structure  . . . .  114 116  Composite Model Fit  116  6.5.1  116  Derivation of the Composite Model  vi  6.6  6.7  6.8 7  6.5.2  Unweighted Regression of Composite Model  117  6.5.3  Testing for i i d Error Terms and Weighted Regression  118  M S L S Fit  119  6.6.1  First Stage Equations  119  6.6.2  Second Stage Equations  119  6.6.3  Testing for i i d Error Terms  120  6.6.4  Estimation of the Error Covariance M a t r i x  121  6.6.5  E G L S to Fit the System of Equations  123  Comparison of the Three Fitting Techniques  124  6.7.1  Goodness-of-fit Measures  124  6.7.2  Relative Variances  129  6.7.3  Table of Estimated Coefficients and Standard Deviations  130  6.7.4  Ranking for Other Features  132  Conclusion  133  Application 3: Volume G r o w t h and Yield  136  7.1  Introduction  136  7.2  Data Preparation  137  7.3  Model Selection  138  7.4  Ordinary Least Squares Fit  139  7.4.1  Unweighted Simple or Multiple Linear Regression  139  7.4.2  Testing for i i d Error Terms  139  7.4.3  Appropriate O L S Fit Based on Error Structure  140  7.5  Composite Model Fit  .  140  7.5.1  Derivation of the Composite Model  140  7.5.2  Unweighted Regression of the Composite Model  141  vii  7.6  7.7  7.8  7.5.3  Testing for i i d Error Terms  141  7.5.4  Weighted F i t of the Composite M o d e l  142  M S L S Fit  142  7.6.1  First Stage Equations  142  7.6.2  Second Stage Equations  143  7.6.3  Testing for i i d Error Terms  143  7.6.4  Estimation of the Error Covariance M a t r i x  144  7.6.5  E G L S to F i t the System of Equations  144  Comparison of the Three Fitting Techniques  145  7.7.1  Goodness-of-fit Measures  145  7.7.2  Relative Variances  148  7.7.3  Table of Estimated Coefficients and Standard Deviations  150  7.7.4  Ranking for the Three Techniques  150  Conclusion  152  8  Overall Discussion and Conclusions  155  9  References Cited  160  A  Glossary of Terms and Abbreviations  170  vm  List of Tables  2.1  Simultaneous Estimation Assuming i i d Error Terms  21  5.2  Distribution of Selected Trees b}' Height and D b h Classes  71  5.3  D i s t r i b u t i o n of 100 S a m p l e Trees  73  5.4  F i t Indices for O L S and M S L S Fits of the Volume Equation System . . .  93  5.5  M . A . D . for Five Classes for the O L S F i t of the Volume Equation System  94  5.6  M . A . D . for Five Classes for the M S L S F i t of the Volume Equation System  95  5.7  M . A . D . for Merchantable Volume  95  5.8  M . D . for Five Classes for the O L S F i t of the Volume Equation System  .  96  5.9  M . D . for Five Classes for the M S L S F i t of the Volume Equation System .  96  5.10 M . D . for Merchantable Volume  97  5.11 Trace of the Coefficient Covariance M a t r i x for Each Equation of the Volume Equation System  98  5.12 Estimated Coefficients and Standard Deviations of Coefficients for the Volume Equation System  99  5.13 Ranks for the Three Techniques  101  6.14 Distribution of Selected Psps by Age and Stems per Hectare Classes . . .  Ill  6.15 Fit Indices for O L S and M S L S Fits of the D b h Distribution System  . . .  124  6.16 M . A . D . for Five Classes for the O L S F i t of the D b h Distribution System  126  6.17 M . A . D . for Five Classes for the M S L S F i t of the D b h Distribution System  126  6.18 M . D . for Five Classes for the O L S F i t of the D b h Distribution System . .  128  6.19 M . D . for Five Classes for the M S L S F i t of the D b h Distribution System .  128  ix  6.20 Trace of the Coefficient Covariance M a t r i x for Each Equation of the D b h Distribution System  130  6.21 Estimated Coefficients and Standard Deviations of Coefficients for the D b h Distribution System  131  6.22 Ranks for the Three Techniques for the F i t of the D b h Distribution Systeml32 7.23 Distribution of Selected Psps for the T h i r d Application  137  7.24 Fit Indices for O L S , Composite Model, M S L S and M S L S Fits of the Yield Equation System  145  7.25 M . A . D . for Five Classes for the O L S F i t of the Yield Equation System 7.26  .  146  M . A . D . for Five Classes for the M S L S F i t of the Yield Equation System .  146  7.27 M . A . D . for Five Classes for the Yield Composite Model F i t  147  7.28  147  M . D . for Five Classes for the O L S Fit of the Yield Equation System . . .  7.29 M . D . for Five Classes for the M S L S Fit of the Yield Equation System . .  148  7.30 M . D . for Five Classes for the Y i e l d Composite Model Fit  148  7.31  Trace of the Coefficient Covariance M a t r i x for Each Equation of the Yield Equation System  149  7.32 Estimated Coefficients and Standard Deviations of Coefficients for the Yield Equation System  150  7.33 Ranks for the Three Techniques  151  x  List of Figures  Extension of Econometric Least Squares Methods for Fitting Systems to MSLS  xi  Acknowledgement  M y greatest thanks go to the professors at University of British Columbia. In particular, Dr. A . Kozak, as well as the other members of my thesis committee, Drs. J . H . G . Smith, M . Bonner, P. Marshall, and D. T a i l , of the Faculty of Forestry, and Dr. J . G . Cragg of the Department of Economics provided great assistance in completing my research. In addition, I would like to thank Dr. J . Wilson and M s . N . Bertrams for their efforts in securing funding and for processing the necessary information. D a t a were provided by the Timber Management Branch of the Alberta Forest Service. Special thanks go to M r . D. J . Morgan, Forester-in-charge, Forest Measurement Section, and his staff members for their prompt response to my data needs and questions. I would also like to thank Patrick Pelletier, my husband, for his patience and encouragement .  xii  Chapter 1  Introduction  In order to describe forestry problems, a system of equations is frequently used. For instance, for the prediction of tree volume, tree height is first estimated, and then tree volume is estimated from measured diameter outside bark at breast height (dbh) and the estimated tree height. The desired parameter, tree volume, is therefore estimated through the use of a system of two equations. The error structure for a system of equations will affect the results of any technique used to fit the system. The error terms among equations of the system may be correlated (contemporaneous  correlation) or variables appearing as dependent variables in one 1  equation may appear as an independent variable in another equation of the system. For example, the system of equations for the prediction of tree volume may be chosen as follows: height  =  /3Q + flidbh + ei  log volume  =  log 02 + 83 log dbh + 8 log height + e  (1.1)  2  4  2  (1.2)  where volume is the volume of the main bole of the tree from ground to tree top; height is the height of the main stem from ground to tree top; dbh is the diameter outside bark measured at breast height, 1.3 metres above ground; log is the logarithm, base 10 or base e; •"•For definitions of terms a n d abbreviations used i n this thesis, see A p p e n d i x A .  1  Chapter  1.  2  Introduction  0o, 0x, 0 , 03, 04 axe coefficients to be estimated; 2  ei, £2 are error terms. Height is a stochastic variable which occurs as both a dependent variable on the left hand side (LHS) of an equation and also as an independent variable on the right hand side (RHS) of an equation. Also, a measure of tree taper such as form factor has been excluded from the system. Taper would likely affect both height and volume, hence both Ci and  include the error due to the exclusion of a measure of taper, and are therefore  probably correlated (contemporaneous correlation). Further complications  arise if the  variances of the error terms for any of the equations in the system vary over the range of independent variables (heteroskedasticity ) , or if the error terms for a given equation 2  are correlated with the previous error terms (serially correlation). The error structure for a system of equations can therefore have any or all of the following characteristics. 1. Dependent variables may appear on the LHS of an equation in the system and also as a RHS variable in another equation so that the OLS assumption that the RHS variables are uncorrelated with the error term is not met for every equation. Systems with this characteristic are termed simultaneous equations. 2. The error terms among equations may be correlated indicating contemporaneous correlation. 3. Within individual equations, the error terms may be serially correlated (not independent), or the variances of the error terms may be heterogenous (not identically distributed), or both (neither independent nor identically distributed (non-iid)). The fitting approach for systems of equations for forestry applications should ideally meet all of the following criteria. 2  T h i s may also be spelled as heteroscedasiicity.  See McCulloch (1985) for discussion.  Chapter  1.  3  Introduction  1. The routine should be simple i n that few fitting steps are required. 2. Estimates of the coefficients and their associated variances should be unbiased or at least consistent. The estimates should have low variance (high efficiency). 3. Reported information on asymptotic and small sample properties of the estimated coefficients and their variances should be available. 4. Estimates should result in a compatible system of equations in that logical relationships among variables i n the system should be maintained in the fitted system. Criteria one through four should be met regardless of the error structure of the systems of equations (characteristics  1, 2, and 3, page 2).  The main objectives of this research were as follows: 1. To review forestry, econometrics, biometrics, and statistics literature and to choose, from this literature, a technique which satisfies all of the above criteria for  fitting  simultaneous, contemporaneously correlated systems of forestry equations, i n which the error terms of individual equations are non-iid. 2. To compare this alternative technique to the most commonly used methods of (1) an appropriate O L S fit to each of the equations and (2) an appropriate O L S fit to a composite model created by substituting the equations of the system into one composite equation. The central hypotheses of this thesis are: first, a fitting technique exists which satisfies the desired criteria for simultaneous, contemporaneously correlated systems of equations in which individual equations have non-iid error terms; and second, that any additional computational burden i n using the technique is compensated by the benefits of meeting the desired criteria.  Chapter  1.  Introduction  4  To meet the first objective, a literature search was conducted, first by examining forestry literature, and then by extending the search to econometrics, biometrics, and statistics literature. In order to restrict the scope of this thesis, only techniques based on least squares methodology were considered. T h e main alternative to the least squares approach is a maximum likelihood approach; however, the maximum likelihood approach requires that an assumption about the distribution of the dependent variables be made, and is more difficult to calculate. Also, the m a x i m u m likelihood approach is more sensitive to model specification error, to the presence of outliers, and to the presence of multicollinearity (Cragg, 1967; Summers, 1965). Least squares methodology was therefore selected as a more desirable method. Objective one was not met, because the search of the literature failed to provide a fitting technique which satisfies all of the desirable properties for fitting systems with all three of the characteristics listed for the error structure.  However, the information i n  the econometrics literature was used as the basis for the derivation of a new technique as part of this research. Results from several authors were combined into a comprehensive technique and labelled multistage least squares ( M S L S ) for this thesis.  3  The M S L S  technique is restricted to fitting systems of equations i n which one sample set is used to fit the equations and, therefore, the number of samples is the same for every equation of the system. A n iterated procedure was also derived and labelled iterated multistage least squares ( I M S L S ) . To meet the second objective, the derived M S L S technique, and the two most commonly used techniques were used to fit a system of equations for each the following forestry problems. T h e use of the terms multistage least squares in this thesis refers only to the technique derived by this author. Pienaar and Shiver (1986) use the term multistage least squares in the summary of their paper; however, it is likely that they are referring to established econometric techniques for systems of equations where error terms for individual equations are iid, which can be considered to be a subset of the MSLS comprehensive technique described in this thesis. 3  Chapter  1.  Introduction  5  1. The estimation of tree volume. 2. The estimation of diameter distributions from stand measurements, such as stand age and number of stems per unit area. 3. The estimation of volume and basal area growth and yield. The systems of equations for these problems were expected to have different error structures. For the first problem, the error terms were expected to be heteroskedastic within equations and contemporaneously correlated among equations. The second problem i n volved the prediction of parameters of a probability distribution; each equation of the system estimated one of the parameters.  For the third problem, the error terms were  expected to be serially correlated and perhaps heteroskedastic within equations, and contemporaneously correlated among equations. These estimation problems are commonly encountered in forest management and endogenous variables tend to be estimates which are later expanded. For instance, the predicted volume on a given tree may be expanded to a per hectare estimate of volume. The evaluation of the three fitting procedures, O L S , composite model, and M S L S , was based on comparing goodness-of fit measures, comparing the traces of the estimated coefficient covariance matrices, and calculating a summed rank, based on the presence or absence of desired properties of the estimates. In order to distinguish between variable types, the terms used i n econometrics  are  used. Stochastic variables which appear on the L H S of equations and may also appear on the R H S are termed endogenous variables derived from a Greek word meaning "generated from the inside" ( H u , 1973, page 121). termed exogenous  Variables which appear only on the R H S are  variables meaning "generated from the outside".  These exogenous  variables were assumed to have very little error and could be considered to be fixed variables (nonstochastic).  Chapter 2  Previous Attempts at Fitting Forestry Equation Systems  2.1  Ordinary Least Squares for Individual Equations  For systems of equations i n forestry, the most common fitting method has been the independent fitting of each equation i n the system using an appropriate least squares procedure, such as simple linear regression, multiple linear regression, weighted regression, or nonlinear least squares (Burkhart, 1986: Furnival and W i l s o n , Jr., 1971). The O L S approach has appeal i n that the method is well known and calculations to obtain estimates of coefficients and variances of coefficients and of dependent variables are relatively simple. The standard O L S procedure of simple or multiple linear regression applied to individual equations yields estimates of coefficients which are best linear unbiased estimates ( B L U E ) for linear equations if the following assumptions are met. 1. The error terms for an equation are i i d ; serial correlation and heteroskedasticity are not present within individual equations. 2. The variables on the R H S of each equation are uncorrelated with the error term of the equation (nonstochastic). 3. The error terms among equations are not correlated, meaning that contemporaneous correlation is not present.  6  Chapter  2. Previous  Attempts  ai Fitting  Forestry  Equation  Systems  A system of three equations appears as follows: (2.3)  y  2  = X /3 + £2 2  (2.4)  2  y3 = X3/33 — £3  (2.5)  where y; is an n by 1 matrix of the sample values for the i  th  endogenous variable in the  system of equations; X - is an n by k{ matrix of the sample values for all of the exogenous variables 2  which affect the i  th  endogenous variable in this equation of the system;  fli is a ki by 1 matrix of the true coefficients associated with the exogenous variables of this equation; €i is an n by 1 matrix of the error terms associated with each sample of the endogenous variable; n is the number of samples. These equations can also be expressed as follows:  y =  where 0  n  yi  x  On  On  A  y2  On  x  On  ft  ys  0  On  x  03  n  2  3  +  £2  = XB + E  (2.6)  £3  is an n by n submatrix of zeros.  If all of these assumptions are met for the system of equations, the covariance matrix of the error terms of the system, is a diagonal matrix as follows:  fl =  2T  On  On  On  2T  On  o  n  On  C/  3n A  where ft is the error covariance matrix for the system of equations;  (2.7)  Chapter  2.  Previous  Attempts  at Fitting  Forestry  Equation  Systems  8  erf In is a diagonal submatrix representing iid error terms within the equation i. Each of the diagonal elements is equal to erf for the i  equation. I is  th  n  the identity matrix of size n by n; 0  n  is an n by n submatrix of zeros, indicating no contemporaneous correlation.  The covariance matrix then "falls apart" into separate matrices for each equation and simple or multiple linear regression can be used to fit each equation separately.  The  estimated coefficients will be B L U E , and also maximum likelihood estimates (MLE) if the error terms are normally distributed. The estimated coefficient covariance matrix also will be unbiased. For individual equations, if the error terms are not identically distributed, in that their variances are not homogeneous, the error covariance matrix remains a diagonal matrix, but the diagonal terms for each equation are unequal, as shown below for three equations and three samples.  0  0  0  0  0  0  0  0  0  1 2  0  0  0  0  0  0  0  0  0  0  0  0  0  2 \  0  0  0  0  0  0  0  0  0  0  0  0  0  0  „2 a  0  0  0  0  0  0  0  0  0  0  0  0  0  0  _2 ^23  0  0  0  0  0  0  _2  0  0  0  0  0  0  0  0  0  0  0  0  0  0  where tr?. is the variance for the i  th  „2 a  equation and the j  th  „ 2 a  Z 2  0  (2.8)  0 _2 °33  sample.  Again, the covariance matrix can be divided into a separate matrix for each equation of the system. In this case, a specialized OLS technique such as a weighted regression  Chapter  2. Previous  Attempts  at Fitting  Forestry  Equation  9  Systems  procedure or another transformation of the variables in each equation can be used so that the variance of the transformed regression is as the matrix, in equation 2.7 (iid error terms), and a least squares procedure can be used to obtain estimates which are asymptotically BLUE for the untransformed variables (Judge et al., 1985, pages 420 and 421). Alternatively, the elements of the error covariance matrix can be estimated, and an estimated generalized least squares method (EGLS) could be used to fit each equation separately as follows: y;  fit GLS  =  =  Xifc  (2.9)  + ei  (X^.nx^'x^riy,  where y; is an n by 1 matrix of the sample values for the i  th  (2.10) endogenous variable in the  system of equations; Xj is an n by k{ matrix of the sample values for all of the exogenous variables which affect the i  th  endogenous variable in this equation of the system;  fii is a ki by 1 matrix of the true coefficients associated with the exogenous variables of this equation; ej is an n by 1 matrix of the error terms associated with each sample of the endogenous variable; fii GLS is an estimate of the fii matrix; <£; is the estimated error covariance matrix for i  th  equation of the system.  Generally, for EGLS, if the estimated error covariance matrix is consistent, the estimated coefficients and variances of these coefficients are consistent, and the distribution of each of the estimated coefficients is asymptotically normal. Several estimators for the error covariance matrix were presented by Judge and others (1985, pages 419 to 464) depending on the assumptions made concerning the heteroskedasticity. The procedure may be iterated by using the results of the EGLS fit to obtain a new estimate of the  Chapter  2. Previous  Attempts  at Fitting  Forestry  Equation  Systems  10  error covariance matrix and from this obtain a new 0i GLS , and so on until the estimated coefficients converge.  These iterations will result in what Malinvaud (1980. page 285)  termed a quasi-maximum  likelihood  estimator of the coefficients and the covariance ma-  trix. If normality of the error terms is assumed, then the estimators become M L E and the asymptotic covariance matrix will reach the Cramer-Rao lower bound for efficiency (minimum variance). If serial correlation is present, the covariance matrix is no longer diagonal, because relationships within the sample appear. In this case, if the relationship can be defined such as a first order autoregressive process, in which the correlation between error terms declines geometrically as the time between disturbances increases, a term may be added to the regression equation or a transformation of the variables may be used (Cochrane and Orcutt, 1949; Kadiyala, 1968). This technique was used by Monserud (1984) for a single equation to describe height growth using stem analysis data. Alternatively, as with the case of heteroskedasticity, a consistent estimator of the error covariance matrix may be found, and used in an EGLS procedure (Judge et al, 1985, pages 283 to 286). Gregoire (1987) demonstrated this procedure for a single equation where permanent sample plots were correlated over time. If endogenous variables appear on the RHS of one equation of the system and on the LHS of another equation of the system (simultaneous equations), the assumption that the variables on the RHS of each equation are not correlated with the error is not met. The independent OLS fit using simple or multiple linear regression will result in biased estimates of coefficients. To prove that this bias exists, equation 2.3 can be extended to include endogenous variables on the right hand side as follows: yi = Y  i 7 i  + Xif3i + e  (2.11)  z  where y; is an n by 1 matrix of the sample values for the i h endogenous variable in the t  Chapter  2.  Previous  Attempts  at Fitting  Forestry  Equation  11  Systems  system of equations; Yj is an n by g — 1 matrix of the sample values for all of the endogenous variables excluding the i  tn  endogenous variable;  7i is a g — 1 by 1 matrix of coefficients associated with the endogenous variables on the RHS of the equation. Coefficients are set to zero if the associated endogenous variables do not affect the  endogenous variable of the system;  Xj is an n by k matrix of the sample values for all of the exogenous variables of the system; /3j is a k by 1 matrix of coefficients associated with the exogenous variables. Coefficients are set to zero if the associated exogenous variables do not affect the i  tn  endogenous variable of the system;  6j is an n by 1 matrix of the errors associated with the i  tn  endogenous variable of  the system. The RHS variables and coefficients may be combined as follows:  + £j = Zj£j + £j  Yi = [Yj Xj]  (2.12)  Pi  For the g equations of the system, using g — 3 equations, On  On  o  z  2  o  On  0„  Z  yi  =  n  " s' 1  +  n  3  (2.13)  s  3  which may be restated as follows: y'= Z A + E where y is a gn by 1 matrix of n samples for each of g endogenous variables; Z is a gn by g({g — 1) + k) matrix of the RHS variables;  (2.14)  Chapter  2. Previous  Attempts  at Fitting  Forestry  Equation  12  Systems  A is a g([g — 1) - f k) by 1 matrix of the coefficients associated with the RHS variables; E is a gn by 1 matrix of the residual error for n samples for each of g endogenous variables. For a single equation, the OLS estimator of 5,; is therefore: 5i  = ( z ^ r z; 1  Yt  (2.15)  The expectation of the OLS estimator is: (2.16) (2.17) The second term does not disappear as the Zj are not independent of the error term and so the OLS fit is biased (Judge et al, 1985, page 571). This simultaneity not disappear if the sample size is increased.  bias does  The estimates are also inconsistent (see  proof in Judge et ai, 1985, page 571). Also, since the estimated coefficients are biased and inconsistent, the estimated coefficient covariance matrix cannot be used to calculate confidence limits for the true coefficients. If confidence limits are calculated using these estimated variances of the coefficients, the limits would be incorrectly narrow. If endogenous variables do not appear on the RHS, but contemporaneous correlation is present, the resulting OLS estimates of coefficients will not be most efficient as this correlation among equations is not included in the OLS fit of single equations. Zellner (1962) demonstrated this problem and called the system of equations with this contemporaneous correlation, seemingly unrelated regression (SUR) equations. If endogenous variables appear on the RHS and contemporaneous correlation is present, OLS estimates will again be biased and inconsistent, and, also, a loss in efficiency would result, because the information concerning correlation among equations would not be used in the OLS  Chapter  fit.  2.  Previous  Attempts  at Fitting  Forestry  Equation  13  Systems  Finally, if contemporaneous correlation is present, endogenous variables appear on  the R H S , and the iid assumption is not met for one or more equations, the standard O L S estimates using simple or multiple linear regression for individual equations will result in biased and inconsistent estimates of the coefficients and their variances. In addition, predictions of endogenous variables are not constrained to be logically compatible if an O L S fit is used. The practical implications of using the O L S fitting method for individual equations are as follows: 1. For simultaneous systems, even though any bias between an estimated and the actual coefficient, or between a predicted and the actual value of the dependent variable may be small, these small biases may be magnified over a forest inventory. For instance, if volume per tree was in error by 0.10 cubic metres, and this tree represented 100 trees in the stand, the volume for the stand would be i n error by 10 cubic metres. Also, biases i n certain coefficients may have a dramatic effect in the resulting estimates of dependent variables over the entire range or over partial ranges of the independent variables. 2. For simultaneous systems, estimates of coefficients based on the O L S fit of individual equations are not only biased, they are inconsistent. Consequently, no matter how many samples are collected, there is no assurance that the sample estimates will be close to the population values for coefficients and dependent variables. 3. The estimated coefficient covariance matrix from the O L S fit of simultaneous equations cannot be used to calculate confidence intervals. If these estimated variances are used, the confidence intervals will be incorrectly narrow, and there will be a higher chance that the true coefficients are not i n the confidence interval. hypotheses cannot be tested.  Also,  Chapter  2.  Previous  Attempts  at Fitting  Forestry  Equation  Systems  14  4. If contemporaneous correlation is present, O L S estimates will be less efficient, because the information concerning this correlation is not used in determining the O L S fit of individual equations. The resulting confidence limits will therefore be wider if O L S is used, and more samples will be required to obtain a desired precision. 5. Because the equations are fitted independently, compatibility of estimates is not assured. For instance, one equation may be used to estimate tree height and another equation may be used to estimate site index. If site index is defined as the height of the tree (or average tree) at 50 years measured at breast height, a desirable trait of this system of two equations would be that the height at 50 years from breast height predicted from the first equation, is equal to the site index of that tree as predicted from the second equation. The O L S fit of individual equations does not assure this logical compatibility. The independent O L S approach has been termed the "naive" approach by Intriligator (1978, page 373), because information concerning the error structure of the system of equations is ignored. However, the method is still useful i n that it is the easiest to calculate and computer programs are the most widely available. In addition, for preliminary work to define the system of equations, or where the system of equations is very large such as in systems for forestry growth and yield modelling, an independent O L S fit is probably the most practical method. However, compatibility is not assured, and estimates can be biased, inconsistent and not most efficient, depending on the error structure of the system.  Chapter  2.2  2.  Previous  Attempts  at Fitting  Forestry  Equation  Systems  15  Compatible Systems using Substitution Another frequently used method for fitting systems of forestry equations is to use  differentiation or substitution of variables and equations within the system. For instance, in an attempt to ensure logical compatibility of growth and yield estimates, Clutter (1963) suggested that growth equations should be obtained by taking the derivatives of the yield equations.  Resulting estimates were not considered efficient.  Bailey and Ware (1983)  produced a compatible basal area growth and yield model by using the growth model to calculate yield.  Matney and Sullivan (1982) described a system of substitutions to  obtain compatible stock and stand tables. They estimated the parameters of the Weibull distribution by relating the integration of the distribution for volume and basal area to the predicted volume and basal area, thereby ensuring compatibility. Other applications of the method of substitution include Ramirez-Maldonado and others (1987). who developed a system of equations for predicting height growth and yield. A model to predict height at time one was first fitted, and then used to predict one of the coefficients of the model to estimate height at time two, so that estimates for growth or yield of height are compatible. McTague and Bailey (1987a) developed a compatible system of equations for basal area and diameter distribution by recovering the parameters of a Weibull distribution from predicted stand variables.  First, the 10th, and 63rd percentiles, present and future,  were predicted from site index, age and stems per hectare.  The basal area was then  predicted from these current percentiles. The " a " parameter of the Weibull distribution was predicted from age, number of stems, and the 10th percentile. The 90th percentile was then predicted from the 10th and 63rd percentiles, and site index and age. The " b " and "c" parameters were then calculated mathematically. Substitution of all equations into one composite model has perhaps been even more widely used. Sullivan and Clutter (1972) developed a single linear model by substituting  Chapter  2.  Previous  Attempts  at Fitting  Forestry  Equation  16  Systems  the basal area growth equation into the volume yield equation to obtain a composite model. A maximum likelihood procedure was used to obtain unbiased estimates of the regression coefficients, because of serial correlation. The problem with this approach was that some of the variables from the original models were not significant in the composite model and disappeared from the resulting fit.  The original biologically based models  were lost; the result of changes in the variables was difficult to interpret. For example, the original model for predicting the log of volume at time two was as follows: lnV  where lnV  2  2  = h  (SI,  (2.18)  1/A , lnBA ) 2  2  is the natural logarithm of volume a.t time 2;  SI is the site index; A  is the age at time 2;  2  BA  2  is the basal area at time 2.  A substitution for basal area at time two resulted in the following equation.  '  n  V  i  =  h  (SI  h  %  x  l n B A i  '  1  -1?  I 1  x S I  )  ( 2 I 9 )  where Ai is the age at time 1; BAi is the basal area at time 1. The composite model no longer retained the expected biological relationship between volume and basal area.  Also, even though the authors suggested that the combined  linear model was compatible for prediction of growth and yield, the growth was obtained indirectly by subtracting yield at time two from yield at time one, rather than by an unique equation for growth and for yield. A composite model for height growth based on the Chapman-Richard's model (Pienaar and Turnbull, 1973) is frequently proposed.  The coefficients of the model are es-  timated for each tree or plot and then these coefficients are related to site index or  Chapter  2.  Previous  Attempts  ai Fitting  Forestry  Equation  Systems  17  o t h e r s a m p l e o b s e r v a t i o n s s u c h as h a b i t a t or r e g i o n . T h e r e s u l t i n g e q u a t i o n s to describe changes i n the coefficients are t h e n p l a c e d back i n t o t h e C h a p m a n - R i c h a x d ' s e q u a t i o n to o b t a i n a c o m p o s i t e m o d e l (e.g. B e c k , 1971; G r a n e y a n d B u r k h a r t , 1973; L u n d g r e n a n d D o l i d , 1970; T r o u s d e l l et ai,  1974). A n o t h e r e x a m p l e of the use of a c o m p o s i t e m o d e l is  V a n D e u s e n a n d others (1982) w h o s h o w e d t h a t a s y s t e m of equations for m e r c h a n t a b l e v o l u m e to a n y height or to a n y d i a m e t e r , t o t a l v o l u m e , a n d m e r c h a n t a b l e height c o u l d be m a d e c o m p a t i b l e b y e s t i m a t i n g one of t h e e q u a t i o n s , a n d u s i n g t h i s e s t i m a t e for the other equations.  O n l y one e s t i m a t e d coefficient was r e q u i r e d . M c T a g u e a n d B a i l e y  (1987b) p r o p o s e d a m o r e c o m p l e x c o m p o s i t e e q u a t i o n to estimate m e r c h a n t a b l e v o l u m e . T h e e q u a t i o n c o u l d be r e a r r a n g e d t o estimate t o t a l v o l u m e , a n d for t a p e r . T h e c o m p o s ite e q u a t i o n was f i t , a n d t h e f i t t e d coefficients were t h e n used for each of the equations o b t a i n e d b y r e a r r a n g i n g t h e c o m p o s i t e m o d e l . F o r t a p e r m o d e l l i n g , K o z a k (in press) subs t i t u t e d an e q u a t i o n for t h e d i a m e t e r at t h e i n f l e c t i o n p o i n t , a n d a n o t h e r e q u a t i o n for t h e e x p o n e n t of t h e e q u a t i o n , to o b t a i n a c o m p o s i t e m o d e l w h i c h d e s c r i b e d the d i a m e t e r for a given height above g r o u n d . T h e s u b s t i t u t i o n m e t h o d has been w i d e l y u s e d to ensure c o m p a t i b i l i t y of systems of forestry e q u a t i o n s .  H o w e v e r , t h e o r i g i n a l b i o l o g i c a l l y based m o d e l s m a y be c h a n g e d i n  t h e s u b s t i t u t i o n , a n d n e i t h e r efficiency n o r u n b i a s e d n e s s of the e s t i m a t e d coefficients or t h e i r variances is assured. A l s o , o n l y one v a r i a b l e o n t h e L H S is e s t i m a t e d .  2.3  M i n i m u m Loss Function A s i m u l t a n e o u s fit of t w o e q u a t i o n s was o b t a i n e d b y B u r k h a r t a n d S p r i n z (1984) b y  m i n i m i z i n g a loss f u n c t i o n w h i c h c o m b i n e d s q u a r e d errors for the first e q u a t i o n w i t h t h e s q u a r e d errors for t h e second e q u a t i o n as follows:  F = £ {Vi - V - ) / 4 2  B.) /*B 2  (2-20)  Chapter  2.  Previous  Attempts  at Fitting  Forestry  Equation  18  Systems  where V ; , V; is volume, actual and predicted; Bi, Bi is basal area, actual and predicted; u\ is the mean square error from the O L S fit of the volume equation; <jg is the mean square error from the O L S fit of the basal area equation. The advantage of this approach to fitting a system of equations is that constraints for coefficients  across equations of the system can be introduced.  Burkhart and Sprinz  introduced the coefficients specified by Clutter (1963) to ensure compatibility between the predicted basal area and yield equations.  Goodness-of-fit measures  1  were used to  test the resulting procedures. N o estimate of small or large sample bias or efficiency was made, although Burkhart and Sprinz noted that the resulting estimates gave a higher sum of squared error for the first equation and a lower sum of squared error for the second equation than O L S . They also noted that the m i n i m u m loss function as defined above was lower with this simultaneous fit than with the O L S fit of the equations individually. Reed and Green (1984) used a loss function similar to Burkhart and Sprinz (1984) to simultaneously estimate stem taper and volume coefficients, and constrained the coefficients so that logical relationships were represented by the coefficients.  They tested  four systems for estimating taper and volume, and found that the loss function decreased by 10 to 50 percent from the O L S fit of individual equations.  Byrne and Reed (1986)  extended the same loss function to a system of four equations for taper, total volume, volume ratio to an upper diameter, and volume ratio to an upper height. Reed and others (1986) used a m i n i m u m loss function for fitting four basal area and yield equations based on Sullivan and Clutter (1972). Knoebel and others (1986) also fitted growth and yield equations using a m i n i m u m loss function to minimize the squared 1  T h e t e r m goodness-of-fit  measures is not to be confused w i t h goodness-of-fit  tests, such as the C h i -  Square test for goodness-of-fit. T h e former refers t o measures w h i c h indicate how well the estimated equation fits the sample d a t a , whereas the latter refers to tests of whether the sample d a t a are f r o m a particular hypothesized d i s t r i b u t i o n .  Chapter  2.  Previous  Attempts  at Fitting  Forestry  Equation  19  Systems  error for the system. In all cases, the total sum of squares for the system represented by the loss function was reduced from the O L S fit of individual equations.  Compatibility was introduced  by constraining the coefficients, which may have resulted in biased estimates if these constraints are incorrect. If equations are simultaneous, these estimated coefficients will not only be biased, but also inconsistent as with the O L S fit of simultaneous equations. Also, the resulting estimates may not be most efficient.  2.4  Econometric Methods Based on the Assumption of iid E r r o r Terms The theory for simultaneously fitting a system of equations assuming i i d error terms  for individual equations was developed in the 1940s and early 1950s (Chow, 1983), and widely used in economics. The techniques have also been applied to agricultural problems (Friedman and Foote, 1955). Explanations of simpler techniques can be found in general econometric texts such as those by Gujarti (1978) and Wonnacott and Wonnacott (1979). More detailed explanations, proofs and other techniques can be found i n Intriligator (1978) and Judge and others (1985). The assumption of i i d error terms for individual equations results i n a covariance matrix for the error terms of the system which are block diagonal as follows for a system of three equations.  n= where  °lJ-n  ^12^n  0-2lIn  <T I  tT I  C"3lln  0"32ln  0"33ln  22  & 1 3 l  n  2 3  n  n  is the covariance of the error terms between equations if i ^ j, or is the variance of the error terms if i = j; I  n  is the identity matrix of size n by n ;  n is the number of samples.  ( - ) 2  21  Chapter  2.  Previous  Attempts  at Fitting  Forestry  Equation  20  Systems  W h e n equations are independent, contemporaneous variances among equations are zero. The error covariance matrix becomes a diagonal matrix as in equation 2.7, because the (Tij terms which are not on the diagonal are zero.  The simultaneous estimation of a  system of equations reduces to O L S for each equation when the error covariance matrix is diagonal, and the equations are not simultaneous. A summary of common techniques for estimation of systems of equations based on least squares theory assuming i i d error terms for individual equations is given in Table 2.1. The condition specified for the 2SLS and 3SLS procedures is that the simultaneous system of equations must be identified.  This condition is required so that an equation  can be distinguished from other equations in the system.  For just identified systems,  if algebra is used to change the equations of the system (structural equations), so that the endogenous variables appear only on the L H S (reduced-form equations), coefficients estimated from an O L S fit of the reduced-form equations can be used to recover the coefficients of the structural equations. If the system is underidentified, there is not enough information available from the reduced-form coefficients to solve for the coefficients for the structural model. For overidentified systems, more than enough information is available from the reduced-form coefficients, so that more than one solution for the structural coefficients is possible. If the simultaneous system of equations is underidentified, the statistical properties of 2SLS and 3SLS cannot be assumed. To test for identification, the rank condition must be met for each equation; an order condition can be used to define if the system is just-, over-, or underidentified, once the rank condition is met.  First, a matrix of all of the  coefficients of the system is constructed, with each column representing one variable of the system and each row representing one equation.  To test whether the i  th  equation  satisfies the rank condition, the row corresponding to the coefficients of that equation is deleted, and then the columns corresponding with nonzero coefficients for that equation  Chapter  2.  Previous  Attempts  at Fitting  Forestry  Equation  Systems  Table 2.1: Simultaneous Estimation Assuming i i d Error Terms  Error Structure  Technique  Description  1. A l l assumptions of O L S are met. The error covariance matrix is diagonal (equation 2.7).  OLS  O L S applied to each equation.  2. Contemporaneous correlation due to common absence of important exogenous variables in two or more equations of the system. A l l R H S variables are  Seemingly Unrelated Regressions (SUR); Linear or Nonlinear  Estimate the error matrix and use E G L S or nonlinear equivalent, extended to a system of equations.  3. Endogenous variables on L H S and on R H S of equations in the system (Simultaneous system of equations).  Two stage Least Squares, Linear (2SLS), or Nonlinear (2SNLS)  Stage one: Regress each endogenous variable on all exogenous variables of the system. Stage two: Regress original system equations but replace endogenous variables on R H S with predicted endogenous variables from Stage one.  4. As type 3 above but contemporaneous correlation due to common absence of important exogenous variables from two or more equations of the system.  Three stage least squares, linear (3SLS), or nonlinear (3SNLS)  Stages one and two as 2SLS above. Stage three: Use the estimated error terms from Stage two to obtain an estimate of the error covariance matrix. E G L S for the system.  Chapter  2.  Previous  Attempts  at Fitting  Forestry  Equation  Systems  22  are deleted. The determinate of at least one submatix of order g — 1 from this reduced matrix must be nonzero for the rank condition to be satisfied. T h e order condition can then be checked. If the number of exogenous variables i n the system minus the number of exogenous variables i n the i  th  in the i  th  equation, is equal to the number of endogenous variables  equation minus one, the equation is justidentified. If this difference is greater,  the equation is overidentified. A l l of the equations of the system must be identified for the system to be identified (See G u j a r t i , 1978, pages 353 to 365, for more explanation). The earliest forestry application of econometric techniques for fitting systems of equations is credited to Furnival and Wilson (1971). fitting  They showed that the simultaneous  methods developed for economics could be applied to a system of equations to  estimate growth and separately to a system of equations to estimate yield. They indicated that the advantage of the use of these methods over substitution or the O L S fit of individual equations is that confidence limits for coefficients of the system could be estimated.  They noted that three stage least squares  2  (3SLS) gave unexpectedly small  elements i n the estimated coefficient covariance matrix and so confidence limits would be narrow. Subsequently, the application of econometric theory for simultaneous estimation of coefficients in forest growth and yield equations, assuming that individual equations have i i d error terms, has been demonstrated by several authors including M u r p h y and Sternitzke (1979), Murphy and Beltz (1981), Murphy (1983), Borders and Bailey (1986), and Hans (1986). M u r p h y and Sternitzke (1979) and Murphy and Beltz (1981) used 3SLS to estimate the coefficients of growth and yield models for pine in the West G u l f Region. T h e selected equations were those developed by Clutter (1963) and Sullivan and Clutter (1972). The study by Furnival and Wilson (1971) was cited by them and used as the main reason for T h e terms two stage or three stage least squares are not to be confused w i t h the terms two or three stage sampling. T h e former refers to techniques used to analyze d a t a to estimate coefficients whereas the latter terms refer to s a m p l i n g designs. 2  Chapter  2.  Previous  Attempts  at Fitting  Forestry  Equation  Systems  23  opting to use a simultaneous approach to estimation of coefficients rather than an O L S approach to individual equations.  M u r p h y and Sternitzke stated that no attempt was  made to compare the simultaneous approach to the O L S approach. In both papers, the fit statistics and a discussion on how to use the resulting system were given. N o tests were done to determine if individual equations were i i d even though this is an assumption of the 3SLS procedure. Murphy (1983) chose a system of nonlinear equations to model growth and yield. For each of the dependent variables i n the system, he used nonlinear least squares to select the "best" model. He expected both contemporaneous and serial correlation i n the system and so he chose the seemingly unrelated nonlinear regressions technique to estimate the coefficients of the chosen system of equations, even though S U R techniques assume that individual equations have i i d error terms and are therefore not serially correlated. He then compared the simultaneous fit to the independent fitting of each equation i n the system using nonlinear least squares.  He concluded that the simultaneous fit had no  evident benefit over the individual nonlinear least squares fit for the data and models tested. Borders and Bailey (1986) noted that the advantages of the simultaneous fitting methods were that: 1. Point estimates are consistent and can be also efficient. 2. Compatibility of equations is obtained. 3. Interval estimates can be derived. The work of Clutter (1963) was used as the basis for the establishment of a system of growth and yield equations. They then compared the O L S fit of each of the equations independently to the simultaneous fit of the system of equations.  Three simultaneous  Chapter  2.  Previous  Attempts  fitting procedures were used.  at Fitting  Forestry  Equation  Systems  24  First, a 2SLS procedure was used, followed by a 3SLS  procedure, and then a 3SLS procedure, modified by restricting some of the coefficients, was used. They also estimated the covariance matrices of the estimated coefficients and of the predicted endogenous variables. They concluded that the estimates of the coefficients were much the same for the O L S , 2SLS, 3SLS, and restricted 3SLS techniques; however, the 3SLS fits were more efficient (lower variance). Also, compatibility was obtained with the 2SLS, 3SLS, and restricted 3SLS techniques. Hans (1986) compared the O L S fit to a substitution method, and a S U R fit for Clutter's (1963) yield equation system. Goodness-of-fit measures for the original and an independent data set, were used to compare the alternative methods. S U R gave lower standard errors than O L S , but the goodness-of-fit measures were better for the O L S fit. Hans recognized that an alternative simultaneous fitting technique would have been more appropriate, because endogenous variables i n the system also appear as R H S variables. Amateis and others (1984) demonstrated that forestry applications of simultaneous fitting techniques other than for growth and yield exist. They compared the O L S fit to two different simultaneous fitting techniques for estimates of product yield. Three systems were established, based on differing assumptions of inter-relationships of product yield. T h e first system assumed no inter-relationships among equations (contemporaneous variances were assumed to be zero) and so an O L S fit was appropriate. The second system assumed that relationships proceeded in succession, meaning that the results from the first equation affected the second equation, and the results from the first and second equations affected the third equation and so on. T h e resulting system was therefore identified as a recursive system. Errors between successive equations were also assumed to be independent and so an O L S fit was used to fit each equation of the recursive system. The third system was a simultaneous system of equations with endogenous variables appearing on both the R H S and the L H S and errors between equations were assumed to be  Chapter  2.  Previous  Attempts  at Fitting  Forestry  Equation  Systems  25  correlated. This system was fitted using 3SLS. Goodness-of-fit measures were calculated for each fitted system and compared. A summed rank was calculated by comparing the predicted value from each fitted system, to the observed value. The fitted system was given a rank of one if the predicted value w as closest to the observed value; other fitted r  systems were given a rank of zero. The ranks were summed over all observations; the 3SLS had a higher summed rank. The recursive system which was fitted was considered to be inappropriate as dependencies between different products and different equations appeared to be significant. They concluded by saying that the simultaneous fitting approach did not substantially alter the estimates of coefficients, but the methods were more appealing. Borders and others (1987b) developed a system of 12 equations to predict 12 percentiles of a diameter distribution from other percentiles and stand attributes.  The  system was fitted using S U R , as error terms among equations appeared to be contemporaneously correlated. The system was set up in sequence and so endogenous variables appearing on the R H S of the equation were predicted by a previous equation, justifying the use of the S U R method. The S U R method was also used by Bailey and da Silva (1987) to fit basal area at time two from basal area at time one. Since overlapping measurement intervals were used to obtain coefficients for the model and contemporaneous correlation was expected to be significant, the S U R technique was used to fit coefficients for a system of equations. Each equation represented one measurement length.  2.5  Econometric Methods without the Assumption of iid Error Terms If the assumption that the error terms of individual equations are i i d does not hold,  the error structure is no longer block diagonal as in equation 2.21, and techniques which  Chapter  2.  Previous  Attempts  at Fitting  Forestry  Equation  26  Systems  assume that the error is block diagonal will no longer be asymptotically most efficient, because information concerning the variances not on the diagonal is not included in the resulting fit.  Techniques developed for fitting systems of equations in this case, still  require that assumptions about the error covariance matrix be made, because not all elements of the gn by gn error matrix of the system of equations can be consistently estimated with only n samples (Judge et al, 1985, page 174). Few attempts have been made to obtain an efficient solution for forestry systems which have an non-iid error terms for individual equations.  The first attempt was by  Ferguson and Leech (1978) to predict coefficients of a yield equation from exogenous variables outside of the system, a technique often labelled as parameter  prediction  forestry literature. They noted that much statistical literature was available for  in  fitting  equations in which the coefficients were considered to be random variables, but little work had been done to fit equations in which the coefficients were considered to be random functions of other exogenous variables. To begin, they fitted a yield equation to each of 20 permanent sample plots, with nine observations each, using a standard O L S technique, multiple linear regression, resulting in a set of coefficients for each plot. The DurbinWatson bounds test (Durbin and Watson, 1951) was applied to each plot to test for serial correlation. No significant correlations were noted on most plots; however, they did note that the tables published by D u r b i n and Watson showing the distribution of the test statistic did not include values for the small sample size of nine observations. In addition, variances of the error terms within each plot were considered to be homogenous, and so the mean square error from the standard O L S fit was used in estimating the variance of each coefficient on each plot. These estimated coefficient variances were examined and, using Bartlett's test for equality of variances, it was determined that the variances differed significantly among plots. A system of three equations was established to predict each coefficient using stand variables. Since the three coefficients were related, the error terms  Chapter  2.  Previous  Attempts  at Fitting  Forestry  Equation  27  Systems  for these three equations were considered to be contemporaneously correlated. Multiple linear regression was then used to fit each of the random coefficient functions. A n estimate of the error covariance matrix for the system of parameter prediction functions was found by combining the estimates of contemporaneous variance from this second O L S fit and the estimates of variance for each plot found by the O L S fit to the observations for each plot. The S U R approach was therefore modified by Ferguson and Leech to heteroskedastic, contemporaneously correlated equations.  T h e error covariance matrix was assumed to  have the following form, shown for three equations and three samples. 0111  0  0  0121  0  0  0"131  0  0  0  0"ll2  0  0  0"l22  0  0  0"l32  0  0  0  0"ll3  0  0  0" 123  0  0  0133  °"211  0  0  0 221  0  0  0"231  0  0  0  0"212  0  0  0"222  0  0  0"232  0  0  0  0213  0  0  0223  0  0  0"233  0311  0  0  0"321  0  0  0331  0  0  0  0312  0  0  0322  0  0  0 332  0  0  0  0"313  0  0  0323  0  0  0"333  where 0 \ j is the covariance for the i m  estimated by  and the j  equations and the m  sample,  &ij ; m  the matrix is symmetric. For  the Ferguson and Leech study, the form of the matrix was expanded to include  estimated variances for 20 plots. This estimated error covariance matrix was then used to obtain an E G L S fit, extrapolated to the whole system of equations as follows:  y 3  =  XB-fE  E G L S extended to a system of equations has also been termed joint generalized  3  (2.23) least squares.  Chapter  2.  Previous  Attempts  at Fitting  BEGLS Var(B ) EGLS  Forestry  Equation  =  (X'n-'xy  =  (X'fl-'Xy  1  Systems  X'Q-'y  1  28  (2.24) (2.25)  where y is a gn by 1 matrix of the endogenous variables; X is a gn by gk matrix of the exogenous variables; B is a gk by 1 matrix of the coefficients associated with the exogenous variables. Coefficients were set to zero if the associated exogenous variables did not affect the  endogenous variable;  E is a gn by 1 matrix of the error terms associated with n samples of each of g endogenous variables; BEGLS i  s a n  estimate of the B matrix;  O is a gn by gn matrix of the estimated error covariance matrix. This overall fit accounted for heterogeneity of error within equations, as well as the correlation of error among equations by combining the error from the first and second step.  A l l R H S variables were assumed to be nonstochastic, and serial correlation was  assumed to be absent. Davis and West (1981) and Ferguson and Leech (1981) published notes of correction for the procedure given in Ferguson and Leech (1978). Newberry (1984) and Newberry and Burkhart (1986) used the technique of Ferguson and Leech (1978), corrected by Davis and West (1981), to obtain estimates of coefficients of a taper function, but they expanded the technique to a nonlinear fitting method for the first step.  A taper function was fitted for each tree using nonlinear least squares  and variances for each coefficient were estimated using a Taylor's series and a jackknife approximation. A runs test was then used to test for serial correlation (Lehman, 1975) for each tree. A sign test (Lehman, 1975) w as then performed using the results of the runs r  tests, by counting the number of trees which showed serial correlation (significant results from the runs test), and then by comparing this count to the number of trees expected to  Chapter  2.  Previous  Attempts  at Fitting  Forestry  Equation  Systems  29  show serial correlation if the results were random. They decided that serial correlation was significant but stated that they were unaware of any technique to account for this serial correlation and so it was ignored. They then continued by relating each of the three coefficients to tree and stand characteristics to obtain a system of linear equations. The error covariance matrix accounting for the first (nonlinear) and second step ( O L S on each of the parameter prediction equations) was estimated using the corrected Ferguson and Leech method, and E G L S was used to obtain estimates of coefficients and their associated variances for the system of equations. Some of the second step variance estimates were negative and so were given a value of zero. The authors noted no real improvements using the E G L S approach, extended to a system, to estimate coefficients for the parameter prediction system versus using O L S for each equation separately. The Ferguson and Leech (1978) approach to parameter prediction, followed subsequently by Newberry (1984) and Newberry and Burkhart (1986), introduced a method for fitting a system of equations which accounted for heteroskedasticity (not identically distributed) and contemporaneous correlation. However, all R H S variables were exogenous, and serial correlation was assumed to be absent. Also, the L H S variables of the system were parameters obtained by fitting an equation for each sample unit (plot or tree). The variance of each of these L H S variables was therefore obtained from the initial estimation of the parameters.  This method of determining the variance of each sample  unit could not be applied to another system of equations in which the L H S variables are measured observations of endogenous variables, rather than estimated parameters.  2.6  Discussion of Previous Approaches  None of the methods used previously to fit systems of forestry equations has all of the desirable properties for fitting systems of equations identified i n the introduction. Each  Chapter  2.  Previous  Attempts  a t Fitting  Forestry  Equation  Systems  30  of the procedures lacks one or more of these properties, for a simultaneous, contemporaneously correlated system of equations, in which error terms of individual equations are non-iid. The O L S fit of individual equations results i n estimated coefficients which are biased, and inconsistent if the equations are simultaneous. Confidence limits for the true coefficients cannot be calculated using the estimated coefficient covariance matrix. For nonsimultaneous systems with non-iid error terms, weighted regression or E G L S can be used on each equation; however, if error terms are both non-iid and contemporaneously correlated, the resulting fit will not be most efficient. For simultaneous equations with non-iid error terms, the O L S fit or E G L S fit of individual equations results in biased and inconsistent estimates of the coefficients. Information on small sample properties i n the presence of non-iid errors for simultaneous systems is limited, but information which is available for i i d error terms for simultaneous systems shows that O L S is more biased for small samples than econometric methods, such as 2SLS and 3SLS, if equations are simultaneous (Cragg, 1967; M i k h a i l , 1975; Nagar, 1960; Sawa, 1969; Summers, 1965). Methods to increase the efficiency over the O L S fit, when contemporaneous correlation is significant are well documented (Kmenta and Gilbert, 1968; Zellner, 1962; Zellner and Huang, 1962) even for small sample sizes (Mehta and Swamy, 1976; Revankar, 1974; Zellner, 1963). In the presence of heterogeneity and serial correlation, the O L S fit of individual equations is less efficient relative to the generalized least squares ( G L S ) fit, and has also been shown to be less efficient relative to the estimated generalized least squares ( E G L S ) fit using consistent estimators of the error covariance matrix (Rao and Griliches, 1969). The O L S fit is therefore the simplest approach, thereby satisfying the first criterion, but the estimates can be biased, inconsistent, and not most efficient. In addition, the desired property of compatibility is not assured with the O L S fit. The substitution method has an advantage over the O L S fit i n that compatibility of  Chapter  2.  Previous  Attempts  at Fitting  Forestry  Equation  Systems  31  the system is achieved. However, no further advantages are gained through this method, unless the derived composite model has only exogenous variables on the R H S . The minimum loss function approach has appeal i n that compatibility is achieved, and, i n addition, the mean square error for the system of equations may be reduced over the O L S fit individual equations, if biased estimates are introduced. T h e method is also relatively simple for small systems, but for large systems of many equations and many variables, the search to find the minimum value for the function may be long and difficult. Also, if the error terms are non-iid and contemporaneously correlated, the method is less efficient than alternative methods which make use of this information. If simultaneous equations are represented i n the system, the resulting estimators will remain biased as with the O L S fit. Econometric methods for fitting systems of equations based on the assumption that the error terms for individual equations are i i d have the advantages over O L S of being compatible, consistent and less biased than O L S i n the presence of simultaneity, and more efficient than O L S if contemporaneous correlation is significant. If the error terms of individual equations are non-iid, the methods result i n inconsistent estimates of the coefficient covariance matrix, so that other techniques become more desirable. The non-iid approach illustrated by Ferguson and Leech (1978) and Newberry and Burkhart (1986) for systems of parameter prediction equations are applications of E G L S to systems of equations.  Ferguson and Leech (1978) indicated that their estimator for  the error covariance matrix is consistent, and so the properties of G L S can be assumed for large sample sizes. This non-iid approach, therefore, yields consistent estimates of coefficients which are also asymptotically efficient (Judge et a/., 1985, pages 175 and 176). The simultaneous fitting of the equations does ensure compatibility and the number of steps required is few. However, the technique to estimate the error covariance matrix  Chapter  2.  Previous  Attempts  ai Fitting  Forestry  Equation  Systems  32  used by Ferguson and Leech is only applicable to a system of parameter prediction equations; error terms for an individual parameter prediction equation must not be serially correlated, and all R H S variables must be exogenous. Also, for small sample sizes such as that used by Ferguson and Leech, the level of efficiency is uncertain and variances may even exceed those of the O L S applied to each of the parameter prediction equations separately.  Chapter 3  A n Alternative Simultaneous F i t t i n g Procedure  3.1  Extension of Econometric Methods to Multistage Least Squares  A fitting technique which satisfies all of the desired criteria for simultaneous, contemporaneously correlated systems of equations with non-iid error terms, was not found i n forestry literature, nor i n the subsequent search of econometrics, biometrics, and statistics literature.  Techniques for systems which have non-iid error terms for individual  equations, and are contemporaneously correlated, but not simultaneous, were found for some forms of the non-iid error structure. For instance, the non-iid approach illustrated by Ferguson and Leech (1978) is an extension of the S U R technique for non-iid error terms, but this technique is only useful for a system of parameter prediction equations assuming heteroskedastic, contemporaneously correlated error terms. In econometric literature, techniques for extension of the S U R method to systems with serial correlation and contemporaneous correlation were found (Kmenta and Gilbert, 1970: Parks, 1967). However, extensions to heteroskedastic, contemporaneously correlated systems of equations which are not systems of parameter prediction equations were not found.  Also,  a technique for simultaneous systems in which error terms of individual equations are non-iid was not found. Existing techniques were therefore extended to different error structures and to simultaneous systems as part of this research. This extended technique was labelled multistage least squares ( M S L S ) , because the technique is based on least squares methodology, and  33  Chapter  3.  An Alternative  Simultaneous  Fitting  34  Procedure  many fitting steps (stages) are required to obtain estimated coefficients and the associated covariance matrix. The M S L S technique can be viewed as an extension of S U R techniques for noniid error terms to simultaneous systems, and to other types of non-iid error structures not previously examined for S U R equations.  It can also be viewed as an extension of  the 3SLS technique to systems of simultaneous equations with non-iid error terms for individual equations.  In order to show how the M S L S technique can be derived and  justified from existing techniques, econometric methods assuming i i d or non-iid error terms were combined into a flowchart presented i n Figure 3.1, and described below. The E G L S fitting technique was derived by Aitken (1934-35).  This technique was  extended to a system of equations with iid error terms for individual equations, for which contemporaneous correlation was significant by Zellner (1962) and labelled the S U R fitting method.  Independently, Theil in 1953 (as referenced in Basmann,  1957)  and Basmann (1957) derived a means of "purging" simultaneity bias from simultaneous equation systems called 2SLS. Zellner and Theil (1962) combined S U R and 2SLS for simultaneous, contemporaneously correlated systems, and called the combined technique 3SLS. Because 3SLS is an extension of the S U R technique to simultaneous equations, the error terms for individual equations are assumed to be i i d . The steps for the 3SLS technique are as follows:  1  Stage 1. Simple or multiple linear regression is used to predict endogenous variables from all of the exogenous variables in the system (first stage equations). Standard O L S techniques provide unbiased estimates of the coefficients to predict each of the endogenous variables even if the equations to predict these variables show heteroskedastic or serially correlated error terms. These steps are also given in Table 2.1, but are included here so that the extension to equations with non-iid error terms can be shown. 1  Chapter  3.  An Alternative  Simultaneous  Fitting  35  Procedure  Figure 3.1: Extension of Econometric Least Squares Methods for Fitting Systems to MSLS  Prediction using E G L S (Goldberger, 1962) (Yamamoto, 1979)  Iterations (Dhrymes, 1971) (Magnus, 1978)  S U R with Heteroskedasticity and Autocorrelation modified Parks (1967) EGLS for one equation (Aitken, 1934-35)  Heteroskedasticity, S U R (Ferguson and Leech, 1978) modified Parks (1967) (Duncan, 1983)  Autocorrelation, S U R (Parks, 1967) (Kmenta and Gilbert, 1970)  SUR (Zellner, 1962)  3SLS (Zellner and T h e i l , 1962)  2SLS Theil i n 1953 (B asmann , 1957)  U  Iterated Multistage Least Squares IMSLS  Multistage Least Squares MSLS  Chapter  3.  An Alternative  SimuHa,neous  Fitting  36  Procedure  Stage 2. To remove simultaneity bias, the predicted endogenous variables from stage 1 are substituted for the endogenous variables which appear on the R H S (second stage equations), and a standard O L S technique is applied to each of the modified equations of the system.  2  7t  + e< = ZiSi + e; •  A =  Si  (3.26)  (z;-Z )" Z ; , 2  (3.27)  y  This substitution ensures that all R H S variables are now independent of the error terms of each second stage equation so that the O L S fit for each equation will result in consistent estimates of all coefficients. The 2SLS technique is complete at this point. Stage 3. To gain efficiency, for the 3SLS technique, the residual errors from the O L S fit of the second stage equations are used to obtain estimates of the contemporaneous variances using the estimator by Zellner (1962), as follows:  t J  where  n  n  is the covariance of the i  th  and the j  th  equations if i ^ j, and is the  variance if i = j. These values are the same for all n samples; i{, ij are n by 1 matrices of the error term for the i  th  and the j  3  th  equations  respectively; n is the number of samples; T h e n o t a t i o n used here is described on pages 14 and 15 of this thesis. Z e l l n e r (1962) used the number of samples minus the n u m b e r of exogenous variables i n the system as the denominator. Revankar (1974) and other authors used n. Other estimates of the degrees of freedom are given i n Judge and others (1985), page 600. 2  3  apter 3. An Alternative  Simultaneous  Fitting  37  Procedure  yi, yj are n by 1 matrices of the sample values for the endogenous variables of the i  ih  and the j  t h  equations, respectively;  fi, Yj are n by 1 matrices of the predicted values from the second O L S fit for the endogenous variables of the i  th  and the ^ e q u a t i o n s , respectively.  These variance estimates are then combined to obtain an estimate of the error covariance matrix for the system of equations. This matrix will be a block diagonal matrix with all elements of each diagonal in a block being equal (see equation 2.21). The estimated error covariance matrix is then used to fit the system of equations for which the simultaneity bias was purged, using E G L S for the system of equations. y  EGLS  ZA + E  (3.29)  = (z'n-'zy z'rrV 1  A EGLS Var(A  =  =  )  (Z'n-'Zy  1  (3.30) (3.31)  where y is a gn by 1 matrix of the endogenous variables; Z is a gn by g((g —!) + &) matrix of the R H S variables (endogenous variables substituted by predicted endogenous variables); A is a g((g — 1) + k) by 1 matrix of the coefficients associated with the R H S variables. Coefficients are set to zero if the associated R H S variables do not affect the i  tn  endogenous variable; alternatively, the  corresponding variable and its zero coefficient may be removed from the matrix; E is a gn by 1 matrix of the error terms associated with n samples of each of g endogenous variables;  Chapter  3.  An Alternative  A.EGLS is  a n  Simultaneous  Fitting  Procedure  38  estimate of the A matrix;  fl is a gn by gn matrix of the estimated error covariance matrix. Because the simultaneity bias is removed i n the first stage of 3SLS, estimated coefficients are consistent. These estimates are more efficient than 2SLS if contemporaneous correlation is significant and if the set of the R H S exogenous variables is different for each equation (Srivastava and T i w a r i , 1978;  Zellner and Theil, 1962), even for small  sample sizes (Cragg, 1967). The 3SLS technique has also been shown to be less biased than 2SLS (Cragg, 1967) and therefore is less biased than O L S on individual equations. The estimated coefficients from 3SLS have been shown to be asymptotically normally distributed and asymptotically equivalent to the maximum likelihood estimators for systems of equations (full information maximum likelihood) if the error terms for each equation are considered to be normally distributed (Cragg, 1967). For small samples, the iterated three stage least squares (I3SLS) may be used to obtain estimated coefficients and the error covariance matrix at the .same time ( M i k h a i l , 1975). Iterations to improve E G L S estimates for small samples i n single equations and i n S U R or simultaneous systems have been discussed by Dagenais (1978), Dhrymes (1971), K m e n t a and Gilbert (1968), Magnus (1978), Madansky (1964), and Telser (1964).  Generally,  iterated E G L S is equal to the M L E estimates if error terms are normally distributed. The I3SLS technique is similar to the "zigzag" method proposed by Oberhoffer and Kmenta (1974) to obtain maximum likelihood estimates for single equations with non-iid error terms. The 3SLS and I3SLS techniques were extended to systems with non-iid error terms for individual equations by adding more stages. Techniques from literature to estimate the error covariance matrix for S U R systems which have non-iid error terms for individual equations (Kmenta and Gilbert, 1970; Parks, 1967) were extended to simultaneous  Chapter  3.  An Alternative  Simultaneous  Fitting  Procedure  39  systems in the same manner as S U R estimators were combined with 2SLS methods to obtain 3SLS methods. This extension of the S U R method assuming non-iid error terms to simultaneous systems is the basis for the M S L S technique derived as part of this research. The iterative analog has been termed iterated multistage least squares ( I M S L S ) . The M S L S procedure is therefore as follows: 1. A standard O L S technique is used to obtain predicted endogenous variables using all of the exogenous variables in the system (first stage equations). If the number of exogenous variables is high, some can be removed for easier computation with a loss in efficiency (Brundy and Jorgenson, 1971). 2. A l l endogenous variables on the R H S of equations in the system are replaced by their respective predicted values to purge simultaneity bias from the system. Simple or multiple linear regression is then used to obtain fitted second stage equations.  4  3. A consistent estimate of fl is obtained. The stages required to obtain a consistent estimate vary depending on the error structure of the system (Stages 2 and further). 4. The E G L S technique, extended to the system of equations, is used to obtain an estimate of A as with Stage 3 of 3SLS. 5. To extend M S L S to I M S L S , a new value of fl is computed using the estimated coefficients from the M S L S fit. The fl  new  matrix is restricted in the same way as  the first fl matrix. For instance, if the error covariance matrix for the first M S L S fit was considered to be block diagonal, as with 3SLS i n which individual equations have i i d error terms, the new estimated error covariance matrix would be also block I f error terms are non-iid, E G L S using an estimated error covariance m a t r i x for each equation i n the system can be used to o b t a i n consistent estimates of the coefficients a n d their variances for the second stage equations. T h i s is s i m p l y an extension of 2 S L S to non-iid error terms. F u r t h e r steps of the M S L S technique will only result i n a gain in efficiency. 4  Chapter  3.  An Alternative  Simultaneous  Fitting  40  Procedure  diagonal. The contemporaneous variances for the second iteration of a system of equations with a block diagonal error covariance matrix, are calculated as follows:  0~ij3SLS  =  t'i3SLS£j3SLS  =  (Yi -  Yi3SLs)'  n  where  Y  n  is the covariance of the i  &ij3SLS  fo o o \ {6.61)  ( j ~~ Yj3SLs)  th  and the j  t h  equations if i  j , and  is the variance if i = j . These values are the same for all n samples; £j3SLS  £i3SLS,  are n by 1 matrices of the error terms for the i  th  and the  j  th  equations, respectively, using the estimated coefficients from the 3SLS fit; n is the number of samples; y;, Yj are n by 1 matrices of the sample values for the endogenous variables of the i  th  and the j  Yi3SLS, fj3SLS  a r e  equations, respectively;  th  by 1 matrices of the predicted values using the  n  estimated coefficients from the first 3SLS fit for the endogenous variables of the i  th  and the ^ e q u a t i o n s , respectively.  6. A new E G L S fit for the system of equations is obtained by replacing the previous estimated error covariance matrix by the new Cl. A  = (Z'O^Z)"  1  n  e  w  Z'fi-ly  (3-33)  7. Steps 4 and 5 are then be repeated until convergence occurs. Because of the asymptotic properties of E G L S , this will only lead to improvements if the sample size is small. If the error covariance matrix estimate (step 2) is consistent, the properties of generalized least squares ( G L S ) which uses the true error covariance matrix, can be assumed  Chapter  3.  An Alternative  Simultaneous  Fitting  41  Procedure  for E G L S using large sample sizes (Judge et al, 1985, page 176). The estimated coefficients will be consistent, as will the estimate of the variances of the coefficients. Also, the distribution of the estimated coefficients will be asymptotically normal and confidence limits can be calculated using the normal probabilities (Maddala, 1974). For small sample sizes, the iteration should result i n estimates which are quasi-maximum likelihood, or are M L E under the assumption that the error terms are normally distributed, as with the I3SLS procedure, assuming that the assumptions made concerning the error covariance matrix are correct. However, convergence of estimates by iterations is not assured. To obtain a consistent estimate of the covariance matrix some assumptions about the nature of the covariance matrix must be made as it is not possible to estimate all of the elements of the error covariance matrix consistently (Judge et al, 174).  1985,  page  Also, many stages may be required and the stages required will differ depending  on the assumptions made. For some non-iid error structures, procedures for obtaining consistent estimates of the error covariance matrix were not found i n literature. For these error structures, existing techniques were modified and results published i n literature were used as evidence that these modifications provide consistent estimates of the error covariance matrix.  3.2 3.2.1  Estimation of the E r r o r Covariance M a t r i x Autocorrelation and Contemporaneous Correlation  The error covariance matrix for a serially correlated single equation with homogenous variances (not independent, identically distributed) is the product of a scalar and a matrix given as follows (Judge et al, 1985, page 275): $ = cr * 2  e  where «& is an n by n matrix of the error covariance;  (3.34)  Chapter  3.  An Alternative  Simultaneous  Fiiting  42  Procedure  cr is the scalar multiplier; 2  \& is an n by n matrix. If the error terms for an equation are i i d , ^  becomes the identity matrix I . n  The  assumption made in this thesis is that the relationship of the current error to the previous error remains the same for all of the error terms over the entire sample set (stationary process). For most forest inventory problems, the time from one observation to the next observation is usually quite long. Therefore, if serial correlation is present, the error for one observation is likely correlated with the observation immediately preceeding it and is less correlated w ith the previous observations. This correlation was therefore assumed to r  follow a first order autocorrelation process; the correlation between error terms declines geometrically with the distance between measurements.  The assumption of first order  correlation for most forestry inventory situations is therefore probably correct. The <& matrix assuming a first order autoregressive process for a single equation is as follows:  p  where  p  is the slope of e  m  =  pe -i m  +  v  m  n-l  P  P  1  P  P  1  n-1  p  and  v  m  2  n-3  p  •••  P^ „n-2  (3.35)  J_  are i i d ;  <J is equal to c r / ( l — p ) , where cr is the variance of v . 2  2  2  2  m  The simplification made for this matrix is that there are time lapses of one unit between measurements.  If the time lag differs from this, the elements of the matrix become p  where s is the time between the error terms.  s  5  The O L S estimation of an equation with  this error covariance matrix using simple or multiple linear regression will result i n an 5  S e e Judge and others (1985), pages 275 a n d 276 for more details.  Chapter  3.  An Alternative  Simultaneous  Fitting  43  Procedure  unbiased estimate of the coefficients, but the usual estimate of the covariances of these coefficients will be biased. Also, the estimate of the coefficients will not be most efficient (Judge et al., 1985, page 278). The alternatives for single equations are to transform the equations and use simple or multiple linear regression or to estimate the error covariance matrix and use E G L S . These two methods have been proven equivalent by Jaech (1964). Parks (1967) extended the first order autoregressive model to contemporaneously correlated systems of equations ( S U R ) . In order to obtain an estimate of the error covariance matrix, fi, for the system of equations, he performed the following steps: 1. For each equation, multiple linear regression analysis was performed. The coefficients are unbiased and, therefore, were used to obtain an estimate of the error for each sample for each equation (e; ).  6  m  2. A n estimate of p was obtained for each of the g equations of the system by regressing the current error term against the previous error term, resulting in n — 1 pairs of data, as follows: ,  Pi  =  (3.36)  m=2 ^•im^-irn — 1  -"m=2 tm-l e  where e;  m  is the estimated residual for the i  iim-j is the estimated residual for the i  equation and the m th  observation;  equation and the m — 1  th  observation. 3. The estimated values of p for each equation were used to obtain an estimate of P;, 6  F o r simultaneous systems, this first step would be performed using the second stage equations, and  resulting coefficients would be biased, but consistent.  Chapter  3.  An Alternative  Simultaneous  Fitting  44  Procedure  as follows: ( w n -  1  P. (i -  /  0  2  0  PIY  112  Pi  where P ^  =  (l-Pi)  (3.37)  1  Pi  Pi  Pi  ^/(l-p?);  z refers to the  equation of the system;  pi is the slope of the regression line for Ci with ej _i for the i m  m  th  equation.  The inverse of the P; matrix is as follows: ( l - p f )  1  '  2  -Pi  pr  0  1  0  0  . . .  0  0  1  0  ...  0  0  1  ...  0  0  -Pi  1  - p i  0  0  0  (3.38)  A l l elements of the diagonal are equal except the first element which estimates the relationship of en with e;o which cannot be exactly measured. Parks discussed the use of all diagonal elements equal to one. Maeshiro (1980) stated that the use of all diagonal elements equal to one results i n a substantial reduction i n efficiency, whereas Doran and Griffiths (1983) stated that little efficiency is lost. T h e loss i n efficiency will depend on the how much the error terms of an equation are correlated, and also on the sample size. 1  .  "  4. Each equation was transformed using the estimated P ~ matrix, labelled as R;. t  R y, t  = RiZiSi  + Ri€i  (3.39)  Chapter  3.  An Alternative  Simultaneous  Fitting  45  Procedure  where Z ; is a matrix of predicted endogenous variables and exogenous variables (see equation 3.26). For the Parks study, all R H S variables w ere exogenous: T  8{ is a matrix of coefficients for the Z ; with coefficients set to zero if the associated variables do not affect the i  equation.  th  The transformed regression equation has i i d error terms, for large sample sizes. 5. Multiple regression was applied to each transformed equation and the estimated error terms were used to obtain the estimated contemporaneous variances from equation 3.28 for the following matrix.  £ =  &  u  &  u  <7  13  ...  &  l g  cr  21  cr  2 2  0" 3  ...  cr  2 g  033  • • •  031  0"32  0gl  0"g2  2  0g3  (3.40)  03g  99  6. The estimated error covariance matrix was obtained by the following product.  n = p(s<g)i ) p' n  <xnPiPi  0r..PiP  2  0-13P1P3  ^PlP  g  0-21P2P1  0-22p P  0-23P2P3  ^P P  g  <J3lP Pl  <73 P P'  2  <T33P P  ^ P P;  0"glP Pi  0"g2P P  ^ g ^ g ^ ' z  3  s  2  2  3  2  ff  2  3  2  3  3  •••  3  (3.41)  ^ggPpPg  Parks proved that this estimator of ft is consistent. Kmenta and Gilbert (1970) tested the technique given by Parks (1967) and showed that other techniques are more efficient for small sample sizes. T w o of the techniques tested were nonlinear techniques.  One of the techniques, called Z E F - Z E F is a slight  Chapter  3.  An Alternative  Simultaneous  Fitting  46  Procedure  modification of the Parks method which results i n greater efficiency by obtaining an estimate of all of the pi using a simultaneous fit of a l l of the error prediction equations, rather than an O L S fit to obtain pi for each error prediction equation separately. The technique chosen to estimate the error covariance matrix for the M S L S technique was the Parks (1967) method with an added step from K m e n t a and Gilbert (1970), between steps 2 and 3 as follows: 2a. T h e regression of of e; on ej _i is used to obtain estimates of the following matrix. m  m  =  12e  fl'lle  a  0~ 2U  (T  0"31e  0"32e  13e  •••  °~lgt  23e  •••  0~ 2ge  a  22c  (T  <?33e  •••  &3ge  (3.42)  '99*  aij€  is calculated as follows:  "  (3.43)  (n-1)  £  where &ij is the covariance of the i  and the i  th  th  £  error prediction equations if i ^ j,  and is the variance if i — j . These values are the same for all n samples; vi and ij are n by 1 matrices of the residual error from O L S fit of the error prediction equations for the i  th  and j  equations, respectively.  t h  The error prediction equations are then fit simultaneously using E G L S for the system of error prediction equations where tl  e  =  <S> In) • T h e estimated auto-  correlation coefficients (pi) are then used to obtain the P i matrices. This technique results in a consistent estimate of $7 which is required so that the properties of G L S can be applied to E G L S asymptotical!}'. T h e P ; matrix can also be extended  Chapter  3.  An Alternative  Simultaneous  Fitting  47  Procedure  to higher order correlations if these are found to be significant (see Ameniya, 1985, pages 164 to 170).  3.2.2  Heteroskedasticity and Contemporaneous Correlation  The error covariance matrix for heteroskedastic, contemporaneously correlated systems of equations without serial correlation (independent, not identically distributed) is block diagonal as shown in equation 2.22. shown as $ =  where  The variance for an individual equation can be  is a diagonal matrix with unequal diagonal elements.  To  obtain a consistant estimate of the error covariance matrix, a consistent estimate of a\ and a consistent estimate of ^ are required. A method for estimating the error covariance matrix for heteroskedastic, contemporaneously correlated systems of equations was not found i n literature. Parks (1967) method for autocorrelated systems was therefore extended to heteroskedastic, contemporaneously correlated systems. The following steps were derived to obtain a consistent estimate of the error covariance matrix. 1. For each second stage equation, simple or multiple linear regression is used to obtain consistent estimates of the error terms for each sample as in step 1 for autocorrelated errors. The estimated coefficients will be consistent. If the system of equations is not simultaneous (stage 1 of M S L S procedure is not required), the estimated coefficients will be also unbiased. 2. The error terms, squared, are then graphed against the predicted values for the L H S endogenous variables using coefficients from step 1 for each equation as follows:  Vim  ^im  ^-im  =  (3.44)  ZimCli  Vim  Vim  (3.45)  Chapter  3. An Alternative  where u ; yi  m  Simultaneous  is e; for the i m  Fitting  48  Procedure  equation and the m  observation;  is the predicted endogenous variable from step 1 for the i  m  th  equation and  the m. observation; th  Z;  m  is the vector of predicted R H S endogenous variables and exogenous  variables for the i  th  equation and the m  sample;  th  a; is the coefficient matrix from the step 1 simple or multiple linear regression fit of each equation. 3. Using the graph of estimated error squared versus predicted endogenous variables, an estimated functional form of the variance for each given sample (represented by the squared error) against the expected value of the endogenous variable given the R H S variables is chosen.  «L=/(fcm)  ( - ) 3  46  This equation may be fit using simple or multiple linear regression for each equation, resulting i n a consistent estimate of the variances (Judge et al, 1985, p. 437) or all equations may be fit simultaneously using S U R as with step 2a of the process to obtain the error covariance matrix for autocorrelated systems. 4. T h e fitted equations are then used to obtain crf  m€  = u\ to yield the following m  matrix. Vile  0  0 0  0  0  0  0  . .  0  0  . .  0  .  0  <?t3e •  0  .  (3.47)  Chapter  3. An Alternative  Simultaneous  Fitting  49  Procedure  The inverse of this matrix is as follows: o  o  o  o  i/&  o  o  0  0  l/*  o  o  iMie  w r  5. T h e matrix W  {  1  1  i2e  o  i3e  (3.48)  o  is used to transform each of the g equations of the system resulting  in the transformed equation as follows: (3.49)  W r ' y i r W r ^ + W r ' e ,  where Z j is a matrix of predicted endogenous variables and exogenous variables (see equation (3.26)): Si is a matrix of coefficients for the Z j with coefficients set to zero if the associated variables do not affect the i  th  equation.  The transformed regression equations have i i d error terms, for large sample sizes. 6. Multiple linear regression is applied to each transformed equation and the estimated error terms from the weighted regressions are used to obtain the contemporaneous variances using equation 3.28.  s =  °12  &2l  0"22 0~23 • • • CF  031  032  O-gl 0-  g2  0"l3  • • • &lg  0"ll  0"33  0g  3  •••  0"3p  '99  2g  (3.50)  Chapter  3.  An Alternative  Simultaneous  Fitting  50  Procedure  7. The estimated error covariance matrix, O is obtained by the following product. cTnWxW;  chzWiW'j  <r W W;  <x W W'  2  ^  cr  2  2 1  n = w(s®i ) n  w  3 1  2  w  3  w ;  * iW WJ f l  B  22  3 2  2  w3w  <r W W e 2  g  & WW l3  a  2 3  1  3  W W 2  <T W W 3 3  3  3  3  ^2 W W; 9  < ^ w  2  3  w ;  2  (3.51) T h e estimate of  is the product of consistent estimates and is therefore also a consistent  estimate. The assumption made i n this explanation of how the estimated error covariance matrix is obtained was that the variances of the error terms for individual equations are related to the expected value of the endogenous variable given the exogenous variables. Other assumptions could be made i n estimating the W ; matrix (see Judge et al., 1985, pages 424 and 425). Duncan (1983) provided a proof that for heterogenous, contemporaneously correlated systems of equations, the S U R estimator is not most efficient.  He suggested that an  alternative estimate of the error covariance matrix be used to improve efficiency of the estimated coefficients. T h e modified Parks (1967) technique presented here should result in a improvement i n efficiency for large sample sizes, if the assumptions made concerning the nature of the heteroskedasticity are correct.  3.2.3  Autocorrelation, Heteroskedasticity,  and Contemporaneous Correla-  tion The error covariance matrix for single equations which are heteroskedastic and autocorrelated (neither idependent nor identically distributed) is more complex i n that the correlation of error terms is complicated by unequal variances. This can occur if data are  Chapter  3.  An Alternative  Simultaneous  Fitting  Procedure  51  measured for different units over a period of time (cross sectional, time series data) such as with stem analysis data or with permanent sample plot data for forest inventories, or can occur for one unit measured over a long period of time during which climatic changes have influenced the relationship of error terms among measurements. These two situations present different problems and therefore different solutions were derived.  Cross Sectional, T i m e Series D a t a D a t a which are collected by measuring each unit over a period of time are termed cross sectional, time series data. D a t a can be pooled and one equation can be fit to the pooled observations.  In this case, the error term for a single equation can be divided  into components and variance estimates  can be obtained for (1) the variance for each  unit across the time period (2) the variance for each time across the units and (3) the residual variance. The result is that the intercept of the equation varies over time and over units. The estimation of the error components has been attempted by many authors (e.g. Arora, 1973; Balestra and Nerlove, 1966; Wallace and Hussain, 1969) and is rela,ted to the use of indicator variables to indicate different time periods and units. However, time and unit differences may affect the slopes of the equation as well as the intercept. A n extension to slope coefficients has also been attempted by many authors (e.g. Hildreth and Houck, 1968; Swamy, 1970; Swamy and Mehta, 1977) and has been called random coefficient modelling. However, the estimation of random coefficients for data with unequal numbers of measurements within each unit is difficult. Also, even though estimates of variances can be found, the reasons for the time and cross sectional differences such as changes i n climate, differences i n genetics, and other factors are not explicitly explained by the equation. For forestry problems, random coefficient modelling has been extended to random coefficient functions, in which parameters  are predicted by other exogenous variables  Chapter  3.  An Alternative  Simultaneous  Fitting  52  Procedure  resulting i n a system of parameter prediction functions. First, an equation is fitted to each unit over the time periods.  T h e estimated coefficients are then related to other  exogenous variables using a set of parameter prediction equations.  These parameter  prediction equations are then fitted using simple or multiple linear regression on each equation. Ferguson and Leech (1978) argued that the traditional fitting of the parameter prediction equations did not account for the inter-relationships of the parameters, nor did it account for the error of the coefficients created by fitting one equation for each of the sample units.  They proposed an alternative technique by estimating the error  covariance matrix for the system of parameter prediction equations and by using this estimated matrix to find estimates of the coefficients of the system using E G L S . However, they did not attempt to account for the serial correlation i n fitting an equation to each sample unit, resulting i n a biased estimate of the variance of each coefficient. Also, they assumed that the errors for the parameter prediction equations were correlated, whereas the parameters themselves may be correlated resulting in a simultaneous set of parameter prediction equations. For cross-sectional, time series data for forestry then, the procedure used by Ferguson and Leech (1978) could be modified by (1) accounting for serial correlation when fitting an equation to each sample unit through a transformation of variables or by estimating the error covariance matrix and using E G L S , and (2) identifying the parameter prediction equation as a system of simultaneous equations rather than a system of S U R equations. The M S L S procedure can be modified for a simultaneous, heteroskedastic  system of  parameter prediction equations as follows: 1. First, simple or multiple linear regression is used to fit a chosen equation to each of the sample units across time measurements. serial correlation.  Equations would then be tested for  Chapter  3.  An Alternative  Simultaneous  Fitting  53  Procedure  2. If serial correlation is found to be significant, an appropriate transformation followed by a new simple or multiple linear regression fit, or an estimation of the error covariance matrix and an E G L S fit of each equation separately is made. From this regression, consistent estimates of the coefficients and their associated variances are obtained. 3. The simultaneous system of parameter prediction equations is then chosen by using simple or multiple linear regression on each equation to determine which exogenous variables to include and to decide which parameters are strongly correlated. 4. To remove the simultaneity bias, each parameter is predicted from all of the exogenous variables i n the system of parameter prediction equations.  These predicted  parameters are then substituted for parameters which appear on the R H S of equations i n the system. 5. Multiple linear regression of each equation i n the system is then performed. 6. The method proposed by Ferguson and Leech (1978) is then used to combine the variances from steps 2 and 5 into one matrix fl and E G L S is used to fit the system of equations simultaneously.  Single Unit Over A Long Period Autocorrelation and heteroskedasticity  can also occur i n individual units that are  measured over a long period. In this case, the serial correlation between respective time measurements  changes depending on the pair of measurements.  This could occur in  forestry as data from a permanent sample plot which has been measured many times over a long time period. In this case, climatic changes may have affected the relationships between time periods. In fact, the fitting of an equation to each unit as described  Chapter  3.  An Alternative  Simultaneous  Fitting  54  Procedure  for cross sectional, time series data, may show error terms which are autocorrelated and heterogenous.  Engle (1982) and Cragg (1982) proposed techniques to estimate coeffi-  cients under this type of error structure for individual equations.  As an alternative,  Gregoire (1987) suggested an initial transformation to remove either autocorrelation or heteroskedasticity. For systems of equations which show this type of error structure, the following technique to estimate £1 for step 2 of M S L S is proposed: 1. The transformation suggested by Cochrane and Orcutt (1949) is used to remove serial correlation, assuming first order autocorrelation. The transformation involves taking differences for all variables i n the equation, meaning that one observation is lost. 2. Once serial correlation is removed, the steps given for heteroskedastic systems using the transformed data are followed. The resulting estimated coefficients and their variances will be for the transformed data, but, for this transformation, they will also apply to the untransformed data. because one observation is lost, the M S L S fit will not be asymptotically efficient.  Also, The  loss i n efficiency will depend on the number of samples, and the degree of the serial correlation.  3.3  Confidence Limits, Hypothesis Testing, and Prediction  The M S L S and 1MSLS procedures are extensions to 3SLS, which i n itself is an extension to the E G L S procedure, applied to a system of equations. Because the E G L S procedure produces estimates of coefficients which are asymptotically normally distributed, for large  Chapter  3.  An Alternative  Simultaneous  Fitting  55  Procedure  sample sizes, confidence limits for coefficients from M S L S can be calculated by the following: 4  where  is the r  th  ±  (- )  Za/zcrgir  3  estimated coefficient for the i  th  52  equation;  z /2 is the value from the standard normal probability distribution corresponding a  to the 1 — a j 2 percentile; cr^. is the estimated standard deviation for this coefficient which is the square root of the appropriate variance from the coefficient covariance matrix. The coefficient covariance matrix is calculated as in equation 3.31. For a single coefficient, the following hypothesis may be proposed. H  0  •  #i :  4  = £o  (3-53)  4  + So  (3.54)  where S is the hypthesized value for the population value, Si . 0  T  In order to test this hypothesis, a confidence interval for this coefficient could be calculated, and the hypothesis rejected if the confidence interval does not contain 6 . 0  Alter-  natively, the following test statistic could be used. test statistic  (3.55)  =  Under the assumption that the hypothesis is true, the test statistic is distributed asymptotically as the standard normal distribution. A one-sided hypothesis statement may also be tested. If several coefficients of the system are to be tested as a group, the following hypothesis statement may be established. Ii •  R-A  H, :  RA  0  r e d u c e d  reduced  = r  (3.56)  # r  (3.57)  Chapter  3.  An Alternative  Simultaneous  Fitting  56  Procedure  where R is a q by K matrix; is a matrix of coefficients i n the system. Note that coefficients which have  ^reduced  been constrained to be zero to represent R H S variables which do not affect the i  equation have been removed. T h e size of this matrix has therefore  th  been reduced from g(g — 1 + k) by 1 to K by 1; r is a q by 1 matrix; K is the number of coefficients i n the system which were not constrained to be zero; q is the number of comparisons to be made. Comparisons can be made between coefficients of different equations i n the system. For instance, to test H  0  : 6~n = 8 i or Ho : £n — £21 = 0 , the following matrices would be 2  derived. R^^  reduced  (3.58)  ^  Su  1 0 - 1 0 0  (3.59)  '21  J22  The test statistic for 3SLS and therefore for M S L S is as follows: test statistic  where Var ^A  Teduced  = (RA  ^J  reduced  - rj  (R Var (^A  j  reduced  Ilj  (KA  Teduced  - rj  (3.60)  is Var (A) reduced by eliminating columns and rows which  correspond to coefficients which were constrained to be zero. This test statistic is asymptotically distributed as the % distribution with q degrees of 2  freedom under the assumption that the hypothesis is correct (Judge et al., 1985, page 614).  Chapter  3.  An Alternative  Simultaneous  Fitting  57  Procedure  Given a set of exogenous variables, the mean predicted L H S endogenous variables can be found using the coefficients from the M S L S fit. 1. T h e set of exogenous variables is put into the first stage equations from the first step of the M S L S procedure to obtain predicted endogenous variables and these are substituted for endogenous variables which appear on the R H S of equations in the system.  The set of new exogenous variables and predicted R H S endogenous  variables is used to create the following matrix.  (3.61)  where  is the matrix of R H S variables, new exogenous and predicted R H S endogenous, for the i  ih  equation.  2. T h e mean predicted L H S endogenous variables are then calculated by the following: y, where  2k  = Z A  (3.62)  f c  is the matrix of mean predicted L H S endogenous variables given the set of exogenous variables and predicted R H S endogenous variables.  The variance matrix of the mean predicted L H S endogenous variables can be calculated by the following:  V a r  (y|Z,J = Z„ Var (A) Z  h  where Var (A)  (3.63)  is calculated as i n equation 3.31.  Confidence limits for a mean predicted L H S endogenous variable can be found by: y,z,  ± V z ^ y ^  ( 3  6 4  )  Chapter  3.  An Alternative  Simultaneous  Fitting  Procedure  58  As with the confidence limits for coefficients of the system, the normal distribution is used for for M S L S .  , because the asymptotic properties of E G L S are assumed to be appropriate  Chapter 4  Procedures for Comparison of Fitting Techniques  4.1  Selection of Equations for the Systems In order to compare the alternative technique ( M S L S or I M S L S ) to the common  approaches of (1) O L S applied to each equation and (2) the O L S fit of a composite model, first the equations of the system were chosen.  C o m m o n linear equations were  selected for each of the three forestry problems and, i n situations where several equation models were available, each model was regressed using simple or multiple linear regression and a model was selected based on simplicity and a high coefficient of determination  1  (R  2  value) . Variables within the model were considered to be important to the model if the 2  R  change (the change i n the R  2  2  value due to the addition of that variable) was greater  than 0.005. N o attempt was made to relate this to a level of significance, because the assumption of normally distributed error terms was not made; therefore, the usual tests using the t or F distribution were not appropriate. The choice of equations was restricted to linear models to restrict the scope of this thesis; however, extensions of M S L S and I M S L S to nonlinear equations using generalized nonlinear regression should be possible. Since the simple or multiple linear regression of equations of a simultaneous system of equations produces biased results, the coefficient of d e t e r m i n a t i o n is only useful as a goodness-of-fit measure. 1  T h e n o t a t i o n , R value, w i l l be used i n this thesis to denote the coefficient of d e t e r m i n a t i o n for b o t h simple a n d m u l t i p l e linear regression, a l t h o u g h r is n o r m a l l y used for simple linear regression. 2  2  2  59  Chapter  4.2  4.  Procedures  for Comparison  of Fitting  60  Techniques  Obtaining the O L S , Composite, and M S L S Fits Once the equations of the system were chosen, the residual errors for the simple  or multiple linear regression fit of each equation were graphed and tested for (1) heteroskedasticity and (2) serial correlation. If either or both of these properties were noted for any equation, the O L S fit of this equation was appropriately modified so that the best O L S fit of each single equation of the system was obtained. The results of this fitting were then used as the single equation O L S fit for comparison with other techniques. If possible, the equations of the system were combined into a composite model. O L S was then applied to this composite model and residuals were tested for heteroskedasticity and serial correlation. O L S was then modified to account for the properties of the error term and a new O L S for the composite model was fitted and used as a comparison with other techniques. To obtain the fit for the M S L S method, the technique outlined in Chapter 3 was used as follows: 1. O L S was used to obtain a predicted value for each endogenous variable from all of the exogenous variables of the system (first stage equations).  These predicted  endogenous variables were then substituted for corresponding endogenous variables on the R H S of each of the equations of the system (Stage 1). 2. O L S was again used to fit each of the equations in the system once the predicted endogenous variables were substituted for the R H S endogenous variables (Stage 2, second stage equations). 3. The residuals from the second O L S fit were used to determine if heteroskedasticity or serial correlation were present in the system, because this second fit results i n consistent estimates of the coefficients of the system.  Chapter  4.  Procedures  for Comparison  of Fitting  Techniques  61  4. Once the structure of the error covariance matrix was determined, an appropriate technique for estimating the error covariance matrix, ft, was chosen and used (Stages 3 and on). The number of steps required depended on the characteristics of the error structure. If neither heteroskedasticity nor serial correlation were present in the equations of the system, the error covariance matrix became block diagonal with all elements of the block being equal. M S L S is equal to 3SLS i n this case, and 3SLS for large sample sizes or I3SLS for small sample sizes was used. 5. The estimated error covariance matrix, fl, was then used to fit the system of equations simultaneously using E G L S for the system. 6. For small sample sizes, a new value for fl was then calculated using the estimated coefficients from the E G L S fit, and the process was iterated until convergence occurred.  Convergence was considered to occur if each of the new coefficients was  within 0.00005 of the corresponding previous coefficient as follows: \8u - 6 where 6u is the I  th  a{new)  \  coefficient for the i  tn  < 0.00005  (4.65)  equation.  If convergence did not occur, the first estimate (from step 5) was used. N o attempt was made to try alternative nonlinear fitting algorithms. The results from the M S L S (or I M S L S ) were then used for the comparison with the more commonly used techniques. If a system of equations was found to be both heteroskedastic and autocorrelated, the M S L S steps were modified according to the description given i n Chapter 3. Once the set of coefficients, and their covariances were calculated for each of the techniques ( O L S , composite model, and M S L S or I M S L S ) , the results of the three techniques were compared. Values which were compared among the three techniques included  Chapter  4.  Procedures  for Comparison  of Fitting  62  Techniques  goodness-of-fit measures, the traces of the estimated coefficient covariance matrices, and the estimated coefficients and their standard deviations. In addition, the techniques were ranked for (1) the amount of information about the system produced, (2) consistency of the estimators for the coefficients and for the variance of the coefficients as reported in literature, (3) ability to calculate confidence intervals, (4) asymptotic efficiency as reported in literature, (5) compatibility, and (6) ease of fit in terms of the number and difficulty of the steps required.  4.3  Tests for Heteroskedasticity and Serial Correlation  Although tests for serial correlation i n systems of equations are available (Durbin, 1957; Harvey and Phillips, 1980), because the presence of heteroskedasticity or serial correlation in any equation will affect the error covariance matrix of the system of equations, tests for single equations were chosen. For serial correlation, the error terms were ordered in time sequence by age if available (i.e.  tree or plot age), or by the magnitude of the predicted endogenous variable on  the L H S , from the smallest to largest value.  A graph of the ej  simple linear regression analysis were done.  If the coefficient of determination from  m  with the e; _i m  and  this regression was greater than 0.005, the Durbin and Watson bounds test (1951) was used; Epps and Epps (1977) found this to be reliable even if heteroskedasticity is present. However, the bounds test can only be used if an intercept item is included in the equation. Also, Harvey and Phillips (1980) found that the zone representing inconclusive results is large if the number of predictor variables is large. For forestry problems, the number of predictor variables is generally small. The test statistic for the bounds test is as follows: di  where e ^ , ii -i m  —  (4.66)  are the estimated errors from the simple or multiple linear regression fit  Chapter  4.  Procedures  of the i  for Comparison  of Fitting  63  Techniques  equation of the system, once simultaneity bias is removed.  The tables for the Durbin and Watson bounds test (1951) were extended to more R H S variables and to larger and smaller sample sizes by Savin and W h i t e (1977) and these more extensive tables were used.  However, because the bounds test has a region of  inconclusiveness, and the tables are only available for up to a sample size of 200, a second test used by Newberry and Burkhart (1986), called the runs test, was performed if the results of the bounds test were inconclusive or if the sample size was large. For the runs test, each ordered  efj  m  and i i - \ pair was given a "+" if the difference value is positive. If m  the difference was negative, a "—" is assigned to the pair. The sign test (Conover, 1980) was then used to determine if the number of positive pairs is significantly large (positive serial correlation), or alternatively _ if the number of positive pairs is significantly small (negative serial correlation) using a one-sided test. A two-sided test can be used to test for the presence of serial correlation, either positive or negative simultaneously. To examine the error terms for heteroskedasticity, a graph of ef  m  versus yi  m  and simple  linear regression were done. If the coefficient of determination from this regression was greater than 0.005, further testing was done; otherwise, the error terms were considered homoskedastic.  If a test was deemed necessary and if no serial correlation was found,  the Goldfeld and Quandt test for heteroskedasticity  (1965) was used.  The data were  first ordered by the predicted endogenous variable, from smallest to largest if the variances were considered nondecreasing, and from largest to smallest, if the variances were considered to be nonincreasing. The ordered data were then divided into three groups, and a linear regression was performed for the group with the lower variances and for the group with the higher variances. The ratio of the mean squared error for the regression of data with the higher variances, to the mean squared error for the regression of data with the lower variances was calculated as the test statistic. This test assumes that the variance is monotonically increasing or decreasing, which is likely for forestry situations.  Chapter  4.  Procedures  for Comparison  of Fitting  64  Techniques  If serial correlation was found, the data were first transformed to remove serial correlation (Cochrane and Orcutt, 1949), because the Goldfeld and Quandt test was found to be i n error if serial correlation is present (Epps and Epps, 1977). The Goldfeld and Quandt test was then applied to the transformed data. For all tests, an alpha level of 0.05 was used.  4.4  Criteria for Comparison of the Three Techniques  4.4.1  Goodness-of-fit Measures  For comparison of the fit of the three techniques for the sample data, the goodnessof-fit measures were calculated.  Because the O L S technique minimizes the sum of the  squared error, goodness-of-fit measures based on the square or absolute value of the error terms will be best for the O L S fit. However, these measures are commonly used in forestry to indicate how well the fitted equation relates to the sample data; therefore, this comparison was made.  The goodness-of-fit measures do not indicate whether the  estimated coefficients are unbiased, however. 1. For the O L S and M S L S fitting techniques, the F i t Index (F.I.) was calculated for each of the g equations of the system. F.I*  = 1 -  (4.67)  where n is the number of samples; Vim,Vim  are values for the m  th  sample of the i  equation for the endogenous  th  and predicted endogenous variables, respectively; y is the average of the sample values for the i {  th  endogenous variable.  Chapter  4.  Procedures  for Comparison  of Fitting  65  Techniques  The F.I. is expected to be highest for the O L S fit of each of the equations, because the O L S fit minimizes the unweighted error terms; whereas, the M S L S fit minimizes the weighted error terms. To compare the composite model to the alternative methods for goodness-of-fit, the F.I. was calculated for the composite model and compared to the F.I. calculated for each of the other two techniques for the corresponding endogenous variable. 2. The Mean Absolute Difference was calculated for each equation over the range of the sample data to compare the size of the differences between the O L S and the M S L S fits. The data were evenly divided into five classes based on the value for the endogenous variable and the mean difference was calculated for each class. (4.68) where M.A.D.u the I  th  is the mean absolute deviation for the i  ih  endogenous variable in  class;  c is the number of observations in the I  th  class, approximately equal to n/5  for all equations and all classes. To compare the composite model to the other fitting techniques, the M . A . D . was calculated for each class of the endogenous variable which appears as a L H S variable in the composite model. 3. The Mean Difference ( M . D . ) was calculated for each equation over the range of the sample data to compare the direction of differences between the O L S and the M S L S fits. The data were evenly divided into five classes based on the value for the endogenous variable and the mean difference was calculated for each class.  Chapter  4.  Procedures  where M.D.u  for Comparison  of Fitting  66  Techniques  is the average deviation for the i  th  endogenous variable in the  I  th  class; c is the number of observations i n the I  th  class, approximately equal to n / 5  for all equations and all classes. To compare the composite model to the alternative techniques, the M . D . was calculated for each of the three techniques for the variable which appears as the L H S variable in the composite model.  4.4.2  Trace of the Estimated Coefficient Covariance M a t r i x  The O L S fit of simultaneous equations results in biased and inconsistent estimates of the coefficients. The comparison of the traces of the estimated coefficient covariance matrix (sum of the estimated variances of the coefficients) from the O L S fit to that from the M S L S fit will not, therefore, indicate the relative efficiency. However, because the use of the estimated coefficient covariance matrix from the O L S fit is common, and the differences in the estimated matrices are of interest, a comparison was made. Also, this comparison was used to examine the magnitude of the differences and, therefore, to indicate that the use of the O L S fit for testing hypothesis statements and for calculating confidence limits for simultaneous systems, can be quite incorrect. Initially, the traces for the complete system of equations were compared, and then the traces of each submatrix, corresponding to a single equation, were compared for the two techniques, For comparison of the composite model to the O L S , the trace of the estimated covariance matrix for the fit of the composite model was compared to the trace of the estimated covariance matrix for the O L S fit of the equation having the same endogeneous variable on the L H S as the composite model. Similar trace values were calculated to compare the M S L S fit to the fit of the composite model.  Chapter  4.4.3  4.  Procedures  for Comparison  of Fitting  Techniques  67  Table of Estimated Coefficients and Standard Deviations  Because the trace of a estimated coefficient covariance matrix may mask some of the differences for single coefficients, a table of the estimated coefficients and their standard deviations was compiled. Because the composite model fit does not necessarily have the same variables, only the O L S , M S L S , I M S L S fits were compared.  4.4.4  Ranking for Other Features  A n overall ranking system has been used by several authors including Amateis and others (1984). The ranking system used here is simple i n that a higher rank indicates that the technique has more of the desirable criteria of fitting techniques. The ranks were assigned as follows: 1. The amount of information given. R a n k = l , if the technique estimates one variable only; Rank=2, if all endogenous variables are estimated. 2. The consistency of the coefficient and covariance matrix estimates as reported in literature. R a n k = l , if neither is consistent; Rank=2, if both are consistent. 3. Ability to calculate confidence limits (requires a consistent estimate of the covariance matrix and a known probability distribution). R a n k = l , not able; Rank=2, can be calculated.  Chapter  4.  Procedures  for Comparison  of Fitting  Techniques  68  4. Asymptotic efficiency as reported i n literature. R a n k = l . larger variance; Rank=2, smaller variance. 5. Compatibility across equations. R a n k = l . not compatible; R a n k = 2 , compatible. 6. Ease of fit. R a n k = l . most difficult; Rank=2, medium difficulty; Rank=3, least difficult. The ranks were assigned and summed for each of the three techniques used. For each criterion, equal weight was assigned, except that for the last item three ranks were assigned.  The ranks could have been given other weights depending on the item, but  the weights given are largely a function of the researcher or his organization, and so the weighting of ranks would likely change depending on the user of the information. For this thesis, the equal weighting was chosen; information was provided so that other weights could be assigned and a new overall rank calculated for each fitting technique if desired.  Chapter 5  Application 1: Tree V o l u m e Estimation  5.1  Introduction The estimation of gross tree volume has been well researched and many methods and  models have been employed.  Models have been developed for the estimation of total  volume, which includes volume of the main bole of the tree from ground to tree tip, and for merchantable volume which includes volume for the merchantable part of the main bole only. T w o main types of models have been developed. The first type uses a system of equations to estimate total volume, and to estimate the ratio of merchantable volume to total volume (volume ratio), and then merchantable volume is calculated from the volume ratio and the total volume. The second type of model for estimating tree volume uses a mathematical equation to represent the shape of the tree bole (taper), and then integration is used to estimate volume for the whole bole or for any merchantable part. To limit the scope of this thesis, the examination of tree volume estimation was restricted to the first type of model, although the second type of volume estimation models using a taper function can also be considered a system of equations (Reed and Green, 1984).  5.2  Preparation of D a t a  Sectioned tree data for 1818 pine trees i n Alberta were obtained from the Alberta Forest Service ( A F S ) . Trees were sampled by A F S personnel by selecting particular individuals with desired traits, or by selecting plots within stands and sectioning all of the trees in  69  Chapter  5.  Application  1: Tree Volume  Estimation  70  the selected plot. Trees were sectioned at 0.3 metres above ground (stump height), 1.3 metres above ground (breast height), 2.8 metres above ground (2.5 metres above stump height), and subsequently at 2.5 metre intervals. Sections were further divided if decay was found within the section. For each section, the diameter inside and outside bark at the top of the section were measured and the section lengths were recorded. The number of tree rings and the dimensions of any decay were also measured at the top of each section. More details about the tree sectioning technique can be found i n Alberta 3 Forest  Inventory:  Tree sectioning  manual  Phase  ( A n o n . , 1985).  D a t a were collected throughout the province. To limit these data to lodgepole pine (Pmus  contorta  var. latifolia  Engelm.), data from Northeastern Alberta Alberta were  removed, because most of the trees in this region are considered to be jack pine nus banksiana  (Pi-  Lamb.). Also, trees which were forked or had broken tops were deleted.  Table 5.2 is a distribution of the remaining 1097 trees. For each tree, total volume from ground to tip was calculated by assuming that the first section has a cylindrical shape (from ground to 0.3 m above ground), the top section has a conical shape, and the remaining sections have a paraboloid frustum shape. Merchantable volume was also calculated for each tree from a 0.3 metre stump height above ground, to a 7.0, 10.0, 13.0, and 15.0 centimetre top diameter inside bark (top dib). In addition, tree height (sum of the section lengths), merchantable length (the length of the merchantable part of the stem), d b h , and stump dib (diameter inside bark at the top of the first section) were calculated. Each tree therefore represented four merchantability standards with total volume included as another merchantability standard (volume from a 0.00 metre stump height to 0.0 cm top dib). Because these five merchantable volumes per tree do not represent independent data, only one of the five merchantable volumes was retained for each tree.  To determine which of the five merchantable volumes to  retain on a tree, a systematic selection was performed by selecting the first merchantable  Chapter  5.  Apphcation  1: Tree Volume  Estimation  Table 5.2: Distribution of Selected Trees by Height and D b h Classes height i n metres dbh in cm 1.1 to 5.0 5.1 to 10.0 10.1 to 15.0 15.1 to 20.0 20.1 to 25.0 25.1 to 30.0 30.1 to 35.0 35.1 to 40.0 40.1 to 45.0 45.1 to 50.0 50.1 to 55.0 55.1 to 65.0 Total  1.30 to 5.00  5.01 to 10.00  10.01 to 15.00  15.01 to 20.00  20.01 to 25.00  25.01 to 30.00  30.01 to 35.00  35.01 to 40.00  4  4 21  10  15  104  20  7  89  152  22  1  25  138  73  3  3  48  99  29  1  180  9  59  56  3  127  3  21  40  6  70  1  4  12  1  19  1  6  5  12  1  1  2  '  1  31 139  1 4  Total  44  233  270 240  1 X  371  279  148  17  1  3  1  1097  Chapter  5.  Application  1: Tree Volume  72  Estimation  volume (0.00 stump height to 0.0 cm top dib) for the first tree i n the sample, the second merchantable volume (0.30 stump height and 7.0 cm top dib) for the second tree in the sample, and so on.  The result was that approximately one fifth of the sample trees  represented each of the five merchantability classes. Because some of the sectioned trees were sampled from plots, the tree data were not considered independent. Also, if all 1097 sample trees were used in the analysis of systems of equations, the size of the error covariance matrix would be g x 1097 by g X 1097.  The  inverse of this error covariance matrix must be calculated for the M S L S procedure, and the calculation would be difficult for an array of this size. To reduce the dependence of trees sampled within plots, and also to reduce the size of the error covariance matrix, a random sample was selected from the 1097 trees. Initially, 500 trees were selected, but, because only seven megabytes of computer memory for running computer programs were available, difficulties with inverting the g X 500 by g x 500 error covariance matrix were encountered. Therefore, a second sample of 100 trees was selected for the analysis. The sample data were graphed and no outliers were found. sample trees is presented in Table 5.3.  5.3  The distribution of these 100  1  M o d e l Selection The equation to estimate total volume was restricted to standard volume functions  which predict total volume as a function of total height and dbh. The models selected for possible inclusion into the system of equations were as follows: 1. A nonlinear model proposed by Schumacher and Hall (1933) was selected for testing. total volume  = 8Q dbh  131  height  132  errori  (5.70)  A copy of the d a t a selected for each application presented i n this thesis can be obtained on diskette or tape by contacting the author. 1  Chapter  5.  Application  1: Tree Volume  Estimation  Table 5.3: Distribution of 100 Sample Trees  dbh in cm  1.30 to 5.00  1.1 to 5.0 5.1 to 10.0 10.1 to 15.0 15.1 to 20.0 20.1 to 25.0 25.1 to 30.0 30.1 to 35.0 35.1 to 40.0 40.1 to 45.0  5.01 to 10.00  height i n metres 10.01 15.01 20.01 25.01 to to to to 15.00 20.00 25.00 30.00  35.01 to 40.00  Total  0 2  1  1  11  1  7  14  5  2  14  7  1  24  4  10  3  17  7  2  1  2  3  2  2  3 13 26  45.1 to 50.0 50.1 to 55.0 55.1 to 65.0 Total  30.01 to 35.00  2  11  1  1  0 0 0  3  21  33  30  10  3  0  100  Chapter  5.  Application  1: Tree Volume  where total volume  74  Estimation  is the volume of the main bole of the tree from ground  to tree tip; dbh is the diameter inside bark measured at 1.3 metres, called diameter at breast height; is the total tree height from ground to tree tip;  height  Bo, B\, Q are coefficients to be estimated; 2  error  x  is the error term.  Using logarithms, the model can be transformed to a linear model, and then estimated coefficients can be obtained using multiple linear regression.  This model  was found to be the most applicable to data from Alberta (LeMay, 1982). However, results are in terms of logarithms and must be converted to the original units. 2. A simple linear model proposed by Spurr (1952) was shown by L e M a y (1982) to be also quite accurate for Alberta data. total volume  = 8  3  + (3 dbh  2  4  height  -f error  2  (5.71)  where Q , 3 are coefficients to be estimated; 3  error  A  2  is the error term.  Spurr's model is based on the mathematical relationship of area at the base and height to volume. 3. Honer (1965) proposed a transformation of total volume resulting i n the following model. dbh total  2  volume  06 = 05 + 1 height  error  3  (5.72)  Chapter  5.  Application  1: Tree Volume  75  Estimation  where 05, 06 are coefficients to be estimated; error  3  is the error term.  4. Another simple linear model was identified as follows: Total  Volume  where 07, 0 , 0 , /3 8  error  9  4  10  = j3 -j- fl^dbh + figdbh 7  2  + (3 height w  (5.73)  -f error  4  are coefficients to be estimated;  is the error term.  This model is simple in that total volume, total height and dbh are all untransformed. Generally, the variance of the error terms for total volume linear models are heteroskedastic.  The logarithmic transformation required to linearize the first model removes this  heteroskedasticity. For each of these models, simple or multiple linear regression was used to estimate coefficients. Spurr's model was selected for the system of equations to calculate volume, because the model is simple, logically based on the calculation of volume using geometric formulae, and the fit statistics were high (coefficient of determination (R ) 2  0.9881).  2  value of  The Schumacher and H a l l , and the Honer models require a transformation of  the total volume variable. Since transformations of the endogenous variables appearing on the L H S of equations in the systems might cause unnecessary complications, these models were rejected. the R  2  The remaining linear model (equation 5.73) was not selected as  value for this model (0.9827) was less than that of the Spurr model.  For the prediction of merchantable  volume, two equations are used.  A n equation  to estimate the volume ratio is paired with an equation which calculates  merchantable  2  C o m p a r i s o n of the coefficient  of determination as a. goodness-of-fit  measure was made;  however,  for the Schumacher a n d H a l l , and the Honer models, this coefficient of determination is based on the transformation of volume.  Chapter  5. Application  1: Tree Volume  76  Estimation  volume from the estimated total volume and volume ratio. Honer (1964, 1967) proposed several equations to estimate volume ratio. „, „  =  VR T  ,„  ~  n  „  VR  VR  =  hm  1 2  „  ——+/?  height  „  =  -rrr.  „  /? +/3  ^  17  + As  f  hm  —  „  2  <+/3ls  _  \height  f top dib\  ft (^ )  3  (  1 3  \  ,  2  J  + error  5.74  B  ftopdib\  i  <"5)  + f t e ( ^ r )  r  hs  1 + T—TT  \ ftop  height J \  dib\  2  /  ( (  -777- + & 9 " dbh J  1  \\  '  hs  \ (top W  height J \  ^  dib  x  dbh  (5.76)  -\-erroTj v  R  2"^ ^  - ^ ^ M  i  - ^ )  +  m  "  -  (5 77)  where VR is the volume ratio; hm is merchantable height, or the merchantable length plus the stump height; top dib is the top diameter inside bark for the merchantability standard; hs is the stump height, 0.0 for the first merchantability standard (total volume), and 0.30 for the remaining four merchantability standards; Bn through /?22 are coefficients to be estimated; errors  through error  8  are the error terms.  Honer (1964) used d b h measured inside bark, whereas Honer (1967) used d b h measured outside bark. For this thesis, dbh measured outside bark was used as this measure can be easily obtained on standing trees. Each model was fitted using multiple linear regression. The highest R value (0.9966) 2  was obtained for Honer's fourth model, but this model is based on merchantable height and does not reflect changes i n stump height.  For the first of Honer's models, the R  2  value was 0.9962, and for the second model, the R value was 0.8934. A s with the fourth 2  model, the first and second models do not reflect changes i n stump height. T h e remaining model, equation 5.76, does reflect changes i n both stump height and i n the top dib, but  Chapter  the R  2  5.  Application  1: Tree Volume  77  Estimation  value for this model was only 0.8943, somewhat lower than that for the fourth  model. To reflect the changes i n stump height, and to retain the high fit statistics of the fourth model, the following model was derived. (5.78) where ml is the merchantable length,- which is the merchantable height minus the stump height; 023) 024 error  9  a r e  coefficients to be estimated;  is the error term.  The top dib/dbh  ratio was removed from the model as the inclusion of this ratio to the  model resulted i n a change of the R  2  value of only 0.0006. T h e simple linear regression  fit of this simple model produced a high R value (0.9960) and was chosen for the system 2  of equations to estimate volume. The total volume equation model relies on the measurement of tree dbh and height. Often height is not measured and rather is predicted from dbh. In British Columbia, the B . C . Ministry of Forests and Lands has approved the following height models (Watts, 1983) for use.  height  where 025 through 0 error  w  025 + /3 edbh + /3 dbh 2  2  27  + error  w  (5.79)  height  (5.80)  height  (5.81)  height  (5.82)  3 4  are coefficients to be estimated;  through errori  3  are error terms.  Chapter  5.  AppHcaiion  1: Tree Volume  78  Estimation  The equation used in Alberta is as follows (Edwards. 1987):  height  where /3 , /3 35  error  36  1 4  =  /3  35  e  dlh  (5.83)  error  u  are coefficients to be estimated; is the error term.  This model can be transformed to a linear model using logarithms. Multiple linear regression was used to fit each of these equations.  The first model,  the paraboloid model, was selected as the height model to be included in the system of equations based on simplicity and an R  2  value of 0.7699.  The second and fourth  models were rejected as the intercepts are conditioned to 1.3 metres, and the effect of this conditioning, if the true intercept is not 1.3, would be that estimated coefficients are biased. The fifth model, the A l b e r t a model, was rejected as a transformation of height is required to obtain a linear model and this complication was deemed unnecessary as results from other nontransformed models were superior. hyperbola (equation 5.81), was rejected in that the R  2  The remaining model, the  value was 0.7636, slightly lower  than the chosen model. The selected volume ratio model requires the ratio of merchantable length to total height (height ratio).  Again, since the height and merchantable length are likely not  measured, a model for the estimation of the height ratio was required.  Several linear  models were fitted based on transformations of top dib, dbh, and stump height.  The  following model was selected. (5.84) where HR is the height ratio; 037 > 038) 039 error i  5  a r e  coefficients to be estimated;  is the error term.  Chapter  5.  Application  1: Tree Volume  79  Estimation  Because top dib ranged from 0.00 through 15.0 centimetres, for small trees, the specified top dib was sometimes larger than dbh. To restrict the top dib/dbh allowed a m a x i m u m value of stump bark measured at stump height.  dib/'dbh  The R  2  where stump  ratio, the ratio was  dib is the diameter inside  value for this model was 0.9722.  stepwise regression procedure, other transformations of the top dib/dbh  Using a  ratio, such as the  first and third power, entered into the equation before hs; however, when these other transformations of the diameter ratio were included in the equation, the inclusion of hs in the model resulted in an R  2  value change of less than 0.005. Since the effect of changes  in the stump height on the height ratio was considered important, the equation presented above was selected for the system of equations. The chosen system of equations for estimating tree volume was the following: height  =  Su + 8i dbh  + 8 dbh  2  iz  2  + e  x  (5.85) (5.86) (5.87)  merch. total  volume  (5.88)  volume merch.  volume ml  merch.  height  = hm  =  total volume  —  height  =  ml + hs  x  HR  x  VR  (5.89) (5.90) (5.91)  The system is therefore composed of seven equations, and estimates of coefficients are needed for four of these equations.  Chapter  5.4  5. Application  1: Tree Volume  80  Esiimaiion  Ordinary Least Squares F i t  5.4.1  Unweighted Simple or Multiple Linear Regression  The estimated coefficients for the chosen equations using unweighted multiple linear regression to fit each equation were as follows: pred. height pred. HR  =  0.584579 + 1.071239<M - 0.009644 dbh  (5.92)  =  1.000000 - 0.912563 (  (5.93)  2  y  pred. total volume  pred.VR  t o p d l b  )  dbh J  _ 0.226424 hs  =  0.011944 + 3.55499 x 10~ dbh height  (5.94)  =  0.990691 - 0.986292(1 - HR)  (5.95)  5  2  2  The estimated error terms using these unweighted fitted equations were used to test for serial correlation and heteroskedasticity.  5.4.2  Testing for iid E r r o r Terms  To check for serial correlation in each equation, the estimated error terms were sorted by the number of annual rings counted at stump height on each tree (age), and a g r a p h of the current error term versus the previous error term was obtained.  3  Simple linear  regression was performed for the current error term with the previous error term, and all four R values (one for each of the four equations) were less than 0.005. Also, for an 2  alpha level of 0.10 (alpha of 0.05 for positive correlation and an alpha of 0.05 for negative correlation), the Durbin and Watson (1951) statistic was not significant for any of the equations and so the error terms were considered to be independent for each of the four equations of the system. For the height model, the graph of the estimated error versus the predicted height G r a p h s used i n this research are not presented i n this thesis, because of their large number; graphs are o n file and copies can be made available. 3  Chapter  5. Application  1: Tree Volume  81  Estimation  did not indicate heteroskedasticity, but a simple linear regression of the estimated error squared with predicted height resulted i n an R  value of 0.01158.  2  T h e Goldfeld and  Quandt (1965) test for heteroskedasticity was therefore used to test the height model. The estimated error terms were ordered by the predicted values of the height variable from smallest to largest, because the variances were considered to be nondecreasing. Using the ordered data, multiple linear regression was used to fit the height model using the first 40 samples only (small estimated error terms) and then using the last 40 samples (large estimated error terms) only. The 20 samples representing the center of the ordered estimated error terms were not included i n either regression.  T h e test statistic was  calculated as follows: test statistic where mean squared  error  =  mean  squared  error  2  mean  squared  error  1  2, mean  —  squared  error  =  7.32302 5.60853  = 1.3057  (5.96)  1 are the mean squared errors for the  regression of the larger error terms and for the regression of the smaller error terms, respectively. The degrees of freedom for both the numerator and the denominator were calculated as follows: n - k - 2m - 2 _ 100 - 20 - 2(2) - 2 _ 2  _  ~  ^ - S< Q  (5.97)  where n is the total number of samples; k is the number of samples not included i n either regression; m is the number of R H S variables i n the equation, (does not include the intercept). The F value for a significance level of 0.05 and for 37 degrees of freedom for both the numerator and the denominator is 1.69 (actually for degrees of freedom equal to 40), and so the height model was considered to be homoskedastic (identically distributed). The estimated error versus predicted height ratio graph indicated that the selected model was somewhat i n error; the graph of the weighted error terms versus the weighted  Chapter  5.  Application  1: Tree Volume  82  Estimation  predicted height showed a somewhat linear trend for a few of the sample trees with small height ratios.  The heteroskedasticity graph of estimated error squared with predicted  height ratio showed that higher squared errors were associated with small predicted values; after examination, this trend was considered to be associated with lack-of-fit of the model rather than with heteroskedasticity.  The Goldfeld and Quandt (1965) test  was attempted and later discarded as the test is affected by lack-of-fit. The error terms for the height equation were therefore considered to be homoskedastic for the remaining analysis. The estimated error versus predicted total volume graph indicated some heteroskedasticity, with larger variances for larger predicted values. The R value for the simple linear 2  regression of estimated error squared with predicted volume was 0.34276. The Goldfeld and Quandt (1965) test statistic was calculated and found to be 0.00495/0.00015 or 33.000. The degrees of freedom were 38, as m equals 1. Using the same F value as for the height model with degrees of freedom of 40 for the numerator and the denominator, the total volume model was found to be heteroskedastic. The heteroskedasticity graph for the volume ratio model did not indicate heteroskedasticity. Also, a simple linear regression of the estimated error squared with the predicted volume ratio had an R  2  value of less than 0.005, and so the error terms were considered  to be homoskedastic.  5.4.3  Estimating the E r r o r Covariance M a t r i x of Each Equation  The error terms for all models were considered to be i i d , except for the total volume equation which were found to be heteroskedastic. of the heteroskedasticity were examined.  Several assumptions about the nature  Chapter  5. Application  1: Tree Volume  83  Estimation  1. The variance of the error is a linear function of the predicted total volume. u\  m  where u |  m  = e\  m  —a  0  -f- cxi pred. total volume  + error  (5.98)  1 6  = ^im * the estimated error squared for the total volume equation (the s  third i n the system) for the m  th  sample, based on the unweighted model;  pred. total volume is based on the unweighted model; a , cti are coefficients to be estimated. 0  2. The variance of the error is proportional to a power of the predicted total volume. lm  u  =  ^3m  =  a  3 (  total volume)  ai  errorn  (5.99)  where a , and a are coefficients to be estimated. 3  4  3. The variance of the error is a linear function of exogenous variables i n the system.  «L = & » = / ( * )  (5-100)  where X is an n by k matrix of all of the exogenous variables i n the system. Each of these assumptions was assessed by using multiple linear regression to calculate coefficients. The estimated a coefficients were then used to calculate the inverse of the square root of the estimated u\ , and these values were used to in a weighted regression of m  the total volume equation. The residual graph of the estimated error terms and predicted values of the weighted model were examined for a trend showing heteroskedasticity. None of these assumptions for heteroskedasticity resulted i n a residual graph which indicated that the weighted error terms were identically distributed. Also, many negative variances were predicted using these equations for heteroskedasticity.  A s an alternative to these  heteroskedasticity models, then, the variance of the error terms was calculated by dividing the range of the total volume into classes, and then the average squared error was  Chapter  5.  Application  1: Tree Volume  84  Estimation  calculated for each of these classes. T h e residual graph of the weighted error terms with weighted predicted total volume, using the inverse of the square root of these estimated variances did not appear to be heteroskedastic.  Also, the regression of the estimated  weighted error terms squared versus the predicted weighted total volume resulted in an R  value of 0.05733, much lower than the R  2  2  value of 0.34276 for the unweighted error  terms. T h e estimated variances of the error calculated for each total volume class were therefore used to weight the total volume regression.  5.4.4  Appropriate O L S Fit Based on E r r o r Structure  The coefficients presented for the unweighted fit of the height, height ratio, and volume ratio models were considered appropriate as each of these equations have error terms which are i i d . For the volume equation, E G L S using the estimated error covariance matrix from the previous section was used as an appropriate O L S fit. The estimated coefficients using the appropriate O L S fit for the system of equations were as follows: pred. height  =  0.584579 + 1.071239<M - 0.009644 dbh  pred. HR  =  1.000000 - 0.912563 (  y  t o  _ 0.226424  P^ \ dbh J l b  pred. total volume  =  0.011615 + 3.552805 x 1 0  pred. VR  =  0.990691 - 0.986292(1 - HR)  - 5  2  dbh height 2  2  (5.101) (5.102) (5.103) (5.104)  These fitted equations were used to compare the O L S method to other methods.  5.5 5.5.1  Composite M o d e l Fit Derivation of the Composite M o d e l  To obtain a composite model for comparison to the other methods for fitting equations, the equations of the system were combined into one equation to estimate merchantable  Chapter  5.  Application  1: Tree Volume  85  Estimation  volume by performing the following steps. 1. Height was replaced i n the total volume equation by the height estimation function, resulting i n the following equation. total  = r +  volume  0  -f T dbh  Tidbh  2  2  + T dbh  3  3  + error  4  (5.105)  1 8  This equation is simply a local volume function which is a polynomial. 2. The squared term of the volume ratio equation was expanded, and the height ratio was replaced by the height ratio equation. T  r  „  [top  -\-T hs  + error  2  9  dib\  ,  2  dib\^  Itop  (top  dib\  ,  2  (5.106)  1 9  3. The expanded total volume equation and the expanded volume ratio equation were then multiplied to obtain an equation to estimate merchantable volume. merch.  -f  7 5  /i5  2  / top dib \  =  volume  -f "fedbh  7  o  +  7  l  +  l  2  -f 7 t o p di6 -f *y dbh hs 2  2  +  hs  -f  2  s  7  (top  2  ^ - ^ - J  7 9  7  +f dbh n  hs + ii dbh 2  + i 6 i o p dib dbh 2  7  +f top 21  5.5.2  ,  dib  4  dib  + fndbh  hs  hs  3  + -) &bh top 22  -f f^top  2  2  dib hs 2  ,  dbh  +  7l4  +  4  + -y dbh hs 23  4  7l9  dbh  + 4  J  *fiot°P  dib hs 2  £op cfo6 ——— ao/i  4  7 l 5  2  e  2  j fc-  7  d 6 / i f o p difc +  +  2  / top dib \  4  2  o d f c / i /is +  -f fwdbh  J  ) iop difc +  (—^ \  ,  dib \  s  2  *y dbh 20  hs  4  (5.107)  5  Unweighted Regression of Composite M o d e l  The multiple linear regression of this large model resulted i n the following fitted model. pred  merch.  volume  =  0.046284 + 2.69205 x 1 0 " -3.19287 x 1 0  - 7  dbh hs A  -  5  dbh  3  0.095285 ( \  t o  P  d l h  dbh  (5.108)  \ I  Chapter  5.  Apphcation  1: Tree Volume  86  Estimation  The addition of other variables produced an R  2  change of less than 0.001. The R  2  value  for this composite model was 0.9682, with the first variable, dbh , responsible for a 0.9572 3  R  2  change.  Because all of the R H S variables are exogenous, the estimated coefficients  are unbiased.  5.5.3  Testing for iid Error Terms  D a t a were ordered by age measured on each tree at stump height and a graph of the estimated error term with the previous estimated error term was obtained. The graph did not indicate that the serial correlation was significant, and also, the linear regression of the estimated error term with the previous error term resulted i n an R  2  value of 0.00508.  The D u r b i n and Watson (1951) test statistic was also not significant. The error terms for the composite model were therefore independent. A graph of the estimated error terms with the predicted merchantable volume from the unweighted linear model indicated that the variance of the error was heteroskedastic. Also, the linear regression of the estimated error terms squared with the predicted merchantable volume had an R  2  value of 0.13863. The test statistic for the Goldfeld and  Quandt (1965) test was 16.4415, and since the critical value from the F distribution for a equal to 0.05, and 36 degrees of freedom (i.e. for n = 100, k = 20, m — 3) for the numerator and the denominator is 1.69 (actually for 40 degrees of freedom), the composite model was considered heteroskedastic.  5.5.4  Estimating the Variance of the E r r o r Terms  The following model was selected for estimating the variance of the error terms of the composite model. u  2 m  =  = 0:5 -f a pred. 6  merch.  volume  + error i 2  (5.109)  Chapter  5.  where u  2  Application  1: Tree Volume  — e is the estimated error term for the m 2  m  87  Estimation  sample based on  th  m  the unweighted fit of the composite model; u  2 m  estimates cr^, the variance of the error term for the m  sample;  th  volume is predicted merchantable volume from the unweighted  pred. merch.  fit of the composite model; a  5  and a  are coefficients to be estimated.  6  The fitted equation was as follows: j =u 2  <  m  2 m  = -0.00023 + 0.01376pred. merch.  (5.110)  volume  For a predicted merchantable volume of 0.01671 or less, the estimated variance is negative, and so the estimated variance of the error term was reset to 0.672 x 1 0 ~ , the value of 7  this equation for predicted merchantable volume of 0.01672. A graph of the estimated weighted error versus the predicted weighted merchantable volume indicated that the variances of the weighted error terms were homogenous.  Also, the linear regression of  the estimated weighted error squared with the predicted weighted merchantable volume resulted i n an R value of 0.003. T h e model chosen to estimate the variance of the error 2  terms was therefore considered appropriate.  5.5.5  Weighted Regression of the Composite M o d e l  The estimated coefficients from weighted regression were as follows:  pred. merch.  volume  =  -0.053441 + 3.34751 x 10~  5  dbh  3  -6.46642 x 1 0 ~ dbh hs + 0.014345 ( 7  4  y  t o  P^  l b  dbh  The results from this weighted fit were used to compare with other methods.  \ )  (5.m)  Chapter  5.6  5.  1: Tree Volume  Application  88  Estimation  MSLS Fit  5.6.1  First Stage Equations  The chosen system of equations includes four equations which require estimation of the coefficients. Of these four equations, two equations, total volume and merchantable ratio, have endogenous variables appearing on the R H S . T h e total volume function is based on dbh height,  and because height  2  is an endogenous variable, this R H S variable  is also endogenous. Similarly, the (1 — HR)  2  variable which appears on the R H S of the  volume ratio equation is endogenous as HR is endogenous. T h e O L S fit of each of these two equations therefore results in biased estimates of the coefficients.  The error terms  are likely correlated among equations, because a measure of taper over the stem was not included in the equations. Taper likely affects all four of the equations, resulting i n correlation of the error terms. T h e system of equations met the rank and order conditions for identification. The first step to obtain the M S L S fit was to fit the endogenous variables which appear on the R H S , height  and HR, as a function of all of the exogenous variables i n  the system of equations. Multiple linear regression was used to obtain the following first stage equations. pred. height  let  pred. HR  ut  =  =  0.623596 + 1.092101 dbh - 0.009839 dbh  (  2  top dib \  „,  dbh  J  2  - 2.568663 hs  (5.112)  0.979888 + 0.001844 dbh - 3.77511 x 10~ dbh 5  -0.911111  ti^P^t] y dbh  )  + 0.227930 hs  2  (5.113)  Some loss i n efficiency is expected, because the HR variable appears as a quadratic term on the R H S of the system (nonlinear variable) and as a linear variable on the L H S of the system; the structural equations cannot be manipulated to obtain linear reduced-form  Chapter  5. AppHcation  equations.  1: Tree Volume  89  Estimation  Since the first stage equations predict the linear term, some efficiency was  lost. The M S L S fit using these first stage equations will not, therefore, be asymptotically efficient.  5.6.2  Second Stage Equations  The second step of the M S L S procedure was to replace the height  and HR variables  appearing on the R H S of the volume and volume ratio equations by pred. height pred. HR . lst  1 H  and  Each second stage equation was then fitted using multiple linear regression.  Because the height and height ratio equations do not have endogenous variables on the R H S , the final O L S results were used as the second stage regressions of these two models. For the volume and volume ratio models, the following second stage equations were obtained.  pred. total volume  2nd  pred.VR  2nd  5.6.3  =  0.009677 + 3.57295 x 10^ dbh pred. height  =  0.982007 - 0.934982(1 -pred.  2  HR )  2  ut  l a t  (5.114)  (5.115)  Testing for iid Error Terms  The height and height ratio second stage equations are the same as the O L S equations and these equations were shown to have i i d error terms. For the total volume and volume ratio equations, because the equations were purged of simultaneity bias, the coefficients of the second stage total volume and volume ratio equations are consistent, and so the residuals can be used as estimates of the error terms. To test each equation for serial correlation, the estimated error terms for these two second stage equations were first ordered by the age of tree taken at stump height, and graphs of the estimated error term versus the previous estimated error term were obtained. Neither equation appeared to be serially correlated and the D u r b i n and Watson  Chapter  5.  Apphcation  1: Tree Volume  Estimation  90  (1951) test statistic was not significant for an alpha, level of 0.05 for positive serial correlation and for an alpha level of 0.05 for negative correlation (alpha of 0.10 for a two sided test). Errors terms for both equations were therefore considered independent. To check for heteroskedasticity in the second stage total volume model, a graph of the estimated error versus the predicted total volume was examined. The graph indicated increasing variance of the error term with increasing predicted total volume and the R  2  value for the linear regression of the estimated error squared with the predicted total volume was 0.20678.  The test statistic for the Goldfeld and Quandt (1965) test was  22.11 which is significant for alpha equal to 0.05. total volume equation were therefore considered  The error terms for the second stage heteroskedastic.  The graph of the estimated error with the predicted volume ratio indicated a lack-offit for a few samples with small volume ratios. The Goldfeld and Quandt (1965) test was calculated, but later discounted as the. test is inconclusive if the model indicates lack-offit. No further testing or model development was done as this lack-of-fit is likely due to the height ratio model, and a more intensive examination of the height ratio model was considered to be beyond the scope of this research.  5.6.4  Estimation of the E r r o r Covariance M a t r i x  A l l of the equations of the system which require estimation of coefficients have i i d error terms except the total volume equation.  The estimation of the error covariance  matrix, then, first required the estimation of the variances of the error terms for the total volume equation.  However, as with the estimation of the variances of the error terms  for the total volume equation using the O L S method, none of the models proposed for heteroskedasticity  resulted in weighted error terms which were i i d . For this reason, and  also to maintain a parallel fit for comparison to the O L S fitting technique, the range of the total volume was divided into classes, and the average squared estimated error was  Chapter  5.  1; Tree Volume  Application  calculated for each class.  91  Estimation  These averages were used as the estimated variances for the  error terms. Secondly, to estimate the error covariance matrix, E G L S was used to obtain estimates of coefficients for the weighted total volume model. Contemporaneous variances were then calculated using the estimated error terms from the following equations. pred. height pred.  0.584579 + 1.071239 dbh - 0.009644 dbh  2 n d  2  , top dib  1.000000 - 0.912563 '  HR  2nd  *  dbh  (5.117)  -0.226424 hs pred. total volume  2 n d  0.004272 wt  wted  +3.590292 x 1 0 " dbh 5  pred.VR  (5.116)  =  2nd  2  utwted  (5.118)  HR f  (5.119)  pred. height  0.982007 - 0.934982(1 -pred.  lst  where wt is the inverse of the square root of the estimated variance of the error term; pred. total volume d ted 2n  w  *  s  total volume times the weight;  .  is dbh squared times predicted height (from first  dbh pred. height 2  lBtwted  stage equations) times the weight. Contemporaneous variances were calculated using equation 3.28, resulting i n the following S matrix. r  6.6046  -0.0022  2.0391  -0.0089  -0.0022  0.0019  0.0098  0.0019  2.0391  0.0098  0.9606  0.0028  -0.0089  0.0019  0.0028  0.0034  (5.120)  where diagonal elements are the variances for an equation and off-diagonal elements are covariances between equations. Based on equation 3.51, the estimated error covariance matrix was therefore the following  Chapter 5. Apphcation  1: Tree Volume  92  Estimation  matrix.  fl  6.6046W!Wi  -0.0022W W  2  2.0391W W  3  -0.0022W W;  0.0019W W  2  0.0098W W  3  o.ooi9W w;  2.0391W W;  0.0098W W  2  0.9606W W  3  0.0028W W  4  -0.0089W W;  0.0019W W  2  0.0028W W  3  0.0034W W  4  1  2  2  3  3  4  4  1  2  3  4  -0.0089W!W  4  2  3  4  (5.121)  Because the height, height ratio, and volume ratio equations have iid error terms, W j , W , and W 2  4  are equal to the identity matrix of size n by n. The W  3  matrix is an n by n  matrix with the diagonal elements equal to the square root of the estimated variances of the error terms for the second stage total volume model. The simplified error covariance matrix was therefore as follows:  fl  6.6046I  n  -0.0022I  n  2.0391W  3  -0.0089I  n  -0.0022I  n  2.0391W  3  0.0019I„  0.0098W  3  0.0098W  3  0.9606W W  o.ooigin  3  3  0.0028W  3  -0.0089I  n  O.OOigin 0.0028W  3  0.0034I  n  (5.122)  The negative contemporaneous variances (covariances between equations) indicate that the height equation was negatively correlated with the height ratio and the volume ratio equations. A l l other equations appear positively correlated. These estimated covariances appear large relative to the estimated variances.  5.6.5  E G L S to Fit the System of Equations  To obtain the M S L S fit, E G L S was then used to fit the system of equations simultaneously for the estimated error covariance matrix. The resulting M S L S fit was as follows: pred. height  MSLS  =  -0.054629 + 1.100452<M -0.009568<M  2  (5.123)  Chapter  5.  Apphcation  1: Tree Volume  93  Estimation  ( -0.217870 pred.total  volume  MSLS  =  topF dib\  j  2  ta  0.003757 + 3.738763 x 10~ dbh pred. height  (5.124)  5  (5.125)  2  pred. VRJMSLS  =  lst  0.985298 - 0.947024 (1 - pred.  H R  U  T  )  2  (5.126)  The I M S L S technique was not used, because the number of samples was considered large enough to assume that the asymptotic properties of M S L S apply.  5.7  Comparison of the Three Fitting Techniques  5.7.1  Goodness-of-fit Measures  Fit Index The F i t Indices for each of the four equations for the O L S and the M S L S fits are presented i n Table 5.4.  T h e F i t Indices for the M S L S fit were marginally lower than  those for the O L S fit.  Table 5.4: F i t Indices for O L S and M S L S Fits of the Volume Equation System L H S Variable  height  HR  total volume  VR  OLS MSLS  0.7699 0.7612  0.9722 0.9629  0.9881 0.9629  0.9960 0.9314  For the composite model, the F i t Index calculated for the merchantable volume was 0.9471.  T h e corresponding value for the O L S fit, calculated by combining the total  volume and volume ratio equations was 0.9885.  Similarly, for the M S L S fit, the F i t  Chapter  5.  Application  1: Tree Volume  94  Estimation  Index for merchantable volume was 0.9629. The composite model created by combining equations therefore resulted i n the lowest F i t Index; the O L S F i t Index was the highest, but was only marginally higher than that for the M S L S fit.  M e a n Absolute Deviation The mean absolute deviations ( M . A . D . ) were calculated by class for each of the L H S endogenous variables. The classes were created by sorting the 100 samples by the endogenous variable, and then dividing the sorted data into five equal classes of 20 samples each. T h e M . A . D . values for the O L S fit and for the M S L S fit are presented i n Tahle 5.5 and Table 5.6. T h e M . A . D . values are slightly lower for O L S than for M S L S . T h e trends across the five classes were basically the same for the O L S and for the M S L S fits.  Table 5.5: M . A . D . for Five Classes for the O L S F i t of the Volume Equation System Classes, from Low to High Values Endogenous Variable height HR total volume VR  1  2  3  4  5  2.19 0.053  1.52 0.036  2.22  2.05  2.20  0.026  0.016  0.0  0.009 0.012  0.013  0.026 0.013  0.027  0.064  0.010  0.009  0.013  For the composite model, the M . A . D . values were calculated for merchantable volume, and compared to those for the O L S and M S L S fits, created by combining the total volume and volume ratio equations (Table 5.7). As with F i t Indices, the O L S fit gave the lowest M . A . D . values, followed by the M S L S fit, and lastly by the composite model fit.  Chapter  5.  Application  1: Tree Volume  95  Estimation  Table 5.6: M . A . D . for Five Classes for the M S L S F i t of the Volume Equation System Classes, from Low to High Values Endogenous Variable height HR total volume VR  1  2  3  4  5  2.00 0.047  1.42 0.038  2.19 0.034  2.33 0.042  2.33 0.030  0.011 0.078  0.029 0.020  0.043 0.009  0.058 0.010  0.112 0.015  Table 5.7: M . A . D . for Merchantable Volume Classes, from Low to High Values Endogenous Variable OLS Composite MSLS  1  2  3  4  5  0.005  0.012  0.020  0.027  0.062  0.039 0.013  0.050 0.026  0.068 0.044  0.079 0.054  0.123 0.112  Chapter  5.  Apphcation  1: Tree Volume  96  Estimation  M e a n Deviation The mean deviations ( M . D . ) were calculated for each equation for the same five classes as for M . A . D . Results for the O L S fit are presented i n Table 5.8, and for the M S L S fit in Table 5.9.  T h e M . D . values were sometimes lower for M S L S than for O L S . For the  Table 5.8: M . D . for Five Classes for the O L S F i t of the Volume Equation System Classes, from Low to High Values Endogenous Variable height HR total volume VR  1  2  3  4  5  -1.41 -0.013  -0.49 -0.007  -0.85 0.005  0.73 0.015  2.01 0.000  -0.008 0.001  0.008  0.009 -0.009  -0.005  0.000  0.000  0.009  -0.001  Table 5.9: M . D . for Five Classes for the M S L S F i t of the Volume Equation System Classes, from Low to High Values Endogenous Variable height HR total volume VR  1  2  3  4  5  -0.683 0.005  0.079 0.016  -0.428 0.030  1.05 0.042  2.13 0.030  -0.009 -0.021  -0.003  -0.002  -0.018  -0.043  -0.006  -0.002  0.006  0.015  height model, the M . D . values were lower for smaller height values using M S L S than using O L S , but for higher height values, the O L S fit resulted i n lower M . D . values. T h e M S L S fit of the height ratio model resulted i n an underestimation of the height ratio for  Chapter  5.  Application  1: Tree Volume  97  Estimation  all classes of the sample data; whereas, the O L S fit resulted i n an overestimation of the lower classes and an underestimation of the higher classes.  T h e M . D . values for total  volume showed an overestimation of total volume using M S L S . T h e O L S fit showed small differences which were not consistently low or high across the classes. Finally, for the volume ratio model, the M S L S fit showed an overestimation for the lower classes, and an underestimation for the upper classes.  T h e O L S fit showed underestimation, then  overestimation and then underestimation. For the composite model, the M . D . values were calculated for merchantable volume, and compared to those for the O L S and M S L S fits, created by combining the total volume and volume ratio equations (Table 5.10). T h e composite model underestimated  Table 5.10: M . D . for Merchantable Volume  Technique  Classes, from Low to High Values 1 2 3 4 5  OLS  -0.004  0.007  Composite MSLS  0.029 -0.008  0.040 0.000  0.005 0.044  -0.002  -0.001  0.024  0.006  -0.016  -0.045 -0.039  merchantable volume for both smaller and larger volumes. T h e O L S and M S L S fits had similar trends with an overestimate of volume, then an underestimate, and finally an overestimate. T h e O L S values were the lowest.  5.7.2  Relative Variances  The trace of the estimated coefficient covariance matrix for the M S L S fit was 1.775283; whereas, the trace of the O L S fit was 3.467678 for the system of equations, Even though the M S L S appears more efficient, the trace of the estimated coefficient covariance matrix for the O L S fit cannot be used to calculate relative efficiency. T h e M S L S fit is expected  Chapter  5.  Application  1: Tree Volume  98  Estimation  to be more efficient and i n fact reaches the Cramer-Rao lower bound asymptotically, but in this case, because of the first stage equations used in this M S L S fit of the volume equation system, there was some loss i n efficiency. The traces for the submatrices of the estimated coefficient covariance matrix corresponding to each individual equation are given in Table 5.11. The trace for the O L S fit  Table 5.11: Trace of the Coefficient Covariance M a t r i x for Each Equation of the Volume Equation System Endogenous Variable OLS MSLS  height  HR  3.465530 1.773650  2.097202 x 10~ 9.787514 x 10~  total volume 3 4  9.954486 x 1 0 " 1.483118 x 10~  VR 6 5  4.125564 x 1 0 " 6.084494 x 10~  5 4  of the height equation was larger than the M S L S fit, and this difference accounted for the difference in the overall traces of coefficient covariance matrix for the system of equations'. The lower trace for the M S L S fit occurred, because contemporaneous correlation is accounted for in the M S L S fit. A similar result occurred for the height ratio equation. For the total volume and volume ratio equations, the trace for the O L S fit was somewhat lower. However, since the O L S estimates of the coefficients are biased and inconsistent, confidence intervals cannot be calculated, and hypotheses cannot be tested. Since the composite model has merchantable volume as the L H S variable and a different functional form, a comparison of the trace of the estimated coefficient covariance matrix for the composite model to the other fitting techniques was not. possible.  5.7.3  Table of Estimated Coefficients and Standard Deviations  A summary of the estimated coefficients and their associated standard deviations from the O L S and M S L S fits is shown in Table 5.12.  Chapter  5.  Application  1: Tree Volume  99  Estimation  Table 5.12: Estimated Coefficients and Standard Deviations of Coefficients for the Volume Equation System Endogenous Variable height °n 013  Coefficients OLS  Standard Deviations OLS MSLS  MSLS  0.584579 1.071239 -0.009644  -0.054629 1.100452 -0.009568  1.855147 0.154753 0.003040  1.328270 0.096674 0.001719  1.000000 -0.912563 -0.226424  0.970531 -0.898900 -0.217870  0.007978 0.021266 0.039766  0.006091 0.018179 0.024722  0.0116150 3.552805 x I O "  0.003757 3.738763 x 10~  0.003155 4.505443 x 10~  0.003851 5.887156 x I O "  0.990691 -0.986292  0.985298 -0.947024  0.001643 0.006209  0.006660 0.023751  HR 021 <!>22 <$23  total volume 831 832  5  VR <5  41  842  5  7  7  Chapter  5.  Application  1: Tree Volume  100  Estimation  The estimated coefficients for the M S L S and for the O L S fits were very similar, except for the intercepts of the height and volume equations, and the slope of the volume ratio equation. The standard deviations for the estimated coefficients of the height model were lower from the M S L S fit than from the O L S fit. This was reflected i n the trace calculated for this equation, and indicates that the inclusion of the contemporaneous variances in the system fit using M S L S resulted in a lower variance. Results were similar for the fits of the height ratio equation. For the volume and volume ratio equations, the standard deviations for the coefficients were higher for the M S L S fit; again, this was reflected in the trace values for these equations, and indicates that O L S underestimates the confidence interval for the true value, because of simultaneity bias. If the O L S estimates for the volume and volume ratio equations were used to test hypothesis statements, results would not be correct. For instance, <5> would be expected 4 1  to be equal to 1.0, so that a, volume ratio of one is predicted, if the merchantable length is equal to the total height. Using an alpha level of 0.01, the confidence limits using the biased estimates from O L S would be 0.94923 to 0.98646.  Using the consistent  MSLS  estimates, the confidence limits would be 0.96830 to 1.00230. The hypothesis that this coefficient is equal to 1.0 would be rejected with the O L S estimates, but not rejected with the M S L S consistent estimates.  5.7.4  R a n k i n g for Other Features  The ranks assigned to each fitting technique are shown in Table 5.13.  4  The composite  model was assigned a rank of one for information as only one endogenous variable is estimated. For consistency, both the M S L S and the composite model for this system of equations provided consistent estimates. 4  The fit of the composite model was unbiased  See Chapter 4 for an explanation of the ranking system used.  Chapter  5.  Application  1: Tree Volume  101  Estimation  Table 5.13: Ranks for the Three Techniques Feature Information Consistency Confidence Limits Asymptotic Efficiency Compatibility Ease of fit Total  as all R H S variables were exogenous.  OLS 2 1  Composite 1 2  MSLS 2 2  1  2  2  1 1 2 8  1 2 3 11  2 2 1 11  Confidence limits for both the M S L S fit and the  fit of the composite model can be calculated. M S L S was considered to have the highest asymptotic efficiency as the equations appear to be correlated. the composite model fits provide compatible estimates.  B o t h the M S L S and  The number of steps required  to obtain the O L S fit was four for the unweighted -fit, eight to check for i i d errors, one to estimate the error covariance matrix, and one to obtain the weighted fit of the total volume model, for a total of 14'step's. For the composite model, three steps were required to derive the model, two steps to check for i i d error terms, and three steps to obtain the unweighted fit, the estimated error covariance matrix, and the weighted fit, for a total of eight steps.  For the M S L S fit, two steps were required to fit the first stage models,  four steps for the second stage models (actually used O L S estimates for two of the second stage models), eight to check for i i d errors (actually only four as results from O L S used for two of the models), one to estimate the weights for the volume equation, one to estimate the covariances among equations, and one to fit the system of equations, simultaneously, for a total of 17 steps.  Chapter 5.  Application  1: Tree Volume  Estimation  102  5.8 Conclusion The chosen system of equations to estimate tree volume was composed of seven equations. Four of these equations had coefficients which were to be estimated. Using the O L S technique, two of the equations were found to have i i d error terms, and were fitted using multiple linear regression. The error terms for the height ratio equation were considered i i d , although a lack-of-fit was noted. heteroskedastic error terms. found to be inadequate.  The total volume equation had  M a n y models of this heteroskedasticity were tested and  The variances of the error terms were therefore estimated by  calculating the average of the estimated error terms, squared, from the unweighted fit, for classes of the predicted total volume values, also from the unweighted fit. The total volume equation was then refitted using E G L S . A composite model was derived by combining the seven equations into one to estimate merchantable volume. The unweighted fit, using a criterion of a m i n i m u m R  2  change of  0.005, included only three of the 23 terms of the composite model. Error terms were found to be heteroskedastic and so the variances of the error terms were estimated.  Multiple  linear regression was then used to fit the weighted equation. The first stage equations for the M S L S fit of the volume equation did not reflect the quadratic nature of the height ratio variable; M S L S estimates were therefore not aymptotically efficient for this system. The error terms for the second stage equations of the M S L S fit were found to be similar to the error terms from the unweighted O L S fit.  However, a lack-of-fit was noted for the estimated error terms of the volume ratio  equation; no attempt was made to explain the cause of this lack-of-fit. A second set of second stage equations were therefore derived in which the second stage total volume equation was weighted and fit using E G L S . M S L S was used to fit the system using the estimated error covariance matrix. I M S L S was not used to fit the equations, because the  Chapter  5.  Application  1: Tree Volume  Estimation  103  number of samples was large. The best goodness-of-fit measures (high F.I. and low M . A . D . and M . D . values) were obtained for the O L S fit. However, the goodness-of-fit measures for the M S L S fit were very similar. The composite model resulted in the worst goodness-of-fit measures, likely because some of the important variables were excluded from the composite model fit. The trace of the estimated coefficient covariance matrix was lower for M S L S than for O L S . The trace values of the submatrices of this matrix, corresponding to individual equations, showed that the O L S fit resulted i n lower trace values than the M S L S fit for the total volume and volume ratio equations. Each of these equations had R H S endogenous variables. The trace values for the O L S fit were higher for the height and height ratio equations than for the M S L S fit; these equations did not have R H S endogenous variables. The M S L S fit therefore resulted in an increase i n efficiency as shown by the lower trace values for the submatrices estimated for the height and height ratio equations. Higher trace values for the submatrices of the total volume and volume ratio equations resulted from the M S L S fit.  However, the O L S estimates of the variances cannot be  used to calculate confidence intervals or test hypotheses concerning the true coefficients. The M S L S fit results in a consistent estimate of the coefficients and of the coefficient covariance matrix, and, therefore, confidence limits for true coefficients can be calculated. Hypotheses can also be tested using the M S L S fit. The table of estimated 'coefficients and their standard deviations further supported the evidence given by the traces. The O L S fit results i n larger standard deviations than the M S L S fit for the height and height ratio equations, and results in underestimates of confidence limits for the volume and volume ratio equations. The summed rank for other features was the same for the M S L S fit as for the composite model fit. The M S L S technique has the advantage of all of the endogenous variables being predicted, whereas the composite model fit is easier to compute.  The O L S fit  Chapter  5.  Application  1: Tree Volume  had the lowest summed rank, largely because estimates and inconsistent.  104  Estimation  of the coefficients are biased  Also, the O L S fit is a single equation approach which does not re-  sult in compatible estimates, and efficiency is lost because the information concerning contemporaneous correlation is not utilized in the O L S fit. The M S L S technique is therefore preferable for this application, because the goodnessof-fit measures are close to the O L S fit and higher than the composite model fit. Estimates of the coefficients and their standard deviations are consistent, and more efficient. This was indicated by a high summed rank for other features. The composite model fit resulted in unbiased estimates of the coefficients and their standard deviations; however, only one variable was estimated.  In addition, the M S L S fit required only three more steps than  the O L S fit. The main disadvantage of the M S L S fit was that the number of samples had to be reduced from 500 to 100.  This was due to a limitation of the computer memory to  seven megabytes for this research.  If a very large number of samples were used, and  larger computer memory was not available, a subsample could be used, as was used for this apphcation. Alternatively, the M S L S fit could be modified by eliminating the last step, which is E G L S applied to the system using the estimated error covariance matrix. Instead, consistent estimates of the coefficients could be obtained by using the weighted second stage coefficients (equations 5.116, 5.117, 5.118, and 5.119). The gain i n efficiency by accounting for the contemporaneous variances in the final system fit would be lost, but simultaneity bias would be removed, which is an improvement over the O L S fit.  Chapter 6  Application 2: Estimation of Tree Diameter Distribution  6.1  Introduction Diameter distribution information is required in order to choose stands for harvest  and to assess the expected financial return.  The estimation of diameter distributions  from stand measures can be done by first selecting a known probability density function (pdf), finding the parameters, and then by relating the parameters of the selected pdf to current stand measures. These equations can then be combined with a stand level growth model where stand attributes are first predicted and parameters of the pdf are then "recovered" from these predicted stand attributes.  Hyink and Moser (1983) referred to  this diameter distribution modelling technique as the parameter  recovery  method.  Stand  attributes from forest inventories can also be used as inputs to predict the parameters of the selected pdf, once the relationships between each parameter and the stand attributes are established. The alternative to estimating the parameters of a known pdf is to relate the stand attributes (measured or predicted) to percentiles of the diameter distribution (Anon., 1987; Bailey et al., 1981; Borders et al., 1987b). To limit the scope of this thesis, the estimation of diameter distribution was restricted to the first method. The most commonly selected pdf is the Weibull distribution (Clutter et ai,  1983).  The probability density function of this distribution is as follows: (a < X  im = o  <  co) (6.127)  otherwise  105  Chapter  6. Application  2: Estimation  of Tree Diameter  Distribution  106  where b> 0  c > 0  The parameter a is the location parameter, b is the scale parameter, and c is the shape parameter (Clutter et al., 1983). Although a is sometimes negative for the distribution, a must be nonnegative for diameter distributions. The Weibull distribution is attractive for representing diameter distributions i n that the equation for the pdf is relatively simple with only three parameters, and the shape of the distribution is flexible (Bailey and Dell, 1973). The cumulative form of the Weibull distribution, the cumulative density function (cdf), is as follows: F(X.)  =  l - [ ( ^ ) ]  F(X)  =  0  _  (a<X<oo)  C  e  (6.128)  otherwise  A system of parameter prediction equations is used to predict the parameters of the Weibull distribution, which represents the distribution of diameters. (6.129)  a  — fi (stand  attributes)  b  =  (stand  attributes  )  (6.130)  c  — f  (stand  attributes  )  (6.131)  f  2  3  If the parameters of the Weibull distribution are considered interdependent, the system of parameter prediction equations becomes the following: a  — j4 (stand  attributes,  b.c)  (6.132)  b  — /  5  (stand  attributes,  a, c )  (6.133)  c  — f  6  (stand  attributes,  a, b)  (6.134)  Chapter  6.  Application  2: Estimation  of Tree Diameter  Distribution  107  A change in b, for example, is expected to change the values of a and c and these changes will only be reflected if all three parameters are included in each equation of the system. This second system is simultaneous i n that the endogenous variables (the parameters) on the L H S also appear on the R H S . The first step i n estimating the diameter distribution using the Weibull pdf is to estimate the parameters of the pdf for each stand or plot. Estimates can be obtained by using nonlinear least squares to fit the cdf, and variances can be estimated by using Jacknife approximations or by using a Taylor series expansion. Variances of the parameters from this nonlinear fit of each plot are required and then the method of fitting systems of parameter prediction equations proposed by Ferguson and Leech (1978), modified for simultaneous systems of equations, could be followed. However, the error terms for each plot or stand may be heterogenous or serially correlated and a transformation or generalized nonlinear least squares would be required to obtain a consistent estimate of the coefficient covariance matrix. The alternative to nonlinear least squares is to replace the relative cumulative frequency observations i n each plot or stand with three percentiles only and to calculate the a, b, and c parameters from these three percentiles. The three parameters are considered to be without error with this method. This second approach was selected for this thesis; all system fitting problems were restricted to linear equations.  6.2  Preparation of D a t a Summarized data for permanent sample plots (psp) were obtained from the Alberta  Forest Service ( A F S ) . Psp information was selected over temporary sample plot data because the psp data are fixed area plots, whereas most of the temporary sample plot data collected in Alberta are variable radius plots which are weighted toward selection of larger diameter trees. The summarized data included the location of the psp, the plot  Chapter  6.  Application  2: Estimation  of Tree Diameter  108  Distribution  size, the average age at breast height by species, the average age at stump height by species, the site index for a reference age of 50 years at breast height by species, the quadratic mean diameter (diameter of a tree with mean basal area) by species for live trees, and the top height (height of the 100 largest trees by dbh per hectare) by species for live trees. Also included were live stems per hectare by species and diameter class, and volume from ground to tree tip (total tree volume) per hectare by species and diameter class for live trees. The diameter classes had a width of 4.0 centimeters, beginning with 1.1 centimeters.  A description of the data collection procedures currently used by the  A F S for psp data can be found i n the Alberta field procedures  manual  Forest  Service:  Permanent  sample  plot  (Anon., 1986).  Plots with more than 80 percent pine volume were selected from the data. Also, as with the section tree data for the first apphcation, data from pine stands in Northeastern Alberta, were removed and therefore only lodgepole pine plots remained. Prior to 1982, a cluster of four plots was established by A F S at each psp location. To remove possible dependencies of plots within these clusters, only one plot was retained from each location. The psp data represented measurements from several time periods; to remove correlation between measurements of a given plot over time, the first measurement alone was retained. Also, plots which had been treated, for instance thinned plots, were removed from the data. The relative cumulative frequencies by diameter class (diameter distribution) for the remaining plots were graphed. Plots which had multimodal diameter distributions were removed, because the Weibull distribution is suitable only for unimodal distributions. The 24th, 63rd, and 93rd percentiles were calculated for each psp, using linear interpolation as shown below for the 24th percentile. ratio ratio  (k) — 0.24  (k) — ratio (k — 1)  endpoint endpoint  (k) —  (k) — endpoint  Dl (k — 1)  (6.135)  Chapter  6.  Application  2: Estimation  of Tree Diameter  109  Distribution  where ratio (fc) is the ratio of the cumulative number of stems up to and including the k  th  diameter class, over the total number of stems per hectare; fc — 1 is  the previous diameter class; the desired percentile lies between the two classes; endpoint  (fc) and endpoint (fc — 1) are the endpoints of the k  th  and fc — 1  th  diameter  classes, respectively; DI is the diameter corresponding to the 0.24 ratio. Dubey (1967) showed that the 24  and 9 3  th  rd  percentiles are most efficient relative to  the maximum likelihood method, if the a parameter is equal to zero. Bailey and others (1981) included the 6 3  rd  percentile to estimate the Weibull distribution if a is unknown.  The a, b and c parameters of the Weibull distribution were then calculated from these percentiles by using the following equations. DI  =  a + b[-ln(0.76)]  D2  =  a + b[-ln(0.37)]  D3  =  a + b[-ln(0.07)]  (6.136)  1/c  =a  l/c  + b  (6.137) (6.138)  1/c  where D I , D2, and D3 are the diameter limits for the 0.24, 0.63, and 0.93 cumulative probabilities, respectively; Zn is the natural log, base e. These equations can be simplified to the following: _  D1-D2 D3-D2  Hn  (0.76)]  1 / c  -l  [_  ( .07)]  1 / c  -l  Zn  0  . 1  j  D1-D2 b  a  =  =  . T7  [-/n(0.76)] D2-b  1/c  - 1  .  6.140 ^ (6.141)  V  A n iterative procedure is required to solve for c, and then b and a are calculated from the estimated c value. The Nonlinear Function Optimization ( N L P ) package at the the  Chapter  6.  Application  2: Estimation  of Tree Diameter  Distribution  110  University of British Columbia (Vaessen, 1984) was used to solve for the c parameter by minimimizing the absolute difference between the right and left hand sides of equation 6.139. A tolerance limit of 1.0 x 10~  12  was used. For three of the unimodal psps, several  different starting points were attempted, but no solution was found within the tolerance limit specified. In all of these three plots, the trees were i n the two smallest diameter classes, only, resulting i n an abrupt peak at the beginning of the graph of the distribution, followed by a sharp decline. The Weibull distribution is probably inappropriate for these plots, so the plots were deleted from further analysis. For two of the fitted plots, the solution for the a parameter was negative.  Since this negative value was close to zero,  no attempt was made to refit these two plots. The distribution of the remaining 121 psps by age and stems per hectare is presented in Table 6.14.  Over half of the psps did not have age at breast height on file. These  data were missing as the ages which had been previously recorded were considered to be i n error by A F S personnel; new a.ges were taken, but these had not yet been added to the data used i n this thesis. Site index was also unavailable for these plots. D a t a were checked for outliers by graphing measurements for pairs of variables. No outliers were found.  6.3  Model Selection  M u l t i p l e linear regression was used to fit each of the parameters as a function of the other parameters and stand measures. The stand measures initially included in the regression were site index for pine, age at breast height for pine, stems per hectare for all species, quadratic mean diameter for all species, top height for all species, total volume per. hectare for all species. These variables were chosen based on the results i n literature including a study by the A F S for predicting percentiles of diameter distributions for stands in A l b e r t a  Chapter  6.  AppHcation  2: Estimation  ol' Tree Diameter  111  Distribution  Table 6.14: Distribution of Selected Psps by Age and Stems per Hectare Classes  Stems per Hectare 0 to 1000  no age  1 to 20  21 to 40  4  at Breast Height 61 81 101 121 to to to to 80 100 120 140  141 to 160  161 to 180  1  1001 to 2000  18  2001 to 3000  23  1  7  3001 4000 4001 5000 5001 6000 6001 7000 7001 8000 8001 9000  Age 41 to 60  Total  5 5  1  5  3  1  3  4  2  1  6  2  2  2  9  1  3  1  1  1  1  3  28 33  to to  1  1  19  •2  14  1  11  to to 1  5  to to  2  2  1  3  9001 to 10000  0  10000 to 11000  0  11000 to 12000 Total  1 72  0  11  13  13  3  7  0  1  1  1 121  Chapter  6. AppEcation  2: Estimation  of Tree Diameter  112  Distribution  (Anon., 1987). Only 49 of the 121 plots were used for this initial analysis, as age and site index were missing from the other 72 plots. Because the error terms were not assumed to be normally distributed or i i d , the partial F test used i n stepwise regression was not appropriate. Instead, all variables were forced into the regression. T h e addition of age and site index to the regression resulted i n an R change of less than 0.005; therefore, the 2  multiple linear regression of all of the 121 plots was fitted, using all of the stand variables except age and site index. Based on a m i n i m u m of 0.005 change i n R value, the system 2  of parameter prediction equations was selected as follows: a = Sn -\- 8\2 topht -\- 8\3 qdiam  -\- 8i stems  -f  totvol  -\- <5ig b + £ 1 7 c +e i  b = 821 -f 8  + 8  + #25 totvol  + 8 e a + ^27 c + e  22  topht -f 8 s qdiam 2  4  24  c = 631 + £32 topht + 833 qdiam  stems  + 8 stems 34  2  (6.142) (6.143)  2  (6.144)  + 8 a + 8 b+ e 3B  36  3  where topht is the top height or the average height of the 100 largest trees by dbh per hectare; totvol is the total volume from ground to tree tip per hectare; stems is the number of stems per hectare; qdiam 8  U  is the mean dbh for the tree of average basal area;  through 8  3e  are coefficients to be estimated.  Since the equations for the a and b parameter are the same using this 0.005 R criterion, 2  these two equations cannot be distinguished from each other. A new criterion of a minimum R change of 0.02 was set. The selected system of parameter prediction equations 2  was therefore as follows: a  —  8  b  =  £21 + £22 qdiam  c  =  £31 + £32 topht + 833 a + 834 b •+ e  n  + £12 topht + 8-13 stems + 8 stems 23  + 8 totvol X4  + 824 totvol  3  + £  (6.145)  b + t\  15  + 8 $ a -f e 2  2  (6.146) (6.147)  Chapter  6.  Application  2: Estimation  of Tree Diameter  113  Distribution  Each of the equations is unique using this criterion for retaining variables. T h e system is simultaneous i n that the endogenous variables, the parameters to be predicted, appear on both the R H S and the L H S of equations i n the system. This second system of equations was used i n the subsequent analyses.  6.4  Ordinary Least Squares Fit  6.4.1  Unweighted Simple or Multiple Linear Regression  The estimated coefficients for the chosen parameter prediction equations using unweighted multiple linear regression for each equation were as follows: pred. a  =  7.214783 -f 0.260097 topht - 9.558752 x 10~  4  stems  +0.026935 totvol - 1.225793 b pred. b  pred.c  =  =  3.639838 + 0.159000 qdiam - 1.96650 x 1 0 "  (6.148) 4  stems  +0.007499 totvol - 0.209573 a  (6.149)  4.436816 - 0.349270 topht + 0.237814 a + 0.540540 b  (6.150)  The estimated error terms using these unweighted fitted equations were used to test for serial correlation and heteroskedasticity.  6.4.2  Testing for iid Error Terms  To check for serial correlation i n each equation, the estimated error terms were sorted by the predicted L H S endogenous variable using the unweighted fit, and a graph of the current error term versus the previous error term was obtained. Simple linear regression was used to fit the regression of the current error with the previous error term for each of the three equations i n the system. The R value for the equation with a as the L H S 2  endogenous variable was 0.01937^ for b the R was 0.00041, and for c the R was 0.01379. 2  Chapter  6.  Apphcation  2: Estimation  of Tree Diameter  114  Distribution  The D u r b i n and Watson (1951) test statistic was not significant for any of the equations using an alpha level of 0.10 (0.05 for negative serial correlation and 0.05 for positive serial correlation). The error terms were therefore considered to be independent for each of the three equations. A graph of the estimated error terms versus the predicted L H S endogenous variables was examined for heteroskedasticity. The simple linear regression of the estimated error squared with the predicted L H S endogenous variable resulted in an R  2  value of 0.0030  for the a parameter, 0.1327 for the b parameter, and 0.02366 for the c parameter.  The  error terms for the first equation, with the a parameter as the L H S endogenous variable, were considered to be homoskedastic. The Goldfeld and Quandt (1965) test was used to test for heteroskedasticity for the b and c parameter equations. D a t a were ordered by the predicted L H S endogenous variable, and linear regressions were performed for the first 40 and for the last 40 observations. The middle 41 observations were excluded from both regressions.  The test statistic was found to be significant for each of these equations.  The error terms for these two equations were therefore considered to be heteroskedastic.  6.4.3  Estimating the Error Covariance M a t r i x of Each Equation  The error terms for the first equation with parameter a as the L H S endogenous variable were i i d . For the remaining two equations of the system, the error terms were considered heterogenous. For the b parameter equation, the variances of the error terms were estimated using the following model. u\  m  where u\  m  = 0.120250 x pred. b  (6.151)  is the estimated variance of the error for the second equation of the system  and the m  th  sample. The u^m values are equal to the e  2m  values from the  Chapter  6.  Application  2: Estimation  of Tree Diameter  115  Distribution  unweighted O L S fit; pred. b is the predicted b from the unweighted regression. The pred. b variable was reset to 1.0 x 1 0  - 7  if the value was less than or equal to zero  resulting i n only positive estimated variances. A linear model to estimate variances which included an intercept was also fitted, but the intercept was negative resulting in more of the estimated error terms as negative values. The model with an intercept was discarded and the zero intercept model was chosen. Using the inverse of the square root of these estimated variances as weights, a weighted regression of the b parameter equation w as r  performed.  The estimated error terms from this weighted model were graphed against  the predicted weighted b parameter.  This graph showed some heteroskedasticity, and  the regression of the estimated error terms squared, from the weighted model, versus the predicted weighted b parameter resulted i n an R  2  value was somewhat lower than the R  2  value of 0.09354.  However, this R  2  value of 0.1327 for the unweighted model; these  estimated variances were selected as weights for the b parameter equation. The c parameter equation also appeared to have non-iid error terms, although the R  2  value for the regression of the estimated error terms squared with the predicted c  values was only 0.02366. The average of the estimated error terms squared, for classes of the predicted c parameter indicated that the variance of the error terms was large for small values of the c parameter, small for medium values of the c parameter, and again large for large values of the c parameter. The Goldfeld and Quandt (1965) test for data ordered by the predicted c parameter was therefore considered to be misleading as the variances of the error did not increase with the predicted value. The average squared error terms by classes of the predicted c parameter were then used to obtain a weighted regression by using the inverse of the square root of the average estimated error squared as weights. The R of the regression of the estimated weighted error terms, squared, with 2  the predicted weighted c parameter was 0.05860. However, since the heteroskedasticity  Chapter  6.  Application  2: Estimation  of Tree Diameter  116  Distribution  was likely the result of a lack-of-fit, the error terms were considered i i d for this analysis.  6.4.4  Appropriate OLS Fit Based on Error Structure  The coefficients presented for the unweighted fit of the a and c parameter equations were considered appropriate as error terms were iid. For the b parameter equation, E G L S was used to estimate the parameters, using the estimated variances of the error terms. The estimated coefficients using the appropriate O L S fit for the system of equations were as follows: pred. a  =  7.214783 + 0.260097 topht - 9.558752 x 10~  4  stems  +0.026935 totvol - 1.225793 6 pred. b  =  pred.c  =  (6.152)  3.574736 + 0.162501 qdiam - 1.795836 x 1 0  - 4  stems  +0.007432 totvol - 0.212128 a  (6.153)  4.436816 - 0.349270 topht + 0.237814 a + 0.540540 b  (6.154)  These fitted equations were used to compare the O L S method to other methods.  6.5  Composite Model Fit  6.5.1  Derivation of the Composite Model  To obtain a composite model, the equations for the a, b and c parameters were substituted into the cumulative form of the Weibull distribution function by performing the following steps. 1. The b parameter which appears on the R H S of the a parameter equation was replaced by the b parameter equation to obtain the following equation, after simplification. a = a  n  + OL12 topht + a  1 3  qdiam  + a  1 4  stems  + « i totvol 5  + error  a  (6.155)  6. Application 2: Estimation  Chapter  of Tree Diameter  117  Distiibution  2. The a parameter which appears on the R H S of the b parameter equation was replaced by the a parameter equation to obtain the following equation. b = OL2\  -+-  a2 2  topht  +  CK23  qdiam  +  0:24  + a 5 totvol + error  stems  2  2  (6.156)  3. The equations from above, for the a and b parameters as functions of stand variables onhy, were substituted for the a and b parameters of the c parameter equation. c  — m a  +  ^32  topht  +  CK33  qdiam  + a  3 4  stems  -f  0:35  totvol  + error  3  (6.157)  These substitutions resulted in equations to predict each of the parameters of the Weibull distribution as a function of the stand variables only. These three equations were then substituted into the cumulative form of the Weibull distribution to obtain the following equation. F(X)  =  1 - e H - v ^ J  J  +  £ 4  (6.158)  where a eqn, b eqn. and c eqn are the equations given above, excluding the error terms.  6.5.2  Unweighted Regression of Composite M o d e l  The composite equation was fitted using a nonlinear optimization routine (Vaessen, 1984) to minimize the sum of squared deviations between the actual and predicted values for the three percentiles of the 121 plots, simultaneously. Starting points for the nonlinear fit were the estimated coefficients obtained from the multiple linear regression of each of the three parameters as a function of the stand variables only. For each iteration, the estimated coefficients for the composite model were used to calculate estimated a, 6, and c parameters.  Because the b and c parameters of the Weibull equation must be greater  than zero, and the a parameter must be nonnegative, the sum of squared deviations was calculated by taking the absolute value of each negative parameter. The fitted function  Chapter  6. Application  2: Estimation  of Tree Diameter  Distribution  118  from the last iteration (minimum sum of squared error) resulted i n seven of the 121 plots having an estimated b parameter which was negative.  Since the b parameter is  the denominator of a ratio and the numerator of this ratio was positive, the ratio was negative when b was negative.  Since this ratio is raised to the c power, a negative b  parameter resulted i n a mathematical error. To attempt to restrict the resulting fit for a nonnegative a parameter, and b and c parameters greater than zero, one was added to the absolute value of negative parameters.  T h e fitted function resulted i n estimated b and  c parameters which were greater than zero for all 121 plots. Some of the a parameters were negative; since these values were close to zero and the a parameter is the location parameter, no further attempts were made to restrict the fitted function. The fitted composite model, partitioned into the aeqn, the beqn and the ceqn was as follows: pred. a eqn =  5.051160 - 0.377366 topht + 0.002504 qdiam +0.000263 stems + 0.092034 totvol  pred. b eqn =  0.787755 + 0.559818 topht + 0.004303 qdiam -0.000433 stems + 0.275654 totvol  pred. c eqn =  (6.160)  7.468789 - 0.065044 topht + 0.014784 qdiam -0.000122 stems - 0.2UU1  6.5.3  (6.159)  totvol  (6.161)  Testing for iid E r r o r Terms and Weighted Regression  Because the composite model is a nonlinear model, no testing of the error terms for serial correlation or heteroskedasticity was performed. T h e unweighted fit was therefore used to compare to the other two methods. Comparison was restricted to goodness-of-fit measures only.  Chapter  6.6  6. Apphcation  2: Estimation  of Tree Diameter  119  Distribution  MSLS Fit  6.6.1  First Stage Equations  Each of the parameter prediction equations has endogenous variables on the R H S . T h e system of equations was identified. The first step to obtaining a.n M S L S fit was to fit each of the R H S endogenous variables, parameters a and b as a function of all of the exogenous variables i n the system. Multiple linear regression was used to obtain the following first stage equations. pred. a  =  l e t  6.014739 + 0.054645 topht - 0.066945  qdiam  -9.60807 x 1 0 " stems + 0.024632 totvol  (6.162)  4  pred. b  =  Ui  1.476369 + 0.057970 topht + 0.156755 qdiam +4.26610 x 1 0  - 5  stems + 0.001511 totvol  (6.163)  A first stage equation was not required for parameter c, because c appears only on the L H S side of the system of equations.  6.6.2  Second Stage Equations  The second stage equations were derived by replacing the endogenous variables, a and b which appear on the right hand side of equations i n the parameter prediction system with the predicted endogenous variables from the first stage equations. Multiple linear regression was then used to fit each of the second stage equations. pred. a  =  2 n d  6.645248 + 0.079403 tophi  - 9.42588 x 1 0  - 4  stems  +0.025278 totvol - 0.427068 pred. b  ut  pred.b d 2n  =  (6.164)  ~  4.904405 + 0.227774 qdiam + 0.001062 stems -0.024620 totvol + 1.060856 pred. a  u t  (6.165)  Chapter  6. Application  pred.c  2nd  2: Estimation  =  of Tree Diameter  120  Distribution  5.177297 - 0.300508 topht + 0.331522  pred.a  lst  +0.064493 pred. b  (6.166)  lst  where pred.a  ut  and pred.b  lsi  are the predicted values for a, and b, respectively, using  the first stage equations.  6.6.3  Testing for iid Error Terms  Each of the second stage equations was purged of the simultaneity bias by using predicted values from the first stage equations to replace R H S endogenous  variables.  The residuals from the second stage equations were therefore used as estimates of the error terms, because estimated coefficients are consistent. To test for serially correlated error terms, the estimated error terms for each equation were ordered by the predicted L H S endogenous variable from the second stage equations, and graphs of the estimated error term versus the previous error term were examined. The R value for the regression of the estimated error term with the previous error term 2  was less than 0.005 for the b parameter equation. R  2  For the a parameter equation, the  value was 0.04099, T h e Durbin and Watson (1951) test was inconclusive for positive  serial correlation (alpha of 0.05), and was not significant for negative serial correlation (alpha of 0.05). T h e runs test was therefore used to test for serial correlation, positive or negative, using an alpha level of 0.10. T h e test statistic was not significant. For the c parameter equation, the R value was 0.01215; the D u r b i n and Watson test was 2  not significant for positive or negative serial correlation. The error terms were therefore  r considered to be independent for all three equations. To check for heteroskedasticity, the regression of the estimated error squared with the predicted L H S endogenous variable was performed.  T h e R value for the a parameter 2  equation was 0.0020, for the b parameter equation was 0.1137, and for the c parameter  Chapter  6.  Apphcation  2: Estimation  of Tree Diameter  121  Distribution  equation was 0.0147. T h e a parameter equation was therefore considered homoskedastic. The Goldfeld and Quandt (1965) test for the b parameter equation was significant (test statistic of 3.04149) for an alpha level of 0.05, excluding the 41 observations representing middle values of the predicted b parameter. The b parameter was therefore considered to have heteroskedastic error terms. A n examination of the estimated error terms for the c parameter equation indicated that the variances of the error were likely not monotonic, as with the O L S fit, since this may be associated with a lack of fit, this equation was considered to be homoskedastic.  6.6,4  Estimation of the Error Covariance Matrix  The estimation of the error covariance matrix first required the estimation of the variances of the error terms for the 6 parameter equation. The fitted equation for the estimated variances of the error term was as follows: u\  m  where u\  m  = 0.164154 x pred.b  (6.167)  2nd  estimated variance of the error term for the second equation and the  m  th  sample; pred.b  2nd  is the predicted b parameter from the second stage equations.  The predicted b parameter was reset to 1.0 X 1 0  - 7  if the value was zero or negative, so  that only positive values were predicted for the variance of the error. The R  2  value for  the regression of the estimated error terms, squared, with the predicted b parameters from the weighted second stage model was 0.07497, which is somehat less than the R  2  value of 0.1137 from the fit of the unweighted second stage model. These estimated variances of the error terms were used to obtain an E G L S fit of the b parameter equation.  Contemporaneous variances were then calculated using the  Chapter  6. Application  2: Estimation  of Tree Diameter  122  Distribution  estimated error terms from the following equations. pred. a  2 n d  =  6.645248 + 0.079403 topht - 9.42588 x 1 0 "  +0.025278 totvol - 0.427068 pred. b '  =  pred.b  2ndwted  -6.591194u>* + 0.242275 qdiam  wted  -0.030403 totvol =  pred.c  2nd  stems  (6.168)  ut  + 0.001292  + 1.302598 pred. a  wted  4  +0.064493 pred. b  wted  (6.169)  u t w t e d  5.177297 - 0.300508 i o p f a + 0.331522 pred. a  stems  l r t  (6.170)  lst  where wt is the inverse of the square root of the estimated variance of the error term; pred.b  is the b parameter times the weight;  2ndwied  is quadratic mean diameter times the weight;  qdiam  wted  stems  w t e d  totvol  w U d  pred. a  is stems per hectare times the weight; is volume per hectare times the weight;  l B t w l e d  is the predicted a parameter times the weight.  Contemporaneous variances were calculated using equation 3.28, resulting i n the following S matrix. 5.2812  -1.1863  0.4059  -1.1863  0.8688  0.2898  0.4059  0.2898  2.8225  (6.171)  where diagonal elements are the variances for an equation and off-diagonal elements are covariances between equations. The error terms between the a and b parmeter equations appeared negatively correlated, whereas the c parameter Avas positively correlated with the other two equations of the system. Based on equation 3.51, the estimated error covariance matrix was therefore the  Chapter  6. Apphcation  123  2: Estimation  of Tree Diameter  Distribution  5.2812W Wi  -1.1863W!W  2  0.4059W!W  -1.1863W Wi  0.8688W W  2  0.4059W W;  0.2898W W  2  following matrix. 1  fi =  2  2  3  3  0.2898W W 2  3  (6.172)  3  2.8225W W 3  3  Because the a and b parameter equations have i i d error terms, W j and W the identity matrix of size n by n. The W  2  3  are equal to  matrix is an n by n matrix with the diagonal  elements equal to the square root of the estimated variances of the error terms for the second stage b parameter equation. The simplified error covariance matrix was therefore as follows: r  fi =  5.2812I  n  -1.1863W  2  0.4059I 6.6.5  n  -1.1863W . 2  0.8688W W 2  2  0.2898W  2  0.4059I  n  0.2898W  2  (6.173)  2.8225L,  E G L S t o F i t t h e S y s t e m of E q u a t i o n s  The last step of the M S L S fitting technique is to use the estimated error covariance matrix, and obtain an E G L S fit of the system of equations simultaneously. The resulting M S L S fit was as follows: pred.a MSLS  =  6.572920 + 0.076069 topht - 9.476689 x 1 0  - 4  stems  +0.025394 totvol - 0.403223 pred. b  (6.174)  lst  pred. b MSLS  =  -6.095781 + 0.235510 qdiam  + 0.001234 stems  -0.028857 totvol + 1.235273pred. a pred. c MSLS  =  (6.175)  u t  5.204059 - 0.299683 topht + 0.330314pred. a  u t  +0.058220 pred. b  ut  (6.176)  The I M S L S technique was not used, because the number of samples was considered large enough for the asymptotic properties of the M S L S technique to be assumed.  Chapter  6.7  6.  Application  2: Estimation  of Tree Diameter  124  Distribution  Comparison of the Three Fitting Techniques  6.7.1  Goodness-of-fit Measures  Fit Index The F i t Indices for each of the four equations for the O L S and the M S L S fits are presented i n Table 6.15. The F i t Indices for the M S L S fit were lower than those of the  Table 6.15: F i t Indices for O L S and M S L S Fits of the D b h Distribution System Endogenous Variable  a parameter  b parameter  c parameter  OLS MSLS  0.7383 0.6398  0.6744 0.5383  0.1563 0.1057  O L S fit. The result was expected as the O L S fit minimizes the squared difference between the endogenous variable and the corresponding predicted value, whereas the M S L S fit minimizes the squared difference between weighted values. T h e differences between the O L S and the M S L S F i t Indices are larger for this second application than the differences shown for Application 1. For the composite model, the F i t Index was calculated using the following equation. F  I  =  ffUi  S^ where F (DI),  (0-24 - F (DI))  + (0.63 - F (D2))  2  = 1  + (0.93 - F  2  (DZ))  (0.24 - 0.3333) + (0.63 - 0.3333) + (0.93 - 0.3333) 2  2  2  2  F (D2), and F (DZ) are the estimated cumulative probabilities from  the estimated Weibull distribution up to the Dl,  D2, and D 3 diameter limits,  respectively; 0.3333 is the mean of all three percentiles for the 121 plots; 0.24, 0.63, and 0.93 were the percentiles selected to represent each plot.  Chapter  6.  Apphcation  2: Estimation  In order to calculate F(D1),  F(D2),  of Tree Diameter  and F(D3),  Distribution  125  the a parameter was reset to 0.0 if a  negative value was predicted. The F i t Index using this equation for the composite model fit was 0.6442. To compare to the O L S technique, the predicted values for the a, b, and c parameter using the final O L S equations were used to obtain the estimated cumulative probabilities.  One of the plots had an estimated a parameter which was reset to 0.0,  another had a negative b parameter which was reset to 1.0 X 1 0 , and another had a - 7  negative c parameter which was not reset. The overall F i t Index for the O L S technique, using the above equation, was 0.6679.  Similarly, an overall F i t Index for the M S L S  technique was calculated using predicted a, b, and c parmeters from the final M S L S fit. For one of the 121 plots, the predicted value of the a parameter was negative, and was reset to 0.0. The overall F i t Index for the M S L S technique was 0.7125. The overall F i t Index was lowest for the composite model fit which may have resulted because a local rather than a global m i n i m u m may have been found using N L P . Also, in the composite model, the parameters of the Weibull distribution are a function of the stand parameters alone and are not a function of the other parameters of the Weibull distribution. The M S L S overall F i t Index was slightly higher than the overall O L S F i t Index, which may be due the simultaneous fit of the three parameters using the M S L S technique. The presence of negative b and c parameters from the O L S fit may also have caused the lower overall Fit Index.  M e a n Absolute Deviation The mean absolute deviations ( M . A . D . ) were calculated by class for each of the L H S endogenous variables. The classes were created in a similar manner as for Apphcation 1, with each class having 24 observations except the last class which had 25 observations. The M . A . D . values for the O L S fit and for the M S L S fit are presented in Table 6.16 and Table 6.17.  As with the Application 1, the O L S technique resulted in lower M . A . D .  Chapter  6.  Application  2: Estimation  of Tree Diameter  Distribution  126  Table 6.16: M . A . D . for Five Classes for the O L S F i t of the D b h Distribution System Classes, from Low to High Values Endogenous Variable  1  2  3  4  5  a parameter b parameter c parameter  1.884 0.407 1.174  0.706 0.489 0.924  0.841 0.344 0.551  1.083 0.521 0.356  1.951 0.910 2.445  Table 6.17: M . A . D . for Five Classes for the M S L S F i t of the D b h Distribution System Classes, from Low to High Values Endogenous Variable  1  2  3  4  5  a parameter b parameter c parameter  2.901 0.557 1.349  1.257 0.524 1.009  0.701 0.395 0.617  0.798 0.638 0.431  2.221 1.149 2.466  Chapter  6.  Application  2: Estimation  of Tree Diameter  Distribution  127  values across the range of the endogenous variables, although the differences between the O L S and M S L S M . A . D . values by class, were small. For the composite model fit, an M . A . D . value was calculated for each percentile as shown below for the first percentile. M.A.D.  5&  =  = 1  10.24(01)  (6.178)  n  Negative values for estimated parameters were reset as for the calculation of F i t Index. The M . A . D . for the 24 percentile  was 0.190, for the 6 3  th  the 9 3  rd  percentile was 0.075.  to obtain F(D1), the 6 3  rd  F(D2),  the 9 3  rd  th  and F(D3),  the M . A . D . for the 24 rd  th  percentile was 0.236, for  percentile was 0.067. For the M S L S fit,  percentile was 0.220, for the 6 3  percentile was 0.070.  percentile was 0.276, and for  Using the estimated parameters from the final O L S fit  percentile was 0.239, and for the 9 3  the M . A . D . for the 24  rd  rd  percentile was 0.223, and for  The M . A . D . values were much the same for the three  percentiles.  M e a n Deviation The mean deviations ( M . D . ) were calculated for the same classes as with the M . A . D . values. The results for the O L S fit are presented i n Table 6.18, and for the M S L S fit in Table 6.19.  As with the M . A . D . values by class, the M . D . values by class were lower with  the O L S fit; however, the differences between the O L S and the M S L S fits are generally small. Also, the trends of over- and underestimation across the range of the endogenous variables are similar with the two fits. For the composite model fit, an M . D . value was calculated for each percentile as shown for the first percentile. M.D.  =  E»  = 1  (0.24-Fpl)) n  (6.179)  Chapter  6.  Application  2: Estimation  of Tree Diameter  Distribution  128  Table 6.18: M . D . for Five Classes for the O L S F i t of the D b h Distribution System Classes, from Low to High Values Endogenous Variable a parameter b parameter c parameter  1  2  0 o  4  5  -0.727 -0.293 -1.079  -0.110 -0.448  -0.457 -0.268 -0.531  -0.219 0.224  1.452 0.752  0.073  2.335  -0.896  Table 6.19: M . D . for Five Classes for the M S L S F i t of the D b h Distribution System Classes, from Low to High Values Endogenous Variable  1  2  3  4  5  a parameter b parameter c parameter  -1.421 -0.529 -1.349  -0.324 -0.484 -0.844  -0.420 -0.329 -0.536  0.262 0.242 0.159  1.832 1.064 2.466  Chapter  6.  Application  2: Estimation  of Tree Diameter  129  Distribution  Negative estimated parameters were reset as for the calculation of F i t Index. The M . D . for the 24  th  percentile was 0.175, for the 6 3  percentile was -0.030. M . D . for the 2 4  th  percentile was 0.185, for the 6 3 th  rd  percentile was 0.197, and for the 9 3  For the O L S fit, using the estimated Weibull parameters,  percentile was -0.054. The 24 the 9 3  rd  and 6 3  rd  rd  the  percentile was 0.033, and for the 9 3  rd  rd  percentiles were therefore underestimated and  percentile was slightly overestimated. For the M S L S fit, the M . D . for the 24  percentile was 0.182, for the 6 3  rd  percentile was 0.035, and for the 9 3  rd  th  percentile was  -0.064, much the same as for the O L S fit.  6.7.2  Relative Variances  The trace of the estimated coefficient covariance matrix for the M S L S fit was 49.64200, whereas the trace of the O L S fit was 4.7411 for the system of equations.  Because the  system of parameter equations was simultaneous, the O L S estimate of the coefficient covariance matrix cannot be used to calculate confidence limits for the true parameter. However, because the simultaneity bias was removed i n the first step of the M S L S technique, the estimated coefficient covariance matrix from the M S L S fit is consistent. Confidence limits using these O L S variance estimates would appear, incorrectly, to be much narrower than the those using the consistently estimated coefficient covariance matrix from the M S L S fit. The traces for the submatrices of the estimated coefficient covariance matrix corresponding to each individual equation are given i n Table 6.20. Unlike Application 1, all of the equations of the dbh distribution system have endogenous variables on the R H S . The O L S estimates of the variance of the coefficients appear lower for all of the equations of the system, similar to the trace for the overall matrix. The comparison of the coefficient covariance matrix for the composite model with those from the other fitting techniques was not possible, because the L H S variable of the  Chapter  6.  Apphcation  2: Estimation  of Tree Diameter  Distribution  130  Table 6.20: Trace of the Coefficient Covariance Ma.trix for Each Equation of the D b h Distribution System Endogenous Variable OLS MSLS  a parameter  b parameter  c parameter  3.690011 5.056920  0.170987 42.410241  0.880147 1.174840  composite model does not appear on the L H S of equations of the system fitted by the O L S or the M S L S procedure. Also, the variances were not calculated for the composite model.  6.7.3  Table of Estimated Coefficients and Standard Deviations  A summary of the estimated coefficients and their associated standard deviations from the O L S and M S L S fits is shown i n Table 6.21. The estimated standard deviations for the coefficients were higher for the M S L S fit than for the O L S fit, but the O L S estimates of the coefficients are inconsistent, and so these standard deviations would result in underestimated confidence intervals for all of the coefficients i n the system. The coefficients for the a parameter and c parameter equations were similar for the two techniques, but, for the b parameter equation, the coefficients were quite different. Since the M S L S estimated coefficients are similar to the unweighted multiple least squares fit of each of the second stage equations (equations 6.164, 6.165, and 6.166), the difference in coefficients must be due to simultaneity bias. Because the O L S results in biased coefficients, hypothesis statements tested using the results from the O L S fit will be incorrect. For instance, using a value of 1.96 from the normal distribution, confidence intervals for the <5 coefficient which was associated 15  with the b parameter in the first equation, were calculated as -1.58437 to -0.86721 from  Chapter  6.  Apphcation  2: Estimation  of Tree Diameter  Distribution  131  Table 6.21: Estimated Coefficients and Standard Deviations of Coefficients for the D b h Distribution System Endogenous Variable a parameter on 012 013 014 015  Coefficients  Standard Deviations OLS MSLS  OLS  MSLS  7.214783 0\260097 -0.000956 0.026935 -1.225793  6.572920 0.076069 -0.000948 0.025394 -0.403223  1.908552 0.118161 0.000133 0.003106 0.182949  2.199947 0.160974 0.000152 0.003645 0.437296  3.574736 0.162501 -0.000180 0.007432 -0.212128  -6.095781 0.235510 0.001234 -0.028857 1.235273  0.411770 0.020156 0.000047 0.001329 0.031998  6.516691 0.057110 0.000960 0.024576 0.969079  4.436816 -0.349270 0.237814 0.540540  5.204059 -0.299683 0.330314 0.058220  0.917986 0.090431 0.056247 0.161578  1.027642 0.118933 0.088472 0.311160  b parmeter 021 £22 023 024 <?>25  c parameter 031 032 033 0~34  Chapter  6.  Apphcation  2: Estimation  of Tree Diameter  132  Distribution  the O L S fit, and as -1.26032 to 0.45388 from the M S L S fit. Using the results from the M S L S fit, this coefficient would be considered to be zero, whereas for the O L S fit, the coefficient is nonzero. Similar differences i n hypothesis testing were noted for the 8 5 and 2  834 coefficients.  6.7.4  R a n k i n g for Other Features  The ranks assigned to each fitting technique are shown i n Table 6.22. For the Infor-  Table 6.22: Ranks for the Three Techniques for the F i t of the D b h Distribution System Feature Information Consistency Confidence Limits Asymptotic Efficiency Compatibility Ease of fit Total  OLS 2 1  Composite 1 2  MSLS 2 2  1  2  2  1 1 3 9  .1 2 1 9  -  2 2 2 12  mation feature, the composite model was assigned a rank of one as only one endogenous variable was estimated.  T h e only consistent estimates were the M S L S coefficient esti-  mates, as the nonlinear least squares procedure used to fit the composite model was restricted to obtain positive values for the predicted b and for the c parameter values. Confidence limits can be calculated for M S L S as the sample size was quite large and the estimated coefficients are asymptotically normally distributed. For the composite model, confidence limits could be calculated if generalized nonlinear least squares were used to fit the model, because estimates of the variances of the coefficients could be obtained. In terms of efficiency, the M S L S approach should be more efficient than the O L S approach;  Chapter  6.  Application  2: Estimation  of Tree Diameter  Distribution  133  however, since the O L S estimates of all of the coefficients of this system were biased and inconsistent, this expected increase in efficiency using the M S L S approach was difficult to witness. The M S L S and composite model fits both result in compatibility, whereas the O L S fit does not. The number of steps for the M S L S fit is greater than that for the O L S fit; for the composite model, the number of steps was not calculated as no attempt was made to estimate the variances of the coefficients of the nonlinear composite model. Because the fit of the nonlinear composite model would require simultaneous estimation of the error covariance matrix and the coefficients if the error terms were non-iid, the composite model was considered to be the most difficult to fit. Also, the number of coefficients to be estimated in the nonlinear composite model was 15; the chance of obtaining a local minimum rather than a global minimum is probably high with this large number. The composite model was therefore given a low "ease of fit" ranking. The M S L S technique was therefore assigned the highest sum of ranks for this second application.  6.8  Conclusion  The chosen system of parameter prediction equations was simultaneous. However, the L H S variable of the third equation, the c parameter, did not appear on the R H S . One of the equations, the 6 parameter equation, was found to have heteroskedastic error terms using the O L S technique.  The error terms for the c parameter equation  indicated some lack-of-fit, but the error terms for this equation were assumed to be iid, as were the error terms for the a parameter equation.  E G L S was used to fit the  b parameter equation, whereas multiple linear regression was used to fit the other two equations. For the composite model, a nonlinear model was derived with 15 coefficients to be  Chapter  6.  Application  2: Estimation  of Tree Diameter  Distribution  134  estimated. The equation was fitted using restricted nonlinear least squares. No attempt was made to check for serial correlation or heteroskedasticity. The unweighted fit of the second stage equations, for the M S L S fit, indicated that the characteristics of the error terms were similar to those of the error terms from the unweighted fit for the O L S technique. The variances of the error terms for the second stage b parameter equation were estimated and the equation was refitted using E G L S . The estimated error covariance matrix was then used to obtain the final M S L S fit. The goodness-of-fit measures for the individual parameter prediction equations were best for the O L S fit, as expected. Unlike Application 1, the goodness-of-fit measures for the M S L S fit were somewhat worse than for the O L S fit. The goodness-of-fit measures for the composite model were compared to the simulated composite model using the O L S fit, and, separately, the M S L S fit. The goodness-of-fit measures, in this case, were generally best for the M S L S fit, except for the M . D . values; however, since the O L S fit resulted in one negative predicted b parameter and one negative predicted c parameter, the goodness-of-fit measures may be somewhat misleading for the simulated composite model. The goodness-of-fit measures for the composite model were the worst. The trace of the estimated coefficient covariance matrix was lowest with the O L S fit. The trace values for the submatrices of this matrix, corresponding to each equation of the system, were lower for O L S than for M S L S . Since all three of the parameter prediction equations have R H S endogenous variables, these results are similar to those found for the total volume and volume ratio equation of Application 1 which also had R H S endogenous variables.  Since the O L S fit coefficients are biased and inconsistent, these estimated  variances can not be used to obtain confidence limits for the true coefficients. The summed rank for other features was highest for the M S L S fit. The composite model was given a low ease-of-fit rank because the simultaneous estimation of the 15 coefficients was difficult, and if estimates of the variances of these coefficients is also  Chapter  6.  Application  2: Estimation  of Tree Diameter  Distribution  135  desired, the estimation would be even more difficult. The O L S fit resulted in biased and inconsistent estimates as all of the equations have R H S endogenous variables. In terms of goodness-of-fit, the O L S fit was better; if a fit of the sample data were required and this fit was not to be used for other samples of the population, the O L S fit would be appropriate. However, the O L S fit results i n inconsistent estimates of the coefficients, and the results could not be used to test hypothesis statements and should not be used for other sample data. The composite model fit would require a difficult fitting procedure to obtain estimates of the coefficient covariance matrix, and the goodness-offit measures were lower than those of the M S L S fit for the simulated composite model. Because tests of hypothesis statements are useful, and the fitted equation may be used for other samples of the population, the M S L S approach was considered the best technique for fitting this system of equations; the coefficients and the estimated variances of the coefficients are consistent.  Chapter 7  Application 3: Volume G r o w t h and Y i e l d  7.1  Introduction The management of forest resources requires accurate information about the current  and future wood supply. Systems of equations have been widely used to represent forest growth and yield.  Simultaneous fitting techniques such as 2SLS, 3SLS, restricted  3SLS, and minimizing loss functions for the system have been used to fit these equations (Borders and Bailey, 1986; Burkhart and Sprinz, 1984; Furnival and W i l s o n , Jr., 1971; Hans, 1986; M u r p h y and Beltz, 1981; M u r p h y and Sternitzke, 1979; Reed et al, 1986). However, for each of these studies, the assumption was made that individual equations of the system have i i d error terms. Permanent sample plots are often used i n fitting growth and yield systems. Because psps are measured repeatedly over time, error terms may be serially correlated, and also, the variance of the error terms may differ among plots.  The presence of serial  correlation between error terms will depend on the length of time between measurements and whether overlapping intervals are used i n estimation (Borders et al, 1987a). The use of simultaneous fitting techniques for systems of growth and yield equations assuming that the error terms are i i d may therefore be less efficient than the M S L S technique, depending on the degree of serial correlation or heteroskedasticity of the error terms.  136  Chapter  7.2  7. Apphcation  3: Volume  Growth  and  137  Yield  D a t a Preparation The psp data used for Application 2 were also selected for this apphcation. Plots  with more than 80 percent pine by volume, breast height age recorded, and no treatment applied were selected. D a t a from Northeastern Alberta were deleted so that only lodgepole pine was represented i n the data.  One plot was selected from each cluster  of four plots at a location. Summary information from the establishment measurement and the subsequent measurement was selected; any plot having only the establishment measurement was deleted from the data. D a t a were graphed and no outliers were noted. The distribution of-the remaining 28 plots is given i n Table 7.23.  Table 7.23: Distribution of Selected Psps for the T h i r d Application Age at Breast Height Stems per Hectare 0 to 1000 1001 to 2000 2001 to 3000 3001 to 4000 4001 to 5000 5001 to 6000 6001 to 7000 Total  21 to 40  41 to 60  61 to 80  81 to 10.0  1 4 1  3  3  1  2  101 to 120  121 to 140  Total  1  1  3  4  1  11  1  6  1 1  5  1  1  1  1  1 7  1 5  5  2  6  3  28  Chapter  7.3  7.  Apphcation  3: Volume  Growth  and  Yield  138  Model Selection  T w o equations of the growth and yield system developed by Clutter (1963) were selected for analysis. These equations were as follows: lnBA  =  2  lnV  A  =  2  where InBAi  InBA^+aAl-^f] 0o+  and lnBA  V  2  AJ  +a (l-^f] 2  \  2  0i SI + / 3 - J - + 0 lnBA 2  3  A1  error  SI+  (7.180)  1  2  + error  (7.181)  2  2  are the natural logarithms of the basal area per hectare  2  measured at times 1 and 2, respectively; Ai and A  are the number of years counted at breast height  2  at times 1 and 2, respectively; SI is the site index for a reference age of 50 years measured at breast height; In V is the natural logarithm of volume per hectare at time 2; 2  ai and c t , and 0 2  errori  and error  O  through 0  3  are coefficients to be estimated;  are the error terms.  2  These equations were fitted simultaneously by Burkhart and Sprinz (1984) using a mini m u m loss function for the two equations. Borders and Bailey (1986), M u r p h y and Sternitzke (1979), and Hans (1986) used simultaneous fitting techniques from econometrics, assuming that error terms of individual equations are i i d , to fit modifications of these equations. The equations can be rearranged as follows: (in BA  2  - In BA  X  ^) lnV  2  + 6  (l - ^ )  =  *  =  $21 + 8 22 SI + 8  u  12  23  + S  13  —  (l - ^ )  + 8  24  In BA  2  SI + e + e  2  1  (7.182) (7.183)  ^2 The term InBAij^  is a combined term where BAi  is a lagged endogenous variable  which can be treated as a predetermined variable, and the term j+  is an exogenous  Chapter  7. Apphcation  3: Volume  Growth  and  139  Yield  variable. This combined term can be treated as a constant which is subtracted from the endogenous variable, lnBA , 2  area equation above.  to obtain the L H S endogenous variable shown i n the basal  A n intercept coefficient was added to this basal area equation,  although this coefficient is expected to be close to zero.  7.4  O r d i n a r y Least Squares Fit  7.4.1  Unweighted Simple or Multiple Linear Regression  The estimated coefficients using multiple linear regression to fit each equation of the system individually were as follows: pred.  (in BA  2  =  - In BAj  -0.017785 + 4.458907 ( l -0.011270 ( l - ^ )  pred. In V  2  =  Si"  (7.184)  2.315771 -f 0.066766 S J - 47.758153-^jA  -(-1.014053 In BA  2  2  (7.185)  The estimated error terms from the unweighted fit were then used to test for serial correlation and heteroskedasticity.  7.4.2  Testing for iid E r r o r Terms  To check for serial correlation i n each equation, the estimated error terms were sorted by the predicted endogenous variables, and a graph of the current error term versus the previous error term was obtained. T h e R  2  value for the simple linear regression of the  current error term with the previous error term was 0.00965 for the basal area equation and 0.011511 for the volume equation. The D u r b i n and Watson (1951) test statistic for each model was not significant for either positive or negative serial correlation using an  Chapter  7.  Application  3:  Volume  Growth  and  140  Yield  alpha of 0.10 (0.05 for positive and 0.05 for negative serial correlation).  Each equation  of the system therefore had independent error terms. To test for heteroskedasticity,  a graph of the estimated error versus the predicted  L H S endogenous variable was done.  The regression of the estimated error, squared,  with the predicted variable on the L H S of the basal area equation resulted in an  R  2  value of 0.00270. The error terms for this equation were therefore considered indentically distributed. The regression of the estimated error, squared, with the predicted logarithm of volume from the second equation, resulted i n an R value of 0.01425. The test statistic 2  for the Goldfeld and Quandt (1965) test was 1.9149 for nonincreasing variance with increasing predicted logarithm of volume. The critical F value for 6 degrees of freedom for the numerator and the denominator, and an alpha level of 0.05 was 4.28. The error terms of the second equation of the system were therefore considered identically distributed.  7.4.3  Appropriate O L S Fit Based on E r r o r Structure  Because each of the equations of the system had iid error terms, the unweighted O L S fit was considered appropriate. This unweighted fit was therefore used i n the comparison with other methods.  7.5  Composite M o d e l  7.5.1  Fit  Derivation of the Composite M o d e l  A composite model was derived by Sullivan and Clutter (1972), by substituting the equation for the logarithm of basal area at time 2 into the equation to estimate yield at time two. The resulting equation was as follows: lnV  2  =  7o + 7 i SI + 7  2  +^/ lnBA 3  1  ^  + 7 4 (l  -  Chapter  7.  Application  3: Volume  Growth  +75 ( l ~ ^ )  and  SI + e  141  Yield  (7.186)  3  where 7 through 75 are coefficients to be estimated; 0  e is the error term. 3  7.5.2  Unweighted Regression of the Composite Model  A l l of the R H S variables of the composite model can be considered predetermined variables and are uncorrelated with the error term. T h e fit using multiple linear regression results i n unbiased estimates of the coefficients and their variances. T h e fitted equation was as follows: pred. In V  2  =  2.513266 + 0.063926 SI - 39.294313 -j- + 0.961204 A-i  In BA-i ^  7.5.3  + 3.655157 (  x  ~ ^ )  ~ 0.003875 ( l - ^  5/(7.187)  Testing for iid Error Terms  D a t a were ordered by the predicted logarithm of volume using the unweighted fit of the composite model, and a graph of the estimated error with the previous estimated error term was examined for serial correlation. T h e regression of the current error term with the previous error term resulted in an R  2  value of 0.0082, and the D u r b i n and Watson  test statistics for positive and for negative serial correlation were not significant for an alpha of 0.10 for both tests. T h e error terms for the composite model were therefore independent. To test for heteroskedasticity. a graph of the estimated error term with the predicted logarithm of volume was obtained, and the simple linear regression of the estimated error term, squared, with the predicted logarithm of volume was performed.  The R  2  value was 0.03734. T h e Goldfeld and Quandt (1965) test statistic was 1.0567 assuming  Chapter  7.  Application  3: Volume  Growth  and  142  Yield  that the variance is nonincreasing with increasing predicted logarithm of volume. This test statistic was less than the critical F value of 5.05 for 5 degrees of freedom for the numerator and for the denominator, and for an alpha of 0.05.  The error terms were  therefore considered homoskedastic.  7.5.4  Weighted Fit of the Composite Model  The error terms for the composite model were found to be iid. The unweighted fit was therefore used to compare to other fitting techniques.  7.6 7.6.1  MSLS Fit First Stage Equations  The two equation of the system met the rank and order conditions and were identified. For the second equation of the system, In BA  appears on the R H S and is the part of  2  the complex variable which appears as the endogenous variable on the L H S of the first equation.  To remove the simultaneity bias for the second equation, the following first  stage model was obtained. pred.  (lnBA  V  2  - lnBA  1  ^)  Aj  = l  B  0.163469 - 0.013701 SI - 1.026070 4~  A  t  + 0.026270 ( l - -^j  (l - -^j  12 / The predicted In BA  2  V  + 4.071904 2  (7.188)  SI  -12 <  values were then recovered from this first stage equation using the  following equation. pred.lnBA  2lst  =  /  pred. [InBA  2  - I n B A  4 \ 1  ^ )  +lnBA -± 1  A  (7.189)  Chapter  7.6.2  7. Application 3: Volume Growth and  143  Yield  Second Stage Equations  T h e second stage equations were derived by substituting the predicted value for the logarithm of basal area at time 2 from the first stage equation into the R H S of the In V equation. Because the second stage equation for lnBA  2  ^  — InBAi  2  has only exogenous  variables on the R H S , the unweighted O L S fit of this equation was the same as the fit of the second stage equation. pred.  (lnBA  2  ^  - lnBA  1  =  ^f]  -0.017785 + 4.458907 ( l -  ^ 2 / 2nd  ^  -0.011270 ( l pred.lnV  =  22nd  1  (7.190)  SI  2.550358 + 0.066408 5 / - 47.37629 - j A  +0.9503109 pred. lnBA  7.6.3  A_  2  (7.191)  2ut  Testing for iid E r r o r Terms  The basal area second stage equation was the same as the basal area equation for the unweighted O L S fit; therefore, the error terms for this second stage equation were i i d . To test for serial correlation in the second stage volume model, the estimated error terms were ordered by the predicted logarithm of volume, and a graph of the estimated error term with the previous error term was obtained.  The R  2  value for the simple  linear regression of the current with the previous error term was 0.01856. The D u r b i n and Watson (1951) test statistics for positive and negative serial correlation were not significant for an alpha of 0.10 for the two sided test. The error terms were therefore considered independent for the second stage logarithm of volume equation. To test for heteroskedasticity,  a graph of the estimated error with the predicted log-  arithm of volume using the second stage model was obtained.  The regression of the  estimated error, squared, with predicted logarithm of volume resulted i n an R  2  value  Chapter  7. Application  of 0.0950.  3: Volume  Growth  and  144  Yield  T h e Goldfeld and Quandt (1965) test for an alpha of 0.05 was not signifi-  cant. The error terms of the second stage logarithm of volume equation were therefore identically distributed.  E s t i m a t i o n of the E r r o r C o v a r i a n c e M a t r i x  7.6.4  Because the equations of the system have i i d error terms, the M S L S procedure simplifies to the 3SLS procedure. Estimates of contemporaneous variances using equation 3.28 were calculated from the estimated error terms of the second stage models, resulting i n the following S matrix. r  0.0067  0.0060  0.0060  0.0078  (7.192)  The error covariance matrix for the system was therefore as follows:  fi  0.0067I  0.0060I  n  0.0060L,  n  (7,193)  0.00781,,  The covariance of the error terms between equations 0.0060 was quite high relative to the variance of the error terms within each equation which were 0.0067 and 0.0078. T h e equations therefore appear to be correlated.  7.6.5  E G L S to F i t the S y s t e m of E q u a t i o n s  The final step of the M S L S fit was to fit the system simultaneously using the estimated error covariance matrix, fl.  The resulting M S L S fit of the system of equations was as  follows:  pred.  (lnBA V  2  - I n B A ^ ) A/ 2  MSLS  =  -0.0480136 + 4.588017 f l \  -0.008839  -  5"!  ^ ) AJ 2  (7.194)  Chapter  7.  Apphcation  3: Volume  pred. In V  Growth  2 M S L S  =  and  145  Yield  2.264741 + 0.072275 SI - 47.47299 — ^±2  + lMA2Upred.lnBA  (7.195)  2ut  The I M S L S technique i n this case is the same as the I3SLS technique. The estimated coefficients from the M S L S fit were used to estimate new values for the contemporaneous correlation by using equation 3.32, and a new M S L S fit was calculated. This process was repeated for eight iterations, until the criterion specified i n equation 4.65 was met. Coefficients were similar to those for the M S L S fit.  7.7  Comparison of the Three F i t t i n g Techniques  7.7.1  Goodness-of-fit Measures  F i t Index The F i t Indices for each of the two equations for the O L S , M S L S , and I M S L S fits are presented i n Table 7.24. F i t Indices for the composite model fit are presented for the logarithm of volume only. T h e F i t Indices for the M S L S fit were marginally lower than those for the O L S fit, for the basal area equation. For the volume equation, the F i t Index for the O L S fit was the highest, followed by the composite model, and the lowest F i t Index was for the I M S L S fit:  Table 7.24: F i t Indices for O L S , Composite M o d e l , M S L S and I M S L S Fits of the Y i e l d Equation System Endogenous Variable In BA  2  In V  2  - In BAi  MSLS  IMSLS  0.9536  Composite Model none  0.9519  0.9496  0.9802  0.9129  0.9024  0.8974  OLS £  Chapter  7. Application  3: Volume  Growth  and  146  Yield  M e a n Absolute Deviation The mean absolute deviations ( M A . D . ) were calculated by class for each of the L H S endogenous variables. T h e classes were created by sorting the 28 samples by the endogenous variable, and then dividing the sorted data into four classes of six samples each, with the fifth class having only four samples. T h e M . A . D . values for the O L S fit, for the M S L S fit, and for the composite model fit for the logarithm of volume only, are presented in Table 7.25, Table  7.26, and Table 7.27. N o values are shown for the I M S L S fit; the  values were within 0.005 units from the M S L S fit.  Table 7.25: M . A . D . for Five Classes for the O L S F i t of the Y i e l d Equation System Classes, from Low to High Values Endogenous Variable In BA lnV  2  - In BA  %  1  2  1  2  3  4  5  0.043  0.081  0.074  0.035  0.044  0.043  0.037  0.032  0.021  0.035  Table 7.26: M . A . D . for Five Classes for the M S L S F i t of the Y i e l d Equation System Classes, from Low to High Values Endogenous Variable lnBA  2  lnV  2  -  lnBA  1  1  2  3  4  5  0.039  0.087  0.078  0.034  0.046  0.090  0.070  0.052  0.078  0.054  For the basal area equation, the M . A . D . values were only marginally lower for the O L S fit compared to the M S L S fit. For the volume equation, the M . A . D . values were somewhat higher for the M S L S and composite model fits, relative to the O L S fit. T h e  Chapter  7. Application  3: Volume  Growth  147  and Yield  Table 7.27: M . A . D . for Five Classes for the Yield Composite Model F i t Classes, from Low to High Values Endogenous Variable In Vi  1  2  3  4  5  0.089  0.073  0.039  0.080  0.057  composite model fit had similar M . A . D . values to the M S L S fit.  M e a n Deviation The mean deviations ( M . D . ) were calculated for each equation for the same five classes as for M . A . D . Results for the O L S fit are presented i n Table 7.28, for the M S L S fit i n Table 7.29, and for the Composite Model fit i n Table 7.30.  T h e M . D . values for the  I M S L S fit were similar to the M S L S fit for the volume equation.  For the basal area  equation, values for the I M S L S fit were close the those for the O L S fit.  Table 7.28: M . D . for Five Classes for the O L S F i t of the Y i e l d Equation System Classes, from Low to High Values Endogenous Variable In BA lnV  2  2  - In BA  X  ^  1  2  3  4  5  -0.024  -0.010  0.008  -0.004  0.044  -0.003  -0.017  0.003  0.002  0.023  For the basal area equation, the M . D . values were similar for the O L S and M S L S fits. The directions of the deviations between the observed and predicted values were similar also. For the volume equation, the M . D . values were krwer for the O L S fit than for the M S L S fit. The M . D . values for the composite model fit were the highest.  Chapter  7, Apphcation  3: Volume  Growth  148  and Yield  Table 7.29: M . D . for Five Classes for the M S L S F i t of the Yield Equation System Classes, from Low to High Values Endogenous Variable In BA  2  -  In BA  1  In V  2  ^  1  2  3  4  5  -0.006  0.001  0.009  -0.015  0.019  0.006  -0.070  0.004  0.023  0.054  Table 7.30: M . D . for Five Classes for the Y i e l d Composite M o d e l F i t Classes, from Low to High Values Endogenous Variable In V  2  7.7.2  l . |  2  -0.007 | -0.068  3  4  5  0.008  0.030  0.057  Relative Variances  The trace of the estimated coefficient covariance matrix for the M S L S fit was 12.35046 whereas the trace of the O L S fit was 7.500340 for the system of equations. However, the O L S fit results i n coefficients which are inconsistent, because the endogenous variable on the L H S of the first equation was modified and included as an endogenous variable on the R H S of the second equation of the system. The traces for the submatrices of the estimated coefficient covariance matrix corresponding to each individual equation using the O L S and M S L S fitting techniques are given i n Table 7.31.  T h e trace of the coefficient covariance matrix for the composite  model fit is shown i n the table; however, the composite model had six coefficients i n the volume equation, whereas for the O L S and M S L S fit of the system of equations, only four coefficients were estimated for the volume equation.  Chapter  7.  3: Volume Growth  Application  and  149  Yield  Table 7.31: Trace of the Coefficient Covaria,nce M a t r i x for Each Equation of the Y i e l d Equation System Endogenous Variable lnBA  -  2  In V  2  InBA^  OLS  MSLS  0.59786  0.21083  Composite Model none  6.9025  12.1396  98.1090  The high trace value for the composite model was attributable mostly to the high variance of the coefficient associated with the - ~ variable. Also, the mean squared error for the composite model was 0.00933, whereas for the O L S fit of the volume equation, the mean squared error was only 0.00194.  The higher mean squared error and larger  number of variables likely caused the high value for the trace of the estimated coefficient covariance matrix from the composite model. The O L S fit of the basal area equation resulted in unbiased estimates of the coefficients and their variances, because all R H S variables of this equation were exogenous.  The  M S L S fit resulted i n a lower trace value for the estimated coefficient covariance matrix, which was the result of using the information from the volume equation in fitting the basal area equation (contemporaneous variances).  The O L S fit of the volume equation  results in biased and inconsistent coefficients, because basal area appears as a L H S and as a R H S variable in the system of equations.  The trace of the estimated coefficient  covariance matrix was therefore lower than for the M S L S fit. Because the number of samples used to fit this system of equations was small, the large sample properties of consistency and asymptotic efficiency may not apply for the 3SLS fit. The O L S fit has been shown to be more biased than 3SLS, and since the M S L S fit for this system was simply a 3SLS fit, the O L S results may be more biased than those of the M S L S fit, for the volume equation.  Chapter  7.  7.7.3  Apphcation  3: Volume  Growth  150  and Yield  Table of Estimated Coefficients and Standard Deviations  A summary of the estimated coefficients and their associated standard deviations from the O L S and M S L S fits is shown i n Table 7.32.  Table 7.32: Estimated Coefficients and Standard Deviations of Coefficients for the Yield Equation System Endogenous Variable  OLS  Coefficients MSLS  Standard Deviation IMSLS  OLS  MSLS  IMSLS  -0.048014 4/588017 -0.008839  -0.059548 4.561855 -0.003256  0.048789 0.770864 0.035310  0.034473 0.453306 0.022136  0.032591 0.394548 0.019709  2.264741 2.315771 0.072275 0.066766 -47.758153 .-47.47299 1.004244 1.104053  2.202168 0.074283 -46.69858 1.010136  0.134221 0.005269 2.623611 0.033243  0.163504 0.006985 3.480158 0.036704  0.145804 0.006259 3.119119 0.031243  biBA lnBA,± 2  on 012 013  lnV  -0.017785 4.458907 -0.011270  2  8 21  $22 023 023  The estimated coefficients were similar for the three techniques. Because the I M S L S technique was simply I3SLS for this problem, and the I3SLS should converge to M L E if the error terms are normally distributed, the I3SLS estimates of the coefficients may be more appealing.  7.7.4  Ranking for the Three Techniques  The ranks assigned to each fitting technique are shown i n Table 7.33.  Because of  the small number of samples, the assumption that the M S L S estimates are normally distributed may not be applicable: the calculation of confidence limits using the results from this small number of samples would likely be incorrect. T h e M S L S estimators are  Chapter  7.  Application  3: Volume  Growth  151  and Yield  Table 7.33: Ranks for the Three Techniques Feature Information Consistency Confidence Limits Asymptotic Efficiency Compatibility Ease of fit Total  OLS 2 1  Composite 1 2  MSLS 2 2  1  2  1  1 1 2 8  1 2 3 11  2 2 1 10  consistent, but, because of the small number of samples, the estimates obtained may be quite different from the true values. The summed rank for these features was highest for the composite model, because estimates of the coefficients are unbiased and so are the estimated variances of these coefficients. However, the composite model only estimates the coefficients for the endogenous variable of the second equation of the system. For the first equation, the basal area equation, the coefficients from the fitted volume equation may be used, as demonstrated by Sullivan and Clutter (1972), but these estimated coefficients are not necessarily unbiased or consistent estimates. The O L S fit resulted in inconsistent estimates of the coefficients for the volume model, as one of the R H S variables is endogenous. Also, the ease-of-fit rank was lower than for the composite model fit, as only one equation is fitted for the composite model. Finally, the O L S fit does not result in compatible equations.  Chapter  7.  Application  3: Volume  Growth  and  152  Yield  7.8 Conclusion T h e system of equations proposed by Clutter (1963) was selected for this application. This system is simultaneous, although the first equation has only exogenous variables on the R H S . The estimated error terms from the unweighted O L S fit were i i d , even though two measurements from psps were used to analyze the two equation yield system, and serial correlation was expected to be significant. This may have resulted because the plot data were pooled and serial correlation within measurements of a plot was masked by this grouping of data.  Also, the measurement period between the first and second varied  from five to 14 years, and the longer periods result i n a reduction i n the correlation between measurements.  The unweighted fit was used for comparison with the other  fitting techniques. T h e composite model was derived by Sullivan and Clutter (1972).  The estimated  error terms from the multiple linear regression of the composite model were i i d . The estimated error terms from the unweighted fit of the second stage equations for the M S L S fit, were iid. T h e M S L S technique therefore was reduced to the 3SLS technique. The estimated contemporaneous variance between the two equations was high relative to the estimated variances with each equation and the error terms between equations appeared to be highly correlated. The goodness-of-fit measures were generally best for the O L S fit.  For the basal  area equation, the M S L S goodness-of-fit measures were close to those of the O L S fit, and sometimes better.  For the volume equations, the goodness-of-fit measures for the  composite model fit and for the M S L S fit were similar, and somewhat worse than those of the O L S fit. The trace of the estimated coefficient covariance matrix was lower for the O L S fit than  Chapter  7.  Apphcation  3: Volume  Growth  and  153  Yield  for the M S L S fit. A n examination of the trace values for the submatrices indicated that, like Apphcation 1, the trace value for the equation with no R H S endogenous variables (basal area equation) was lower for the M S L S fit than for the O L S indicating an increase in efficiency by accounting for contemporaneous correlation. Like Applications 1 and 2, the trace value for the submatrix of the estimated coefficient covariance matrix for the equation with R H S endogenous variables, the volume equation, was lower for the O L S fit. If the variances of the coefficients from the O L S fit were used to calculate confidence limits, the confidence interval would be underestimated. The composite model fit was given the highest summed rank for other features. This was largely due to the small number of samples used to test this application. The M S L S technique was given a lower rank because some of the properties of E G L S could not be applied to the M S L S fit for only 28 samples. If the number of samples was larger, the summed rank for the M S L S fit would be equal to that of the composite model fit. The composite model only estimated the L H S variable; however, unbiased estimates of the coefficients were obtained. In terms of goodness-of-fit measures, the O L S fit was best, but the estimates of the coefficients of the volume model were biased and inconsistent. Also, information concerning the contemporaneous correlation was not used for the O L S fit, and so the estimates are less efficient than with the M S L S technique.  The M S L S  fit was not necessarily the best for this apphcation, because of the small number of samples used. The large sample properties of E G L S , applicable to M S L S could not be assumed. If more two-measurement psps were available, the large sample properties of the M S L S technique could be assumed and so this technique would be the most favorable. Alternatively, all possible pairs of the measurements from the selected psps could have been used to fit the model, but serial correlation between the measurements of each plot may have been significant. Using all possible pairs of measurements, the error structure would likely be more complex than that given for the serially and contemporaneously  Chapter  7.  Application  3: Volume  Growth  and  Yield  154  correlated systems of equations i n Chapter 3 of this thesis. If compatibilitjr and estimates of each of the L H S variables of this system were required, the I M S L S procedure would be the most appropriate, because these estimates woidd be equal to the M L E estimates if error terms were normally distributed.  The  standard deviations shown for the I M S L S fit were the lowest for the basal area equation, and were lower than those of the M S L S fit for the volume equation.  Chapter 8  Overall Discussion and Conclusions  The first hypothesis of this thesis was that a fitting technique exists which satisfies the desired criteria for simultaneous, contemporaneously correlated systems of equations, i n which individual equations have non-iid error terms. The second hypothesis was that any additional computational burden i n using the technique is compensated by the benefits of meeting the desired criteria. The first objective of this research, related to the first hypothesis, was to find a technique from the literature which meets the desired criteria for simultaneous, contemporaneously correlated systems of equations, i n which the error terms for individual equations are non-iid. This objective was not met, because no technique was found which satisfied these criteria for systems of equations with this error structure. However, information from the literature was used to derive a new fitting technique, labelled multistage least squares ( M S L S ) , which is an extension of the 3SLS technique to systems in which the error terms of individual equations are non-iid. The estimated coefficients from the M S L S technique are consistent and asymptotically efficient, if the estimated error covariance matrix is consistent and the error structure has been correctly determined for the system. Confidence limits can be calculated for large sample sizes, and compatibility is maintained. The second objective, related to the second hypothesis, was to compare the chosen technique to the common techniques of O L S applied to each equation, and O L S applied  155  Chapter  8.  Overall  Discussion  and  156  Conclusions  to a composite model. Since no technique was found in literature, the M S L S technique was used for this comparison. The three techniques were applied to three forestry problems for which systems of equations are used.  The criteria for examining the results  of the three techniques included the comparison of goodness-of-fit measures (Fit Index, Mean Absolute Deviation, Mean Deviation), the comparison of the trace of the estimated coefficient covariance matrix, and the calculation of a summed rank based on the amount of information given, the consistency of estimates using information from literature to assess the system, the ability to calculate confidence intervals, the efficiency using i n formation from literature to assess the system, the compatibility, and the ease of fit i n terms of the number and difficulty of steps required. The O L S fit of individual equations is simple to calculate and algorithms are readily available.  The O L S fit of each of the systems of equations for the three applications,  resulted in better goodness-of-fit measures than did the M S L S fit, as expected, because the O L S fit minimizes the sum of squared differences.  Also, the estimated coefficients  from the O L S fit were generally close to those from the M S L S fit. The O L S fit requires less computer memory than the M S L S fit; large forestry problems with many equations, variables, and samples can be fitted. However, for simultaneous systems of equations, the estimated coefficients are biased and inconsistent. The estimates do not converge to the true estimates, with increasing sample size. Also, confidence limits cannot be calculated and compatibility within the system is not assured. The O L S fit of a composite model, created by substituting all of the equations into one equation, was simple to perform for the applications tested.  Also, for the two lin-  ear composite models, the estimated coefficients were unbiased, because all of the R H S variables were exogenous. For the nonlinear composite model, coefficients were restricted and are therefore likely biased. For the first apphcation, the volume system, the derived  Chapter  8.  Overall  Discussion  and  Conclusions  157  composite model did not appear to have all of the important variables. For the dbh distribution, the composite model did not show the relationships among the parameters of the Weibull distribution. Because the composite model derived for the growth and yield system is useful for predicting only the volume yield, the basal area yield was not predicted. The technique of deriving and fitting a composite model meets all of the desired criteria for some systems of forestry equations. However, if endogenous variables remain on the R H S , estimated, coefficients are biased and consistent. Since only one endogenous variable is- predicted, the composite model fit may be undesirable.  Also, the original  biological relationships may be lost, and important variables may not be retained in the derived model. Finally, for large problems with many variables and many equations, this technique is impractical. For all three of the applications tested, the goodness-of-fit measures for the M S L S fit were close to those for the O L S fit, and were sometimes better than the composite •model fit. The number of steps required was similar to the O L S fit, also. In addition, the estimated coefficients of the M S L S fit were consistent and asymptotically efficient, except for the first application for which some efficiency was lost. Compatibility was also obtained with the M S L S fit. Hypothesis statements can also be tested. For the first and second applications, the use of the O L S fit to incorrectly test hypotheses about coefficients resulted i n different conclusions than if the consistent estimates from the M S L S fit were used.  The selected applications did not demonstrate the use of the M S L S technique  for serially correlated error terms in individual equations, or for heteroskedastic, serially correlated error terms. However, the desired criteria would still be met for these error structures, and the difficulty in obtaining the M S L S fit would likely be similar to the applications tested in this thesis. The main disadvantage of the M S L S fit was that more computer memory is required,  Chapter  8.  Overall  Discussion  and  Conclusions  158  than for the O L S or composite model fits. For large forestry problems with many equations, variables, and samples, a more efficient computer program or enough computer memory would be required.  1  Alternatively, a modified M S L S technique could be used,  in which the final step of the M S L S technique is not performed. This modified M S L S technique is simply an extension of the 2SLS technique to non-iid error terms.  Esti-  mates of coefficients and their variances would remain consistent for this modified M S L S technique. A loss in efficiency would be incurred, because the information about contemporaneous correlation used in the last step of the M S L S technique would not be utilized. The final alternative, which was used for Apphcation 1, is to fit the system using a subsample of the data. Again, a loss of efficiency would result. Another disadvantage of the M S L S technique is that estimates are not unbiased. For small samples, a Monte Carlo study to examine the degree of bias for small sample sizes for the M S L S and I M S L S techniques should be conducted. The contemporaneous correlation, correlation of the R H S variables with the error term, serial correlation, and heteroskedasticity should be varied to examine the effects on bias. However, studies using 3SLS for small samples indicated that the bias is often less for this technique than for the O L S technique. Similar results may occur for the M S L S technique. In summary, the first hypothesis of this thesis was refuted, because no technique was found i n literature which meets the desired criteria for the error structure described. However, a fitting technique was derived as part of this research. In terms of the number of steps required, the second hypothesis was met using M S L S for the applications tested. But in terms of the computer memory required, the M S L S technique results i n an additional computational burden which could limit the use. A modified M S L S technique or a subsample of the data followed by the M S L S technique are proposed as alternatives A F O R T R A N p r o g r a m w i t h I n t e r n a t i o n a l M a t h a n d S t a t i s t i c a l L i b r a r y ( I M S L ) subroutines, Version 1.0, was used to o b t a i n the final M S L S fits for the three applications presented i n this thesis. 1  Chapter  8.  Overall  Discussion  and  Conclusions  159  for large forestry problems. Also, since the computer program used in this research may not be most efficient, the computational burden may be reduced by creating an efficient routine.  Chapter 9  References C i t e d  Aitken, A . C . 1934-35. O n least squares and linear combinations of observations. Royal Society of E d i n b u r g h . 55:> 42-48. Amateis, R. L . , H . E . Burkhart, B . J . Greber, and E . E . Watson. 1984. A comparison of approaches for predicting multiple- product yields from weight-scaling data. Forest Sci. 30: 991-998. Ameniya, T . 1985. Advanced econometrics. 164-170.  Harvard University Press. Cambridge, pp.  A n o n . . 1985. A l b e r t a Phase 3 Forest Inventory: Tree sectioning manual. Alberta Energy and Natural Resources. E N R Report N o . Dept. 56. 80 p. A n o n . . 1986. Alberta Forest Service: Permanent sample plot field procedures manual. Forest Measurement Section, Timber Management Branch. Alberta Forest Service. Edmonton. F M O P C 83-03. 68 p. A n o n . , 1987.  Diameter distribution modeling from plot and stand data in Alberta.  Alberta Forestry, Lands and Wildlife. P u b . N o . T/156. Edmonton, Alberta. 81 p. Arora, S. S. 1973. Error components regression models and their applications. Annals of Economic and Social Measurement. 2: 451-461. Bailey, R. L . , N . C . Abernethy, and E . P. Jones, Jr. 1981. Diameter distribution models for repeatedly thinned slash pine plantations. In Proc. First Bienn. South Silvic. Res. Conf., J . P. Barnett, ed., U . S . D . A . Forest Serv. Tech. Rep. SO-34. pp 115-126. Bailey, R. L . and T . R. Dell. 1973. Quantifying diameter distributions with the Weibull function. Forest Sci. 19: 97-104.  160  Chapter  9.  References  Cited  161  Bailey. R. L . and J . A . A . da Silva. 1987. Compatible models for survival, basal area growth, and diameter distributions of fertilized slash pine plantations. In Forest Growth Modelling and Prediction, Volume 1, Proceedings of the I U F R O conference, August 23-27, 1987, Minneapolis, Minnesota. S A F P u b l . N o . SAF-87.12. pp. 538-546. Bailey, R.. L . and K . D . Ware. 1983. Compatible basal-area growth and yield model for thinned and unthinned stands. C a n . J . For. Res. 13: 563-571. Balestra, P. and M . Nerlove. 1966. Pooling cross section and time series data i n the estimation of a dynamic model. Econometrica 34: 585-612. Basmann, R. L . 1957. A generalized classical method of linear estimation of coefficients in a structural equation. Econometrica 25: 77-83. Beck, D . E . 1971. Polymorphic site index curves for white pine i n the Southern A p palachians. U . S . D . A . For. Serv. R.es. P a p . SE-80. 8 p. Borders, B . E . and R. L . Bailey. 1986. A compatible system of growth and yield equations for slash pine fitted with restricted three-stage least squares. Forest Sci. 32: 185-201. Borders, B . E . , R. L . Bailey, and M . L . Clutter. 1987a. Forest growth models: parameter estimation using real growth series. In Forest Growth Modelling and Prediction, Volume 2. Proceedings of the I U F R O Conference, August 23-27, 1987, Minneapohs, Minnesota. S A F P u b l . No. SAF-87.12. pp. 660-667. Borders, B . E . , R. A . Souter, R. L . Bailey, and K . D . Ware. 1987b. Percentile-based diameter distributions characterize forest stand tables. Forest Sci. 33: 570576. Brundy, J . M . and D . W . Jorgenson. 1971. Efficient estimation of simultaneous equations by instrumental variables. Review of Economics and Statistics 53:107-224. Burkhart, H . E . 1986. Fitting analystically related models to forestry data. In Proceedings of the XIII International Biometric Conference, J u l y 27-August 1, 1986. 15 p.  Seattle, Washington.  Chapter  9.  References  162  Cited  Burkhart, H . E . and P . T . Sprinz. 1984. Compatible cubic volume and basal area projection equations for thinned old-field loblolly pine plantations. Forest Sci. 30: 86-93. Byrne, J . C . and D . D . Reed. 1986. Complex compatible taper and volume estimation systems for red and loblolly pine. Forest Sci. 32: 423-443. Chow, G . C . 1983. 77-252.  Econometrics.  M c G r a w - H i l l Book Company, Inc., Toronto,  pp.  Clutter, J . L . 1963. Compatible growth and yield models for loblolly pine. Forest Sci. 9: 354-371. Clutter, J . L . , J . C . Fortson, L . V . Pienaar, G . H . Brister, and R. L . Bailey. 1983. Timber management: A quantitative approach. - John W i l e y &: Sons, Inc., Toronto. 333 p. Cochrane, D . and G . H . Orcutt. 1949. Application of least squares regression to relationships containing auto-correlated error terms. J . Amer. Stat. Assn. 44: 32-61. Conover, W . J . 1980. Practical nonparametric statistics. Toronto, pp. 122-129.  J o h n W i l e y & Sons, Inc.,  Cragg, J . G . 1967. O n the relative small-sample properties of several structural equation estimators. Econometrica 35: 89-110. Cragg, J . G . 1982. Estimation and testing i n time-series regression models with heteroscedastic disturbances. J . of Econometrics 20: 135-157. Dagenais, M . G . 1978. The computation of F I M L estimates as iterative generalized least squares estimates in linear and nonlinear simultaneous equations models. Econometrica 46: 1351-1362. Davis, A . W . and P. W . West. 1981. Remarks on ''Generalized least squares estimation of yield functions" by I.S. Ferguson and J.W . Leech. Forest Sci. 27: 233-239. T  Dhrymes. P. J . 1971. Equivalence of iterative Aiken and maximum likelihood estimators for a system of regression equations. Australian Economic Papers. 10: 20-24.  Chapter  9.  References  163  Cited  Doran, H . E . and W . E . Griffiths. 1983. O n the relative efficiency of estimators which include the initial observations i n the estimation of seemingly unrelated regressions with first-order autoregressive disturbances. J . Econometrics 23: 165-191. Dubey, S. D . 1967. Some percentile estimators for Weibull parameters. 9: 119-129. Duncan, G . M . 1983. Estimation and inference for heteroscedastic Int. Econ. Rev. 24: 559-566.  Technometrics  systems of equations.  Durbin, J . 1957. Testing for serial correlation i n systems of simultaneous regression equations. Biometrika 44: 370-377. D u r b i n , J . and G . S. Watson. 1951. Testing for serial correlation in least squares regression II. Biometrika 38: 159-178. Edwards, D . 1987. pers. comm. Forest Measurement Section, Timber Management Branch. Alberta Forest Service. Edmonton. Engle, R. F. 1982. Autoregressive conditional heteroscedasticity with estimates of the variances of United K i n g d o m Inflations. Econometrica 50: 987-1007. Epps, T . W . and M . L . Epps. 1977. The robustness of some standard tests for autocorrelation and heteroskedasticity when both problems are present. Econometrica 45: 745-753. Ferguson, I. S. and J . W . Leech. 1978. Generalized least squares estimation of yield functions. Forest Sci. 24: 27-42. Ferguson, I. S. and J . W . Leech.  1981.  Reply to remarks by A . W . Davis and P. W .  West on "Generalized least squares estimation of yield functions." Forest Sci 27: 589-591. Friedman, J . and R. J . Foote. 1955. Computational methods for handling systems of simultaneous equations, with applications to agriculture. U . S . D . A . Agriculture Handbook N o . 94. 109 p. Furnival, G . M . and R. W . Wilson, Jr. 1971. Systems of equations for predicting forest  Chapter  9.  References  164  Cited  growth and yield. In Statistical Ecol. 3: 43-57. G . P. P a t i l , E . C . Pielou, W . E . Waters, eds. Penn. State Univ. Press, University Park. Goldberger, A . S. 1962. Best linear unbiased prediction in the generalized linear regression model. J . Amer. Stat. Assn. 57: 369-375. Goldfeld. S. M . and R. E . Quandt.  1965.  Some tests for homoscedasticity.  J . Amer.  1973.  Polymorphic site index curves for shortleaf  Stat. Assn. 60: 539-547. Graney, D . L . and H . E . Burkhart.  pine i n the Ouachita Mountains. U . S . D . A . For. Serv. Res. P a p . SO-85.  14  PGregoire, T. G . 1987. Generalized error structure for forestry yield models. Forest Sci. 33: 423-444, Gujarti, D . 1978. Basic econometrics. M c - G r a w H i l l . Inc. Toronto, pp. 333-387. Hans, R. P. 1986. Estimating the coefficients i n a system of compatible growth and yield equations for loblolly pine. M.Sc. Thesis. V a . Polytech. Inst, and State Univ., Blacksburg, Virginia. 45 p. Harvey, A . C . and G . D . A . Phillips. 1980. Testing for serial correlation in simultaneous equation models. Econometrica 48: 747-759. Henderson, C . R., Jr. 1971. Comment on "The use of error components models i n combining cross section with time series data". Econometrica 39: 397-401. Hildreth, C. and J . P. Houck. 1968. Some estimators for a linear model with random coefficients. J . Amer. Stat. Assn. 63: 584-595. Honer, T . G . 1964. The use of height and squared diameter ratios for the estimation of merchantable cubic foot volume. Forestry Chron. 40: 325-331. Honer, T . G . 1965. A new total cubic foot volume function. Forestry Chron. 41: 476-493. Honer, T . G . 1967.  Standard volume tables and merchantable conversion factors for  the commercial tree species of central and eastern Canada. Canada Dept. of Forestry and R u r a l Development Information Report. F M R - X - 5 . 14 p.  .Chapter  9.  References  165  Cited  H u , T . 1973. Econometrics, an introductory analysis. University Park Press, Baltimore, pp. 120-167. Hyink, D . M . and J . W . Moser, Jr. 1983. A generalized framework for projecting forest yield and stand structure using diameter distributions. Forest Sci. 29: 85-95. Intriligator, M . D . 1978.  Econometric models, techniques, and applications.  Prentice-  H a l l , Inc., Toronto, pp. 336-428. Jaech, J . L . 1964.  A note on the equivalence of two methods of fitting a straight line  through cumulative data. J . Amer. Stat. Assn. 59: 863-866. Judge, G . C . , W . E . Griffiths, R. C . H i l l , H . Liitkepohl, and T . Lee. 1985. The theory and practise of econometrics. 2nd. ed. John W i l e y & Sons, Inc., Toronto. 1019 p. Kadiyala, K . P. 1968. A transformation used to circumvent the problem of autocorrelation. Econometrica. 36: 93-96. Kmenta, J . and R. F . Gilbert. 1968. Small sample properties of alternative estimators of seemingly unrelated regressions. J . Amer. Stat. Assn. 63: 1180-1200. Kmenta, J . and R. F . Gilbert. 1970. Estimation of seemingly unrelated regressions with autoregressive disturbances. J . Amer. Stat. Assn. 65: 186-197. Knoebel, B . R., H . E . Burkhart, and D . E . Beck. 1986. A growth and yield model for thinned stands of yellow-poplar. Forest Sci. -Monograph 27. i + 62 p. Kozak, A . in press.  A variable-exponent taper equation. C a n . J . For. Res.  Lehman, E . L . 1975. Nonparametrics: statistical methods based on ranks. Holden-Day, Inc., Toronto, pp. 313-315. LeMay, V . 1982. Estimating total, merchantable, and defect volumes of individual trees for four regions of Alberta. M . S c . Thesis.  The University of Alberta. E d -  monton. 110 p. Lundgren, A . L . and W . A . Dolid. 1970. Biological growth functions describe published site index curves for Lake States timber species. U . S . D . A . For. Serv. Res.  Chapter  9.  References  166  Cited  Pap. N C - 3 6 . 8 p. Maclanskjr, A . 1964. O n the efficiency of three-stage least squares estimation. Econometrica 32: 51-56. M a d d a l a , G . S. 1974. Some small sample evidence on tests of significance i n simultaneous equation models. Econometrica 42: 841-851. Maeshiro, A . 1980. New evidence on the small properties of estimators of S U R models with autocorrelated disturbances. J . Econometrics. 12: 177-187. Magnus, J . F. 1978. M a x i m u m likelihood estimation of the G L S model with unknown paxameters i n the disturbance covariance matrix. J . Econometrics 7: 281-312. Malinvaud, E. 1980.  Statistical methods of econometrics.  North-Holland Publishing  Company, New York. p. 285. Matney, T. G . and A . D . Sullivan. 1982. Compatible stand and stock tables for thinned and unthinned loblolly pine stands. Forest Sci. 28: 161-171. M c C u l l o c h , J . H . 1985. O n heteroskedasticity. Econometrica 53:483. McTague, J . P. and R. L . Bailey. 1987a.. Compatible basal area and diameter distribution models for thinned loblolly pine plantations in Santa Catarina, Brazil. Forest Sci. 33: 43-51. McTague, J . P. and R. L . Bailey. 1987b. Simultaneous total and merchantable volume equations and a compatible taper function for loblolly pine. C a n . J . of For. Res. 17: 87-92. Mehta, J . S. and P. A . V . B . Swamy. 1976. Further evidence on the relative efficiencies of Zellner's seemingly unrelated regressions estimator. J . Amer. Stat. Assn. 71: 634-639. M i k h a i l , W . M . 1975. A comparative Monte Carlo study of the properties of econometric estimators. J . Amer. Stat. Assn. 70: 94-104. Monserud, R. A . 1984. Height growth and site index curves for inland Douglas-fir based on stem analysis data and forest habitat type. Forest Sci. 30: 943-965.  Chapter  9.  References  167  Cited  Murphy. P a u l A . 1983. A nonlinear timber yield equation system for loblolly pine. Forest Sci. 29: 582-591. Murphy, P a u l A . and Roy C . Beltz. 1981. Growth and yield of shortleaf pine in the West G u l f Region. U . S . D . A . For. Ser. Res. P a p . SO-169. 7 p. Murphy, P a u l A . and Herbert S. Sternitzke.  1979.  Growth and yield estimation for  loblolly pine in the West Gulf. U . S . D . A . For. Ser. Res. P a p . SO-154. 8 p. Nagar, A . L . 1960. A Monte Carlo study of alternative simultaneous equation estimators. Econometrica 28: 573-590. Newberry, J . D . 1984. Methods for modeling whole stem diameter growth and taper. P h . D . Thesis. V a . Polytech. Inst, and State U n i v . , Blacksburg, Virginia. 107 p. Newberry, J . D . and H . E . Burkhart. 1986. Variable-form stem profile models for loblolly pine. C a n . J . of For. Res. 16: 109-114. Oberhofffer, W . and J . Kmenta. 1974. A general procedure to obtaining m a x i m u m likelihood estimates in generalized regression models. Econometrica 42: 579590. Parks, R. W . 1967. Efficient estimation of a system of regression equations when disturbances are both serially and contemporaneously correlated. J . Amer. Stat. Assn. 62: 500-509. Pienaar, L . V . and B . D . Shiver. 1986. Basal area prediction and projection equations for pine plantations. Forest Sci. 32:626-633. Pienaar, L . V . and K . J . Turnbull. 1973. The Chapman-Richards generalization of von Bertalanffy's growth model for basal area growth and yield i n even-aged stands. Forest Sci. 19: 2-22. Ramirez-Maldonado, H . , R. L . Bailey, and B . E . Borders. 1987. Some implications of the algebraic difference approach for developing growth models. In Forest Growth Modelling and Prediction, Volume 2, Proceedings of the I U F R O conference, August 23-27, 1987, Minneapolis, Minnesota. S A F P u b l . N o . SAF-87.12. pp. 731-738.  Chapter  9.  References  Cited  168  Rao, P. and Z. Griliches. 1969. Small-sample properties of several two-stage regression methods in the context of autocorrelated errors. J . Amer. Stat. Assn. 64: 253-272. Reed, D . D . and E . J . Green. 1984. Compatible stem taper and volume ratio equations. Forest Sci. 30: 977-990. Reed, D . D . , E . A . Jones, T . R. Bottenfield, and C . C . Tretin. 1986. Compatible cubic volume and basal area equations for red pine plantations. C a n . J . For. Res. 16: 416-419. Revankar, N . S. 1974. Some finite sample results in the context of two seemingly unrelated regression equations. J . Amer. Stat. Assn. 69: 187-190. Savin, N . E . and K . J . White. 1977. The Durbin-Watson test for serial correlation with extreme sample sizes or many regressors. Econometrica 45: 1989-1996. Sawa, T. 1969. The exact sampling distribution of ordinary least squares and two-stage least squares estimators. J . Amer. Stat. Assn. 64: 923-937. Schumacher. F . X . and Hall, F . S. 1933. Logarithmic expression of timber-tree volume. Jour. Agric. Res. 47: 719-734. Spurr, S. 1952. Forest inventory. The Ronald Press Company, New York. p. 94. Srivastava, V . K . and R. Tiwari. 1978. Efficiency of two-stage and three-stage least squares estimators. Econometrica 46: 1495-1498. Sullivan, A . D . and J . L . Clutter. 1972. A simultaneous growth and yield model for loblolly pine. Forest Sci. 18: 76-86. Summers, R. 1965. A capital intensive approach to the small sample properties of various simultaneous equation estimators. Econometrica 33: 1-41. Swamy, P. A . V . B . 1970.  Efficient inference in a random coefficient regression model.  Econometrica 38: 311-323. Sw amy, P. A . V . B . and J . S. Mehta. 1977. Estimation of linear models with time and cross-sectionally varying coefficients. J . Amer. Stat. Assn. 72: 890-898. T  Chapter  9.  References  169  Cited  Telser, L . G . 1964. Iterative estimation of a set of linear regression equations. J . Amer. Stat. Assn. 59: 845-62. Trousdell, K . B . , D . E . Beck, and F . T . Lloyd. 1974. Site index for loblolly pine i n the A t l a n t a Coastal P l a i n of the Carolinas and Virginia. U . S . D . A . for. Serv. Res. Pap. SE-115. 11 p. Vaessen, W . 1984. U B C N L P : Nonlinear function optimization. University of British Columbia, Vancouver. 167 p.  Computing Centre,  VanDeusen, P. C , T . G . Matney, and A . D . Sullivan. 1982. A compatible system for predicting the volume and diameter of sweetgum trees to any height. South. J . A p p l . For 6: 159-163. Wallace, T . D . and A . Hussain. 1969. The use of error components models in combining cross section with time series data. Econometrica 37: 55-72. Watts, S. B . , ed. 1983. Forestry handbook for British Columbia. 4th ed. The Forestry Undergraduate Society, Vancouver, pp. 417-420. Wonnacott, Ronald J . and Thomas H . Wonnacott. Sons, Inc., Toronto, pp. 251-572.  1979. Econometrics. John W i l e y &  Yamamoto, T . 1979. O n the prediction efficiency of the generalized least squares model with an estimated variance covariance matrix. Int. Econ. Rev. 20: 693-705. Zellner, A . 1962. A n efficient method of estimating seemingly unrelated regressions and tests for aggregation bias. J . Amer. Stat. Assoc. 57: 348-368. Zellner, A . 1963. Estimators of seemingly unrelated regressions: some exact finite sample results. J . Amer. Stat. Assn. 58: 977-992. Zellner, A . and D . S. Huang. 1962. Further properties of efficient estimators for seemingly unrelated regression equations. Int. Econ. Rev. 3: 300-313. Zellner, A . and H . Theil.  1962.  Three-stage least squares:  simultaneous equations. Econometrica 30: 54-78.  simultaneous estimation of  Appendix A Glossary of Terms and Abbreviations  2SLS Two Stage Least Squares; a technique to fit systems of equations which are simultaneous but not contemporaneously correlated. 3SLS Three Stage Least Squares; a technique to fit systems of equations which are simultaneous and contemporaneously correlated. AFS  Alberta Forest Service  B L U E Best Linear Unbiased Estimator compatibility Logical relationships between equations of a system are retained i n the fitted system of equations. composite A l l equations of the system are combined into one equation. consistent The probability that the estimate is within a small deviation from the population value approaches one as the number of samples is increased; the estimate converges to the true value as the sample size is increased. contemporaneous correlation Correlation of error terms between equations of a system. contemporaneous variances The variances corresponding to contemporaneous correlation. dbh diameter at breast height (1.3 metres above ground) efficient The Cramer-Rao lower bound for efficiency is met. More efficient means that the variance of the estimate is lower whereas less efficient means that the variance of the estimate is higher. E G L S Estimated Generalized Least Squares; a technique developed for single equations to fit a linear model using an estimated error covariance matrix. endogenous variables Variables generated by the system of equations, stochastic.  170  Appendix  A.  Glossary  of Terms  and  Abbreviations  171  exogenous variables Variables generated outside the system of equations, nonstochastic. F.I. F i t Index GLS Generalized Least Squares; a technique developed for single equations to fit a linear model using a known error covariance matrix. heteroskedasticity Variances of the error terms are not equal across the range of the sample data; error terms are not identically distributed. homoskedasticity Variances of the error terms are equal across the range of the sample data; error terms are identically distributed. I3SLS Iterated Three Stage Least Squares; the estimated error covariance matrix from the first 3SLS fit is used to fit the system of equations again. The process is repeated until convergence occurs. iid independent and identically distributed  IMSLS Iterated Multistage Least Squares; the estimated error covariance matrix from the first M S L S fit is used to fit the system of equations again. repeated until convergence occurs.  T h e process is  inconsistent T h e probability that the estimate is within a small deviation from the population value does not approach one as the number of samples is increased. LHS Left Hand Side M . A . D . Mean Absolute Deviation M.D.  Mean Deviation  M L E M a x i m u m Likelihood Estimator MSLS Multistage Least Squares; derived i n this thesis for fitting systems of equations which are simultaneous and contemporaneously correlated and have with non-iid error terms. non-iid  Either not independent or not identically distibuted or both.  OLS Ordinary Least Squares; techniques include simple linear regression, multiple linear regression, nonlinear least squares, and regression of weighted models. One of the assumptions for O L S is that the error terms are. i i d .  Appendix  A.  Glossary  of Terms and  Abbreviations  172  psps permanent sample plots; plots that are established and marked so that future measurements from the same trees can be taken. RHS  Right Hand Side  serial c o r r e l a t i o n  Dependence between the current error term and the previous one(s).  s i m u l t a n e i t y bias The bias in the estimated coefficients which is results if an equation with endogenous variables on the R H S is fit using O L S , G L S , or E G L S . s i m u l t a n e o u s e q u a t i o n s A system of equations wherein the L H S variable of one or more equations also appears on the R H S of one or more equations in the system. SUR  Seemingly Unrelated Regression; a technique to fit systems of equations which are contemporaneously correlated but not simultaneous.  

Cite

Citation Scheme:

        

Citations by CSL (citeproc-js)

Usage Statistics

Share

Embed

Customize your widget with the following options, then copy and paste the code below into the HTML of your page to embed this item in your website.
                        
                            <div id="ubcOpenCollectionsWidgetDisplay">
                            <script id="ubcOpenCollectionsWidget"
                            src="{[{embed.src}]}"
                            data-item="{[{embed.item}]}"
                            data-collection="{[{embed.collection}]}"
                            data-metadata="{[{embed.showMetadata}]}"
                            data-width="{[{embed.width}]}"
                            async >
                            </script>
                            </div>
                        
                    
IIIF logo Our image viewer uses the IIIF 2.0 standard. To load this item in other compatible viewers, use this url:
http://iiif.library.ubc.ca/presentation/dsp.831.1-0075305/manifest

Comment

Related Items