Continue With High Hessian Convergence Criterion

Biostatistics. 2014 Oct; 15(4): 745–756.

Almost efficient estimation of relative risk regression

Atul A. Gawande

Brigham and Women's Hospital, Boston, MA 02115, USA

Ariadne Labs, Boston, MA 02115, USA

Received 2013 Jul 18; Revised 2014 Feb 27; Accepted 2014 Mar 4.

Abstract

Relative risks (RRs) are often considered the preferred measures of association in prospective studies, especially when the binary outcome of interest is common. In particular, many researchers regard RRs to be more intuitively interpretable than odds ratios. Although RR regression is a special case of generalized linear models, specifically with a log link function for the binomial (or Bernoulli) outcome, the resulting log-binomial regression does not respect the natural parameter constraints. Because log-binomial regression does not ensure that predicted probabilities are mapped to the [0,1] range, maximum likelihood (ML) estimation is often subject to numerical instability that leads to convergence problems. To circumvent these problems, a number of alternative approaches for estimating RR regression parameters have been proposed. One approach that has been widely studied is the use of Poisson regression estimating equations. The estimating equations for Poisson regression yield consistent, albeit inefficient, estimators of the RR regression parameters. We consider the relative efficiency of the Poisson regression estimator and develop an alternative, almost efficient estimator for the RR regression parameters. The proposed method uses near-optimal weights based on a Maclaurin series (Taylor series expanded around zero) approximation to the true Bernoulli or binomial weight function. This yields an almost efficient estimator while avoiding convergence problems. We examine the asymptotic relative efficiency of the proposed estimator for an increase in the number of terms in the series. Using simulations, we demonstrate the potential for convergence problems with standard ML estimation of the log-binomial regression model and illustrate how this is overcome using the proposed estimator. We apply the proposed estimator to a study of predictors of pre-operative use of beta blockers among patients undergoing colorectal surgery after diagnosis of colon cancer.

Keywords: Bernoulli likelihood, Convergence problems, Maclaurin series, Poisson regression, Quasi-likelihood

1. Introduction

We consider prospective study designs where it is of scientific interest to estimate relative risks (RRs) conditional on covariates. Interestingly, in many studies where RRs are the parameters of primary scientific interest, odds ratios (ORs) are reported instead. This can be explained in part by the technical advantages of logistic regression (e.g. no constraints on the regression parameters) and the widespread availability of appropriate software. Although RR regression is a special case of generalized linear models, specifically with a log link function for the binomial (or Bernoulli) outcome, the resulting log-binomial regression does not respect the natural parameter constraints. Because log-binomial regression does not ensure that predicted probabilities are mapped to the [0,1] range, maximum likelihood (ML) estimation is often subject to numerical instability that leads to convergence problems. It has been noted by several authors that convergence problems are especially likely to arise when the predicted probabilities are close to 1 (Wacholder, 1986; Lu and Tilley, 2001; Zou, 2004; Carter and others, 2005). Several approaches have been proposed to circumvent the convergence problems associated with ML estimation of log-binomial regression. These include: (1) directly estimating RR using a constrained ML method that truncates the range of the probabilities (Wacholder, 1986); (2) adding a small constant to each subject's Bernoulli outcome in the sample (Clogg and others, 1991; Deddens and others, 2003); (3) indirectly estimating RR using the mathematical relationship between OR and RR for a single binary covariate (Zhang and Yu, 1998); and (4) quasi-likelihood method of moments techniques (Traissac and others, 1999; McNutt and others, 2003; Zou, 2004; Carter and others, 2005).

One approach that has been widely studied is the use of Poisson regression estimating equations (Traissac and others, 1999; McNutt and others, 2003; Zou, 2004; Carter and others, 2005). That is, the Poisson likelihood equations are used to estimate the RR regression parameters without constraints. The estimating equations for Poisson regression yield consistent, asymptotically normal (CAN) estimators of the RR regression parameters. Moreover, Carter and others (2005) found that they alleviate convergence problems associated with ML estimation of log-binomial regression parameters. Although estimating equations for Poisson regression yield consistent estimators of the RR regression parameters, they are inefficient because the "weight function" is misspecified as Poisson instead of bionomial or Bernoulli. As we demonstrate later, the loss of efficiency tends to be greatest in the very setting where their use is required, i.e. when predicted probabilities are far from zero.

We consider the relative efficiency of the Poisson regression estimator and develop an alternative, almost efficient estimator for the RR regression parameters. The proposed method uses near-optimal weights based on a Maclaurin series (Taylor series expanded around 0) approximation to the true Bernoulli weight function. If the Maclaurin series is truncated at its first term, this yields the Poisson regression estimator. Truncation at higher terms in the series yields near-optimal weights and almost efficient estimators. Using method-of-moments, assuming the RR regression model is correctly specified, the estimators are consistent for any given truncation of the Maclaurin series approximation to the optimal weight function. We examine the asymptotic relative efficiency (ARE) for an increase in the number of terms in the series. We also make recommendations for choice of the number of terms to avoid similar finite sample convergence problems as with ML estimation of log-binomial regression parameters. We present results of a simulation study that highlight the potential gains in efficiency in finite samples.

The proposed method is motivated by a study of best practices for patients undergoing surgery for colorectal cancer (Arriaga and others, 2009). A panel of colorectal and general surgeons was assembled to ascertain a set of 37 evidence-based practices that they considered to be the most pertinent to the evaluation and management of a patient undergoing colorectal surgery after diagnosis of colon cancer. Patients with known heart disease who are given beta blockers prior to surgery have been found to have a significantly reduced risk of post-operative death (Poldermans and others, 1999). Thus, one of the key practices is giving beta blockers when indications (heart disease) are present. In this study, the medical records of 210 cancer patients with cardiac conditions from three hospitals were reviewed (due to confidentiality, hospital names must remain anonymous). Here the binary outcome of interest is whether the patient was given beta blockers (yes, no) prior to surgery. Upon review of the medical records, it was found that beta blockers were given to 130 out of the 210 equation M1 patients, indicating that many doctors are not meeting the best practices guideline for almost equation M2 of the patients. The goal of this study is to determine predictors of pre-operative use of beta blockers in these patients. The main predictors of interest are the patient's age (in years), race (categorized as white versus other), gender, number of comorbidities, and "American Society of Anesthesiologists (ASA) score." The ASA score is a global assessment of the physical status of the patient (Owens and others, 1978) and yields a two-level indicator of a patient's pre-operative disease status at diagnosis ( equation M3 disease, equation M4 -threatening disease). Table 1 presents the overall distributions of beta blocker use, age, race, gender, number of comorbidities, and ASA score. Although all of these cardiac patients should receive pre-operative beta blockers, it is of interest to explore whether pre-operative disease status (ASA score), age, race, gender, and number of comorbidities are predictive of those patients who were given beta blockers. The study investigators conjectured that patients with higher risks of complications, i.e. patients who are older, with worse ASA score and more comorbidities, are more likely to receive pre-operative beta blockers. It is also of interest to examine whether there are any differences by race and gender.

Table 1.

Descriptive statistics for patient characteristics from study of pre-operative use of beta blockers among patients undergoing colorectal surgery

In Section 2, we describe the RR regression model, the corresponding Bernoulli likelihood, and the proposed estimating equations for the RR regression parameters. In Section 3, we present results of a study examining the ARE of the proposed estimator for increasing number of terms in the series. In Section 4, we present results of a simulation study that demonstrate the potential gains in efficiency in finite samples.

2. RR regression model

Let equation M21 denote the binary response (success or failure) for subject equation M22 , equation M23 , where equation M24 is the number of independent subjects. Then equation M25 is the success probability, where equation M26 is a equation M27 vector of covariates. In RR regression, the success probability is modeled using the log link,

equation M28

or, equivalently, equation M29 , where equation M30 is the vector of regression parameters. One can easily show that the elements of equation M31 (with the exception of the intercept) have interpretation as log-RRs (see, for example, Jewell, 2003). For the remainder of this paper, we assume that the main interest centers on estimating the regression parameter vector equation M32 .

The Bernoulli likelihood is

equation M33

(2.1)

The ML estimating equations for equation M34 are equation M35 , where

equation M36

(2.2)

The MLE is the asymptotically efficient estimate. However, when the success probability approaches 1, the denominator of (2.2) approaches 0, resulting in convergence problems. Wacholder (1986) constrained the likelihood to prevent equation M37 from approaching 0; however, these modifications are still subject to convergence problems (Baumgarten and others, 1989).

In general, to estimate equation M38 one can use estimating equations (or quasi-likelihood equations; Wedderburn, 1974) of the form equation M39 , where

equation M40

(2.3)

with equation M41 being a "weight" function of equation M42 Assuming that the regression model for equation M43 has been correctly specified, i.e. equation M44 these estimating equations yield CAN estimators of equation M45 for any bounded weight function equation M46 (see, for example, Rotnitzky, 2009). Specifically, it can be shown that the asymptotic distribution of equation M47 , the estimator for equation M48 with a particular choice of equation M49 , satisfies

equation M50

(2.4)

where

equation M51

and

equation M52

Note that the asymptotically efficient estimate is the MLE, with weight function equation M53 and asymptotic covariance determined by equation M54 . Consistent estimators of the asymptotic covariance of the estimated regression parameters can be obtained using the empirical estimator of equation M55 proposed by Huber (1967), White (1982), and Royall (1986). The empirical variance estimator is obtained by evaluating equation M56 at equation M57 and substituting equation M58 for equation M59 ; this is widely known as the sandwich variance estimator.

The use of Poisson regression for estimating equation M60 has received much attention recently (e.g. Traissac and others, 1999; McNutt and others, 2003; Zou, 2004; Carter and others, 2005). The Poisson regression estimating equations are equation M61 , where

equation M62

(2.5)

It is apparent that the Poisson regression estimating equations are simply a special case of (2.3) with equation M63 Thus, although the Poisson regression estimating equations produce consistent estimators of equation M64 , they can be quite inefficient because equation M65 is not the optimal or asymptotically efficient weight function. In general, weight functions closer to equation M66 will have higher efficiency. Thus, the goal of this paper is to choose a equation M67 close to equation M68 but one that also avoids the convergence problems associated with ML. Lumley and others (2006) considered the weights equation M69 equation M70 and equation M71 and estimated their relative efficiency (relative to the MLE). Lumley and others found that with equation M72 close to 1 these three estimating equations can give inefficient estimates; this is due to the fact that the three weight functions considered do not closely approximate the optimal weight function.

To develop more efficient estimating equations, we first note that the Maclaurin series (Abramowitz and Stegun, 1970) of equation M73 is

equation M74

This series converges for equation M75 . For our proposed estimating equations for equation M76 , we consider using weight functions of the form

equation M77

(2.6)

for different finite values of equation M78 . Note that the Poisson regression estimating equations can be considered the Maclaurin series truncated at equation M79 (i.e. constant weights). Higher values of equation M80 will more closely approximate the optimal weights associated with the MLE and should yield more efficient estimates.

For any finite value of equation M81 the proposed weight function can be implemented in standard statistical software for generalized linear models that allows user-defined variance functions (e.g. PROC GLIMMIX in SAS); there is negligible increase in computational time for larger values of equation M82 However, some care must be exercised in the choice of value for equation M83 ; for very large values of equation M84 equation M85 will be very close to equation M86 and the resulting estimating equations will be unstable when equation M87 is close to 1. Recall that convergence problems with ML estimation arise when equation M88 is close to 1 and the equation M89 th observation receives excessively large weight in the estimating equations. We note here that, because equation M90 equation M91 will always be positive. Also, for success probabilities close to 1 the weights are bounded, with largest equation M92 , so that convergence problems are less likely unless very large values of equation M93 are chosen. In the next section, we examine the ARE for increasing number of terms ( equation M94 ) in the series and make recommendations for the choice of the number of terms to avoid similar finite sample convergence problems as with ML estimation. The challenge is to find some minimum value of equation M95 that provides near-optimal weights but essentially bounds the largest weights when equation M96 is close to 1.

3. Asymptotic relative efficiency

The goal is to find a weight function that approximates the optimal weight function, equation M97 , but avoids assigning extremely large weights when equation M98 is close to 1. Using the truncated Maclaurin series of equation M99 , equation M100 , we examine the ARE for increasing finite values of equation M101 . For this study of ARE, we consider a log RR regression model with a single covariate,

equation M102

where, for simplicity, we let equation M103 have a discrete uniform distribution on the set equation M104 . We let equation M105 be negative, so that equation M106 is the maximum value for any equation M107 ; in contrast, equation M108 is the minimum value for any equation M109 . For the study of ARE, we first specify the maximum, equation M110 , and minimum, equation M111 , values for equation M112 , which in turn fully specifies the parameters equation M113 and equation M114 Specification of the model in this way allows us to explore properties of the estimators as equation M115 approaches 1. For different choices of values of equation M116 and equation M117 , we examine the ARE of the estimator of equation M118 based on equation M119 for increasing values of equation M120 ranging from 0 to 100. Recall that, when equation M121 the weights are constant and equivalent to the Poisson regression estimator. Fixing equation M122 we let equation M123 range from 0.2 to 0.8.

Given a discrete uniform distribution for equation M124 , and a Bernoulli distribution for equation M125 given equation M126 , the asymptotic variance of equation M127 can be obtained from (2.4) by simply considering an artificial sample comprised of one properly weighted observation for each possible realization of ( equation M128 ). The weights are determined by the respective joint probabilities of the given realizations. Following Rotnitzky and Wypij (1994), the asymptotic variance of equation M129 can be ascertained by weighting each contribution to equation M130 and equation M131 by its respective probability. That is, we take the expectations of equation M132 and equation M133 , the components of equation M134 in (2.4), by summing all of the possible realizations weighted by their respective probabilities.

A plot of the ARE for increasing values of equation M135 is given in Figure 1. The four panels of Figure 1 display the AREs for equation M136 , respectively. As equation M137 increases, the concentration of success probabilities that are close to 1 increases as the median of the probabilities increases from 0.44 when equation M138 to 0.89 when equation M139 . In the four panels of Figure 1, the ARE for the Poisson regression estimator ( equation M140 ) is in the 60–70% range. This highlights the loss of efficiency associated with the use of constant weights when the probabilities are not small. These results are in close agreement with those reported by Lumley and others (2006). As anticipated, the ARE increases monotonically with increasing values of equation M141 . AREs of approximately 95–97% are obtained for the proposed estimator when equation M142 is between 40 and 60. We note that, for equation M143 close to 1, equation M144 and equation M145 bound the maximum weights at approximately 41 and 61, respectively. For larger values of equation M146 , there appears to be diminishing returns in terms of increases in efficiency. More importantly, however, larger values of equation M147 seem far more likely to produce problems with convergence in finite samples due to excessive weight assigned to observations with equation M148 close to 1. In the next section, we examine the finite sample performance of the proposed estimator when equation M149 and equation M150 .

An external file that holds a picture, illustration, etc. Object name is kxu01201.jpg

ARE for increasing number of terms ( equation M151 ) in Maclaurin series expansion of the optimal Bernoulli weight function.

4. Simulation study

In this section, we consider the finite sample properties of the proposed estimator. For the simulations, we used a similar configuration as in the study of ARE presented in Section 3:

equation M152

where, for simplicity, we let equation M153 have a uniform (0,1) distribution. For the different values of equation M154 and equation M155 (or equivalently, equation M156 and equation M157 ), we conducted simulations for equation M158 with 2500 simulation replications performed for each configuration. We performed simulations fixing equation M159 and letting equation M160 range from 0.2 to 0.8 in 0.2 unit increments. The simulations were used to compare the MLE (unconstrained), the Poisson regression estimator ( equation M161 ), and the proposed estimator based on a Maclaurin series approximation with equation M162 and equation M163 .

Table 2 presents the relative bias, defined as equation M164 the root mean square error, the coverage probabilities of 95% Wald confidence intervals for equation M165 , as well as the percentage of simulation replications in which the convergence criterion was met. From Table 2, we see that the percent convergence for ML is less than equation M166 for all configurations; for ML, we report the results only for those simulations that converged for ML; the latter results can be considered "conditional on the likelihood convergence criterion". In contrast, there were no convergence problems for any of the estimating equations approaches. In terms of bias, standard ML has non-negligible relative bias, and the relative bias increases as equation M167 increases. The relative bias varies from approximately 8% to 30%. All other approaches have negligible ( equation M168 5%) relative biases. For the estimating equations approaches, the relative efficiencies can be estimated as the square of the ratio of the root mean square errors. The relative efficiencies of Poisson regression versus the Maclaurin series with equation M169 is between equation M170 and equation M171 increasing from equation M172 when equation M173 to equation M174 when equation M175 For these simulations, the relative efficiency of the Maclaurin series with equation M176 versus the Maclaurin series with equation M177 is above equation M178 for all configurations ( equation M179 when equation M180 and equation M181 when equation M182 Also, the relative efficiency of the Maclaurin series with equation M183 versus the Maclaurin series with equation M184 is approximately equation M185 for all configurations. Interestingly, for these simulation configurations, even use of equation M186 terms in the Maclaurin series yields high efficiency.

5. Application to study of pre-operative use of beta blockers in patients with colon cancer

We apply the proposed methodology to the analysis of pre-operative use of beta blockers (yes/no) among patient undergoing colorectal surgery after diagnosis of colon cancer (Arriaga and others, 2009). To examine the relationship between the binary outcome and the five patient-level predictors of interest, we fit the following RR regression model:

equation M218

(5.1)

where equation M219 is the conditional probability that the equation M220 th patient receives pre-operative beta blockers; equation M221 is 1 if life-threatening disease and 0 otherwise; equation M222 is the number of comorbidities; equation M223 is 1 if male and 0 if female; equation M224 is 1 if White race and 0 if otherwise; and equation M225 is age in years (although the reported effect of age is multiplied by 10 for easier comparison of results in Table 3).

Table 3.

Comparison of (log) RR regression estimates for the probability of pre-operative use of beta blockers among patients undergoing colorectal surgery

Table 3 presents the estimates of equation M294 obtained using standard ML (as implemented in SAS PROC GENMOD), the Poisson quasi-likelihood approach (McNutt and others, 2003; Zou, 2004; Carter and others, 2005), and the Maclaurin series approach with equation M295 equation M296 and equation M297 Of note, there were convergence problems with ML but not with any of the other approaches. In particular, for ML SAS PROC GENMOD produced a warning message that "The relative Hessian convergence criterion of 0.0198000361 is greater than the limit of 0.0001. The convergence is questionable".

As conjectured by the study investigators, the results in Table 3 indicate that older patients and patients with more comorbidities are significantly more likely to receive pre-operative beta blockers. From the results presented in Table 3, it is also apparent that the quantitative variable age and the ordinal variable "number of comorbidities" show the largest estimated efficiency gains when comparing the Poisson ( equation M298 ) to the Maclaurin series estimators. The relative efficiencies can be estimated as the square of the ratio of the estimated standard errors. For the covariate "number of comorbidities", the estimated relative efficiency of Poisson regression versus the Maclaurin series with equation M299 is approximately equation M300 For the covariate age, the estimated relative efficiency of Poisson regression versus the Maclaurin series with equation M301 is approximately equation M302 Thus, in this particular application, Poisson regression appears to be quite inefficient compared with the proposed Maclaurin series approach.

For the covariate "number of comorbidities", the estimated relative efficiency of Maclaurin series with equation M303 versus the Maclaurin series with equation M304 is greater than equation M305 similarly, for the covariate age, it is above equation M306 Thus, in this applications, there appears to be discernible gains from using a Maclaurin series with equation M307 instead of equation M308 However, there is no appreciable difference between Maclaurin series with equation M309 and equation M310 This reinforces the results from Sections 3 and 4 where it was found that there are diminishing returns in terms of increases in efficiency when equation M311 is greater than 40–60.

6. Conclusion

To circumvent the usual convergence problems associated with the ML estimator, we propose estimators that approximate the optimal weight function based on the truncated Maclaurin series of equation M312 . This use of a near-optimal weight function that bounds the largest weights yields estimators with relatively high efficiency that also avoid convergence problems. In our study of asymptotic efficiency, the proposed estimator with weight function based on 40–60 terms from the Maclaurin series was 95–97% efficient relative to the MLE. This compares favorably to the Poisson regression estimator that was found to be only 60–70% efficient. In simulations with samples of size 50 (data not shown) and 100, we found similar gains in relative efficiency and no convergence problems with the proposed estimator based on 40–60 terms. In addition, the proposed estimator, using any finite number of terms, is straightforward to implement in standard statistical software for generalized linear models that allows user-defined variance functions (e.g. PROC GLIMMIX in SAS). Finally, we note that estimators that approximate the optimal weight function based on truncated series expansions may also be useful for other generalized linear models in which the link function does not respect the natural parameter constraints (e.g. linear or linear-expit models for binary data (Kovalchik and others, 2013)).

In general, RR regression is most useful when the scientific goal is to estimate the association between an exposure or intervention and a commonly occurring binary outcome, with appropriate adjustment for additional covariates. However, we note that although the estimating equations (2.3) yield consistent estimators of equation M313 for any choice of weight function, including the optimal weight function and the approximation to it proposed in this paper, the estimators are not constrained to produce estimated equation M314 . As a result, RR regression should be avoided altogether when the scientific goal is to make predictions; when the goal is prediction, models where the constraints on the probabilities are automatically satisfied (e.g. logistic regression) should be adopted instead.

Funding

We are grateful for the support provided by grants MH054693; and CA160679 from the United States National Institutes of Health (NIH).

Acknowledgements

Conflict of Interest: None declared.

References

Abramowitz M., Stegun I. A. Handbook of Mathematical Functions with Formulas, Graphs, and Mathematical Tables. New York: Dover Publications; 1970. 9th printing. [Google Scholar]
Arriaga A. F., Lancaster R. T., Berry W. R., Regenbogen S. E., Lipsitz S. R., Kaafarani H. M., Elbardissi A. W., Desai P., Ferzoco S. J., Bleday R., others The better colectomy project: association of evidence-based best-practice adherence rates to outcomes in colorectal surgery. Annals of Surgery. 2009;250:507–513. [PubMed] [Google Scholar]
Baumgarten M., Seliske P., Goldberg M. S. Warning regarding the use of GLIM macros for the estimation of risk ratios. American Journal of Epidemiology. 1989;130:1065. [PubMed] [Google Scholar]
Carter R. E., Lipsitz S. R., Tilley B. C. Quasi-likelihood estimation for relative risk regression models. Biostatistics. 2005;6:39–44. [PubMed] [Google Scholar]
Clogg C. C., Rubin D. B., Schenker N., Schultz B., Weidman L. Multiple imputation of industry and occupation codes in census public-use samples using Bayesian logistic regression. Journal of the American Statistical Association. 1991;86:68–78. [Google Scholar]
Deddens J. A., Petersen M. R., Lei X. Proceeding of the 28th Annual SAS Users Group International Conference. Cary, NC: SAS Institute Inc.; 2003. Estimation of prevalence ratios when PROC GENMOD does not converge; pp. 270–278. [Google Scholar]
Huber P. J. Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability. Berkeley: University of California Press; 1967. The behavior of maximum likelihood estimates under nonstandard conditions; pp. 221–233. Volume 1. [Google Scholar]
Jewell N. P. Statistics for Epidemiology. New York: Chapman & Hall/CRC Press; 2003. [Google Scholar]
Kovalchik S. A., Varadhan R., Fetterman B., Poitras N. E., Wacholder S., Katki H. A. A general binomial regression model to estimate standardized risk differences from binary response data. Statistics in Medicine. 2013;32:808–821. [PMC free article] [PubMed] [Google Scholar]
Lu M., Tilley B. C. Use of odds ratio or relative risk to measure a treatment effect in clinical trials with multiple correlated binary outcomes: data from the NINDS t-PA stroke trial. Statistics in Medicine. 2001;20:1891–1901. [PubMed] [Google Scholar]
Lumley T., Kronmal R., Ma S. Relative risk regression in medical research: models, contrasts, estimators, and algorithms. UW Biostatistics Working Paper Series. 2006 Working Paper 293. http://biostats.bepress.com/uwbiostat/paper293 . [Google Scholar]
McNutt L. A., Wu C., Xue X., Hafner J. P. Estimating relative risk in cohort studies and clinical trials of common events. American Journal of Epidemiology. 2003;157:940–943. [PubMed] [Google Scholar]
Owens W. D., Felts J. A., Spitznagel E. L., Jr ASA physical status classifications: a study of consistency of ratings. Anesthesiology. 1978;49:239–243. [PubMed] [Google Scholar]
Poldermans D., Boersma E., Bax J. J., Thomson I. R., van de Ven L. L., Blankensteijn J. D., Baars H. F., Yo T. I., Trocino G., Vigna C., Roelandt J. R., van Urk H. The effect of bisoprolol on perioperative mortality and myocardial infarction in high-risk patients undergoing vascular surgery. The New England Journal of Medicine. 1999;341:1789–1794. [PubMed] [Google Scholar]
Rotnitzky A. Inverse probability weighted methods. In: Fitzmaurice G., Davidian M., Verbeke G., Molenberghs G., editors. Longitudinal Data Analysis. Boca Raton: Chapman & Hall/CRC Press; 2009. pp. 453–476. [Google Scholar]
Rotnitzky A., Wypij D. A note on the bias of estimators with missing data. Biometrics. 1994;50:1163–1170. [PubMed] [Google Scholar]
Royall R. M. Model robust confidence intervals using maximum likelihood estimators. International Statistical Review. 1986;54:221–226. [Google Scholar]
Traissac P., Martin-Prevel Y., Delpeuch F., Maire B. Regression logistique vs autres modeles lineaires generalises pour l'estimation de rapports de prevalences. Rev Epidemiol Sante Publique. 1999;47:593–604. [PubMed] [Google Scholar]
Wacholder S. Binomial regression in GLIM: estimating risk ratios and risk differences. American Journal of Epidemiology. 1986;123:174–184. [PubMed] [Google Scholar]
Wedderburn R. W. M. Quasi-likelihood functions, generalized linear models, and the gauss newton method. Biometrika. 1974;61:439–447. [Google Scholar]
White H. Maximum likelihood estimation under mis-specified models. Econometrica. 1982;50:1–26. [Google Scholar]
Zhang J., Yu K. F. What's the relative risk?: a method of correcting the odds ratio in cohort studies of common outcomes. Journal of the American Medical Association. 1998;280:1690–1691. [PubMed] [Google Scholar]
Zou G. A modified Poisson regression approach to prospective studies with binary data. American Journal of Epidemiology. 2004;159:702–706. [PubMed] [Google Scholar]

Articles from Biostatistics (Oxford, England) are provided here courtesy of Oxford University Press

clarkanking.blogspot.com

Source: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4168315/