“Best-Guess” MRAD Provides Robust Evidence for a Limit to Human Lifespan: Reply to de Grey (Rejuvenation Res. 2017;20:261

Abstract

Using segmented linear regression to reanalyze the “best-guess” maximum reported age at death data supplied in Aubrey de Grey's editorial, we find compelling evidence for a breakpoint in the mid-1990s, with a positive slope before the breakpoint and a flat or slightly negative slope after it. This confirmation of our earlier results was also modeled using exponential regression. Both the segmented and exponential models were superior to a simple linear regression, providing a better fit for the data even after taking into account their greater number of parameters. These findings are highly robust to the removal of several points from the data and bolster the existing evidence for a limit to human lifespan. Taken in light of both our original analysis and its confirmation by several independent groups, this latest result provides yet more evidence that human lifespan has reached its limit under the current technological paradigm. However, we cannot discount the possibility that novel innovations could propel human lifespan beyond the limit we have identified, if they can overcome the considerable challenges facing them.

In his editorial,¹ Aubrey de Grey criticizes both the data dissemination practices of the Gerontology Research Group (GRG) and the way we handled their data on maximum reported ages at death (MRAD) in our recent article.² We leave it to the GRG to respond to what Dr. de Grey calls his “diatribe” against them, but we would like to take the opportunity to address what we felt were unfounded criticisms of our article.

First, de Grey says that we selected “only a small subset of the available data.” However, this statement is not true. We were not selective with the GRG data at all, using the entirety of the MRAD values. Later, de Grey also denigrates the GRG data used in our article as being only a few dozen data points; that is true, but this criticism only applies to Extended Data Figure 6 of our article. The two main figures and other five Extended Data figures incorporate a large volume of data from both the Human Mortality Database and the International Database on Longevity, filling >200 separate graphs. “Dozens” is not sufficient to describe the number of plots in our article, much less the number of data points.

Next, drawing on uncited news articles, de Grey implies that we think our results should be accepted on the basis of visual inspection alone. Although it is not clear that de Grey disagrees with us—he concedes that our graphs are “inarguable”—we would like to emphasize that we provided in our article a variety of statistics (such as p-values and r values to accompany calculated regressions) to back up any of our hypotheses prompted by visual inspection of the data. We would also like to add that when evaluating the scientific literature, primacy should be given to the articles under scrutiny and one should be wary of out-of-context quotes. Or should we take as definitive de Grey's earlier comment³ that our results are “absolutely correct”?

After briefly outlining the shortcomings of the GRG data, de Grey reveals a graph of his “best-guess” MRAD data, consisting of the GRG data, revised using unstated criteria and unsourced information known, apparently, only to de Grey himself; an explicit enumeration of his methodology would be useful for many researchers, so it is a pity that Dr. de Grey did not include it in his article. Furthermore, de Grey expresses his doubts about the points from 1997, 1977, and 1999, calling the first a “once-in-a-century deviation” and the latter two “once-in-multiple-decades deviations” from a putative trend of ever-increasing MRAD. Looking at the plot of “best-guess” MRAD values, de Grey says that they suggest a “very different conclusion” to that of our original article. But do they really? In a word: No.

First, using all of the “best-guess” MRAD data (and performing a segmented regression using the segmented package⁴ for the R programming language⁵ to avoid any accusations of arbitrarily selecting our breakpoint), a breakpoint was identified in 1997, with a slope before the breakpoint of 0.16 and a slope after the breakpoint of −0.04 (Fig. 1A). These results are essentially the same as those presented in our original article: a breakpoint in the mid-1990s (in fact, within one standard error, 4.88, of our original breakpoint), with an increasing MRAD before the breakpoint and a slightly decreasing MRAD after it.

FIG. 1.

Reanalyses of the “best-guess” MRAD data. (A) Segmented regression of the “best-guess” MRAD data identifies a breakpoint at 1997.2 (standard error 4.8) with a slope of 0.16 before the breakpoint (blue) and a slope of −0.04 after (orange). (B) Reanalysis as in (A), but with the point for 1997 removed. The breakpoint at 1998.004 (standard error 6.2) has a slope of 0.126 before and −9 × 10⁻⁵ after. (C) With the data point for 1977 also removed, the breakpoint occurs at 1998.001 (standard error 4.8) with a slope of 0.137 before and −0.001 after. (D) A recreation of Extended Data Figure 6 of our original article², using the “best-guess” MRAD data with the points for 1977, 1997, and 1999 removed. Segmented regression identified a breakpoint at 1995.06 (standard error 6.3). A linear regression of the data before 1995 yields a slope of 0.13 (standard error = 0.02, r ² = 0.62, p = 6.78 × 10⁻⁷). Linear regression of the data from 1995 and after yields a slope of 0.05 (standard error = 0.039, r ² = 0.04, p = 0.2). The light blue and orange regions indicate 95% confidence intervals for the regressions. MRAD, maximum reported age at death.

The criteria for further discounting individual data points are unclear. Although the 1997 data point, corresponding to world record holder Jeanne Calment, is considered by many to be exceptional, the 1977 and 1999 data points have attracted far less notoriety. Cook's distance, a measure introduced by R. Dennis Cook to identify influential points in regression studies,⁶ does not exceed the threshold of 1, suggested by Cook himself,⁷ for any of the data points. Using a lower threshold⁸ of 4/n flags the points from 1997 and 1977, but not the point from 1999.

Removing the data point for 1997 and rerunning the segmented regression leads to a breakpoint in 1998, with a slope of 0.126 before the breakpoint and a slope very slightly <0 after the breakpoint (Fig. 1B). Removing the data point for 1977 as well yields almost identical results: a break point in 1998, with a slope of 0.137 before the breakpoint and −0.001 after (Fig. 1C). Finally, if we are to entertain Dr. de Grey's suggestion that, without clear justification, the data point from 1999 should be removed as well, we still arrive at results that are strikingly similar to those presented in our original article. A segmented regression identifies a breakpoint in 1995—the very same year we located the breakpoint in our initial analysis. In each case, the second, flatter segment occurs at around 115 years of age, the limit we identified in our original article.

If we had performed our original analysis using de Grey's data instead of the unrevised GRG data, we would have presented results consisting of a pre-breakpoint increase with slope of 0.135 (r ² = 0.62, p = 6.7 × 10⁻⁷) and a post-breakpoint slope of 0.052 (r ² = 0.04, p = 0.20). The much lower slope, much lower r ² value, and much higher p-value, all indicate that the post-breakpoint trend is not significant and cannot be statistically distinguished from a plateau. The upper end of the 95% confidence interval for the predicted MRAD in 2016 is 115.53, well within the 95% confidence interval (113.1–116.7) we reported in our article. The 95% confidence interval is also sufficiently low and narrow to exclude the possibility that the deviation from the pre-1995 trajectory is due to chance (Fig. 1D). Overall, these results do not come as a surprise. As we have previously explained, our results are not due to an outlier⁹ or even several outliers,¹⁰ and the deviation from a continued increase is too large and long lasting to be due to chance.¹¹

Another way to model the change in MRAD over time is with an exponential regression of the form \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document}$$y = L \left( {1 - {e^{kt}}} \right) + b$$ \end{document} . The results of this regression are virtually identical, regardless of the inclusion or exclusion of the aforementioned data points: a plateau at roughly 115 years (Fig. 2). Using the Akaike information criterion (AIC)¹² to assess the relative parsimony and fit of each model, the exponential model is consistently superior to a simple linear model. However, the segmented model is consistently superior to the exponential model (Table 1). This last result is somewhat surprising in light of the fact that the dynamics of MRAD reaching its limit are likely to be that of a gradual approach, rather than a sharp break. The most likely explanation is that the smooth course of the MRAD's saturation diverges from an exponential curve in a way that is better approximated by the segmented model—which, as indicated by its superior AIC, makes good use of its additional parameter to model the change in MRAD in a manner that is a better fit, but still parsimonious; a logistic regression might be the best way to model the MRAD's increase and stagnation, but accurately fitting one would require MRAD data from before the industrial revolution. However, this is a relatively minor issue. What is clear is that both models that predict a limit are superior to the model that predicts a continued increase, providing decisive evidence for a limit to human lifespan.

FIG. 2.

Exponential regression on the “best-guess” MRAD data. Regardless of whether the regression is performed on all the data points or with the points for 1997, 1977, and 1999 successively removed, the curves are almost identical.

Table 1.

Relative Performance of Linear, Segmented, and Exponential Models

	r²			AIC
	Linear	Segmented	Asymptotic	Linear	Segmented	Asymptotic
All data points	0.3576	0.4286	NA	193.8945	190.0233	192.635
1997 removed	0.4669	0.4977	NA	165.5798	164.5883	164.7242
1977 removed	0.5669	0.6163	NA	150.6427	146.821	148.965
1999 removed	0.6439	0.6716	NA	130.4268	128.5846	129.0562

The performance of the three models is measured using r ² and the AIC on four versions of the “best-guess” MRAD data: first, with all of the data points, and then with the points for 1997, 1977, and 1999 successively removed. In each case, the segmented model has the best performance, and the exponential model has the second-best performance. The r ² is not calculated for the exponential model as it is not an appropriate metric for nonlinear models. The r ² for the segmented model has been adjusted for the number of parameters; had it not been, the difference between it and the linear model would have been even greater.

AIC, Akaike information criterion; MRAD, maximum reported age at death.

If we count our original analysis in the Nature article and the analyses presented here, they add up to nine separate analyses of the GRG data, and not a single one provides any evidence for a continued increase in the MRAD. At a certain point, we will all have to concede that, no matter how we analyze the data, no matter how we test it, twist it, trim it, or tweak it, the MRAD remains stubbornly stationary. Indeed, many other scientists have come to the same conclusions as us. Analyses using extreme value theory that find a limit to human lifespan within the same range we identified have been performed by three¹³ separate¹⁴ groups,¹⁵ and even one of our most vocal critics has, in an article¹⁶ with similar methodology to ours, concluded that centenarian survival has reached a “plateau” and that “the maximum lifespan, measured as the age of the oldest person to die, is currently not increasing.”

Finally, we would like to state that our results were never intended as an attack on antiaging research, merely an indication that the current approach to lifespan extension has run its course. In fact, those who bother to read our two page article in its entirely will see that we conclude it by speculating about the technologies that could extend human lifespan past the limit we found. Instead of blaming the bearers of bad news, researchers interested in extending the human lifespan should appreciate the unvarnished assessment of the challenges ahead and be motivated to create the ground-breaking innovations that will surpass all previous advances.

Footnotes

Acknowledgment

We thank Aubrey de Grey for statistical advice. Code to reproduce the analyses presented here can be found at https://github.com/BXQ/MRAD_RR_2017

Author Disclosure Statement

X.D. and J.V. are cofounders of SingulOmics Corp.

References

de Grey

ADNJ

. Deficient data dissemination does damage. Rejuvenation Res, 2017; 20:261–262.

Dong

, Milholland

, Vijg

. Evidence for a limit to human lifespan. Nature, 2016; 538:257–259.

Geddes

. Human age limit claim sparks debate. Nat News DOI:10.1038/nature.2016.20750

Muggeo

VMR

. Package “segmented”: Regression Models with Break-Points/Change-Points Estimation. 2017. http://202.90.158.4/pub/pub/R/web/packages/segmented/segmented.pdf (accessed October 12, 2017 ).

R Core Team. R: A Language and Environment for Statistical Computing . Vienna, Austria: R Foundation for Statistical Computing, 2014.

Cook

. Detection of influential observation in linear regression. Technometrics, 1977; 19:15–18.

Cook

, Weisberg

. Residuals and Influence in Regression. London, UK: Chapman & Hall, 1982.

Fox

, Long

. Modern Methods of Data Analysis. Newbury Park, Calif.: Sage Publications, 1990.

Dong

, Milholland

, Dong

Vijg J.

, et al. reply: Nature. 2017; 546:E7.

10.

Dong

, Milholland

, Vijg

. Reply to Kashnitsky: Soc Sci Res Netw. 2016. https://papers.ssrn.com/sol3/papers.cfm?abstract_id=2890500 (accessed October 12, 2017 ).

11.

Dong

, Milholland

, Dong

Vijg J.

, et al. reply: Nature, 2017; 546:E9–E10.

12.

Akaike

. A new look at the statistical model identification. IEEE Trans Autom Control, 1974; 19:716–723.

13.

Feifel

, Genz

, Pauly

. Who wants to live forever? An analysis of the maximum lifespan in the US. 2017. www.ifa-ulm.de/fileadmin/user_upload/download/forschung/2017_ifa_Feifel-etal_Who-wants-to-live-forever-An-analysis-of-the-maximum-lifespan-in-the-US.pdf (accessed October 12, 2017 ).

14.

The oldest human does not get any older. Tilburg University Available at: https%3A%2F%2Fwww.tilburguniversity.edu%2Fcurrent%2Fnews%2Fpress-release-oldest-human%2F (accessed September 4, 2017 ).

15.

Gbari

, Poulain

, Dal

, Denuit

. Extreme value analysis of mortality at the oldest ages: A case study based on individual ages at death. North Am Actuar J, 2017; 0:1–20.

16.

Modig

, Andersson

, Vaupel

, Rau

, Ahlbom

. How long do centenarians survive? Life expectancy and maximum lifespan. J Intern Med, 2017; 282:156–163.

“Best-Guess” MRAD Provides Robust Evidence for a Limit to Human Lifespan: Reply to de Grey (Rejuvenation Res. 2017;20:261–262)

Abstract

Footnotes

Acknowledgment

Author Disclosure Statement

References