In case you sometimes read an alarming headline about biodiversity loss, like Global wildlife populations have declined by 69% since 1970 or Researchers Report a Staggering Decline in Wildlife, there is a decent chance it is based on a metric called The Living Planet Index (LPI). My colleagues at CTS looked into how is LPI calculated, and show that current version of LPI is probably best understood as a cautionary example of effects of mathematical biases, but poor measure of biodiversity. [1] I do expect this result will get way less media coverage than the alarmist headlines; also it is methodologically interesting, so posting a short summary.
The paper has a helpful diagram:
So, the whole process is
There isn't anything obviously fishy about this, and if you try to invent an index tracking population growth or decline from a scratch, you would probably make many similar choices: geometric averages are natural way to track population growth, some smoothing of noisy population estimates is reasonable, and weighted averaging over different ecosystems also makes sense, because of uneven sampling across taxa and regions.
Devil is in the detail.
Imagine you're trying to track a population of rabbits in your backyard, but as they are moving, you have some trouble counting them exactly. You count 5 one day, and 3 the next, and 1 the next one. Have the rabbits really decreased or increased? The problem, in particular with small populations, is, you like have an arithmetic noise in your measurements.
Unfortunately the symmetrical counting noise on the arithmetic scale gets transformed to negative growth after the log transform. For the given case LPI in the depicted case decreases from 1 to ~0.75. Note that it does not matter whether the two records occur in subsequent years or they are more distant in time.
This is somewhat longer to explain, so read in the original paper if interested. First problem is the index is sensitive to initial population declines early on, and can hardly recover. And this is amplified by the hierarchical weighted averaging procedure: imagine if you have a sequence of increasingly general buckets, where all the time there is just a single species in the increasingly general categories. If population of such species, it gets unreasonably amplified.
The paper gives an example of the herptiles in the Palearctic region, represented by only one (declining) population of viper Vipera berus for the period 1974-1977. Hierarchical averaging across taxa and biogeographical regions leads to the situation in which these four records of the viper population cause an 89.5% greater decrease (the index changes from the original value of 0.826 to 1.721 after removing these four records) in the final state of the LPI for the Palearctic realm and a 3.3% greater decrease in the LPI for the whole terrestrial system in comparison to the LPI without these four records
With data series about populations count, do you expect zeroes more at the beginning or at the ends? Clearly at the end: people usually start studying the species and recording the time series when the species is present somewhere. Imagine what would a time-series with leading 15 zeroes imply: someone diligently recording, year after year, "we haven't seen this animal". On year 16, it is finally observed for the first time. The symmetrical case, when the species disappears, is more likely.
Discussion
The authors conclude
Due to the sensitivity of the LPI to subjective decisions and to specific problems with the LPI calculation, the LPI does not seem to accurately represent biodiversity trends. An indicator of the global state of nature should not be sensitive to the fact that 50 years ago one population of viper did not thrive well, and should not be affected by the particular way population sizes were measured and how population absence was treated in the end or the beginning of the time series. Similarly, a universal index of population change should not be sensitive to particular grouping to taxa and biogeographical realms if its aim is to provide a rigorous, repeatable indicator with a straightforward interpretation. These issues deserve particular attention if the LPI is calculated for individual regions or countries, in which the effect of these biases may be even stronger than in the case of the global data.
and again note that LPI not being an accurate representation of biodiversity does not imply the situation isn't bad.
Note that this does not imply much about actual biodiversity loss.