Information updated: 11-05-2020
From the very beginning of the COVID-19 disease, several map browsers have appeared to illustrate the spread of the disease all around the world. COVID-19 is a complex phenomenon in space and time. Only by approaching it through several facets, from biology to geographical aspects, we will advance in a consistent way. This humble web page has the following features to contribute to the inspiration of researchers and people wishing to complete a little bit the synoptic view that humankind needs:
Please click at the name of the institutions at the top of the page, or below.
We assume that the following variables are available every day:
With these 3 variables we automatically generate the active cases (reported) by subtracting the dead and the recovered cases form the confirmed cases. This worked in the early days of the pandemic but soon many countries discontinued reporting recovered cases (e.g. UK) or are clearly underreporting them (e.g. USA). We opted for a very simplified model to estimate active cases: Based on the information about the desease behaviour, we know that in most of the cases, confirmed cases are fully in 15 days. This way we can assume that, for most people, the confirmed cases 15 days ago are not active today anymore. Accepting this assumption, active cases can be estimated by the accumulated confirmed cases today minus the accumulated confirmed cases 15 days ago. We have compared this estimation with countries that are still providing reliable information on recovered cases (e.g. Austria), and the results are similar (sometimes after a pick, this model underestimates a bit the result). This data is maked as "active estimated" (to make a clear diference with the "active reported"). This simple model has de advantage of only needing accumulated confirmed cases, so it can be applied to regions that only provides this parameter (e.g. Catalonia). When there is inforamtion about deaths, estimated recovery is optained by substracting the death and the estimated active cases from the accumulated confirmed cases.
Rt represents the effective reproduction rate of the virus calculated for each location. It lets us estimate how many secondary infections are likely to occur from a single infection in a specific area. Values over 1.0 means we should expect more cases in that area, values under 1.0 mean we should expect fewer. Values over 2.5 means that no measures have been adopted or the measures are not working. There are several models to calculate and none of them is perfect. We wanted to have a simple very simple estimation. Based on the information about the desease behaviour (e.g. https://f1000research.com/articles/9-868), we know that the incubation period for the disease is 5 days and the "natural" reproduction rate is 2.5. Since current policy in most countries forces confirmed cases to stay at home, we can consider that active confirmed cases only transmit the disease during the previous five days of the detection (declaration of active case). The result of this transmission will translate into the active confirmed cases five days into the future. In other words, if we divide the active cases today by the ones we had 5 days ago, we will have a reasonable estimation of the reproduction rate. At the beginning of the pandemic this assumption does not work, as many cases where undetected but, after two months into the first upbreak, the assumption works well, and the results are comparable with the result of much more sophisticated models.
During the 6 months after the begining of the pandemic, we have been deriving 3 indices from each of the 4 variables (confirmed, deaths, recovered and active). To explain these indices, let's adopt the following conventions:
The derived indices use the following expressions:
Differences are very sensible to noise caused by unwanted factors (e.g. weekend anomalies, spurious difficulties in getting the data on time for accounting it in the present day report). To filter out the noise, the values used in t2, t1 and t0 are averages of five days (from two days before to two days after). This is not possible for today and yesterday (obviously, the date for tomorrow and the day after have not been released, as they are in the future) and no average value is used in this case. Please note that values of the accumulate cases presented are not averaged: only the derived indicators are affected by averages.
If you are curious about how this is done in practice, if you call the values used in the averages of the time t4 considered (t2, t3, t4, t5 and t6), the values used in the day before (t1, t2, t3, t4 and t5) and the averages of the day before the day before (t0, t1, t2, t3 and t4), the calculations can be simplified into the following expressions:
Rt represents the effective reproduction rate of the virus calculated for each locale. It
In addition to that, we are also providing an indication of the trend of the last days. This is done by classifying the dynamics of the pandemic in phases:
You can see this graphically here:
If you are curious on how exactly this is done, we consider the current value (tc), the value 5 days ago (tm) and the value 10 days ago (tf) and we apply the following logic:
Is tc > (tf+tolerance)?; tolerance=(tc+tf)/2*0.06
This map browser has been created with the MiraMon Map Browser of the MiraMon Remote Sensing and GIS software now celebrating its 25th anniversary. MiraMon is a project of the Grumets Research Group. Small python scripts to transform open data into the data used in the map browser can be found here.
Joan Masó and Xavier Pons