@@ -26,25 +26,19 @@ if sys.version_info.major < 3 or sys.version_info.minor < 6:
...
@@ -26,25 +26,19 @@ if sys.version_info.major < 3 or sys.version_info.minor < 6:
print("Please use Python 3.6 (or higher)!")
print("Please use Python 3.6 (or higher)!")
#+END_SRC
#+END_SRC
#+RESULTS:
#+BEGIN_SRC emacs-lisp :results output :exports both
#+BEGIN_SRC emacs-lisp :results output :exports both
(unless (featurep 'ob-python)
(unless (featurep 'ob-python)
(print "Please activate python in org-babel (org-babel-do-languages)!"))
(print "Please activate python in org-babel (org-babel-do-languages)!"))
#+END_SRC
#+END_SRC
#+RESULTS:
** R 3.4
** R 3.4
We use only basic R functionality, so a earlier version might be OK, but we did not test this.
vWe use only basic R functionality, so a earlier version might be OK, but we did not test this.
#+BEGIN_SRC emacs-lisp :results output :exports both
#+BEGIN_SRC emacs-lisp :results output :exports both
(unless (featurep 'ob-R)
(unless (featurep 'ob-R)
(print "Please activate R in org-babel (org-babel-do-languages)!"))
(print "Please activate R in org-babel (org-babel-do-languages)!"))
#+END_SRC
#+END_SRC
#+RESULTS:
* Data preprocessing
* Data preprocessing
The data on the incidence of influenza-like illness are available from the Web site of the [[http://www.sentiweb.fr/][Réseau Sentinelles]]. We download them as a file in CSV format, in which each line corresponds to a week in the observation period. Only the complete dataset, starting in 1984 and ending with a recent week, is available for download. The URL is:
The data on the incidence of influenza-like illness are available from the Web site of the [[http://www.sentiweb.fr/][Réseau Sentinelles]]. We download them as a file in CSV format, in which each line corresponds to a week in the observation period. Only the complete dataset, starting in 1984 and ending with a recent week, is available for download. The URL is:
...
@@ -69,12 +63,10 @@ This is the documentation of the data from [[https://ns.sentiweb.fr/incidence/cs
...
@@ -69,12 +63,10 @@ This is the documentation of the data from [[https://ns.sentiweb.fr/incidence/cs
The [[https://en.wikipedia.org/wiki/ISO_8601][ISO-8601]] format is popular in Europe, but less so in North America. This may explain why few software packages handle this format. The Python language does it since version 3.6. We therefore use Python for the pre-processing phase, which has the advantage of not requiring any additional library. (Note: we will explain in module 4 why it is desirable for reproducibility to use as few external libraries as possible.)
The [[https://en.wikipedia.org/wiki/ISO_8601][ISO-8601]] format is popular in Europe, but less so in North America. This may explain why few software packages handle this format. The Python language does it since version 3.6. We therefore use Python for the pre-processing phase, which has the advantage of not requiring any additional library. (Note: we will explain in module 4 why it is desirable for reproducibility to use as few external libraries as possible.)
** Download
** Download
In order to protect us in case the Réseau Sentinelles Web server disappears or is
In order to protect us in case the Réseau Sentinelles Web server
modified, we make a local copy of this dataset that we store together
disappears or is modified, we make a local copy of this dataset that
with our analysis. It is unnecessary and even risky to download the
we store together with our analysis. It is unnecessary and even risky
data at each execution, because in case of a malfunction we might be
to download the data at each execution, because in case of a malfunction we might be replacing our file by a corrupted version. Therefore we download the data only if no local copy exists.
replacing our file by a corrupted version. Therefore we download the