{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "## Importation des données\n", "Les fichiers a récupérer sont au format csv et téléchargeable sur le site du réseau Sentinelles. Pour anticiper les changements de la structure des données, nous faisons une copie de ce fichier csv en local. Nous récupérerons les donnés locales." ] }, { "cell_type": "code", "execution_count": 1, "metadata": {}, "outputs": [], "source": [ " %matplotlib inline\n", "import matplotlib.pyplot as plt\n", "import pandas as pd\n", "import isoweek\n", "import os\n", "data_url = \"https://www.sentiweb.fr/datasets/all/inc-7-PAY.csv\"\n", "data_local = \"./inc-7-PAY.csv\"\n", "import urllib.request\n", "if not os.path.exists(data_local):\n", " urllib.request.urlretrieve(data_url, data_local)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Lecture des données" ] }, { "cell_type": "code", "execution_count": 2, "metadata": { "scrolled": true }, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
weekindicatorincinc_lowinc_upinc100inc100_lowinc100_upgeo_inseegeo_name
0202444723544894219417FRFrance
1202443721306253635315FRFrance
22024427262112463996426FRFrance
3202441720353813689315FRFrance
4202440721257253525315FRFrance
52024397289813334463426FRFrance
6202438775101513102FRFrance
72024377916281804102FRFrance
8202436722358703600315FRFrance
9202435716202852955204FRFrance
10202434725606224498417FRFrance
11202433719715363406315FRFrance
1220243274399194468547311FRFrance
1320243174500221367877410FRFrance
14202430770044278973011715FRFrance
1520242979270630312237141018FRFrance
1620242879364649812230141018FRFrance
17202427710247709013404151020FRFrance
182024267143681039918337221628FRFrance
19202425711174803914309171222FRFrance
20202424712621935715885191424FRFrance
212024237146571133917975221727FRFrance
22202422711628836114895171222FRFrance
2320242179701685112551151119FRFrance
242024207136611020917113201525FRFrance
2520241971008364131375315921FRFrance
26202418713438951417362201426FRFrance
272024177153031121919387231729FRFrance
282024167181381354022736272034FRFrance
292024157249291731532543372648FRFrance
.................................
17401991267176081130423912312042FRFrance
17411991257161691070021638281838FRFrance
17421991247161711007122271281739FRFrance
1743199123711947767116223211329FRFrance
1744199122715452995320951271737FRFrance
1745199121714903897520831261636FRFrance
17461991207190531274225364342345FRFrance
17471991197167391124622232291939FRFrance
17481991187213851388228888382551FRFrance
1749199117713462887718047241632FRFrance
17501991167148571006819646261834FRFrance
1751199115713975978118169251832FRFrance
1752199114712265768416846221430FRFrance
175319911379567604113093171123FRFrance
1754199112710864733114397191325FRFrance
17551991117155741118419964271935FRFrance
17561991107166431137221914292038FRFrance
1757199109713741878018702241533FRFrance
1758199108713289881317765231531FRFrance
1759199107712337807716597221529FRFrance
1760199106710877701314741191226FRFrance
1761199105710442654414340181125FRFrance
17621991047791345631126314820FRFrance
17631991037153871048420290271836FRFrance
17641991027162771104621508292038FRFrance
17651991017155651027120859271836FRFrance
17661990527193751329525455342345FRFrance
17671990517190801380724353342543FRFrance
1768199050711079666015498201228FRFrance
17691990497114302610205FRFrance
\n", "

1770 rows × 10 columns

\n", "
" ], "text/plain": [ " week indicator inc inc_low inc_up inc100 inc100_low \\\n", "0 202444 7 2354 489 4219 4 1 \n", "1 202443 7 2130 625 3635 3 1 \n", "2 202442 7 2621 1246 3996 4 2 \n", "3 202441 7 2035 381 3689 3 1 \n", "4 202440 7 2125 725 3525 3 1 \n", "5 202439 7 2898 1333 4463 4 2 \n", "6 202438 7 751 0 1513 1 0 \n", "7 202437 7 916 28 1804 1 0 \n", "8 202436 7 2235 870 3600 3 1 \n", "9 202435 7 1620 285 2955 2 0 \n", "10 202434 7 2560 622 4498 4 1 \n", "11 202433 7 1971 536 3406 3 1 \n", "12 202432 7 4399 1944 6854 7 3 \n", "13 202431 7 4500 2213 6787 7 4 \n", "14 202430 7 7004 4278 9730 11 7 \n", "15 202429 7 9270 6303 12237 14 10 \n", "16 202428 7 9364 6498 12230 14 10 \n", "17 202427 7 10247 7090 13404 15 10 \n", "18 202426 7 14368 10399 18337 22 16 \n", "19 202425 7 11174 8039 14309 17 12 \n", "20 202424 7 12621 9357 15885 19 14 \n", "21 202423 7 14657 11339 17975 22 17 \n", "22 202422 7 11628 8361 14895 17 12 \n", "23 202421 7 9701 6851 12551 15 11 \n", "24 202420 7 13661 10209 17113 20 15 \n", "25 202419 7 10083 6413 13753 15 9 \n", "26 202418 7 13438 9514 17362 20 14 \n", "27 202417 7 15303 11219 19387 23 17 \n", "28 202416 7 18138 13540 22736 27 20 \n", "29 202415 7 24929 17315 32543 37 26 \n", "... ... ... ... ... ... ... ... \n", "1740 199126 7 17608 11304 23912 31 20 \n", "1741 199125 7 16169 10700 21638 28 18 \n", "1742 199124 7 16171 10071 22271 28 17 \n", "1743 199123 7 11947 7671 16223 21 13 \n", "1744 199122 7 15452 9953 20951 27 17 \n", "1745 199121 7 14903 8975 20831 26 16 \n", "1746 199120 7 19053 12742 25364 34 23 \n", "1747 199119 7 16739 11246 22232 29 19 \n", "1748 199118 7 21385 13882 28888 38 25 \n", "1749 199117 7 13462 8877 18047 24 16 \n", "1750 199116 7 14857 10068 19646 26 18 \n", "1751 199115 7 13975 9781 18169 25 18 \n", "1752 199114 7 12265 7684 16846 22 14 \n", "1753 199113 7 9567 6041 13093 17 11 \n", "1754 199112 7 10864 7331 14397 19 13 \n", "1755 199111 7 15574 11184 19964 27 19 \n", "1756 199110 7 16643 11372 21914 29 20 \n", "1757 199109 7 13741 8780 18702 24 15 \n", "1758 199108 7 13289 8813 17765 23 15 \n", "1759 199107 7 12337 8077 16597 22 15 \n", "1760 199106 7 10877 7013 14741 19 12 \n", "1761 199105 7 10442 6544 14340 18 11 \n", "1762 199104 7 7913 4563 11263 14 8 \n", "1763 199103 7 15387 10484 20290 27 18 \n", "1764 199102 7 16277 11046 21508 29 20 \n", "1765 199101 7 15565 10271 20859 27 18 \n", "1766 199052 7 19375 13295 25455 34 23 \n", "1767 199051 7 19080 13807 24353 34 25 \n", "1768 199050 7 11079 6660 15498 20 12 \n", "1769 199049 7 1143 0 2610 2 0 \n", "\n", " inc100_up geo_insee geo_name \n", "0 7 FR France \n", "1 5 FR France \n", "2 6 FR France \n", "3 5 FR France \n", "4 5 FR France \n", "5 6 FR France \n", "6 2 FR France \n", "7 2 FR France \n", "8 5 FR France \n", "9 4 FR France \n", "10 7 FR France \n", "11 5 FR France \n", "12 11 FR France \n", "13 10 FR France \n", "14 15 FR France \n", "15 18 FR France \n", "16 18 FR France \n", "17 20 FR France \n", "18 28 FR France \n", "19 22 FR France \n", "20 24 FR France \n", "21 27 FR France \n", "22 22 FR France \n", "23 19 FR France \n", "24 25 FR France \n", "25 21 FR France \n", "26 26 FR France \n", "27 29 FR France \n", "28 34 FR France \n", "29 48 FR France \n", "... ... ... ... \n", "1740 42 FR France \n", "1741 38 FR France \n", "1742 39 FR France \n", "1743 29 FR France \n", "1744 37 FR France \n", "1745 36 FR France \n", "1746 45 FR France \n", "1747 39 FR France \n", "1748 51 FR France \n", "1749 32 FR France \n", "1750 34 FR France \n", "1751 32 FR France \n", "1752 30 FR France \n", "1753 23 FR France \n", "1754 25 FR France \n", "1755 35 FR France \n", "1756 38 FR France \n", "1757 33 FR France \n", "1758 31 FR France \n", "1759 29 FR France \n", "1760 26 FR France \n", "1761 25 FR France \n", "1762 20 FR France \n", "1763 36 FR France \n", "1764 38 FR France \n", "1765 36 FR France \n", "1766 45 FR France \n", "1767 43 FR France \n", "1768 28 FR France \n", "1769 5 FR France \n", "\n", "[1770 rows x 10 columns]" ] }, "execution_count": 2, "metadata": {}, "output_type": "execute_result" } ], "source": [ "raw_data = pd.read_csv(data_local, encoding=\"utf-8\", skiprows=1)\n", "raw_data" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Vérification de données manquantes" ] }, { "cell_type": "code", "execution_count": 3, "metadata": { "scrolled": true }, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
weekindicatorincinc_lowinc_upinc100inc100_lowinc100_upgeo_inseegeo_name
\n", "
" ], "text/plain": [ "Empty DataFrame\n", "Columns: [week, indicator, inc, inc_low, inc_up, inc100, inc100_low, inc100_up, geo_insee, geo_name]\n", "Index: []" ] }, "execution_count": 3, "metadata": {}, "output_type": "execute_result" } ], "source": [ "raw_data[raw_data.isnull().any(axis=1)]" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Aucune donnée manquante détecté." ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Conversion des dates" ] }, { "cell_type": "code", "execution_count": 4, "metadata": {}, "outputs": [], "source": [ "def convert_week(year_and_week_int):\n", " year_and_week_str = str(year_and_week_int)\n", " year = int(year_and_week_str[:4])\n", " week = int(year_and_week_str[4:])\n", " w = isoweek.Week(year, week)\n", " return pd.Period(w.day(0), 'W')\n", "\n", "raw_data['period'] = [convert_week(yw) for yw in raw_data['week']]" ] }, { "cell_type": "code", "execution_count": 5, "metadata": {}, "outputs": [], "source": [ "sorted_data = raw_data.set_index('period').sort_index()" ] }, { "cell_type": "code", "execution_count": 6, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "" ] }, "execution_count": 6, "metadata": {}, "output_type": "execute_result" }, { "data": { "image/png": "\n", "text/plain": [ "
" ] }, "metadata": { "needs_background": "light" }, "output_type": "display_data" } ], "source": [ "sorted_data['inc'].plot()" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "# Etude de l'incidence" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Nous définissons la période de référence\n", "entre deux minima de l'incidence, du 1er septembre de l'année N au\n", "1er septembre de l'année N+1." ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Les données commencent en " ] } ], "metadata": { "kernelspec": { "display_name": "Python 3", "language": "python", "name": "python3" }, "language_info": { "codemirror_mode": { "name": "ipython", "version": 3 }, "file_extension": ".py", "mimetype": "text/x-python", "name": "python", "nbconvert_exporter": "python", "pygments_lexer": "ipython3", "version": "3.6.4" } }, "nbformat": 4, "nbformat_minor": 2 }