"3. Etablir une régression logistique en introduisant un variable Death valant 1 ou 0 si la personne est morte ou pas au cours des 20 années entre les 2 sondages. Conclure."
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"### Etape 1 :"
]
},
{
"cell_type": "markdown",
"metadata": {
...
...
@@ -49,7 +56,7 @@
"hidePrompt": true
},
"source": [
"Tout d'abord, il faut commencer par inclure les bibliothèques dont on aura besoin."
"Tout d'abord, il faut commencer par inclure les bibliothèques dont nous aurons besoin."
]
},
{
...
...
@@ -565,7 +572,7 @@
"cell_type": "markdown",
"metadata": {},
"source": [
" Création de 2 tableaux à partir du contenu du fichier csv :\n",
" Création de 2 \"tableaux\" à partir du contenu du fichier csv :\n",
" *nonFumeuses* contient les données des personnes qui ne fument pas (qui ont \"No\" dans la colonne \"Smoker\")\n",
" et *fumeuses* contient les données des personnes qui fument (qui ont \"Yes\" dans la colonne \"Smoker\")"
]
...
...
@@ -585,7 +592,7 @@
},
{
"cell_type": "code",
"execution_count": 5,
"execution_count": 21,
"metadata": {},
"outputs": [
{
...
...
@@ -616,184 +623,184 @@
" </thead>\n",
" <tbody>\n",
" <tr>\n",
" <th>915</th>\n",
" <th>0</th>\n",
" <td>Yes</td>\n",
" <td>Alive</td>\n",
" <td>47.4</td>\n",
" <td>21.0</td>\n",
" </tr>\n",
" <tr>\n",
" <th>1095</th>\n",
" <th>1</th>\n",
" <td>Yes</td>\n",
" <td>Alive</td>\n",
" <td>30.2</td>\n",
" <td>19.3</td>\n",
" </tr>\n",
" <tr>\n",
" <th>941</th>\n",
" <th>4</th>\n",
" <td>Yes</td>\n",
" <td>Alive</td>\n",
" <td>30.4</td>\n",
" <td>81.4</td>\n",
" </tr>\n",
" <tr>\n",
" <th>1311</th>\n",
" <th>7</th>\n",
" <td>Yes</td>\n",
" <td>Dead</td>\n",
" <td>62.1</td>\n",
" <td>57.5</td>\n",
" </tr>\n",
" <tr>\n",
" <th>1091</th>\n",
" <th>8</th>\n",
" <td>Yes</td>\n",
" <td>Alive</td>\n",
" <td>60.0</td>\n",
" <td>24.8</td>\n",
" </tr>\n",
" <tr>\n",
" <th>945</th>\n",
" <th>9</th>\n",
" <td>Yes</td>\n",
" <td>Alive</td>\n",
" <td>60.2</td>\n",
" <td>49.5</td>\n",
" </tr>\n",
" <tr>\n",
" <th>1148</th>\n",
" <th>10</th>\n",
" <td>Yes</td>\n",
" <td>Alive</td>\n",
" <td>50.6</td>\n",
" <td>30.0</td>\n",
" </tr>\n",
" <tr>\n",
" <th>1307</th>\n",
" <th>12</th>\n",
" <td>Yes</td>\n",
" <td>Alive</td>\n",
" <td>43.0</td>\n",
" </tr>\n",
" <tr>\n",
" <th>913</th>\n",
" <td>Yes</td>\n",
" <td>Dead</td>\n",
" <td>84.4</td>\n",
" <td>49.2</td>\n",
" </tr>\n",
" <tr>\n",
" <th>946</th>\n",
" <th>19</th>\n",
" <td>Yes</td>\n",
" <td>Alive</td>\n",
" <td>25.0</td>\n",
" <td>65.7</td>\n",
" </tr>\n",
" <tr>\n",
" <th>1276</th>\n",
" <th>21</th>\n",
" <td>Yes</td>\n",
" <td>Alive</td>\n",
" <td>58.5</td>\n",
" <td>38.3</td>\n",
" </tr>\n",
" <tr>\n",
" <th>950</th>\n",
" <th>23</th>\n",
" <td>Yes</td>\n",
" <td>Dead</td>\n",
" <td>43.3</td>\n",
" <td>62.3</td>\n",
" </tr>\n",
" <tr>\n",
" <th>1309</th>\n",
" <th>26</th>\n",
" <td>Yes</td>\n",
" <td>Alive</td>\n",
" <td>35.9</td>\n",
" <td>59.2</td>\n",
" </tr>\n",
" <tr>\n",
" <th>947</th>\n",
" <th>30</th>\n",
" <td>Yes</td>\n",
" <td>Dead</td>\n",
" <td>37.1</td>\n",
" <td>Alive</td>\n",
" <td>34.6</td>\n",
" </tr>\n",
" <tr>\n",
" <th>1125</th>\n",
" <th>31</th>\n",
" <td>Yes</td>\n",
" <td>Alive</td>\n",
" <td>57.2</td>\n",
" <td>51.9</td>\n",
" </tr>\n",
" <tr>\n",
" <th>948</th>\n",
" <th>32</th>\n",
" <td>Yes</td>\n",
" <td>Alive</td>\n",
" <td>47.7</td>\n",
" <td>49.9</td>\n",
" </tr>\n",
" <tr>\n",
" <th>1093</th>\n",
" <th>35</th>\n",
" <td>Yes</td>\n",
" <td>Dead</td>\n",
" <td>84.3</td>\n",
" <td>Alive</td>\n",
" <td>46.7</td>\n",
" </tr>\n",
" <tr>\n",
" <th>911</th>\n",
" <th>36</th>\n",
" <td>Yes</td>\n",
" <td>Alive</td>\n",
" <td>38.6</td>\n",
" <td>44.4</td>\n",
" </tr>\n",
" <tr>\n",
" <th>1273</th>\n",
" <th>37</th>\n",
" <td>Yes</td>\n",
" <td>Alive</td>\n",
" <td>55.7</td>\n",
" <td>29.5</td>\n",
" </tr>\n",
" <tr>\n",
" <th>1149</th>\n",
" <th>38</th>\n",
" <td>Yes</td>\n",
" <td>Alive</td>\n",
" <td>21.5</td>\n",
" <td>Dead</td>\n",
" <td>33.0</td>\n",
" </tr>\n",
" <tr>\n",
" <th>1111</th>\n",
" <th>39</th>\n",
" <td>Yes</td>\n",
" <td>Alive</td>\n",
" <td>41.9</td>\n",
" <td>35.6</td>\n",
" </tr>\n",
" <tr>\n",
" <th>1127</th>\n",
" <th>40</th>\n",
" <td>Yes</td>\n",
" <td>Alive</td>\n",
" <td>32.5</td>\n",
" <td>39.1</td>\n",
" </tr>\n",
" <tr>\n",
" <th>1115</th>\n",
" <th>42</th>\n",
" <td>Yes</td>\n",
" <td>Dead</td>\n",
" <td>63.3</td>\n",
" <td>Alive</td>\n",
" <td>35.7</td>\n",
" </tr>\n",
" <tr>\n",
" <th>1114</th>\n",
" <th>46</th>\n",
" <td>Yes</td>\n",
" <td>Dead</td>\n",
" <td>31.3</td>\n",
" <td>44.3</td>\n",
" </tr>\n",
" <tr>\n",
" <th>1143</th>\n",
" <th>48</th>\n",
" <td>Yes</td>\n",
" <td>Alive</td>\n",
" <td>26.6</td>\n",
" <td>37.5</td>\n",
" </tr>\n",
" <tr>\n",
" <th>1102</th>\n",
" <th>49</th>\n",
" <td>Yes</td>\n",
" <td>Alive</td>\n",
" <td>29.7</td>\n",
" <td>22.1</td>\n",
" </tr>\n",
" <tr>\n",
" <th>1142</th>\n",
" <th>53</th>\n",
" <td>Yes</td>\n",
" <td>Dead</td>\n",
" <td>71.0</td>\n",
" <td>Alive</td>\n",
" <td>39.0</td>\n",
" </tr>\n",
" <tr>\n",
" <th>1140</th>\n",
" <th>56</th>\n",
" <td>Yes</td>\n",
" <td>Alive</td>\n",
" <td>42.3</td>\n",
" <td>40.1</td>\n",
" </tr>\n",
" <tr>\n",
" <th>1288</th>\n",
" <th>60</th>\n",
" <td>Yes</td>\n",
" <td>Dead</td>\n",
" <td>39.3</td>\n",
" <td>Alive</td>\n",
" <td>58.1</td>\n",
" </tr>\n",
" <tr>\n",
" <th>1132</th>\n",
" <th>61</th>\n",
" <td>Yes</td>\n",
" <td>Alive</td>\n",
" <td>18.0</td>\n",
" <td>37.3</td>\n",
" </tr>\n",
" <tr>\n",
" <th>63</th>\n",
" <td>Yes</td>\n",
" <td>Dead</td>\n",
" <td>36.3</td>\n",
" </tr>\n",
" <tr>\n",
" <th>...</th>\n",
...
...
@@ -802,184 +809,184 @@
" <td>...</td>\n",
" </tr>\n",
" <tr>\n",
" <th>649</th>\n",
" <th>1240</th>\n",
" <td>Yes</td>\n",
" <td>Alive</td>\n",
" <td>36.9</td>\n",
" <td>29.7</td>\n",
" </tr>\n",
" <tr>\n",
" <th>650</th>\n",
" <th>1243</th>\n",
" <td>Yes</td>\n",
" <td>Dead</td>\n",
" <td>81.8</td>\n",
" <td>Alive</td>\n",
" <td>40.1</td>\n",
" </tr>\n",
" <tr>\n",
" <th>611</th>\n",
" <th>1251</th>\n",
" <td>Yes</td>\n",
" <td>Alive</td>\n",
" <td>43.4</td>\n",
" <td>27.8</td>\n",
" </tr>\n",
" <tr>\n",
" <th>608</th>\n",
" <th>1252</th>\n",
" <td>Yes</td>\n",
" <td>Alive</td>\n",
" <td>23.5</td>\n",
" <td>52.4</td>\n",
" </tr>\n",
" <tr>\n",
" <th>605</th>\n",
" <th>1253</th>\n",
" <td>Yes</td>\n",
" <td>Alive</td>\n",
" <td>59.0</td>\n",
" <td>27.8</td>\n",
" </tr>\n",
" <tr>\n",
" <th>604</th>\n",
" <th>1254</th>\n",
" <td>Yes</td>\n",
" <td>Alive</td>\n",
" <td>43.8</td>\n",
" <td>41.0</td>\n",
" </tr>\n",
" <tr>\n",
" <th>554</th>\n",
" <th>1259</th>\n",
" <td>Yes</td>\n",
" <td>Alive</td>\n",
" <td>21.3</td>\n",
" <td>40.8</td>\n",
" </tr>\n",
" <tr>\n",
" <th>555</th>\n",
" <th>1260</th>\n",
" <td>Yes</td>\n",
" <td>Dead</td>\n",
" <td>76.9</td>\n",
" <td>Alive</td>\n",
" <td>20.4</td>\n",
" </tr>\n",
" <tr>\n",
" <th>558</th>\n",
" <th>1263</th>\n",
" <td>Yes</td>\n",
" <td>Dead</td>\n",
" <td>75.2</td>\n",
" <td>Alive</td>\n",
" <td>20.9</td>\n",
" </tr>\n",
" <tr>\n",
" <th>560</th>\n",
" <th>1264</th>\n",
" <td>Yes</td>\n",
" <td>Alive</td>\n",
" <td>53.0</td>\n",
" <td>45.5</td>\n",
" </tr>\n",
" <tr>\n",
" <th>562</th>\n",
" <th>1269</th>\n",
" <td>Yes</td>\n",
" <td>Alive</td>\n",
" <td>43.7</td>\n",
" <td>38.8</td>\n",
" </tr>\n",
" <tr>\n",
" <th>563</th>\n",
" <th>1270</th>\n",
" <td>Yes</td>\n",
" <td>Alive</td>\n",
" <td>50.9</td>\n",
" <td>55.5</td>\n",
" </tr>\n",
" <tr>\n",
" <th>565</th>\n",
" <th>1271</th>\n",
" <td>Yes</td>\n",
" <td>Alive</td>\n",
" <td>32.8</td>\n",
" <td>24.9</td>\n",
" </tr>\n",
" <tr>\n",
" <th>566</th>\n",
" <th>1273</th>\n",
" <td>Yes</td>\n",
" <td>Alive</td>\n",
" <td>50.7</td>\n",
" <td>55.7</td>\n",
" </tr>\n",
" <tr>\n",
" <th>567</th>\n",
" <th>1276</th>\n",
" <td>Yes</td>\n",
" <td>Dead</td>\n",
" <td>66.1</td>\n",
" <td>Alive</td>\n",
" <td>58.5</td>\n",
" </tr>\n",
" <tr>\n",
" <th>569</th>\n",
" <th>1278</th>\n",
" <td>Yes</td>\n",
" <td>Alive</td>\n",
" <td>27.2</td>\n",
" <td>43.7</td>\n",
" </tr>\n",
" <tr>\n",
" <th>548</th>\n",
" <th>1282</th>\n",
" <td>Yes</td>\n",
" <td>Alive</td>\n",
" <td>62.1</td>\n",
" <td>51.2</td>\n",
" </tr>\n",
" <tr>\n",
" <th>1284</th>\n",
" <td>Yes</td>\n",
" <td>Dead</td>\n",
" <td>36.0</td>\n",
" </tr>\n",
" <tr>\n",
" <th>571</th>\n",
" <th>1285</th>\n",
" <td>Yes</td>\n",
" <td>Alive</td>\n",
" <td>38.1</td>\n",
" <td>48.3</td>\n",
" </tr>\n",
" <tr>\n",
" <th>573</th>\n",
" <th>1288</th>\n",
" <td>Yes</td>\n",
" <td>Dead</td>\n",
" <td>55.2</td>\n",
" <td>39.3</td>\n",
" </tr>\n",
" <tr>\n",
" <th>575</th>\n",
" <th>1295</th>\n",
" <td>Yes</td>\n",
" <td>Alive</td>\n",
" <td>50.9</td>\n",
" <td>Dead</td>\n",
" <td>82.4</td>\n",
" </tr>\n",
" <tr>\n",
" <th>580</th>\n",
" <th>1296</th>\n",
" <td>Yes</td>\n",
" <td>Alive</td>\n",
" <td>42.5</td>\n",
" <td>38.3</td>\n",
" </tr>\n",
" <tr>\n",
" <th>583</th>\n",
" <th>1297</th>\n",
" <td>Yes</td>\n",
" <td>Alive</td>\n",
" <td>26.6</td>\n",
" <td>32.7</td>\n",
" </tr>\n",
" <tr>\n",
" <th>584</th>\n",
" <th>1299</th>\n",
" <td>Yes</td>\n",
" <td>Alive</td>\n",
" <td>23.3</td>\n",
" <td>Dead</td>\n",
" <td>60.0</td>\n",
" </tr>\n",
" <tr>\n",
" <th>587</th>\n",
" <th>1303</th>\n",
" <td>Yes</td>\n",
" <td>Alive</td>\n",
" <td>34.8</td>\n",
" <td>31.2</td>\n",
" </tr>\n",
" <tr>\n",
" <th>589</th>\n",
" <th>1304</th>\n",
" <td>Yes</td>\n",
" <td>Alive</td>\n",
" <td>28.2</td>\n",
" <td>47.8</td>\n",
" </tr>\n",
" <tr>\n",
" <th>591</th>\n",
" <th>1305</th>\n",
" <td>Yes</td>\n",
" <td>Alive</td>\n",
" <td>38.5</td>\n",
" <td>60.9</td>\n",
" </tr>\n",
" <tr>\n",
" <th>592</th>\n",
" <th>1307</th>\n",
" <td>Yes</td>\n",
" <td>Alive</td>\n",
" <td>41.0</td>\n",
" <td>43.0</td>\n",
" </tr>\n",
" <tr>\n",
" <th>595</th>\n",
" <th>1309</th>\n",
" <td>Yes</td>\n",
" <td>Alive</td>\n",
" <td>25.7</td>\n",
" <td>35.9</td>\n",
" </tr>\n",
" <tr>\n",
" <th>572</th>\n",
" <th>1311</th>\n",
" <td>Yes</td>\n",
" <td>Dead</td>\n",
" <td>66.8</td>\n",
" </tr>\n",
" <tr>\n",
" <th>656</th>\n",
" <td>Yes</td>\n",
" <td>Alive</td>\n",
" <td>43.0</td>\n",
" <td>62.1</td>\n",
" </tr>\n",
" </tbody>\n",
"</table>\n",
...
...
@@ -988,83 +995,84 @@
],
"text/plain": [
" Smoker Status Age\n",
"915 Yes Alive 47.4\n",
"1095 Yes Alive 30.2\n",
"941 Yes Alive 30.4\n",
"1311 Yes Dead 62.1\n",
"1091 Yes Alive 60.0\n",
"945 Yes Alive 60.2\n",
"1148 Yes Alive 50.6\n",
"1307 Yes Alive 43.0\n",
"913 Yes Dead 84.4\n",
"946 Yes Alive 25.0\n",
"1276 Yes Alive 58.5\n",
"950 Yes Dead 43.3\n",
"1309 Yes Alive 35.9\n",
"947 Yes Dead 37.1\n",
"1125 Yes Alive 57.2\n",
"948 Yes Alive 47.7\n",
"1093 Yes Dead 84.3\n",
"911 Yes Alive 38.6\n",
"0 Yes Alive 21.0\n",
"1 Yes Alive 19.3\n",
"4 Yes Alive 81.4\n",
"7 Yes Dead 57.5\n",
"8 Yes Alive 24.8\n",
"9 Yes Alive 49.5\n",
"10 Yes Alive 30.0\n",
"12 Yes Alive 49.2\n",
"19 Yes Alive 65.7\n",
"21 Yes Alive 38.3\n",
"23 Yes Dead 62.3\n",
"26 Yes Alive 59.2\n",
"30 Yes Alive 34.6\n",
"31 Yes Alive 51.9\n",
"32 Yes Alive 49.9\n",
"35 Yes Alive 46.7\n",
"36 Yes Alive 44.4\n",
"37 Yes Alive 29.5\n",
"38 Yes Dead 33.0\n",
"39 Yes Alive 35.6\n",
"40 Yes Alive 39.1\n",
"42 Yes Alive 35.7\n",
"46 Yes Dead 44.3\n",
"48 Yes Alive 37.5\n",
"49 Yes Alive 22.1\n",
"53 Yes Alive 39.0\n",
"56 Yes Alive 40.1\n",
"60 Yes Alive 58.1\n",
"61 Yes Alive 37.3\n",
"63 Yes Dead 36.3\n",
"... ... ... ...\n",
"1240 Yes Alive 29.7\n",
"1243 Yes Alive 40.1\n",
"1251 Yes Alive 27.8\n",
"1252 Yes Alive 52.4\n",
"1253 Yes Alive 27.8\n",
"1254 Yes Alive 41.0\n",
"1259 Yes Alive 40.8\n",
"1260 Yes Alive 20.4\n",
"1263 Yes Alive 20.9\n",
"1264 Yes Alive 45.5\n",
"1269 Yes Alive 38.8\n",
"1270 Yes Alive 55.5\n",
"1271 Yes Alive 24.9\n",
"1273 Yes Alive 55.7\n",
"1149 Yes Alive 21.5\n",
"1111 Yes Alive 41.9\n",
"1127 Yes Alive 32.5\n",
"1115 Yes Dead 63.3\n",
"1114 Yes Dead 31.3\n",
"1143 Yes Alive 26.6\n",
"1102 Yes Alive 29.7\n",
"1142 Yes Dead 71.0\n",
"1140 Yes Alive 42.3\n",
"1276 Yes Alive 58.5\n",
"1278 Yes Alive 43.7\n",
"1282 Yes Alive 51.2\n",
"1284 Yes Dead 36.0\n",
"1285 Yes Alive 48.3\n",
"1288 Yes Dead 39.3\n",
"1132 Yes Alive 18.0\n",
"... ... ... ...\n",
"649 Yes Alive 36.9\n",
"650 Yes Dead 81.8\n",
"611 Yes Alive 43.4\n",
"608 Yes Alive 23.5\n",
"605 Yes Alive 59.0\n",
"604 Yes Alive 43.8\n",
"554 Yes Alive 21.3\n",
"555 Yes Dead 76.9\n",
"558 Yes Dead 75.2\n",
"560 Yes Alive 53.0\n",
"562 Yes Alive 43.7\n",
"563 Yes Alive 50.9\n",
"565 Yes Alive 32.8\n",
"566 Yes Alive 50.7\n",
"567 Yes Dead 66.1\n",
"569 Yes Alive 27.2\n",
"548 Yes Alive 62.1\n",
"571 Yes Alive 38.1\n",
"573 Yes Dead 55.2\n",
"575 Yes Alive 50.9\n",
"580 Yes Alive 42.5\n",
"583 Yes Alive 26.6\n",
"584 Yes Alive 23.3\n",
"587 Yes Alive 34.8\n",
"589 Yes Alive 28.2\n",
"591 Yes Alive 38.5\n",
"592 Yes Alive 41.0\n",
"595 Yes Alive 25.7\n",
"572 Yes Dead 66.8\n",
"656 Yes Alive 43.0\n",
"1295 Yes Dead 82.4\n",
"1296 Yes Alive 38.3\n",
"1297 Yes Alive 32.7\n",
"1299 Yes Dead 60.0\n",
"1303 Yes Alive 31.2\n",
"1304 Yes Alive 47.8\n",
"1305 Yes Alive 60.9\n",
"1307 Yes Alive 43.0\n",
"1309 Yes Alive 35.9\n",
"1311 Yes Dead 62.1\n",
"\n",
"[582 rows x 3 columns]"
]
},
"execution_count": 5,
"execution_count": 21,
"metadata": {},
"output_type": "execute_result"
}
],
"source": [
"#Affichage\n",
"fumeuses"
]
},
{
"cell_type": "code",
"execution_count": 6,
"execution_count": 22,
"metadata": {},
"outputs": [
{
...
...
@@ -1532,12 +1540,13 @@
"[732 rows x 3 columns]"
]
},
"execution_count": 6,
"execution_count": 22,
"metadata": {},
"output_type": "execute_result"
}
],
"source": [
"#Affichage\n",
"nonFumeuses"
]
},
...
...
@@ -1763,7 +1772,8 @@
"cell_type": "markdown",
"metadata": {},
"source": [
"On obtient des résultats assez surprenants dans le sens où étant donné que l'on nous a souvent répété que fumer est mauvais pour la santé, nous nous attendions à retrouver ce fait dans cette étude."
"On obtient des résultats assez surprenants dans le sens où, étant donné que l'on nous a souvent répété que fumer est mauvais pour la santé, nous nous attendions à retrouver ce fait dans cette étude.\n",
"Or, nous pouvons observer que le résultat des calculs effectués nous montre l'inverse de ce à quoi nous nous attendions : le groupe de femmes qui ne fumaient pas a un taux de mortalité supérieur à celui composé de femmes qui fumaient."