From be9ea82a5a719e018f6b275231c8779bca12bc31 Mon Sep 17 00:00:00 2001
From: 678362a9c2b5d46a6a161a762a6e18e0
<678362a9c2b5d46a6a161a762a6e18e0@app-learninglab.inria.fr>
Date: Fri, 5 Mar 2021 16:28:53 +0000
Subject: [PATCH] data clean
---
module3/exo3/exercice.ipynb | 258 +++++++++++++++++++++++++++++++++---
1 file changed, 240 insertions(+), 18 deletions(-)
diff --git a/module3/exo3/exercice.ipynb b/module3/exo3/exercice.ipynb
index 4c72dbb..148d8b9 100644
--- a/module3/exo3/exercice.ipynb
+++ b/module3/exo3/exercice.ipynb
@@ -12,7 +12,8 @@
"### Prérequis\n",
"\n",
"Techniques de présentation graphique. Cet exercice peut être réalisé indifféremment en R ou en Python.\n",
- "Sujet\n",
+ "\n",
+ "### Sujet\n",
"\n",
"Le but est ici de reproduire des graphes semblables à ceux du South China Morning Post (SCMP), sur la page The Coronavirus Pandemic et qui montrent pour différents pays le nombre cumulé (c'est-à-dire le nombre total de cas depuis le début de l'épidémie) de personnes atteintes de la maladie à coronavirus 2019.\n",
"\n",
@@ -58,27 +59,13 @@
"cell_type": "markdown",
"metadata": {},
"source": [
- "Nous pouvons maintenant télécharger les [données](https://raw.githubusercontent.com/CSSEGISandData/COVID-19/master/csse_covid_19_data/csse_covid_19_time_series/time_series_covid19_confirmed_global.csv)."
+ "Nous pouvons maintenant télécharger les [données](https://raw.githubusercontent.com/CSSEGISandData/COVID-19/master/csse_covid_19_data/csse_covid_19_time_series/time_series_covid19_confirmed_global.csv) si elles ne sont pas déjà téléchargés."
]
},
{
"cell_type": "code",
"execution_count": 2,
"metadata": {},
- "outputs": [],
- "source": [
- "data_url = \"https://raw.githubusercontent.com/CSSEGISandData/COVID-19/master/csse_covid_19_data/csse_covid_19_time_series/time_series_covid19_confirmed_global.csv\"\n",
- "\n",
- "data_file = \"data_covid.csv\"\n",
- "\n",
- "if not os.path.exists(data_file):\n",
- " urllib.request.urlretrieve(data_url, data_file)"
- ]
- },
- {
- "cell_type": "code",
- "execution_count": 4,
- "metadata": {},
"outputs": [
{
"data": {
@@ -1850,21 +1837,256 @@
"[274 rows x 412 columns]"
]
},
- "execution_count": 4,
+ "execution_count": 2,
"metadata": {},
"output_type": "execute_result"
}
],
"source": [
+ "# URL des données\n",
+ "data_url = \"https://raw.githubusercontent.com/CSSEGISandData/COVID-19/master/csse_covid_19_data/csse_covid_19_time_series/time_series_covid19_confirmed_global.csv\"\n",
+ "\n",
+ "# Nom du fichier csv\n",
+ "data_file = \"data_covid.csv\"\n",
+ "\n",
+ "# Téléchargement des données si elles ne sont pas déjà présentes dans le répertoire\n",
+ "if not os.path.exists(data_file):\n",
+ " urllib.request.urlretrieve(data_url, data_file)\n",
+ "\n",
+ "# Affichage des données\n",
"raw_data = pd.read_csv(data_file)\n",
"raw_data"
]
},
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "Ces données sont organisés sur 274 lignes pour chaque pays et (pour le moment) 412 colonnes présentant la province, le pays, les latitudes et longitudes suivi du nombre de cas par jour du 22 Janvier 2020 au 4 Mars 2021 au moment de l'écriture de ce rapport.\n",
+ "\n",
+ "Filtrons les données en choisant les colonnes des dates la regex `\\d{1,2}\\/\\d{1,2}\\/\\d{2}` sélectionne les colonnes dont le nom commence par :\n",
+ "un ou deux chiffres suivi d'un \"/\", deux fois, puis se terminent par deux chiffres\n",
+ "c'est à dire les colonnes des dates (vous pouvez inverser la regex avec `[^\\d{1,2}\\/\\d{1,2}\\/\\d{2}]` pour constater que la commande fait bien l'inverse et nous rend des données avec des NaN)"
+ ]
+ },
{
"cell_type": "code",
- "execution_count": null,
+ "execution_count": 3,
+ "metadata": {},
+ "outputs": [
+ {
+ "data": {
+ "text/html": [
+ "
\n",
+ "\n",
+ "
\n",
+ " \n",
+ " \n",
+ " | \n",
+ " Province/State | \n",
+ " Country/Region | \n",
+ " Lat | \n",
+ " Long | \n",
+ " 1/22/20 | \n",
+ " 1/23/20 | \n",
+ " 1/24/20 | \n",
+ " 1/25/20 | \n",
+ " 1/26/20 | \n",
+ " 1/27/20 | \n",
+ " ... | \n",
+ " 2/23/21 | \n",
+ " 2/24/21 | \n",
+ " 2/25/21 | \n",
+ " 2/26/21 | \n",
+ " 2/27/21 | \n",
+ " 2/28/21 | \n",
+ " 3/1/21 | \n",
+ " 3/2/21 | \n",
+ " 3/3/21 | \n",
+ " 3/4/21 | \n",
+ "
\n",
+ " \n",
+ " \n",
+ " \n",
+ "
\n",
+ "
0 rows × 412 columns
\n",
+ "
"
+ ],
+ "text/plain": [
+ "Empty DataFrame\n",
+ "Columns: [Province/State, Country/Region, Lat, Long, 1/22/20, 1/23/20, 1/24/20, 1/25/20, 1/26/20, 1/27/20, 1/28/20, 1/29/20, 1/30/20, 1/31/20, 2/1/20, 2/2/20, 2/3/20, 2/4/20, 2/5/20, 2/6/20, 2/7/20, 2/8/20, 2/9/20, 2/10/20, 2/11/20, 2/12/20, 2/13/20, 2/14/20, 2/15/20, 2/16/20, 2/17/20, 2/18/20, 2/19/20, 2/20/20, 2/21/20, 2/22/20, 2/23/20, 2/24/20, 2/25/20, 2/26/20, 2/27/20, 2/28/20, 2/29/20, 3/1/20, 3/2/20, 3/3/20, 3/4/20, 3/5/20, 3/6/20, 3/7/20, 3/8/20, 3/9/20, 3/10/20, 3/11/20, 3/12/20, 3/13/20, 3/14/20, 3/15/20, 3/16/20, 3/17/20, 3/18/20, 3/19/20, 3/20/20, 3/21/20, 3/22/20, 3/23/20, 3/24/20, 3/25/20, 3/26/20, 3/27/20, 3/28/20, 3/29/20, 3/30/20, 3/31/20, 4/1/20, 4/2/20, 4/3/20, 4/4/20, 4/5/20, 4/6/20, 4/7/20, 4/8/20, 4/9/20, 4/10/20, 4/11/20, 4/12/20, 4/13/20, 4/14/20, 4/15/20, 4/16/20, 4/17/20, 4/18/20, 4/19/20, 4/20/20, 4/21/20, 4/22/20, 4/23/20, 4/24/20, 4/25/20, 4/26/20, ...]\n",
+ "Index: []\n",
+ "\n",
+ "[0 rows x 412 columns]"
+ ]
+ },
+ "execution_count": 3,
+ "metadata": {},
+ "output_type": "execute_result"
+ }
+ ],
+ "source": [
+ "raw_data[raw_data.filter(regex=\"\\d{1,2}\\/\\d{1,2}\\/\\d{2}\").isnull().any(axis=1)]"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "Le tableau en sortie est bien vide, il ne manque donc aucune donnée importante.\n",
+ "\n",
+ "Listons les pays que nous allons analyser, ils sont tous représentés dans les données par le nom du pays dans la colonne `Country/Region` et `Nan` dans la colonne `Province/State` sauf dans le cas de la Chine ou nous allons devoir faire une somme de toutes les provinces d'un coté et de récupérer Hong-Knog de l'autre."
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": 12,
"metadata": {},
"outputs": [],
+ "source": [
+ "countries = [ \n",
+ " \"Belgium\",\n",
+ " \"France\",\n",
+ " \"Germany\",\n",
+ " \"Iran\",\n",
+ " \"Italy\",\n",
+ " \"Japan\",\n",
+ " \"Korea, South\",\n",
+ " \"Netherlands\",\n",
+ " \"Portugal\",\n",
+ " \"Spain\",\n",
+ " \"United Kingdom\",\n",
+ " \"US\",\n",
+ "# \"China\",\n",
+ "# \"China, Hong-Kong\",\n",
+ "]"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
+ "source": [
+ "Enregistrons les lignes consernées dans un nouveau tableau :"
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": 28,
+ "metadata": {},
+ "outputs": [
+ {
+ "name": "stdout",
+ "output_type": "stream",
+ "text": [
+ "[[nan, 'France', 46.2276, 2.2137, 0, 0, 2, 3, 3, 3, 4, 5, 5, 5, 6, 6, 6, 6, 6, 6, 6, 11, 11, 11, 11, 11, 11, 11, 12, 12, 12, 12, 12, 12, 12, 12, 12, 12, 14, 18, 38, 57, 100, 130, 191, 212, 285, 423, 613, 938, 1115, 1401, 1773, 2281, 2281, 3661, 4469, 4499, 6633, 7652, 9043, 10871, 12612, 14282, 16533, 19856, 22054, 24967, 28856, 32609, 37162, 39761, 44029, 51579, 56362, 58404, 63588, 46483, 47299, 49934, 46400, 50242, 54003, 55538, 56972, 107712, 110836, 128272, 131476, 144035, 145896, 149683, 150752, 153197, 155821, 157603, 155881, 158653, 160295, 160907, 164592, 167650, 166193, 167326, 167846, 168637, 168887, 169405, 170467, 174596, 175218, 175743, 176132, 176338, 176651, 177330, 177790, 178353, 178837, 179140, 179250, 179650, 180143, 180499, 180752, 181444, 181969, 181410, 181684, 181880, 182018, 185300, 185842, 187608, 187817, 187986, 187265, 184015, 188286, 188808, 189330, 189621, 189782, 190266, 190743, 191040, 191679, 192043, 192317, 192369, 192915, 193256, 193576, 194121, 194535, 194599, 194693, 195256, 195256, 195256, 196267, 196521, 196111, 196390, 196554, 197238, 197591, 197994, 197994, 197994, 198850, 199170, 199699, 200027, 200569, 200569, 200569, 201755, 201755, 202383, 202819, 203516, 203516, 203516, 205308, 205773, 206641, 207650, 208553, 208553, 208553, 210821, 211337, 212676, 213927, 215135, 215135, 215135, 218351, 219253, 220874, 222398, 224602, 224602, 224602, 229203, 230540, 232966, 235540, 238240, 241371, 244335, 244761, 246821, 250539, 255214, 259717, 263129, 267589, 269452, 272643, 272643, 283698, 290929, 296265, 301554, 304349, 309247, 316047, 322935, 331578, 340013, 346921, 351017, 357473, 365025, 374586, 383854, 394270, 400533, 406609, 414196, 423924, 433871, 446961, 460328, 470864, 476162, 485929, 497914, 513732, 529501, 543688, 554304, 558374, 566196, 578769, 591783, 603920, 620330, 632834, 637918, 648116, 666849, 684164, 704478, 731155, 747092, 755597, 768380, 790959, 820685, 845740, 877840, 907677, 920916, 941085, 967717, 1008483, 1050468, 1095708, 1102262, 1174279, 1207430, 1243855, 1290446, 1339640, 1374691, 1420870, 1525184, 1597139, 1549838, 1607147, 1667604, 1754202, 1792764, 1812919, 1834515, 1870380, 1902644, 1926314, 1958235, 1985427, 1994728, 2039938, 2068297, 2088784, 2111646, 2129287, 2142417, 2146869, 2155722, 2171967, 2184846, 2197283, 2209739, 2219502, 2223393, 2231344, 2245324, 2257894, 2269080, 2281889, 2292838, 2296249, 2309760, 2324300, 2337594, 2350923, 2364740, 2376228, 2379291, 2390419, 2407983, 2426063, 2441737, 2458839, 2471458, 2477255, 2488889, 2503693, 2525234, 2545358, 2548388, 2556621, 2559575, 2570767, 2597124, 2616902, 2636045, 2639404, 2651893, 2655915, 2676195, 2701215, 2722728, 2742349, 2762120, 2777921, 2781503, 2800770, 2824473, 2845701, 2866413, 2887532, 2903975, 2907711, 2931050, 2957547, 2979915, 3002910, 3026320, 3044358, 3048598, 3070458, 3097374, 3120634, 3143417, 3167274, 3186354, 3190701, 3213360, 3239256, 3238798, 3284085, 3303273, 3322988, 3327305, 3345558, 3370645, 3390952, 3410996, 3431805, 3447518, 3451894, 3471268, 3495775, 3517177, 3541282, 3562707, 3584326, 3588972, 3608271, 3639501, 3664050, 3689034, 3712474, 3732426, 3736390, 3759247, 3785326, 3810605]]\n",
+ "[[nan, 'Germany', 51.165690999999995, 10.451526, 0, 0, 0, 0, 0, 1, 4, 4, 4, 5, 8, 10, 12, 12, 12, 12, 13, 13, 14, 14, 16, 16, 16, 16, 16, 16, 16, 16, 16, 16, 16, 16, 16, 16, 17, 27, 46, 48, 79, 130, 159, 196, 262, 482, 670, 799, 1040, 1176, 1457, 1908, 2078, 3675, 4585, 5795, 7272, 9257, 12327, 15320, 19848, 22213, 24873, 29056, 32986, 37323, 43938, 50871, 57695, 62095, 66885, 71808, 77872, 84794, 91159, 96092, 100123, 103374, 107663, 113296, 118181, 122171, 124908, 127854, 130072, 131359, 134753, 137698, 141397, 143342, 145184, 147065, 148291, 150648, 153129, 154999, 156513, 157770, 158758, 159912, 161539, 163009, 164077, 164967, 165664, 166152, 167007, 168162, 169430, 170588, 171324, 171879, 172576, 173171, 174098, 174478, 175233, 175752, 176369, 176551, 177778, 178473, 179021, 179710, 179986, 180328, 180600, 181200, 181524, 182196, 182922, 183189, 183410, 183594, 183879, 184121, 184472, 184924, 185450, 185750, 186109, 186506, 186522, 186691, 187226, 187267, 187518, 187682, 188252, 188604, 189817, 190299, 190670, 191272, 191768, 192480, 192871, 193371, 194036, 194458, 194693, 195042, 195418, 195893, 196370, 196780, 197198, 197523, 198064, 198343, 198699, 199001, 199332, 199709, 199919, 200180, 200456, 200890, 201450, 202045, 202426, 202735, 203325, 203717, 204276, 204881, 205623, 206278, 206667, 207112, 207707, 208546, 209535, 210399, 211005, 211220, 212111, 212828, 214113, 215039, 216196, 216903, 217288, 218508, 219540, 220859, 222281, 223791, 224488, 225007, 226700, 228120, 229706, 231292, 233029, 233861, 234494, 236122, 237583, 239010, 240571, 242126, 242835, 243305, 244802, 246015, 247411, 248840, 250283, 251058, 251728, 253626, 254957, 256433, 258149, 259735, 260817, 261737, 263222, 265014, 266869, 269048, 271247, 272932, 273965, 275560, 277412, 279025, 281346, 283712, 285026, 286339, 288631, 290471, 292913, 295539, 298374, 300027, 301573, 304673, 307127, 311137, 315941, 320495, 323463, 326309, 332850, 337314, 344487, 352107, 359802, 364664, 368671, 377068, 385591, 397922, 403874, 417350, 427808, 437698, 450258, 463419, 479621, 498354, 517736, 531790, 544346, 569598, 577131, 608611, 631172, 653992, 668114, 682624, 689146, 715693, 738094, 762832, 785093, 799733, 802946, 817526, 843757, 867484, 891525, 914118, 927990, 932367, 946822, 963192, 995879, 1017830, 1038649, 1052494, 1055691, 1069912, 1094678, 1117953, 1141544, 1157514, 1183640, 1194550, 1200006, 1229269, 1254358, 1287092, 1314309, 1336101, 1350810, 1357261, 1391086, 1423830, 1454009, 1486839, 1508518, 1514962, 1534218, 1570371, 1604129, 1630596, 1632736, 1646240, 1658639, 1672643, 1692109, 1741153, 1760520, 1762637, 1773540, 1783896, 1796216, 1814565, 1841228, 1886561, 1905638, 1928462, 1929410, 1941116, 1968326, 1993892, 2015235, 2023828, 2038645, 2050129, 2059382, 2071615, 2100618, 2108895, 2125261, 2137691, 2147769, 2154656, 2164043, 2179679, 2194562, 2207393, 2224911, 2225659, 2232327, 2240017, 2252504, 2265536, 2276371, 2285003, 2291673, 2296323, 2302051, 2311297, 2321225, 2330422, 2336906, 2341744, 2346876, 2352766, 2362364, 2372209, 2381259, 2388421, 2394515, 2399499, 2405263, 2416037, 2427069, 2436506, 2444177, 2450295, 2455569, 2462061, 2472913, 2484306]]\n",
+ "[[nan, 'Iran', 32.427908, 53.68804599999999, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 2, 5, 18, 28, 43, 61, 95, 139, 245, 388, 593, 978, 1501, 2336, 2922, 3513, 4747, 5823, 6566, 7161, 8042, 9000, 10075, 11364, 12729, 13938, 14991, 16169, 17361, 18407, 19644, 20610, 21638, 23049, 24811, 27017, 29406, 32332, 35408, 38309, 41495, 44605, 47593, 50468, 53183, 55743, 58226, 60500, 62589, 64586, 66220, 68192, 70029, 71686, 73303, 74877, 76389, 77995, 79494, 80868, 82211, 83505, 84802, 85996, 87026, 88194, 89328, 90481, 91472, 92584, 93657, 94640, 95646, 96448, 97424, 98647, 99970, 101650, 103135, 104691, 106220, 107603, 109286, 110767, 112725, 114533, 116635, 118392, 120198, 122492, 124603, 126949, 129341, 131652, 133521, 135701, 137724, 139511, 141591, 143849, 146668, 148950, 151466, 154445, 157562, 160696, 164270, 167156, 169425, 171789, 173832, 175927, 177938, 180156, 182525, 184955, 187427, 189876, 192439, 195051, 197647, 200262, 202584, 204952, 207525, 209970, 212501, 215096, 217724, 220180, 222669, 225205, 227662, 230211, 232863, 235429, 237878, 240438, 243051, 245688, 248379, 250458, 252720, 255117, 257303, 259652, 262173, 264561, 267061, 269440, 271606, 273788, 276202, 278827, 281413, 284034, 286523, 288839, 291172, 293606, 296273, 298909, 301530, 304204, 306752, 309437, 312035, 314786, 317483, 320117, 322567, 324692, 326712, 328844, 331189, 333699, 336324, 338825, 341070, 343203, 345450, 347835, 350279, 352558, 354764, 356792, 358905, 361150, 363363, 365606, 367796, 369911, 371816, 373570, 375212, 376894, 378752, 380746, 382772, 384666, 386658, 388810, 391112, 393425, 395488, 397801, 399940, 402029, 404648, 407353, 410334, 413149, 416198, 419043, 422140, 425481, 429193, 432798, 436319, 439882, 443086, 446448, 449960, 453637, 457219, 461044, 464596, 468119, 471772, 475674, 479825, 483844, 488236, 492378, 496253, 500075, 504281, 508389, 513219, 517835, 522387, 526490, 530380, 534631, 539670, 545286, 550757, 556891, 562705, 568896, 574856, 581824, 588648, 596941, 604952, 612772, 620491, 628780, 637712, 646164, 654936, 663800, 673250, 682486, 692949, 703288, 715068, 726585, 738322, 749525, 762068, 775121, 788473, 801894, 815117, 828377, 841308, 854361, 866821, 880542, 894385, 908346, 922397, 935799, 948749, 962070, 975951, 989572, 1003494, 1016835, 1028986, 1040547, 1051374, 1062397, 1072620, 1083023, 1092407, 1100818, 1108269, 1115770, 1123474, 1131077, 1138530, 1145651, 1152072, 1158384, 1164535, 1170743, 1177004, 1183182, 1189203, 1194963, 1200465, 1206373, 1212481, 1218753, 1225142, 1231429, 1237474, 1243434, 1249507, 1255620, 1261903, 1268263, 1274514, 1280438, 1286406, 1292614, 1299022, 1305339, 1311810, 1318295, 1324395, 1330411, 1336217, 1342134, 1348316, 1354520, 1360852, 1367032, 1372977, 1379286, 1385706, 1392314, 1398841, 1405414, 1411731, 1417999, 1424596, 1431416, 1438286, 1445326, 1452380, 1459370, 1466435, 1473756, 1481396, 1488981, 1496455, 1503753, 1510873, 1518263, 1526023, 1534034, 1542076, 1550142, 1558159, 1566081, 1574012, 1582275, 1590605, 1598875, 1607081, 1615184, 1623159, 1631169, 1639679, 1648174, 1656699, 1665103]]\n",
+ "[[nan, 'Italy', 41.87194, 12.56738, 0, 0, 0, 0, 0, 0, 0, 0, 0, 2, 2, 2, 2, 2, 2, 2, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 20, 62, 155, 229, 322, 453, 655, 888, 1128, 1694, 2036, 2502, 3089, 3858, 4636, 5883, 7375, 9172, 10149, 12462, 15113, 17660, 21157, 24747, 27980, 31506, 35713, 41035, 47021, 53578, 59138, 63927, 69176, 74386, 80589, 86498, 92472, 97689, 101739, 105792, 110574, 115242, 119827, 124632, 128948, 132547, 135586, 139422, 143626, 147577, 152271, 156363, 159516, 162488, 165155, 168941, 172434, 175925, 178972, 181228, 183957, 187327, 189973, 192994, 195351, 197675, 199414, 201505, 203591, 205463, 207428, 209328, 210717, 211938, 213013, 214457, 215858, 217185, 218268, 219070, 219814, 221216, 222104, 223096, 223885, 224760, 225435, 225886, 226699, 227364, 228006, 228658, 229327, 229858, 230158, 230555, 231139, 231732, 232248, 232664, 232997, 233197, 233515, 233836, 234013, 234531, 234801, 234998, 235278, 235561, 235763, 236142, 236305, 236651, 236989, 237290, 237500, 237828, 238159, 238011, 238275, 238499, 238720, 238833, 239410, 239706, 239961, 240136, 240310, 240436, 240578, 240760, 240961, 241184, 241419, 241611, 241819, 241956, 242149, 242363, 242639, 242827, 243061, 243230, 243344, 243506, 243736, 243967, 244216, 244434, 244624, 244752, 245032, 245338, 245590, 245864, 246118, 246286, 246488, 246776, 247158, 247537, 247832, 248070, 248229, 248419, 248803, 249204, 249756, 250103, 250566, 250825, 251237, 251713, 252235, 252809, 253438, 253915, 254235, 254636, 255278, 256118, 257065, 258136, 259345, 260298, 261174, 262540, 263949, 265409, 266853, 268218, 269214, 270189, 271515, 272912, 274644, 276338, 277634, 278784, 280153, 281583, 283180, 284796, 286297, 287753, 288761, 289990, 291442, 293025, 294932, 296569, 298156, 299506, 300897, 302537, 304323, 306235, 308104, 309870, 311364, 313011, 314861, 317409, 319908, 322751, 325329, 327586, 330263, 333940, 338398, 343770, 349494, 354950, 359569, 365467, 372799, 381602, 391611, 402536, 414241, 423578, 434449, 449648, 465726, 484869, 504509, 525782, 542789, 564778, 589766, 616595, 647674, 679430, 709335, 731588, 759829, 790377, 824879, 862681, 902490, 935104, 960373, 995463, 1028424, 1066401, 1107303, 1144552, 1178529, 1205881, 1238072, 1272352, 1308528, 1345767, 1380531, 1408868, 1431795, 1455022, 1480874, 1509875, 1538217, 1564532, 1585178, 1601554, 1620901, 1641610, 1664829, 1688939, 1709991, 1728878, 1742557, 1757394, 1770149, 1787147, 1805873, 1825775, 1843712, 1855737, 1870576, 1888144, 1906377, 1921778, 1938083, 1953185, 1964054, 1977370, 1991278, 2009317, 2028354, 2038759, 2047696, 2056277, 2067487, 2083689, 2107166, 2129376, 2141201, 2155446, 2166244, 2181619, 2201945, 2220361, 2237890, 2257866, 2276491, 2289021, 2303263, 2319036, 2336279, 2352423, 2368733, 2381277, 2390102, 2400598, 2414166, 2428221, 2441854, 2455185, 2466813, 2475372, 2485956, 2501147, 2515507, 2529070, 2541783, 2553032, 2560957, 2570608, 2583790, 2597446, 2611659, 2625098, 2636738, 2644707, 2655319, 2668266, 2683403, 2697296, 2710819, 2721879, 2729223, 2739591, 2751657, 2765412, 2780882, 2795796, 2809246, 2818863, 2832162, 2848564, 2868435, 2888923, 2907825, 2925265, 2938371, 2955434, 2976274, 2999119]]\n",
+ "[[nan, 'Japan', 36.204824, 138.252924, 2, 2, 2, 2, 4, 4, 7, 7, 11, 15, 20, 20, 20, 22, 23, 23, 23, 24, 24, 26, 27, 28, 33, 43, 54, 60, 67, 79, 85, 95, 112, 137, 149, 160, 173, 192, 218, 236, 245, 259, 278, 298, 333, 365, 420, 466, 499, 527, 585, 640, 696, 733, 795, 826, 843, 893, 928, 968, 1022, 1059, 1104, 1144, 1217, 1314, 1416, 1530, 1728, 1907, 2001, 2255, 2535, 2818, 3154, 3525, 3876, 4110, 4485, 5020, 5614, 6250, 6951, 7473, 7773, 8277, 8835, 9398, 9958, 10548, 10914, 11258, 11641, 12037, 12469, 12854, 13186, 13405, 13576, 13860, 14076, 14284, 14558, 14861, 15061, 15229, 15354, 15455, 15553, 15640, 15755, 15824, 15861, 15948, 15998, 16096, 16148, 16202, 16226, 16259, 16287, 16321, 16362, 16385, 16410, 16451, 16472, 16502, 16528, 16598, 16673, 16716, 16751, 16787, 16837, 16867, 16911, 16958, 17000, 17039, 17060, 17111, 17146, 17187, 17250, 17293, 17369, 17439, 17484, 17530, 17588, 17658, 17725, 17780, 17820, 17879, 17963, 18055, 18162, 18254, 18366, 18476, 18615, 18838, 19055, 19185, 19461, 19668, 19848, 20055, 20261, 20617, 21044, 21430, 21841, 22125, 22437, 23172, 23510, 24104, 24946, 25446, 25706, 26463, 27136, 28114, 28883, 29684, 30548, 31142, 32116, 33382, 35144, 36234, 37804, 39116, 40099, 41347, 42686, 44167, 45764, 47342, 48782, 49617, 50302, 51288, 52471, 53818, 55051, 56074, 56717, 57636, 58728, 59900, 60949, 61916, 62658, 63158, 63888, 64779, 65653, 66499, 67353, 67958, 68396, 69023, 69619, 70278, 70866, 71467, 71918, 72213, 72724, 73264, 73916, 74558, 75206, 75646, 75914, 76446, 76997, 77488, 78061, 78662, 79142, 79462, 79773, 80009, 80490, 81054, 81703, 82186, 82484, 83022, 83591, 84244, 84768, 85345, 85746, 86027, 86540, 87039, 87679, 88267, 88962, 89400, 89652, 90153, 90694, 91402, 92044, 92670, 93098, 93408, 93895, 94515, 95134, 95868, 96599, 97095, 97503, 98146, 98877, 99674, 100450, 101327, 101943, 102431, 103309, 103928, 104964, 106136, 107439, 108394, 109191, 110487, 112011, 113655, 115360, 117113, 118611, 119557, 121247, 123477, 125859, 128285, 130871, 133034, 134554, 135786, 137735, 140225, 142778, 145457, 147515, 148962, 150976, 153432, 155938, 158387, 160870, 162917, 164434, 166592, 169429, 172406, 175310, 178272, 180639, 182311, 184752, 187751, 190950, 193753, 196815, 199270, 201074, 203732, 207007, 210769, 214575, 218467, 221412, 223786, 227415, 231271, 235811, 239068, 242097, 245293, 248625, 253571, 259521, 267084, 274947, 282737, 288818, 293746, 298321, 304140, 310734, 317871, 324942, 330715, 335605, 340989, 346508, 352129, 357174, 361928, 365899, 368684, 372545, 376482, 380644, 384180, 387499, 390166, 391956, 394269, 396900, 399492, 401858, 404128, 405765, 406992, 408550, 410434, 412125, 413441, 414803, 416154, 417127, 418435, 419896, 421421, 422718, 423952, 424988, 425725, 426828, 427732, 428816, 429873, 431093, 432090, 432778, 433700, 434944, 436093]]\n",
+ "[[nan, 'Korea, South', 35.907757000000004, 127.766922, 1, 1, 2, 2, 3, 4, 4, 4, 4, 11, 12, 15, 15, 16, 19, 23, 24, 24, 25, 27, 28, 28, 28, 28, 28, 29, 30, 31, 31, 104, 204, 433, 602, 833, 977, 1261, 1766, 2337, 3150, 3736, 4335, 5186, 5621, 6088, 6593, 7041, 7314, 7478, 7513, 7755, 7869, 7979, 8086, 8162, 8236, 8320, 8413, 8565, 8652, 8799, 8961, 8961, 9037, 9137, 9241, 9332, 9478, 9583, 9661, 9786, 9887, 9976, 10062, 10156, 10237, 10284, 10331, 10384, 10423, 10450, 10480, 10512, 10537, 10564, 10591, 10613, 10635, 10653, 10661, 10674, 10683, 10694, 10708, 10718, 10728, 10738, 10752, 10761, 10765, 10774, 10780, 10793, 10801, 10804, 10806, 10810, 10822, 10840, 10874, 10909, 10936, 10962, 10991, 11018, 11037, 11050, 11065, 11078, 11110, 11122, 11142, 11165, 11190, 11206, 11225, 11265, 11344, 11402, 11441, 11468, 11503, 11541, 11590, 11629, 11668, 11719, 11776, 11814, 11852, 11902, 11947, 12003, 12051, 12085, 12121, 12155, 12198, 12257, 12306, 12373, 12421, 12438, 12484, 12535, 12563, 12602, 12653, 12715, 12757, 12800, 12850, 12904, 12967, 13030, 13091, 13137, 13181, 13244, 13293, 13338, 13373, 13417, 13479, 13512, 13551, 13612, 13672, 13711, 13745, 13771, 13816, 13879, 13938, 13979, 14092, 14150, 14175, 14203, 14251, 14269, 14305, 14336, 14366, 14389, 14423, 14456, 14499, 14519, 14562, 14598, 14626, 14660, 14714, 14770, 14873, 15039, 15318, 15515, 15761, 16058, 16346, 16670, 17002, 17399, 17665, 17945, 18265, 18706, 19077, 19400, 19699, 19947, 20182, 20449, 20644, 20842, 21010, 21177, 21296, 21432, 21588, 21743, 21919, 22055, 22176, 22285, 22391, 22504, 22657, 22783, 22893, 22975, 23045, 23106, 23216, 23341, 23455, 23516, 23611, 23661, 23699, 23812, 23889, 23952, 24027, 24091, 24164, 24239, 24353, 24422, 24476, 24548, 24606, 24703, 24805, 24889, 24988, 25035, 25108, 25199, 25275, 25333, 25424, 25543, 25698, 25775, 25836, 25955, 26043, 26146, 26271, 26385, 26511, 26635, 26732, 26807, 26925, 27050, 27195, 27284, 27427, 27553, 27653, 27799, 27942, 28133, 28338, 28546, 28769, 28998, 29311, 29654, 30017, 30403, 30733, 31004, 31353, 31735, 32318, 32887, 33375, 33824, 34201, 34652, 35163, 35703, 36332, 36915, 37546, 38161, 38755, 39432, 40098, 40786, 41736, 42766, 43484, 44364, 45442, 46453, 47515, 48570, 49665, 50591, 51460, 52550, 53533, 54770, 55902, 56872, 57680, 58725, 59773, 60740, 61769, 62593, 63244, 64264, 64979, 65818, 66686, 67358, 67999, 68664, 69114, 69651, 70212, 70728, 71241, 71820, 72340, 72729, 73115, 73518, 73918, 74262, 74692, 75084, 75521, 75875, 76429, 76926, 77395, 77850, 78205, 78508, 78844, 79311, 79762, 80131, 80524, 80896, 81185, 81487, 81930, 82434, 82837, 83199, 83525, 83869, 84325, 84946, 85567, 86128, 86574, 86992, 87324, 87681, 88120, 88516, 88922, 89321, 89676, 90031, 90372, 90816, 91240, 91638]]\n",
+ "[[nan, 'Netherlands', 52.1326, 5.2913, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 1, 1, 6, 10, 18, 24, 38, 82, 128, 188, 265, 321, 382, 503, 503, 804, 959, 1135, 1413, 1705, 2051, 2460, 2994, 3631, 4204, 4749, 5560, 6412, 7431, 8603, 9762, 10866, 11750, 12595, 13614, 14697, 15723, 16627, 17851, 18803, 19580, 20549, 21762, 23097, 24413, 25587, 26551, 27419, 28153, 29214, 30449, 31589, 32655, 33405, 34134, 34842, 35729, 36535, 37190, 37845, 38245, 38416, 38802, 39316, 39791, 40236, 40571, 40770, 41087, 41319, 41774, 42093, 42382, 42627, 42788, 42984, 43211, 43481, 43681, 43870, 43995, 44141, 44249, 44447, 44700, 44888, 45064, 45236, 45445, 45578, 45768, 45950, 46126, 46257, 46442, 46545, 46647, 46733, 46942, 47152, 47335, 47574, 47739, 47903, 48087, 48251, 48461, 48640, 48783, 48948, 49087, 49204, 49319, 49426, 49502, 49593, 49658, 49722, 49804, 49914, 50005, 50074, 50147, 50223, 50273, 50335, 50412, 50487, 50548, 50621, 50657, 50694, 50746, 50798, 50840, 50921, 51022, 51093, 51146, 51252, 51351, 51454, 51581, 51725, 51910, 52073, 52241, 52404, 52595, 52732, 53005, 53151, 53374, 53621, 53963, 54301, 54732, 55098, 55470, 55955, 56381, 56982, 57501, 57987, 58564, 59194, 59424, 60627, 61204, 61840, 62495, 63002, 63484, 63973, 64525, 65054, 65589, 66097, 66554, 67128, 67543, 68114, 68624, 69131, 69632, 70140, 70667, 71129, 71863, 72464, 73208, 73862, 74787, 75584, 76548, 77688, 78511, 79781, 81012, 82099, 83399, 84778, 86320, 88073, 90047, 91934, 93778, 95995, 98240, 100597, 103141, 105918, 108631, 111626, 114540, 117551, 120845, 124097, 127922, 131889, 135892, 140471, 144999, 149988, 155810, 161781, 168280, 174653, 181498, 188876, 196163, 203954, 211938, 220052, 228234, 236226, 244391, 253134, 262405, 272401, 281052, 291254, 301597, 311889, 319991, 330255, 341374, 351178, 359861, 368147, 375890, 383523, 390488, 397730, 404401, 410065, 414745, 419412, 424819, 430453, 436544, 442458, 447871, 452701, 457003, 461612, 467257, 473190, 479260, 484648, 489818, 493744, 498653, 503123, 508866, 513325, 518910, 523478, 527523, 532444, 538050, 543924, 550445, 557224, 564291, 570437, 576965, 585685, 594523, 603603, 613487, 621944, 628577, 639746, 652525, 664456, 676673, 689705, 700873, 710683, 721071, 732570, 744057, 753905, 762985, 770400, 777902, 787300, 796981, 805164, 813765, 821163, 827726, 834064, 841163, 850790, 858914, 866235, 872847, 878263, 883135, 889221, 895687, 901696, 906956, 912554, 917308, 921580, 927110, 932884, 938628, 944042, 948933, 952950, 956867, 961593, 966252, 970602, 974775, 978475, 981663, 985224, 989237, 993430, 997751, 1001826, 1005760, 1007981, 1009725, 1012908, 1017302, 1021643, 1025837, 1029284, 1032094, 1034795, 1038156, 1042674, 1047400, 1051965, 1056639, 1060801, 1064598, 1068960, 1073971, 1079084, 1084021, 1088690, 1092452, 1096433, 1101430, 1105544]]\n",
+ "[[nan, 'Portugal', 39.3999, -8.2245, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 2, 2, 5, 8, 13, 20, 30, 30, 41, 59, 59, 112, 169, 245, 331, 448, 448, 785, 1020, 1280, 1600, 2060, 2362, 2995, 3544, 4268, 5170, 5962, 6408, 7443, 8251, 9034, 9886, 10524, 11278, 11730, 12442, 13141, 13956, 15472, 15987, 16585, 16934, 17448, 18091, 18841, 19022, 19685, 20206, 20863, 21379, 21982, 22353, 22797, 23392, 23864, 24027, 24322, 24505, 25045, 25351, 25190, 25282, 25524, 25702, 26182, 26715, 27268, 27406, 27581, 27679, 27913, 28132, 28319, 28583, 28810, 29036, 29209, 29432, 29660, 29912, 30200, 30471, 30623, 30788, 31007, 31292, 31596, 31946, 32203, 32500, 32700, 32895, 33261, 33592, 33969, 34351, 34693, 34885, 35306, 35600, 35910, 36180, 36463, 36690, 37036, 37336, 37672, 38089, 38464, 38841, 39133, 39392, 39737, 40104, 40415, 40866, 41189, 41646, 41912, 42141, 42454, 42782, 43156, 43569, 43897, 44129, 44416, 44859, 45277, 45679, 46221, 46512, 46818, 47051, 47426, 47765, 48077, 48390, 48636, 48771, 48898, 49150, 49379, 49692, 49955, 50164, 50299, 50410, 50613, 50868, 51072, 51310, 51463, 51569, 51681, 51848, 52061, 52351, 52537, 52668, 52825, 52945, 53223, 53548, 53783, 53981, 54102, 54234, 54448, 54701, 54992, 55211, 55452, 55597, 55720, 55912, 56274, 56673, 57074, 57448, 57768, 58012, 58243, 58633, 59051, 59457, 59943, 60258, 60507, 60895, 61541, 62126, 62813, 63310, 63983, 64596, 65021, 65626, 66396, 67176, 68025, 68577, 69200, 69663, 70465, 71156, 72055, 72939, 73604, 74029, 74717, 75542, 76396, 77284, 78247, 79151, 79885, 80312, 81256, 82534, 83928, 85574, 86664, 87913, 89121, 91193, 93294, 95902, 98055, 99911, 101860, 103736, 106271, 109541, 112440, 116109, 118686, 121133, 124432, 128392, 132616, 137272, 141279, 144341, 146847, 149443, 156940, 161350, 166900, 173540, 179324, 183420, 187237, 192172, 198011, 204664, 211266, 217301, 225672, 230124, 236015, 243009, 249498, 255970, 260758, 264802, 268721, 274011, 280394, 285838, 290706, 294799, 298061, 300462, 303846, 307618, 312553, 318640, 322474, 325071, 327976, 332073, 335207, 340287, 344700, 348744, 350938, 353576, 358296, 362616, 366952, 370787, 374121, 376220, 378656, 383258, 387636, 391782, 392996, 394573, 396666, 400002, 406051, 413678, 420629, 423870, 427254, 431623, 436579, 446606, 456533, 466709, 476187, 483689, 489293, 496552, 507108, 517806, 528469, 539416, 549801, 556503, 566958, 581605, 595149, 609136, 624469, 636190, 643113, 653878, 668951, 685383, 698583, 711018, 720516, 726321, 731861, 740944, 748858, 755774, 761906, 765414, 767919, 770502, 774889, 778369, 781223, 784079, 785756, 787059, 788561, 790885, 792829, 794769, 796339, 797525, 798074, 799106, 800586, 801746, 802773, 803844, 804562, 804956, 805647, 806626, 807456]]\n",
+ "[[nan, 'Spain', 40.463667, -3.7492199999999998, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 1, 1, 1, 1, 1, 1, 1, 1, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 6, 13, 15, 32, 45, 84, 120, 165, 222, 259, 400, 500, 673, 1073, 1695, 2277, 2277, 5232, 6391, 7798, 9942, 11748, 13910, 17963, 20410, 25374, 28768, 35136, 39885, 49515, 57786, 65719, 73235, 80110, 87956, 95923, 104118, 112065, 119199, 126168, 131646, 136675, 141942, 148220, 153222, 158273, 163027, 166831, 170099, 172541, 177644, 184948, 190839, 191726, 198674, 200210, 204178, 208389, 213024, 202990, 205905, 207634, 209465, 210773, 212917, 213435, 215216, 216582, 217466, 218011, 219329, 220325, 221447, 222857, 223578, 224350, 227436, 228030, 228691, 229540, 230183, 230698, 230698, 231606, 232037, 232555, 233037, 234824, 235290, 235772, 235400, 236259, 236259, 237906, 238564, 239228, 239479, 239638, 239932, 240326, 240660, 240978, 241310, 241550, 241717, 241966, 242280, 242707, 243209, 243605, 243928, 244109, 244328, 244683, 245268, 245575, 245938, 246272, 246504, 246752, 247086, 247486, 247905, 248469, 248770, 248970, 249271, 249659, 250103, 250545, 250545, 250545, 251789, 252130, 252513, 253056, 253908, 253908, 253908, 255953, 256619, 257494, 258855, 260255, 260255, 260255, 264836, 266194, 267551, 270166, 272421, 272421, 272421, 278782, 280610, 282641, 285430, 288522, 288522, 288522, 297054, 302814, 305767, 309855, 314362, 314362, 314362, 322980, 326612, 329784, 337334, 342813, 342813, 342813, 359082, 364196, 370867, 377906, 386054, 386054, 386054, 405436, 412553, 419849, 429507, 439286, 439286, 439286, 462858, 470973, 479554, 488513, 498989, 498989, 498989, 525549, 534513, 543379, 554143, 566326, 566326, 566326, 593730, 603167, 614360, 625651, 640040, 640040, 640040, 671468, 682267, 693556, 704209, 716481, 716481, 716481, 748266, 748266, 769188, 778607, 789932, 789932, 789932, 813412, 825410, 835901, 848324, 861112, 861112, 861112, 888968, 896086, 908056, 921374, 936560, 936560, 936560, 974449, 988322, 1005295, 1026281, 1046132, 1046132, 1046132, 1098320, 1116738, 1136503, 1160083, 1185678, 1185678, 1185678, 1240697, 1259366, 1284408, 1306316, 1328832, 1328832, 1328832, 1381218, 1381218, 1417709, 1437220, 1458591, 1458591, 1458591, 1496864, 1510023, 1525341, 1541574, 1556730, 1556730, 1556730, 1582616, 1594844, 1605066, 1617355, 1628208, 1628208, 1628208, 1648187, 1656444, 1665775, 1675902, 1684647, 1684647, 1684647, 1702328, 1702328, 1712101, 1720056, 1730575, 1730575, 1730575, 1751884, 1762212, 1773290, 1785421, 1797236, 1797236, 1797236, 1819249, 1829903, 1842289, 1854951, 1854951, 1854951, 1854951, 1879413, 1893502, 1910218, 1928265, 1928265, 1928265, 1928265, 1958844, 1982544, 1982544, 2024904, 2050360, 2050360, 2050360, 2111782, 2137220, 2176089, 2211967, 2252164, 2252164, 2252164, 2336451, 2370742, 2412318, 2456675, 2499560, 2499560, 2499560, 2593382, 2629817, 2670102, 2705001, 2743119, 2743119, 2743119, 2822805, 2851869, 2883465, 2913425, 2941990, 2941990, 2941990, 2989085, 3005487, 3023601, 3041454, 3056035, 3056035, 3056035, 3086286, 3096343, 3107172, 3121687, 3133122, 3133122, 3133122, 3153971, 3161432, 3170644, 3180212, 3188553, 3188553, 3188553, 3204531, 3130184, 3136321, 3142358]]\n",
+ "[[nan, 'United Kingdom', 55.3781, -3.4360000000000004, 0, 0, 0, 0, 0, 0, 0, 0, 0, 2, 2, 2, 8, 8, 9, 9, 9, 13, 14, 14, 15, 16, 17, 18, 18, 18, 19, 19, 20, 22, 23, 23, 28, 30, 34, 37, 44, 56, 61, 94, 134, 189, 245, 294, 373, 428, 482, 629, 887, 1298, 1787, 2266, 2630, 3072, 3684, 4452, 5451, 6506, 7760, 8957, 10333, 12668, 15039, 17732, 20816, 24017, 26839, 29696, 33969, 38484, 43398, 48263, 53178, 57198, 60792, 66067, 71517, 76646, 81498, 85813, 89390, 92885, 97068, 101393, 106458, 111756, 116721, 121437, 125289, 130147, 134907, 140397, 145540, 150513, 154261, 157729, 162431, 167152, 172587, 177543, 182270, 185491, 188465, 191843, 195527, 199358, 203125, 206174, 208324, 210645, 214228, 217617, 220915, 223524, 226041, 228106, 229932, 232506, 235547, 238253, 240795, 242825, 244332, 245680, 247287, 248937, 250739, 252473, 253977, 255076, 256145, 257579, 259046, 260388, 261622, 262727, 263518, 264235, 265321, 266474, 267656, 268657, 269710, 270597, 271404, 272430, 273507, 274504, 275524, 276504, 277170, 277792, 278684, 279566, 280340, 281037, 281675, 282308, 282703, 283307, 283710, 283770, 283774, 284276, 284900, 285416, 285768, 286349, 286979, 287621, 288133, 288953, 289603, 290133, 291373, 291911, 292552, 293239, 294066, 294792, 295372, 295817, 296377, 297146, 297914, 298681, 299426, 300111, 300658, 301455, 302301, 303181, 303942, 304685, 305623, 306293, 307184, 308134, 309005, 309763, 310825, 311641, 312789, 313798, 314927, 316367, 317379, 318484, 319197, 320286, 321098, 322280, 323313, 324601, 325642, 326614, 327798, 328846, 330368, 331644, 332752, 334467, 335873, 337168, 338676, 340411, 342351, 344164, 347152, 350100, 352560, 355219, 358138, 361677, 365174, 368504, 371125, 374228, 378219, 381614, 385936, 390358, 394257, 398625, 403551, 409729, 416363, 423236, 429277, 434969, 439013, 446156, 453264, 460178, 467146, 480017, 502978, 515571, 530113, 544275, 561815, 575679, 590844, 603716, 617688, 634920, 654644, 673622, 689257, 705428, 722409, 741212, 762542, 789229, 810467, 830998, 854010, 873800, 894690, 917575, 942275, 965340, 989745, 1011660, 1034914, 1053864, 1073882, 1099059, 1123197, 1146484, 1171441, 1192013, 1213363, 1233775, 1256725, 1290195, 1317496, 1344356, 1369318, 1390681, 1410732, 1430341, 1453256, 1473508, 1493383, 1512045, 1527495, 1538794, 1557007, 1574562, 1589301, 1605172, 1617327, 1629657, 1643086, 1659256, 1674134, 1690432, 1705971, 1723242, 1737960, 1750241, 1766819, 1787783, 1809455, 1830956, 1849403, 1869666, 1888116, 1913277, 1948660, 1977167, 2004219, 2040147, 2073511, 2110314, 2149551, 2188587, 2221312, 2256005, 2288345, 2329730, 2382865, 2432888, 2488780, 2542065, 2599789, 2654779, 2713563, 2774479, 2836801, 2889419, 2957472, 3017409, 3072349, 3118518, 3164051, 3211576, 3260258, 3316019, 3357361, 3395959, 3433494, 3466849, 3505754, 3543646, 3583907, 3617459, 3647463, 3669658, 3689746, 3715054, 3743734, 3772813, 3796088, 3817176, 3835783, 3852623, 3871825, 3892459, 3911573, 3929835, 3945680, 3959784, 3972148, 3985161, 3998655, 4013799, 4027106, 4038078, 4047843, 4058468, 4071185, 4083242, 4095269, 4105675, 4115509, 4126150, 4134639, 4144577, 4154562, 4163085, 4170519, 4176554, 4182009, 4188400, 4194785, 4201358]]\n",
+ "[[nan, 'US', 40.0, -100.0, 1, 1, 2, 2, 5, 5, 5, 6, 6, 8, 8, 8, 11, 11, 11, 12, 12, 12, 12, 12, 13, 13, 14, 14, 14, 14, 14, 14, 14, 14, 16, 16, 16, 16, 16, 16, 17, 17, 25, 32, 55, 74, 107, 184, 237, 403, 519, 594, 782, 1147, 1586, 2219, 2978, 3212, 4679, 6512, 9169, 13663, 20030, 26025, 34898, 46136, 56755, 68837, 86693, 105383, 125013, 143912, 165987, 192301, 224544, 256779, 289066, 321482, 351359, 382740, 413507, 444699, 480640, 515055, 544185, 571440, 598370, 627151, 652591, 682626, 715656, 743588, 769684, 799512, 825429, 854288, 887858, 920185, 950580, 977081, 1000784, 1025361, 1051799, 1081019, 1115945, 1143295, 1167592, 1191677, 1216208, 1240767, 1268179, 1295017, 1320153, 1339020, 1358291, 1381238, 1401646, 1428464, 1453211, 1477370, 1495733, 1518123, 1539130, 1561827, 1587593, 1611249, 1632361, 1652428, 1671100, 1690750, 1709299, 1731621, 1756093, 1779726, 1798713, 1816148, 1837651, 1857495, 1879144, 1904544, 1925704, 1943620, 1961257, 1979639, 2000749, 2023881, 2048746, 2073954, 2092902, 2112721, 2136391, 2163449, 2191971, 2223531, 2255800, 2280946, 2313093, 2350156, 2386014, 2426329, 2472311, 2513651, 2554376, 2595655, 2642081, 2693890, 2750517, 2801874, 2847553, 2898322, 2941394, 3002036, 3062146, 3124633, 3192672, 3252687, 3311137, 3370026, 3438038, 3506145, 3581954, 3654203, 3716740, 3777211, 3839293, 3903780, 3974341, 4042769, 4116070, 4180969, 4235816, 4292554, 4359003, 4430816, 4498234, 4566931, 4623105, 4668652, 4714173, 4772945, 4827393, 4886727, 4945996, 5000107, 5045855, 5093475, 5141430, 5197413, 5248690, 5314012, 5360914, 5400103, 5436757, 5481755, 5529102, 5572890, 5621596, 5664641, 5698902, 5735502, 5775618, 5820908, 5866093, 5913046, 5955722, 5990118, 6025593, 6067260, 6108200, 6152285, 6202700, 6245682, 6276981, 6300449, 6327884, 6361638, 6397758, 6445407, 6486484, 6520750, 6555127, 6594554, 6633389, 6678579, 6727725, 6769847, 6808147, 6860061, 6899723, 6938731, 6985760, 7033944, 7078562, 7116078, 7149281, 7192498, 7231832, 7277418, 7332238, 7380797, 7416539, 7455962, 7501009, 7551949, 7610451, 7666716, 7721622, 7767500, 7809362, 7861469, 7921089, 7985864, 8054907, 8111563, 8160870, 8228585, 8290408, 8353581, 8429792, 8511514, 8594218, 8656238, 8723553, 8800136, 8879326, 8970125, 9069103, 9158771, 9263540, 9348641, 9475561, 9579953, 9709053, 9836805, 9964007, 10079085, 10199542, 10339769, 10486134, 10650509, 10830672, 10998189, 11134318, 11296818, 11460242, 11633088, 11824228, 12022158, 12201266, 12347905, 12521898, 12697001, 12879677, 12991818, 13199752, 13355018, 13495104, 13655220, 13842652, 14044625, 14267421, 14499637, 14714853, 14895691, 15089952, 15313819, 15535463, 15766130, 16004659, 16221125, 16408428, 16601499, 16810792, 17056440, 17295460, 17546621, 17738237, 17925587, 18123968, 18321157, 18550002, 18743916, 18841414, 19067574, 19222891, 19396787, 19595354, 19827770, 20061903, 20215297, 20515083, 20723157, 20906563, 21140217, 21394210, 21671278, 21963383, 22225720, 22438881, 22653070, 22879069, 23108826, 23343727, 23584671, 23785679, 23962788, 24105873, 24281531, 24464045, 24656940, 24846933, 25017235, 25148046, 25299108, 25445778, 25598384, 25767014, 25933090, 26075178, 26187035, 26321351, 26436155, 26557346, 26680589, 26814096, 26917986, 27007586, 27097493, 27192663, 27287431, 27392829, 27492273, 27579257, 27644213, 27698190, 27760660, 27830489, 27899756, 27979051, 28050458, 28107350, 28163046, 28234656, 28309085, 28386492, 28463190, 28527344, 28578548, 28637313, 28694071, 28759980, 28827144]]\n"
+ ]
+ }
+ ],
+ "source": [
+ "data = []\n",
+ "for country in countries:\n",
+ " new_row = raw_data[(raw_data['Country/Region']==country) & (raw_data['Province/State'].isnull())].values.tolist()\n",
+ " print(new_row)\n",
+ "dataf = pd.DataFrame(data, columns=raw_data.columns)"
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": 29,
+ "metadata": {},
+ "outputs": [
+ {
+ "data": {
+ "text/html": [
+ "\n",
+ "\n",
+ "
\n",
+ " \n",
+ " \n",
+ " | \n",
+ " Province/State | \n",
+ " Country/Region | \n",
+ " Lat | \n",
+ " Long | \n",
+ " 1/22/20 | \n",
+ " 1/23/20 | \n",
+ " 1/24/20 | \n",
+ " 1/25/20 | \n",
+ " 1/26/20 | \n",
+ " 1/27/20 | \n",
+ " ... | \n",
+ " 2/23/21 | \n",
+ " 2/24/21 | \n",
+ " 2/25/21 | \n",
+ " 2/26/21 | \n",
+ " 2/27/21 | \n",
+ " 2/28/21 | \n",
+ " 3/1/21 | \n",
+ " 3/2/21 | \n",
+ " 3/3/21 | \n",
+ " 3/4/21 | \n",
+ "
\n",
+ " \n",
+ " \n",
+ " \n",
+ "
\n",
+ "
0 rows × 412 columns
\n",
+ "
"
+ ],
+ "text/plain": [
+ "Empty DataFrame\n",
+ "Columns: [Province/State, Country/Region, Lat, Long, 1/22/20, 1/23/20, 1/24/20, 1/25/20, 1/26/20, 1/27/20, 1/28/20, 1/29/20, 1/30/20, 1/31/20, 2/1/20, 2/2/20, 2/3/20, 2/4/20, 2/5/20, 2/6/20, 2/7/20, 2/8/20, 2/9/20, 2/10/20, 2/11/20, 2/12/20, 2/13/20, 2/14/20, 2/15/20, 2/16/20, 2/17/20, 2/18/20, 2/19/20, 2/20/20, 2/21/20, 2/22/20, 2/23/20, 2/24/20, 2/25/20, 2/26/20, 2/27/20, 2/28/20, 2/29/20, 3/1/20, 3/2/20, 3/3/20, 3/4/20, 3/5/20, 3/6/20, 3/7/20, 3/8/20, 3/9/20, 3/10/20, 3/11/20, 3/12/20, 3/13/20, 3/14/20, 3/15/20, 3/16/20, 3/17/20, 3/18/20, 3/19/20, 3/20/20, 3/21/20, 3/22/20, 3/23/20, 3/24/20, 3/25/20, 3/26/20, 3/27/20, 3/28/20, 3/29/20, 3/30/20, 3/31/20, 4/1/20, 4/2/20, 4/3/20, 4/4/20, 4/5/20, 4/6/20, 4/7/20, 4/8/20, 4/9/20, 4/10/20, 4/11/20, 4/12/20, 4/13/20, 4/14/20, 4/15/20, 4/16/20, 4/17/20, 4/18/20, 4/19/20, 4/20/20, 4/21/20, 4/22/20, 4/23/20, 4/24/20, 4/25/20, 4/26/20, ...]\n",
+ "Index: []\n",
+ "\n",
+ "[0 rows x 412 columns]"
+ ]
+ },
+ "execution_count": 29,
+ "metadata": {},
+ "output_type": "execute_result"
+ }
+ ],
+ "source": [
+ "dataf"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "metadata": {},
"source": []
}
],
--
2.18.1