You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
"name": "Troy Bradley Copy of Statistics - Coding Challenge #1.ipynb",
7
+
"version": "0.3.2",
8
+
"provenance": []
9
+
},
10
+
"kernelspec": {
11
+
"name": "python3",
12
+
"display_name": "Python 3"
13
+
}
14
+
},
15
+
"cells": [
16
+
{
17
+
"cell_type": "markdown",
18
+
"metadata": {
19
+
"id": "view-in-github",
20
+
"colab_type": "text"
21
+
},
22
+
"source": [
23
+
"[View in Colaboratory](https://colab.research.google.com/github/bitcointroy/MLcodechallenges/blob/master/Troy_Bradley_Copy_of_Statistics_Coding_Challenge_1.ipynb)"
24
+
]
25
+
},
26
+
{
27
+
"metadata": {
28
+
"id": "Vim6ATB8ADUI",
29
+
"colab_type": "text"
30
+
},
31
+
"cell_type": "markdown",
32
+
"source": [
33
+
"# Statistics Coding Challenge #1\n",
34
+
"\n",
35
+
"In this coding challenge, we are going to use the \"Accidental Drug Related Deaths 2012-2017 (State of Connecticut)\" data set available from the Data.Gov website (https://catalog.data.gov/dataset?groups=local&organization_type=State+Government#topic=local_navigation). \n",
36
+
"\n",
37
+
"There are 2 main objectives you need to accomplish:\n",
38
+
"\n",
39
+
"1) First treat missing values for the \"Death City\" attribute - replace any missing values in the \"Death City\" with the city that has experienced the most number of deaths\n",
40
+
"\n",
41
+
"For each city, do the following:\n",
42
+
"\n",
43
+
"2) Compute summary statistics for the *age* attribute:\n",
44
+
"\n",
45
+
"\n",
46
+
"a) Mean\n",
47
+
"\n",
48
+
"b) Median\n",
49
+
"\n",
50
+
"c) 25%, 50% and 75% percentiles using [np.percentile](https://docs.scipy.org/doc/numpy-1.14.0/reference/generated/numpy.percentile.html)\n",
51
+
"\n",
52
+
"d) Examine any outliers in data using [Boxplots](https://matplotlib.org/2.1.1/gallery/statistics/boxplot_demo.html)\n",
53
+
"\n",
54
+
"e) Construct a Bee Swarm plot to highlight the distibution of age by each city using [seaborn.swarmplot](https://seaborn.pydata.org/generated/seaborn.swarmplot.html)\n",
0 commit comments