Examine datatype and comment on appropriateness of each


Assignment: Jupyter

Create a Jupyter notebook containing a title of "homework 2", your name, and the course CPSMA 4313. Load any libraries you will use in a code block at the beginning.

Gather the data from a fitbit provided on Kaggle and provided in the github repo for this course. I have used ‘quotes' when discussing a column in the dataset.

1. Store the data as a pandas dataframe. Examine each datatype and comment on the appropriateness of each.

2. Remove the column that repeats the indexes and is ‘unnamed' as a column.

3. Clean the column names to remove the unit declaration, (%), using regular expressions. The column name should not have any trailing spaces after cleaning. You will only receive partial credit for simply renaming columns without using regular expressions.

4. Convert ‘activity day' column into a datetime format.

5. Impute ‘total steps' by replacing the ‘NaN's with an appropriate number of steps. Convert to appropriate datatype.

6. Convert non-empty ‘avg pace' into a float that is still representative of the information contained in the column. Recall that there are 60 seconds in one minute so 3:30 is equivalent to 3.5 minutes.

7. Group data by ‘workout type' and find the mean, median, count and standard deviation of ‘calories'.

8. Create an indicator column that identifies if the activity achieved 30% or more ‘aerobic' activity.

9. Which day of the week (Monday, Tuesday, etc.) and ‘workout type‘ has the maximum of the ‘max cadence'.

Format your assignment according to the give formatting requirements:

1. The answer must be double spaced, typed, using Times New Roman font (size 12), with one-inch margins on all sides.

2. The response also includes a cover page containing the title of the assignment, the course title, the student's name, and the date. The cover page is not included in the required page length.

3. Also include a reference page. The references and Citations should follow APA format. The reference page is not included in the required page length.

Request for Solution File

Ask an Expert for Answer!!
Other Subject: Examine datatype and comment on appropriateness of each
Reference No:- TGS03175946

Expected delivery within 24 Hours