Provide to me the structure of the loaded data set


Homework: Solving With R Code

Instructions: Using R and the data set provided, provide the code and output that completes the following. I have intentionally left a few minor details for you to either reason through or research. I am aware that there are lots of ways to accomplish many of these tasks in R; I do not really care which way you do them so long your output answers the questions asked.

Expectation: This homework is constructed such that you can easily validate your answers in Excel with minimum effort. Therefore, I expect you to do so.

Note: You might find this tidbit useful. The simple use of the aggregate function in R produces two variables with the labels: "Group.1" and "x". If you want to change the labels you can use the command "names" (e.g. names(x) = c("NewLabel_Group.1", "NewLabel_x")

a. Load the data file.

b. Provide to me the structure of the loaded data set.

c. Provide me a "summary" of the loaded data structure.

d. Count of releases per year.

e. Count of releases for each group of two years (i.e. 1992 and 1993, 1994 and 1995, etc).

f. Average number of the Lines of Code (LOC) per releases per year.

g. Average size of the file size per year.

h. Create a single data frame through code which contains the year along with the avg, median, and standard deviation for LOC and tar file variables.

Format your homework according to the give formatting requirements:

a. The answer must be double spaced, typed, using Times New Roman font (size 12), with one-inch margins on all sides.

b. The response also includes a cover page containing the title of the homework, the course title, the student's name, and the date. The cover page is not included in the required page length.

c. Also include a reference page. The references and Citations should follow APA format. The reference page is not included in the required page length.

Attachment:- CSV-Data-file.rar

Request for Solution File

Ask an Expert for Answer!!
Database Management System: Provide to me the structure of the loaded data set
Reference No:- TGS03140707

Expected delivery within 24 Hours