Create a histogram for totalsales and create a histogram


Assignment -

For this assignment use the "Red Bluff Golf Course Data".csv file from the Topic 6 Assignment as a starting point. The model is specifically focused on the "TotalSales" and "QuantitySold" fields (variables) in the file.

Using IBM SPSS Modeler, format the model shown in the "Topic 7 Assignment Model Requirements" resource.

As depicted in the resource, the model requires the nodes described below.

1. Excel Source Node: Use the same "Red Bluff Golf Course Data" .csv file from the Topic 6 assignment. Resave this file as an Excel (.xlsx) file.

2. Type Node: Ensure that all the data is "read" into this node. Provide a screenshot of the field headers and the first few rows of the data.

3. Data Audit Node: Perform a data audit on the "TotalSales" and "QuantitySold" fields. Are there any missing data for these fields? Do any extreme outliers exist for these fields? Provide a screenshots of the "Audit" and "Quality" results.

4. Table Node: Use this node to ensure that the data successfully loaded into the Type Node. Ensure that the data here matches what is in the original Excel source file. Provide a screenshot of the "Table" output that includes the field headers and first few rows of the dataset.

5. Statistics Node: Run all available descriptive statistics on the "TotalSales" and "QuantitySold" fields. Which field exhibits the more variability and why? Perform a correlation analysis on the "TotalSales" and "QuantitySold" fields. What is the general correlation between these fields: high, medium, or low? Justify your selection. Provide a screenshot of the "Statistics Report."

6. Histogram Nodes: Create a histogram for "TotalSales" and create a histogram for "QuantitySold." Do these fields appear to have normal distributions? Why or why not? Provide a screenshot of the histograms.

7. Plot Node: Create a scatterplot using "QuantitySold" as the x variable and "TotalSales" as the y variable. What is the general relationship between these variables? Provide a screenshot of the scatter plot.

8. Regression Node: Run a regression analysis using "QuantitySold" as the x variable and "TotalSales" as the y variable. In the "Model" tab, ensure that the "Include constant in equation" box is checked. Leave all other settings unchanged.

9. Gold Nugget Node: This node should appear after executing the regression analysis. Export the "Output" report (from the Advanced tab) to a Word file.

10.  Hint: The derived regression equation and r-squared results obtained here should be the same as you obtained in the Topic 6 assignment for the simple regression analysis for "QuantitySold" and "TotalSales."

Attachment:- Assignment & Data File.rar

Request for Solution File

Ask an Expert for Answer!!
Applied Statistics: Create a histogram for totalsales and create a histogram
Reference No:- TGS02643996

Expected delivery within 24 Hours