Which attributes seem to matter the most how do you know it


1. As a first step, build at least 3 models using different classification techniques that are capable of classifying customers into 2 categories (churn/no churn.) At least one of the 3 models should be either a decision tree or a neural network.

Make sure that you build the process to include the cross-validation operator (it is enough to have 3 folds in each validation to save some process runtime). Where possible, experiment with the parameter settings of the model operators to try to improve the model's performance.

Make readable screenshots of the following for all 3 models:

- Processes

- Parameter settings for the type of model (Decision Trees, Neural Networks, etc.)

- Appropriate model results (i.e. Coefficients, Tree, Network, etc.)

2. For measuring the performance of the 3 models look at the following performance measures:

- Accuracy

- Kappa

- Precision

- Recall

- Lift

- AUC (NOT the optimistic or pessimistic)

(Hint: use the binomial classification performance operator to obtain all of these measures.)

a. Make a screenshot of the confusion matrix output of each of the 3 methods.

b. Prepare a table to report the above values for the 3 models.

c. Discuss the performance for each of the 3 models based on the above values. Relate the performances to the a priori probabilities of the outcome as well.

d. Suppose that if you can correctly predict that a customer will churn (a true positive), you can make a special promotional offer that will result in the customer staying with you.

You estimate that the net profit from such an outcome is $200 per customer. However, if you predict a customer will churn when they actually wouldn't have (a false positive), you incur a net cost of $50 per customer by unnecessarily
making the offer.

Using this information, compute and show a profit/cost matrix for each model and report the per-customer expected value of each model.

e. Prepare a visual evaluation of the 3 models by including a screenshot of the ROC comparison chart.

(Hint: This may require the building of a separate process from the previous ones.)

f. Using the above information, compare the performance of the 3 models. What are the similarities/differences in their performance?

3. Answer the following questions:

a. Which attributes seem to matter the most? How do you know it? Discuss their importance and/or effect sizes. How can you interpret the results of the models?

b. Are the 3 models giving you more or less same suggestions? Do the models agree in most aspects? If there are differences, what are they?

4. What business recommendations can be given based on the results? How could the results of the model(s) be useful for the company?

Attachment:- Analytics Portfolio Exercise.rar

Request for Solution File

Ask an Expert for Answer!!
Operation Management: Which attributes seem to matter the most how do you know it
Reference No:- TGS02757943

Expected delivery within 24 Hours