————————————————————————————————–
0.5000
————————————————————————————————–
0.5417
————————————————————————————————–
8.4296
————————————————————————————————–
9.8768
Question 1
Let’s once again consider the customer reward program dataset. For your convenience, here the original data set.
In this exercise, we use machine learning methods including trees and neural networks. Set all random seeds to 12345 in XLMiner; this should be the default value if you have not changed them. Also use default options in XLMiner unless directed otherwise.
1. Build a full classification tree model with Reward column as the target variable and the following set of predictor variables: Salerank, X2013USSales, X2013WorldSales, ProfitMargin, NumStores, and IndustryType. Which feature has the highest feature importance?
[Hint: Use Classify in the Data Mining ribbon since Reward is a 0-1 variable.]
Question 2
Split the dataset into training and validation sets using a 60:40 split. Use the same columns as in the previous question.
What is the precision for on the validation data for bagged tree (round the answer to 4 decimal places, i.e. )?
Question 3
What is the precision on the validation data for boosted tree (round the answer to 4 decimal places, i.e. )?
Question 4
Build a neural network model with RewardSize column as the target variable and the following set of predictor variables: Salerank, X2013USSales, X2013WorldSales, NumStores, and ProfitMargin. Note that RewardSize variable is only relevant for retailers that use reward programs. For your convenience, here is the data file with values for RewardSize, which should be used for model building.
Split the dataset into training and validation set using a 60:40 split. Also remember to scale the data. Report RMSE on the validation data for the following models: (i) bagged neural net, (ii) boosted neural net. [Hint: Use Predict in the Data Mining ribbon since RewardSize is a continuous variable.]
Report RMSE on the validation data for the bagged neural net (round the answer to 4 decimal places, i.e. )?
Question 5
Report RMSE on the validation data for the boosted neural net (round the answer to 4 decimal places, i.e. ):
Week 2 Quiz of Predictive Modeling and Analytics 1. Question 1 Which type of target variable are we dealing with in linear regression? 1 point Binary Categorical Continuous Imaginary…
Week 1 Quiz of Predictive Modeling and Analytics 1. Question 1 At what stage(s) of Data Exploration would you address missing values in a data set? 1 point Data…
Week 3 Quiz of Predictive Modeling and Analytics 1. Question 1 A soccer team is believed to have a 8 to 2 odds of winning the election. What is…
Week 4 Quiz of Predictive Modeling and Analytics 1. Question 1 Consider the following split in the appointment data. What is the Gini index for the branch Age<65.5? 1…
Week 3 Application Assignment of Predictive Modeling and Analytics 1. Question 1 Let’s reconsider the customer reward program dataset. In this exercise, you will complete a predictive modeling task…
Week 2 Application Assignment of Predictive Modeling and Analytics 1. Question 1 In this assignment we are continuing to work with customer reward programs (review assignments from Week 1…