Campaign_id, Date, {some features}, Number_of_new_customers
Test set consists of multiple campaigns. For each of them we are given several first days in train data. For example, if a campaign runs for two weeks, we could have three first days in train set, and all next days will be present in the test set. For another campaign, running for weeks, we could have the first 6 days in the train set, and the remaining days in the test set.
Identify train/test split in a competition.
Week – 2 Exploratory data analysis >>> How to Win a Data Science Competition Learn from Top Kagglers 1. Suppose we are given a data set with features XX, YY,…
Week – 1 Recap of How to Win a Data Science Competition 1. What back propagation is usually used for in neural networks? 1 point To propagate signal through network…
Week – 1 Feature preprocessing and generation with respect to models 1. Suppose we have a feature with all the values between 0 and 1 except few outliers larger than…
Week – 4 Graded Advanced Features II Quiz >>> How to Win a Data Science Competition Learn from Top Kagglers 1. Imagine that we apply X = PCA(n_components=5).fit_transform(data) and data…
Week – 4 Graded quiz How to Win a Data Science Competition Learn from Top Kagglers 1. Which hyperparameters are first to tune in sklearn’s RandomForest? 1 point n_estimators, max_depth,…
Week – 3 Mean encodings >>> How to Win a Data Science Competition: Learn from Top Kagglers 1. What can be an indicator of usefulness of mean encodings? 1 point…