Exploratory data analysis
Week – 2 Exploratory data analysis >>> How to Win a Data Science Competition Learn from Top Kagglers
2
On the top figure you see a scatter plot for variables XX and YY. Variable ZZ is a function of XX and YY and on the bottom figure a scatter plot between XX and ZZ is shown. Can you recover ZZ as a function of XX and YY?
2
(note that it is not the same variable XX as in previous questions).
Which hypotheses about variable XX do NOT contradict with the plots? In other words: what hypotheses we can’t reject (not in statistical sense) based on the plots and our intuition?
We use target variable yy to colorcode the points.
The other three plots were produced by jittering XX and YY values
That is, we add Gaussian noise to the features before drawing scatter plot.
Select the correct statements.
Week – 1 Graded Soft/Hard Quiz of How to Win a Data Science Competition Learn from Top Kagglers 1. Which library provides the most convenient way to perform matrix multiplication?…
Week – 1 Recap of How to Win a Data Science Competition 1. What back propagation is usually used for in neural networks? 1 point To propagate signal through network…
Week – 1 Feature extraction from text and images >>> How to Win a Data Science Competition: Learn from Top Kagglers 1. Select true statements about n-grams 2 points N-grams…
Week – 2 Validation >>> How to Win a Data Science Competition: Learn from Top Kagglers 1. Select true statements 1 point Performance increase on a fixed cross-validation split guaranties…
Week – 3 Mean encodings >>> How to Win a Data Science Competition: Learn from Top Kagglers 1. What can be an indicator of usefulness of mean encodings? 1 point…