Explaining P-value to a non technical audience

Wikipedia defines p-value as "the probability of obtaining test results at least as extreme as the results actually observed, under the assumption that the null hypothesis is correct". Well if we give this definition, say in a presentation to a product or a business team, you're most probably gonna receive piercing puzzled looks. One of the major … Continue reading Explaining P-value to a non technical audience

Understanding Multicollinearity and Confounding Variables in Regression

Multicollinearity When two or more of the predictors are correlated, this phenomenon is called multicollinearity. This affects the resulting coefficients by masking the underlying individual weights of the correlated variables. This is why model weights are not equal to feature importance. Ways to deal with multicollinearity Looking at Variance Inflation Factor (VIf), which measures the … Continue reading Understanding Multicollinearity and Confounding Variables in Regression