The post I understand what you think you are predicting, I just can’t work out why you think you predicted it… appeared first on Web Science MOOC.
]]>Going to any of the web related conferences it is impossible to not notice how much of these end up being about predictive modelling. Anyone familiar with this modelling will know the literature is dominated by various forms of linear statistics. Here, you take two or more variables and map them onto each other using some probability distribution and a function representative of a straight line. There are then a range of hypothesis testing things you can do like calculating p-values and the rest.
This has led to many people you talk to seemingly believing that a statistical relationship between two variables exists if and only if there is a p-value of less than some arbitrarily chosen level – say 0.1. This is a very odd conclusion to make for the following reasons;
I think misunderstandings or not discussing the above points causes a lot of bother. Either, people find it easy to dismiss a result based on the fact that they don’t believe such a model can capture the features of the system or, people feel like it is enough to run and off the shelf statistics package and report findings as if the statistic is explanation enough for their conclusions.
The post I understand what you think you are predicting, I just can’t work out why you think you predicted it… appeared first on Web Science MOOC.
]]>