Statistics: Statistics *
Independence
Basic independence
Going back to our initial example with a customer rating our movie on a scale of 1 to 5, denoted as , and them being happy, denoted as . Suppose we now introduce yet another variable, say , which denotes whether the user has blue eyes, again taking on values 1 and 0 denoting whether or not the user has blue eyes respectively. In this case, there is no reason to assume that my belief that someone is happy changes after observing whether or not someone has blue eyes, i.e.
When and are not independent, i.e. when tells me something about , we call them dependent.
Exercise Suppose we want to parametrize a joint distribution over 10 binary random variables, . If we do not assume any independence, we would need parameters to describe this joint distribution. Suppose now that all are independent, i.e. . Explain why now we only need parameters to describe our joint distribution.
Conditional Independence
In a real-life situation, simple independence is almost something you never encounter. A much more common scenario is that of two random variables being independent given some third variable.
Example of conditional independence Suppose we consider two students that both travel by the same train to the university, and we denote as the binary variable describing whether student 1 is on time for the lecture, and the binary variable describing whether student 2 is on time. Intuitively, these events are related, since when you tell me that is on time, that increased my belief that also is on time, for the reason that the trains were not delayed. In other words,
However, let us now introduce yet a third binary variable denoting whether the train was on time. How does knowing
change the above situation? In this case, observing that is or is not on time if I already know will not tell me any more information than I could already derive from . The reason is that the only thing it could tell me is something about whether or not the train is delayed, and since we already knew that the train was not delayed we obtain no information.
In this case, we have
In the above example, we had a situation where and were dependent, but and were independent when informed about some . That is., even if there is an ‘information flow’ between and , there can be no ‘information flow’ from to if we know that . The other case is also possible, the situation where and are independent variables, but they become dependent given some variable . To illustrate this, let us consider a new - quite inspired on our breakfast - scenario.
Example of conditional dependence Suppose that is the random variable that describes whether or not the first strawberry I eat on a day is sweet. Moreover, let denote whether or not the first coffee I prepare is an espresso. Needless to say, the outcomes of these events do not affect each other, and hence and are independent. The reason we picked such a weird example is that even such two (very) independent events can be made conditionally dependent if we consider a crazy enough third variable...
Summary In this theory page, you have learned about the independence of random variables.