It might seem counterintuitive, like, how could a sample size be infinite?
But let us remember the classic application of the Poisson distribution — modeling the probability of a given number of events occurring in a fixed time interval.
To better understand it with an example, consider the probability of a given number of buses arriving at a station in an hour.
Now, to look at it from a binomial distribution perspective, we could divide this one hour into sixty Bernoulli trials. Each trial models the probability that a single bus will arrive in that particular minute.
The problem with this approach is that it simply restricts the number of buses that could arrive at any given minute to one. In the real world, it is not a completely unimaginable scenario that multiple buses could arrive back-to-back within a minute.
We could increase the granularity by opting for 3600 Bernoulli trials instead of 60. It would mean we moved away from looking at minute intervals to seconds, and our problem might have an acceptable solution now.
But consider the cases where more than one event could happen within a second, and it becomes clear that we would have to go even more granular, which causes the number of trails (n) to increase again.
We keep increasing this granularity to reach a subinterval where events could occur only once. Then, the n value would increase to such an extent that it approaches infinity.
Hence, we could apply a limit that the n tends to infinity on the probability distribution function (PDF) of Binomial distribution, and we would have the PDF of Poisson distribution. It proves that the Poisson is an extended case of Binomial when the number of trials is large (or large sample size).
Another key distinction of the Poisson distribution is that it assumes the events to be independent with a constant rate of occurrence.
Let us equate this constant rate (famously denoted by a ‘λ’) to the expected value of the random variable if it were a Binomial distribution.
The expected value of a random variable in Binomial distribution = number of trials (n) * probability of success in each trial (p).
Therefore, λ = n * p
Since λ is constant and n approaches infinity, the p becomes very low and approaches zero.
Hence, we could use the Poisson distribution to model the rare events with a low success rate, like people reading my articles and loving them!
Comments
Post a Comment