G Narenthiran FEBNS FRCS(SN)
18 March 2024
g_narenthiran@hotmail.com
The ‘P’ in the P-value stands for probability. We will come to this shortly.
Generally, when we analyse data, we analyse the data for a sample and use statistics to infer the results for the population.
The type of statistics we commonly use in medicine is ‘frequentist statistics’. In frequentist statistics, we study the sample and infer for the population.
Example 1:
We measured the weight of our male and female patients. There were 20 male patients and 20 female patients. The mean weight for the male patients was 72 kg. The mean weight for the female patients was 55 kg. Here, we note a difference of 17kg between the mean weights of our male and female patients. This difference of 17kg is indeed an actual difference in the mean weights of our male and female patients.
Can we make an inference for all the male and female patients in the world based on our findings, i.e., from the findings from a sample? How can we generalise our findings for the population?
When we now use the t-test (unpaired student t-test) to analyse the difference in mean weight between our male and female patients, we are attempting to infer whether where this difference of 17kg would be actual if we measured the weights of all male and female patients in the universe i.e. we are trying to use the data we have gathered (from our sample) to infer for the population.
Let’s assume that when we undertook the t-test, the P-value was 0.04. What does this mean?
In statistics, we define the null hypothesis before we undertake an experiment.
The ‘null hypothesis’ (Ho) for our study is that:
“There is no difference in the mean weight of male and female patients.”
In statistics, we assume that the members of our study group (male and female patients in our ward) are samples from the whole population (all male and female patients in the universe) and the patients in our sample (those in our ward) were randomly chosen from the entire population of male and female patients.
The t-test assesses the probability that the difference that we noted between our groups, i.e., a difference of 17kg in the mean weight between our male and female patients, is by chance because of our random sampling from the population.
The p-value of 0.04 indicates that if we assume that the null hypothesis is true, i.e., there is no difference in the mean weight of all the male and female patients, then the 17kg difference in the mean weight that we observed might have been through chance, because of the random sampling, is 4%.
In medical statistics, if the probability of the difference between two groups occurring is equal or less than 5% (0.05), then we state the difference is statistically significant. However, we can only be sure that there is an actual difference if we study the whole population of male and female patients, which is impractical.
Conversely, if the p-value is greater than 0.05, then we infer that our null hypothesis is true and there is no statistical difference between the mean weights of our groups; the difference we noted (17kg) was through chance because of the random sampling of the population.
