Show
Understanding Sampling Methods (Visuals and Code)Image from AuthorSampling is the process of selecting a subset(a predetermined number of observations) from a larger population. It’s a pretty common technique wherein, we run experiments and draw conclusions about the population, without the need of having to study the entire population. In this blog, we will go through two types of sampling methods:
Random SamplingUnder Random sampling, every element of the population has an equal probability of getting selected. Below fig. shows the pictorial view of the same — All the points collectively represent the entire population wherein every point has an equal chance of getting selected. You can implement it using python as shown below — import randompopulation = 100 Stratified SamplingUnder stratified sampling, we group the entire population into subpopulations by some common property. For example — Class labels in a typical ML classification task. We then randomly sample from those groups individually, such that the groups are still maintained in the same ratio as they were in the entire population. Below fig. shows a pictorial view of the same — We have two groups with a count ratio of x and 4x based on the colour, we randomly sample from yellow and green sets separately and represent the final set in the same ratio of these groups. Stratified SamplingYou can implement it very easily using python sklearn lib. as shown below — from sklearn.model_selection import train_test_split You can also implement it without the lib., read this. Cluster SamplingIn Cluster sampling, we divide the entire population into subgroups, wherein, each of those subgroups has similar characteristics to that of the population when considered in totality. Also, instead of sampling individuals, we randomly select the entire subgroups. As can be seen in the below fig. that we had 4 clusters with similar properties (size and shape), we randomly select two clusters and treat them as samples. Cluster SamplingReal-Life example — Class of 120 students divided into groups of 12 for a common class project. Clustering parameters like (Designation, Class, Topic) are all similar over here as well. You can implement it using python as shown below — import numpy as npclusters=5 Systematic SamplingSystematic sampling is about sampling items from the population at regular predefined intervals(basically fixed and periodic intervals). For example — Every 5th element, 21st element and so on. This sampling method tends to be more effective than the vanilla random sampling method in general. Below fig. shows a pictorial view of the same — We sample every 9th and 7th element in order and then repeat this pattern. Systematic SamplingYou can implement it using python as shown below — population = 100 Multistage samplingUnder Multistage sampling, we stack multiple sampling methods one after the other. For example, at the first stage, cluster sampling can be used to choose clusters from the population and then we can perform random sampling to choose elements from each cluster to form the final set. Below fig. shows a pictorial view of the same — You can implement it using python as shown below — import numpy as npclusters=5
Convenience SamplingUnder convenience sampling, the researcher includes only those individuals who are most accessible and available to participate in the study. Below fig. shows the pictorial view of the same — Blue dot is the researcher and orange dots are the most accessible set of people in orange’s vicinity. Convenience SamplingVoluntary SamplingUnder Voluntary sampling, interested people usually take part by themselves by filling in some sort of survey forms. A good example of this is the youtube survey about “Have you seen any of these ads”, which has been recently shown a lot. Here, the researcher who is conducting the survey has no right to choose anyone. Below fig. shows the pictorial view of the same — Blue dot is the researcher, orange one’s are those who voluntarily agreed to take part in the study. Snowball SamplingUnder Snowball sampling, the final set is chosen via other participants, i.e. The researcher asks other known contacts to find people who would like to participate in the study. Below fig. shows the pictorial view of the same — Blue dot is the researcher, orange ones are known contacts(of the researcher), and yellow ones (orange’s contacts) are other people that got ready to participate in the study. Snowball Sampling
I hope you enjoyed reading this. If you’d like to support me as a writer, consider signing up to become a Medium member. It’s just $5 a month and you get unlimited access to Medium So, that’s it for this blog. Thank you for your time! In which type of sampling all the items have equal chance to be selected?1. Simple random sampling. In a simple random sample, every member of the population has an equal chance of being selected.
Which sampling method gives equal chance to the population?Simple random sampling. In simple random sampling (SRS), each sampling unit of a population has an equal chance of being included in the sample. Consequently, each possible sample also has an equal chance of being selected. To select a simple random sample, you need to list all of the units in the survey population.
In which type of sampling does each member of a population have an equal chance of being selected and each selection is entirely independent of the next?Simple random sampling is the basic sampling technique where we select a group of subjects (a sample) for study from a larger group (a population). Each individual is chosen entirely by chance and each member of the population has an equal chance of being included in the sample.
Which sampling is used to select every item of a list?(A) Random sampling
Random sampling method refers to a method in which every item in the universe has an equal chance of being selected. It is also known as probability sampling or representative sampling.
|