Question
Which data cleaning technique is most appropriate for
handling missing data when missing values are randomly distributed across a dataset?Solution
When missing data points are randomly distributed, imputing values using the mean (for continuous data) or median (for skewed distributions) can be an effective technique. This approach maintains the datasetās overall structure and helps reduce potential bias introduced by missing values. By substituting missing values with central tendencies, analysts can preserve statistical relationships without significantly distorting the data, ensuring a more accurate analysis. Option A is incorrect as removing rows may lead to a significant data loss, especially if many rows contain missing values. Option C is incorrect because dropping columns with missing values reduces feature dimensions, potentially discarding useful information. Option D is incorrect as placeholder values can introduce bias or mislead analysis, especially if the placeholder value skews the distribution. Option E is incorrect because ignoring missing values leaves gaps, making it difficult to perform accurate analysis.
Find the sum of all two-digit numbers that are exactly divisible by 6.
What must be added to (7/20) to make it (3/4)?
What is the remainder when 7³ⵠis divided by 5?
- The total of two numbers is 130. If their LCM is 432 and HCF is 12, determine the smaller number.
There are three natural numbers which are pairwise co-prime. The product of the first two numbers is 105 and the product of the last two numbers is 255....
Dev has joined Snapchat and has 20 friends and each of these friends has 40 friends. Later, it is found that at least two of his friends know each other...
Find remainder when 7103 is divided by 240.
Two numbers P and Q are in the respective ratio 7: 9 and their sum is 80. Then, P is equal to:
- If the product of three consecutive natural numbers is 93024, then find the sum of the three given natural numbers.
When a number is increased by 60% then the number obtained is 56 less than thrice the original number. Find the original number.