Start learning 50% faster. Sign in now
Explanation: Replacing missing data with statistical measures like mean (for continuous data), median (for skewed distributions), or mode (for categorical data) is a robust imputation technique. This approach minimizes the loss of data while maintaining the dataset's integrity. It is particularly effective when missing values are random (MCAR) and do not introduce significant bias. However, this method may not work well for datasets with a high proportion of missing values or when patterns in the missing data need to be preserved. Advanced imputation methods like k-Nearest Neighbors (KNN) or predictive models can be used in such cases. Option A: Deleting rows with missing values can result in significant data loss, reducing the dataset's representativeness. Option C: Ignoring missing data leads to inaccuracies and potential errors in analysis. Option D: Filling with arbitrary constants like zero can distort the dataset, introducing bias. Option E: Duplicating rows compromises the dataset's integrity and can lead to overfitting in predictive models.
Who was appointed as the festival ambassador for Arunachal Rang Mahotsav 2024?
On which date is Constitution Day commemorated?
What is the repo rate as of February 2025, according to the Reserve Bank of India (RBI)?
Consider the following statements about ‘National Policy of Rare Diseases (NPRD)’:
1. Those who are suffering from rare diseases listed under...
India’s foreign reserves do NOT consist of which of the following?
Who won the Rashtraparv Website & Mobile App Development Award for enhancing citizen-centric governance?
What was the GDP growth projection for FY25 by ADB, as updated in December 2024?
In 2024, which new material was introduced in the manufacturing of solar panels to significantly increase their efficiency beyond the traditional silico...