Question
A data analyst is assessing a dataset with inconsistent
categorical entries, such as "USA," "U.S.A," "United States," and "US" for the country field. Which of the following is the best approach for handling this inconsistency?Solution
Standardizing categorical entries to a single representation ensures consistency by consolidating multiple formats of the same entity into one standardized label. For example, consolidating "USA," "U.S.A," "United States," and "US" into one uniform label, like "United States," ensures that all data entries are interpreted consistently. This process is essential in data cleaning, as inconsistencies in categorical data can lead to inaccurate analysis, skewed results, and duplications in reporting. A uniform categorical format enables reliable grouping, sorting, and filtering for analysis. The other options are incorrect because: • Option 1 (Filtering duplicates) removes identical rows but doesn’t address inconsistency in a single field. • Option 2 (Using normalization) only applies to numeric scaling, not categorical consistency. • Option 3 (Applying data transformation) would encode inconsistencies rather than correct them. • Option 5 (Converting to uppercase) helps with case sensitivity but does not fully standardize variations.
- In each of the following questions, a statement is given with a single blank. Choose the most suitable option which fits the blank in the most meaningful a...
Stakeholders over the world stressing the value of free trade
Drug prices have gone ………… the ………… of the common man.
Each sentence has one blank. Choose the most appropriate word to make the sentence grammatically and contextually correct.
The sudden reversal ...
Her biography ________ that she was not as rich as everyone thought.
Tomorrow, for the exam, questions will be asked from our Geography textbook, so let’s read ______________ book.
Fill in the blanks using the correct tense of the verbs given in brackets and choose the right answer from among the options given below them.
- Given below is a sentence with one blank. Below the sentence are given four words among which one word might fill the blank. If none of the words fill the ...
Children must be given a chance to taste food from __________ cuisines, and one of the best ways to do it is by travelling.
(A) different (B) di...
In the following sentences, two words are omitted. Choose the correct option that can fill the blanks both contextually and grammatically.
The ...