Chapter-4: Scrutiny of Data

 


 
Scrutinizing data is a vital process in ensuring the accuracy and reliability of research findings. This chapter will discuss methods for checking internal consistency and detecting errors in data collection and recording, with a focus on practical applications and examples relevant to India.
 
 
 
 1. Checking Internal Consistency
 
Definition of Internal Consistency
 Internal consistency refers to the extent to which all parts of the data are consistent with each other. It ensures that the data collected is logical and free from contradictions.
 
Methods to Check Internal Consistency:
1. Reliability Testing: This involves statistical techniques to assess the consistency of the data.
    Cronbach's Alpha: A common measure used to assess the internal consistency of a set of items or questions.
    Example: A survey on customer satisfaction with services in Indian hotels may use Cronbach's Alpha to check the reliability of responses to various servicerelated questions.
 
2. Correlation Analysis: Checking the correlation between different variables to ensure they are logically related.
    Example: In a study on education and income levels in India, one would expect a positive correlation between higher education and higher income. A lack of correlation may indicate inconsistencies in the data.
 
3. CrossVerification: Comparing data from different sources to verify consistency.
    Example: Crosschecking agricultural production data reported by different government agencies in India.
 
Examples:
 In a health survey, if data shows that individuals who report regular exercise also have better health indicators, this consistency supports the validity of the data.
 In a financial audit, if the total assets do not equal the sum of liabilities and owner’s equity, this inconsistency signals potential errors.
 
 
 
 2. Detection of Errors in Data Collection
 
Common Errors in Data Collection:
1. Sampling Errors: These occur when the sample is not representative of the population.
    Example: Conducting a survey on urban internet usage in India but only sampling individuals from metropolitan cities, thereby excluding smaller cities and towns.
 
2. NonSampling Errors: These include all other errors not related to the sampling process.
    Types:
      Measurement Errors: Incorrectly measuring or recording data.
      Example: Misreporting the income of households in a survey due to misunderstanding the question.
      Interviewer Bias: When the interviewer’s actions influence responses.
      Example: An interviewer leading respondents to certain answers in a political opinion survey in India.
      Response Bias: When respondents provide inaccurate answers.
      Example: Individuals underreporting their alcohol consumption in a health survey due to social desirability.
 
Methods to Detect Errors in Data Collection:
1. Pilot Testing: Conducting a pilot survey to identify potential issues in data collection methods.
    Example: Before a national health survey, a pilot test is conducted in a small region of India to identify and rectify any data collection issues.
 
2. Training and Supervision: Ensuring data collectors are welltrained and supervised.
    Example: Training enumerators for the Indian Census on how to accurately record data and handle respondent queries.
 
3. Validation Checks: Implementing validation checks during data entry to catch errors.
    Example: Using software that flags unrealistic entries, such as a respondent’s age being recorded as 150 years.
 
 
 
 3. Detection of Errors in Data Recording
 
Common Errors in Data Recording:
1. Transcription Errors: Mistakes made while transcribing data from one source to another.
    Example: Recording a household’s income as ₹50,000 instead of ₹500,000.
 
2. Data Entry Errors: Errors made during the manual entry of data into a database.
    Example: Typing errors or incorrect coding of responses in a survey database.
 
3. Logical Errors: Entries that do not make sense logically.
    Example: A survey response indicating that a person has completed a PhD but is only 15 years old.
 
Methods to Detect Errors in Data Recording:
1. Double Entry System: Entering the same data twice independently and comparing the entries.
    Example: Entering survey responses into a database by two different data entry operators and then comparing the entries for discrepancies.
 
2. Automated Error Checking: Using software to automatically check for and flag errors.
    Example: Data entry software that flags entries that fall outside expected ranges, such as an income value that is too high or too low.
 
3. Regular Audits: Conducting regular audits of the data to identify and correct errors.
    Example: Periodically reviewing financial records of a company to ensure all transactions are recorded accurately.
 
Examples:
 In a demographic survey, if the total number of males and females does not add up to the total population, this discrepancy signals a recording error.
 In an academic study, if the recorded test scores exceed the maximum possible score, this error must be corrected.
 
 
 
 Illustrations with Examples
 
1. Internal Consistency Example:
    A survey measuring job satisfaction among employees in an Indian IT company might include multiple questions on various aspects of job satisfaction. If the responses to these questions are consistent, it indicates good internal consistency.
 
2. Error Detection in Data Collection Example:
    During a national nutrition survey in India, discrepancies are found in the reported dietary intake due to misunderstanding of portion sizes. A pilot survey helps in refining the questions for better accuracy.
 
3. Error Detection in Data Recording Example:
    In a financial survey, if an individual's monthly expenditure is recorded as higher than their monthly income, it indicates a potential recording error that needs to be investigated.
 
 
 
 References
 
1. "Research Methodology: Methods and Techniques" by C.R. Kothari.
2. "Survey Methods and Practices" by Statistics Canada.
3. Government of India Census Reports.
4. "Data Analysis with SPSS" by Stephen Sweet and Karen GraceMartin.
5. "Statistics for Business and Economics" by Paul Newbold, William L. Carlson, and Betty Thorne.
6. Journal articles and publications from the Indian Council of Social Science Research (ICSSR).

Comments

Popular posts from this blog

Chapter 3: Special Areas of Audit in India

Chapter 1: Introduction to Income Tax in India

NBU CBCS SEC (H) : E-Commerce Revised Syllabus