Skip to content

Baseline Census dataset final version updated #172

@Archi11Gupta

Description

@Archi11Gupta

Baseline Census Cleaning Overview
Initial Baseline Census Cleaning
The primary do file used to create a cleaned version of the baseline census dataset is located at:

Path: GitHub\i-h2o-india\Code\1_profile_ILC\1_1_A_Census_cleaning.do
This script generates the following datasets:

Final Data Path: ${DataFinal}1_1_Census_cleaned.dta
2nd Data Path: ${DataPre}1_1_Census_cleaned.dta
We had to store it in two different folders because during endline census there was a discussion regarding creating final cleaned versions of all the datasets but the dataset stored in pre paths were being used in all the do files for follow up so to avoid time waste and unnecessary errors in path recognition we let the pre version of the dataset as it is.

Issue with Date Formatting and Missing Submission Dates
Subsequently, issues were identified in ${DataPre}1_1_Census_cleaned.dta, particularly with date formatting and missing submission dates in the date variable. To address these issues, a revised version of the census cleaning script was created.

Revised Do File Path: GitHub\i-h2o-india\Code\1_profile_ILC\Z_Archive\Revised_Baseline_census_cleaning_code.do
This revised script produced multiple versions of ${DataPre}1_1_Census_cleaned.dta. To prevent redundancy, the dataset created by the original file (1_1_A_Census_cleaning.do) was temporarily commented out.This do file was majorly being used to calculate estimates of the endline census productivity

Reverting to the Original Version
However, differences were later found between the datasets created by the revised and original do files. After reviewing these discrepancies, we decided to continue using the dataset generated by the original file:

File Path: GitHub\i-h2o-india\Code\1_profile_ILC\1_1_A_Census_cleaning.do
This version aligns with all files in use, from the baseline census to follow-up surveys.

Final Decision
As a result, we have opted to comment out the entire revised do file:

Commented Out File: GitHub\i-h2o-india\Code\1_profile_ILC\Z_Archive\Revised_Baseline_census_cleaning_code.do

Metadata

Metadata

Assignees

No one assigned

    Labels

    documentationImprovements or additions to documentation

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions