-
Notifications
You must be signed in to change notification settings - Fork 0
Description
Baseline Census Cleaning Overview
Initial Baseline Census Cleaning
The primary do file used to create a cleaned version of the baseline census dataset is located at:
Path: GitHub\i-h2o-india\Code\1_profile_ILC\1_1_A_Census_cleaning.do
This script generates the following datasets:
Final Data Path: ${DataFinal}1_1_Census_cleaned.dta
2nd Data Path: ${DataPre}1_1_Census_cleaned.dta
We had to store it in two different folders because during endline census there was a discussion regarding creating final cleaned versions of all the datasets but the dataset stored in pre paths were being used in all the do files for follow up so to avoid time waste and unnecessary errors in path recognition we let the pre version of the dataset as it is.
Issue with Date Formatting and Missing Submission Dates
Subsequently, issues were identified in ${DataPre}1_1_Census_cleaned.dta, particularly with date formatting and missing submission dates in the date variable. To address these issues, a revised version of the census cleaning script was created.
Revised Do File Path: GitHub\i-h2o-india\Code\1_profile_ILC\Z_Archive\Revised_Baseline_census_cleaning_code.do
This revised script produced multiple versions of ${DataPre}1_1_Census_cleaned.dta. To prevent redundancy, the dataset created by the original file (1_1_A_Census_cleaning.do) was temporarily commented out.This do file was majorly being used to calculate estimates of the endline census productivity
Reverting to the Original Version
However, differences were later found between the datasets created by the revised and original do files. After reviewing these discrepancies, we decided to continue using the dataset generated by the original file:
File Path: GitHub\i-h2o-india\Code\1_profile_ILC\1_1_A_Census_cleaning.do
This version aligns with all files in use, from the baseline census to follow-up surveys.
Final Decision
As a result, we have opted to comment out the entire revised do file:
Commented Out File: GitHub\i-h2o-india\Code\1_profile_ILC\Z_Archive\Revised_Baseline_census_cleaning_code.do