Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

adding chi environmental dataset #35

Open
wants to merge 2 commits into
base: gh-pages
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
26 changes: 26 additions & 0 deletions _data/chienv.csv
Original file line number Diff line number Diff line change
@@ -0,0 +1,26 @@
Variable,Description
cid,Dataset identification code
geoid,Full FIPS code
tracte,6-digit tract FIPS code
treesN,Total number of trees (represented as unique polygons) by tract. Derived from polygon data digitized as trees from Lidar raw sonar data, originally processed by the Spatial Analysis Laboratory, made available through the Regional Trees Initiative and U.S. Forest Service, and finally cleaned, processed, and aggregated to census tract scale by the Health Regions & Policies Lab.
treeCrDn,Percentage of total census tract area covered by tree polygons (with each polygon representing the tree crown, or maximum tree crown width). Derived from polygon data digitized as trees from Lidar raw sonar data, originally processed by the Spatial Analysis Laboratory, made available through the Regional Trees Initiative and U.S. Forest Service, and finally cleaned, processed, and aggregated to census tract scale by the Health Regions & Policies Lab.
svi,Social Vulnerability Index via CDC using 2018 ACS 5-year data.
hardship,Economic Hardship Index, calculated using 2018 ACS 5-year data using standard approach using six components (unemployment, dependency, education, income, crowded housing, poverty) by Center for Spatial Data Science.
floodSusc,Urban flood susceptibility index, via FEMA.
heatIsl,Measure of land surface temperature to approximate the heat island effect. 1km grid raw data aggregated to census tract scale using average of grid values. Values close to 1 represent grid cells that have high summer temperatures relative to all other cells in the Chicago area; values closer to 0 represent lower values. Derived from mean Land Surface Temperature (LST) collected by NASA’s MODIS satellite (MOD11 dataset).
logTraf,Logged traffic volume, using street segment volume from IDOT.
pm25NN,Average PM 2.5 estimates during summer (Jun-Aug) from 2014-2018, calculated using a neural network model via Center for Spatial Data Science.
asthmaAv,Number of Emergency Dept. asthma visits (age 0-18) between 2013-2018, calculated at the zip code and shared via the Chicago Health Atlas.
asthmaAdj,Age adjusted rate of Emergency Dept. asthma visits (age 0-18) between 2013-2018, calculated at the zip code and shared via the Chicago Health Atlas.
totPopE,total population, ACS 2018 5-year estimate via OEPS
whiteP,Percentage of population with race identified as white alone
blackP,Percentage of population with race identified as Black or African American alone
amIndP,Percentage of population with race identified as Native American or Alaska Native alone
asianP,Percentage of population with race identified as Asian alone
hispP,Percentage of population with ethnicity identified as of Hispanic or Latinx origin
otherP,Percentage of population with race not mentioned in any of the options above (includes two race or more races)
childrnP,Percentage of population under the age of 18.
over65P,Percentage of population over the age of 65.
zip_code,intersecting Zip Code area
commarea_n,Community Area Number
community,Community Area Name
2 changes: 1 addition & 1 deletion _data/columbus-template.csv
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@ Variable,Description
AREA,neighborhood area (computed by ArcView)
PERIMETER,neighborhood perimeter (computed by ArcView)
COLUMBUS_,internal polygon ID (generated by ArcView)
COLUMBUS_I,internal polygon ID (geneated by ArcView)
COLUMBUS_I,internal polygon ID (generated by ArcView)
POLYID,"neighborhood ID, used in GeoDa User’s Guide and tutorials"
NEIG,"neighborhood ID, used in Spatial Econometrics examples"
HOVAL,"housing value (in $1,000)"
Expand Down
77 changes: 77 additions & 0 deletions _posts/2021-11-17-chicago-environment.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,77 @@
---
layout: dataset # don't change this
name: "chienv" # one-word reference name for dataset, all lowercase and underscores, e.g. "columbus"
title: "Chicago Environmental Data" # title of dataset, e.g "Columbus Crime 1980"
introduction: "Environmental data for census tracts in Chicago, Illinois." # one-sentence description of dataset, e.g. "Crime and neighborhood data for 49 neighborhoods in Columbus, OH, 1980."
description: "Standardized environmental data at the census tract scale including tree canopy characteristics, surface temperature, logged traffic volume, urban flood susceptibility, social vulnerability, hardship, modeled fine particulate matter estimates, and more in Chicago, IL around 2018 (data ranges from 2010-2018). See details of data methodology at https://chicago-env-explorer.netlify.app." # multiple-sentence description of dataset, e.g. "Crime, housing and income data for 49 neighborhoods in Columbus, OH, 1980. Textbook example."
download_link: "../data/chienv.zip" # URL of link to download dataset, e.g. "../data/columbus.zip"

observations: 801 # a number, e.g. 49; a boxed, comma-separated list if multiple datasets, e.g. [49, 100]
variables: 24 # a number, e.g. 20; a boxed, comma-separated list if multiple datasets, e.g. [20, 25]
start_date: 2018-01-01 # date in yyyy/mm/dd of start of data, e.g. 1980-01-01
end_date: 2018-01-01 # date in yyyy/mm/dd of end of data, e.g. 1980-01-01, same as above if single period
spatial_structure: "Polygon" # most often, "Point" or "Polygon"
spatial_unit: "census tract" # short description of the smallest spatial unit, e.g. "City neighborhood", "Census block", "House sale"

source: ["Healthy Regions & Policies Lab, 2021"] # boxed, comma-separated list of citation(s) of publication(s) from which this dataset originates, e.g. ["Anselin, Luc (1988). Spatial Econometrics. Boston, Kluwer Academic, Table 12.1, p. 189.", ""]
reference: ["Open Chicago Environment Explorer, https://chicago-env-explorer.netlify.app/"] # boxed, comma-separated list of citations(s) of publication(s) that reference this dataset

csv_name: "FIXME" # the name of the csv with the data dictionary (without the .csv extension), e.g. "columbus"; this csv is stored in the _data folder
prepared_by: "CSDS, Marynia Kolak" # the name of the person or organization who prepared the data, e.g. "CSDS", "Luc Anselin"
updated: 2021-11-18 # date in yyyy/mm/dd of last updated date, e.g. 2003-06-16
license: "Data provided “as is,” no warranties" # license/disclaimers for the data, e.g. "Data provided “as is,” no warranties."

used_in: ["City of Chicago's Tree Equity Tool, Open Chicago Environment Explorer, https://chicago-env-explorer.netlify.app/"] # boxed, comma-separated list of CSDS resources using this dataset, e.g. ["GeoDa Sample Data", "Introduction to Spatial Data Science labs"]
use_case: "FIXME" # short description of use case for dataset, e.g. "Spatial hedonic regression"
main-class: "environment" # Main tag for dataset, e.g. "crime". Will show up on website under this category
tags: ["trees", "air quality"] # Secondary tags for data, including the main tag, e.g. ["crime", "polygons", "<500", "smaller areas", "ESDA", "textbook"].
---

{% comment %}
Please do not edit any of the following code, which creates a templated dataset
documentation webpage from the above information block. You should change only
the parts that are labeled FIXME.
{% endcomment %}

<span class="download">[DOWNLOAD DATA]({{ page.download_link }})</span>

<p class="description">{{ page.description }}</p>

* Observations = <span class="observations">{{ page.observations }}</span>

* Variables = <span class="variables">{{ page.variables }}</span>

{% if page.start_date == page.end_date %}
* Date = <span class="start_date">{{ page.start_date }} </span>
{% elsif page.start_date != page.end_date %}
* Date Range = <span class="start_date">{{ page.start_date }}</span> to <span class="end_date">{{ page.end_date }}</span>
{% endif %}

{% if page.source %}
**Source**
<p class="source">{{ page.source }}</p>
{% endif %}

{% if page.reference %}
**Reference**
<p class="reference">{{ page.reference }}</p>
{% endif %}

<table class="datadict">
<thead>
<tr>
<th style="text-align: left">Variable</th>
<th style="text-align: left">Description</th>
</tr>
</thead>
<tbody>
{% for var in site.data[page.csv_name] %}
<tr>
<td class="var" style="text-align: left">{{ var.Variable }}</td>
<td class ="desc" style="text-align: left">{{ var.Description }}</td>
</tr>
{% endfor %}
</tbody>
</table>

Prepared by <span class="prepared">{{ page.prepared_by }}</span>. Last updated <span class="updated">{{ page.updated }}</span>. <span class="license">{{ page.license }}</span>
Binary file added data/chienv.zip
Binary file not shown.