Housing Pipeline Analysis

This project is a part of the Data Science Working Group at Code for San Francisco. Other DSWG projects can be found at the main GitHub repo.

Project Status: Active

Project Intro/Objective

The purpose of this project is to analyze the impact of zoning laws on the development of housing in San Francisco. We use datasets provided by the city of San Francisco to analyze the initiation, progress, and completion of entitlements for the construction of housing. We want to empower policymakers and citizens with the facts and data that help explain the potential impact of housing policy decisions.

Methods Used

Inferential Statistics
Data Visualization
Predictive Modeling

Technologies

Python
Pandas, jupyter

Project Description

We focus on utilizing our distinct skillset of data science and the SF Planning Department’s pipeline data to a) develop quantitative facts about the state of the pipeline of planned housing in SF, and b) use predictive methods to model potential impact of policies. The potential impact could take multiple forms, such as in units built, free market price, and affordable unit availability.

Guiding Principles

Accessible San Francisco -- We are invested in providing facts that allow other parties, some political, to make data-informed decisions that enable San Francisco to become more inclusive and therefore healthier.
Science is A-political -- While we do believe in an accessible San Francisco, we are not a policy advocate. Our role is to provide the most rigorous data analysis to define facts, and use sound statistical methods and scientific investigation to predict how those facts may shift given a policy change.
Correlation is not causation -- Hand-in-hand with being a-political is the responsibility to clearly articulate the findings and limitations of analysis. Most analysis and prediction will only be able to leverage correlative relationships, and will likely be unable to demonstrate causative relationships. Natural experiments that would give us this kind of increased conviction in causative factors are rare.
Equal Access to Facts -- Our work will be shared through blog format to be accessible to all stakeholders at the same time.
Open to Everyone -- We are an all-volunteer organization, and our team welcomes all people of all skill bases and backgrounds to join our team. There is always a way for you to contribute!

Needs of this project

The best place to get started is our list of Issues in Github. We have workstreams spread across these skillsets:

Data Scientists

Data modeling and analysis towards a better understanding of housing pipeline changes over time.

Python Programmers

Creating data cleanup tools and methods for parsing or scraping new datasets.

Econometrists, Economic Modeling

Model market incentives and assumptions, enabling prediction of impact based on changes.

Operations

Manage the volunteer network, communicate with stakeholders, and network with new people to learn new needs.

Visual Designer

Help create visualizations of facts and predictions with Data Scientists

Policy Researcher

Help us understand the policy strategies being discussed and work with the PMs on whether we should investigate them.

Marketing

Help us publicize our work and build a stronger community overall.

Getting Started

Clone this repo (for help see this tutorial).
Raw Data is being kept here within this repo.
Data processing/transformation scripts are being kept here
Install dependencies using Pipenv
Open the Jupyter notebook

Setting Up Your Development Environment

Step 1: Install Pipenv

On macOS:

brew install pipenv

Step 2: Install Dependencies

pipenv install

Step 3: Enter The Development Shell

pipenv run python -m ipykernel install --user --name=`pipenv run basename '$VIRTUAL_ENV'`

Step 4: Open the Notebook

Launch Jupyter and select datasci-housing-pipeline kernel in Jupyter.

jupyter notebook

Contributing DSWG Members

Team Leads (Contacts)

Shantanu Bala - @Shantanu Bala
Anders Engnell - @Anders Engnell

Other Members:

Name	Slack Handle
Andrew Roberts	@Andrew Roberts

Contact

If you haven't joined the SF Brigade Slack, you can do that here.
Our slack channel is #datasci-projectname
Feel free to contact team leads with any questions or if you are interested in contributing!

Name		Name	Last commit message	Last commit date
Latest commit History 119 Commits
analysis		analysis
archive		archive
data		data
gis		gis
.gitignore		.gitignore
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Housing Pipeline Analysis

Project Status: Active

Project Intro/Objective

Methods Used

Technologies

Project Description

Guiding Principles

Needs of this project

Data Scientists

Python Programmers

Econometrists, Economic Modeling

Operations

Visual Designer

Policy Researcher

Marketing

Getting Started

Setting Up Your Development Environment

Step 1: Install Pipenv

Step 2: Install Dependencies

Step 3: Enter The Development Shell

Step 4: Open the Notebook

Contributing DSWG Members

Team Leads (Contacts)

Other Members:

Contact

About

Releases

Packages

Languages

lamaalrajih/datasci-housing-pipeline

Folders and files

Latest commit

History

Repository files navigation

Housing Pipeline Analysis

Project Status: Active

Project Intro/Objective

Methods Used

Technologies

Project Description

Guiding Principles

Needs of this project

Data Scientists

Python Programmers

Econometrists, Economic Modeling

Operations

Visual Designer

Policy Researcher

Marketing

Getting Started

Setting Up Your Development Environment

Step 1: Install Pipenv

Step 2: Install Dependencies

Step 3: Enter The Development Shell

Step 4: Open the Notebook

Contributing DSWG Members

Team Leads (Contacts)

Other Members:

Contact

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages