Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Phase 2 project hellen #136

Closed
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
36 commits
Select commit Hold shift + click to select a range
02f5aa4
Importing modules
Sandrakiptumm Apr 27, 2024
9939288
Update student.ipynb
Cdasilver29 Apr 27, 2024
dc953c2
Business-understanding
Cdasilver29 Apr 27, 2024
94c04b3
Update student.ipynb
Cdasilver29 Apr 27, 2024
a0708a0
Data-understanding
Cdasilver29 Apr 27, 2024
27b7de9
Update student.ipynb
Cdasilver29 Apr 27, 2024
5dca9f8
Update student.ipynb
Cdasilver29 Apr 27, 2024
39a544a
Update student.ipynb
Cdasilver29 Apr 27, 2024
516f422
Update student.ipynb
Cdasilver29 Apr 27, 2024
602cb32
import module and read data
Apr 28, 2024
a5f1a60
deleting importations in main
Sandrakiptumm Apr 28, 2024
02bc896
Merge pull request #1 from Sandrakiptumm/feature/data-preparation
Sandrakiptumm Apr 28, 2024
77c916b
Update student.ipynb
Sandrakiptumm Apr 28, 2024
8ca69ac
Revert "added additional information"
Sandrakiptumm Apr 28, 2024
9a13fdc
Update student.ipynb
Sandrakiptumm Apr 28, 2024
32ee66d
Update student.ipynb
Sandrakiptumm Apr 28, 2024
8b8b1c9
check columns and missing values
Apr 28, 2024
5cb2038
check data info and statistical summary
Apr 28, 2024
7b42ab7
added additional information on the statistical summary
Apr 28, 2024
ca3a1af
Reverting student.ipynb
Sandrakiptumm Apr 28, 2024
b776935
Update student.ipynb
Cdasilver29 Apr 28, 2024
b000ae3
Merge branch 'main' into Business-understanding
Sandrakiptumm Apr 28, 2024
6049c33
Merge pull request #2 from Sandrakiptumm/Business-understanding
Sandrakiptumm Apr 28, 2024
eb06736
added additional information
Apr 28, 2024
a1aaf43
Merge branch 'main' into feature/data-preparation
Sandrakiptumm Apr 28, 2024
37a1792
Update README.md
samuelhellen May 1, 2024
792d9bd
title
samuelhellen May 1, 2024
4292b68
Update README.md
samuelhellen May 1, 2024
5fb6b8b
Update README.md
samuelhellen May 1, 2024
061d03d
updated readme
samuelhellen May 1, 2024
c4c6e8f
updated readme
samuelhellen May 1, 2024
094e891
Update README.md
samuelhellen May 1, 2024
a6916f8
images
samuelhellen May 1, 2024
adf6c9d
image
samuelhellen May 1, 2024
8f9d199
Pdf slides
samuelhellen May 1, 2024
76bff26
slides in pdf
samuelhellen May 1, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Binary file not shown.
314 changes: 108 additions & 206 deletions README.md

Large diffs are not rendered by default.

Binary file added images/image-1.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added images/image-10.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added images/image-11.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added images/image-12.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added images/image-13.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added images/image-14.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added images/image-2.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added images/image-3.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added images/image-4.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added images/image-5.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added images/image-6.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added images/image-7.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added images/image-8.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added images/image-9.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added images/image.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
109 changes: 103 additions & 6 deletions student.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -7,13 +7,109 @@
"## Final Project Submission\n",
"\n",
"Please fill out:\n",
"* Student name: \n",
"* Student pace: self paced / part time / full time\n",
"* Student name: Calvine Dasilver\n",
"* Student pace: full time\n",
"* Scheduled project review date/time: \n",
"* Instructor name: \n",
"* Instructor name: Nikita\n",
"* Blog post URL:\n"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
" # Demystifying House Sales Analysis with Regression Modeling in a Northwestern County\n",
"\n",
" ## Project Overview"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
" ### <li>**Business Understanding**\n",
"\n",
"The real estate market plays a crucial role in the economic health and stability of a region. Understanding the factors that influence house prices is essential for both buyers and sellers to navigate the market effectively. This project focuses on a specific northwestern county in the United States, aiming to shed light on the key determinants of property valuation in this area.\n",
" ##### Problem Statements:\n",
"What are the most significant factors influencing house prices in this northwestern county?How can we quantify the relationship between these factors and property value?Can we develop a reliable model to predict house prices based on relevant characteristics?\n",
" ##### Challenges:\n",
"* Real estate data can be complex and multifaceted, encompassing various property features and local market trends.\n",
"* Accurately identifying and quantifying the relative impact of each factor on house prices can be challenging.\n",
"* External factors like economic conditions and interest rates might also influence prices, requiring careful consideration.\n",
"\n",
"##### Proposed Solutions:\n",
"We propose utilizing multiple linear regression, a powerful machine learning technique. This method allows us to analyze a large dataset of house sales and identify the statistical relationships between various property features (e.g., square footage, number of bedrooms, location) and the corresponding sale prices.\n",
" ##### Objectives:\n",
"1. Develop a robust multiple linear regression model that accurately predicts house prices in the chosen northwestern county.\n",
"2. Identify the most significant factors influencing property value within this specific market.\n",
"3. Provide valuable insights into the housing market dynamics of the region, benefiting potential buyers, real estate agents, and other stakeholders.\n",
" \n",
" \n",
"**Research questions that would help to achieve the objectives**:\n",
"\n",
"1. How does the number of bedrooms, bathrooms, grade and square footage of a house correlate with its sale price in King County?\n",
"2. How much can a homeowner expect the value of their home to increase after a specific renovation project?\n",
"3. Which renovation projects have the most significant impact on a home's market value in the northwestern county?\n",
"4. Are there specific combinations of renovation projects that provide an interdependent effect on a home's market value?"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
" ### <li> **Data Understanding**\n",
"\n",
"Our analysis leverages the King County House Sales dataset - a rich resource containing over 21,500 records and 20 distinct features(columns). Spanning house sales from May 2014 to May 2015, this dataset provides a comprehensive snapshot of the King County housing market during that period.\n",
"\n",
"**The King County House Sales dataset contains the following columns;**\n",
"\n",
"id - unique identified for a house\n",
"\n",
"date - Date house was sold \n",
"\n",
"Price - Sale price (prediction target)\n",
"\n",
"bedrooms - Number of bedrooms,\n",
"\n",
"bathrooms - Number of bathrooms,\n",
"\n",
"sqft_living - Square footage of living space in the home,\n",
"\n",
"sqft_lot - Square footage of the lot,\n",
"\n",
"floors - Number of floors (levels) in house,\n",
"\n",
"view - Quality of view from house,\n",
"\n",
"condition - How good the overall condition of the house is. Related to maintenance of house,\n",
"\n",
"grade - Overall grade of the house. Related to the construction and design of the house,\n",
"\n",
"sqft_above - Square footage of house apart from basement,\n",
"\n",
"sqft_basement - Square footage of the basement,\n",
"\n",
"yr_built - Year when house was built,\n",
"\n",
"yr_renovated - Year when house was renovated,\n",
"\n",
"zipcode - ZIP Code used by the United States Postal Service,\n",
"\n",
"sqft_living15 - The square footage of interior housing living space for the nearest 15 neighbors,\n",
"\n",
"sqft_lot15 - The square footage of the land lots of the nearest 15 neighbors, and\n",
"\n",
"sell_yr - Date house was sold.\n",
"\n",
"\n",
"We need to be aware of certain constraints within the data, as these might influence our analysis and interpretation of the results. From the sources;\n",
"\n",
"1. The data may contain anomalies or inconsistencies that require careful examination during analysis. For instance, a record lists a house with 33 bedrooms, which appears to be an outlier\n",
"\n",
"2. It's important to consider the time frame of the data (May 2014 - May 2015) as it may not fully capture the current market dynamics in King County.\n",
"3. It's important to acknowledge the scope of the data. While it provides details on house features, it may not capture external factors such as interest rates or the overall economic climate, which can also play a role in determining property values."
]
},
{
"cell_type": "code",
"execution_count": null,
Expand All @@ -26,21 +122,22 @@
],
"metadata": {
"kernelspec": {
"display_name": "Python 3",
"display_name": "Python (learn-env)",
"language": "python",
"name": "python3"
"name": "learn-env"
},
"language_info": {
"codemirror_mode": {
"name": "ipython",
"version": 3
},
"feature/data-preparation": "main",
"file_extension": ".py",
"mimetype": "text/x-python",
"name": "python",
"nbconvert_exporter": "python",
"pygments_lexer": "ipython3",
"version": "3.6.4"
"version": "3.8.5"
}
},
"nbformat": 4,
Expand Down