-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add big misses from all years #323
Add big misses from all years #323
Conversation
reports/performance/_model.qmd
Outdated
`Est. FMV` = pred_pin_final_fmv_round | ||
) %>% | ||
mutate( | ||
Difference = (`Est. FMV` - `Sale 1 Price`), |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Do we want difference by max of any sale price
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@Damonamajor I simplified this a little bit in 3ba3702 to show just the prediction error/diff from the training data, since we already do a full prediction on the training data earlier in this doc. Avoids the complexity of the assessment/training data join.
Finding erroneously priced sales led to issues such as #311, where we tried to identify incorrect data. This creates a new table which looks at all sales, and identifies the largest and smallest decreases in FMV to Sale Price (organized by township).
In essence, if we find a sale where the difference is $20,000,000, even if the sale was in 2018, that would be a pin where we might want to investigate if it should be an outlier / if something was input incorrectly.