Minor typo in "Bayesian Additive Regression Trees: Introduction" notebook #584 (#585)

mellorjc · jmellor · web-flow · commit e6eecdbbe812 · 2023-10-18T13:47:59.000+02:00
* Fix typo in BART_introduction.myst.md

Change "season" to "reason"

* Fix typo in BART_introduction.ipynb

Changes "season" to "reason" in BART_introduction.ipynb

---------

Co-authored-by: jmellor &lt;jmellor2@exseed.ed.ac.uk&gt;
diff --git a/examples/case_studies/BART_introduction.ipynb b/examples/case_studies/BART_introduction.ipynb
@@ -1037,7 +1037,7 @@
    "id": "2b680d91",
    "metadata": {},
    "source": [
-    "This plot helps us understand the season behind the bad performance on the test set: Recall that in the variable importance ranking from the initial model we saw that `hour` was the most important predictor. On the other hand, our training data just sees `hour` values until $19$ (since is our train-test threshold). As BART learns how to partition the (training) data, it can not differentiate between `hour` values between $20$ and $22$ for example. It just cares that both values are greater that $19$. This is very important to understand when using BART! This explains why one should not use BART for time series forecasting if there is a trend component. In this case it is better to detrend the data first, model the remainder with BART and model the trend with a different model."
+    "This plot helps us understand the reason behind the bad performance on the test set: Recall that in the variable importance ranking from the initial model we saw that `hour` was the most important predictor. On the other hand, our training data just sees `hour` values until $19$ (since is our train-test threshold). As BART learns how to partition the (training) data, it can not differentiate between `hour` values between $20$ and $22$ for example. It just cares that both values are greater that $19$. This is very important to understand when using BART! This explains why one should not use BART for time series forecasting if there is a trend component. In this case it is better to detrend the data first, model the remainder with BART and model the trend with a different model."
    ]
   },
   {
diff --git a/examples/case_studies/BART_introduction.myst.md b/examples/case_studies/BART_introduction.myst.md
@@ -389,7 +389,7 @@ ax.set(
 );
 ```
 
-This plot helps us understand the season behind the bad performance on the test set: Recall that in the variable importance ranking from the initial model we saw that `hour` was the most important predictor. On the other hand, our training data just sees `hour` values until $19$ (since is our train-test threshold). As BART learns how to partition the (training) data, it can not differentiate between `hour` values between $20$ and $22$ for example. It just cares that both values are greater that $19$. This is very important to understand when using BART! This explains why one should not use BART for time series forecasting if there is a trend component. In this case it is better to detrend the data first, model the remainder with BART and model the trend with a different model.
+This plot helps us understand the reason behind the bad performance on the test set: Recall that in the variable importance ranking from the initial model we saw that `hour` was the most important predictor. On the other hand, our training data just sees `hour` values until $19$ (since is our train-test threshold). As BART learns how to partition the (training) data, it can not differentiate between `hour` values between $20$ and $22$ for example. It just cares that both values are greater that $19$. This is very important to understand when using BART! This explains why one should not use BART for time series forecasting if there is a trend component. In this case it is better to detrend the data first, model the remainder with BART and model the trend with a different model.
 
 +++
 

Original file line number	Diff line number	Diff line change
`@@ -1037,7 +1037,7 @@`
`1037`	`1037`	`"id": "2b680d91",`
`1038`	`1038`	`"metadata": {},`
`1039`	`1039`	`"source": [`
`1040`		- "This plot helps us understand the season behind the bad performance on the test set: Recall that in the variable importance ranking from the initial model we saw that `hour` was the most important predictor. On the other hand, our training data just sees `hour` values until $19$ (since is our train-test threshold). As BART learns how to partition the (training) data, it can not differentiate between `hour` values between $20$ and $22$ for example. It just cares that both values are greater that $19$. This is very important to understand when using BART! This explains why one should not use BART for time series forecasting if there is a trend component. In this case it is better to detrend the data first, model the remainder with BART and model the trend with a different model."
	`1040`	+ "This plot helps us understand the reason behind the bad performance on the test set: Recall that in the variable importance ranking from the initial model we saw that `hour` was the most important predictor. On the other hand, our training data just sees `hour` values until $19$ (since is our train-test threshold). As BART learns how to partition the (training) data, it can not differentiate between `hour` values between $20$ and $22$ for example. It just cares that both values are greater that $19$. This is very important to understand when using BART! This explains why one should not use BART for time series forecasting if there is a trend component. In this case it is better to detrend the data first, model the remainder with BART and model the trend with a different model."
`1041`	`1041`	`]`
`1042`	`1042`	`},`
`1043`	`1043`	`{`