diff --git a/notebooks/en/issues_in_text_dataset.ipynb b/notebooks/en/issues_in_text_dataset.ipynb index e568de2..9e41d5e 100644 --- a/notebooks/en/issues_in_text_dataset.ipynb +++ b/notebooks/en/issues_in_text_dataset.ipynb @@ -3254,7 +3254,7 @@ "id": "UG8xfTa5cw4S" }, "source": [ - "We see that these two sets of request are indeed very similar to one another! Including near duplicates in a dataset may have unintended effects on models, and be wary about splitting them across training/test sets. Learn more about handling near duplicates in a dataset from [the FAQ](../faq.html#How-to-handle-near-duplicate-data-identified-by-cleanlab?)." + "We see that these two sets of request are indeed very similar to one another! Including near duplicates in a dataset may have unintended effects on models, and be wary about splitting them across training/test sets. Learn more about handling near duplicates in a dataset from [the FAQ](https://docs.cleanlab.ai/stable/tutorials/faq.html#How-to-handle-near-duplicate-data-identified-by-Datalab?)." ] }, {