The data set I am using is from UCI Machine Learning Repository. This data set is pretty famous in machine learning studies, it has been there for a really long time. You can easily find many research papers about data sets from this repository.
Here is more detialded analysis
https://towardsdatascience.com/model-selection-yacht-hydrodynamics-data-set-ec0f8591e8e8?source=friends_link&sk=ca2e1cab39f257df267b64e067857686