-
Notifications
You must be signed in to change notification settings - Fork 10
Remove dependencies: StatsBase and Distributions #27
Comments
Well we need |
Revisiting this again. We were able to drop Distributions 🎉 thanks to
|
for completeness, this is what we import and reexport from StatsBase import StatsBase: nobs, fit, fit!, predict, params, params! Well the unfortunate thing at the state that we are now (disregarding potential future reasons) is that I guess it would be possible to say "extend I do see your point though. |
We could drop |
I am reopenning this for the modern world. I just want to overload |
It's difficult because StatsBase has claimed some key functions like |
I agree, Infact, the argument could be made that it is better practice to only overload things when importing them directly from their original namespace, |
That's fair. I'm on board with removing StatsBase here |
What's the difference between StatsBase, Distributions, StatsFuns and StatsModels? |
StatsBase is core functions for working with statistics, like calculating the weighted std-dev. |
Reviving this thread again... (third time I guess?) I am planning to revamp the LearnBase.jl package and upgrade it into a more general interface for statistical learning. In this plan the names I would like to ask you where else the LearnBase.jl package is being used inside the JuliaML organization. To what extent do we need to be backwards compatible? I see that the project doesn't have a Project.toml yet, and that is a good sign that it is not being actively used elsewhere. I will open a separate issue to share a proposal regarding an updated LearnBase.jl API. |
LearnBase hasn't required much updating since really all it does is claim names (although I'm pretty surprised there's no Project.toml!). I use it in several packages (OnlineStats, SparseRegression), but I started removing LearnBase from both within the last month. |
@joshday why is it that you are removing it from your packages? Do you feel that it is not suiting your needs? |
For OnlineStats, I'd like to move modeling bits into a different package. I don't think too many people use them and it allows me to drop 3 deps (LearnBase, LossFunctions, PenaltyFunctions). I misspoke about SparseRegression. I did start toying around with a new implementation of PenaltyFunctions though. I think both LossFunctions and PenaltyFunctions could be built simpler. |
You mean an equivalent to LearnBase.jl? Aren't the packages LossFunctions.jl and PenaltyFunctions.jl in good shape?
In what sense? Are you planning to improve the existing repositories or create new ones? Should we align these efforts? |
No, I mean making something like OnlineStatsModels.jl that has LearnBase/LossFunctions/PenaltyFunctions as dependencies.
Yes, they're great. I'm just toying around with making things simpler. They were written in the days when you couldn't dispatch on function types, so I'm trying a slightly different interface now. It it shows promise, I'll make new branches in the respective repos. |
Just so that I understand, this *Models.jl suffix means you will be defining some lmodel = learn(task, data, model)
predic = perform(task, data, lmodel) There is also this notion of full pipelines as models that I am finding interesting in AutoMLPipeline.jl. I am having a video call with the author tomorrow to brainstorm collaborations. But the main idea there is to have a Please let me know how we should plan these improvements. Ideally we could build a common API in LearnBase.jl to be reused everywhere else. |
This issue has been addressed in #35. |
My PR to move params from Distributions to StatsBase now has 8 commits and 22 comments...
I think this is as good a time as any to visit the idea of going back to 0 dependencies, which was our original thought when we created LearnBase. We essentially only have StatsBase and Distributions in our require file for
nobs
andparams
/params!
. Does anyone have a strong opinion on adding these ourselves and just not exporting them? Or other solutions?The text was updated successfully, but these errors were encountered: