-
Notifications
You must be signed in to change notification settings - Fork 190
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
RFC: Define cut() in StatsBase? #228
Comments
I'd be happy to see |
Ah, so that's a third possibility. The implicit assumption in my description was that it would return a |
Any more comments? |
I like the idea of a type argument to allow returning a |
I'd vote for having it in |
That's consistent with what we do with the modeling stuff (though that should be moved to StatsModels). |
It would be weird to provide a function in StatsBase without any implementation, though. This is less surprising for statistical models since you cannot fit them without additional packages anyway. Also I'm not sure what other packages would need to define methods for I'm fine with implementing |
I completely agree with that but when we originally discussed |
Perhaps we could repurpose Stats.jl for that. 🙂 |
Yes, the name StatsBase is quite explicit, we could put extra features into (or load an reexport extra packages from) Stats instead (incidentally, that's how it's called in R). |
cut
used to be defined in DataArrays, since it returns aPooledDataArray
. With the move to CategoricalArrays, where should it live? I could add it to that package, but I figure people are more likely to look for it in StatsBase instead. So should we add it here? Of course that would add a dependency on CategoricalArrays.This would also allow providing an efficient
countmap
/addcounts!
method forCategoricalArray
. Else, CategoricalArrays will have to depend on StatsBase to provide it.The text was updated successfully, but these errors were encountered: