-
Notifications
You must be signed in to change notification settings - Fork 21
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
kfold time series #58
Comments
I am not the maintainer of MLUtils.jl but I believe that anything related to specific domains such as time series, spatial data should be developed in separate projects. For example, in geospatial ML there are a couple of methods available in GeoStats.jl: https://juliaearth.github.io/GeoStats.jl/stable/validation.html You could even use these with time series data. Alternatively, you can propose specific validation methods in TimeSeries.jl or any other package that is devoted to the analysis of time series objects. |
MLDataPattern.jl had time series functions, but they were tricky to work with and didn't compose as well with the rest of the package. In general, time dimensions are hard to get right. So, a quick port of the MLDataPattern.jl functions is probably not what we want here. Indeed, what we finally land on might be more appropriate in a separate package as Julio suggested. So, the answer is yes, but whatever is proposed will have to be carefully considered for how we want to work with temporal data in general. |
What about a simple kfolds like this: https://stats.stackexchange.com/a/14109 |
Yes, we should implement that. See also https://scikit-learn.org/stable/modules/generated/sklearn.model_selection.TimeSeriesSplit.html |
Hi,
have you thought about porting some Time Series utility functions? Such as
kfold
for time series?https://alan-turing-institute.github.io/MLJ.jl/stable/evaluating_model_performance/#MLJBase.TimeSeriesCV
Thanks.
The text was updated successfully, but these errors were encountered: