-
-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Build a meta-feature (evaluation) engine in Python #2
Comments
@joaquinvanschoren you were assigned and there is a listed "in progress". Could you write down what progress there is, if any? Then unassign yourself (assuming you are not working on this). |
@NathanFCarvalho worked on this from March-June. He has written a script to compute meta-features with PyMFE which works on almost all datasets (tested on about 5000 datasets, but slow on the very large ones). It's a script because PyMFE does most of the work. All code and documentation is here: The remaining task would be to store the computed meta-features in OpenML, and rework the code so it can run as a cronjob. I unassigned myself since I have a lot on my plate already, but this should be a very doable and well-contained task. |
The evaluation engine is a component on the server which handles multiple tasks. This is currently implemented in Java and we want to rebuild it in Python, and compartmentalised per each function, for easier maintenance/more accessible to new contributors. One of its tasks is calculating meta-features over tabular datasets.
The engine should take tabular datasets and calculate a set of meta-features of them. Meta-features with an existing name should produce identical results, as much as possible currently available meta-features should remain available. Probably want to work with PyMFE.
The text was updated successfully, but these errors were encountered: