Build a meta-feature (evaluation) engine in Python #2

PGijsbers · 2024-09-02T09:13:46Z

The evaluation engine is a component on the server which handles multiple tasks. This is currently implemented in Java and we want to rebuild it in Python, and compartmentalised per each function, for easier maintenance/more accessible to new contributors. One of its tasks is calculating meta-features over tabular datasets.

The engine should take tabular datasets and calculate a set of meta-features of them. Meta-features with an existing name should produce identical results, as much as possible currently available meta-features should remain available. Probably want to work with PyMFE.

PGijsbers · 2024-09-02T09:14:26Z

@joaquinvanschoren you were assigned and there is a listed "in progress". Could you write down what progress there is, if any? Then unassign yourself (assuming you are not working on this).

joaquinvanschoren · 2024-09-06T13:12:42Z

@NathanFCarvalho worked on this from March-June. He has written a script to compute meta-features with PyMFE which works on almost all datasets (tested on about 5000 datasets, but slow on the very large ones). It's a script because PyMFE does most of the work.

All code and documentation is here:
https://github.com/NathanFCarvalho/OpenML_Metafeature_Extraction

The remaining task would be to store the computed meta-features in OpenML, and rework the code so it can run as a cronjob.
Sidenote: PyMFE uses different names for the metafeatures, and they can be quite cryptic. Nathan made a mapping to more understandable names. However, these are not 100% the same as the existing meta-features. We need to decide whether we want to keep the old meta-features, or exclusively use the new ones for consistency.

I unassigned myself since I have a lot on my plate already, but this should be a very doable and well-contained task.

PGijsbers assigned joaquinvanschoren Sep 2, 2024

joaquinvanschoren removed their assignment Sep 6, 2024

PGijsbers assigned SubhadityaMukherjee Sep 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Build a meta-feature (evaluation) engine in Python #2

Build a meta-feature (evaluation) engine in Python #2

PGijsbers commented Sep 2, 2024

PGijsbers commented Sep 2, 2024

joaquinvanschoren commented Sep 6, 2024

Build a meta-feature (evaluation) engine in Python #2

Build a meta-feature (evaluation) engine in Python #2

Comments

PGijsbers commented Sep 2, 2024

PGijsbers commented Sep 2, 2024

joaquinvanschoren commented Sep 6, 2024