How can I use this toolkit to do multi-output regression? #747

offchan42 · 2018-08-25T06:29:26Z

For example, I have a bunch of real numbers (camera image from headset) as input and I want to predict where my left hand is relative to the camera (4 numbers, x, y, z, length).
My left hand will be visible on the camera.

x,y,z is a unit vector representing direction from camera to the left hand, and length is the distance from the camera to the left hand.

So can this tool support predicting multiple outputs? If yes, how could I do it?
If not please suggest me a tool that can do it or another way of solving my problem.

The text was updated successfully, but these errors were encountered:

rhiever · 2018-08-25T15:42:52Z

You would have to create a custom TPOT configuration that used operations that support multi-output regression, e.g., the sklearn MultiOutputRegressor. As MultiOutputRegressor takes another estimator as a parameter, see our SelectFromModel example in another configuration dictionary.

I'm not 100% familiar with multi-output support in sklearn, but any operations that work with the MultiOutputRegressor and cross_val_score should also work with TPOT.

You can read more about custom configuration dictionaries here.

robertritz · 2018-11-28T02:02:00Z

Could you provide a bit more help with the custom configuration dictionary? I'm attempting to set up a simple custom configuration using the SelectFromModel example you gave. Here is my current config:

tpot_config = {
    'sklearn.multioutput.MultiOutputRegressor': {
        'estimator': {
            'sklearn.ensemble.ExtraTreesRegressor': {
                'n_estimators': [100],
                'max_features': np.arange(0.05, 1.01, 0.05)
            }
        }
    }
}

And here is my code to run TPOT:

pipeline_optimizer = TPOTRegressor(generations=5, population_size=20, max_time_mins=480, n_jobs=-1, verbosity=2, random_state=12345, config_dict=tpot_config)
pipeline_optimizer.fit(X_train, y_train)
print(pipeline_optimizer.score(X_test, y_test))
pipeline_optimizer.export('tpot_exported_pipeline.py')

I receive an error:
ValueError: Error: Input data is not in a valid format. Please confirm that the input data is scikit-learn compatible. For example, the features must be a 2-D array and target labels must be a 1-D array.

Is it necessary to specify the parameters to search for each algorithm? Before reading the documentation and your example I naively just passed through a list of algorithms like so:

tpot_config = {
    'sklearn.multioutput.MultiOutputRegressor': {
        'estimator': ['ExtraTreesRegressor']
      }
}

There are sklearn algorithms that are inherently multioutput, but with MultiOutputRegressor I get many more options. Thanks!

rhiever added the question label Aug 25, 2018

offchan42 closed this as completed Nov 17, 2018

mpdunne mentioned this issue Aug 14, 2019

Flag to allow multioutput. #903

Open

kradant mentioned this issue Dec 3, 2019

multioutput problem #971

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How can I use this toolkit to do multi-output regression? #747

How can I use this toolkit to do multi-output regression? #747

offchan42 commented Aug 25, 2018 •

edited

Loading

rhiever commented Aug 25, 2018

robertritz commented Nov 28, 2018

How can I use this toolkit to do multi-output regression? #747

How can I use this toolkit to do multi-output regression? #747

Comments

offchan42 commented Aug 25, 2018 • edited Loading

rhiever commented Aug 25, 2018

robertritz commented Nov 28, 2018

offchan42 commented Aug 25, 2018 •

edited

Loading