Scoring parameter in TPOTRegressor unable to take customized loss functions #648

miteshyadav · 2018-01-04T22:16:40Z

I am trying to fit my TPOTRegressor using a customized scoring function. I have followed the instructions as per given on the website but it throws an error.

def rmsl_error(y, h): 
    """
    Compute the Root Mean Squared Log Error for hypothesis h and targets y

    Args:
        h - numpy array containing predictions with shape (n_samples, n_targets)
        y - numpy array containing targets with shape (n_samples, n_targets)
    """
    return np.sqrt(np.square(np.log(h + 1) - np.log(y + 1)).mean())

from sklearn.metrics.scorer import make_scorer
my_custom_scorer = make_scorer(rmsl_error, greater_is_better=False)

from tpot import TPOTRegressor
#del final_df['day']

tpot = TPOTRegressor(generations=10, population_size=50, verbosity=2,n_jobs=-1,cv=iter_cv,scoring=my_custom_scorer)
print ('aaaaaaaaaaaaaaaaa')
tpot.fit(final_df.iloc[:,final_df.columns!='count'].values,final_df.iloc[:,6].values)
print ('bbbbbbbbbbbbbbbb')

The following is the screenshot of the error message:

Same function works fine for any other regressor

weixuanfu · 2018-01-04T23:37:15Z

Please check the issue #645 . I think it is a notebook-related issue. You may not use n_jobs > 1 with customized scoring functions using current version (0.9.1) of TPOT in Jupyter notebook. Also I fixed another bug in the new scoring API and merged into dev branch. I will work on this issue and release a patch soon.

GinoWoz1 · 2018-09-16T14:37:44Z

Use my loss function @miteshyadav - it works fine for me. I actually used the same one you had but ran into issues.

def rmsle_loss(y_true, y_pred):
    assert len(y_true) == len(y_pred)
    try:
        terms_to_sum = [(math.log(y_pred[i] + 1) - math.log(y_true[i] + 1)) ** 2.0 for i,pred in enumerate(y_pred)]
    except:
        return float('inf')
    if not (y_true >= 0).all() and not (y_pred >= 0).all():
            return float('inf')
    return (sum(terms_to_sum) * (1.0/len(y_true))) ** 0.5

rmsle_loss = make_scorer(rmsle_loss,greater_is_better=False)```

weixuanfu added the question label Jan 4, 2018

weixuanfu closed this as completed Oct 1, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Scoring parameter in TPOTRegressor unable to take customized loss functions #648

Scoring parameter in TPOTRegressor unable to take customized loss functions #648

miteshyadav commented Jan 4, 2018 •

edited by rhiever

Loading

weixuanfu commented Jan 4, 2018 •

edited

Loading

GinoWoz1 commented Sep 16, 2018 •

edited

Loading

Scoring parameter in TPOTRegressor unable to take customized loss functions #648

Scoring parameter in TPOTRegressor unable to take customized loss functions #648

Comments

miteshyadav commented Jan 4, 2018 • edited by rhiever Loading

weixuanfu commented Jan 4, 2018 • edited Loading

GinoWoz1 commented Sep 16, 2018 • edited Loading

miteshyadav commented Jan 4, 2018 •

edited by rhiever

Loading

weixuanfu commented Jan 4, 2018 •

edited

Loading

GinoWoz1 commented Sep 16, 2018 •

edited

Loading