Fix NegativeBinomial scaling #814

lostella · 2020-05-08T11:49:00Z

Issue #, if available: #718 #719 and may be related to #636

Description of changes: The scaling of alpha in #719 only makes sense if scale >= 1.0, otherwise the scaled alpha can become negative. Intuitively this makes a lot of sense for count data, which can only really be upscaled and not downscaled.

This PR makes sure that that's the case. Essentially, if scale < 1.0 then it is set to 1.0.

As a minimum working example, consider the following snippet:

import numpy as np
import pandas as pd

import matplotlib.pyplot as plt

from gluonts.model.simple_feedforward import SimpleFeedForwardEstimator
from gluonts.distribution import NegativeBinomialOutput
from gluonts.dataset.common import ListDataset
from gluonts.trainer import Trainer

data = np.random.negative_binomial(n=3, p=0.9, size=(200,))

data_series = pd.Series(
    data=data,
    index=pd.date_range(
        start='2014-01-05 00:00:00',
        periods=len(data),
        freq='w'
    )
)

dataset = ListDataset(
    data_iter=[
        {
            "start": '2014-01-05 00:00:00',
            "target": list(data)
        }
    ],
    freq="w"
)

estimator = SimpleFeedForwardEstimator(
    freq="w",
    prediction_length=20,
    distr_output=NegativeBinomialOutput(),
    trainer=Trainer(epochs=10, hybridize=False),
)

predictor = estimator.train(dataset)

forecast = next(iter(predictor.predict(dataset)))
data_series.plot()
forecast.plot()
plt.show()

Before the fix:

Traceback (most recent call last):
  File "/Users/stellalo/gluon-ts/issues/issue_negbin.py", line 39, in <module>
    predictor = estimator.train(dataset)
  File "/Users/stellalo/gluon-ts/src/gluonts/model/estimator.py", line 252, in train
    training_data, validation_data, num_workers, num_prefetch, **kwargs
  File "/Users/stellalo/gluon-ts/src/gluonts/model/estimator.py", line 231, in train_model
    validation_iter=validation_data_loader,
  File "/Users/stellalo/gluon-ts/src/gluonts/trainer/_base.py", line 328, in __call__
    "Got NaN in first epoch. Try reducing initial learning rate."
gluonts.core.exception.GluonTSUserError: Got NaN in first epoch. Try reducing initial learning rate.

After the fix:

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

canerturkmen

Two questions here,

It's whoever's consuming this class that decides the scale. So if that consumer expects that scale = 0.0001, will the behavior not be affected by that?
Why not F.maximum(1, scale) but a soft thresholding?

lostella · 2020-05-08T12:20:45Z

Two questions here,

1. It's whoever's consuming this class that decides the scale. So if that consumer expects that scale = 0.0001, will the behavior not be affected by that?

I think the contract can reasonably be "you'll have the distribution scaled by the scale that you pass, as long as that's larger or equal to 1", I don't see problems with that.

2. Why not `F.maximum(1, scale)` but a soft thresholding?

I just went for the differentiable option in case the scale is output by some model (with parameters which are optimized). In all of our models the scale always a function of the data only, but who knows. Not that there seem to be a problem with maximum and SGD, but we use softmax everywhere for this purpose.

lostella · 2020-05-08T19:20:06Z

cc @kashif

kashif · 2020-05-08T20:52:03Z

@lostella looks good... I forgot about the scale < 1.0 case because I always assumed it to be > 1, but yes your solution is elegant. I remember seeing something similar done in another context but I don't remember where right now.... (perhaps in some RL setting...). 👍

canerturkmen

lgtm, and seems to be producing consistent results in practice. Thanks a lot for not letting this go 🎩

bound scale above 1

da08bfb

lostella changed the title ~~bound scale above 1~~ Fix NegativeBinomial scaling May 8, 2020

lostella requested a review from canerturkmen May 8, 2020 11:49

lostella added this to the v0.5 milestone May 8, 2020

canerturkmen reviewed May 8, 2020

View reviewed changes

canerturkmen approved these changes May 10, 2020

View reviewed changes

lostella merged commit 46e22a7 into awslabs:master May 10, 2020

lostella deleted the fix-neg-bin-scaling branch May 10, 2020 16:10

lostella mentioned this pull request May 10, 2020

Predictions are way too high when modeling an intermittent count series with DeepAR and NegBin distribution? #636

Closed

lostella mentioned this pull request Jul 4, 2020

Scale the negative binomial's gamma() #909

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix NegativeBinomial scaling #814

Fix NegativeBinomial scaling #814

lostella commented May 8, 2020 •

edited

Loading

canerturkmen left a comment

lostella commented May 8, 2020 •

edited

Loading

lostella commented May 8, 2020

kashif commented May 8, 2020

canerturkmen left a comment

Fix NegativeBinomial scaling #814

Fix NegativeBinomial scaling #814

Conversation

lostella commented May 8, 2020 • edited Loading

canerturkmen left a comment

Choose a reason for hiding this comment

lostella commented May 8, 2020 • edited Loading

lostella commented May 8, 2020

kashif commented May 8, 2020

canerturkmen left a comment

Choose a reason for hiding this comment

lostella commented May 8, 2020 •

edited

Loading

lostella commented May 8, 2020 •

edited

Loading