Elastic Net Regularizer #49

postelrich · 2017-05-03T20:51:03Z

This PR adds elastic net regularization, the weighted sum of l1 and l2, as an option. Couldn't find a proximal operator so derived it with much much help from @moody-marlin in an included notebook. It also adds an abstract base class for Regularizer and fixes the formulas for l2.

Resolves #48
Resolves #47

…dme with dev setup.

TomAugspurger · 2017-05-03T21:57:25Z

dask_glm/algorithms.py

@@ -319,7 +319,7 @@ def bfgs(X, y, max_iter=500, tol=1e-14, family=Logistic):
    return beta


-def proximal_grad(X, y, regularizer=L1, lamduh=0.1, family=Logistic,
+def proximal_grad(X, y, regularizer=L1(), lamduh=0.1, family=Logistic,


Thoughts about changing this to 'l1' and looking it up in _regularizers?

I agree, should be changed. Also in addition, we could move the string logic to a function in regularizers.py

postelrich · 2017-05-03T22:56:55Z

@TomAugspurger What do you think of my latest commit? I use the base to retrieve child classes via string.

TomAugspurger · 2017-05-04T12:02:29Z

dask_glm/regularizers.py

@@ -9,6 +9,7 @@ class Regularizer(object):
    Defines the set of methods required to create a new regularization object. This includes
    the regularization functions itself and it's gradient, hessian, and proximal operator.
    """
+    _name = '_base'


I think maybe this should be just name. I think @moody-marlin mentioned that users might defining their own regularizers, so they shouldn't need to override private attributes.

I agree, made the change.

cicdw · 2017-05-04T20:06:44Z

dask_glm/algorithms.py

         max_iter=250, abstol=1e-4, reltol=1e-2, family=Logistic):

    pointwise_loss = family.pointwise_loss
    pointwise_gradient = family.pointwise_gradient
-    regularizer = _regularizers.get(regularizer, regularizer)  # string
+    regularizer = Regularizer.get(regularizer)


I'll be sad to see this line go, but 👍

cicdw · 2017-05-04T20:07:09Z

dask_glm/regularizers.py

-    def proximal_operator(beta, t):
-        return 1 / (1 + t) * beta
+    Defines the set of methods required to create a new regularization object. This includes
+    the regularization functions itself and it's gradient, hessian, and proximal operator.


it's -> its

cicdw · 2017-05-04T20:08:36Z

dask_glm/regularizers.py

-    def f(beta):
-        return (beta**2).sum()
+    def proximal_operator(self, beta, t):
+        """Proximal operator function for non-differentiable regularization function."""


Proximal operator for regularization function; the regularizer doesn't need to be non-differentiable, it just can be.

cicdw · 2017-05-04T20:12:16Z

dask_glm/regularizers.py

-            return grad(beta, *args) + lam * L1.gradient(beta)
-        return wrapped
+    def hessian(self, beta):
+        raise ValueError('l1 norm is not twice differentiable!')


We should probably just fix this with this PR; this should be similar to the gradient:

if np.any(np.isclose(beta, 0)): raise ValueError('l1 norm is not twice differentiable at 0!') else: return np.zeros((beta.shape[0], beta.shape[0]))

l1 regularizer is a straight line everywhere except at 0 where there's a kink.

Then do we want to switch the elastic net hessian to include the l1 side to the weight? It won't have an effect except raise when there are errors.

Ah yes, definitely - I missed that.

cicdw · 2017-05-04T20:13:52Z

dask_glm/tests/test_algos_families.py

@@ -89,7 +89,7 @@ def test_basic_unreg_descent(func, kwargs, N, nchunks, family):
 @pytest.mark.parametrize('nchunks', [1, 10])
 @pytest.mark.parametrize('family', [Logistic, Normal, Poisson])
 @pytest.mark.parametrize('lam', [0.01, 1.2, 4.05])
-@pytest.mark.parametrize('reg', [L1, L2])
+@pytest.mark.parametrize('reg', [r() for r in Regularizer.__subclasses__()])


postelrich · 2017-05-04T21:04:15Z

@moody-marlin ok implemented your changes, looks good to go.

cicdw · 2017-05-04T21:06:45Z

@TomAugspurger do you have any other comments / suggestions? LGTM.

TomAugspurger · 2017-05-04T21:24:55Z

👍 on a quick skim.

postelrich added 4 commits May 3, 2017 12:22

create regularizer base class, elastic net regularization, update rea…

7f457f0

…dme with dev setup.

vectorize proximal operator for elastic net

e38cefb

fix l2, write tests for regularizers

aba2bca

add tests for elastic net

0fe6a30

TomAugspurger reviewed May 3, 2017

View reviewed changes

postelrich added 4 commits May 3, 2017 18:30

Merge branch 'master' into master

3adf07d

fix flake, change to string

3e28031

Merge branch 'master' of github.com:postelrich/dask-glm

1c0cc92

use base regularizer class to retrieve subclasses via string.

995a4dc

TomAugspurger reviewed May 4, 2017

View reviewed changes

cicdw reviewed May 4, 2017

View reviewed changes

add tests for get

a561e3c

cicdw reviewed May 4, 2017

View reviewed changes

fix docstrings, add hessian for l1.

b58bfc0

cicdw merged commit b2b6f10 into dask:master May 4, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Elastic Net Regularizer #49

Elastic Net Regularizer #49

postelrich commented May 3, 2017

TomAugspurger May 3, 2017

postelrich May 3, 2017 •

edited

Loading

postelrich commented May 3, 2017

TomAugspurger May 4, 2017

postelrich May 4, 2017

cicdw May 4, 2017

cicdw May 4, 2017

cicdw May 4, 2017

cicdw May 4, 2017 •

edited

Loading

postelrich May 4, 2017

cicdw May 4, 2017

cicdw May 4, 2017

postelrich commented May 4, 2017

cicdw commented May 4, 2017

TomAugspurger commented May 4, 2017

Elastic Net Regularizer #49

Elastic Net Regularizer #49

Conversation

postelrich commented May 3, 2017

Choose a reason for hiding this comment

postelrich May 3, 2017 • edited Loading

Choose a reason for hiding this comment

postelrich commented May 3, 2017

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

cicdw May 4, 2017 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

postelrich commented May 4, 2017

cicdw commented May 4, 2017

TomAugspurger commented May 4, 2017

postelrich May 3, 2017 •

edited

Loading

cicdw May 4, 2017 •

edited

Loading