Skip to content

Latest commit

 

History

History
111 lines (75 loc) · 3.41 KB

README.md

File metadata and controls

111 lines (75 loc) · 3.41 KB

Overview

This package implements Distance Weighted Discrimination (DWD). DWD For details see (Marron et al 2007, Wang and Zou 2018). Originally implemented in Python by Iain Carmichael. Currently maintained by Kitware, Inc.

The package currently implements:

  • Original DWD formulation solved with Second Order Cone Programming (SOCP) and solved using cvxpy.

  • Genralized DWD (gDWD) and kernel gDWD solved with the Majorization-Minimization algorithm presented in Wang and Zou, 2018.

Marron, James Stephen, Michael J. Todd, and Jeongyoun Ahn. "Distance-weighted discrimination." Journal of the American Statistical Association 102, no. 480 (2007): 1267-1271.

Wang, Boxiang, and Hui Zou. "Another look at distance‐weighted discrimination." Journal of the Royal Statistical Society: Series B (Statistical Methodology) 80, no. 1 (2018): 177-198.

Installation

The DWD package can be installed via pip or github. This package is currently only tested in python 3.6.

$ pip install dwd

The conic solver socp_dwd.DWD depends on cvxpy, which is not available on all platforms. See the cvxpy installation instructions. If cvxpy dependencies are met, then use pip install dwd[socp].

Flit is used for packaging, and all package metadata is stored in pyproject.toml. To install this project locally or for development, use flit install or build a pip-installable wheel with flit build.

Example

from sklearn.datasets import make_blobs
from dwd.socp_dwd import DWD

# sample sythetic training data
X, y = make_blobs(
    n_samples=200,
    n_features=2,
    centers=[[0, 0],
             [2, 2]],
)

# fit DWD classifier
dwd = DWD(C='auto').fit(X, y)

# compute training accuracy
dwd.score(X, y)  # 0.94

dwd_sep_hyperplane

from sklearn.datasets import make_circles
from dwd.gen_kern_dwd import KernGDWD

# sample some non-linear, toy data
X, y = make_circles(n_samples=200, noise=0.2, factor=0.5, random_state=1)

# fit kernel DWD wit gaussian kernel
kdwd = KernGDWD(
    lambd=.1, kernel='rbf',
    kernel_kws={'gamma': 1},
).fit(X, y)

# compute training accuracy
kdwd.score(X, y)  # 0.915

kern_dwd

For more example code see these example notebooks (including the code to generate the above figures). If the notebooks aren't loading on github you can copy/paste the notebook url into https://nbviewer.jupyter.org/.

Help and Support

Additional documentation, examples and code revisions are coming soon.

Documentation

The source code is located on github: https://github.com/slicersalt/dwd

Testing

Testing is done using nose.

Contributing

We welcome contributions to make this a stronger package: data examples, bug fixes, spelling errors, new features, etc.