Create logistic_regression.md #83

benjaminsavage · 2023-10-05T06:31:48Z

Share WALR algorithm as a readme file

Fixed a few minor bugs.

logistic_regression.md

martinthomson · 2023-10-05T22:20:14Z

logistic_regression.md

+$X = (X^{(1)}, ..., X^{(N)}) \in [0, 1]^{N*k}$ and $y = (y^{(1)}, ..., y^{(N)})^T \in \\{0, 1\\}^N$ \
+denote the *(N \* k)*-dimensional feature *matrix* and *N*-dimensional label vector respectively,  then we have 
+
+$\text{dot-product} = \frac{1}{N} \sum\limits_{i=1}^{N} y^{(i)} X^{(i)} = \frac{1}{N} \cdot Xy$ ,\


Suggested change

$\text{dot-product} = \frac{1}{N} \sum\limits_{i=1}^{N} y^{(i)} X^{(i)} = \frac{1}{N} \cdot Xy$ ,\

$\text{dot-product} = \frac{1}{N} \sum\limits_{i=1}^{N} y^{(i)} X^{(i)} = \frac{C\cdot y}{N}$ ,\

Suggested change

$\text{dot-product} = \frac{1}{N} \sum\limits_{i=1}^{N} y^{(i)} X^{(i)} = \frac{1}{N} \cdot Xy$ ,\

$\text{dot-product} = \frac{1}{N} \sum\limits_{i=1}^{N} y^{(i)} X^{(i)} = \frac{1}{N}\cdot X \cdot y$ ,\

I found the positioning of the dot a little annoying. Because you are saying dot-product, but then putting the dot elsewhere.

removed a stray sigma and made Martin's suggested changes including adding authors

bmcase

Looks good.. made a few formatting suggestions.

bmcase · 2023-10-06T01:01:18Z

logistic_regression.md

+Any additional computations used to update $\text{noisy-} \nabla L(\theta)$  (as in a gradient descent procedure) will still be label-DP (with the same privacy parameters) due to the Post Processing Theorem of DP (see Dwork+11 text).
+
+- **Question:** is $\text{noisy-dot-product}$ efficiently computable?\
+**Answer:** Yes, computing this vector requires just one pass through the set of feature vectors, and $k$ random draws from a Gaussian distribution. 


probably should at least mention here that when implementing in MPC we will compute the division by N outside the MPC.

bmcase · 2023-10-06T01:04:23Z

logistic_regression.md

+
+First, we define the terms $\text{LHS}$ and $\text{RHS}$ as\
+$\sum\text{LHS} = \sum\limits_{i=1}^{N} p_i X^{(i)}$\
+$\sum\text{RHS} = \sum\limits_{i=1}^{N} y^{(i)} X^{(i)}$


consider using $:=$ for these two lines defining new notation

bmcase · 2023-10-06T01:05:17Z

logistic_regression.md

+$\text{noisy-} \nabla L(\theta) = (\frac{1}{N}) \cdot (\sum LHS) – (\frac{1}{N}) \cdot (\sum RHS) - \text{gaussian-noise}$. 
+
+To avoid computing $\text{LHS}$ at every optimization step, we approximate this term using a minibatch of size $m$.  Specifically, at every gradient descent step, we sample a minibatch $M$ of size $m$, and we compute\
+$\text{mini-}\sum\text{LHS} = \sum\limits_{j=1}^{m} p_j X^{(j)}$,\


here also $:=$ with defining new terminology

bmcase · 2023-10-06T01:06:45Z

logistic_regression.md

+1. initialize model vector $\theta$
+2. while not converged:\
+sample minibatch of size $m$, and\
+$\text{set } \theta = \theta - lr \cdot ( \text{noisy-hybrid-} \nabla L(\theta))\$


$\theta := ...$

bmcase · 2023-10-06T01:07:32Z

logistic_regression.md

+- In the absence of any computational or privacy constraints, the model  can be trained via full-batch gradient descent of the form, where here $\text{lr}$ is the learning rate:
+  1. initialize model vector $\theta$
+  2. while not converged: \
+  $\text{set } \theta = \theta - \text{lr} \cdot ((\frac{1}{N} \cdot \sum\limits_{i=1}^{N} \sigma(\theta^T X^{(i)}) X^{(i)} ) - \frac{1}{N} \cdot \sum\limits_{i=1}^{N} y^{(i)} X^{(i)} ))$


$\theta :=$

bmcase · 2023-10-06T01:10:31Z

logistic_regression.md

+
+1. initialize model vector $\theta$
+2. while not converged:\
+$\text{set } \theta = \theta - lr \cdot ((\frac{1}{N} \cdot \sum\limits_{i=1}^{N} p_i X^{(i)} ) - \text{noisy-dot-product})$


Suggested change

$\text{set } \theta = \theta - lr \cdot ((\frac{1}{N} \cdot \sum\limits_{i=1}^{N} p_i X^{(i)} ) - \text{noisy-dot-product})$

$\text{set } \theta := \theta - lr \cdot ((\frac{1}{N} \cdot \sum\limits_{i=1}^{N} p_i X^{(i)} ) - \text{noisy-dot-product})$

bmcase · 2023-10-06T01:10:54Z

logistic_regression.md

+1. initialize model vector $\theta$
+2. while not converged:\
+sample minibatch of size $m$, and\
+$\text{set } \theta = \theta - lr \cdot ( \text{noisy-hybrid-} \nabla L(\theta))\$


Suggested change

$\text{set } \theta = \theta - lr \cdot ( \text{noisy-hybrid-} \nabla L(\theta))\$

$\text{set } \theta := \theta - lr \cdot ( \text{noisy-hybrid-} \nabla L(\theta))\$

bmcase · 2023-10-06T01:11:14Z

logistic_regression.md

+$\text{noisy-} \nabla L(\theta) = (\frac{1}{N}) \cdot (\sum LHS) – (\frac{1}{N}) \cdot (\sum RHS) - \text{gaussian-noise}$. 
+
+To avoid computing $\text{LHS}$ at every optimization step, we approximate this term using a minibatch of size $m$.  Specifically, at every gradient descent step, we sample a minibatch $M$ of size $m$, and we compute\
+$\text{mini-}\sum\text{LHS} = \sum\limits_{j=1}^{m} p_j X^{(j)}$,\


Suggested change

$\text{mini-}\sum\text{LHS} = \sum\limits_{j=1}^{m} p_j X^{(j)}$,\

$\text{mini-}\sum\text{LHS} := \sum\limits_{j=1}^{m} p_j X^{(j)}$,\

bmcase · 2023-10-06T01:11:30Z

logistic_regression.md

+To circumvent this computational issue, we use a simple **"hybrid"-minibatch gradient** computation that we describe here. 
+
+First, we define the terms $\text{LHS}$ and $\text{RHS}$ as\
+$\sum\text{LHS} = \sum\limits_{i=1}^{N} p_i X^{(i)}$\


Suggested change

$\sum\text{LHS} = \sum\limits_{i=1}^{N} p_i X^{(i)}$\

$\sum\text{LHS} := \sum\limits_{i=1}^{N} p_i X^{(i)}$\

bmcase · 2023-10-06T01:11:51Z

logistic_regression.md

+
+First, we define the terms $\text{LHS}$ and $\text{RHS}$ as\
+$\sum\text{LHS} = \sum\limits_{i=1}^{N} p_i X^{(i)}$\
+$\sum\text{RHS} = \sum\limits_{i=1}^{N} y^{(i)} X^{(i)}$


Suggested change

$\sum\text{RHS} = \sum\limits_{i=1}^{N} y^{(i)} X^{(i)}$

$\sum\text{RHS} := \sum\limits_{i=1}^{N} y^{(i)} X^{(i)}$

A few more :=

Create logistic_regression.md

c0ed607

Share WALR algorithm as a readme file

benjaminsavage requested review from martinthomson, eriktaubeneck and bmcase October 5, 2023 06:31

Update logistic_regression.md

4589eed

Fixed a few minor bugs.

martinthomson reviewed Oct 5, 2023

View reviewed changes

logistic_regression.md Show resolved Hide resolved

martinthomson reviewed Oct 5, 2023

View reviewed changes

bmcase and others added 3 commits October 5, 2023 20:53

define lr = learning rate

6ec2dc4

fix typo

1973729

Update logistic_regression.md

71369d1

removed a stray sigma and made Martin's suggested changes including adding authors

bmcase approved these changes Oct 6, 2023

View reviewed changes

Update logistic_regression.md

d31e3ae

A few more :=

benjaminsavage merged commit ef8fcc2 into main Oct 6, 2023
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create logistic_regression.md #83

Create logistic_regression.md #83

benjaminsavage commented Oct 5, 2023

martinthomson Oct 5, 2023

bmcase left a comment

bmcase Oct 6, 2023

bmcase Oct 6, 2023

bmcase Oct 6, 2023

bmcase Oct 6, 2023

bmcase Oct 6, 2023

bmcase Oct 6, 2023

bmcase Oct 6, 2023

bmcase Oct 6, 2023

bmcase Oct 6, 2023

bmcase Oct 6, 2023

	$\text{dot-product} = \frac{1}{N} \sum\limits_{i=1}^{N} y^{(i)} X^{(i)} = \frac{1}{N} \cdot Xy$ ,\
	$\text{dot-product} = \frac{1}{N} \sum\limits_{i=1}^{N} y^{(i)} X^{(i)} = \frac{C\cdot y}{N}$ ,\

	$\text{set } \theta = \theta - lr \cdot ((\frac{1}{N} \cdot \sum\limits_{i=1}^{N} p_i X^{(i)} ) - \text{noisy-dot-product})$
	$\text{set } \theta := \theta - lr \cdot ((\frac{1}{N} \cdot \sum\limits_{i=1}^{N} p_i X^{(i)} ) - \text{noisy-dot-product})$

	$\text{mini-}\sum\text{LHS} = \sum\limits_{j=1}^{m} p_j X^{(j)}$,\
	$\text{mini-}\sum\text{LHS} := \sum\limits_{j=1}^{m} p_j X^{(j)}$,\

	$\sum\text{LHS} = \sum\limits_{i=1}^{N} p_i X^{(i)}$\
	$\sum\text{LHS} := \sum\limits_{i=1}^{N} p_i X^{(i)}$\

	$\sum\text{RHS} = \sum\limits_{i=1}^{N} y^{(i)} X^{(i)}$
	$\sum\text{RHS} := \sum\limits_{i=1}^{N} y^{(i)} X^{(i)}$

Create logistic_regression.md #83

Create logistic_regression.md #83

Conversation

benjaminsavage commented Oct 5, 2023

Choose a reason for hiding this comment

bmcase left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment