`LogisticRegression` #100

ordabayevy · 2023-11-01T17:49:21Z

No description provided.

mbabadi · 2023-11-21T01:30:45Z

cellarium/ml/module/base_module.py

@@ -24,17 +30,38 @@ def _get_fn_args_from_batch(tensor_dict: dict[str, np.ndarray | torch.Tensor]) -
        Get forward method arguments from batch.
        """

+    def __getattr__(self, name: str) -> Any:


It seems like you have brought in some parts of the PyroModule code into BaseModule and got rid of BasePyroModule. What's the logic?

I wanted to have a single base model both for regular and pyro models. The only feature needed from PyroModule is the ability to have constrained PyroParams. PyroModule also has a code that syncs parameters with ParamStoreDict. This is the stripped down version of PyroModule that only handles constrained params.

mbabadi

Looks great! You have to rebase this and reorganize it according to the new codebase structure once you merge that PR.

Also, two remarks:

Could you include the callback for monitoring the distribution of W_gc? (here or in a separate PR)
(definitely for a separate PR) Could you comment on what would it take to run LogisticRegression on on-the-fly PCA-transformed data? Can we make a generic Transform that loads a model from a checkpoint and runs predict on a batch as the forward call, followed by another generic Transform that rectifies the predict output to look like a proper batch? One needs to cook up a dummy feature_g, like EMBEDDING_DIM_0, EMBEDDING_DIM_1, ... and call the embeddings x_ng. Everything else in the batch (e.g. y_n) should also pass through...

ordabayevy · 2023-11-21T15:47:47Z

cellarium/ml/models/logistic_regression.py

+    def guide(self, x_ng: torch.Tensor, y_n: torch.Tensor) -> None:
+        pyro.sample("W", dist.Delta(self.W_gc).to_event(2))
+
+    def on_batch_end(self, trainer: pl.Trainer) -> None:


This is a callback to log the W_gc histogram. Here I thought that it might be better to have a callback bundled together with the model unlike VarianceMonitor which was separate (change that too in the future?). The reason is that these callbacks are specific to particular models and having them as a separate callback add more code and config.yaml boilerplate.

ordabayevy · 2023-11-21T18:25:03Z

Could you comment on what would it take to run LogisticRegression on on-the-fly PCA-transformed data?

#99 should take care of it once it is ready

mbabadi

Looks great! You have to rebase this and reorganize it according to the new codebase structure once you merge that PR.

Also, two remarks:

Could you include the callback for monitoring the distribution of W_gc? (here or in a separate PR)
(definitely for a separate PR) Could you comment on what would it take to run LogisticRegression on on-the-fly PCA-transformed data? Can we make a generic Transform that loads a model from a checkpoint and runs predict on a batch as the forward call, followed by another generic Transform that rectifies the predict output to look like a proper batch? One needs to cook up a dummy feature_g, like EMBEDDING_DIM_0, EMBEDDING_DIM_1, ... and call the embeddings x_ng. Everything else in the batch (e.g. y_n) should also pass through...

ordabayevy added 11 commits September 20, 2023 16:54

logistic regression classifier

07da318

wip

5f00385

Merge branch 'main' into svm

6ec90d3

add test

678fd96

AnnDataField

2c76c2e

add test & docstring

3c9223a

mypy

5079884

test

b14c5d1

np.asarray

60cffa1

feature_schema

73ba10f

Merge branch 'anndata-field' into svm

a678337

ordabayevy changed the base branch from main to anndata-field November 2, 2023 18:05

ordabayevy added 5 commits November 2, 2023 23:14

fixes

ce41974

rm classifier

d1f56ca

fixes

963d7ad

log histogram

3358eb2

fix docstring

dab3fd8

ordabayevy requested a review from mbabadi November 3, 2023 18:21

ordabayevy added enhancement New feature or request awaiting review labels Nov 3, 2023

ordabayevy linked an issue Nov 3, 2023 that may be closed by this pull request

Add LogisticRegression model #85

Closed

W_gc

494194c

mbabadi reviewed Nov 21, 2023

View reviewed changes

Base automatically changed from anndata-field to main November 21, 2023 02:12

mbabadi requested changes Nov 21, 2023

View reviewed changes

ordabayevy added 4 commits November 21, 2023 07:47

Merge branch 'main' into svm

c42c926

fixes

f651317

more fixes

b44fc76

log_metrics

c7242d7

ordabayevy commented Nov 21, 2023

View reviewed changes

ordabayevy requested a review from mbabadi November 21, 2023 15:48

mbabadi approved these changes Nov 21, 2023

View reviewed changes

ordabayevy merged commit 069c4a7 into main Nov 21, 2023

ordabayevy deleted the svm branch November 21, 2023 21:44

ordabayevy mentioned this pull request Jan 10, 2024

Top level pyro object should sublcass nn.Module and not PyroModule #39

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`LogisticRegression` #100

`LogisticRegression` #100

ordabayevy commented Nov 1, 2023

mbabadi Nov 21, 2023

ordabayevy Nov 21, 2023

mbabadi left a comment

ordabayevy Nov 21, 2023

ordabayevy commented Nov 21, 2023

mbabadi left a comment

LogisticRegression #100

LogisticRegression #100

Conversation

ordabayevy commented Nov 1, 2023

mbabadi Nov 21, 2023

Choose a reason for hiding this comment

ordabayevy Nov 21, 2023

Choose a reason for hiding this comment

mbabadi left a comment

Choose a reason for hiding this comment

ordabayevy Nov 21, 2023

Choose a reason for hiding this comment

ordabayevy commented Nov 21, 2023

mbabadi left a comment

Choose a reason for hiding this comment

`LogisticRegression` #100

`LogisticRegression` #100