Added outlier detection with iterativetrimming #227

swagataroy123 · 2023-06-28T09:01:38Z

Hi @jduerholt ,
I have added the robust GP outlier as in this paper (https://arxiv.org/abs/2011.11057

jduerholt

Thx, I let some comments.

jduerholt · 2023-06-28T12:06:32Z

bofire/data_models/outlier_detection/api.py

+from typing import Union
+try:
+    from bofire.data_models.outlier_detection.outlier_detection import OutlierDetection,IterativeTrimming
+    AnyOutlierDetection = Union[OutlierDetection,IterativeTrimming]


Why do you have this try except structure?

I just copied it from api of data_models/surrogates

It is not needed here, please remove it.

jduerholt · 2023-06-28T13:08:34Z

bofire/data_models/outlier_detection/outlier_detection.py

+    type: str
+
+
+class IterativeTrimming(OutlierDetection):


Please add a docstring in which you mention the method and the paper and explain the parameters.

Please use google type docstrings

bofire/outlier_detection/mapper.py

bofire/outlier_detection/api.py

jduerholt · 2023-06-28T13:11:59Z

bofire/outlier_detection/outlier_detection.py

+        self.base_gp = data_model.base_gp
+        super().__init__()
+
+    def detect(self, experiments: pd.DataFrame):


Suggested change

def detect(self, experiments: pd.DataFrame):

def detect(self, experiments: pd.DataFrame) -> Tuple[pd.DataFrame, pd.DataFrame]:

jduerholt · 2023-06-28T13:13:21Z

tests/bofire/outlier_validation/test_outlier_detection.py

call it in the tests exactly as in the the main structure, outlier_validation --> outlier_detection

tests/bofire/outlier_validation/test_outlier_detection.py

jduerholt · 2023-06-28T13:14:59Z

tests/bofire/outlier_validation/test_outlier_detection.py

+    Trimmed, outliers = ITGP.detect(experiments=experiments)
+    assert isinstance(Trimmed, pd.DataFrame)
+    assert isinstance(outliers, pd.DataFrame)
+    assert len(experiments) == len(Trimmed) + len(outliers)


can you also check it it was able to finde most of the outliers?

jduerholt · 2023-06-28T13:16:07Z

tests/bofire/outlier_validation/test_outlier_detection.py

+experiments["valid_y"] = 1
+
+
+@pytest.mark.parametrize(


we do not need the parameterization here. just write everything starting in line 20 into the test_IterativeTrimming method, and remove the argument experiments from it.

swagataroy123 · 2023-06-28T13:43:46Z

Thx, I let some comments.

I have answered them

jduerholt · 2023-06-28T13:56:14Z

Please also add a specs module as shown here: https://github.com/experimental-design/bofire/blob/main/tests/bofire/data_models/specs/surrogates.py and add the serialization and deserialization tests.

jduerholt · 2023-06-28T13:57:10Z

Furthermore, a notebook under tutorials would be nice in which the ITGP outlier detection stuff is demonstrated based on the ITGP paper.

Call it outlier_detection.ipynb

jduerholt

Looks good to me.

swagataroy123 added 2 commits June 28, 2023 10:55

Added outlier detection

4e3b87f

checked ruff and black

9f92ac1

jduerholt requested changes Jun 28, 2023

View reviewed changes

swagataroy123 added 2 commits June 28, 2023 15:36

corrected some comments

c0e95fe

Added docstring and paper url

711b20e

Got rid of try except structure

9fd4ab0

swagataroy123 closed this Jun 28, 2023

swagataroy123 reopened this Jun 28, 2023

swagataroy123 added 2 commits June 29, 2023 10:44

Added specs

fb8b8d5

Checked lint

bf531bd

jduerholt approved these changes Jun 29, 2023

View reviewed changes

jduerholt merged commit aa39854 into experimental-design:main Jun 29, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added outlier detection with iterativetrimming #227

Added outlier detection with iterativetrimming #227

swagataroy123 commented Jun 28, 2023

jduerholt left a comment

jduerholt Jun 28, 2023

swagataroy123 Jun 28, 2023

jduerholt Jun 28, 2023

jduerholt Jun 28, 2023

jduerholt Jun 28, 2023

jduerholt Jun 28, 2023

jduerholt Jun 28, 2023

jduerholt Jun 28, 2023

jduerholt Jun 28, 2023

swagataroy123 commented Jun 28, 2023

jduerholt commented Jun 28, 2023

jduerholt commented Jun 28, 2023 •

edited

Loading

jduerholt left a comment

	def detect(self, experiments: pd.DataFrame):
	def detect(self, experiments: pd.DataFrame) -> Tuple[pd.DataFrame, pd.DataFrame]:

		experiments["valid_y"] = 1


		@pytest.mark.parametrize(

Added outlier detection with iterativetrimming #227

Added outlier detection with iterativetrimming #227

Conversation

swagataroy123 commented Jun 28, 2023

jduerholt left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

swagataroy123 commented Jun 28, 2023

jduerholt commented Jun 28, 2023

jduerholt commented Jun 28, 2023 • edited Loading

jduerholt left a comment

Choose a reason for hiding this comment

jduerholt commented Jun 28, 2023 •

edited

Loading