-
Notifications
You must be signed in to change notification settings - Fork 24
/
scikit-learn__scikit-learn-25638.gpt-4-0125-preview.eval.log
296 lines (283 loc) · 32.7 KB
/
scikit-learn__scikit-learn-25638.gpt-4-0125-preview.eval.log
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
[scikit-learn__scikit-learn__1.3] [scikit-learn__scikit-learn-25638] Task Metadata:
- Instance ID: scikit-learn__scikit-learn-25638
- Testbed: scikit-learn__scikit-learn__1.3
- Evaluation Model: gpt-4-0125-preview
- Python version: Python 3.9.19
[scikit-learn__scikit-learn__1.3] [scikit-learn__scikit-learn-25638] Command: git -c advice.detachedHead=false checkout 6adb209acd63825affc884abcd85381f148fb1b0
[scikit-learn__scikit-learn__1.3] [scikit-learn__scikit-learn-25638] Subprocess args: {"cwd": "/opt/scikit-learn__scikit-learn", "check": true, "shell": false, "universal_newlines": true, "stdout": -1, "stderr": -2}
[scikit-learn__scikit-learn__1.3] [scikit-learn__scikit-learn-25638] Std. Output:
HEAD is now at 6adb209ac FIX renormalization of y_pred inside log_loss (#25299)
[scikit-learn__scikit-learn__1.3] [scikit-learn__scikit-learn-25638] Return Code: 0
[scikit-learn__scikit-learn__1.3] [scikit-learn__scikit-learn-25638] Command: git apply -v /opt/temp_scikit-learn__scikit-learn-25638_pred_try.patch
[scikit-learn__scikit-learn__1.3] [scikit-learn__scikit-learn-25638] Subprocess args: {"cwd": "/opt/scikit-learn__scikit-learn", "check": false, "shell": false, "universal_newlines": true, "stdout": -1, "stderr": -2}
[scikit-learn__scikit-learn__1.3] [scikit-learn__scikit-learn-25638] Std. Output:
Checking patch sklearn/utils/multiclass.py...
Applied patch sklearn/utils/multiclass.py cleanly.
[scikit-learn__scikit-learn__1.3] [scikit-learn__scikit-learn-25638] Return Code: 0
[scikit-learn__scikit-learn__1.3] [scikit-learn__scikit-learn-25638] Apply patch successful (pred_try)
>>>>> Applied Patch (pred_try)
[scikit-learn__scikit-learn__1.3] [scikit-learn__scikit-learn-25638] Command: git apply -v -R /opt/temp_scikit-learn__scikit-learn-25638_pred_try.patch
[scikit-learn__scikit-learn__1.3] [scikit-learn__scikit-learn-25638] Subprocess args: {"cwd": "/opt/scikit-learn__scikit-learn", "check": false, "shell": false, "universal_newlines": true, "stdout": -1, "stderr": -2}
[scikit-learn__scikit-learn__1.3] [scikit-learn__scikit-learn-25638] Std. Output:
Checking patch sklearn/utils/multiclass.py...
Applied patch sklearn/utils/multiclass.py cleanly.
[scikit-learn__scikit-learn__1.3] [scikit-learn__scikit-learn-25638] Return Code: 0
[scikit-learn__scikit-learn__1.3] [scikit-learn__scikit-learn-25638] Revert patch successful (pred_try)
>>>>> Applied Patch (pred_try)
[scikit-learn__scikit-learn__1.3] [scikit-learn__scikit-learn-25638] Command: git apply -v /opt/temp_scikit-learn__scikit-learn-25638_pred.patch
[scikit-learn__scikit-learn__1.3] [scikit-learn__scikit-learn-25638] Subprocess args: {"cwd": "/opt/scikit-learn__scikit-learn", "check": false, "shell": false, "universal_newlines": true, "stdout": -1, "stderr": -2}
[scikit-learn__scikit-learn__1.3] [scikit-learn__scikit-learn-25638] Std. Output:
Checking patch sklearn/utils/multiclass.py...
Applied patch sklearn/utils/multiclass.py cleanly.
[scikit-learn__scikit-learn__1.3] [scikit-learn__scikit-learn-25638] Return Code: 0
[scikit-learn__scikit-learn__1.3] [scikit-learn__scikit-learn-25638] Apply patch successful (pred)
>>>>> Applied Patch (pred)
[scikit-learn__scikit-learn__1.3] [scikit-learn__scikit-learn-25638] Command: git restore sklearn/metrics/tests/test_classification.py
[scikit-learn__scikit-learn__1.3] [scikit-learn__scikit-learn-25638] Subprocess args: {"cwd": "/opt/scikit-learn__scikit-learn", "check": true, "shell": false, "universal_newlines": true, "stdout": -1, "stderr": -2}
[scikit-learn__scikit-learn__1.3] [scikit-learn__scikit-learn-25638] Std. Output:
[scikit-learn__scikit-learn__1.3] [scikit-learn__scikit-learn-25638] Return Code: 0
[scikit-learn__scikit-learn__1.3] [scikit-learn__scikit-learn-25638] Command: git restore sklearn/preprocessing/tests/test_label.py
[scikit-learn__scikit-learn__1.3] [scikit-learn__scikit-learn-25638] Subprocess args: {"cwd": "/opt/scikit-learn__scikit-learn", "check": true, "shell": false, "universal_newlines": true, "stdout": -1, "stderr": -2}
[scikit-learn__scikit-learn__1.3] [scikit-learn__scikit-learn-25638] Std. Output:
[scikit-learn__scikit-learn__1.3] [scikit-learn__scikit-learn-25638] Return Code: 0
[scikit-learn__scikit-learn__1.3] [scikit-learn__scikit-learn-25638] Command: git restore sklearn/utils/tests/test_multiclass.py
[scikit-learn__scikit-learn__1.3] [scikit-learn__scikit-learn-25638] Subprocess args: {"cwd": "/opt/scikit-learn__scikit-learn", "check": true, "shell": false, "universal_newlines": true, "stdout": -1, "stderr": -2}
[scikit-learn__scikit-learn__1.3] [scikit-learn__scikit-learn-25638] Std. Output:
[scikit-learn__scikit-learn__1.3] [scikit-learn__scikit-learn-25638] Return Code: 0
[scikit-learn__scikit-learn__1.3] [scikit-learn__scikit-learn-25638] Command: git apply -v /opt/temp_scikit-learn__scikit-learn-25638_test.patch
[scikit-learn__scikit-learn__1.3] [scikit-learn__scikit-learn-25638] Subprocess args: {"cwd": "/opt/scikit-learn__scikit-learn", "check": false, "shell": false, "universal_newlines": true, "stdout": -1, "stderr": -2}
[scikit-learn__scikit-learn__1.3] [scikit-learn__scikit-learn-25638] Std. Output:
Checking patch sklearn/metrics/tests/test_classification.py...
Checking patch sklearn/preprocessing/tests/test_label.py...
Checking patch sklearn/utils/tests/test_multiclass.py...
Applied patch sklearn/metrics/tests/test_classification.py cleanly.
Applied patch sklearn/preprocessing/tests/test_label.py cleanly.
Applied patch sklearn/utils/tests/test_multiclass.py cleanly.
[scikit-learn__scikit-learn__1.3] [scikit-learn__scikit-learn-25638] Return Code: 0
[scikit-learn__scikit-learn__1.3] [scikit-learn__scikit-learn-25638] Apply patch successful (test)
>>>>> Applied Patch (test)
Test Script: pytest -rA --tb=no -p no:cacheprovider sklearn/metrics/tests/test_classification.py sklearn/preprocessing/tests/test_label.py sklearn/utils/tests/test_multiclass.py;
[scikit-learn__scikit-learn__1.3] [scikit-learn__scikit-learn-25638] Command: pytest -rA --tb=no -p no:cacheprovider sklearn/metrics/tests/test_classification.py sklearn/preprocessing/tests/test_label.py sklearn/utils/tests/test_multiclass.py
[scikit-learn__scikit-learn__1.3] [scikit-learn__scikit-learn-25638] Subprocess args: {"cwd": "/opt/scikit-learn__scikit-learn", "check": false, "shell": false, "universal_newlines": true, "stdout": -1, "stderr": -2, "timeout": 1800}
[scikit-learn__scikit-learn__1.3] [scikit-learn__scikit-learn-25638] Std. Output:
[1m============================= test session starts ==============================[0m
platform linux -- Python 3.9.19, pytest-8.2.0, pluggy-1.5.0
rootdir: /opt/scikit-learn__scikit-learn
configfile: setup.cfg
collected 205 items
sklearn/metrics/tests/test_classification.py [32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[33m [ 13%]
[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[31mF[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[31m [ 48%]
[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[33ms[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[31m [ 76%][0m
sklearn/preprocessing/tests/test_label.py [32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[31mF[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[31m [ 91%]
[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[31m [ 93%][0m
sklearn/utils/tests/test_multiclass.py [32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[32m.[0m[31mF[0m[32m.[0m[32m.[0m[31mF[0m[32m.[0m[32m.[0m[32m.[0m[31m [100%][0m
[36m[1m=========================== short test summary info ============================[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_classification_report_dictionary_output[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_classification_report_output_dict_empty_input[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_classification_report_zero_division_warning[warn][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_classification_report_zero_division_warning[0][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_classification_report_zero_division_warning[1][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_multilabel_accuracy_score_subset_accuracy[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_precision_recall_f1_score_binary[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_precision_recall_f_binary_single_class[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_precision_recall_f_extra_labels[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_precision_recall_f_ignored_labels[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_average_precision_score_score_non_binary_class[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_average_precision_score_duplicate_values[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_average_precision_score_tied_values[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_precision_recall_fscore_support_errors[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_precision_recall_f_unused_pos_label[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_confusion_matrix_binary[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_multilabel_confusion_matrix_binary[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_multilabel_confusion_matrix_multiclass[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_multilabel_confusion_matrix_multilabel[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_multilabel_confusion_matrix_errors[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_confusion_matrix_normalize[true-f-0.333333333][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_confusion_matrix_normalize[pred-f-0.333333333][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_confusion_matrix_normalize[all-f-0.1111111111][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_confusion_matrix_normalize[None-i-2][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_confusion_matrix_normalize_single_class[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_likelihood_ratios_warnings[params0-samples of only one class were seen during testing][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_likelihood_ratios_warnings[params1-positive_likelihood_ratio ill-defined and being set to nan][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_likelihood_ratios_warnings[params2-no samples predicted for the positive class][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_likelihood_ratios_warnings[params3-negative_likelihood_ratio ill-defined and being set to nan][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_likelihood_ratios_warnings[params4-no samples of the positive class were present in the testing set][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_likelihood_ratios_errors[params0-class_likelihood_ratios only supports binary classification problems, got targets of type: multiclass][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_likelihood_ratios[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_cohen_kappa[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_matthews_corrcoef_nan[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_matthews_corrcoef_against_numpy_corrcoef[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_matthews_corrcoef_against_jurman[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_matthews_corrcoef[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_matthews_corrcoef_multiclass[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_matthews_corrcoef_overflow[100][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_matthews_corrcoef_overflow[10000][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_precision_recall_f1_score_multiclass[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_precision_refcall_f1_score_multilabel_unordered_labels[samples][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_precision_refcall_f1_score_multilabel_unordered_labels[micro][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_precision_refcall_f1_score_multilabel_unordered_labels[macro][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_precision_refcall_f1_score_multilabel_unordered_labels[weighted][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_precision_refcall_f1_score_multilabel_unordered_labels[None][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_precision_recall_f1_score_binary_averaged[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_zero_precision_recall[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_confusion_matrix_multiclass_subset_labels[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_confusion_matrix_error[empty list][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_confusion_matrix_error[unknown labels][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_confusion_matrix_on_zero_length_input[None][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_confusion_matrix_on_zero_length_input[binary][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_confusion_matrix_on_zero_length_input[multiclass][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_confusion_matrix_dtype[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_confusion_matrix_pandas_nullable[Int64][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_confusion_matrix_pandas_nullable[Float64][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_classification_report_multiclass[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_classification_report_multiclass_balanced[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_classification_report_multiclass_with_label_detection[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_classification_report_multiclass_with_digits[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_classification_report_multiclass_with_string_label[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_classification_report_multiclass_with_unicode_label[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_classification_report_multiclass_with_long_string_label[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_classification_report_labels_target_names_unequal_length[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_classification_report_no_labels_target_names_unequal_length[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_multilabel_classification_report[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_multilabel_zero_one_loss_subset[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_multilabel_hamming_loss[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_jaccard_score_validation[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_multilabel_jaccard_score[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_multiclass_jaccard_score[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_average_binary_jaccard_score[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_jaccard_score_zero_division_warning[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_jaccard_score_zero_division_set_value[0-0][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_jaccard_score_zero_division_set_value[1-0.5][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_precision_recall_f1_score_multilabel_1[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_precision_recall_f1_score_multilabel_2[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_precision_recall_f1_score_with_an_empty_prediction[warn][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_precision_recall_f1_score_with_an_empty_prediction[0][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_precision_recall_f1_score_with_an_empty_prediction[1][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_precision_recall_f1_no_labels[0-macro-1][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_precision_recall_f1_no_labels[0-micro-1][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_precision_recall_f1_no_labels[0-weighted-1][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_precision_recall_f1_no_labels[0-samples-1][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_precision_recall_f1_no_labels[1-macro-1][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_precision_recall_f1_no_labels[1-micro-1][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_precision_recall_f1_no_labels[1-weighted-1][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_precision_recall_f1_no_labels[1-samples-1][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_precision_recall_f1_no_labels_check_warnings[macro][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_precision_recall_f1_no_labels_check_warnings[micro][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_precision_recall_f1_no_labels_check_warnings[weighted][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_precision_recall_f1_no_labels_check_warnings[samples][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_precision_recall_f1_no_labels_average_none[0][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_precision_recall_f1_no_labels_average_none[1][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_precision_recall_f1_no_labels_average_none_warn[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_prf_warnings[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_prf_no_warnings_if_zero_division_set[0][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_prf_no_warnings_if_zero_division_set[1][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_recall_warnings[warn][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_recall_warnings[0][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_recall_warnings[1][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_precision_warnings[warn][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_precision_warnings[0][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_precision_warnings[1][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_fscore_warnings[warn][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_fscore_warnings[0][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_fscore_warnings[1][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_prf_average_binary_data_non_binary[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest__check_targets[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest__check_targets_multiclass_with_both_y_true_and_y_pred_binary[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_hinge_loss_binary[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_hinge_loss_multiclass[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_hinge_loss_multiclass_missing_labels_with_labels_none[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_hinge_loss_multiclass_no_consistent_pred_decision_shape[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_hinge_loss_multiclass_with_missing_labels[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_hinge_loss_multiclass_missing_labels_only_two_unq_in_y_true[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_hinge_loss_multiclass_invariance_lists[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_log_loss[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_log_loss_eps_auto[float64][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_log_loss_eps_auto_float16[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_log_loss_pandas_input[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_brier_score_loss[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_balanced_accuracy_score_unseen[0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_balanced_accuracy_score[y_true0-y_pred0][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_balanced_accuracy_score[y_true1-y_pred1][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_balanced_accuracy_score[y_true2-y_pred2][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_classification_metric_pos_label_types[classes0-jaccard_score][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_classification_metric_pos_label_types[classes0-f1_score][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_classification_metric_pos_label_types[classes0-metric2][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_classification_metric_pos_label_types[classes0-precision_recall_fscore_support][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_classification_metric_pos_label_types[classes0-precision_score][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_classification_metric_pos_label_types[classes0-recall_score][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_classification_metric_pos_label_types[classes0-brier_score_loss][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_classification_metric_pos_label_types[classes1-jaccard_score][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_classification_metric_pos_label_types[classes1-f1_score][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_classification_metric_pos_label_types[classes1-metric2][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_classification_metric_pos_label_types[classes1-precision_recall_fscore_support][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_classification_metric_pos_label_types[classes1-precision_score][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_classification_metric_pos_label_types[classes1-recall_score][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_classification_metric_pos_label_types[classes1-brier_score_loss][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_classification_metric_pos_label_types[classes2-jaccard_score][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_classification_metric_pos_label_types[classes2-f1_score][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_classification_metric_pos_label_types[classes2-metric2][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_classification_metric_pos_label_types[classes2-precision_recall_fscore_support][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_classification_metric_pos_label_types[classes2-precision_score][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_classification_metric_pos_label_types[classes2-recall_score][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_classification_metric_pos_label_types[classes2-brier_score_loss][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_classification_metric_pos_label_types[classes3-jaccard_score][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_classification_metric_pos_label_types[classes3-f1_score][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_classification_metric_pos_label_types[classes3-metric2][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_classification_metric_pos_label_types[classes3-precision_recall_fscore_support][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_classification_metric_pos_label_types[classes3-precision_score][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_classification_metric_pos_label_types[classes3-recall_score][0m
[32mPASSED[0m sklearn/metrics/tests/test_classification.py::[1mtest_classification_metric_pos_label_types[classes3-brier_score_loss][0m
[32mPASSED[0m sklearn/preprocessing/tests/test_label.py::[1mtest_label_binarizer[0m
[32mPASSED[0m sklearn/preprocessing/tests/test_label.py::[1mtest_label_binarizer_unseen_labels[0m
[32mPASSED[0m sklearn/preprocessing/tests/test_label.py::[1mtest_label_binarizer_set_label_encoding[0m
[32mPASSED[0m sklearn/preprocessing/tests/test_label.py::[1mtest_label_binarizer_pandas_nullable[Int64][0m
[32mPASSED[0m sklearn/preprocessing/tests/test_label.py::[1mtest_label_binarizer_pandas_nullable[Float64][0m
[32mPASSED[0m sklearn/preprocessing/tests/test_label.py::[1mtest_label_binarizer_errors[0m
[32mPASSED[0m sklearn/preprocessing/tests/test_label.py::[1mtest_label_encoder[int64][0m
[32mPASSED[0m sklearn/preprocessing/tests/test_label.py::[1mtest_label_encoder[object][0m
[32mPASSED[0m sklearn/preprocessing/tests/test_label.py::[1mtest_label_encoder[str][0m
[32mPASSED[0m sklearn/preprocessing/tests/test_label.py::[1mtest_label_encoder_negative_ints[0m
[32mPASSED[0m sklearn/preprocessing/tests/test_label.py::[1mtest_label_encoder_str_bad_shape[str][0m
[32mPASSED[0m sklearn/preprocessing/tests/test_label.py::[1mtest_label_encoder_str_bad_shape[object][0m
[32mPASSED[0m sklearn/preprocessing/tests/test_label.py::[1mtest_label_encoder_errors[0m
[32mPASSED[0m sklearn/preprocessing/tests/test_label.py::[1mtest_label_encoder_empty_array[int64][0m
[32mPASSED[0m sklearn/preprocessing/tests/test_label.py::[1mtest_label_encoder_empty_array[object][0m
[32mPASSED[0m sklearn/preprocessing/tests/test_label.py::[1mtest_label_encoder_empty_array[str][0m
[32mPASSED[0m sklearn/preprocessing/tests/test_label.py::[1mtest_sparse_output_multilabel_binarizer[0m
[32mPASSED[0m sklearn/preprocessing/tests/test_label.py::[1mtest_multilabel_binarizer[0m
[32mPASSED[0m sklearn/preprocessing/tests/test_label.py::[1mtest_multilabel_binarizer_empty_sample[0m
[32mPASSED[0m sklearn/preprocessing/tests/test_label.py::[1mtest_multilabel_binarizer_unknown_class[0m
[32mPASSED[0m sklearn/preprocessing/tests/test_label.py::[1mtest_multilabel_binarizer_given_classes[0m
[32mPASSED[0m sklearn/preprocessing/tests/test_label.py::[1mtest_multilabel_binarizer_multiple_calls[0m
[32mPASSED[0m sklearn/preprocessing/tests/test_label.py::[1mtest_multilabel_binarizer_same_length_sequence[0m
[32mPASSED[0m sklearn/preprocessing/tests/test_label.py::[1mtest_multilabel_binarizer_non_integer_labels[0m
[32mPASSED[0m sklearn/preprocessing/tests/test_label.py::[1mtest_multilabel_binarizer_non_unique[0m
[32mPASSED[0m sklearn/preprocessing/tests/test_label.py::[1mtest_multilabel_binarizer_inverse_validation[0m
[32mPASSED[0m sklearn/preprocessing/tests/test_label.py::[1mtest_label_binarize_with_class_order[0m
[32mPASSED[0m sklearn/preprocessing/tests/test_label.py::[1mtest_label_binarize_binary[0m
[32mPASSED[0m sklearn/preprocessing/tests/test_label.py::[1mtest_label_binarize_multiclass[0m
[32mPASSED[0m sklearn/preprocessing/tests/test_label.py::[1mtest_label_binarize_multilabel[0m
[32mPASSED[0m sklearn/preprocessing/tests/test_label.py::[1mtest_invalid_input_label_binarize[0m
[32mPASSED[0m sklearn/preprocessing/tests/test_label.py::[1mtest_inverse_binarize_multiclass[0m
[32mPASSED[0m sklearn/preprocessing/tests/test_label.py::[1mtest_nan_label_encoder[0m
[32mPASSED[0m sklearn/utils/tests/test_multiclass.py::[1mtest_unique_labels[0m
[32mPASSED[0m sklearn/utils/tests/test_multiclass.py::[1mtest_unique_labels_non_specific[0m
[32mPASSED[0m sklearn/utils/tests/test_multiclass.py::[1mtest_unique_labels_mixed_types[0m
[32mPASSED[0m sklearn/utils/tests/test_multiclass.py::[1mtest_is_multilabel[0m
[32mPASSED[0m sklearn/utils/tests/test_multiclass.py::[1mtest_check_classification_targets[0m
[32mPASSED[0m sklearn/utils/tests/test_multiclass.py::[1mtest_type_of_target[0m
[32mPASSED[0m sklearn/utils/tests/test_multiclass.py::[1mtest_type_of_target_pandas_sparse[0m
[32mPASSED[0m sklearn/utils/tests/test_multiclass.py::[1mtest_unique_labels_pandas_nullable[Int64][0m
[32mPASSED[0m sklearn/utils/tests/test_multiclass.py::[1mtest_unique_labels_pandas_nullable[Float64][0m
[32mPASSED[0m sklearn/utils/tests/test_multiclass.py::[1mtest_class_distribution[0m
[32mPASSED[0m sklearn/utils/tests/test_multiclass.py::[1mtest_safe_split_with_precomputed_kernel[0m
[32mPASSED[0m sklearn/utils/tests/test_multiclass.py::[1mtest_ovr_decision_function[0m
[33mSKIPPED[0m [1] sklearn/metrics/tests/test_classification.py:2565: Set SKLEARN_RUN_FLOAT32_TESTS=1 to run float32 dtype tests
[31mFAILED[0m sklearn/metrics/tests/test_classification.py::[1mtest_confusion_matrix_pandas_nullable[boolean][0m - TypeError: data type 'boolean' not understood
[31mFAILED[0m sklearn/preprocessing/tests/test_label.py::[1mtest_label_binarizer_pandas_nullable[boolean][0m - TypeError: data type 'boolean' not understood
[31mFAILED[0m sklearn/utils/tests/test_multiclass.py::[1mtest_type_of_target_pandas_nullable[0m - AssertionError: assert 'unknown' == 'continuous-multioutput'
[31mFAILED[0m sklearn/utils/tests/test_multiclass.py::[1mtest_unique_labels_pandas_nullable[boolean][0m - TypeError: data type 'boolean' not understood
[31m============ [31m[1m4 failed[0m, [32m200 passed[0m, [33m1 skipped[0m, [33m30 warnings[0m[31m in 6.25s[0m[31m =============[0m
[scikit-learn__scikit-learn__1.3] [scikit-learn__scikit-learn-25638] Return Code: 1
>>>>> Some Tests Failed
[scikit-learn__scikit-learn__1.3] [scikit-learn__scikit-learn-25638] Test script run successful