Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question about scaling and feature selection #6

Open
jessicaliu09 opened this issue May 22, 2024 · 0 comments
Open

Question about scaling and feature selection #6

jessicaliu09 opened this issue May 22, 2024 · 0 comments

Comments

@jessicaliu09
Copy link

Hi,

Because our data is zero-inflated, we want to use MaxAbsScaler. So when we turn off the scaling, it has error during the feature selection step. See errors below.

So we want to turn off scaling and keep feature selection, how should we set up the parameters? Thanks.

INFO: Applying collective feature selection...
Traceback (most recent call last):
File "/N/slate/shiwliu/STREAMLINE/0.1.STREAMLINE-Notebook.py", line 492, in
f_sel.run(run_parallel=False)
File "/N/slate/shiwliu/STREAMLINE/streamline/runners/feature_runner.py", line 324, in run
job_obj.run()
File "/N/slate/shiwliu/STREAMLINE/streamline/featurefns/selection.py", line 79, in run
self.select_features(selected_feature_lists,
File "/N/slate/shiwliu/STREAMLINE/streamline/featurefns/selection.py", line 254, in select_features
if len(union_list) > max_features_to_keep: # Apply further filtering if more than max features remains
TypeError: '>' not supported between instances of 'int' and 'NoneType'

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant