You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Please indicate the following details about the environment in which you found the bug:
SDV version: 1.0.0
Python version: Any
Operating System: Any
Error Description
If certain sdtypes like state are provided for a column, then SDV passing the incorrect provider name to RDDT, causing it to crash with this error:
TransformerProcessingError: The 'en_US' module does not contain a function named 'state'.
Refer to the Faker docs to find the correct function: https://faker.readthedocs.io/en/master/providers.html
Steps to reproduce
import pandas as pd
from sdv.metadata import SingleTableMetadata
from sdv.single_table import GaussianCopulaSynthesizer
data = pd.DataFrame({
'id': [1, 2, 3],
'state': ['California', 'New York', 'Texas'],
'salary': [10000, 50000, 120000]
})
metadata = SingleTableMetadata()
metadata.detect_from_dataframe(data)
metadata.update_column(column_name='state', sdtype='state')
synth = GaussianCopulaSynthesizer(metadata)
synth.fit(data)
The text was updated successfully, but these errors were encountered:
Environment Details
Please indicate the following details about the environment in which you found the bug:
Error Description
If certain sdtypes like
state
are provided for a column, then SDV passing the incorrect provider name to RDDT, causing it to crash with this error:Steps to reproduce
The text was updated successfully, but these errors were encountered: