-
Notifications
You must be signed in to change notification settings - Fork 326
Open
Description
Problem Description
When working with large datasets fitting the transformer to the data takes a long time (for example a sample of 500 000 rows and 1500 columns takes around 10 hours on nvidia Quadro RTX 6000). Currently we use a self-built-in feature in the .fit() method of the CTGANSynthesizer class to save and load a fitted transformer object.
Maybe these adjustments could be useful for other users working on large scale data?
I could prepare a PR in case this would be useful.
Metadata
Metadata
Assignees
Labels
No labels