TODO in a future PR: refactor dataset object to use the dance abstract dataset class, see [`ClusteringDataset`](https://github.com/OmicsML/dance/blob/f3e0e83ef3dbf25339e1a6ffa10ffb18ac81d72d/dance/datasets/singlemodality.py#L235) for example
Originally posted by @RemyLau in #228 (comment)