Skip to content

Faster recognition of experiments that have already run (or skipped) #28

@jjc2718

Description

@jjc2718

Right now, it's straightforward to recognize experiments that have already been run: we just check for the output file, and if it exists we skip it.

However, recognizing experiments that were skipped before (e.g. because of too few positive labels, etc) is slower, as we have to preprocess the data again to arrive at that conclusion. It would be faster to create some sort of file for each gene showing which experiments were skipped, then on rerun we can load the file (if it exists) and know which experiments to skip again, much faster.

I don't think this will improve runtime that much, but it should help a bit.

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions