-
-
Notifications
You must be signed in to change notification settings - Fork 212
Open
Description
While it's understandable and useful in many situations to want the latest dataset, it can cause issues in some situations:
- ephemeral environments that will not be able to cache the network calls to disk. I'm thinking things like k8s tasks or other distributed systems. They'll be refetching the list at every invocation.
- firewalled or no-connection environments. I believe the library works in this case but only after the delay of making a failed http connection
Not sure what a solution would look like but here are some ideas:
- automate the publishing of the python package on a schedule with an updated tld_set
- make the default non-autoupdating but allow the self-updating version to be easily used via function argument. Something like
use_latest
oruse_autoupdating
- add a TTL to the cached version. For example we could set it at 7 days and it would automatically refetch the list if the cached version was older than that.
hauntsaninja, lubo and imamatory
Metadata
Metadata
Assignees
Labels
No labels