Skip to content

Feature request: Retry opening datasets #144

@charalamm

Description

@charalamm

Hello,

We are planning to use odc stac for some analysis. We have the data on azure and we accessing them with the az:// prefix. In every analysis, when trying to read the files there are always some errors with the internet which result on the data missing from the final data structure.

So far I have catched the following errors:

                dns_problem_condition = "Could not resolve host" in str(ex)
                dns_timeout_condition = "Resolving timed out after" in str(ex)
                read_problem_condition = "not recognized as a supported" in str(ex)
                write_problem = "Failure writing output to destination" in str(ex)
                read_write_problem = "Read or write failed" in str(ex)
                broken_pipe = "Broken pipe" in str(ex)

Do you think it is useful to add a mechanism to retry reading on some errors? I think I can work on a PR if you are interested in this feature. Feel free to close it if you are not interested

A possible approach?

Since some of these errors can be valid ones it should be on the user to decide I they want to retry or not and on what errors to retry. One option would be to allow the user define a list of regexes or strings and odc-stac can check if it should retry based on that. One problem is that GDAL is caching these errors so it might be needed to use CPL_VSIL_CURL_NON_CACHED

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions