Skip to content

Fix timeout issue during the sim1_postprocess_s1_e1_filter_input phase #434

@marekhorst

Description

@marekhorst

Originally reported in: openaire/iis#1326

Documents similarity algorithm fails after running it on a non-deduplicated OpenAIRE Graph counting 300M of publications (deduped graph included 200M).

After in depth inspection covered by the openaire/iis#1326 (comment) it turned out we need to modify documents similarity sources by increasing allowed timeout value which should be defined in sim1-postprocess-s1-e1-filter-sims.pig PIG script.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions