Skip to content

Conversation

sannegovaert
Copy link
Member

This script prepares EML data extracted from IMIS for upload to a GBIF IPT.

@peterdesmet
Copy link
Member

I also think it's best to remove the external link to IMIS (like https://www.vliz.be/en/imis?dasid=5867) as part of this script, since we have the DOI pointing to the same thing.

@peterdesmet
Copy link
Member

And I would remove all emails of all creators/contacts, since there are still some odd emails there. ORCID is sufficient.

Copy link
Member

@peterdesmet peterdesmet left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good. The last paragraph in the description (that is added with our write_eml() function) has escaped HTML characters. I think we should wrap that in <![CDATA[.

@sannegovaert
Copy link
Member Author

sannegovaert commented Mar 31, 2025

I updated the script so to wrap more paragraphs with <![CDATA[.
It seems that more metadata is added from IMIS. I do not understand why pubdate is going back in time however.

(I forgot about this issue)

Copy link
Member

@peterdesmet peterdesmet left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Last paragraph &lt;![CDATA[ has escaped characters, rather than <![CDATA[. I don't think this will work in IPT import. 😬

- ![CDATA[ ]]> wrappers are not needed anymore
- lists of paragraphs ara replaced by <p></p> tags
@sannegovaert
Copy link
Member Author

sannegovaert commented Apr 2, 2025

As we discussed and tested yesterday on the IPT:

  • <![CDATA[ ]]> wrappers are not needed anymore
  • lists of paragraphs can be replaced by <p></p> tags

Sometimes the <p></p> tags are missing in the source eml. This solves that problem.
@sannegovaert sannegovaert requested a review from peterdesmet April 2, 2025 09:49
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants