Skip to content

Conversation

lfoppiano
Copy link

@lfoppiano lfoppiano commented Sep 14, 2024

This PR implements a parallel processor that parses TEI that might come from either Grobid or Pub2TEI.

I've recovered from the dust of the master :-)

@lfoppiano
Copy link
Author

Need to patch the way references are managed, as TEI produced by Pub2TEI might have strings instead of near-integer (b1, b2).

…aintain the compatibility with the rest of the processing
# Conflicts:
#	Readme.md
#	src/main/java/org/grobid/core/engines/DataseerClassifier.java
#	src/main/java/org/grobid/core/engines/DatasetParser.java
…al generated by pub2tei)

(cherry picked from commit 39c0e43)
@lfoppiano lfoppiano marked this pull request as ready for review April 13, 2025 12:54
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant