Skip to content

Labeling Function still got access to calculated attribute after it has been deleted #3

Open
@FelixKirsch

Description

@FelixKirsch

From #1:

When I add a attribute that is basically a copy of a given text (let's call that attribute text_copy), and i then write a lf using text_copy, e.g.:

def my_lf(record):
    if "x" in record["text_copy"].text:
        return "label_a"

that works. But now, when I delete text_copy, and run that function again - or even modify it to label_b, and then run it again - it still works. How do we store the records exactly? might be that there is some place left that is not touched by the deletion of an attribute?

From the user's perspective, this behavior seems unexpected. For the moment, I decided to keep calculated attributes in the docbin and record_tokenized, until the tokenization is restarted. Otherwise, the tokenization would have to be restarted and the user would have to wait until it's finished.

At the point in time, when attributes can be tokenized by themselves without tokenizing the whole project, it will most probably also be possible to delete only one attribute. Then this issue can be fixed.

Metadata

Metadata

Assignees

Labels

bugSomething isn't workingenhancementNew feature or request

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions