-
Notifications
You must be signed in to change notification settings - Fork 5
Project motivation from original 2 theses
Juan Miguel Cejuela edited this page Dec 9, 2015
·
2 revisions
- Introductory project presentation at Rostlab
- Theses Documentation:
- Study significance of NL mentions in mutation mention recognition
- ratio of standard vs NL in abstracts & full text
- % of novel mutations not present in SwissProt (would require manual annotation of protein
- % of mutation mentions in natural language that don't appear as standard mention
- Define/extend corpus of NLs
- size depends on significance of NLs
- Method for mutation mention extraction grounded to their genes/proteins
- Mutation mention recognizer better than tmVar for standard mentions
- If NLs are relevant, prove good F1 performance (> 70-80)
- Simple or optionally advanced normalization method
- Easy to use program:
-
Good documentation:
- code
- end-user (biology researcher level, how to call from the command line, ...)
- Accept inputs: programmatical call (string), text file, corpora' formats**
- Accept outputs: ann.json (tagtog suitable)
-
Good documentation:
- Paper
- Full draft (1 or 2 papers?) by end of August submittable to Burkhard Rost
- Submit by September-October