Skip to content

Commit a8b0d23

Browse files
batch7
1 parent 51aff12 commit a8b0d23

File tree

111 files changed

+25311
-5842
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

111 files changed

+25311
-5842
lines changed

.claude/agents/annotation-reviewer.md

Lines changed: 2 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -31,14 +31,13 @@ For each annotation you will create or update the `review` section of the `exist
3131

3232
Only edit the `review` section. For any statement, back it up with a citation used in the overall document. You should quote exact passages of text in `supporting_text`.
3333

34-
Note that there should be an entry under `existing_annotations` for every line in the GOA tsv. You don't need to add new ones, just comment on existing ones
34+
Note that there should be an entry under `existing_annotations` for every line in the GOA tsv.
3535

3636
The exception is if you think there are key annotations missing. In this case you should add entries, completing the `term` portion yourself, with `action: NEW`. Only do this for annotations not covered or with `proposed_replacement_terms` in existing annotations.
3737

3838
2. **Critical Evaluation**: You must not accept existing annotations as gospel, regardless of whether they are marked as experimental (EXP, IDA, IPI, etc.) or computational (IEA, ISS, etc.). Many GO terms represent over-annotations that need correction.
3939

40-
However, in general IBA annotations have undergone extensive review as well as making phylogenetic sense, they often frequently represent the
41-
term at the right level of specificity. However, they can be conservative and missing functions.
40+
However, in general IBA annotations have undergone extensive review as well as making phylogenetic sense, they often frequently represent the term at the right level of specificity. However, they can be conservative and missing functions.
4241

4342
Always make use of the `original_reference_id`. If this refers to a PMID, then read the publication (in publications/ directory) and make use of the information there.
4443

@@ -64,8 +63,6 @@ Note that duplicates (i.e exact same GO ID) are perfectly fine, there is no need
6463
It may also be OK for IEAs to be broader than what is determined by IBA or literature, you can just mark these as accept,
6564
unless you think the mapping is too general.
6665

67-
68-
6966
5. **Detailed Justification**: For each annotation, provide:
7067
- Clear rationale for the assigned action
7168
- Specific evidence supporting your decision

CLAUDE.md

Lines changed: 19 additions & 15 deletions
Original file line numberDiff line numberDiff line change
@@ -25,22 +25,24 @@ genes/
2525
CFAP300-bioinformatics/ <-- the folder for any bioinformatics analyses (create this if necessary)
2626
<other genes>/
2727
...
28-
worms/
28+
worm/
2929
lrx-1/
30+
SCHPO/ <-- other orgs use uniprot species codes
3031
...
3132
publications/
32-
PMID_123456.md <-- cached publication text (this should be available for you)
33-
33+
PMID_123456.md <-- cached publication text (this should be available for you, DO NOT EDIT)
34+
reactome/
35+
R-HSA-NNNNNN.md <- cached reactome entry
3436

3537
You can regenerate the derived files by running commands like:
3638

3739
```
3840
just fetch-gene human CFAP300
3941
```
4042

41-
This makes the uniprot, goa, and citations files. It will also create a stub for the gene review if it doesn't exist.
43+
This makes the uniprot, goa, and publications files. It will also create a stub for the ai-review.yaml if it doesn't exist.
4244

43-
After making edits to the yaml file, you can validate the file by running:
45+
After making edits to the ai-review.yaml file, you can validate the file by running:
4446

4547
```
4648
just validate human CFAP300
@@ -75,27 +77,29 @@ existing_annotations:
7577
<review of existing annotations, using the GO guidelines; should include all annotations from goa.csv>
7678
core_functions:
7779
<your own summary of the core functions of the gene>
80+
proposed_new_terms:
81+
<suggestions for new GO terms>
82+
suggested_questions:
83+
<suggested questions for experts>
84+
suggested_experiments:
85+
<suggested experiments to elucidate gene function>
7886
```
7987
80-
See the schema and existing files for more details (there are only a few files to look at for now, so all count as exemplars, we will
81-
annotate exemplars more later on)
88+
See the schema and existing files for more details.
8289
8390
When creating a new review, always make sure files are in place:
8491
8592
`just fetch-gene <organism> <gene>`
8693

8794
This creates the uniprot, goa files, and a stub for the gene review.
8895

89-
The next thing is for you to do deep research on that gene and make a GENE-notes.md file, with citations inline (e.g [PMID:12345 title, "<supporting text>"]).
96+
The next thing is for you to do deep research on that gene and make a GENE-notes.md file, with citations inline (e.g [PMID:12345 title, "<supporting text>"]). In some cases this will have been done ahead of time for you.
9097

9198
## Reviewing existing annotations
9299

93-
You should start with the ai-review.yaml file, where stubs are created for you (from `just fetch-gene <organism> <gene>`). Any PMID cited should
94-
be in the publications folder, e.g. `publications/PMID_123456.md`. Review these.
100+
You should start with the ai-review.yaml file, where stubs are created for you (from `just fetch-gene <organism> <gene>`). Any PMID cited should be in the publications folder, e.g. `publications/PMID_123456.md`. Review these.
95101

96-
A deep research file should already be present in the gene folder, e.g. `genes/SPECIES/GENE/GENE-deep-research.md`. This should contain a summary of the
97-
literature, and any other information you found useful for the review. There may be irrelevant information in the citations, that is OK. If this is
98-
not present, make your own and keep notes in `genes/SPECIES/GENE/GENE-notes.md`.
102+
A deep research file should already be present in the gene folder, e.g. `genes/SPECIES/GENE/GENE-deep-research.md`. This should contain a summary of the literature, and any other information you found useful for the review. There may be irrelevant information in the citations, that is OK. If this is not present, make your own and keep notes in `genes/SPECIES/GENE/GENE-notes.md`.
99103

100104
Be aware that many GO terms are over-annotations. You should also not take existing annotations as gospel, whether experimental or bioinformatic.
101105
always use a holistic synthesized picture of the gene function that you have obtained from your research.
@@ -119,6 +123,8 @@ ActionEnum:
119123
UNDECIDED:
120124
description: The annotation is not clear, and the reviewer is not sure what to do with it. ALWAYS USE THIS IF YOU ARE UNABLE TO ACCESS
121125
RELEVANT PUBLICATIONS
126+
NEW:
127+
122128
```
123129

124130
## Tools
@@ -152,8 +158,6 @@ as e.g `file:<SPECIES>/<GENE>/bioinformatics/RESULTS.md`.
152158
* NEVER guess identifiers for terms, genes, publications. Always use the relevant tools or MCPS, or look them up in derived files.
153159
* For files `<GENE>-notes.md`, use literature deep search, and always record provenance for assertions, e.g `[PMID:12345 "<supporting text>"]`
154160

155-
156-
157161
## Support code
158162

159163
There is also support code in `src/ai_gene_review`, see the CLAUDE.md file in that directory for more details

README.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -73,6 +73,7 @@ just stats-open # Generate and open in browser
7373
- **Documentation Website**: [https://monarch-initiative.github.io/ai-gene-review](https://monarch-initiative.github.io/ai-gene-review)
7474
- **Interactive Web App**: [https://ai4curation.github.io/ai-gene-review/app/index.html](https://ai4curation.github.io/ai-gene-review/app/index.html) - Browse and explore gene annotation reviews
7575
- **Statistics Dashboard**: [https://ai4curation.github.io/ai-gene-review/docs/stats_report.html](https://ai4curation.github.io/ai-gene-review/docs/stats_report.html) - Summary Stats
76+
- **Slide Overview**: [https://docs.google.com/presentation/d/1xBFIQE0jt7K6kFg4zFzUwLDHtnDWat2ZVDarhcpA3_4/edit?slide=id.p#slide=id.p](slides)
7677

7778
## Gene Review Structure
7879

0 commit comments

Comments
 (0)