[go] Requirement for all 'unknown' annotations to use ND code
Karen Christie
kchris at genome.Stanford.EDU
Mon Sep 10 16:17:03 PDT 2007
Requirement for all 'unknown' annotations to use ND code
-----------------------------------------------------------
Hi all,
A question was brought up about the requirement that ND be the only
evidence code allowed for (unknown) annotations to the root nodes
within the Evidence Code Committee, and was not resolved
there. Discussion so far on the list is also mixed.
To me, the issue is that is at the Jan GO meeting we agreed that
evidence codes are ONLY about the type of evidence used to make the
annotation, and not about anything else. However, by saying that
people can use the ND evidence code as a way to find all the unknown
annotations, we are encoding an extra meaning into it.
The email discussion of this issue is below.
-Karen
Requirement that ND be the only allowable evidence code for unknown
annotations
proposed new rule for ND:
Even if an author states in a paper that there is no data available or
nothing is known about the gene product in a particular GO aspect,
annotation to the corresponding root node should be made with ND
evidence code citing either the annotating group's internal reference
or the GOC's reference on use of the ND evidence code, not a specific
paper.
comment in red in draft document:
I realize that we agreed to the above statement at the last GOC
meeting, but...
The more I think about it, the more I'm uncomfortable with the
decision that we made that unknown annotations can only be made with
ND, especially since the reason stated to do so has nothing to do with
evidence, but is to help people better identify the unknown
annotations.
I think this is encoding information into the evidence code that is
about something other than the evidence itself. I think this is poor
practice, especially when we spent so much time at the Jan GO meeting
discussing that evidence codes would be JUST a statement of the method
by which the annotation was made.
Jane Lomax (15 Jun 2007)
I was under the impression that we'd agreed 2. at the Jan meeting
i.e. ND is now the only allowable evidence code for unknown
annotations?
Midori Harris (15 Jun 2007)
I understand, and would add that it also loses the information that at
the time of writing, the authors -- who are presumably pretty well
informed about the genes/gene products they study --are aware of no
relevant data. (Tho this concern is not as grave as that of
overloading an evidence code.)
Valerie Wood (22 Jun 2007)
I'm not so sure because:
1. If authors have specifically asserted that there is no information,
this is usually a statement which is made based on looking at the
database (for example if the author is dealing with a gene set).
2. Papers are frequently published concurrently and it is clear that
the authors have no knowledge of the parallel papers, so an author
statement is not always necessarily a good indication that there is no
functional data without a curator check.
3. I'm pretty sure that when the unknowns disappeared, we advised
software developers that they could retrieve the unknown annotations
using the ND evidence code.....
Although I agree it seems bad practice to put info in the evidence
code other than the evidence itself, I think its more important that
there is a very clear way to identify 'unknown' annotations.
It seems like not many of the softwares have caught up with the
previous change to unknowns (for example I havn't yet managed to find
a way to look at GO term enrichment which recognises the unknown
annotations.... does anybody know of one?)
More information about the Go
mailing list