[Annotation] evidence code advice

Judith Blake jblake at informatics.jax.org
Wed Mar 26 21:51:52 PDT 2008


Hi Kara.

I like having a new view on this.  something to think about

judy

Kara Dolinski wrote:
> Hi,
>
> The root of the problem, as I see it, is that we are mixing apples and 
> oranges with evidence codes.  All but one of the evidence codes 
> indicate the type of experimental evidence for a GO annotation, but we 
> have one oddball, IEA, that indicates not what the experiment is, but 
> rather how the annotation was done.  We keep running into variations 
> of the same problem:  we have some evidence (whether experimental or 
> computational) for a GO annotation, but also want to indicate whether 
> a curator looked at it or not.
>
> My proposed (albeit radical) solution:
>
> Remove IEA as an evidence code.
>
> Create a new property for GO annotations (or add a new type of 
> qualifier) that captures how the annotation was done:  manual or 
> automated.
>
> Everything that is currently IEA would be given the 'automated' 
> property/qualifier, and then would be given a new evidence code as 
> appropriate (mostly a flavor of ISS I would assume).
> There can be a rule that all 'automated' annotations that are a flavor 
> of ISS must have a 'with' value.
>
> This would allow us to use 'RCA' as appropriate, in some cases they'd 
> be 'manual', in others, they'd be 'automated'.  In Rama's case, the 
> annotations would be 'RCA' with an 'automated' qualifier.
>
> I realize the issues involved in making such a drastic change, so I 
> understand if we don't go there, but I do think that some approach 
> such as the one above is the best representation of the information 
> that we are trying to capture.
>
> Cheers,
> Kara
>
> On Mar 26, 2008, at 4:30 PM, Rama Balakrishnan wrote:
>
>>
>> Hi All,
>>
>> SGD has come across couple of computationally predicted GO annotation 
>> data sets for S. cerevisiae that we would like to add to our 
>> database. The GO annotations from these data sets are predictions 
>> based on multiple high-throughput data sets. RCA evidence code came 
>> to our minds but according to the documentation, the annotations all 
>> have to be manually reviewed by a curator to use this evidence. There 
>> are several 100 annotations of this kind and it is not feasible for 
>> us to manually review these annotations.
>>
>> Hence, we thought these annotations can be bulk loaded with IEA 
>> evidence code. However, in the Jan 2007 (Cambridge) GO meeting, it 
>> was decided that the 'with' column information has to be filled in 
>> for all IEAs (else Mike's filtering script strips them out). But 
>> these GO annotations being predictions based on multiple 
>> high-throughput data sets, don't have any information for the with 
>> column.  So, we are left with no choice.
>>
>> Which evidence code do people think should be used for these kinds of 
>> computational datasets when there is not an obvious "with"?
>>
>> Thanks for your input.
>>
>>
>> Rama
>>
>>
>> +-----o--o 
>> ---------------------------------------------------------------
>>         o-o       Rama Balakrishnan Ph.D
>>         O          Senior Scientific Curator
>>       o-o         Saccharomyces Genome Database 
>>      o---o       Stanford University
>>    o----o        Stanford, CA 94305-5120
>>  O-----O       Ph: 650.725.8956 Fax: 650.723.7016
>>    0--o          email: rama at genome.stanford.edu 
>> <mailto:rama at genome.stanford.edu>
>>      O            Website: http://www.yeastgenome.org 
>>     o-o           SGD Wiki- http://wiki.yeastgenome.org
>> +- o---o 
>> -----------------------------------------------------------------
>>
>>
>>
>>
>>
>>
>>
>
> ------------------------------------------------------------------------
>
> _______________________________________________
> Annotation mailing list
> Annotation at geneontology.org
> http://fafner.stanford.edu/mailman/listinfo/annotation
>   




More information about the Annotation mailing list