[Annotation] evidence code advice

Rama Balakrishnan rama at genome.stanford.edu
Wed Apr 2 14:40:11 PDT 2008


Added to the agenda.

Rama

On Apr 1, 2008, at 5:55 PM, Judith Blake wrote:

> Rama,
> If this hasn't been done, would you please add to the wiki agenda  
> list with a pointer to a page with Kara's (and others?) emails...
>
> Thanks very much
> judy
>
> Rama Balakrishnan wrote:
>>> Anyway, in light of that history, I think it would make most  
>>> sense  if the
>>> absolute requirement for the with column to be filled for IEA was   
>>> dropped
>>> in the short term, so that we can use the IEA code for unreviewed
>>> annotations from RCA methods.
>>>
>>
>> I think it is important to require the 'with' column for IEAs to   
>> prevent circular annotations.
>> The other option is to revert the RCA code to its original version   
>> which required only the computational method to be reviewed and  
>> not  every annotation.
>>
>> I also really like Kara's proposal and hopefully this will be   
>> discussed at the upcoming GO meeting.
>>
>> Rama
>>
>>
>>
>>> In the long term, I think Kara's proposal is a better way to go.
>>>
>>> -Karen
>>>
>>>
>>> On Sun, 30 Mar 2008, Suzanna Lewis wrote:
>>>
>>>
>>>> This is very much along the lines that I've been trying to foster
>>>> (remember the meeting in Cambridge at Jesus College). The bit- 
>>>> code  (or
>>>> bar-code) for evidence codes, with each bit indicating one of these
>>>> flags for a different piece of information. Not only automated/  
>>>> manual,
>>>> but also large-scale/small-scale, and other characteristics of the
>>>> evidence.
>>>>
>>>> As Kara (and many others) have said, there is quite a bit of over-
>>>> loading of multiple pieces of information in the current evidence
>>>> codes. It would be nice one day to see these distinguished into
>>>> different constituent bits of information.
>>>>
>>>> -S
>>>>
>>>> p.s. I thought that IEA did not -require- the with column.
>>>> p.p.s Was the decision tree a step in this direction?
>>>>
>>>> On Mar 26, 2008, at 1:59 PM, Kara Dolinski wrote:
>>>>
>>>>
>>>>> Hi,
>>>>>
>>>>> The root of the problem, as I see it, is that we are mixing apples
>>>>> and oranges with evidence codes.  All but one of the evidence  
>>>>> codes
>>>>> indicate the type of experimental evidence for a GO annotation,  
>>>>> but
>>>>> we have one oddball, IEA, that indicates not what the experiment  
>>>>> is,
>>>>> but rather how the annotation was done.  We keep running into
>>>>> variations of the same problem:  we have some evidence (whether
>>>>> experimental or computational) for a GO annotation, but also  
>>>>> want to
>>>>> indicate whether a curator looked at it or not.
>>>>>
>>>>> My proposed (albeit radical) solution:
>>>>>
>>>>> Remove IEA as an evidence code.
>>>>>
>>>>> Create a new property for GO annotations (or add a new type of
>>>>> qualifier) that captures how the annotation was done:  manual or
>>>>> automated.
>>>>>
>>>>> Everything that is currently IEA would be given the 'automated'
>>>>> property/qualifier, and then would be given a new evidence code as
>>>>> appropriate (mostly a flavor of ISS I would assume).
>>>>> There can be a rule that all 'automated' annotations that are a
>>>>> flavor of ISS must have a 'with' value.
>>>>>
>>>>> This would allow us to use 'RCA' as appropriate, in some cases
>>>>> they'd be 'manual', in others, they'd be 'automated'.  In Rama's
>>>>> case, the annotations would be 'RCA' with an 'automated'  
>>>>> qualifier.
>>>>>
>>>>> I realize the issues involved in making such a drastic change,  
>>>>> so I
>>>>> understand if we don't go there, but I do think that some approach
>>>>> such as the one above is the best representation of the  
>>>>> information
>>>>> that we are trying to capture.
>>>>>
>>>>> Cheers,
>>>>> Kara
>>>>>
>>>>> On Mar 26, 2008, at 4:30 PM, Rama Balakrishnan wrote:
>>>>>
>>>>>
>>>>>> Hi All,
>>>>>>
>>>>>> SGD has come across couple of computationally predicted GO
>>>>>> annotation data sets for S. cerevisiae that we would like to  
>>>>>> add to
>>>>>> our database. The GO annotations from these data sets are
>>>>>> predictions based on multiple high-throughput data sets. RCA
>>>>>> evidence code came to our minds but according to the  
>>>>>> documentation,
>>>>>> the annotations all have to be manually reviewed by a curator to
>>>>>> use this evidence. There are several 100 annotations of this kind
>>>>>> and it is not feasible for us to manually review these  
>>>>>> annotations.
>>>>>>
>>>>>> Hence, we thought these annotations can be bulk loaded with IEA
>>>>>> evidence code. However, in the Jan 2007 (Cambridge) GO meeting,  
>>>>>> it
>>>>>> was decided that the 'with' column information has to be filled  
>>>>>> in
>>>>>> for all IEAs (else Mike's filtering script strips them out). But
>>>>>> these GO annotations being predictions based on multiple high-
>>>>>> throughput data sets, don't have any information for the with
>>>>>> column.  So, we are left with no choice.
>>>>>>
>>>>>> Which evidence code do people think should be used for these  
>>>>>> kinds
>>>>>> of computational datasets when there is not an obvious "with"?
>>>>>>
>>>>>> Thanks for your input.
>>>>>>
>>>>>>
>>>>>> Rama
>>>>>>
>>>>>>
>>>>>> +-----o--o
>>>>>> ---------------------------------------------------------------
>>>>>>       o-o       Rama Balakrishnan Ph.D
>>>>>>       O          Senior Scientific Curator
>>>>>>     o-o         Saccharomyces Genome Database
>>>>>>    o---o       Stanford University
>>>>>>  o----o        Stanford, CA 94305-5120
>>>>>> O-----O       Ph: 650.725.8956 Fax: 650.723.7016
>>>>>>  0--o          email: rama at genome.stanford.edu
>>>>>>    O            Website: http://www.yeastgenome.org
>>>>>>   o-o           SGD Wiki- http://wiki.yeastgenome.org
>>>>>> +- o---o
>>>>>> -----------------------------------------------------------------
>>>>>>
>>>>>>
>>>>>>
>>>>>>
>>>>>>
>>>>>>
>>>>>>
>>>>>>
>>>>> _______________________________________________
>>>>> Annotation mailing list
>>>>> Annotation at geneontology.org
>>>>> http://fafner.stanford.edu/mailman/listinfo/annotation
>>>>>
>>>> _______________________________________________
>>>> Annotation mailing list
>>>> Annotation at geneontology.org
>>>> http://fafner.stanford.edu/mailman/listinfo/annotation
>>>>
>>>>
>>> _______________________________________________
>>> Annotation mailing list
>>> Annotation at geneontology.org
>>> http://fafner.stanford.edu/mailman/listinfo/annotation
>>>
>>
>> _______________________________________________
>> Annotation mailing list
>> Annotation at geneontology.org
>> http://fafner.stanford.edu/mailman/listinfo/annotation
>>
>



More information about the Annotation mailing list