[go] AmiGO and annotation qualifiers

Judith Blake jblake at informatics.jax.org
Wed Jan 30 16:02:45 PST 2008


could this be a case where the duplicate annotations between GOA and RGD 
were not filtered out by RGD because of the change from UniProt to 
UniProtKB?

judy

Chris Mungall wrote:
>
> I'm a bit confused here.
>
> this appears to show a reference contradicting itself w.r.t Adrm1 in 
> RGD. We saw this in the last demo we looked at for this, and it turned 
> out to be a data error in the FlyBase exports.
>
> There doesn't appear to be any NOT annotation in the RGD files. 
> Perhaps this is just fake annotation for the demo?
>
> As an aside, I see RGD have changed:
> RGD     RGD:69248       Adrm1           GO:0008538      
> RGD:1600115     ISS     UniProt:Q16186  F       adhesion regulating 
> molecule 1          gene    taxon:10116     20070710        UniProt
>
> to:
> UniProtKB       Q9JMB5  ADRM1_RAT               GO:0008538      
> UniProt:Q9JMB5  ISS     UniProtKB:Q16186        F       Adrm1, Gp110: 
> Protein ADRM1     IPI00204510     protein taxon:10116     
> 20070420        UniProtKB
>
> Shouldn't gene IDs be used for consistency, except perhaps where there 
> is alternate splicing?
>
>
>
> On Jan 30, 2008, at 2:58 PM, Judith Blake wrote:
>
>> I like the first one and I like including the qualifiers in AmiGO.
>> http://www.ebi.ac.uk/~aji/demo.html The first one is the clearest.  I 
>> actually had to look around the others to find the qualifiers.
>> I think the default /should/ include the NOTs.  If a newbie is 
>> concerned, they can ask go-help or look around for documentation.  
>> NOTs are important.
>>
>> by nngene products do you mean the '3 associations' thing?  I have no 
>> idea what that means.
>>
>> judy
>>
>> Amelia Ireland wrote:
>>> Hi GO Consortium,
>>>
>>> There has been some discussion in the web presence working group 
>>> about the display of NOT annotations and annotations with 
>>> qualifiers. There are two issues; how to display the annotations 
>>> with qualifiers, and how gene product counts for terms with 
>>> qualifiers should be displayed. We'd like to get some feedback from 
>>> the consortium on these issues before releasing the new AmiGO.
>>>
>>> Annotations With Qualifiers
>>> ===========================
>>>
>>> I've made some mock ups of different possible arrangements here:
>>>
>>> http://www.ebi.ac.uk/~aji/demo.html
>>> http://www.ebi.ac.uk/~aji/demo2.html
>>> http://www.ebi.ac.uk/~aji/demo3.html
>>>
>>> These pages are viewing the GPs annotated to a term (and its 
>>> children), but a similar arrangement will be used to view the terms 
>>> with which a GP is associated.
>>>
>>> Which arrangement is the clearest? Things to consider:
>>>
>>> - how easy is it to see inconsistent annotations, e.g. the same GP 
>>> has a normal and a NOT annotation?
>>>
>>> - is it clear what the qualifier applies to?
>>>
>>> - some views (e.g. having the associations with operators/qualifiers 
>>> separated out) require extra calculations to generate and will hence 
>>> be slower
>>>
>>> - should the default view contain NOT annotations? Is a GO newbie 
>>> going to understand them?
>>>
>>> - is the 'nn gene products' link unambiguous in meaning? Is it 
>>> helpful to know how many GPs there are annotated to a term?
>>>
>>>
>>> Gene Product Counts
>>> ===================
>>>
>>> The WPWG discussed generating separate totals for numbers of GPs 
>>> annotated to a term without any qualifier and the no. of GPs 
>>> annotated to a term with each of the other qualifiers (currently the 
>>> GP count for a term *includes* any GPs annotated with 
>>> 'contributes_to' and 'colocalizes_with', but *excludes* NOT 
>>> annotations). How useful would this be, and is it intuitive to split 
>>> the counts up thus, or is it confusing (especially for GO newbies)? 
>>> There is also this to bear in mind:
>>>
>>> On 24 Jan 2008, at 20:11, Chris Mungall wrote:
>>>> On Jan 24, 2008, at 11:08 AM, Seth Carbon wrote:
>>>>> *) ask chris about splitting the gp count by qualifier (seth)
>>>>
>>>> you mean by contributes_to, the NOT operator etc?
>>>>
>>>> unfortunately it is expensive to pre-compute the gp counts in any 
>>>> way that a single gp can end up in two partitions.
>>>>
>>>> we currently partition by gp db name - no gp can be in > 1. this 
>>>> has the advantage that we can get the correct total by summing the 
>>>> numbers at query time.
>>>>
>>>> however, if we have a partition that does not split gps in this way 
>>>> we can't sum across the partitions, which means we have to 
>>>> pre-compute for all combinations.
>>>>
>>>> This may not be so bad for the qualifiers however, since the 
>>>> qualifiers are relatively rare.
>>>>
>>>> would you really want this for the NOT operator though? remember 
>>>> the semantics are easy to get confused here. Would you want:
>>>>
>>>> all gps that are NOT GPCRs (ie include those that are asserted 
>>>> not-TM receptor in the count) - negation propagates up the graph
>>>>
>>>> all gps that have a NOT annotation to some kind of GPCR - negation 
>>>> propagates as normal
>>>
>>>
>>>
>>> Any feedback would be gratefully received. Thanks!
>>>
>>> Amelia / the Web Presence Working Group.
>>>
>>> -- 
>>> Amelia Ireland
>>> GO Editorial Office,
>>> European Bioinformatics Institute, UK.
>>> Carbon neutral driving: http://www.targetneutral.com/TONIC/index.jsp
>>>
>>>
>>>
>>>
>>>
>>
>>
>>
>





More information about the Go mailing list