[Ontology-editors] another question about biogenesis/organization

Chris Mungall cjm at berkeleybop.org
Fri Jun 26 14:44:43 PDT 2009


tangent to one of my favourite i.e. repeatedly stated topics:

we really need to be collecting datasets with which we can quantify  
the effects of changes in the ontology. This can be quite simple - a  
bag of gene IDs associated with a publication ID (maybe this could be  
integrated with the 'database' underpinning http://www.geneontology.org/cgi-bin/biblio.cgi)

This is sufficient to able to measure the effects of historic or  
proposed changes on the ontology. A tool would take two versions of  
the ontology plus the above dataset and spit out a possibly very  
interesting answer.

Of course, hand curating this dataset would be tedious and time  
consuming but I bet there is someway to get the majority of the  
information out of expression databases etc

But in the absence of such a tool and even with such a tool we should  
of course rely on biologists intuition about what makes a good  
abstraction.

On Jun 26, 2009, at 5:35 AM, Jane Lomax wrote:

> Well, many changes to the ontology result in changes to enrichment  
> analysis results, but if you feel there are valid biological reasons  
> for a parent term for 'organization' and 'biogenesis' then I'm fine  
> with that.
>
> Is it just the cell wall org and biogen terms that you want to  
> unify, or is it others too?
>
> Jane
>
> Valerie Wood wrote:
>> mm physiology isn't right.
>>
>> I still think we need a grouping term in the ontology, not just the  
>> option to create these union terms in slims.
>> By partitioning my cell wall terms into biogenesis and organization  
>> it is  quite possible that gene sets which previously showed  
>> enrichment to the "organization or biogenesis" term will no longer  
>> do so......
>>
>> val
>>
>>
>> Jane Lomax wrote:
>>
>>> Yeah, I know what you mean Val. I'm sure we often use union terms  
>>> because there is no neat name to encompass the meaning of the two  
>>> terms. They do cause problems for building the ontology though.
>>>
>>> Don't think 'cell physiology' would work - we got rid of all the  
>>> physiology terms because we couldn't define them as distinct it  
>>> from the metabolism terms...
>>>
>>> Jane
>>>
>>> Valerie Wood wrote:
>>>
>>>> This sounds quite complicated (for users), but it would be useful  
>>>> if there is no way to create the "compound terms" . It seems  
>>>> though, that these are often reasonable biological concepts but  
>>>> we don't have a good way to name them with a grouping term (there  
>>>> must be something which covers both organization and  
>>>> biogenesis....)
>>>>
>>>> It is similar to the situation with "small molecule metabolism"
>>>> http://sourceforge.net/tracker/?func=detail&aid=2106200&group_id=36855&atid=440764
>>>>
>>>> val
>>>>
>>>> Chris Mungall wrote:
>>>>
>>>>>
>>>>> Currently, no.
>>>>>
>>>>> But we are reengineering a lot of the software. The new  
>>>>> framework  allows for arbitrary boolean intersection/union  
>>>>> combinations. We  should have this in a standalone script soon,  
>>>>> but maybe a bit longer  for amigo
>>>>>
>>>>> (we're already using taxon unions for the GAF validation checks)
>>>>>
>>>>> On Jun 17, 2009, at 5:03 AM, Jane Lomax wrote:
>>>>>
>>>>>> Being able to have union terms in GO slims would solve this.  
>>>>>> I'm not  sure it's something we would want to code into the  
>>>>>> ontology.
>>>>>>
>>>>>> Can map2slim handle term unions, Chris?
>>>>>>
>>>>>> Jane
>>>>>>
>>>>>> Midori Harris wrote:
>>>>>>
>>>>>>> Seriously, the problem with "and" is that it could make a  
>>>>>>> gene  product annotated to a descendant term look like it's  
>>>>>>> involved in  everything covered by a parent (e.g. annotate to  
>>>>>>> biogenesis and  imply annotation also to organization, which  
>>>>>>> includes disassembly).
>>>>>>>
>>>>>>> so that doesn't rule out "organization or biogenesis" ...
>>>>>>>
>>>>>>> m
>>>>>>>
>>>>>>> On Wed, 17 Jun 2009, Valerie Wood wrote:
>>>>>>>
>>>>>>>> "organization and biogenesis" ? ;)
>>>>>>>>
>>>>>>>>
>>>>>>>> Midori Harris wrote:
>>>>>>>>
>>>>>>>>> Ow, my head.
>>>>>>>>>
>>>>>>>>> I think we've worked out reasonable definitions for   
>>>>>>>>> "organization" and "biogenesis" and we're just about there  
>>>>>>>>> for  applying them consistently. What we don't have -- and  
>>>>>>>>> apparently  you could really use -- is some term that covers  
>>>>>>>>> everything we're  putting under both organization and  
>>>>>>>>> biogenesis ... "anything that  happens to a cellular  
>>>>>>>>> component".
>>>>>>>>>
>>>>>>>>> What to call it, though?
>>>>>>>>>
>>>>>>>>> m
>>>>>>>>>
>>>>>>>>> On Tue, 16 Jun 2009, Valerie Wood wrote:
>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>> Sorry, I still haven't got my head around this...see my  
>>>>>>>>>> recent  SF item.....problem with my slimming
>>>>>>>>>> I am still trying to get a slim for a tutorial which gives  
>>>>>>>>>> good  coverage of all of my annotated gene products.
>>>>>>>>>>
>>>>>>>>>> Previously I had the term "cell wall organization and   
>>>>>>>>>> biogenesis" which covered all of cell wall organization  
>>>>>>>>>> and  biogenesis.
>>>>>>>>>>
>>>>>>>>>> Now organization and biogenesis are "disjoint" (am I using  
>>>>>>>>>> the  correct terminology?) I need to include *Both* the  
>>>>>>>>>> organization  and biogenesis term, when it seems as though  
>>>>>>>>>> I really want one  term to cover both of these cell wall  
>>>>>>>>>> processes at this level,  and that they are intimately  
>>>>>>>>>> related. I'm not sure how my  original annotations got  
>>>>>>>>>> split between the the 2 new terms but  there isn't much  
>>>>>>>>>> overlap now......
>>>>>>>>>>
>>>>>>>>>> 73 to biogenesis
>>>>>>>>>> and 63 to organization
>>>>>>>>>> 21 to both
>>>>>>>>>> However when I look through the list I am pretty sure all  
>>>>>>>>>> of  those annotated to biogenesis to equally be annotated  
>>>>>>>>>> to  organization.
>>>>>>>>>>
>>>>>>>>>> Now, if i don't include both of these terms in the slim, I  
>>>>>>>>>> get  gene products which do not map to a high level term.
>>>>>>>>>>
>>>>>>>>>> Any thoughts?
>>>>>>>>>>
>>>>>>>>>> Val
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>> _______________________________________________
>>>>>>> Ontology-editors mailing list
>>>>>>> Ontology-editors at geneontology.org
>>>>>>> http://fafner.stanford.edu/mailman/listinfo/ontology-editors
>>>>>>
>>>>>>
>>>>>>
>>>>>>
>>>>>> -- 
>>>>>> Dr Jane Lomax
>>>>>> GO Editorial Office
>>>>>> EMBL-EBI
>>>>>> Wellcome Trust Genome Campus
>>>>>> Hinxton
>>>>>> Cambridgeshire, UK
>>>>>> CB10 1SD
>>>>>>
>>>>>> p: +44 1223 492516
>>>>>> f: +44 1223 494468
>>>>>>
>>>>>> _______________________________________________
>>>>>> Ontology-editors mailing list
>>>>>> Ontology-editors at geneontology.org
>>>>>> http://fafner.stanford.edu/mailman/listinfo/ontology-editors
>>>>>>
>>>>>
>>>>>
>>>>
>>>>
>>>>
>>>
>>>
>>
>>
>>
>
>
> -- 
> Dr Jane Lomax
> GO Editorial Office
> EMBL-EBI
> Wellcome Trust Genome Campus
> Hinxton
> Cambridgeshire, UK
> CB10 1SD
>
> p: +44 1223 492516
> f: +44 1223 494468
>
>



More information about the Ontology-editors mailing list