[Ontology-editors] small molecule metabolism
Valerie Wood
val at sanger.ac.uk
Tue Apr 14 04:04:49 PDT 2009
Do we need to include these di-polymers in the small molecule pathways?
if not, it could be defined as non-polymeric molecules as Harold
suggested?
There do not appear to be many metabolic pathways which involve the
disaccharides which do not involve their catabolism into their component
mono-saccharides, or their attachment to a macromolecule?
in which case I don't think we would lose anything be defining
"small molecule metabolism, or metabolite metabolism" as
The chemical reactions and pathways, including anabolism and catabolism,
by which living organisms transform non-polymeric organic molecules.
and groupling all of small molecule metabolism under here ( excluding
disaccharide metabolic process)
a di-polymer is therefore neither a small molecule of a macromolecule by
current def (the macromolecule def is actually vaguer because it refers
to "high" relative molecular mass....)
Val
Harold Drabkin wrote:
> arrgh! Alex reminded me of sucrose and lactose.
> maybe we want to cut it off above 2?
>
> There goes my epiphany!
>
> h
>
>
> Harold Drabkin wrote:
>
>> a thought..
>>
>> How about saying that these are the non-polymeric molecules found
>> within a cell? (assuming we do not want to count di-peptides,
>> di-nucleotides, or di-saccharides )?
>>
>> hd
>>
>> Chris Mungall wrote:
>>
>>>
>>> On Apr 9, 2009, at 9:35 AM, Valerie Wood wrote:
>>>
>>>> Tanya Berardini wrote:
>>>>
>>>>>
>>>>>
>>>>> On Thu, Apr 9, 2009 at 8:48 AM, Chris Mungall <cjm at berkeleybop.org
>>>>> <mailto:cjm at berkeleybop.org>> wrote:
>>>>>
>>>>>
>>>>> On Apr 9, 2009, at 3:35 AM, Valerie Wood wrote:
>>>>>
>>>>> It seems like there is a gap in the terminology of biology to
>>>>> decribe "everything that is not a macromolecule molecule".
>>>>> Maybe we should make one up....
>>>>> Perhaps "small molecule metabolism" would be acceptable if it
>>>>> is defined as "everything that is not a macromolucule" but
>>>>> that is not an acceptable way of defining something is it?
>>>>>
>>>>>
>>>>> Do we really need a term for it? Why not just ask for non-X
>>>>> metabolism any time you're interested in metabolism of Ys where Ys
>>>>> are not Xs
>>>>>
>>>>> Granted tools can't do this yet but it's not hard given the
>>>>> correct structures in the ontology, and we should perhaps be
>>>>> working towards a situation where tools do support this
>>>>>
>>>>>
>>>>> I am partial to this approach. Defining 'small molecule
>>>>> metabolism' as everything that is not 'macromolecule metabolism'
>>>>> violates the ontology design principle of positivity. Why not
>>>>> just combine the annotations from the terms that do cover what is
>>>>> desired and then analyze those results?
>>>>>
>>>>> Tanya
>>>>>
>>>>
>>>> Its quite difficult to do this during enrichment analysis.
>>>> I have seen a number of times that terms which would be
>>>> classically termed "biochemical pathways" are enriched, bacause I
>>>> see the annotations in my data individually.
>>>> The enrichment tools don't show this because the number of
>>>> annotations to the individual terms are not large enough. tThe
>>>> parent term "cellular metabolic process" is not enriched because
>>>> the effect is masked by all of the other 3000 annotations to this
>>>> term generated mainly by the variouse types of macromolecule
>>>> metabolic process i.e.DNA metabolic process, protein metabolic
>>>> process etc.
>>>>
>>>> When you are analysisng whole genome datasets it isn't really
>>>> practical to add and subtract processes and repeat enrichment (it
>>>> gets way too complicated to process and report the results, as you
>>>> would have to fiddle with the P-values for everything you did
>>>> manually and then reintegrate it into your whole genome analysis)
>>>>
>>>> This isn't really a problem for me now because I worked around it,
>>>> but I could only do this because I know what the problem was.
>>>>
>>>> I thought that other users may appreciate some sort of grouping
>>>> term here for similar analyses- it just seems that there should be
>>>> a term to group these processes in the same way that macromolecular
>>>> metabolic processes are grouped. The fact is that if you have a
>>>> bunch of genes enriched for low numbers of various small
>>>> metabolism/canonical biochemical pathway terms, this enrichment
>>>> would most likely be overlooked.
>>>
>>>
>>> I agree with Tanya but understand the practical need for enrichment
>>> analysis.
>>>
>>> I am envisioning a partial solution along the following lines:
>>>
>>> Just as we have goslims, we can have gofats. a gofat would live
>>> outside the ontology and contain statements like
>>>
>>> GOFAT:1 small molecule metabolism = metabolism and not
>>> has_participant chebi:macromolecule
>>>
>>> The reasoner would compute is_a parentage to the fat terms and
>>> create a derived obo file. If the tool accepts OBOFs+GAFs then just
>>> give the tool this obo file instead of the regular one.
>>>
>>> (note this only works for tools that allow you to input an obo file.
>>> Some web-based tools may not give you the flexibility to substitute
>>> anything other than the regular GO)
>>>
>>> This should work in your particular case, no need to keep
>>> re-analyzing once you've defined your fat. We could even make the
>>> fat-derived obo files available on the website, as we do for slims
>>>
>>> This solution isn't perfect as it requires the analyzer to know a
>>> priori which may be useful grouping categories. Really the tool
>>> should be able to do this. For example, for a particular dataset,
>>> "metabolism with a molecule with an X side chain, missing a Y" may
>>> be enriched. There are strategies for dealing with this - rule
>>> mining, or pre-computing every possible cross-product. This will
>>> require a little more know-how from tools developers.
>>>
>>>
>>>>
>>>> val
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>> --
>>>> The Wellcome Trust Sanger Institute is operated by Genome Research
>>>> Limited, a charity registered in England with number 1021457 and a
>>>> company registered in England with number 2742969, whose registered
>>>> office is 215 Euston Road, London, NW1 2BE.
>>>
>>>
>>> _______________________________________________
>>> Ontology-editors mailing list
>>> Ontology-editors at geneontology.org
>>> http://fafner.stanford.edu/mailman/listinfo/ontology-editors
>>
>>
>> _______________________________________________
>> Ontology-editors mailing list
>> Ontology-editors at geneontology.org
>> http://fafner.stanford.edu/mailman/listinfo/ontology-editors
>
>
>
--
The Wellcome Trust Sanger Institute is operated by Genome Research
Limited, a charity registered in England with number 1021457 and a
company registered in England with number 2742969, whose registered
office is 215 Euston Road, London, NW1 2BE.
More information about the Ontology-editors
mailing list