Standardized Term Names--cell ontology and GO
Jennifer I Clark
jclark at ebi.ac.uk
Tue Feb 22 01:53:25 PST 2005
Hi,
I think someone must have got there first since that synonym is spelt
correctly.
Jen
Jennifer I Clark wrote:
> Hi,
>
> In the meantime I'll fix that spelling error and leave the 'cell
> differentiation' synonyms in then shall I?
>
> Jen
>
> Sue Rhee wrote:
>
>> I agree with both Chris and Alexander's views. While the cell
>> ontology is
>> much smaller effort than GO, Michael, Jonathan, and I are interested and
>> somewhat committed to improving the cell ontology to use
>> community-accepted terminology. May I suggest that you include Michael
>> Ashburner, Jonathan Bard, and myself in the recipients of the obol
>> run and
>> we will try to resolve the name conflicts? Also, if anyone has any other
>> term-related issues relevant to cell ontology such as those brought
>> up by
>> Alex in the thread below, please send them to Michael Ashburner, who is
>> the official curator of cell ontology.
>>
>> Many thanks,
>> Sue
>>
>> On Mon, 21 Feb 2005, Chris Mungall wrote:
>>
>>
>>
>>> Hi Alex - I agree completely. I will create a specific cell ontology
>>> alignment report next obol run
>>>
>>> On Mon, 21 Feb 2005, Alexander Diehl wrote:
>>>
>>>
>>>
>>>> Chris,
>>>>
>>>> While I would like to see the cell ontology continued to be improved,
>>>> I think that its use in constructing GO terms cannot be forced at
>>>> this time for the reasons I mentioned previously. In particular, I
>>>> don't feel the cell ontology can be considered authoritative at this
>>>> time and its naming conventions differ significantly from the GO.
>>>> Nevertheless, it would be useful for interested parties to compare
>>>> the cell ontology with the implicit GO cell ontology either
>>>> computationally or by hand to look for discrepancies between the two
>>>> and add appropriate synonyms where needed. I would also suggest the
>>>> GO editors at EBI could make a point of comparing the two ontologies
>>>> whenever a GO term containing a cell type name is being coined or
>>>> modified in order to ensure the two ontologies are more in harmony by
>>>> suggesting changes to one or the other ontology.
>>>>
>>>> -- Alex
>>>>
>>>>
>>>> At 11:46 AM -0800 2/21/05, Chris Mungall wrote:
>>>>
>>>>
>>>>> Alexander,
>>>>>
>>>>> The cell ontology is still young and not yet seeing much active
>>>>> use, so
>>>>> it's not surprising it's incomplete and has problems. I had
>>>>> envisioned one
>>>>> of the main users of the cell ontology as being the GO curators.
>>>>> It seems
>>>>> that instead GO curators are focusing their energies on getting the
>>>>> implicit cell ontology within GO correct, and ignoring the cell
>>>>> ontology
>>>>> (I may be mischaracterizing here, I don't know to what extent
>>>>> annotators
>>>>> are doing curation).
>>>>>
>>>>> I guess this is perfectly understandable, given that this is the GO
>>>>> consortium and not the cell ontology consortium. However, I think
>>>>> it is
>>>>> extremely counterproductive in the long run.
>>>>>
>>>>> If we can somehow divert more energy to getting the cell ontology
>>>>> correct,
>>>>> then we can use this for automatic correction in GO.
>>>>>
>>>>> I realise this may seem quite daunting. There are indeed glaring
>>>>> inconsistencies. I'm not suggesting anyone sits down and manually
>>>>> sorts
>>>>> these out. The technology is there to automate this task. Obol
>>>>> currently
>>>>> reports inconsistencies, and obol is also capable of exporting the
>>>>> implicit cell (or anatomy or chemical or protein) ontology from GO.
>>>>>
>>>>> Soon we will have the ability to make genuine cross products
>>>>> between GO
>>>>> and CellOnt, with the cross-product recorded in the obo file. This
>>>>> will
>>>>> allow OBO-Edit to automatically flag inconsistencies (rather than
>>>>> having
>>>>> the curator examine a seperate text report).
>>>>>
>>>>> Of course, if this approach is to succeed, then it would require a
>>>>> slight
>>>>> shift in working practice. There would have to be more cross-talk
>>>>> between
>>>>> the cell ontology and GO. GO curators would still of course be
>>>>> free to
>>>>> define their 'own' cell ontology within GO that need not be
>>>>> consistent
>>>>> with the cell ontology (in either DAG structure, synonyms or
>>>>> spelling),
>>>>> but this would be deeply counterproductive. The correct approach
>>>>> would be
>>>>> to correct the cell ontology and have this automatically propagate
>>>>> into
>>>>> GO.
>>>>>
>>>>> Does this sound reasonable?
>>>>>
>>>>> Cheers
>>>>> Chris
>>>>>
>>>>> On Mon, 21 Feb 2005, Alexander Diehl wrote:
>>>>>
>>>>>
>>>>>
>>>>>> The correct spelling is "eosinophil," which gets about 225000
>>>>>> hits in
>>>>>> Google. "Oesinophil" seems to be a mispelling, and gets about 30
>>>>>> hits in Google.
>>>>>>
>>>>>> As currently constituted, I would have a lot of problems with using
>>>>>> the cell ontology as a source for correct nomenclature for cell
>>>>>> types
>>>>>> in the GO. The cell ontology differs from standard usage for many
>>>>>> terms and is far from complete. It also lacks definitions for its
>>>>>> terms as well as many commonly used synonyms. There are glaring
>>>>>> discrepancies between cell types in the GO and cell ontology terms,
>>>>>> such as the use of T-cell in the GO and T_lymphocyte in the cell
>>>>>> ontology, with no synonyms in either place to provide a mapping
>>>>>> between the two terms. All in all, one could probably construct an
>>>>>>
>>>>>
>>>>> > equally valid cell type ontology by deconstructing GO terms.
>>>>>
>>>>>
>>>>>> Nevertheless, I do use the cell ontology in annotation notes to
>>>>>> provide clarifying information when the GO term is inadequate, and I
>>>>>> welcome its continued development.
>>>>>>
>>>>>> GO term names should reflect actual usage and proper English, which
>>>>>> was the source of my initial complaint.
>>>>>>
>>>>>> -- Alex
>>>>>>
>>>>>> At 10:05 AM -0800 2/21/05, Chris Mungall wrote:
>>>>>> >[cc-ing Sue, even though I suspect she is already on the
>>>>>> development
>>>>>> >mailing list]
>>>>>> >
>>>>>> >Let me see if I understand
>>>>>> >
>>>>>> >You want to just have name='oesinophil differentiation', and not
>>>>>> bother
>>>>>> >having exact_synonym='oesinophil cell differentiation'?
>>>>>> >
>>>>>> >That seems very sensible, since it's not a good use of curator
>>>>>> time to
>>>>>>
>>>>>
>>>>> > >insert these automatable synonyms. You certainly shouldn't be
>>>>> spending any
>>>>>
>>>>>
>>>>>> >time on this to help an automatable parser like Obol!
>>>>>> >
>>>>>> >Are you also saying you'd like Obol to automatically add these
>>>>>> synonyms?
>>>>>> >We're not really set up for this kind of dataflow at the moment.
>>>>>> There
>>>>>> >would be some lag, during which time users search on Amigo for
>>>>>> "oesinophil
>>>>>> >cell differentiation" would come back empty handed (I think this
>>>>>> should
>>>>>> >actually be handled at the search level rather than at the
>>>>>> synonym level,
>>>>>> >though again we aren't set up for this yet).
>>>>>> >
>>>>>> >What Obol really cares about for these terms (and what I think
>>>>>> we should
>>>>>> >care about) is consistency with the cell ontology.
>>>>>> >
>>>>>> >Note that the cell ontology has "eosinophil" (NOT "oesinophil").
>>>>>> If this
>>>>>> >is a genuine synonym then there should be a dialog with the cell
>>>>>> ontology
>>>>>> >curators to introduce that synonym in the cell ontology.
>>>>>> Synonyms for GO
>>>>>> >can then be computed automatically for this.
>>>>>> >
>>>>>> >Note also that the cell ontology has "chromophil cell" and
>>>>>> "neutrophil".
>>>>>> >Why not "chromophil" and "neutrophil cell"? If this reflects
>>>>>> actual usage
>>>>>> >then the term names should reflect this. If there are reasons to
>>>>>> believe
>>>>>> >that the cell ontology naming conventions are arbitrary, then
>>>>>> the cell
>>>>>> >ontology should be modified to be consistent with actual usage.
>>>>>> The GO
>>>>>> >should then take the cell ontology as the authorative source,
>>>>>> and use the
>>>>>> >corresponding naming convention.
>>>>>> >
>>>>>> >Cheers
>>>>>> >Chris
>>>>>> >
>>>>>> >On Mon, 21 Feb 2005, Jennifer I Clark wrote:
>>>>>> >
>>>>>> >> Hi Chris,
>>>>>> >>
>>>>>> >> I was thinking more about the cell differentiation things. I
>>>>>> reckon your
>>>>>> >> obol parser would be okay without the synonyms as long as I
>>>>>> tell you a
>>>>>> >> few more naming rules.
>>>>>> >>
>>>>>> >> We have the following kinds of differentiation terms:
>>>>>> >>
>>>>>> >> x cell differentation
>>>>>> >> xblast differentation
>>>>>> >> xphil differentation
>>>>>> >>
>>>>>> >> where the word endings 'blast' and 'phil' are synonyms for
>>>>>> the word
>>>>>> >> 'cell' in the first example. If I just tell you about these
>>>>>> extra rules
>>>>>> >> then could you add them to the pareser? That way I wouldn't
>>>>>> need to to
>>>>>> >> bother with the synonyms. I could add documentation to the
>>>>>> website to
>>>>>> >> say that where these endings are in a term we don't add the
>>>>>> word 'cell'
>>>>>> >> since it is redundant.
>>>>>> >>
>>>>>> >> Thanks,
>>>>>> >>
>>>>>> >> Jen
>>>>>> >>
>>>>>> >> Chris Mungall wrote:
>>>>>> > >
>>>>>> >> >Whichever way we do it, I don't think we should mark up
>>>>>> things in the
>>>>>> >> >ontology just to suit a particular piece of software.
>>>>>> >> >
>>>>>> >> >We should only do this if it doesn't impact curation time
>>>>>> and if the
>>>>>> >> >notion of a 'formal' synonym for a term is useful to humans.
>>>>>> >> >
>>>>>> >> >Here are the criteria for a formal synonym:
>>>>>> >> >
>>>>>> >> >- it must be an exact synonym
>>>>>> >> > [ and would still be designated as such in the obo file, it
>>>>>> >> > would just have an additional qualifier ]
>>>>>> >> >
>>>>>> >> >- the existing term name is some kind of colloquial shorthand
>>>>>> >> > that is readily recognised by most biologists, yet the naming
>>>>>> >> > style of the shorthand is different from the consistent
>>>>>> naming
>>>>>> >> > style employed for similar terms
>>>>>> >> >
>>>>>> >> >- the existence of the synonym is a genuine rephrasing of the
>>>>>> >> > term name, and does not exist by virtue of alternate
>>>>>> spellings
>>>>>> >> > of words.
>>>>>>
>>>>>
>>>>> > >> >
>>>>>
>>>>>
>>>>>> >> >- the formal synonym must employ canonical spellings
>>>>>> >> >
>>>>>> >> >Why would this be useful to humans too? Well, sometimes it
>>>>>> may be useful
>>>>>> >> >to organise a set of terms such that either the formal
>>>>>> synonym (or just
>>>>>> >> >the name if there is no designated formal synonym) is used as
>>>>>> the primary
>>>>>> >> >label. Thus the term with name "methanogenesis" will be
>>>>>> listed as "methane
>>>>>> >> >biosynthesis" and will have consistent labeling style with
>>>>>> other "X
>>>>>> >> >biosynthesis" terms.
>>>>>> >> >
>>>>>> >> >If this isn't useful then we shouldn't bother.
>>>>>> >> >
>>>>>> >> >If we don't, then what I do think will be useful (and what
>>>>>> will be quite
>>>>>> >> >sufficient for obol) is to separate the useful
>>>>>> exact_synonyms from the
>>>>>> >> >exact_synonyms that are derivable from the exact_synonyms of
>>>>>> sub-terms.
>>>>>>
>>>>>
>>>>> > >> >
>>>>>
>>>>>
>>>>>> >> >An example is "sulfite reduction/sulphite reduction". This is
>>>>>> only useful
>>>>>> >> >for the purposes of searching, and could be handled
>>>>>> automatically.
>>>>>> >> >Displaying this synonym is of no use to the user.
>>>>>> >> >
>>>>>> >> >Cheers
>>>>>> >> >Chris
>>>>>> >> >
>>>>>> >> >On Fri, 18 Feb 2005, John Day-Richter wrote:
>>>>>> >> >
>>>>>> >> >
>>>>>> >> >
>>>>>> >> >>J Clark wrote:
>>>>>> >> >>
>>>>>> >> >>
>>>>>> >> >>
>>>>>> >> >>>If there was a way to mark the 'obol' synonym then I could
>>>>>> easily
>>>>>> >> >>>assign the synonym type to that as I went along. Might
>>>>>> this be a
>>>>>> >> >>>possibility with DAG-Edit John?
>>>>>> >> >>>
>>>>>> >> >>>
>>>>>> >> >>We have synonym types in OBO-Edit... just hang on! A
>>>>>> beta-version will
>>>>>> >> >>be out soon (of course, you won't be able to save your
>>>>>> changes to the
>>>>>> >> >>repository for a while after that, because everyone else
>>>>>> will still be
>>>>>> >> >>using DAG-Edit).
>>>>>> >> >>
>>>>>> >> >>For now, I suggest you add a synonym dbxref to the OBO
>>>>>> synonyms (like
>>>>>> >> >>"syntype:obol"). Once we move to full-fledged synonym types,
>>>>>> we can do a
>>>>>> >> >>quick search-and-replace to change those dbxrefs into
>>>>>> synonym category
>>>>>> >> >>identifiers.
>>>>>> >> >>
>>>>>>
>>>>>
>>>>> > >> >> -John
>>>>>
>>>>
>>>>
>>>
>>
>> -----------------------------------------------------------------------------
>>
>> Sue Rhee rhee at acoma.stanford.edu
>> The Arabidopsis Information Resource URL: www.arabidopsis.org
>> Carnegie Institution of Washington FAX: +1-650-325-6857
>> Department of Plant Biology Tel: +1-650-325-1521 ext. 251
>> 260 Panama St.
>> Stanford, CA 94305
>> U.S.A.
>> -----------------------------------------------------------------------------
>>
>>
>>
>>
>>
>>
>
More information about the Development
mailing list