Standardized Term Names
Chris Mungall
cjm at fruitfly.org
Mon Feb 21 16:39:08 PST 2005
Hi Alex - I agree completely. I will create a specific cell ontology
alignment report next obol run
On Mon, 21 Feb 2005, Alexander Diehl wrote:
> Chris,
>
> While I would like to see the cell ontology continued to be improved,
> I think that its use in constructing GO terms cannot be forced at
> this time for the reasons I mentioned previously. In particular, I
> don't feel the cell ontology can be considered authoritative at this
> time and its naming conventions differ significantly from the GO.
> Nevertheless, it would be useful for interested parties to compare
> the cell ontology with the implicit GO cell ontology either
> computationally or by hand to look for discrepancies between the two
> and add appropriate synonyms where needed. I would also suggest the
> GO editors at EBI could make a point of comparing the two ontologies
> whenever a GO term containing a cell type name is being coined or
> modified in order to ensure the two ontologies are more in harmony by
> suggesting changes to one or the other ontology.
>
> -- Alex
>
>
> At 11:46 AM -0800 2/21/05, Chris Mungall wrote:
> >Alexander,
> >
> >The cell ontology is still young and not yet seeing much active use, so
> >it's not surprising it's incomplete and has problems. I had envisioned one
> >of the main users of the cell ontology as being the GO curators. It seems
> >that instead GO curators are focusing their energies on getting the
> >implicit cell ontology within GO correct, and ignoring the cell ontology
> >(I may be mischaracterizing here, I don't know to what extent annotators
> >are doing curation).
> >
> >I guess this is perfectly understandable, given that this is the GO
> >consortium and not the cell ontology consortium. However, I think it is
> >extremely counterproductive in the long run.
> >
> >If we can somehow divert more energy to getting the cell ontology correct,
> >then we can use this for automatic correction in GO.
> >
> >I realise this may seem quite daunting. There are indeed glaring
> >inconsistencies. I'm not suggesting anyone sits down and manually sorts
> >these out. The technology is there to automate this task. Obol currently
> >reports inconsistencies, and obol is also capable of exporting the
> >implicit cell (or anatomy or chemical or protein) ontology from GO.
> >
> >Soon we will have the ability to make genuine cross products between GO
> >and CellOnt, with the cross-product recorded in the obo file. This will
> >allow OBO-Edit to automatically flag inconsistencies (rather than having
> >the curator examine a seperate text report).
> >
> >Of course, if this approach is to succeed, then it would require a slight
> >shift in working practice. There would have to be more cross-talk between
> >the cell ontology and GO. GO curators would still of course be free to
> >define their 'own' cell ontology within GO that need not be consistent
> >with the cell ontology (in either DAG structure, synonyms or spelling),
> >but this would be deeply counterproductive. The correct approach would be
> >to correct the cell ontology and have this automatically propagate into
> >GO.
> >
> >Does this sound reasonable?
> >
> >Cheers
> >Chris
> >
> >On Mon, 21 Feb 2005, Alexander Diehl wrote:
> >
> >> The correct spelling is "eosinophil," which gets about 225000 hits in
> >> Google. "Oesinophil" seems to be a mispelling, and gets about 30
> >> hits in Google.
> >>
> >> As currently constituted, I would have a lot of problems with using
> >> the cell ontology as a source for correct nomenclature for cell types
> >> in the GO. The cell ontology differs from standard usage for many
> >> terms and is far from complete. It also lacks definitions for its
> >> terms as well as many commonly used synonyms. There are glaring
> >> discrepancies between cell types in the GO and cell ontology terms,
> >> such as the use of T-cell in the GO and T_lymphocyte in the cell
> >> ontology, with no synonyms in either place to provide a mapping
> >> between the two terms. All in all, one could probably construct an
> > > equally valid cell type ontology by deconstructing GO terms.
> >> Nevertheless, I do use the cell ontology in annotation notes to
> >> provide clarifying information when the GO term is inadequate, and I
> >> welcome its continued development.
> >>
> >> GO term names should reflect actual usage and proper English, which
> >> was the source of my initial complaint.
> >>
> >> -- Alex
> >>
> >> At 10:05 AM -0800 2/21/05, Chris Mungall wrote:
> >> >[cc-ing Sue, even though I suspect she is already on the development
> >> >mailing list]
> >> >
> >> >Let me see if I understand
> >> >
> >> >You want to just have name='oesinophil differentiation', and not bother
> >> >having exact_synonym='oesinophil cell differentiation'?
> >> >
> >> >That seems very sensible, since it's not a good use of curator time to
> > > >insert these automatable synonyms. You certainly shouldn't be spending any
> >> >time on this to help an automatable parser like Obol!
> >> >
> >> >Are you also saying you'd like Obol to automatically add these synonyms?
> >> >We're not really set up for this kind of dataflow at the moment. There
> >> >would be some lag, during which time users search on Amigo for "oesinophil
> >> >cell differentiation" would come back empty handed (I think this should
> >> >actually be handled at the search level rather than at the synonym level,
> >> >though again we aren't set up for this yet).
> >> >
> >> >What Obol really cares about for these terms (and what I think we should
> >> >care about) is consistency with the cell ontology.
> >> >
> >> >Note that the cell ontology has "eosinophil" (NOT "oesinophil"). If this
> >> >is a genuine synonym then there should be a dialog with the cell ontology
> >> >curators to introduce that synonym in the cell ontology. Synonyms for GO
> >> >can then be computed automatically for this.
> >> >
> >> >Note also that the cell ontology has "chromophil cell" and "neutrophil".
> >> >Why not "chromophil" and "neutrophil cell"? If this reflects actual usage
> >> >then the term names should reflect this. If there are reasons to believe
> >> >that the cell ontology naming conventions are arbitrary, then the cell
> >> >ontology should be modified to be consistent with actual usage. The GO
> >> >should then take the cell ontology as the authorative source, and use the
> >> >corresponding naming convention.
> >> >
> >> >Cheers
> >> >Chris
> >> >
> >> >On Mon, 21 Feb 2005, Jennifer I Clark wrote:
> >> >
> >> >> Hi Chris,
> >> >>
> >> >> I was thinking more about the cell differentiation things. I reckon your
> >> >> obol parser would be okay without the synonyms as long as I tell you a
> >> >> few more naming rules.
> >> >>
> >> >> We have the following kinds of differentiation terms:
> >> >>
> >> >> x cell differentation
> >> >> xblast differentation
> >> >> xphil differentation
> >> >>
> >> >> where the word endings 'blast' and 'phil' are synonyms for the word
> >> >> 'cell' in the first example. If I just tell you about these extra rules
> >> >> then could you add them to the pareser? That way I wouldn't need to to
> >> >> bother with the synonyms. I could add documentation to the website to
> >> >> say that where these endings are in a term we don't add the word 'cell'
> >> >> since it is redundant.
> >> >>
> >> >> Thanks,
> >> >>
> >> >> Jen
> >> >>
> >> >> Chris Mungall wrote:
> >> > >
> >> >> >Whichever way we do it, I don't think we should mark up things in the
> >> >> >ontology just to suit a particular piece of software.
> >> >> >
> >> >> >We should only do this if it doesn't impact curation time and if the
> >> >> >notion of a 'formal' synonym for a term is useful to humans.
> >> >> >
> >> >> >Here are the criteria for a formal synonym:
> >> >> >
> >> >> >- it must be an exact synonym
> >> >> > [ and would still be designated as such in the obo file, it
> >> >> > would just have an additional qualifier ]
> >> >> >
> >> >> >- the existing term name is some kind of colloquial shorthand
> >> >> > that is readily recognised by most biologists, yet the naming
> >> >> > style of the shorthand is different from the consistent naming
> >> >> > style employed for similar terms
> >> >> >
> >> >> >- the existence of the synonym is a genuine rephrasing of the
> >> >> > term name, and does not exist by virtue of alternate spellings
> >> >> > of words.
> > > >> >
> >> >> >- the formal synonym must employ canonical spellings
> >> >> >
> >> >> >Why would this be useful to humans too? Well, sometimes it
> >>may be useful
> >> >> >to organise a set of terms such that either the formal synonym (or just
> >> >> >the name if there is no designated formal synonym) is used as
> >>the primary
> >> >> >label. Thus the term with name "methanogenesis" will be
> >>listed as "methane
> >> >> >biosynthesis" and will have consistent labeling style with other "X
> >> >> >biosynthesis" terms.
> >> >> >
> >> >> >If this isn't useful then we shouldn't bother.
> >> >> >
> >> >> >If we don't, then what I do think will be useful (and what
> >>will be quite
> >> >> >sufficient for obol) is to separate the useful exact_synonyms from the
> >> >> >exact_synonyms that are derivable from the exact_synonyms of sub-terms.
> > > >> >
> >> >> >An example is "sulfite reduction/sulphite reduction". This is
> >>only useful
> >> >> >for the purposes of searching, and could be handled automatically.
> >> >> >Displaying this synonym is of no use to the user.
> >> >> >
> >> >> >Cheers
> >> >> >Chris
> >> >> >
> >> >> >On Fri, 18 Feb 2005, John Day-Richter wrote:
> >> >> >
> >> >> >
> >> >> >
> >> >> >>J Clark wrote:
> >> >> >>
> >> >> >>
> >> >> >>
> >> >> >>>If there was a way to mark the 'obol' synonym then I could easily
> >> >> >>>assign the synonym type to that as I went along. Might this be a
> >> >> >>>possibility with DAG-Edit John?
> >> >> >>>
> >> >> >>>
> >> >> >>We have synonym types in OBO-Edit... just hang on! A beta-version will
> >> >> >>be out soon (of course, you won't be able to save your changes to the
> >> >> >>repository for a while after that, because everyone else will still be
> >> >> >>using DAG-Edit).
> >> >> >>
> >> >> >>For now, I suggest you add a synonym dbxref to the OBO synonyms (like
> >> >> >>"syntype:obol"). Once we move to full-fledged synonym types,
> >>we can do a
> >> >> >>quick search-and-replace to change those dbxrefs into synonym category
> >> >> >>identifiers.
> >> >> >>
> > > >> >> -John
>
>
More information about the Development
mailing list