[Ontology-editors] cross products

David Hill dph at informatics.jax.org
Mon Apr 6 14:50:18 PDT 2009


Hi Chris,

Just for clarity. Tanya and I are currently trying to change all the 
'during' terms to 'involved in' terms if there really is a part_of 
relationship. There are some cases in the GO where 'during' really means 
during. In those cases, the term means that the 2 processes just occur 
at the same time, but otherwise are unrelated. So the 'during' rule 
might not be such a good one.

We need to clean up these 'during' terms as well as 'by' terms and 'via' 
terms.

David
**
Chris Mungall wrote:
>
> The answer is it's a mix, depending on the xp set. *Ultimately* the 
> goal is to never have to parse a term again and to create xp defs 
> prospectively rather than retrospectively (we can still use obol for 
> *generation* of names, defs and synonyms, based on the newly created 
> xp def). We're nearer this goal with some xp sets rather than others.
>
> For example, the inter-organism regulation ones are quite hard, and we 
> probably have a few iterations to go. These iterations are quite 
> useful besides improving the grammar, as they clarify many parts of 
> the ontology, and also consistency of naming.
>
> The bp_xp_self which you are working on is a special case. There is a 
> long tail of terms that can be trivially parsed; for example "X during 
> Y" is translated to "an X that part_of Y". These have a high accuracy. 
> But there are other terms in your set for which I liberally applied 
> grammatical rules I knew to be less accurate. One such rule involved 
> using a relational adjective to guess an xp def, resulting in
>
> GO:0048007 ! antigen processing and presentation, exogenous lipid 
> antigen via MHC class Ib
> synonym: "exogenous lipid antigen processing and presentation via MHC 
> class Ib" ==>
> intersection_of: GO:0048003 ! antigen processing and presentation of 
> lipid antigen via MHC class Ib
> intersection_of: part_of GO:0042638 ! exogen
>
> (currently in the vetted file)
>
> This is quite obviously wrong!! I knew there would be many incorrect 
> ones in the bp_xp_self set from the outset, I did a first pass filter 
> myself, but probably not a good enough job. So in some ways your set 
> is the hardest as it has the highest ratio of nonsense. We should 
> perhaps invert the procedure for you, such that everything starts off 
> unvetted and must be explicitly vetted.
>
> The goal in this case is not to improve the obol rule: it's simply not 
> possible to get a 100% accurate xp def from terms such as this. Obol 
> is really for the low hanging fruit (LHF), the trivially composable 
> terms.
>
> We'll probably revisit the example above when we return to do 
> bp_xp_pro. For now, just delete erroneous xp defs such as the one above
>
> On Apr 6, 2009, at 5:27 AM, Jennifer Deegan (nee Clark) wrote:
>
>> Hi,
>>
>> David and I were editing some of my cross products together on Friday 
>> and we realised that we had different ideas of what we were meant to 
>> do.  I just wanted to check that we are on the right lines now, and 
>> Chris asked me to do it on list so we are all up to date on the 
>> correct scheme.
>>
>> I had assumed that the intersection files that Chris generated with 
>> obol  were a one-off, and that we were to hand edit the stanzas and 
>> put the resulting corrected intersection tags into the vetted file.
>>
>> David thought that the plan was for us to correct the live GO and 
>> feed back on obol's rules so that autogeneration would work correctly 
>> for all future runs.
>>
>> Which one is right?
>>
>> Thanks,
>>
>> Jen
>>
>>
>> _______________________________________________
>> Ontology-editors mailing list
>> Ontology-editors at geneontology.org
>> http://fafner.stanford.edu/mailman/listinfo/ontology-editors
>>
>
> _______________________________________________
> Ontology-editors mailing list
> Ontology-editors at geneontology.org
> http://fafner.stanford.edu/mailman/listinfo/ontology-editors

-- 
David P. Hill, Ph.D.
Bioinformatics Scientist: Ontology Development
Gene Ontology Consortium
The Jackson Laboratory
www.geneontology.org
www.informatics.jax.org
tel:207-288-6430



More information about the Ontology-editors mailing list