[Ontology-editors] Automating regulation terms (and combinatorial terms in general)

Jennifer Deegan (nee Clark) jdeegan at ebi.ac.uk
Mon Aug 18 01:57:05 PDT 2008


Hi,

Is there a way this kind of system could be set up to work on branch 
files in scratch as well? I mainly work on side projects on files in the 
scratch directory now and these projects would also benefit from 
automatic term generation.

Thanks,

Jen

Tanya Berardini wrote:

> I'd be happy to take the terms from the SF requests and put them in a 
> file in cvs and have you work your magic on them.  Reviewing the 
> suggested relationships sounds like it would be much more efficient.  If 
> we can have the standard defs be a combination of the standard 
> regulation definition followed by the definition of the linked process 
> term (if that exists), that would be great.  So the definition of 
> 'regulation of X' would be 'Any process that modulates the rate, extent 
> or frequency of X. X is the ......'
> 
> We do still need to review the links as suggested because frequently 
> (today, in fact!) we run across errors in the ontology that we would 
> have completely missed had we not manually inspected the graph 
> structure.  The errors we saw today occurred higher up in the graph 
> (closer to the root terms) that the immediate terms we were adding.  So, 
> I envision us taking a file with the suggested terms merged into the 
> main live file, reviewing it by eye and then committing the new terms.  
> This route still saves us time spent in creating the terms, and finding 
> the logical parents.
> 
> Tanya
> 
> On Fri, Aug 15, 2008 at 5:40 AM, David Hill <dph at informatics.jax.org 
> <mailto:dph at informatics.jax.org>> wrote:
> 
>     This would certainly free Tanya and I up to do a lot of other stuff.
>     We edit the regulation graph by hand a lot! In fact, we will be
>     doing it this afternoon.
> 
>     David
> 
> 
>     Chris Mungall wrote:
> 
>         I'm kind of embarassed that poor curators like Ruth have to type
>         out sourceforge requests like this:
> 
>         *
>         https://sourceforge.net/tracker/index.php?func=detail&aid=2045154&group_id=36855&atid=440764
>         <https://sourceforge.net/tracker/index.php?func=detail&aid=2045154&group_id=36855&atid=440764>
> 
> 
>         And then someone has to manually add this into the graph being
>         careful to get the is_a links correct
> 
>         Regulation term creation (and much xp creation in general)
>         should be far more automatable. This will be easier with OE2
>         when we can have the intersection_of lines live in the main
>         ontology, but I think there is a lot we can do now.
> 
>         The current methodology is to parse terms into xp defs, run the
>         reasoner via script, and have people look through reports. I
>         don't think this is scalable. The first time we did this, we
>         ended up having to fix 1000 links. Looking at the current
>         reports[*] there are nearly 1000 more links needing fixed. This
>         translates to an alarmingly high rate of false negatives in
>         search results and analyses.
> 
>         David and Tanya like to look through the reports as examining
>         the reasoner results individually allows them to find errors in
>         the source graph. This happens if someone puts the biologically
>         correct link in the regulation graph but neglects to do the
>         equivalent in the source graph. The fact this is happening at
>         all is worrying - no one should be editing the regulation graph
>         manually!
> 
>         It would be really easy to automate things more, even whilst
>         waiting for OE2.
> 
>         One idea would be to have a file in CVS that just contains a
>         list of terms or term requests. E.g.
> 
>            regulation of blood vessel remodeling
>            negative regulation of blood vessel remodeling
>            positive regulation of blood vessel remodeling
> 
>         Or it could just be the source term, with the request for 3
>         regulation terms being implicit
> 
>         A nightly script would generate obo including synonyms, standard
>         defs and correct placement in the obo file. It would write the
>         unparseables to a report file. It could even be automatically
>         merged but I'm guessing folks would prefer some kind of
>         oversight. Either way it should just be a simple script to merge
>         the results in. We could probably use obomerge.
> 
>         Folks should also be aware that as far as "standard" regulation
>         terms go OE2 has a lot of useful functionality. For example, you
>         can just create a bunch of regulation terms under biological
>         process, run the semantic parser (under the reasoner menu) to
>         generate the xp defs, run the reasoner, then select "assert
>         implied links". The main fiddly step here is that the
>         intersection_of lines have to be stripped before going into the
>         editors version. Should be possible with save filters.
> 
>         I'm leaning towards doing this out of oboedit. The file-in-cvs
>         approach fits well with the annotator submission model. It's
>         easily extensible beyond standard regulation terms. It does
>         introduce a "fork" in the term submission process, but I think
>         it creates less work and less errors all round. I could also
>         implement the whole thing in < 5 mins.
> 
>         A variation is a web form that allows you to build xps and then
>         submit them.
> 
>         Thoughts? Call?
> 
>         Cheers
>         Chris
> 
>         [*]
>         http://wiki.geneontology.org/index.php/XP:biological_process_xp_regulation#Availability
> 
> 
>         _______________________________________________
>         Ontology-editors mailing list
>         Ontology-editors at geneontology.org
>         <mailto:Ontology-editors at geneontology.org>
>         http://fafner.stanford.edu/mailman/listinfo/ontology-editors
> 
> 
> 
> 
> -- 
> Tanya Berardini
> TAIR Curator
> www.arabidopsis.org <http://www.arabidopsis.org>
> 
> 
> ------------------------------------------------------------------------
> 
> _______________________________________________
> Ontology-editors mailing list
> Ontology-editors at geneontology.org
> http://fafner.stanford.edu/mailman/listinfo/ontology-editors

-- 
Jennifer Deegan (nee Clark)
EMBL-European Bioinformatics Institute
Gene Ontology Consortium


More information about the Ontology-editors mailing list