[Ontology-editors] Automating regulation terms (and combinatorial terms in general)

David Hill dph at informatics.jax.org
Fri Aug 15 05:40:50 PDT 2008


This would certainly free Tanya and I up to do a lot of other stuff. We 
edit the regulation graph by hand a lot! In fact, we will be doing it 
this afternoon.

David

Chris Mungall wrote:
> I'm kind of embarassed that poor curators like Ruth have to type out 
> sourceforge requests like this:
>
> * 
> https://sourceforge.net/tracker/index.php?func=detail&aid=2045154&group_id=36855&atid=440764 
>
>
> And then someone has to manually add this into the graph being careful 
> to get the is_a links correct
>
> Regulation term creation (and much xp creation in general) should be 
> far more automatable. This will be easier with OE2 when we can have 
> the intersection_of lines live in the main ontology, but I think there 
> is a lot we can do now.
>
> The current methodology is to parse terms into xp defs, run the 
> reasoner via script, and have people look through reports. I don't 
> think this is scalable. The first time we did this, we ended up having 
> to fix 1000 links. Looking at the current reports[*] there are nearly 
> 1000 more links needing fixed. This translates to an alarmingly high 
> rate of false negatives in search results and analyses.
>
> David and Tanya like to look through the reports as examining the 
> reasoner results individually allows them to find errors in the source 
> graph. This happens if someone puts the biologically correct link in 
> the regulation graph but neglects to do the equivalent in the source 
> graph. The fact this is happening at all is worrying - no one should 
> be editing the regulation graph manually!
>
> It would be really easy to automate things more, even whilst waiting 
> for OE2.
>
> One idea would be to have a file in CVS that just contains a list of 
> terms or term requests. E.g.
>
>     regulation of blood vessel remodeling
>     negative regulation of blood vessel remodeling
>     positive regulation of blood vessel remodeling
>
> Or it could just be the source term, with the request for 3 regulation 
> terms being implicit
>
> A nightly script would generate obo including synonyms, standard defs 
> and correct placement in the obo file. It would write the unparseables 
> to a report file. It could even be automatically merged but I'm 
> guessing folks would prefer some kind of oversight. Either way it 
> should just be a simple script to merge the results in. We could 
> probably use obomerge.
>
> Folks should also be aware that as far as "standard" regulation terms 
> go OE2 has a lot of useful functionality. For example, you can just 
> create a bunch of regulation terms under biological process, run the 
> semantic parser (under the reasoner menu) to generate the xp defs, run 
> the reasoner, then select "assert implied links". The main fiddly step 
> here is that the intersection_of lines have to be stripped before 
> going into the editors version. Should be possible with save filters.
>
> I'm leaning towards doing this out of oboedit. The file-in-cvs 
> approach fits well with the annotator submission model. It's easily 
> extensible beyond standard regulation terms. It does introduce a 
> "fork" in the term submission process, but I think it creates less 
> work and less errors all round. I could also implement the whole thing 
> in < 5 mins.
>
> A variation is a web form that allows you to build xps and then submit 
> them.
>
> Thoughts? Call?
>
> Cheers
> Chris
>
> [*] 
> http://wiki.geneontology.org/index.php/XP:biological_process_xp_regulation#Availability 
>
>
> _______________________________________________
> Ontology-editors mailing list
> Ontology-editors at geneontology.org
> http://fafner.stanford.edu/mailman/listinfo/ontology-editors


More information about the Ontology-editors mailing list