[Ontology-editors] filtering of .obo files for the main release

Chris Mungall cjm at berkeleybop.org
Tue Feb 17 16:08:13 PST 2009


done

On Feb 5, 2009, at 4:19 PM, Mike Cherry wrote:

> I'll put it back on you to get someone to update the README to  
> clearly say that gene_ontology_edit.obo, editors/ 
> gene_ontology_write.obo and obo_format_1_2/gene_ontology_ext.obo are  
> the same.
>
> Here is what the README says now.
>
> !version: $Revision: 1.1 $
> !date: $Date: 2006/10/12 04:15:01 $
> go/ontology contents documentation
>
> Gene Ontology - The Ontologies
> ---------------------------------------------
>
> The current format used by the Gene Ontology Consortium (GOC) is
> called OBO v1.2.  The file corresponding to this format is
> gene_ontology_edit.obo.  The specification for OBO v1.2 is provided at
> the following location:
>
>  http://www.geneontology.org/GO.format.obo-1_2.shtml
>
> The other files in this directory contain older formats that were
> previously used by the GOC.  The specifications of the older formats,
> and others used by the GOC, can be found at:
>
>  http://www.geneontology.org/GO.format.shtml
>
> The gene_ontology.obo file is OBO v1.0 format.  It was used by GOC
> from February 14, 2004 to October 11, 2006.  This file is recreated
> daily from the OBO v1.2 file.
>
> The other files represent the original so called "flat file" format
> used by the GOC between June 16, 1999 and February 14, 2004.  The
> ontologies were split into separate files: process.ontology,
> function.ontology, and component.ontology.  The term definitions were
> also in a separate file named GO.defs.  These files are recreated once
> a week from the OBO v1.2 file.
>
> For help please contact the GOC at gohelp at geneontology.org
>
>
>
>
>
>
>
>
> On Feb 5, 2009, at 3:19 PM, Chris Mungall wrote:
>
>> here's the rationale for the 3 files
>>
>>> obo_format_1_2/gene_ontology_ext.obo
>>
>> cutting edge users.
>>
>>>
>>> obo_format_1_2/gene_ontology.1_2.obo
>>
>> the default (safe) file, where most go file consumers should go
>>
>>> obo_format_1_0/gene_ontology.1_0.obo
>>
>>
>> legacy. I wouldn't be sorry to see it go but we should give notice.  
>> Unfortunately this was necessary at the time as 1.2 introduced non- 
>> backwards compatible changes
>>
>> On Feb 5, 2009, at 2:59 PM, Mike Cherry wrote:
>>
>>> Chris,
>>>
>>> The script now runs everyday at 7:30 pm as part of the other  
>>> conversions.  All these commands are in a wrapper script not  
>>> separated into individual cron jobs.  All the errors go to one log  
>>> file.  The new file (obo_format_1_2/gene_ontology_ext.obo) is now  
>>> in CVS, of course for now it is the same as obo_format_1_2/ 
>>> gene_ontology.1_2.obo.
>>>
>>> We have three OBO1.2 files that are the same:
>>> gene_ontology_edit.obo
>>> editors/gene_ontology_write.obo
>>> obo_format_1_2/gene_ontology_ext.obo
>>>
>>> One of the reduced OBO 1.2:
>>> obo_format_1_2/gene_ontology.1_2.obo
>>>
>>> And two of the OBO1.0:
>>> gene_ontology.obo
>>> obo_format_1_0/gene_ontology.1_0.obo
>>>
>>> Do we really need all these copies?  Seems we should delete some  
>>> of these.  We can report the deletion to GOFRIENDS and state the  
>>> change in the README file in each directory.  For me the big  
>>> concern is all the files have difference CVS versions.  Some one  
>>> looking for a file will be very confused.
>>>
>>> -Mike
>>>
>>>
>>>
>>> On Feb 4, 2009, at 6:05 PM, Chris Mungall wrote:
>>>
>>>>
>>>> I have moved this script to the main GO cvs and renamed it:
>>>> 	./software/utilities/filter-obo-for-standard-release.pl
>>>>
>>>> I have tested it and it seems fine.
>>>>
>>>> Updated the docs:
>>>> 	http://wiki.geneontology.org/index.php/Ontology_publishing_pipeline_2009
>>>>
>>>> So we can make the switch any time prior to the links going live.
>>>>
>>>> I can't see the crontab but I imagine it's just a change from  
>>>> something like
>>>>
>>>> 	cp ontology/editors/gene_ontology_write.obo ontology/ 
>>>> obo_format_1_2/gene_ontology.1_2.obo
>>>> 	...
>>>>
>>>> to
>>>>
>>>> 	cp ontology/editors/gene_ontology_write.obo ontology/ 
>>>> obo_format_1_2/gene_ontology_ext.obo
>>>> 	filter-obo-for-standard-release.pl ontology/obo_format_1_2/ 
>>>> gene_ontology_ext.obo > ontology/obo_format_1_2/gene_ontology. 
>>>> 1_2.obo
>>>>
>>>> it's a little chatty on STDERR, we can pipe it to a file or /dev/ 
>>>> null, doesn't really matter
>>>> 	
>>>>
>>>> On Jan 27, 2009, at 1:48 PM, Mike Cherry wrote:
>>>>
>>>>> Looks fine.  The current cron job running at 7:30 pm.
>>>>>
>>>>> -Mike
>>>>>
>>>>> On Jan 27, 2009, at 11:40 AM, Chris Mungall wrote:
>>>>>
>>>>>> Hi Mike
>>>>>>
>>>>>> The other week Ben, David and Tanya discussed a change to the  
>>>>>> obo publishing pipeline, introducing a filtering step to shield  
>>>>>> the majority of users from certain extensions to the GO, whilst  
>>>>>> at the same time allowing power-users a means of accessing the  
>>>>>> extended GO outside the editors space.
>>>>>>
>>>>>> We will probably discuss this tomorrow, but we will also need  
>>>>>> to coordinate on the cron changes required. I have written up  
>>>>>> the proposed flow here:
>>>>>> http://wiki.geneontology.org/index.php/Ontology_publishing_pipeline_2009
>>>>>
>>>>>
>>>
>>>
>
>



More information about the Ontology-editors mailing list