[Go] Fwd: GAF changes, addition of new columns
Mike Cherry
cherry at stanford.edu
Thu May 22 15:37:03 PDT 2008
I'm okay with most this however if the changes are not announced to
gofriends and in the newsletter until the first of August then October
1 is too soon for the switch. We know the GAF files are used by many
and two months is a bit short to be good to our users and allow them a
reasonable time to make needed changes. Not everyone will be able to
jump on changing their scripts immediately. End of October or first
of November would be better.
That would also mean that all files would change at that time. The
filtering script could add the extra columns if not in a GAF. This of
course would not cover changes in the type column.
One other thing that was discussed was the switch to using SO IDs in
the type column. I like that idea.
-Mike
On May 22, 2008, at 2:23 PM, Chris Mungall wrote:
>
> Sorry for the late response.
>
> My feeling is this should be dropped from the current newsletter; it
> should wait until we can provide a URL that goes to a detailed
> specification.
>
> Currently all we have is the wiki page I made prior to the meeting:
>
> http://gocwiki.geneontology.org/index.php/Annotation_of_Alternate_Spliceforms
>
> This page is intended to give background so that the group could
> make an informed decision, it's not intended as the final spec
>
> Also: although the proposal was accepted at the RefG meeting (see
> [1]), there were a number of issues that cropped up that have not
> yet been reflected in the above wiki page.
>
> In particular, there was the issue of the meaning of the
> DB_Object_Type column. In my original proposal this would continue
> to retain the current meaning, i.e. the type of the entity
> referenced in col2 (i.e. the generic entity: a gene or generic
> protein). However, the majority of the participants at the meeting
> felt that this was not a good idea and that the type column should
> instead be the type of the entity referenced in the new col17 (ie
> the specific entity: e.g. a spliceform).
>
> My original proposal was a standardization of best practice plus
> additional optional information with no change in meaning in
> existing columns; the revised form we decided at the meeting (not
> yet documented) involved a change in meaning. As such, it requires
> an especially long lead time.
>
> The action items stated:
>
> 1. update documentation
> 2. write notice of changes to users
> 3. individual data providers make sure that their input matches
> 4. software changes as necessary
>
> But didn't give any dates
>
> I suggest:
>
> June 6: The software group will create updated documentation on the
> wiki
> includes: a detailed specification GAF providers and consumers can
> follow
> June 13: GOC provides feedback & refinement, signoff
> June 20: email goes out to gofriends
> August ?: announcement in newsletter
> October 1: GAF files conforming to new spec go live
>
> Note that the people who need to know most in advance probably read
> gofriends more than the newsletter (a hunch, I have no evidence for
> this)
>
> I'm sympathetic to the view we should be as open as possible, but I
> worry about announcing too early and causing consternation and
> confusion
>
> [1] http://wiki.geneontology.org/index.php/Reference_Genome_Meeting_Minutes_April_2008#Annotation_Pipeline.2C_Part_1
> :_Generation_of_protein_sets_.28Suzi_Lewis.29
>
> *
> o ACTION ITEMS
>
> 1. update documentation
> 2. write notice of changes to users
> 3. individual data providers make sure that their input matches
> 4. software changes as necessary
> 5. add header to gene association file
> 6. syntex of gp2protein file will be provided by Mike and Chris
>
> On May 20, 2008, at 11:01 AM, Mike Cherry wrote:
>
>> Decision needed.
>>
>> There has been no comments from the software group. I feel we
>> could include the date(s) for GAF format change in the newsletter
>> that is in final edits now.
>>
>> I'll be traveling until Saturday so I hope someone else can make
>> sure this is decided and included in the newsletter.
>>
>> -Mike
>>
>>
>> Begin forwarded message:
>>
>>> From: Mike Cherry <cherry at stanford.edu>
>>> Date: May 19, 2008 8:52:38 AM PDT
>>> To: software-group Group <software-group at genome.stanford.edu>
>>> Subject: GAF changes, addition of new columns
>>>
>>> Hi,
>>>
>>> Everyone asks me when the new columns will become required. What
>>> do you all think?
>>>
>>> I'd suggest September or October. We need to define a date. A GO
>>> newsletter will go to press, so to speak, this week. The next
>>> newsletter will be in August. That will give two newsletters
>>> announcing the change, and having the switch over date be a month
>>> or two after the August newsletter.
>>>
>>> Part of this is when can groups start submitting a GAF with the
>>> additional columns, and when would it become manditory? There are
>>> two additional columns involved, the long reserved column for
>>> properties (column 16) and the newly agreed one for canonical gene
>>> name (column 17).
>>>
>>> Any counter proposals? Concerns? I've heard from Harold that
>>> this change will take a few months for the MGI software staff to
>>> put into production.
>>>
>>> -Mike
>>
>> _______________________________________________
>> Go mailing list
>> Go at geneontology.org
>> http://fafner.stanford.edu/mailman/listinfo/go
>
> _______________________________________________
> Go mailing list
> Go at geneontology.org
> http://fafner.stanford.edu/mailman/listinfo/go
More information about the Go
mailing list