[Go] Spliceform column in GAF

Chris Mungall cjm at berkeleybop.org
Wed Sep 10 13:41:15 PDT 2008


But this would be the type for the locus/gene/whatever that encodes  
the gene product, not the gene product itself.

On Sep 10, 2008, at 12:01 PM, Mike Cherry wrote:

> I think column 12 should be mandatory.  I'd suggest locus when you  
> don't know.  However locus is not in SO.  The root node of SO is  
> region, it follows our usage with GO, use the root node.
>
> -Mike
>
> On Sep 10, 2008, at 11:29 AM, Chris Mungall wrote:
>
>>
>> On Sep 10, 2008, at 11:13 AM, Rama Balakrishnan wrote:
>>
>>> Chris,
>>>
>>> I am trying to understand this proposal. So please bear with me.
>>>
>>> Column 17 is optional and column 12 is mandatory. If you don't  
>>> know the spliceform of the product in col 2 and hence leave col 17  
>>> as blank, then what would you put in column 12?
>>
>> protein, if it's a protein-coding gene. RNA if RNA-coding etc.
>>
>> I didn't document the case where the gene has not been molecularly  
>> characterized and we have IGI annotations. I'm open to suggestions  
>> here. Perhaps the best thing is to allow this column to be blank if  
>> we truly do not know if the gene is protein coding or not.
>>
>> However, if the gene is known to be protein coding yet the  
>> particular spliceform is not known then the type column should be  
>> protein
>>
>>> Because as I understand the proposal, what is in Col12 should  
>>> reflect the type of the spliceform in col 17?
>>
>> Yes, but we're making the open-world assumption here: absence of  
>> data does not mean an absence of the entity in reality.
>>
>> Thanks for the questions, looks like the document needs work to  
>> make it more readable.
>>
>> We only had a very cursory discussion of the type column in SLC,  
>> and we certainly didn't give people time to absorb the ramifications.
>>
>>>
>>> Thanks,
>>>
>>> Rama
>>>
>>> On Sep 10, 2008, at 10:51 AM, Chris Mungall wrote:
>>>
>>>>
>>>> If a group submitted annotations for two records corresponding to  
>>>> the same gene this would be in violation. The most likely way for  
>>>> this to happen would be when a MOD submits annotations to both  
>>>> UniProtKB IDs and MOD IDs.
>>>>
>>>> On Sep 10, 2008, at 10:34 AM, Stoddard, Alexander wrote:
>>>>
>>>>> I do not clearly understand the following part of the spec  
>>>>> regarding
>>>>> non-redundant canonical entities:
>>>>>
>>>>> "In addition the GAF must be non-redundant with respect to  
>>>>> canonical
>>>>> entities in a genome"
>>>>>
>>>>> Chris, would you please give an example of how a GAF file could be
>>>>> redundant with respect to canonical entities and how to correct  
>>>>> the
>>>>> example?
>>>>>
>>>>> Thank you,
>>>>> Alex Stoddard
>>>>>
>>>>>
>>>>> -----Original Message-----
>>>>> From: go-bounces at genome.stanford.edu
>>>>> [mailto:go-bounces at genome.stanford.edu] On Behalf Of Chris Mungall
>>>>> Sent: Tuesday, September 09, 2008 5:30 PM
>>>>> To: go list; Paul D Thomas
>>>>> Subject: [Go] Spliceform column in GAF [was Re: [Gofriends]  
>>>>> Redundancy
>>>>> ingo_XXXXXX-assocdb-tables/dbxref.txt]
>>>>>
>>>>> [redirected to GO]
>>>>>
>>>>> The change Mike speaks of is for the new spliceform column in  
>>>>> the GAF.
>>>>>
>>>>> I have specced this out here:
>>>>>
>>>>> 	
>>>>> http://wiki.geneontology.org/index.php/GAF_Spliceform_Column_Proposal
>>>>>
>>>>> Note that most of you will have read the previous document  
>>>>> describing
>>>>> current practices for annotating alternate spliceforms:
>>>>>
>>>>> 	
>>>>> http://wiki.geneontology.org/index.php/Annotation_of_Alternate_Splicefor
>>>>> ms
>>>>>
>>>>> But you won't have read the fully formulated proposal, as I only  
>>>>> put
>>>>> it on the wiki today.
>>>>>
>>>>> Note that this proposal was ratified at the SLC GOC meeting, but  
>>>>> the
>>>>> majority of the discussion was at the RefG portion of the meeting.
>>>>> It's particularly important that folks who weren't at this part  
>>>>> read
>>>>> and understand the proposal. Ratification at the GOC meeting may  
>>>>> have
>>>>> been premature as I only intended to sketch out a solution
>>>>> collaboratively at that meeting.
>>>>>
>>>>> Once the above wiki page is in shape, we should send an  
>>>>> announcement
>>>>> to gofriends (promptly, as it is of relevance to the current
>>>>> discussion below), all data providers and consumers, and then  
>>>>> after
>>>>> that in the newsletter and on the main GO docs.
>>>>>
>>>>> As Mike says we are aiming for a introduction some time in 2009.  
>>>>> It's
>>>>> important that anyone involved with producing GAFs is aware of the
>>>>> changes and is OK with this timetable.
>>>>>
>>>>> Cheers
>>>>> Chris
>>>>>
>>>>> On Sep 9, 2008, at 1:22 PM, Mike Cherry wrote:
>>>>>
>>>>>> There is a change coming to the format of the gene association  
>>>>>> file
>>>>>> which will solve this problem.  Annotations to proteins, gene,
>>>>>> transcripts, etc for a particular locus will be identified as  
>>>>>> such.
>>>>>> The change should occur in 2009.
>>>>>>
>>>>>> -Mike
>>>>>>
>>>>>>
>>>>>>> From: "Quaid Morris" <quaid.morris at gmail.com>
>>>>>>> To: "Gabriel Berriz" <gberriz at hms.harvard.edu>
>>>>>>> Subject: Re: [Gofriends] Redundancy in go_XXXXXX-assocdb-tables/
>>>>>>> dbxref.txt
>>>>>>> Cc: gofriends at genome.stanford.edu
>>>>>>>
>>>>>>> Hi Gabriel,
>>>>>>>
>>>>>>> It looks like in the example that you gave RGD ID 1302948 is a  
>>>>>>> gene
>>>>>>> ID and
>>>>>>> ENSRNOP00000034933 is a protein ID.  Are all your examples like
>>>>>>> this?  Maybe
>>>>>>> there are circumstances when it's possible to annotate a  
>>>>>>> specific
>>>>>>> isoform
>>>>>>> and others when only the gene can be annotated.
>>>>>>>
>>>>>>> Q
>>>>>>>
>>>>>> _______________________________________________
>>>>>> Gofriends mailing list
>>>>>> Gofriends at geneontology.org
>>>>>> http://fafner.stanford.edu/mailman/listinfo/gofriends
>>>>>>
>>>>>
>>>>> _______________________________________________
>>>>> Go mailing list
>>>>> Go at geneontology.org
>>>>> http://fafner.stanford.edu/mailman/listinfo/go
>>>>>
>>>>
>>>> _______________________________________________
>>>> Go mailing list
>>>> Go at geneontology.org
>>>> http://fafner.stanford.edu/mailman/listinfo/go
>>>
>>>
>>
>> _______________________________________________
>> Go mailing list
>> Go at geneontology.org
>> http://fafner.stanford.edu/mailman/listinfo/go
>
>



More information about the Go mailing list