[Go] [go] annotations for refgenomes
David Hill
dph at informatics.jax.org
Thu Feb 28 04:47:50 PST 2008
Hi Sue and Mike,
I think including/separating out the annotations to the root nodes would
be informative as well.
David
Sue Rhee wrote:
> Great. Actually, it would be very interesting to include the unknowns
> in the graph. Since you color-code by the evidence codes, the NDs
> would be a separate color, no? I think knowing the extent of the
> unknown is quite useful..
>
> I would love to see last year's graph. I think these graphs would be
> very cool to have on the GO website at some point, given that the
> participating groups are OK with that.
>
> Cheers,
> Sue
>
> Mike Cherry wrote:
>> Sue,
>>
>> Thanks I'm glad you like the graph. I have the graph from last year
>> if anyone wants it.
>>
>> Good question about the filtering. The filtering different between
>> the two graphs simply refers to the nightly filtering of gene
>> association files. This includes a large list of potential "errors"
>> that are removed including badly formated information, syntax
>> errors, use of obsolete GOIDs. The normal filtering. Thats good
>> that there isn't much different between the two graphs.
>>
>> For both graphs I remove all the annotations to the roots:
>> GO:0008150, GO:0003674, GO:0005575. Then count the number of IDs in
>> column 2. For TAIR there are 20,500 unique IDs for Function
>> annotations with the root annotations excluded.
>>
>> -Mike
>>
>>
>> On Feb 27, 2008, at 6:25 PM, Sue Rhee wrote:
>>
>>
>>> Hi Mike,
>>>
>>> It's a beautiful graph. Thanks for sharing. I have a couple of
>>> questions though. What is being filtered out? It seems to me the
>>> two graphs are very similar. Also I'm a little confused about the
>>> total number of genes. In TAIR, we have 24465 (for function), 23534
>>> (for process), and 22038 (for component) protein-encoding genes
>>> annotated with GO, but the graphs seem to be showing different
>>> numbers.
>>>
>>> Sue
>>>
>>> Mike Cherry wrote:
>>>
>>>> Graphs I made list week. One is from the submitted GA file and
>>>> one from the filtered GA file.
>>>>
>>>> -Mike
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>> --
>>> Sue Rhee
>>> Staff Scientist
>>> Carnegie Institution, Department of Plant Biology
>>> 260 Panama Street, Stanford, CA 94305
>>> Email: (650) 325-1521 x251
>>> Fax: (650) 325-6857
>>>
>>
>> _______________________________________________
>> Go mailing list
>> Go at geneontology.org
>> http://fafner.stanford.edu/mailman/listinfo/go
>>
>
> ------------------------------------------------------------------------
>
> _______________________________________________
> Go mailing list
> Go at geneontology.org
> http://fafner.stanford.edu/mailman/listinfo/go
>
More information about the Go
mailing list