[Go] [go] annotations for refgenomes
Mike Cherry
cherry at stanford.edu
Wed Feb 27 20:17:30 PST 2008
Sue,
Thanks I'm glad you like the graph. I have the graph from last year
if anyone wants it.
Good question about the filtering. The filtering different between
the two graphs simply refers to the nightly filtering of gene
association files. This includes a large list of potential "errors"
that are removed including badly formated information, syntax errors,
use of obsolete GOIDs. The normal filtering. Thats good that there
isn't much different between the two graphs.
For both graphs I remove all the annotations to the roots: GO:0008150,
GO:0003674, GO:0005575. Then count the number of IDs in column 2.
For TAIR there are 20,500 unique IDs for Function annotations with the
root annotations excluded.
-Mike
On Feb 27, 2008, at 6:25 PM, Sue Rhee wrote:
> Hi Mike,
>
> It's a beautiful graph. Thanks for sharing. I have a couple of
> questions though. What is being filtered out? It seems to me the two
> graphs are very similar. Also I'm a little confused about the total
> number of genes. In TAIR, we have 24465 (for function), 23534 (for
> process), and 22038 (for component) protein-encoding genes annotated
> with GO, but the graphs seem to be showing different numbers.
>
> Sue
>
> Mike Cherry wrote:
>> Graphs I made list week. One is from the submitted GA file and one
>> from the filtered GA file.
>>
>> -Mike
>>
>>
>>
>>
>>
>
> --
> Sue Rhee
> Staff Scientist
> Carnegie Institution, Department of Plant Biology
> 260 Panama Street, Stanford, CA 94305
> Email: (650) 325-1521 x251
> Fax: (650) 325-6857
More information about the Go
mailing list