[go] Requirement for all 'unknown' annotations to use ND code
Jim Hu
jimhu at tamu.edu
Mon Sep 17 10:02:25 PDT 2007
On Sep 17, 2007, at 11:07 AM, Valerie Wood wrote:
>
>
>
> I don't see how you can make an annotation to the root node using
> RCA/IC/IMP/ISS or IDA?
We haven't done these yet, but
ISS - similarity to proteins annotated to the root node with ND in
another organism?
IMP - What does one do for large scale knockout screens when a KO
shows no phenotype. Someone did look, so it's not really ND, is it?
IDA seems pretty hard to rationalize. I can imagine negative
results, as in "previous analysis suggested that gene X has activity
Y, but we can't detect it" but wouldn't that get a NOT modifier for
the assayed activity Y, if it was annotated at all? I'm actually
thinking of a case where paper A says that an E. coli protein is a
nuclease, and paper B shows that the nuclease activity is a
contaminant. I'm thinking there are the following choices if that
results in not knowing the function of the gene X product:
* delete the annotation from paper A
** no annotation to the root node
** annotate to the root node
* add the annotation from paper B with a not Y
** no annotation to the root node
** annotate to the root node
I recall discussing this kind of situation with Karen, but I'm not
sure that we covered how to handle the root node. Does this change
if the information that the putative activity was a contaminant is
not published but the curator knows about it from a meeting or a
personal communication?
Similarly, can one annotate to the root node with RCA if a
computational analysis shows that protein X does not have previously
suggested activity Y based on improved sophistication of motif
analysis? Again, if this removes the only putative activity from an
earlier analysis, does the protein get a root node annotation, or
does it get nothing? Example, a protein is annotated by some project
as a thioredoxin based on good sequence similarity to the fold family
members. Later, someone notices that the active site residues are
missing.
Jim
>
> The ND means the curator has looked at all the papers for this gene
> (and for some databases checked the annotations to orthologs to see
> if any sensible inferences can be made), and as of the data the
> annotation was mane there is "no data".
>
> We wouldn't be able to do this with any of the other evidence codes.
>
> Val
>
>
> Suzanna Lewis wrote:
>
>> After reading through this thread I see no strong reason for
>> requiring ND as the evidence code for annotations to the root.
>>
>> In fact, I'm now wondering why we have ND at all. Seems to me
>> that "no data" is a result. It is not the type of experiment that
>> was done. Maybe the only accurate use of ND is when we don't even
>> know what kind of experiment was carried out.
>>
>> -S
>>
>> On Sep 17, 2007, at 8:42 AM, Valerie Wood wrote:
>>
>>> So we don't all need to run the query......
>>>
>>>
>>> biological_process Dictybase ISS 1
>>> biological_process Dictybase ND 1313
>>> biological_process FB ND 1022
>>> biological_process GeneDB_Pfalciparum ND 702
>>> biological_process GeneDB_Spombe ND 1021
>>> biological_process GeneDB_Tbrucei ND 1087
>>> biological_process GeneDB_Tbrucei TAS 1
>>> biological_process GR_protein IC 11
>>> biological_process MGI IDA 1
>>> biological_process MGI IMP 2
>>> biological_process MGI ND 1382
>>> biological_process PseudoCAP IDA 13
>>> biological_process PseudoCAP ISS 2
>>> biological_process PseudoCAP RCA 26
>>> biological_process RGD IEA 1
>>> biological_process RGD ND 607
>>> biological_process SGD IMP 1
>>> biological_process SGD NAS 1
>>> biological_process SGD ND 1429
>>> biological_process SGD TAS 1
>>> biological_process TAIR ND 11086
>>> biological_process TAIR RCA 12
>>> biological_process TAIR TAS 3
>>> biological_process TIGR_CMR ND 19190
>>> biological_process TIGR_Tba1 ND 194
>>> biological_process UniProt IEA 6
>>> biological_process UniProt ND 966
>>> biological_process WB IMP 1326
>>> biological_process WB ND 2
>>> biological_process ZFIN ND 5269
>>> cellular_component Dictybase ISS 3
>>> cellular_component Dictybase ND 1551
>>> cellular_component FB ISS 1
>>> cellular_component FB ND 2058
>>> cellular_component GeneDB_Pfalciparum ND 288
>>> cellular_component GeneDB_Spombe ND 190
>>> cellular_component GeneDB_Tbrucei NAS 2
>>> cellular_component GeneDB_Tbrucei ND 1623
>>> cellular_component GeneDB_Tbrucei TAS 1
>>> cellular_component GR_protein TAS 8
>>> cellular_component MGI ND 1362
>>> cellular_component MGI TAS 1
>>> cellular_component PseudoCAP IDA 13
>>> cellular_component PseudoCAP ISS 2
>>> cellular_component RGD ND 718
>>> cellular_component SGD ND 972
>>> cellular_component SGD TAS 1
>>> cellular_component TAIR ND 9877
>>> cellular_component TAIR TAS 12
>>> cellular_component TIGR_CMR ND 14318
>>> cellular_component TIGR_Tba1 NAS 2
>>> cellular_component TIGR_Tba1 ND 184
>>> cellular_component UniProt ND 1278
>>> cellular_component WB ND 55
>>> cellular_component ZFIN ND 6283
>>> molecular_function Dictybase ND 1064
>>> molecular_function FB ND 1935
>>> molecular_function FB TAS 1
>>> molecular_function GeneDB_Lmajor IEA 57
>>> molecular_function GeneDB_Pfalciparum IEA 38
>>> molecular_function GeneDB_Pfalciparum ND 789
>>> molecular_function GeneDB_Spombe ND 1452
>>> molecular_function GeneDB_Tbrucei IEA 44
>>> molecular_function GeneDB_Tbrucei ND 977
>>> molecular_function GeneDB_Tbrucei TAS 7
>>> molecular_function GR_protein IEA 255
>>> molecular_function GR_protein RCA 15
>>> molecular_function MGI ND 1381
>>> molecular_function PseudoCAP IDA 13
>>> molecular_function PseudoCAP ISS 2
>>> molecular_function PseudoCAP RCA 46
>>> molecular_function RGD ND 701
>>> molecular_function SGD ISS 4
>>> molecular_function SGD NAS 1
>>> molecular_function SGD ND 2166
>>> molecular_function SGD TAS 19
>>> molecular_function TAIR NAS 3
>>> molecular_function TAIR ND 10095
>>> molecular_function TAIR RCA 403
>>> molecular_function TAIR TAS 72
>>> molecular_function TIGR_CMR ND 19337
>>> molecular_function TIGR_Tba1 ND 181
>>> molecular_function TIGR_Tba1 TAS 7
>>> molecular_function UniProt ND 1124
>>> molecular_function WB NAS 1
>>> molecular_function WB ND 51
>>> molecular_function WB TAS 2
>>> molecular_function ZFIN ND 4950
>>>
>>>
>>> --
>>> The Wellcome Trust Sanger Institute is operated by Genome
>>> Research Limited, a charity registered in England with number
>>> 1021457 and a company registered in England with number 2742969,
>>> whose registered office is 215 Euston Road, London, NW1 2BE.
>>
>>
>>
>
>
>
> --
> The Wellcome Trust Sanger Institute is operated by Genome Research
> Limited, a charity registered in England with number 1021457 and a
> company registered in England with number 2742969, whose registered
> office is 215 Euston Road, London, NW1 2BE.
=====================================
Jim Hu
Associate Professor
Dept. of Biochemistry and Biophysics
2128 TAMU
Texas A&M Univ.
College Station, TX 77843-2128
979-862-4054
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://fafner.stanford.edu/pipermail/go/attachments/20070917/3d0970ac/attachment.html
More information about the Go
mailing list