[Go] addition of localization specific process terms ?

Harold Drabkin hjd at informatics.jax.org
Tue Mar 24 11:01:39 PDT 2009


However, as I was just told, "interleukin" does not describe a family in 
a related sequence sense, but in a functional  sense, so we can't have 
an "interleukin binding" term, so that idea is not good

h

Harold Drabkin wrote:
> Just a quick note: a lot of the children of the protein binding term 
> are used when a family is involved: eg. actin binding, and is useful 
> where the authors did not specify which of several possible actins was 
> involved. In this case, we use IDA instead of IPI, so at least the 
> information that something specifically binds (one of the ) actin.
> I was concerned some time ago when there was a move to add 
> exceedingly  specific terms;  ie, interleukin 6 binding, interleukin 7 
> binding, where typically there is one; these could be handled with 
> protein binding IPI id_of_interleukin 6, or whatever, or perhaps even 
> better,  interleukin binding IPI id_of interleukin 6, so that one 
> could search on what might bind any interleukin, and get back 
> annotations where the specific species can be found n the with field.
>
> h
>
>
> Alexander Diehl wrote:
>> Jim,
>>
>> I don't see the need to prepare ChEBI x binding.  The use of the term 
>> "protein binding" has always been done post-compositionally by MGI 
>> and others, and similarly the use of "binding" in general can be done 
>> post-compositionally.  There are some more specific children of 
>> protein binding, some of which may be more useful than others, but 
>> certainly I would not encourage more without a sound argument.  The 
>> GO is not perfect, but the existing mechanisms for term requests and 
>> generation are pretty good about weeding out unnecessary additions.  
>> If necessary we can propose annotation standards for GO that specify 
>> post-composition for "binding" related terms, as we do already for 
>> "protein binding."
>>
>> While I agree with Jen a new fast term browser might help folks who 
>> have trouble finding terms, in general finding terms does become 
>> faster with experience.  And although the MGI term browser is 
>> primitive in many respects, by default it always shows children of a 
>> given term, which many other browsers do not, thus saving a step when 
>> searching for more specific terms.
>>
>> -- Alex
>>
>>
>> Jim Hu wrote:
>>> In general, I like precomposition too.  But for binding, and to a 
>>> lesser extent location, I don't like the idea of having parent terms 
>>> with thousands of children.  The terms like regulation of 
>>> translation of gene X mRNA are terrifying to me.  I noticed that 
>>> somewhere on wiki.geneontology.org, there's a statement that GO will 
>>> never do those kinds of terms by precomposition, but a few terms 
>>> like that are already in GO, and there was recently a sourceforge 
>>> item about protein chaperones for specific gene products. I usually 
>>> find terms by searching for a keyword combination, navigating to a 
>>> particular term, and then browsing up and down the ontology.  Do 
>>> others not do the browsing part?  I think that's where the massive 
>>> expansion is most problematic.
>>>
>>> I see what you mean about time, but requesting a new term is also a 
>>> time barrier to annotation.
>>>
>>> Perhaps a test version of the ontology could be automatically 
>>> generated with ChEBI x binding, and people could see if my 
>>> intuitions or everyone else's are correct.  In general, I suspect 
>>> that people want precomposition for their own annotations and are 
>>> annoyed at the excess terms that they don't see themselves ever 
>>> using.  E. coli being the most distant from everyone else in the 
>>> phylogeny may be why I'm where I am on this! ;)
>>>
>>> Jim
>>>
>>> On Mar 24, 2009, at 8:38 AM, Alexander Diehl wrote:
>>>
>>>> I want to add my agreement to the words of Val and David.  It is 
>>>> much simpler to use a pre-composed existing term in annotation.  
>>>> One aspect of the annotation process I feel is over looked as we 
>>>> add more complexity to the annotation process is that 
>>>> post-composition adds a significant bit of time to the annotation 
>>>> process, resulting in fewer annotations overall and lower metrics 
>>>> for the database and grant.  While it is important to do detailed 
>>>> and correct annotations whenever possible, anything we can to do to 
>>>> increase throughput, such as precomposing likely terms, is 
>>>> beneficial.  I'm not saying we should add all possible combinations 
>>>> of X and Y, just the appropriate ones.  This is one of the main 
>>>> reasons for having annotators lead ontology development and holding 
>>>> ontology content meetings where expert biologists can discuss 
>>>> processes actually seen in nature, so that the appropriate 
>>>> combinations of X and Y are added.
>>>>
>>>> And knowing which pre-composed terms to use is a matter of training 
>>>> and experience, both in general biology, and in annotation.  
>>>> There's no way around it.
>>>>
>>>> -- Alex
>>>>
>>>>
>>>> val at sanger.ac.uk <mailto:val at sanger.ac.uk> wrote:
>>>>> I agree, it is far better to have pre-composed terms if possible,
>>>>> especially for new curators.
>>>>> As we encourage annotation to the most specific term possible it 
>>>>> is hard
>>>>> to overlook the precomposed terms, because we (I hope) always 
>>>>> check the
>>>>> child terms).
>>>>>
>>>>> Val
>>>>>
>>>>>  
>>>>>> From someone who has been annotating using a lot of 
>>>>>> pre-composition as
>>>>>> well as post-composition for a reasonably long time;  although 
>>>>>> there is
>>>>>> an initial activation energy to get a pre-composed term into the
>>>>>> ontology, once they are there, they are much easier to use than 
>>>>>> to look
>>>>>> up things in multiple ontologies for post-composition.
>>>>>>
>>>>>> The key to finding pre-composed terms easily is to have a good 
>>>>>> way of
>>>>>> viewing the ontology.
>>>>>>
>>>>>> my 2c
>>>>>>
>>>>>> D
>>>>>>  
>>>>>>> I would like to hear the opinion of some of the annotators here. Is
>>>>>>> excessive pre-coordination a concern for curation?
>>>>
>
> _______________________________________________
> Go mailing list
> Go at geneontology.org
> http://fafner.stanford.edu/mailman/listinfo/go



More information about the Go mailing list