[go] contributes_to question
Valerie Wood
val at sanger.ac.uk
Mon Aug 13 03:15:38 PDT 2007
Recently I'm wondering recently why we have 2 meanings for contributes_to:
When the qualifier was initially implemented, it was so function terms
could be added to complexes like DNA polymerase and the F1 Fo ATPase
where the function cannot be attributed to a single subunit. This seems
fine.
Increasingly I see annotations to complexes which are described as (for
example) a histone acetyltransferase complex, and all of the subunits
are given histone de/acetlytransferase or methyltransferase activity
with contributes_to, even thought the other subunits clearly have other
functions (I see ATPases, ubiquitin ligases actin-like proteins etc,
which are commonly associated with histone acetyltransferases and
methyltransferases).
This seems odd, for a number of reasons.
Often these subunits are not required for the activity, but their
deletion (sometimes, but not always) affects the rate the activity
Primarily I don't understand what this type of 'contributes_to'
annotation provides to GO users above a process annotation to the
histone acetylation (if this has been shown), a complex annotation, and
a function term to unknown/root node. Isn't it more useful to know that
there is some information about the process, but the molecular function
is not known?
1) Another problem is that these particular chromatin associated
complexes often have shared
subunits so the function annotations aren't so clear-cut (i.e some of
these subunits may be members of other complexes which do not have this
activity)
2) Also computational analysis using RCA which infer these 'functions'
to similar proteins which, from their domain composition it is unlikely
possess this activity.
3) It makes cross species comparisons difficult because you get
different numbers of functions to what you would expect when comparing
annotations between species. For example it is known how many histone
acetyltransferases /methytrasferases etc. pombe has, compered to S.
cerevisiae, but when I compare the 2 the numbers are skewed.
The documentation clearly allows this (although there is not an example
of this type of annotation in the documentation, so I wonder if this is
what we meant?):
> From the documentation:
Annotating individual gene products according to attributes of a complex
is especially useful for molecular function annotations in cases where a
complex has an activity, but not all of the individual subunits do. (For
example, there may be a known catalytic subunit and one or more
additional subunits, or the activity may only be present when the
complex is assembled.) Molecular function annotations of complex
subunits that are not known to possess the activity of the complex must
include the entry contributes_to in the Qualifier column.
Note that contributes_to is not needed to annotate a catalytic subunit.
Furthermore, contributes_to may be used for any non-catalytic subunit,
whether the subunit is essential for the activity of the complex or not.
--
The Wellcome Trust Sanger Institute is operated by Genome Research
Limited, a charity registered in England with number 1021457 and a
company registered in England with number 2742969, whose registered
office is 215 Euston Road, London, NW1 2BE.
More information about the Go
mailing list