[go] Protein domain GO annotation

E Dimmer edimmer at ebi.ac.uk
Thu Nov 1 09:17:33 PDT 2007


Hi,

Could I please ask people's opinion on the functional annotation of 
protein domains/regions to the GO?

I have been contacted by a group who would like to annotate GO functions 
to identified disordered regions in proteins.

The thought so far is that they would annotate to a 'disordered_region' 
SO term, along with sequence co-ordinates, and then also attach a GO 
term with a reference and evidence code.
(I have spoken with Gabby Reeves from BioSapiens, who would be happy to 
add 'disordered_region' terms to the BioSapiens protein feature ontology 
section of SO).

For an annotation example: protein LEF-1 (Q9QXN1) has a disordered 
region corresponding to residues 296 - 397. This domain has been found 
to act to bend DNA, as reported in a experiment in PMID: 7651541.
In the normal course of GO annotation I would of course happily to 
annotate the whole protein (Q9QXN1) to the DNA bending term (DNA bending 
activity, GO:0008301), and while I might read about the discrete region 
in the protein that is responsible for this function I would not capture 
this data.
However the IUP(Intrinsically Unstructured Protein) curators would 
include the aa residue information in their annotations and want to 
describe the individual functions that a protein's multiple domains 
might have.

So I assume that for these kinds of annotations, where an equivalent GO 
term exists, a GOC annotation group could integrate this group's 
annotations and relate it up to the whole protein/gene product (and 
possibly being able to keep the SO term in the new cross-reference 
column 16? but not the aa residue location?).

While the majority of the function terms that the IUP community are 
interested in applying to their domains do map quite straight-forwardly 
to GO terms, there are some new ones which would need to be requested. 
And some of these new terms seem to describe more domain-specific, 
intra-protein function. For example, for some of the function terms used 
in the DisProt database:

flexible linker/spacer
Provides separation and permits movement between adjacent domains

entropic brisle
A disordered region that creates a zone of exclusion by its entropic 
movement

entropic spring
Provides a restoring force resulting from randomization of bond torsion 
angles that become restricted upon stretching.

(see: http://www.disprot.org/view_function_subclass.php)

So, would GO be willing to add these types of terms? And how much of the 
IUP communities annotation data would GOC groups be happy to incorporate 
into their own annotation sets?

Thanks,
Emily


-- 
 ************************************
    Emily Dimmer
    GOA Coordinator
    EMBL-EBI
    Wellcome Trust Genome Campus
    Hinxton
    Cambridge CB10 1SD, U.K.
    Tel:     +44 1223 494654
    Fax:    +44 1223 494468
    email:  edimmer at ebi.ac.uk
 ************************************




More information about the Go mailing list