[Go] generic GO slim question

Valerie Wood val at sanger.ac.uk
Tue Jun 16 12:00:31 PDT 2009


As an example of (1.) fission yeast uses 3361 different terms in total, 
3127 of these are used for manual annotation (I was looking at this 
today) so the 'slim' would be quite 'fat' in this case.
Val


Judith Blake wrote:

> Hi Jim,
>
> I think you bring forward the two different approaches to slims....
>
>    1. High level terms, typically fewer than 20, that can be used to
>       look at overall distribution of gene attributes of a genome set.
>
>
> 2. Set of terms that have been used in a particular context... Used to 
> annotate a prokaryotic protein...as in your example.
>
> Something to keep in mind.
>
> For me, the first case, with a few high level terms, is a ‘slimming’ 
> more apparently than the 2nd.
>
> Judy
>
>
> On 6/16/09 12:59 PM, "Jim Hu" <jimhu at tamu.edu> wrote:
>
>     From what I can tell about the discussions of slims I've heard at
>     GOC meetings, part of the problem is that maintaining them is an
>     extra task that no one really has time to do. Which makes me
>     wonder if slimming can be automated in some way. For example,
>     anything that is used for a manual annotation of a prokaryote
>     would go in the prokaryotic slim.
>
>     Jim
>
>
>     On Jun 14, 2009, at 4:54 AM, Valerie Wood wrote:
>
>
>         How was it decided which terms to include in the generic GO slim?
>
>         There have been discussions previously about what makes a
>         useful and relevent generic GO slim (but no agreement).
>         However, it seems that at the very least the terms should be
>         i) general, and ii) high level terms which constitute major
>         cellular processes (and therefore areas of research) should be
>         included.
>
>         So, I was wondering why the following terms are in the slim (I
>         have included the TOTAL number of annotations for all
>         organisms in parenthases)
>
>         i) plastid translation [1]
>         ii) lead ion binding [2]
>         iii) cytoplasmic chromosome [28]
>         iv) neurotransmitter transporter [55]
>
>         Conversely the following biologically important "general"
>         terms (at least from a single celled organism perprective) ,
>         are absent from the generic GO slim
>
>         i) DNA replication [1685]
>         ii) DNA repair [1934]
>         iii) transmembrane transport [814]
>         iv) ribosome biogenesis [1849]
>         v) cytokinesis [1049]
>         vi) cytoskeletal organization [2311]
>         and others.
>
>         In addition, there is an obsolete molecular function term in
>         the slim (chaperone regulator activity)
>
>         I wondered whether the contents of the slim need to be to make
>         it more useful. I realise it isn't easy to make a slim which
>         is good for all organisms. If this is the case perhaps we
>         should consider abandoning the "generic generic" slim and
>         define more useful individual generic slims for prokaryotes,
>         eukaryotic unicellular, and multicellular orgs?
>
>         We might not agree about the utility of a "generic slim" but
>         these are used a lot as they are the default slims used by
>         AmiGO, and the Princeton generic GO term mapper.......They
>         should provide a good overview of the known biology of any
>         organism. They should probably provide a starting point for
>         people who wish to refine to make their own slim and include
>         more specific terms for their area of interest, and remove
>         terms which are not useful. I am trying to write a tutorial
>         which includes how to select terms for a slim to give complete
>         coverage for their organism, and refine to make a more
>         specific slim, but the the generic slim doesn't seem to
>         provide very good example for a starting point.
>
>         Val
>
>
>
>
>
>



-- 
 The Wellcome Trust Sanger Institute is operated by Genome Research 
 Limited, a charity registered in England with number 1021457 and a 
 company registered in England with number 2742969, whose registered 
 office is 215 Euston Road, London, NW1 2BE. 


More information about the Go mailing list