[Go] generic GO slim question
Valerie Wood
val at sanger.ac.uk
Sun Jun 14 02:54:51 PDT 2009
How was it decided which terms to include in the generic GO slim?
There have been discussions previously about what makes a useful and
relevent generic GO slim (but no agreement). However, it seems that at
the very least the terms should be i) general, and ii) high level terms
which constitute major cellular processes (and therefore areas of
research) should be included.
So, I was wondering why the following terms are in the slim (I have
included the TOTAL number of annotations for all organisms in parenthases)
i) plastid translation [1]
ii) lead ion binding [2]
iii) cytoplasmic chromosome [28]
iv) neurotransmitter transporter [55]
Conversely the following biologically important "general" terms (at
least from a single celled organism perprective) , are absent from the
generic GO slim
i) DNA replication [1685]
ii) DNA repair [1934]
iii) transmembrane transport [814]
iv) ribosome biogenesis [1849]
v) cytokinesis [1049]
vi) cytoskeletal organization [2311]
and others.
In addition, there is an obsolete molecular function term in the slim
(chaperone regulator activity)
I wondered whether the contents of the slim need to be to make it more
useful. I realise it isn't easy to make a slim which is good for all
organisms. If this is the case perhaps we should consider abandoning the
"generic generic" slim and define more useful individual generic slims
for prokaryotes, eukaryotic unicellular, and multicellular orgs?
We might not agree about the utility of a "generic slim" but these are
used a lot as they are the default slims used by AmiGO, and the
Princeton generic GO term mapper.......They should provide a good
overview of the known biology of any organism. They should probably
provide a starting point for people who wish to refine to make their
own slim and include more specific terms for their area of interest, and
remove terms which are not useful. I am trying to write a tutorial
which includes how to select terms for a slim to give complete coverage
for their organism, and refine to make a more specific slim, but the the
generic slim doesn't seem to provide very good example for a starting
point.
Val
--
The Wellcome Trust Sanger Institute is operated by Genome Research
Limited, a charity registered in England with number 1021457 and a
company registered in England with number 2742969, whose registered
office is 215 Euston Road, London, NW1 2BE.
More information about the Go
mailing list