[go] Paper of potential interest to you
Valerie Wood
val at sanger.ac.uk
Wed Aug 8 05:47:43 PDT 2007
Mike Cherry wrote:
> Manual curation is not sufficient for annotation of genomic databases
> William A. Baumgartner, Jr, K. Bretonnel Cohen, Lynne M. Fox, George
> Acquaah-Mensah, and Lawrence Hunter
> Bioinformatics 2007 23: i41-i48.
>
> http://bioinformatics.oxfordjournals.org/cgi/content/abstract/23/13/
> i41?etoc
>
>
>
This was interesting. Before we all decide its a losing battle, it's not
quite so doom and gloom as this analysis suggests.
By using mouse and fly they chose the 2 models with the single greatest
volume of data. It would have been nice to see the combined progress of
the GO curated organisms vs. non GO curated organisms (rather than
mouse, fly and then the entire Uniprot knowledge base)
Using this criteria (at least one GO annotation) they would have
identified a 'best case scenario' (left graph of figure one') for both
budding and fission yeasts.
However, using these methods, they would never show a 'best case
scenario' of GO annotation for ANY organism because they extracted the
GO data from the Uniprot records (at least this is what they say in the
methods), and Uniprot don't include ISS/IC/NAS/TAS/ or most importantly
for this analysis ND (I think that is correct isn't it Emily?)
And as they mention one reviewer pointed out, it is impossible here to
differentiate between a rate limiting factor of the rate of annotation
and the rate of discovery, or the relative contributions of either.
As an evaluation of GO coverage it would have been more informative if
they had used all the GO data. But its difficult to provide an analysis
of curation completion unless you know what is known.....
--
The Wellcome Trust Sanger Institute is operated by Genome Research
Limited, a charity registered in England with number 1021457 and a
company registered in England with number 2742969, whose registered
office is 215 Euston Road, London, NW1 2BE.
More information about the Go
mailing list