[Annotation] Association errors

Valerie Wood val at sanger.ac.uk
Thu Jun 12 01:36:23 PDT 2008


pombe fixed

Doesnt Mikes script check the format of the dbxrefs and check they have 
valid prefixes? I though it did. If not can it be added so we capture 
most of these at submission?

Val


Amelia Ireland wrote:
> Hi all,
>
> I have been looking at database refs in the GO database and have found a
> few dodgy xrefs which need fixing. The errors mean that the links from the
> associations don't work correctly, so it'd be good to get them fixed.
>
> I've attached a file with the dodgy refs in it. It's tab delimited so you
> can open it in excel for your viewing pleasure (or displeasure).
>
> Cheers!
> Amelia
>
>   
> ------------------------------------------------------------------------
>
> gpxref	evcode	reference	with column	assigned_by
>
> REFERENCE				
> Not sure what TIGR:autoGO is supposed to mean!				
> GeneDB_Tbrucei:Tb10.6k15.3030	ISS	TIGR:autoGO	UniProt:Q9Y4W6	GeneDB_Tbrucei
>
> WITH COLUMN				
> "Should be Interpro, not IUPHAR"				
> dictyBase:DDB0266650	ISS	dictyBase_REF:10155	IUPHAR:IPR000271	dictyBase
>
> Should these be MGI?				
> GeneDB_Spombe:SPBC28E12.06c	ISS	PMID:17072883	MGD:1096875	GeneDB_Spombe
> SGD:S000005096	ISS	SGD_REF:S000125856	MGD:99665	SGD
> SGD:S000003563	ISS	SGD_REF:S000045069	MGD:98181	SGD
> SGD:S000003241	ISS	SGD_REF:S000043550	MGD:74623	SGD
>
> Remove the surplus MGD				
> dictyBase:DDB0230136	ISS	dictyBase_REF:10155	MGD:MGI:1914510	dictyBase
>
> The MGI IDs for these need to be found				
> SGD:S000005853	ISS	SGD_REF:S000043587	MGD:Myo5a	SGD
> SGD:S000000027	ISS	SGD_REF:S000043587	MGD:Myo5a	SGD
> SGD:S000003664	ISS	SGD_REF:S000046190	MGD:Map2k1	SGD
>
> Should be lowercase				
> MGI:MGI:1915651	ISS	MGI:MGI:3576961	REFSEQ:NM_138387.2	MGI
>
> "I think these are supposed to be SGD, not SMD"				
> dictyBase:DDB0230088	ISS	dictyBase_REF:10155	SMD:S000005654	dictyBase
> dictyBase:DDB0231094	ISS	dictyBase_REF:10155	SMD:S000003080	dictyBase
>
> TIGR: should these be TIGRFAMS refs instead?				
> TAIR:gene:2050224	RCA	TAIR:Communication:501714663	TIGR:TIGR00797	TIGR
> TAIR:gene:2050224	RCA	TAIR:Communication:501714663	TIGR:TIGR00797	TIGR
> TIGR_CMR:BA_0175	ISS	PMID:12721629	TIGR:TIGR00363	TIGR
> TIGR_CMR:BA_1378	ISS	PMID:12721629	TIGR:TIGR01439	TIGR
> TIGR_CMR:BA_1378	ISS	PMID:12721629	TIGR:TIGR01439	TIGR
> TIGR_CMR:BA_2818	ISS	PMID:12721629	TIGR:TIGR01389	TIGR
> TIGR_CMR:BA_2818	ISS	PMID:12721629	TIGR:TIGR01389	TIGR
> TIGR_CMR:BA_2818	ISS	PMID:12721629	TIGR:TIGR01389	TIGR
> TIGR_CMR:BA_2818	ISS	PMID:12721629	TIGR:TIGR01389	TIGR
> TIGR_CMR:BA_2818	ISS	PMID:12721629	TIGR:TIGR01389	TIGR
> TIGR_CMR:BA_3635	ISS	PMID:12721629	TIGR:annotation	TIGR
> TIGR_CMR:BA_3769	ISS	PMID:12721629	TIGR:TIGR01439	TIGR
> TIGR_CMR:BA_3769	ISS	PMID:12721629	TIGR:TIGR01439	TIGR
> TIGR_CMR:BA_4076	ISS	PMID:12721629	TIGR:TIGR01439	TIGR
> TIGR_CMR:BA_4076	ISS	PMID:12721629	TIGR:TIGR01439	TIGR
> TIGR_CMR:BA_4140	ISS	PMID:12721629	TIGR:TIGR00095	TIGR
> TIGR_CMR:BA_4140	ISS	PMID:12721629	TIGR:TIGR00095	TIGR
> TIGR_CMR:BA_4713	ISS	PMID:12721629	TIGR:TIGR00040	TIGR
> TIGR_CMR:BA_4713	ISS	PMID:12721629	TIGR:TIGR00040	TIGR
> TIGR_CMR:LMOf2365_1146	ISS	PMID:15115801	TIGR:TIGR00493	TIGR
> TIGR_CMR:LMOf2365_1146	ISS	PMID:15115801	TIGR:TIGR00493	TIGR
> TIGR_CMR:LMOf2365_1146	ISS	PMID:15115801	TIGR:TIGR00493	TIGR
> TIGR_CMR:LMOf2365_1733	ISS	PMID:15115801	TIGR:TIGR00500	TIGR
> TIGR_CMR:LMOf2365_1733	ISS	PMID:15115801	TIGR:TIGR00500	TIGR
> TIGR_CMR:LMOf2365_1888	ISS	PMID:15115801	TIGR:TIGR00401	TIGR
> TIGR_CMR:LMOf2365_1888	ISS	PMID:15115801	TIGR:TIGR00401	TIGR
> TIGR_CMR:LMOf2365_2483	ISS	PMID:15115801	TIGR:TIGR00963	TIGR
> TIGR_CMR:LMOf2365_2483	ISS	PMID:15115801	TIGR:TIGR00963	TIGR
>
> Change Flybase to FB:...				
> GeneDB_Spombe:SPCC550.14	ISS	PMID:17072883	Flybase:FBgn0027835	GeneDB_Spombe
> GeneDB_Spombe:SPAC26F1.02	ISS	PMID:17072883	Flybase:FBgn0037737	GeneDB_Spombe
>
> These IDs are wrong - should be FB:FBgn0015477				
> WB:WB:WP:CE30344	ISS	WB:WBPaper00004810	FLYBASE:CG8014-PA	WB
> WB:WB:WP:CE30345	ISS	WB:WBPaper00004810	FLYBASE:CG8014-PA	WB
> ------------------------------------------------------------------------
>
> _______________________________________________
> Annotation mailing list
> Annotation at geneontology.org
> http://fafner.stanford.edu/mailman/listinfo/annotation
>   


-- 
---------------------------------------------------------------------------
Valerie Wood			 Tel: 01223 496909
S. pombe Genome Project		 Fax: 01223 494919 		       
Wellcome Trust Sanger Institute	 email: val at sanger.ac.uk
Wellcome Trust Genome Campus	 http://www.genedb.org/genedb/pombe 
Hinxton, Cambridge, CB10 1HH	 http://www.sanger.ac.uk/Projects/S_pombe



-- 
 The Wellcome Trust Sanger Institute is operated by Genome Research 
 Limited, a charity registered in England with number 1021457 and a 
 company registered in England with number 2742969, whose registered 
 office is 215 Euston Road, London, NW1 2BE. 


More information about the Annotation mailing list