[Annotation] interpro scan question

Doug howe dhowe at cs.uoregon.edu
Fri Sep 19 12:52:22 PDT 2008


http://www.uniprot.org/uniprot/O93578

had a typo on this one..sorry!:
http://www.uniprot.org/uniprot/Q6P3L7



Harold Drabkin wrote:
> Doug howe wrote:
>> I've got two protein sequences (found below) which are 99% 
>> identical.  Both belong to zebrafish synaptosomal protein snap25a, 
>> but one of them is a fragment.  The fragment turns up positive for 
>> the interpro domain IPR002197 (helix-turn-helix, Fis-type), while the 
>> full length protein does not.  The fragment is the only protein on 
>> the gene that is positive for this interpro domain.   It turns out 
>> that IPR002197 maps through interpro2go to 'transcription factor 
>> activity' and 'regulation of transcription'.  Is it possible that the 
>> protein fragment is getting a false positive hit for the domain 
>> IPR002197?  If so, why doesn't it also make a false positive hit on 
>> the full length proteins?
>>
>> Any help welcomed...
>>
>> O93578
>> LGKFCGLCSCPCNKMKSGASKAWGNNQDGVVASQPARVVDEREQMAISGGFIRRVTDDAR
>> ENEMDENLEQVGGIIGNLRHMALDMGNEIDTQNRQIDRIMEKADSNKTRIDEANQRATKM
>> LGSG
>>
>> Q693L7
>> MAEDSDMRNELADMQQRADQLADESLESTRRMLQLVEESKDAGIRTLVMLDEQGEQLERI
>> EEGMDQINKDMKDAEKNLNDLGKFCGLCSCPCNKMKSGASKAWGNNQDGVVASQPARVVD
>> EREQMAISGGFIRRVTDDARENEMDENLEQVGGIIGNLRHMALDMGNEIDTQNRQIDRIM
>> EKADSNKTRFDEANQRATKMLGSG
>>
>>
> why can I not find either of these ids in UniProtKB?
>
> Could the fragment be a Trembl record as thus has not been fully 
> annotated for domains (this is why we block display of domains 
> associated with TreEMBL records in our db and only display and use the 
> ones from a fully curated UniProt record.
>
>

-- 
Doug Howe, Ph.D.
ZFIN Scientific Curator
Zebrafish Nomenclature Coordinator



More information about the Annotation mailing list