From midori at ebi.ac.uk Tue Sep 1 02:28:36 2009 From: midori at ebi.ac.uk (Midori Harris) Date: Tue, 1 Sep 2009 10:28:36 +0100 (BST) Subject: [Ontology-editors] XP docs In-Reply-To: References: Message-ID: Chiming in late due to bank holiday yesterday ... I think the doc now looks very good indeed. (Although "classes" would be more ontologically correct, I heartily approve of sticking with "terms" in this documentation, because so much of its target audience is biologists; formal ontologists will mostly look elsewhere anyway.) m On Mon, 31 Aug 2009, Chris Mungall wrote: > > Looks very good! > > Couple of comments: >> If we use ontology terms in the genus and the differentia, we can see >> that these logical definitions take the general form >> >> term relation term >> > The problem is we already use a triple of the form to denote > > all X R some Y > > I suggest that instead of > > term relation term > > You write > > term and relation term > > or > > term that relation term > > e.g. instead of > >> lysosomal membrane is membrane surrounding lysosome >> > > we should have: > > lysosomal membrane is (a) membrane that surrounds (a) lysosome > > This is consistent with what you have above, and with the formal semantics. > Also the actual relation used is 'surrounds' not 'surrounding'. Of course > it's not hard to have a simple algorithm that translates a logical definition > of the form > > lysosomal membrane = membrane that surrounds lysosome > > To a more user-friendly > > a lysosome membrane is a membrane surrounding a lysosome > > But this documentation should not be dependent on such an algorithm. The docs > should probably not shy away from showing the relevant piece of the obo > format stanza and include a translational table for going between the two. > E.g. > > id: X > > intersection_of: G > > intersection_of: R1 Y1 > > intersection_of: Rn Yn > > <=> > > X = G that and ... and > > > > The docs use the terms 'term', 'concept' and 'category'. I'm not sure what > the difference between these are. I think it would be simplest if we use > 'class' throughout. Unfortunately the usage of 'term' for what are actually > classes is very prevalent in GO. We could at least reduce it to term and > class and eliminate concept and category. > > It may be better to use a species-neutral ontology like cell or chebi instead > of the species-specific fly_anatomy. > > For the internal xp examples it would be good to have examples drawn from the > xps that will go live first: the subset of bp_xp_cc that are simple occurs_in Y> ; regulation xps ; the subset of cc_xp_self that is component that part_of X>. We could also have an example of how this is > being used to maintain the GO. > > David and Amina are writing docs for OE users so we should be sure to > coordinate. > > Thanks! > > > > On Aug 28, 2009, at 3:10 PM, Amelia Ireland wrote: > >> Hello people, >> >> I've drafted some XP docs for the GO website if you want to have a look at >> them: >> >> http://geneontology.org/test-html.shtml >> >> If anyone has suggestions for what should go in that last paragraph, they'd >> be gratefully received! >> >> Thanks, >> Amelia. >> >> -- >> Amelia Ireland >> GO Editorial Office >> http://www.berkeleybop.org || http://www.ebi.ac.uk >> Boycott Trader Joe's Red List seafood: http://traitorjoe.com/ >> >> >> >> >> >> >> >> >> _______________________________________________ >> Ontology-editors mailing list >> Ontology-editors at geneontology.org >> http://fafner.stanford.edu/mailman/listinfo/ontology-editors >> > > _______________________________________________ > Ontology-editors mailing list > Ontology-editors at geneontology.org > http://fafner.stanford.edu/mailman/listinfo/ontology-editors From midori at ebi.ac.uk Tue Sep 1 05:40:49 2009 From: midori at ebi.ac.uk (Midori Harris) Date: Tue, 1 Sep 2009 13:40:49 +0100 (BST) Subject: [Ontology-editors] forward and reverse transcription In-Reply-To: References: <64CE8C76-9D79-45F8-9357-4663A9E89D5D@berkeleybop.org> <493411F5.7070109@informatics.jax.org> <49341373.7010000@informatics.jax.org> <49359501.4050000@informatics.jax.org> <4935B884.3010108@informatics.jax.org> <4935C498.5020903@informatics.jax.org> <4935EBE3.90107@informatics.jax.org> <49369C89.70608@informatics.jax.org> <4936A2E0.7000400@informatics.jax.org> <4936A560.10903@informatics.jax.org> <4936AD2B.7030005@informatics.jax.org> <49371535.50401@informatics.jax.org> Message-ID: Hi all, After a few months of dormancy, there have been a few more comments on this SF item. Rather than copy things here, here's the link: https://sourceforge.net/tracker/?func=detail&atid=440764&aid=2354289&group_id=36855# Additional comments welcome. m From kchris at genome.stanford.edu Tue Sep 1 10:24:18 2009 From: kchris at genome.stanford.edu (Karen Christie) Date: Tue, 1 Sep 2009 10:24:18 -0700 (PDT) Subject: [Ontology-editors] [OBO-Edit Working Group] displaying has_part In-Reply-To: References: Message-ID: HI David, I would think of the major spliceosome (and also the minor and trans spliceosomes) as an ordered series of complexes, each composed of some or all of the 5 snRNPs + some other stuff. I don't know that relates to a "non-contiguous structure". Perhaps you could provide an example. -Karen On Tue, 1 Sep 2009, djs93 at gen.cam.ac.uk wrote: > Hi Karen, > > I think there might be another way to deal with this issue, that is to get a > more precise definition of the 'major spliceosome' - one that more closely > accords with what biologists are actually referring too. It seems to me, that > the way you use the term is most consistent with it referring to a > non-contiguous structure - by which I mean one whose parts are not all > connected to each other. As these can be confusing, I think it's important to > use such terms sparingly, but they are unavoidable in some cases. For > example, we need terms for functionally defined systems in anatomy. Instances > of many such systems - e.g.- the immune system or the endocrine system, are > non-contiguous. Is this the case for what you refer to here as the 'major > spliceosome'? If so, then, as far as I can tell, the original use of part_of > in this case was not incorrect. > > Hope this helps, rather than just muddying the waters further, > > David > > On Aug 26 2009, Karen Christie wrote: > >> OK, I understand your explanation about "Why should "Queries for U2-type >> spliceosomal complex should ***not*** return gene products localized >> to U1 snRNP complex." But I think that is separate from the display >> issue I have with has_part. >> >> However, it also brings up a different issue. A question that a >> biologist might want to ask is "give me all the proteins in the major >> spliceosome". They would expect to get the proteins present in the 5 >> snRNPs (U1, U2, U4, U5, and U6). > > >> By replacing part_of relations with >> has_part, it seems that they might have difficulty getting a full >> answer to that question. In your example where g1 is annotated only to >> "U1 snRNP", you give the hypothetical result: >> >>> There are no genes with products known to be localized to 'U2-type >>> spliceosomal complex'. >>> However, every 'U2-type spliceosomal complex' has the following >>> parts: >>> a U1 snRNP -- genes: g1 >>> a U2 snRNP -- genes: none >> >> However, if I wanted to get all proteins in the major spliceosome, I >> don't want just gene products that are in EVERY 'U2-type spliceosomal >> complex', I want gene products that are in ANY 'U2-type spliceosomal >> complex'. What do we need to do to make this a question that biologists >> get a reasonable answer to? >> >> I understand that GO needs to implement more rigorous logic, but if we >> do it in a way that prevents biologists from getting reasonable >> answers, we are not doing what we set out to do. >> >> -Karen >> >> >> >> >> On Mon, 24 Aug 2009, Chris Mungall wrote: >> >>> >>> On Aug 24, 2009, at 10:36 AM, Karen Christie wrote: >>> >>> [snipped part of dialog for now to focus on one issue] >>> >>>>> I would strongly advocate that even given sufficient developer hours it >>>>> is better *not* to display the ontology in this way at all. It perhaps >>>>> looks more comforting, but people will make the same comforting >>>>> assumptions that no longer hold. For example, it looks like there is >>>>> some kind of transitive relationship between U1 snRNP and U2-type >>>>> spliceosomal complex ***which there is not***. It looks like the true >>>>> path rule might hold. ***it does not*** . Queries for U2-type >>>>> spliceosomal complex should ***not*** return gene products localized to >>>>> U1 snRNP complex. >>>> >>>> Why should "Queries for U2-type spliceosomal complex should ***not*** >>>> return gene products localized to U1 snRNP complex."? >>> >>> OK, good, I think we are circling on on the crux of the issue here. >>> >>> According to GO, it is not necessarily the case that a gp localized to a >>> U1 snRNP is necessarily localized to a U2-type spliceosomal complex. For >>> example, the gp may be localized to a particular U1 snRNP that is part of >>> a penta-snRNP complex (I may not have chosen the best example as we're >>> lacking annotations to this new term, but I can pull out an analogous >>> example if you're not convinced) >>> >>> This can be seen in the sub-graph which I reproduce at the end of this >>> email (let me know if this is not visible in some email programs, I can >>> make a wiki page with this all on it) - there is no path following the >>> arrows from U1 snRNP to U2-type spliceosomal complex. Note that attempting >>> to show the graph in an "intuitive" way as I have attempted to do below, >>> with the smaller entities at the bottom, is actually *misleading* because >>> it leads one to assume that there is some inferred all-some relationship >>> between these two terms when in fact there is not. >>> >>> Biology is complex and logic is hard. There's no escaping this. I don't >>> believe we should simplify either to the point where we get false >>> positives. I do think we need better ways of displaying this complex >>> information, but I think we should focus resources on doing this in end >>> user-facing tools rather than oboedit, as we would hope everyone using >>> oboedit to edit the ontology would have an understanding of the logic or a >>> willingness to learn. >>> >>> Here is a roughly sketched out example of how this could work >>> >>> Let's say PMID:123 describes an observation of a product of g1 being >>> localized to a 'U1 snRNP' via an IDA. Let's say that's all we know, either >>> due to the resolution of the assay or that's all that the annotator >>> specified. The user queries for 'U2-type spliceosomal complex' >>> >>> The query result screen could show something like: >>> >>> There are no genes with products known to be localized to 'U2-type >>> spliceosomal complex'. >>> However, every 'U2-type spliceosomal complex' has the following >>> parts: >>> a U1 snRNP -- genes: g1 >>> a U2 snRNP -- genes: none >>> >>> A more advanced tool would be able to show even more: >>> >>> PMID:123 shows g1 in U1 snRNP. >>> prob('U2-type spliceosomal complex') = 0.83 >>> prob('penta-snRNP complex') = 0.05 >>> ... >>> >>> This leads the user to terms of relevance, shows what is known, shows what >>> might be the case, and does not show anything that is false. >>> >>>> I would have >>>> thought that they should. If I wanted to know the parts of "U2-type >>>> spliceosomal complex" I would want to know all the things that compose >>>> the series of complexes that are all considered to be a "U2-type >>>> spliceosomal complex". >>> >>> I'm not sure I really understand the statement, it sounds tautological, I >>> don't understand what adding "series of" adds. >>> >>> If the question is "what parts can be found in every U2-type spliceosomal >>> complex" then the answer is found via the has_part relation and it's >>> closure. >>> >>> However, this is a different question from "what gene products have been >>> observed to be present in a U2-type spliceosomal complex >>> >>>> [Note that we need to revise the defs of "U2-type spliceosomal complex >>>> ; GO:0005684" (and its sibling "U12-type spliceosomal complex ; >>>> GO:0005689") to be consistent with the def of the parent term >>>> "spliceosomal complex ; GO:0005681" and specify that these terms >>>> represent series of complexes. I'll submit a SF item for this.] >>> >>> https://sourceforge.net/tracker/index.php?func=detail&aid=2843718&group_id=36855&atid=440764 >>> >>> I don't understand the motivation here. I think the definitions should >>> employ a consistent style, but I would change the parent from: >>> >>> GO:0005681 ! spliceosomal complex [DEF: "Any of a series of >>> ribonucleoprotein complexes that... >>> >>> to >>> >>> GO:0005681 ! spliceosomal complex [DEF: "A ribonucleoprotein complex >>> that... >>> >>> I'm not sure how adding "any of a series of..." changes the meaning, it >>> seems to just add extra verbiage that obfuscates the definition. >>> >>> >>> >> >> >> ------------------------------------------------------------------------------ >> Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day >> trial. Simplify your report design, integration and deployment - and focus >> on what you do best, core application coding. Discover what's new with >> Crystal Reports now. http://p.sf.net/sfu/bobj-july >> _______________________________________________ >> Geneontology-oboedit-working-group mailing list >> Geneontology-oboedit-working-group at lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/geneontology-oboedit-working-group > > From cjm at berkeleybop.org Tue Sep 1 10:28:53 2009 From: cjm at berkeleybop.org (Chris Mungall) Date: Tue, 1 Sep 2009 10:28:53 -0700 Subject: [Ontology-editors] [OBO-Edit Working Group] displaying has_part In-Reply-To: References: Message-ID: <12E6EEF1-2EBF-4A5A-A83E-346111DCB7FE@berkeleybop.org> Can you explain what you mean by 'ordered series'? On Sep 1, 2009, at 10:24 AM, Karen Christie wrote: > HI David, > > I would think of the major spliceosome (and also the minor and trans > spliceosomes) as an ordered series of complexes, each composed of > some or all of the 5 snRNPs + some other stuff. I don't know that > relates to a "non-contiguous structure". Perhaps you could provide > an example. > > -Karen > > > On Tue, 1 Sep 2009, djs93 at gen.cam.ac.uk wrote: > >> Hi Karen, >> >> I think there might be another way to deal with this issue, that is >> to get a more precise definition of the 'major spliceosome' - one >> that more closely accords with what biologists are actually >> referring too. It seems to me, that the way you use the term is >> most consistent with it referring to a non-contiguous structure - >> by which I mean one whose parts are not all connected to each >> other. As these can be confusing, I think it's important to use >> such terms sparingly, but they are unavoidable in some cases. For >> example, we need terms for functionally defined systems in anatomy. >> Instances of many such systems - e.g.- the immune system or the >> endocrine system, are non-contiguous. Is this the case for what you >> refer to here as the 'major spliceosome'? If so, then, as far as I >> can tell, the original use of part_of in this case was not incorrect. >> >> Hope this helps, rather than just muddying the waters further, >> >> David >> >> On Aug 26 2009, Karen Christie wrote: >> >>> OK, I understand your explanation about "Why should "Queries for >>> U2-type >>> spliceosomal complex should ***not*** return gene products localized >>> to U1 snRNP complex." But I think that is separate from the display >>> issue I have with has_part. >>> However, it also brings up a different issue. A question that a >>> biologist might want to ask is "give me all the proteins in the >>> major >>> spliceosome". They would expect to get the proteins present in the 5 >>> snRNPs (U1, U2, U4, U5, and U6). >> >> >>> By replacing part_of relations with >>> has_part, it seems that they might have difficulty getting a full >>> answer to that question. In your example where g1 is annotated >>> only to >>> "U1 snRNP", you give the hypothetical result: >>> >>>> There are no genes with products known to be localized to >>>> 'U2-type >>>> spliceosomal complex'. >>>> However, every 'U2-type spliceosomal complex' has the >>>> following >>>> parts: >>>> a U1 snRNP -- genes: g1 >>>> a U2 snRNP -- genes: none >>> However, if I wanted to get all proteins in the major spliceosome, I >>> don't want just gene products that are in EVERY 'U2-type >>> spliceosomal >>> complex', I want gene products that are in ANY 'U2-type spliceosomal >>> complex'. What do we need to do to make this a question that >>> biologists >>> get a reasonable answer to? >>> I understand that GO needs to implement more rigorous logic, but >>> if we >>> do it in a way that prevents biologists from getting reasonable >>> answers, we are not doing what we set out to do. >>> -Karen >>> On Mon, 24 Aug 2009, Chris Mungall wrote: >>>> On Aug 24, 2009, at 10:36 AM, Karen Christie wrote: >>>> [snipped part of dialog for now to focus on one issue] >>>>>> I would strongly advocate that even given sufficient developer >>>>>> hours it is better *not* to display the ontology in this way at >>>>>> all. It perhaps looks more comforting, but people will make the >>>>>> same comforting assumptions that no longer hold. For example, >>>>>> it looks like there is some kind of transitive relationship >>>>>> between U1 snRNP and U2-type spliceosomal complex ***which >>>>>> there is not***. It looks like the true path rule might hold. >>>>>> ***it does not*** . Queries for U2-type spliceosomal complex >>>>>> should ***not*** return gene products localized to U1 snRNP >>>>>> complex. >>>>> Why should "Queries for U2-type spliceosomal complex should >>>>> ***not*** >>>>> return gene products localized to U1 snRNP complex."? >>>> OK, good, I think we are circling on on the crux of the issue here. >>>> According to GO, it is not necessarily the case that a gp >>>> localized to a U1 snRNP is necessarily localized to a U2-type >>>> spliceosomal complex. For example, the gp may be localized to a >>>> particular U1 snRNP that is part of a penta-snRNP complex (I may >>>> not have chosen the best example as we're lacking annotations to >>>> this new term, but I can pull out an analogous example if you're >>>> not convinced) >>>> This can be seen in the sub-graph which I reproduce at the end of >>>> this email (let me know if this is not visible in some email >>>> programs, I can make a wiki page with this all on it) - there is >>>> no path following the arrows from U1 snRNP to U2-type >>>> spliceosomal complex. Note that attempting to show the graph in >>>> an "intuitive" way as I have attempted to do below, with the >>>> smaller entities at the bottom, is actually *misleading* because >>>> it leads one to assume that there is some inferred all-some >>>> relationship between these two terms when in fact there is not. >>>> Biology is complex and logic is hard. There's no escaping this. I >>>> don't believe we should simplify either to the point where we get >>>> false positives. I do think we need better ways of displaying >>>> this complex information, but I think we should focus resources >>>> on doing this in end user-facing tools rather than oboedit, as we >>>> would hope everyone using oboedit to edit the ontology would have >>>> an understanding of the logic or a willingness to learn. >>>> Here is a roughly sketched out example of how this could work >>>> Let's say PMID:123 describes an observation of a product of g1 >>>> being localized to a 'U1 snRNP' via an IDA. Let's say that's all >>>> we know, either due to the resolution of the assay or that's all >>>> that the annotator specified. The user queries for 'U2-type >>>> spliceosomal complex' >>>> The query result screen could show something like: >>>> >>>> There are no genes with products known to be localized to 'U2- >>>> type spliceosomal complex'. >>>> However, every 'U2-type spliceosomal complex' has the following >>>> parts: >>>> a U1 snRNP -- genes: g1 >>>> a U2 snRNP -- genes: none >>>> A more advanced tool would be able to show even more: >>>> >>>> PMID:123 shows g1 in U1 snRNP. >>>> prob('U2-type spliceosomal complex') = 0.83 >>>> prob('penta-snRNP complex') = 0.05 >>>> ... >>>> This leads the user to terms of relevance, shows what is known, >>>> shows what might be the case, and does not show anything that is >>>> false. >>>>> I would have >>>>> thought that they should. If I wanted to know the parts of "U2- >>>>> type >>>>> spliceosomal complex" I would want to know all the things that >>>>> compose >>>>> the series of complexes that are all considered to be a "U2-type >>>>> spliceosomal complex". >>>> I'm not sure I really understand the statement, it sounds >>>> tautological, I don't understand what adding "series of" adds. >>>> If the question is "what parts can be found in every U2-type >>>> spliceosomal complex" then the answer is found via the has_part >>>> relation and it's closure. >>>> However, this is a different question from "what gene products >>>> have been observed to be present in a U2-type spliceosomal complex >>>>> [Note that we need to revise the defs of "U2-type spliceosomal >>>>> complex >>>>> ; GO:0005684" (and its sibling "U12-type spliceosomal complex ; >>>>> GO:0005689") to be consistent with the def of the parent term >>>>> "spliceosomal complex ; GO:0005681" and specify that these terms >>>>> represent series of complexes. I'll submit a SF item for this.] >>>> >>>> https://sourceforge.net/tracker/index.php?func=detail&aid=2843718&group_id=36855&atid=440764 >>>> I don't understand the motivation here. I think the definitions >>>> should employ a consistent style, but I would change the parent >>>> from: >>>> GO:0005681 ! spliceosomal complex [DEF: "Any of a series of >>>> ribonucleoprotein complexes that... >>>> to >>>> GO:0005681 ! spliceosomal complex [DEF: "A ribonucleoprotein >>>> complex that... >>>> I'm not sure how adding "any of a series of..." changes the >>>> meaning, it seems to just add extra verbiage that obfuscates the >>>> definition. >>> >>> ------------------------------------------------------------------------------ Let >>> Crystal Reports handle the reporting - Free Crystal Reports 2008 >>> 30-Day trial. Simplify your report design, integration and >>> deployment - and focus on what you do best, core application >>> coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july >>> _______________________________________________ Geneontology- >>> oboedit-working-group mailing list Geneontology-oboedit-working-group at lists.sourceforge.net >>> https://lists.sourceforge.net/lists/listinfo/geneontology-oboedit-working-group >> >> > From cjm at berkeleybop.org Tue Sep 1 10:33:18 2009 From: cjm at berkeleybop.org (Chris Mungall) Date: Tue, 1 Sep 2009 10:33:18 -0700 Subject: [Ontology-editors] [OBO-Edit Working Group] displaying has_part In-Reply-To: References: Message-ID: An endocrine system is a non-contiguous entity. For any one instance on an endocrine system (e.g. the endocrine system that is part of me), the individual parts (my pituitary gland, my thymoid gland) are spatially disjoint (non-overlapping) On Sep 1, 2009, at 10:24 AM, Karen Christie wrote: > HI David, > > I would think of the major spliceosome (and also the minor and trans > spliceosomes) as an ordered series of complexes, each composed of > some or all of the 5 snRNPs + some other stuff. I don't know that > relates to a "non-contiguous structure". Perhaps you could provide > an example. > > -Karen > > > On Tue, 1 Sep 2009, djs93 at gen.cam.ac.uk wrote: > >> Hi Karen, >> >> I think there might be another way to deal with this issue, that is >> to get a more precise definition of the 'major spliceosome' - one >> that more closely accords with what biologists are actually >> referring too. It seems to me, that the way you use the term is >> most consistent with it referring to a non-contiguous structure - >> by which I mean one whose parts are not all connected to each >> other. As these can be confusing, I think it's important to use >> such terms sparingly, but they are unavoidable in some cases. For >> example, we need terms for functionally defined systems in anatomy. >> Instances of many such systems - e.g.- the immune system or the >> endocrine system, are non-contiguous. Is this the case for what you >> refer to here as the 'major spliceosome'? If so, then, as far as I >> can tell, the original use of part_of in this case was not incorrect. >> >> Hope this helps, rather than just muddying the waters further, >> >> David >> >> On Aug 26 2009, Karen Christie wrote: >> >>> OK, I understand your explanation about "Why should "Queries for >>> U2-type >>> spliceosomal complex should ***not*** return gene products localized >>> to U1 snRNP complex." But I think that is separate from the display >>> issue I have with has_part. >>> However, it also brings up a different issue. A question that a >>> biologist might want to ask is "give me all the proteins in the >>> major >>> spliceosome". They would expect to get the proteins present in the 5 >>> snRNPs (U1, U2, U4, U5, and U6). >> >> >>> By replacing part_of relations with >>> has_part, it seems that they might have difficulty getting a full >>> answer to that question. In your example where g1 is annotated >>> only to >>> "U1 snRNP", you give the hypothetical result: >>> >>>> There are no genes with products known to be localized to >>>> 'U2-type >>>> spliceosomal complex'. >>>> However, every 'U2-type spliceosomal complex' has the >>>> following >>>> parts: >>>> a U1 snRNP -- genes: g1 >>>> a U2 snRNP -- genes: none >>> However, if I wanted to get all proteins in the major spliceosome, I >>> don't want just gene products that are in EVERY 'U2-type >>> spliceosomal >>> complex', I want gene products that are in ANY 'U2-type spliceosomal >>> complex'. What do we need to do to make this a question that >>> biologists >>> get a reasonable answer to? >>> I understand that GO needs to implement more rigorous logic, but >>> if we >>> do it in a way that prevents biologists from getting reasonable >>> answers, we are not doing what we set out to do. >>> -Karen >>> On Mon, 24 Aug 2009, Chris Mungall wrote: >>>> On Aug 24, 2009, at 10:36 AM, Karen Christie wrote: >>>> [snipped part of dialog for now to focus on one issue] >>>>>> I would strongly advocate that even given sufficient developer >>>>>> hours it is better *not* to display the ontology in this way at >>>>>> all. It perhaps looks more comforting, but people will make the >>>>>> same comforting assumptions that no longer hold. For example, >>>>>> it looks like there is some kind of transitive relationship >>>>>> between U1 snRNP and U2-type spliceosomal complex ***which >>>>>> there is not***. It looks like the true path rule might hold. >>>>>> ***it does not*** . Queries for U2-type spliceosomal complex >>>>>> should ***not*** return gene products localized to U1 snRNP >>>>>> complex. >>>>> Why should "Queries for U2-type spliceosomal complex should >>>>> ***not*** >>>>> return gene products localized to U1 snRNP complex."? >>>> OK, good, I think we are circling on on the crux of the issue here. >>>> According to GO, it is not necessarily the case that a gp >>>> localized to a U1 snRNP is necessarily localized to a U2-type >>>> spliceosomal complex. For example, the gp may be localized to a >>>> particular U1 snRNP that is part of a penta-snRNP complex (I may >>>> not have chosen the best example as we're lacking annotations to >>>> this new term, but I can pull out an analogous example if you're >>>> not convinced) >>>> This can be seen in the sub-graph which I reproduce at the end of >>>> this email (let me know if this is not visible in some email >>>> programs, I can make a wiki page with this all on it) - there is >>>> no path following the arrows from U1 snRNP to U2-type >>>> spliceosomal complex. Note that attempting to show the graph in >>>> an "intuitive" way as I have attempted to do below, with the >>>> smaller entities at the bottom, is actually *misleading* because >>>> it leads one to assume that there is some inferred all-some >>>> relationship between these two terms when in fact there is not. >>>> Biology is complex and logic is hard. There's no escaping this. I >>>> don't believe we should simplify either to the point where we get >>>> false positives. I do think we need better ways of displaying >>>> this complex information, but I think we should focus resources >>>> on doing this in end user-facing tools rather than oboedit, as we >>>> would hope everyone using oboedit to edit the ontology would have >>>> an understanding of the logic or a willingness to learn. >>>> Here is a roughly sketched out example of how this could work >>>> Let's say PMID:123 describes an observation of a product of g1 >>>> being localized to a 'U1 snRNP' via an IDA. Let's say that's all >>>> we know, either due to the resolution of the assay or that's all >>>> that the annotator specified. The user queries for 'U2-type >>>> spliceosomal complex' >>>> The query result screen could show something like: >>>> >>>> There are no genes with products known to be localized to 'U2- >>>> type spliceosomal complex'. >>>> However, every 'U2-type spliceosomal complex' has the following >>>> parts: >>>> a U1 snRNP -- genes: g1 >>>> a U2 snRNP -- genes: none >>>> A more advanced tool would be able to show even more: >>>> >>>> PMID:123 shows g1 in U1 snRNP. >>>> prob('U2-type spliceosomal complex') = 0.83 >>>> prob('penta-snRNP complex') = 0.05 >>>> ... >>>> This leads the user to terms of relevance, shows what is known, >>>> shows what might be the case, and does not show anything that is >>>> false. >>>>> I would have >>>>> thought that they should. If I wanted to know the parts of "U2- >>>>> type >>>>> spliceosomal complex" I would want to know all the things that >>>>> compose >>>>> the series of complexes that are all considered to be a "U2-type >>>>> spliceosomal complex". >>>> I'm not sure I really understand the statement, it sounds >>>> tautological, I don't understand what adding "series of" adds. >>>> If the question is "what parts can be found in every U2-type >>>> spliceosomal complex" then the answer is found via the has_part >>>> relation and it's closure. >>>> However, this is a different question from "what gene products >>>> have been observed to be present in a U2-type spliceosomal complex >>>>> [Note that we need to revise the defs of "U2-type spliceosomal >>>>> complex >>>>> ; GO:0005684" (and its sibling "U12-type spliceosomal complex ; >>>>> GO:0005689") to be consistent with the def of the parent term >>>>> "spliceosomal complex ; GO:0005681" and specify that these terms >>>>> represent series of complexes. I'll submit a SF item for this.] >>>> >>>> https://sourceforge.net/tracker/index.php?func=detail&aid=2843718&group_id=36855&atid=440764 >>>> I don't understand the motivation here. I think the definitions >>>> should employ a consistent style, but I would change the parent >>>> from: >>>> GO:0005681 ! spliceosomal complex [DEF: "Any of a series of >>>> ribonucleoprotein complexes that... >>>> to >>>> GO:0005681 ! spliceosomal complex [DEF: "A ribonucleoprotein >>>> complex that... >>>> I'm not sure how adding "any of a series of..." changes the >>>> meaning, it seems to just add extra verbiage that obfuscates the >>>> definition. >>> >>> ------------------------------------------------------------------------------ Let >>> Crystal Reports handle the reporting - Free Crystal Reports 2008 >>> 30-Day trial. Simplify your report design, integration and >>> deployment - and focus on what you do best, core application >>> coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july >>> _______________________________________________ Geneontology- >>> oboedit-working-group mailing list Geneontology-oboedit-working-group at lists.sourceforge.net >>> https://lists.sourceforge.net/lists/listinfo/geneontology-oboedit-working-group >> >> > From rama at genome.stanford.edu Tue Sep 1 11:06:42 2009 From: rama at genome.stanford.edu (Rama Balakrishnan) Date: Tue, 1 Sep 2009 11:06:42 -0700 Subject: [Ontology-editors] [OBO-Edit Working Group] displaying has_part In-Reply-To: References: Message-ID: I haven't been following the thread completely...so this may sound tangential/ out of place/ irrelevant. Apologies and feel free to ignore this comment if that be the case. Is it possible that the fundamental issue is that we decided to capture protein complexes as cellular component terms but we did not take time to figure out where we should stop or what constitutes a complex (for example, do 3 interacting proteins constitute a complex?). Rama On Sep 1, 2009, at 10:33 AM, Chris Mungall wrote: > > An endocrine system is a non-contiguous entity. For any one instance > on an endocrine system (e.g. the endocrine system that is part of me), > the individual parts (my pituitary gland, my thymoid gland) are > spatially disjoint (non-overlapping) > > On Sep 1, 2009, at 10:24 AM, Karen Christie wrote: > >> HI David, >> >> I would think of the major spliceosome (and also the minor and trans >> spliceosomes) as an ordered series of complexes, each composed of >> some or all of the 5 snRNPs + some other stuff. I don't know that >> relates to a "non-contiguous structure". Perhaps you could provide >> an example. >> >> -Karen >> >> >> On Tue, 1 Sep 2009, djs93 at gen.cam.ac.uk wrote: >> >>> Hi Karen, >>> >>> I think there might be another way to deal with this issue, that is >>> to get a more precise definition of the 'major spliceosome' - one >>> that more closely accords with what biologists are actually >>> referring too. It seems to me, that the way you use the term is >>> most consistent with it referring to a non-contiguous structure - >>> by which I mean one whose parts are not all connected to each >>> other. As these can be confusing, I think it's important to use >>> such terms sparingly, but they are unavoidable in some cases. For >>> example, we need terms for functionally defined systems in anatomy. >>> Instances of many such systems - e.g.- the immune system or the >>> endocrine system, are non-contiguous. Is this the case for what you >>> refer to here as the 'major spliceosome'? If so, then, as far as I >>> can tell, the original use of part_of in this case was not >>> incorrect. >>> >>> Hope this helps, rather than just muddying the waters further, >>> >>> David >>> >>> On Aug 26 2009, Karen Christie wrote: >>> >>>> OK, I understand your explanation about "Why should "Queries for >>>> U2-type >>>> spliceosomal complex should ***not*** return gene products >>>> localized >>>> to U1 snRNP complex." But I think that is separate from the display >>>> issue I have with has_part. >>>> However, it also brings up a different issue. A question that a >>>> biologist might want to ask is "give me all the proteins in the >>>> major >>>> spliceosome". They would expect to get the proteins present in >>>> the 5 >>>> snRNPs (U1, U2, U4, U5, and U6). >>> >>> >>>> By replacing part_of relations with >>>> has_part, it seems that they might have difficulty getting a full >>>> answer to that question. In your example where g1 is annotated >>>> only to >>>> "U1 snRNP", you give the hypothetical result: >>>> >>>>> There are no genes with products known to be localized to >>>>> 'U2-type >>>>> spliceosomal complex'. >>>>> However, every 'U2-type spliceosomal complex' has the >>>>> following >>>>> parts: >>>>> a U1 snRNP -- genes: g1 >>>>> a U2 snRNP -- genes: none >>>> However, if I wanted to get all proteins in the major >>>> spliceosome, I >>>> don't want just gene products that are in EVERY 'U2-type >>>> spliceosomal >>>> complex', I want gene products that are in ANY 'U2-type >>>> spliceosomal >>>> complex'. What do we need to do to make this a question that >>>> biologists >>>> get a reasonable answer to? >>>> I understand that GO needs to implement more rigorous logic, but >>>> if we >>>> do it in a way that prevents biologists from getting reasonable >>>> answers, we are not doing what we set out to do. >>>> -Karen >>>> On Mon, 24 Aug 2009, Chris Mungall wrote: >>>>> On Aug 24, 2009, at 10:36 AM, Karen Christie wrote: >>>>> [snipped part of dialog for now to focus on one issue] >>>>>>> I would strongly advocate that even given sufficient developer >>>>>>> hours it is better *not* to display the ontology in this way at >>>>>>> all. It perhaps looks more comforting, but people will make the >>>>>>> same comforting assumptions that no longer hold. For example, >>>>>>> it looks like there is some kind of transitive relationship >>>>>>> between U1 snRNP and U2-type spliceosomal complex ***which >>>>>>> there is not***. It looks like the true path rule might hold. >>>>>>> ***it does not*** . Queries for U2-type spliceosomal complex >>>>>>> should ***not*** return gene products localized to U1 snRNP >>>>>>> complex. >>>>>> Why should "Queries for U2-type spliceosomal complex should >>>>>> ***not*** >>>>>> return gene products localized to U1 snRNP complex."? >>>>> OK, good, I think we are circling on on the crux of the issue >>>>> here. >>>>> According to GO, it is not necessarily the case that a gp >>>>> localized to a U1 snRNP is necessarily localized to a U2-type >>>>> spliceosomal complex. For example, the gp may be localized to a >>>>> particular U1 snRNP that is part of a penta-snRNP complex (I may >>>>> not have chosen the best example as we're lacking annotations to >>>>> this new term, but I can pull out an analogous example if you're >>>>> not convinced) >>>>> This can be seen in the sub-graph which I reproduce at the end of >>>>> this email (let me know if this is not visible in some email >>>>> programs, I can make a wiki page with this all on it) - there is >>>>> no path following the arrows from U1 snRNP to U2-type >>>>> spliceosomal complex. Note that attempting to show the graph in >>>>> an "intuitive" way as I have attempted to do below, with the >>>>> smaller entities at the bottom, is actually *misleading* because >>>>> it leads one to assume that there is some inferred all-some >>>>> relationship between these two terms when in fact there is not. >>>>> Biology is complex and logic is hard. There's no escaping this. I >>>>> don't believe we should simplify either to the point where we get >>>>> false positives. I do think we need better ways of displaying >>>>> this complex information, but I think we should focus resources >>>>> on doing this in end user-facing tools rather than oboedit, as we >>>>> would hope everyone using oboedit to edit the ontology would have >>>>> an understanding of the logic or a willingness to learn. >>>>> Here is a roughly sketched out example of how this could work >>>>> Let's say PMID:123 describes an observation of a product of g1 >>>>> being localized to a 'U1 snRNP' via an IDA. Let's say that's all >>>>> we know, either due to the resolution of the assay or that's all >>>>> that the annotator specified. The user queries for 'U2-type >>>>> spliceosomal complex' >>>>> The query result screen could show something like: >>>>> >>>>> There are no genes with products known to be localized to 'U2- >>>>> type spliceosomal complex'. >>>>> However, every 'U2-type spliceosomal complex' has the following >>>>> parts: >>>>> a U1 snRNP -- genes: g1 >>>>> a U2 snRNP -- genes: none >>>>> A more advanced tool would be able to show even more: >>>>> >>>>> PMID:123 shows g1 in U1 snRNP. >>>>> prob('U2-type spliceosomal complex') = 0.83 >>>>> prob('penta-snRNP complex') = 0.05 >>>>> ... >>>>> This leads the user to terms of relevance, shows what is known, >>>>> shows what might be the case, and does not show anything that is >>>>> false. >>>>>> I would have >>>>>> thought that they should. If I wanted to know the parts of "U2- >>>>>> type >>>>>> spliceosomal complex" I would want to know all the things that >>>>>> compose >>>>>> the series of complexes that are all considered to be a "U2-type >>>>>> spliceosomal complex". >>>>> I'm not sure I really understand the statement, it sounds >>>>> tautological, I don't understand what adding "series of" adds. >>>>> If the question is "what parts can be found in every U2-type >>>>> spliceosomal complex" then the answer is found via the has_part >>>>> relation and it's closure. >>>>> However, this is a different question from "what gene products >>>>> have been observed to be present in a U2-type spliceosomal complex >>>>>> [Note that we need to revise the defs of "U2-type spliceosomal >>>>>> complex >>>>>> ; GO:0005684" (and its sibling "U12-type spliceosomal complex ; >>>>>> GO:0005689") to be consistent with the def of the parent term >>>>>> "spliceosomal complex ; GO:0005681" and specify that these terms >>>>>> represent series of complexes. I'll submit a SF item for this.] >>>>> >>>>> https://sourceforge.net/tracker/index.php?func=detail&aid=2843718&group_id=36855&atid=440764 >>>>> I don't understand the motivation here. I think the definitions >>>>> should employ a consistent style, but I would change the parent >>>>> from: >>>>> GO:0005681 ! spliceosomal complex [DEF: "Any of a series of >>>>> ribonucleoprotein complexes that... >>>>> to >>>>> GO:0005681 ! spliceosomal complex [DEF: "A ribonucleoprotein >>>>> complex that... >>>>> I'm not sure how adding "any of a series of..." changes the >>>>> meaning, it seems to just add extra verbiage that obfuscates the >>>>> definition. >>>> >>>> ------------------------------------------------------------------------------ Let >>>> Crystal Reports handle the reporting - Free Crystal Reports 2008 >>>> 30-Day trial. Simplify your report design, integration and >>>> deployment - and focus on what you do best, core application >>>> coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july >>>> _______________________________________________ Geneontology- >>>> oboedit-working-group mailing list Geneontology-oboedit-working-group at lists.sourceforge.net >>>> https://lists.sourceforge.net/lists/listinfo/geneontology-oboedit-working-group >>> >>> >> > > > ------------------------------------------------------------------------------ > Let Crystal Reports handle the reporting - Free Crystal Reports 2008 > 30-Day > trial. Simplify your report design, integration and deployment - and > focus on > what you do best, core application coding. Discover what's new with > Crystal Reports now. http://p.sf.net/sfu/bobj-july > _______________________________________________ > Geneontology-oboedit-working-group mailing list > Geneontology-oboedit-working-group at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/geneontology-oboedit-working-group From kchris at genome.stanford.edu Tue Sep 1 15:40:42 2009 From: kchris at genome.stanford.edu (Karen Christie) Date: Tue, 1 Sep 2009 15:40:42 -0700 (PDT) Subject: [Ontology-editors] [OBO-Edit Working Group] displaying has_part In-Reply-To: References: Message-ID: Responding to both of Chris's last two emails together. By comparison with the endocrine system, I don't think the spliceosome can be considered a non-contiguous entity. The situation doesn't seem comparable. Perhaps you'll agree with that once I explain what I mean by 'ordered series', though let me say before I start that I only care about the order as a biologist. GO doesn't represent the order, just all the complexes that are observed. People talk about "the" spliceosome, but there isn't really a single complex that could be referred to as "the spliceosome". Originally, when people first isolated a big ribonucleoprotein complex involved in mRNA splicing, they probably assumed it had a static composition, as seems to be the case for the ribosome. Be that as it may, 5 snRNPs, each containing 1 snRNA (U1, U2, U4, U5, or U6) and a bunch of proteins (a mostly common ring of 7, plus proteins unique to each snRNP), are considered to make up "the spliceosome". For simplicity, I'm just going to use the RNA name, e.g. U1, but when I talk about something like U1, I actually mean the entire snRNP, so the RNA + 9 or more proteins, depending on which one it is. When people study assembly of the spliceosome, it turns out that most people see an ordered assembly pathway as the snRNPs assemble onto the pre-mRNA, rather than an existing "complete spliceosome". Basically, U1 and U2 seem to assemble onto the pre-mRNA, while U4, U5, and U6 form a tri-snRNP called U4/U6.U5. The tri-snRNP U4/U6.U5 joins the other two snRNPs already on the mRNA to form something called the prespliceosome (or A2-1). The prespliceosome is the only complex in the series that contains all 5 snRNPs, but it is NOT catalytic. In order to become catalytic, it undergoes a major conformational change and spits out U1 and U4. So both of the two catalytic conformations (A1 and A2-2) contain U2, U5, and U6, but not U1 or U4. However, a biologist will still want to see the gene products that are in U1 and U4 as spliceosomal proteins because they both play key roles in the process of splicing in the recognition of the splice sites. The GO term used to be named "spliceosome". In the recent component changes that introduced the has_part relationship, we changed the name to "spliceosomal complex" because it is basically a grouping term collecting the various different complexes that are seen in the splicesosome cycle. The pathway is normally drawn as a circular diagram representing the cyclic assembly and disassambly, but for email, here it is as a list: Complex Label (yeast) Constituents ------------- -------------------------------------------- CC U1, pre-mRNA B U1, U2, pre-mRNA U4/U6.U5 U4, U5, U6 A2-1 U1, U2, U4/U6, U5, pre-mRNA (non-catalytic pre-spliceosome) A1 U2, U6, U5, pre-mRNA (catalytic for 1st step) A2-2 U2, U6, U5, 5'exon and lariat RNA (catalytic for 2nd step) A2-3 U2, U6, U5, spliced mRNA and excised lariat RNA I U2, U6, U5, excised lariat RNA Or this version might be more illustrative: Complex some Label (yeast) U1 U2 U4 U5 U6 RNA ------------------------------------------------------------ CC + + B + + + U4/U6.U5 + + + A2-1 + + + + + + A1 + + + + A2-2 + + + + A2-3 + + + + I + + + + -Karen On Tue, 1 Sep 2009, Chris Mungall wrote: > Can you explain what you mean by 'ordered series'? On Tue, 1 Sep 2009, Chris Mungall wrote: > > An endocrine system is a non-contiguous entity. For any one instance on an > endocrine system (e.g. the endocrine system that is part of me), the > individual parts (my pituitary gland, my thymoid gland) are spatially > disjoint (non-overlapping) > > On Sep 1, 2009, at 10:24 AM, Karen Christie wrote: > >> HI David, >> >> I would think of the major spliceosome (and also the minor and trans >> spliceosomes) as an ordered series of complexes, each composed of some or >> all of the 5 snRNPs + some other stuff. I don't know that relates to a >> "non-contiguous structure". Perhaps you could provide an example. >> >> -Karen >> >> >> On Tue, 1 Sep 2009, djs93 at gen.cam.ac.uk wrote: >> >>> Hi Karen, >>> >>> I think there might be another way to deal with this issue, that is to get >>> a more precise definition of the 'major spliceosome' - one that more >>> closely accords with what biologists are actually referring too. It seems >>> to me, that the way you use the term is most consistent with it referring >>> to a non-contiguous structure - by which I mean one whose parts are not >>> all connected to each other. As these can be confusing, I think it's >>> important to use such terms sparingly, but they are unavoidable in some >>> cases. For example, we need terms for functionally defined systems in >>> anatomy. Instances of many such systems - e.g.- the immune system or the >>> endocrine system, are non-contiguous. Is this the case for what you refer >>> to here as the 'major spliceosome'? If so, then, as far as I can tell, the >>> original use of part_of in this case was not incorrect. >>> >>> Hope this helps, rather than just muddying the waters further, >>> >>> David >>> >>> On Aug 26 2009, Karen Christie wrote: >>> >>>> OK, I understand your explanation about "Why should "Queries for U2-type >>>> spliceosomal complex should ***not*** return gene products localized >>>> to U1 snRNP complex." But I think that is separate from the display >>>> issue I have with has_part. >>>> However, it also brings up a different issue. A question that a >>>> biologist might want to ask is "give me all the proteins in the major >>>> spliceosome". They would expect to get the proteins present in the 5 >>>> snRNPs (U1, U2, U4, U5, and U6). >>> >>> >>>> By replacing part_of relations with >>>> has_part, it seems that they might have difficulty getting a full >>>> answer to that question. In your example where g1 is annotated only to >>>> "U1 snRNP", you give the hypothetical result: >>>> >>>>> There are no genes with products known to be localized to 'U2-type >>>>> spliceosomal complex'. >>>>> However, every 'U2-type spliceosomal complex' has the following >>>>> parts: >>>>> a U1 snRNP -- genes: g1 >>>>> a U2 snRNP -- genes: none >>>> However, if I wanted to get all proteins in the major spliceosome, I >>>> don't want just gene products that are in EVERY 'U2-type spliceosomal >>>> complex', I want gene products that are in ANY 'U2-type spliceosomal >>>> complex'. What do we need to do to make this a question that biologists >>>> get a reasonable answer to? >>>> I understand that GO needs to implement more rigorous logic, but if we >>>> do it in a way that prevents biologists from getting reasonable >>>> answers, we are not doing what we set out to do. >>>> -Karen >>>> On Mon, 24 Aug 2009, Chris Mungall wrote: >>>>> On Aug 24, 2009, at 10:36 AM, Karen Christie wrote: >>>>> [snipped part of dialog for now to focus on one issue] >>>>>>> I would strongly advocate that even given sufficient developer hours >>>>>>> it is better *not* to display the ontology in this way at all. It >>>>>>> perhaps looks more comforting, but people will make the same >>>>>>> comforting assumptions that no longer hold. For example, it looks like >>>>>>> there is some kind of transitive relationship between U1 snRNP and >>>>>>> U2-type spliceosomal complex ***which there is not***. It looks like >>>>>>> the true path rule might hold. ***it does not*** . Queries for U2-type >>>>>>> spliceosomal complex should ***not*** return gene products localized >>>>>>> to U1 snRNP complex. >>>>>> Why should "Queries for U2-type spliceosomal complex should ***not*** >>>>>> return gene products localized to U1 snRNP complex."? >>>>> OK, good, I think we are circling on on the crux of the issue here. >>>>> According to GO, it is not necessarily the case that a gp localized to a >>>>> U1 snRNP is necessarily localized to a U2-type spliceosomal complex. For >>>>> example, the gp may be localized to a particular U1 snRNP that is part >>>>> of a penta-snRNP complex (I may not have chosen the best example as >>>>> we're lacking annotations to this new term, but I can pull out an >>>>> analogous example if you're not convinced) >>>>> This can be seen in the sub-graph which I reproduce at the end of this >>>>> email (let me know if this is not visible in some email programs, I can >>>>> make a wiki page with this all on it) - there is no path following the >>>>> arrows from U1 snRNP to U2-type spliceosomal complex. Note that >>>>> attempting to show the graph in an "intuitive" way as I have attempted >>>>> to do below, with the smaller entities at the bottom, is actually >>>>> *misleading* because it leads one to assume that there is some inferred >>>>> all-some relationship between these two terms when in fact there is not. >>>>> Biology is complex and logic is hard. There's no escaping this. I don't >>>>> believe we should simplify either to the point where we get false >>>>> positives. I do think we need better ways of displaying this complex >>>>> information, but I think we should focus resources on doing this in end >>>>> user-facing tools rather than oboedit, as we would hope everyone using >>>>> oboedit to edit the ontology would have an understanding of the logic or >>>>> a willingness to learn. >>>>> Here is a roughly sketched out example of how this could work >>>>> Let's say PMID:123 describes an observation of a product of g1 being >>>>> localized to a 'U1 snRNP' via an IDA. Let's say that's all we know, >>>>> either due to the resolution of the assay or that's all that the >>>>> annotator specified. The user queries for 'U2-type spliceosomal complex' >>>>> The query result screen could show something like: >>>>> >>>>> There are no genes with products known to be localized to 'U2-type >>>>> spliceosomal complex'. >>>>> However, every 'U2-type spliceosomal complex' has the following >>>>> parts: >>>>> a U1 snRNP -- genes: g1 >>>>> a U2 snRNP -- genes: none >>>>> A more advanced tool would be able to show even more: >>>>> >>>>> PMID:123 shows g1 in U1 snRNP. >>>>> prob('U2-type spliceosomal complex') = 0.83 >>>>> prob('penta-snRNP complex') = 0.05 >>>>> ... >>>>> This leads the user to terms of relevance, shows what is known, shows >>>>> what might be the case, and does not show anything that is false. >>>>>> I would have >>>>>> thought that they should. If I wanted to know the parts of "U2-type >>>>>> spliceosomal complex" I would want to know all the things that compose >>>>>> the series of complexes that are all considered to be a "U2-type >>>>>> spliceosomal complex". >>>>> I'm not sure I really understand the statement, it sounds tautological, >>>>> I don't understand what adding "series of" adds. >>>>> If the question is "what parts can be found in every U2-type >>>>> spliceosomal complex" then the answer is found via the has_part relation >>>>> and it's closure. >>>>> However, this is a different question from "what gene products have been >>>>> observed to be present in a U2-type spliceosomal complex >>>>>> [Note that we need to revise the defs of "U2-type spliceosomal complex >>>>>> ; GO:0005684" (and its sibling "U12-type spliceosomal complex ; >>>>>> GO:0005689") to be consistent with the def of the parent term >>>>>> "spliceosomal complex ; GO:0005681" and specify that these terms >>>>>> represent series of complexes. I'll submit a SF item for this.] >>>>> >>>>> https://sourceforge.net/tracker/index.php?func=detail&aid=2843718&group_id=36855&atid=440764 >>>>> I don't understand the motivation here. I think the definitions should >>>>> employ a consistent style, but I would change the parent from: >>>>> GO:0005681 ! spliceosomal complex [DEF: "Any of a series of >>>>> ribonucleoprotein complexes that... >>>>> to >>>>> GO:0005681 ! spliceosomal complex [DEF: "A ribonucleoprotein complex >>>>> that... >>>>> I'm not sure how adding "any of a series of..." changes the meaning, it >>>>> seems to just add extra verbiage that obfuscates the definition. >>>> >>>> ------------------------------------------------------------------------------ >>>> Let Crystal Reports handle the reporting - Free Crystal Reports 2008 >>>> 30-Day trial. Simplify your report design, integration and deployment - >>>> and focus on what you do best, core application coding. Discover what's >>>> new with Crystal Reports now. http://p.sf.net/sfu/bobj-july >>>> _______________________________________________ >>>> Geneontology-oboedit-working-group mailing list >>>> Geneontology-oboedit-working-group at lists.sourceforge.net >>>> https://lists.sourceforge.net/lists/listinfo/geneontology-oboedit-working-group >>> >>> > From cjm at berkeleybop.org Tue Sep 1 18:25:24 2009 From: cjm at berkeleybop.org (Chris Mungall) Date: Tue, 1 Sep 2009 18:25:24 -0700 Subject: [Ontology-editors] [OBO-Edit Working Group] displaying has_part In-Reply-To: References: Message-ID: <9EC14AF7-DE08-41DA-AE09-BC4E5FB3C124@berkeleybop.org> Thanks for your clear explanation. Yes, the whole contiguous/non-contiguous thread was a red herring (but perhaps a useful one). I actually think has_part vs part_of may also be something of a red herring, although it was clearly revisions related to the introduction of has_part that caused this issue to surface. Based on my (superficial) understanding of the spliceosome I believe that it is the temporal/change issue that is of most relevance here, rather than the directionality of the all-some relationship. A lot of the time we get away with treating cell components as static structures - but this gets us into trouble with highly dynamic changing structure that gains or loses parts - such as a spliceosome. DavidOS has thought a lot about this in the context of gross anatomy and development (where the static partonomy view also breaks down). I will focus on all-some part_of relations here because I think that this is where both the problem and the solution lies. In GO we take the relatively strict definition of part_of from the OBO Relations Ontology: P part_of W if and only if for all p, t : if p is an instance of P at time t, then there exists some w such that w is an instance of W at t, and p is part_of w at t. or stated another way - there exists no time at which an instance of P is not part of an instance of W This means that strictly speaking we cannot say U4 part_of spliceosome complex because there exists many instances of U4s which *for a period of time* are not part of a spliceosome - specifically for the duration of their existence post-dissociation from the spliceosome. However, if we assert in the ontology it will solve one of your problems, specifically the one you mention below, in which a user currently querying for spliceosome does not get back gene products localized to U4. Given that this is a desirable result for at least some gene products, let's explore going ahead and adding this to the ontology whilst retaining the current definition of part_of. First of all, will this give us any false positives? Potentially. Imagine there is a gene product p that is localized to U4, and -- crucially -- p is *only* ever localized to U4 *after* dissociation from the prespliceosome. Now this may be stretching things a bit, but imagine U4 has some post-dissociation function for which p is required (I can probably think of more convincing analogous examples at the gross anatomical level, but I'm sure there are examples in GO-CC). In this case it would be wrong to return p in a query for spliceosome (although it may be desirable to guide the user to make further queries for things that are *potentially* part of the spliceosome). Hopefully this at least convinces that the problem is non-trivial and that a logical analysis is helpful. What are our options from here? Option 1 - weaken part_of : we simply use the weaker time-neutral definition of part_of. This would increase the sensitivity of queries but decrease the specificity (more false positives). I have a feeling we could provide part_of relationships for many protein complexes that currently have no part_of relationships at the moment. It could even be argued that some of the part_ofs in GO-CC right now already follow this weaker interpretation. One consequence of weakening part_of in this way is that it is no longer guaranteed to be transitive. I think it would be a mistake to weaken the relation in this way, but there are valid arguments to be made on the other side. Option 2 - introduce an additional part_of relation. Here the current part_of relation would retain its current strict meaning, but we would have a new relation which I'll call part_of_unless_dissociated for want of a better label. This could be used for the relationship between U4 and spliceosome, such that every U4 is part_of a spliceosome at some point in its existence, but that dissociation is allowed. I think this would confuse people horribly. Option 3 - introduce two new subtypes of U4: U4_in_splicesome and U4_dissociated_from_spliceosome. The (normal) part_of relation would be used for the former. This is awkward, and the dissociated subtype may never be used if U4 has no post-dissociation function. Option 4 - (a variant of the previous one) - define U4 to mean U4_in_spliceosome, and ignore dissociated U4s. We could then make a valid link without weakening part_of. Option 5 - keep the current structure of GO and current definition of part_of. Formalize the annotation protocol such that if I am annotating a protein p that is located in some complex A which is associated with some larger complex B, and there is no part_of relationship between A and B in GO, then co-annotate to both. The problem here is that co-annotation to A and B is weaker than annotating to . Another issue is that this is more work for annotators. This could partly be mitigated by using probabilistic methods to suggest co-annotations where they are likely (e.g. if p is in a U4 then it's likely it's in a splicesome rather than a dissociated U4) Option 6 - use the annotation cross-product column. The problem here is that most MODs do not plan to allow for this level of expressivity in their databases any time in the near future. Option 7 - add derives_from links in CC (just like almost all other anatomical ontologies) - these could be indirect via the corresponding assembly/disassembly processes. This would allow greater precision in querying. For example, I could ask "what products are localized to the spliceosome" vs "what products are localized to the spliceosome or any of its derivatives" (this would be done in a user interface via some simple menu or popup selection). Only the latter would return products localized to U4. Orthogonal to all of the above options is the option of using has_part (or any other relation) to guide users to other queries (see my email from last week). But I would like to distinguish between this kind of navigational use of the relations vs the use of relations to give guaranteed true answers to well-posed questions. I would advocate strengthening 5 whilst working towards 7, and at the same time improving interfaces to allow for better navigation using gene_ontology_ext I think we are only scratching the tip of the iceberg here with has_part. Consider some of the other relationships from some of the various XP files that will eventually be integrated into gene_ontology_ext. HDAC complex has_function histone deacetylase activity mitochondrial translation occurs_in mitochondrion spindle pole body {localization,binding,..} ? spindle pole body protein import into mitochondrial matrix results_in_transport_to mitochondrial matrix etc It's important everyone is prepared for these On Sep 1, 2009, at 3:40 PM, Karen Christie wrote: > Responding to both of Chris's last two emails together. > > By comparison with the endocrine system, I don't think the spliceosome > can be considered a non-contiguous entity. The situation doesn't seem > comparable. Perhaps you'll agree with that once I explain what I mean > by 'ordered series', though let me say before I start that I only care > about the order as a biologist. GO doesn't represent the order, just > all the complexes that are observed. > > People talk about "the" spliceosome, but there isn't really a single > complex that could be referred to as "the spliceosome". Originally, > when people first isolated a big ribonucleoprotein complex involved in > mRNA splicing, they probably assumed it had a static composition, as > seems to be the case for the ribosome. Be that as it may, 5 snRNPs, > each containing 1 snRNA (U1, U2, U4, U5, or U6) and a bunch of > proteins (a mostly common ring of 7, plus proteins unique to each > snRNP), are considered to make up "the spliceosome". For simplicity, > I'm just going to use the RNA name, e.g. U1, but when I talk about > something like U1, I actually mean the entire snRNP, so the RNA + 9 or > more proteins, depending on which one it is. > > When people study assembly of the spliceosome, it turns out that most > people see an ordered assembly pathway as the snRNPs assemble onto the > pre-mRNA, rather than an existing "complete spliceosome". Basically, > U1 and U2 seem to assemble onto the pre-mRNA, while U4, U5, and U6 > form a tri-snRNP called U4/U6.U5. The tri-snRNP U4/U6.U5 joins the > other two snRNPs already on the mRNA to form something called the > prespliceosome (or A2-1). The prespliceosome is the only complex in > the series that contains all 5 snRNPs, but it is NOT catalytic. In > order to become catalytic, it undergoes a major conformational change > and spits out U1 and U4. So both of the two catalytic conformations > (A1 and A2-2) contain U2, U5, and U6, but not U1 or U4. However, a > biologist will still want to see the gene products that are in U1 and > U4 as spliceosomal proteins because they both play key roles in the > process of splicing in the recognition of the splice sites. > > The GO term used to be named "spliceosome". In the recent component > changes that introduced the has_part relationship, we changed the name > to "spliceosomal complex" because it is basically a grouping term > collecting the various different complexes that are seen in the > splicesosome cycle. > > The pathway is normally drawn as a circular diagram representing the > cyclic assembly and disassambly, but for email, here it is as a list: > > Complex Label (yeast) Constituents > ------------- -------------------------------------------- > CC U1, pre-mRNA > B U1, U2, pre-mRNA > U4/U6.U5 U4, U5, U6 > A2-1 U1, U2, U4/U6, U5, pre-mRNA (non-catalytic pre-spliceosome) > A1 U2, U6, U5, pre-mRNA (catalytic for 1st step) > A2-2 U2, U6, U5, 5'exon and lariat RNA (catalytic for 2nd step) > A2-3 U2, U6, U5, spliced mRNA and excised lariat RNA > I U2, U6, U5, excised lariat RNA > > > Or this version might be more illustrative: > > Complex some > Label (yeast) U1 U2 U4 U5 U6 RNA > ------------------------------------------------------------ > CC + + > B + + + > U4/U6.U5 + + + A2-1 + + + + + + > A1 + + + + > A2-2 + + + + > A2-3 + + + + > I + + + + > > > -Karen > > > > > > > On Tue, 1 Sep 2009, Chris Mungall wrote: > >> Can you explain what you mean by 'ordered series'? > > > > On Tue, 1 Sep 2009, Chris Mungall wrote: >> >> An endocrine system is a non-contiguous entity. For any one >> instance on an endocrine system (e.g. the endocrine system that is >> part of me), the individual parts (my pituitary gland, my thymoid >> gland) are spatially disjoint (non-overlapping) >> >> On Sep 1, 2009, at 10:24 AM, Karen Christie wrote: >> >>> HI David, >>> I would think of the major spliceosome (and also the minor and >>> trans spliceosomes) as an ordered series of complexes, each >>> composed of some or all of the 5 snRNPs + some other stuff. I >>> don't know that relates to a "non-contiguous structure". Perhaps >>> you could provide an example. >>> -Karen >>> On Tue, 1 Sep 2009, djs93 at gen.cam.ac.uk wrote: >>>> Hi Karen, >>>> I think there might be another way to deal with this issue, that >>>> is to get a more precise definition of the 'major spliceosome' - >>>> one that more closely accords with what biologists are actually >>>> referring too. It seems to me, that the way you use the term is >>>> most consistent with it referring to a non-contiguous structure - >>>> by which I mean one whose parts are not all connected to each >>>> other. As these can be confusing, I think it's important to use >>>> such terms sparingly, but they are unavoidable in some cases. For >>>> example, we need terms for functionally defined systems in >>>> anatomy. Instances of many such systems - e.g.- the immune system >>>> or the endocrine system, are non-contiguous. Is this the case for >>>> what you refer to here as the 'major spliceosome'? If so, then, >>>> as far as I can tell, the original use of part_of in this case >>>> was not incorrect. >>>> Hope this helps, rather than just muddying the waters further, >>>> David >>>> On Aug 26 2009, Karen Christie wrote: >>>>> OK, I understand your explanation about "Why should "Queries for >>>>> U2-type >>>>> spliceosomal complex should ***not*** return gene products >>>>> localized >>>>> to U1 snRNP complex." But I think that is separate from the >>>>> display >>>>> issue I have with has_part. >>>>> However, it also brings up a different issue. A question that a >>>>> biologist might want to ask is "give me all the proteins in the >>>>> major >>>>> spliceosome". They would expect to get the proteins present in >>>>> the 5 >>>>> snRNPs (U1, U2, U4, U5, and U6). >>>>> By replacing part_of relations with >>>>> has_part, it seems that they might have difficulty getting a full >>>>> answer to that question. In your example where g1 is annotated >>>>> only to >>>>> "U1 snRNP", you give the hypothetical result: >>>>> >>>>>> There are no genes with products known to be localized to >>>>>> 'U2-type >>>>>> spliceosomal complex'. >>>>>> However, every 'U2-type spliceosomal complex' has the >>>>>> following >>>>>> parts: >>>>>> a U1 snRNP -- genes: g1 >>>>>> a U2 snRNP -- genes: none >>>>> However, if I wanted to get all proteins in the major >>>>> spliceosome, I >>>>> don't want just gene products that are in EVERY 'U2-type >>>>> spliceosomal >>>>> complex', I want gene products that are in ANY 'U2-type >>>>> spliceosomal >>>>> complex'. What do we need to do to make this a question that >>>>> biologists >>>>> get a reasonable answer to? >>>>> I understand that GO needs to implement more rigorous logic, but >>>>> if we >>>>> do it in a way that prevents biologists from getting reasonable >>>>> answers, we are not doing what we set out to do. >>>>> -Karen >>>>> On Mon, 24 Aug 2009, Chris Mungall wrote: >>>>>> On Aug 24, 2009, at 10:36 AM, Karen Christie wrote: >>>>>> [snipped part of dialog for now to focus on one issue] >>>>>>>> I would strongly advocate that even given sufficient >>>>>>>> developer hours it is better *not* to display the ontology in >>>>>>>> this way at all. It perhaps looks more comforting, but people >>>>>>>> will make the same comforting assumptions that no longer >>>>>>>> hold. For example, it looks like there is some kind of >>>>>>>> transitive relationship between U1 snRNP and U2-type >>>>>>>> spliceosomal complex ***which there is not***. It looks like >>>>>>>> the true path rule might hold. ***it does not*** . Queries >>>>>>>> for U2-type spliceosomal complex should ***not*** return gene >>>>>>>> products localized to U1 snRNP complex. >>>>>>> Why should "Queries for U2-type spliceosomal complex should >>>>>>> ***not*** >>>>>>> return gene products localized to U1 snRNP complex."? >>>>>> OK, good, I think we are circling on on the crux of the issue >>>>>> here. >>>>>> According to GO, it is not necessarily the case that a gp >>>>>> localized to a U1 snRNP is necessarily localized to a U2-type >>>>>> spliceosomal complex. For example, the gp may be localized to a >>>>>> particular U1 snRNP that is part of a penta-snRNP complex (I >>>>>> may not have chosen the best example as we're lacking >>>>>> annotations to this new term, but I can pull out an analogous >>>>>> example if you're not convinced) >>>>>> This can be seen in the sub-graph which I reproduce at the end >>>>>> of this email (let me know if this is not visible in some email >>>>>> programs, I can make a wiki page with this all on it) - there >>>>>> is no path following the arrows from U1 snRNP to U2-type >>>>>> spliceosomal complex. Note that attempting to show the graph in >>>>>> an "intuitive" way as I have attempted to do below, with the >>>>>> smaller entities at the bottom, is actually *misleading* >>>>>> because it leads one to assume that there is some inferred all- >>>>>> some relationship between these two terms when in fact there is >>>>>> not. >>>>>> Biology is complex and logic is hard. There's no escaping this. >>>>>> I don't believe we should simplify either to the point where we >>>>>> get false positives. I do think we need better ways of >>>>>> displaying this complex information, but I think we should >>>>>> focus resources on doing this in end user-facing tools rather >>>>>> than oboedit, as we would hope everyone using oboedit to edit >>>>>> the ontology would have an understanding of the logic or a >>>>>> willingness to learn. >>>>>> Here is a roughly sketched out example of how this could work >>>>>> Let's say PMID:123 describes an observation of a product of g1 >>>>>> being localized to a 'U1 snRNP' via an IDA. Let's say that's >>>>>> all we know, either due to the resolution of the assay or >>>>>> that's all that the annotator specified. The user queries for >>>>>> 'U2-type spliceosomal complex' >>>>>> The query result screen could show something like: >>>>>> >>>>>> There are no genes with products known to be localized to 'U2- >>>>>> type spliceosomal complex'. >>>>>> However, every 'U2-type spliceosomal complex' has the >>>>>> following parts: >>>>>> a U1 snRNP -- genes: g1 >>>>>> a U2 snRNP -- genes: none >>>>>> A more advanced tool would be able to show even more: >>>>>> >>>>>> PMID:123 shows g1 in U1 snRNP. >>>>>> prob('U2-type spliceosomal complex') = 0.83 >>>>>> prob('penta-snRNP complex') = 0.05 >>>>>> ... >>>>>> This leads the user to terms of relevance, shows what is known, >>>>>> shows what might be the case, and does not show anything that >>>>>> is false. >>>>>>> I would have >>>>>>> thought that they should. If I wanted to know the parts of "U2- >>>>>>> type >>>>>>> spliceosomal complex" I would want to know all the things that >>>>>>> compose >>>>>>> the series of complexes that are all considered to be a "U2-type >>>>>>> spliceosomal complex". >>>>>> I'm not sure I really understand the statement, it sounds >>>>>> tautological, I don't understand what adding "series of" adds. >>>>>> If the question is "what parts can be found in every U2-type >>>>>> spliceosomal complex" then the answer is found via the has_part >>>>>> relation and it's closure. >>>>>> However, this is a different question from "what gene products >>>>>> have been observed to be present in a U2-type spliceosomal >>>>>> complex >>>>>>> [Note that we need to revise the defs of "U2-type spliceosomal >>>>>>> complex >>>>>>> ; GO:0005684" (and its sibling "U12-type spliceosomal complex ; >>>>>>> GO:0005689") to be consistent with the def of the parent term >>>>>>> "spliceosomal complex ; GO:0005681" and specify that these terms >>>>>>> represent series of complexes. I'll submit a SF item for this.] >>>>>> https://sourceforge.net/tracker/index.php?func=detail&aid=2843718&group_id=36855&atid=440764 >>>>>> I don't understand the motivation here. I think the definitions >>>>>> should employ a consistent style, but I would change the parent >>>>>> from: >>>>>> GO:0005681 ! spliceosomal complex [DEF: "Any of a series of >>>>>> ribonucleoprotein complexes that... >>>>>> to >>>>>> GO:0005681 ! spliceosomal complex [DEF: "A ribonucleoprotein >>>>>> complex that... >>>>>> I'm not sure how adding "any of a series of..." changes the >>>>>> meaning, it seems to just add extra verbiage that obfuscates >>>>>> the definition. >>>>> ------------------------------------------------------------------------------ Let >>>>> Crystal Reports handle the reporting - Free Crystal Reports >>>>> 2008 30-Day trial. Simplify your report design, integration and >>>>> deployment - and focus on what you do best, core application >>>>> coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july >>>>> _______________________________________________ Geneontology- >>>>> oboedit-working-group mailing list Geneontology-oboedit-working-group at lists.sourceforge.net >>>>> https://lists.sourceforge.net/lists/listinfo/geneontology-oboedit-working-group >> > From cjm at berkeleybop.org Tue Sep 1 20:47:00 2009 From: cjm at berkeleybop.org (Chris Mungall) Date: Tue, 1 Sep 2009 20:47:00 -0700 Subject: [Ontology-editors] Inter-ontology links and inference of annotations In-Reply-To: <9EC14AF7-DE08-41DA-AE09-BC4E5FB3C124@berkeleybop.org> References: <9EC14AF7-DE08-41DA-AE09-BC4E5FB3C124@berkeleybop.org> Message-ID: <9AE0D4F8-079E-45CC-BBC3-79EE14798BD0@berkeleybop.org> This is quite annotation-centric but I thought I would discuss this here first. I will talk about this at the meeting. At the last meeting there was some confusion about the F->P part_of links. Annotators were not sure whether they needed to make the redundant annotation to the P from an F annotation for which an F->P direct or indirect link exists. There is no need for annotators to make the redundant annotation. This annotation step can be 100% automated. We do this ahead of time rather than at query time, because some query tools may not be aware of the links (i.e they do not load go_ext because this would break their summary statistics). Here are the results of this automated step: http://www.geneontology.org/scratch/gaf-inference/ Each of these are supplemental GAFs that can be either fed back to the contributors or loaded separately into the GO database. The inferred annotations are treated as ICs, and the F goes in the WITH column. Hopefully this is quite straightforward. More complex are the other non-part_of intra-GO inter-ontology links from the various XP files that will eventually make their way into gene_ontology_write and gene_ontology_ext; for example: > HDAC complex has_function histone deacetylase activity > mitochondrial translation occurs_in mitochondrion > spindle pole body {localization,binding,..} ? spindle pole body > protein import into mitochondrial matrix results_in_transport_to > mitochondrial matrix > etc What are the rules for propagating gene products? We need clear rules here. For example, propagation across occurs_in seems natural - if a gene product participates in P and all the participants of P are located_in C, then P must be localized to C. We can use existing annotations as a guide; but we should do so warily, as there are probably many annotation gaps, just as there are for the F->P links. For any XP relationship that spans 2 GO ontologies we can ask how correlated the resulting annotations are; specifically, if the XP set has a relationship we can ask what is p(target|source). For example, what is the probability of a gene being annotated to 'mitochondrion' if it is annotated to 'mitochondrial translation'? We expect there to be some correlation if the terms are related. In some cases (such as the above) we may expect p ~ 1. The results are in the xps directory - http://www.geneontology.org/scratch/xps/ Look for files *.correl and *correl.avgs here is a summary: > has_participant 0.07145 > results_in_increased_length_of 0.09485 > results_in_change_to 0.1053 > results_in_complete_development_of 0.2023 > results_in_binding_of 0.225558 > results_in_transport_of 0.244143 > results_in_remodeling_of 0.260986 > results_in_transport_along 0.263867 > results_in_breakdown_of 0.30087 > results_in_formation_of 0.305972 > results_in_fusion_of 0.362487 > results_in_organization_of 0.385999 > results_in_connection_of 0.3986 > results_in_transport_to 0.401464 > results_in_morphogenesis_of 0.4231 > results_in_localization_of 0.439052 > results_in_transport_from 0.453718 > results_in_localization_to 0.4632 > unfolds_in 0.486606 > mediated_by 0.487917 > results_in_distribution_of 0.56538 > dependent_on 0.6052 > results_in_transport_involving 0.625 > has_level 0.68916 > realizes 0.695027 > results_in_division_of 0.715433 > has_function 0.718474 > results_in_maturation_of 0.7287 > occurs_in 0.78525 > results_in_transport_towards 0.8027 > results_in_transport_across 0.843125 > part_of-part_of 1.0 results_in_transport_across is high - for example: > GO:0034219-carbohydrate transmembrane transport GO:0016020- > membrane 0.8333 > GO:0034220-ion transmembrane transport GO:0016020-membrane 0.9021 > GO:0055085-transmembrane transport GO:0016020-membrane 0.6857 > GO:0034486-vacuolar transmembrane transport GO:0005774-vacuolar > membrane 1.0000 > GO:0034487-vacuolar amino acid transport GO:0005774-vacuolar > membrane 1.0000 > GO:0034755-iron ion transmembrane transport GO:0016020- > membrane 0.6471 > GO:0034775-glutathione transmembrane transport GO:0016020- > membrane 1.0000 > GO:0065002-intracellular protein transmembrane transport GO: > 0016020-membrane 0.6768 this seems reasonable. if a gene product is involved in X transmembrane transport then there's a good chance it's in the X membrane. I don't think we can say it should *always* be in the X membrane, but there is a good chance it is. This is an example of how the xp sets can be used - to suggest IC annotations to annotators and to make the whole IC system more consistent - to provide users navigational clues (e.g. queries for X transmembrane transport may return fewer results than expected, so the user can be prompted to try a query for X membrane) From djs93 at gen.cam.ac.uk Tue Sep 1 10:15:47 2009 From: djs93 at gen.cam.ac.uk (djs93 at gen.cam.ac.uk) Date: 01 Sep 2009 18:15:47 +0100 Subject: [Ontology-editors] [OBO-Edit Working Group] displaying has_part In-Reply-To: References: Message-ID: Hi Karen, I think there might be another way to deal with this issue, that is to get a more precise definition of the 'major spliceosome' - one that more closely accords with what biologists are actually referring too. It seems to me, that the way you use the term is most consistent with it referring to a non-contiguous structure - by which I mean one whose parts are not all connected to each other. As these can be confusing, I think it's important to use such terms sparingly, but they are unavoidable in some cases. For example, we need terms for functionally defined systems in anatomy. Instances of many such systems - e.g.- the immune system or the endocrine system, are non-contiguous. Is this the case for what you refer to here as the 'major spliceosome'? If so, then, as far as I can tell, the original use of part_of in this case was not incorrect. Hope this helps, rather than just muddying the waters further, David On Aug 26 2009, Karen Christie wrote: >OK, I understand your explanation about "Why should "Queries for U2-type >spliceosomal complex should ***not*** return gene products localized >to U1 snRNP complex." But I think that is separate from the display >issue I have with has_part. > >However, it also brings up a different issue. A question that a >biologist might want to ask is "give me all the proteins in the major >spliceosome". They would expect to get the proteins present in the 5 >snRNPs (U1, U2, U4, U5, and U6). > By replacing part_of relations with >has_part, it seems that they might have difficulty getting a full >answer to that question. In your example where g1 is annotated only to >"U1 snRNP", you give the hypothetical result: > >> There are no genes with products known to be localized to 'U2-type >> spliceosomal complex'. >> However, every 'U2-type spliceosomal complex' has the following >> parts: >> a U1 snRNP -- genes: g1 >> a U2 snRNP -- genes: none > >However, if I wanted to get all proteins in the major spliceosome, I >don't want just gene products that are in EVERY 'U2-type spliceosomal >complex', I want gene products that are in ANY 'U2-type spliceosomal >complex'. What do we need to do to make this a question that biologists >get a reasonable answer to? > >I understand that GO needs to implement more rigorous logic, but if we >do it in a way that prevents biologists from getting reasonable >answers, we are not doing what we set out to do. > >-Karen > > > > >On Mon, 24 Aug 2009, Chris Mungall wrote: > >> >> On Aug 24, 2009, at 10:36 AM, Karen Christie wrote: >> >> [snipped part of dialog for now to focus on one issue] >> >>>> I would strongly advocate that even given sufficient developer hours >>>> it is better *not* to display the ontology in this way at all. It >>>> perhaps looks more comforting, but people will make the same >>>> comforting assumptions that no longer hold. For example, it looks like >>>> there is some kind of transitive relationship between U1 snRNP and >>>> U2-type spliceosomal complex ***which there is not***. It looks like >>>> the true path rule might hold. ***it does not*** . Queries for U2-type >>>> spliceosomal complex should ***not*** return gene products localized >>>> to U1 snRNP complex. >>> >>> Why should "Queries for U2-type spliceosomal complex should ***not*** >>> return gene products localized to U1 snRNP complex."? >> >> OK, good, I think we are circling on on the crux of the issue here. >> >> According to GO, it is not necessarily the case that a gp localized to >> a U1 snRNP is necessarily localized to a U2-type spliceosomal complex. >> For example, the gp may be localized to a particular U1 snRNP that is >> part of a penta-snRNP complex (I may not have chosen the best example as >> we're lacking annotations to this new term, but I can pull out an >> analogous example if you're not convinced) >> >> This can be seen in the sub-graph which I reproduce at the end of this >> email (let me know if this is not visible in some email programs, I can >> make a wiki page with this all on it) - there is no path following the >> arrows from U1 snRNP to U2-type spliceosomal complex. Note that >> attempting to show the graph in an "intuitive" way as I have attempted >> to do below, with the smaller entities at the bottom, is actually >> *misleading* because it leads one to assume that there is some inferred >> all-some relationship between these two terms when in fact there is not. >> >> Biology is complex and logic is hard. There's no escaping this. I don't >> believe we should simplify either to the point where we get false >> positives. I do think we need better ways of displaying this complex >> information, but I think we should focus resources on doing this in end >> user-facing tools rather than oboedit, as we would hope everyone using >> oboedit to edit the ontology would have an understanding of the logic or >> a willingness to learn. >> >> Here is a roughly sketched out example of how this could work >> >> Let's say PMID:123 describes an observation of a product of g1 being >> localized to a 'U1 snRNP' via an IDA. Let's say that's all we know, >> either due to the resolution of the assay or that's all that the >> annotator specified. The user queries for 'U2-type spliceosomal complex' >> >> The query result screen could show something like: >> >> There are no genes with products known to be localized to 'U2-type >> spliceosomal complex'. >> However, every 'U2-type spliceosomal complex' has the following >> parts: >> a U1 snRNP -- genes: g1 >> a U2 snRNP -- genes: none >> >> A more advanced tool would be able to show even more: >> >> PMID:123 shows g1 in U1 snRNP. >> prob('U2-type spliceosomal complex') = 0.83 >> prob('penta-snRNP complex') = 0.05 >> ... >> >> This leads the user to terms of relevance, shows what is known, shows >> what might be the case, and does not show anything that is false. >> >>> I would have >>> thought that they should. If I wanted to know the parts of "U2-type >>> spliceosomal complex" I would want to know all the things that compose >>> the series of complexes that are all considered to be a "U2-type >>> spliceosomal complex". >> >> I'm not sure I really understand the statement, it sounds tautological, >> I don't understand what adding "series of" adds. >> >> If the question is "what parts can be found in every U2-type >> spliceosomal complex" then the answer is found via the has_part relation >> and it's closure. >> >> However, this is a different question from "what gene products have been >> observed to be present in a U2-type spliceosomal complex >> >>> [Note that we need to revise the defs of "U2-type spliceosomal complex >>> ; GO:0005684" (and its sibling "U12-type spliceosomal complex ; >>> GO:0005689") to be consistent with the def of the parent term >>> "spliceosomal complex ; GO:0005681" and specify that these terms >>> represent series of complexes. I'll submit a SF item for this.] >> >> >> >> https://sourceforge.net/tracker/index.php?func=detail&aid=2843718&group_id=36855&atid=440764 >> >> I don't understand the motivation here. I think the definitions should >> employ a consistent style, but I would change the parent from: >> >> GO:0005681 ! spliceosomal complex [DEF: "Any of a series of >> ribonucleoprotein complexes that... >> >> to >> >> GO:0005681 ! spliceosomal complex [DEF: "A ribonucleoprotein complex >> that... >> >> I'm not sure how adding "any of a series of..." changes the meaning, it >> seems to just add extra verbiage that obfuscates the definition. >> >> >> > > > ------------------------------------------------------------------------------ > Let Crystal Reports handle the reporting - Free Crystal Reports 2008 > 30-Day trial. Simplify your report design, integration and deployment - > and focus on what you do best, core application coding. Discover what's > new with Crystal Reports now. http://p.sf.net/sfu/bobj-july > _______________________________________________ > Geneontology-oboedit-working-group mailing list > Geneontology-oboedit-working-group at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/geneontology-oboedit-working-group From midori at ebi.ac.uk Thu Sep 3 07:21:42 2009 From: midori at ebi.ac.uk (Midori Harris) Date: Thu, 3 Sep 2009 15:21:42 +0100 (BST) Subject: [Ontology-editors] filter obsoletes from ec2go? Message-ID: Hi Dan, We've had a SF item with a query about obsolete GO terms/IDs appearing in the ec2go file. I found an email from last year in which Chris said he prefers to keep the EC dbxrefs for obsolete terms (even though the set isn't complete because we'd removed some in the past). But I don't think obsoletes should get into the ec2go file, so can they be skipped/filtered out when ec2go is parsed from the GO .obo file? Thanks, m From jane at ebi.ac.uk Thu Sep 3 07:23:02 2009 From: jane at ebi.ac.uk (Jane Lomax) Date: Thu, 03 Sep 2009 15:23:02 +0100 Subject: [Ontology-editors] XP docs In-Reply-To: References: Message-ID: <4A9FD146.9030505@ebi.ac.uk> They look great to me! Jane Midori Harris wrote: > Chiming in late due to bank holiday yesterday ... I think the doc now > looks very good indeed. (Although "classes" would be more > ontologically correct, I heartily approve of sticking with "terms" in > this documentation, because so much of its target audience is > biologists; formal ontologists will mostly look elsewhere anyway.) > > m > > On Mon, 31 Aug 2009, Chris Mungall wrote: > >> >> Looks very good! >> >> Couple of comments: >>> If we use ontology terms in the genus and the differentia, we >>> can see that these logical definitions take the general form >>> >>> term relation term >>> >> The problem is we already use a triple of the form to denote >> >> all X R some Y >> >> I suggest that instead of >> >> term relation term >> >> You write >> >> term and relation term >> >> or >> >> term that relation term >> >> e.g. instead of >> >>> lysosomal membrane is membrane surrounding lysosome >>> >> >> we should have: >> >> lysosomal membrane is (a) membrane that surrounds (a) lysosome >> >> This is consistent with what you have above, and with the formal >> semantics. Also the actual relation used is 'surrounds' not >> 'surrounding'. Of course it's not hard to have a simple algorithm >> that translates a logical definition of the form >> >> lysosomal membrane = membrane that surrounds lysosome >> >> To a more user-friendly >> >> a lysosome membrane is a membrane surrounding a lysosome >> >> But this documentation should not be dependent on such an algorithm. >> The docs should probably not shy away from showing the relevant piece >> of the obo format stanza and include a translational table for going >> between the two. E.g. >> >> id: X >> >> intersection_of: G >> >> intersection_of: R1 Y1 >> >> intersection_of: Rn Yn >> >> <=> >> >> X = G that and ... and >> >> >> >> The docs use the terms 'term', 'concept' and 'category'. I'm not sure >> what the difference between these are. I think it would be simplest >> if we use 'class' throughout. Unfortunately the usage of 'term' for >> what are actually classes is very prevalent in GO. We could at least >> reduce it to term and class and eliminate concept and category. >> >> It may be better to use a species-neutral ontology like cell or chebi >> instead of the species-specific fly_anatomy. >> >> For the internal xp examples it would be good to have examples drawn >> from the xps that will go live first: the subset of bp_xp_cc that are >> simple ; regulation xps ; the subset of >> cc_xp_self that is . We could >> also have an example of how this is being used to maintain the GO. >> >> David and Amina are writing docs for OE users so we should be sure to >> coordinate. >> >> Thanks! >> >> >> >> On Aug 28, 2009, at 3:10 PM, Amelia Ireland wrote: >> >>> Hello people, >>> >>> I've drafted some XP docs for the GO website if you want to have a >>> look at them: >>> >>> http://geneontology.org/test-html.shtml >>> >>> If anyone has suggestions for what should go in that last paragraph, >>> they'd be gratefully received! >>> >>> Thanks, >>> Amelia. >>> >>> -- >>> Amelia Ireland >>> GO Editorial Office >>> http://www.berkeleybop.org || http://www.ebi.ac.uk >>> Boycott Trader Joe's Red List seafood: http://traitorjoe.com/ >>> >>> >>> >>> >>> >>> >>> >>> >>> _______________________________________________ >>> Ontology-editors mailing list >>> Ontology-editors at geneontology.org >>> http://fafner.stanford.edu/mailman/listinfo/ontology-editors >>> >> >> _______________________________________________ >> Ontology-editors mailing list >> Ontology-editors at geneontology.org >> http://fafner.stanford.edu/mailman/listinfo/ontology-editors > _______________________________________________ > Ontology-editors mailing list > Ontology-editors at geneontology.org > http://fafner.stanford.edu/mailman/listinfo/ontology-editors -- Dr Jane Lomax GO Editorial Office EMBL-EBI Wellcome Trust Genome Campus Hinxton Cambridgeshire, UK CB10 1SD p: +44 1223 492516 f: +44 1223 494468 From midori at ebi.ac.uk Thu Sep 3 07:53:16 2009 From: midori at ebi.ac.uk (Midori Harris) Date: Thu, 3 Sep 2009 15:53:16 +0100 (BST) Subject: [Ontology-editors] filter obsoletes from ec2go? (fwd) Message-ID: On a related note ... are obsoletes included or excluded in the *2go files generated by daily_from_obo.pl? (I'd vote for excluding, obviously.) m ---------- Forwarded message ---------- Date: Thu, 3 Sep 2009 15:21:42 +0100 (BST) From: Midori Harris To: Daniel Barrell Cc: Tony Sawford , Ontology Editors Subject: filter obsoletes from ec2go? Hi Dan, We've had a SF item with a query about obsolete GO terms/IDs appearing in the ec2go file. I found an email from last year in which Chris said he prefers to keep the EC dbxrefs for obsolete terms (even though the set isn't complete because we'd removed some in the past). But I don't think obsoletes should get into the ec2go file, so can they be skipped/filtered out when ec2go is parsed from the GO .obo file? Thanks, m From dbarrell at ebi.ac.uk Fri Sep 4 02:18:23 2009 From: dbarrell at ebi.ac.uk (Daniel Barrell) Date: Fri, 04 Sep 2009 10:18:23 +0100 Subject: [Ontology-editors] filter obsoletes from ec2go? In-Reply-To: References: Message-ID: <4AA0DB5F.5070500@ebi.ac.uk> Hi Midori, I've added this as a task for our next sprint. Cheers Dan Midori Harris wrote: > Hi Dan, > > We've had a SF item with a query about obsolete GO terms/IDs appearing > in the ec2go file. > > I found an email from last year in which Chris said he prefers to keep > the EC dbxrefs for obsolete terms (even though the set isn't complete > because we'd removed some in the past). But I don't think obsoletes > should get into the ec2go file, so can they be skipped/filtered out when > ec2go is parsed from the GO .obo file? > > Thanks, > m -- Daniel Barrell EMBL - The EBI Wellcome Trust Genome Campus Hinxton, Cambridge CB10 1SD Phone: +44 (0)1223 492551 Email: dbarrell at ebi.ac.uk From midori at ebi.ac.uk Fri Sep 4 02:20:19 2009 From: midori at ebi.ac.uk (Midori Harris) Date: Fri, 4 Sep 2009 10:20:19 +0100 (BST) Subject: [Ontology-editors] filter obsoletes from ec2go? In-Reply-To: <4AA0DB5F.5070500@ebi.ac.uk> References: <4AA0DB5F.5070500@ebi.ac.uk> Message-ID: Great; thanks! m On Fri, 4 Sep 2009, Daniel Barrell wrote: > Hi Midori, > > I've added this as a task for our next sprint. > > Cheers > > Dan > > Midori Harris wrote: >> Hi Dan, >> >> We've had a SF item with a query about obsolete GO terms/IDs appearing in >> the ec2go file. >> >> I found an email from last year in which Chris said he prefers to keep the >> EC dbxrefs for obsolete terms (even though the set isn't complete because >> we'd removed some in the past). But I don't think obsoletes should get into >> the ec2go file, so can they be skipped/filtered out when ec2go is parsed >> from the GO .obo file? >> >> Thanks, >> m > > From tberardi at acoma.stanford.edu Fri Sep 4 09:26:10 2009 From: tberardi at acoma.stanford.edu (Tanya Berardini) Date: Fri, 4 Sep 2009 09:26:10 -0700 Subject: [Ontology-editors] CC organization just won't go away (q about catabolism) In-Reply-To: References: Message-ID: <8e22ab960909040926v6be293eem30c92ba69f7f7c8b@mail.gmail.com> replied in SF item On Fri, Aug 21, 2009 at 8:36 AM, Midori Harris wrote: > Hi fellow ontology developers, > > This isn't urgent, but we should think about it ... in the CC organization > work, we briefly considered whether to link catabolism and disassembly. > >> From the notes we seemed to be leaning away from linking, but I'm not sure >> >> > it was quite completely resolved. > > > https://sourceforge.net/tracker/?func=detail&aid=2841836&group_id=36855&atid=440764 > http://wiki.geneontology.org/index.php/Cellular_component_processes > > m > _______________________________________________ > Ontology-editors mailing list > Ontology-editors at geneontology.org > http://fafner.stanford.edu/mailman/listinfo/ontology-editors > -- Tanya Berardini TAIR Curator www.arabidopsis.org -------------- next part -------------- An HTML attachment was scrubbed... URL: From midori at ebi.ac.uk Mon Sep 7 05:25:41 2009 From: midori at ebi.ac.uk (Midori Harris) Date: Mon, 7 Sep 2009 13:25:41 +0100 (BST) Subject: [Ontology-editors] CC organization just won't go away (q about catabolism) In-Reply-To: <8e22ab960909040926v6be293eem30c92ba69f7f7c8b@mail.gmail.com> References: <8e22ab960909040926v6be293eem30c92ba69f7f7c8b@mail.gmail.com> Message-ID: There's now a related one, SF 2853470: https://sourceforge.net/tracker/?func=detail&aid=2853470&group_id=36855&atid=440764 m On Fri, 4 Sep 2009, Tanya Berardini wrote: > replied in SF item > > On Fri, Aug 21, 2009 at 8:36 AM, Midori Harris wrote: > >> Hi fellow ontology developers, >> >> This isn't urgent, but we should think about it ... in the CC organization >> work, we briefly considered whether to link catabolism and disassembly. >> >>> From the notes we seemed to be leaning away from linking, but I'm not sure >>> >>> >> it was quite completely resolved. >> >> >> https://sourceforge.net/tracker/?func=detail&aid=2841836&group_id=36855&atid=440764 >> http://wiki.geneontology.org/index.php/Cellular_component_processes >> >> m >> _______________________________________________ >> Ontology-editors mailing list >> Ontology-editors at geneontology.org >> http://fafner.stanford.edu/mailman/listinfo/ontology-editors >> > > > > From jdeegan at ebi.ac.uk Thu Sep 10 01:54:00 2009 From: jdeegan at ebi.ac.uk (Jennifer Deegan (nee Clark)) Date: Thu, 10 Sep 2009 09:54:00 +0100 Subject: [Ontology-editors] Neurobiology Message-ID: <4AA8BEA8.4010308@ebi.ac.uk> Hi, Attached is my first draft neurobiology poster to take the meeting: Cambridge Clinical Neuroscience and Mental Health Symposium, 29th - 30th September. I have checked with the author of the paper that I use as an example of GO analysis and he is happy for his work to be presented like this. Does anybody have any comments on it before I get it printed? I will check the image resolution with the print shop people here. Thanks, Jennifer -------------- next part -------------- A non-text attachment was scrubbed... Name: Poster.pdf Type: application/pdf Size: 623070 bytes Desc: not available URL: From midori at ebi.ac.uk Thu Sep 10 02:53:43 2009 From: midori at ebi.ac.uk (Midori Harris) Date: Thu, 10 Sep 2009 10:53:43 +0100 (BST) Subject: [Ontology-editors] Neurobiology In-Reply-To: <4AA8BEA8.4010308@ebi.ac.uk> References: <4AA8BEA8.4010308@ebi.ac.uk> Message-ID: A bunch of comments ... One thing I'd suggest is a title change, perhaps to "Using the Gene Ontology for large-scale analysis of nervous system function". That would help avoid giving the impression that GO *only* deals with genetic interactions, and get the audience's pet topic right there up front. In the 'specific problem' paragraph, change 'upregulated' to 'differentially regulated'; downregulation can be just as interesting as up. In 'ontologies' - missing '.' after 'carry out' 'Ontology slimming' - I'm trying to think of an alternative to 'slim up the annotations', because 'GO slim' is our jargon, and probably won't be at all familiar to an audience that's new to GO (or ontologies generally) anyway. Maybe 'transfer annotations to more general terms' or 'collect annotations ...'? My point is that 'slimming' has to be explained. 'GO slims and microarrays' - Delete the last sentence; read on for suggestions that would make it superfluous. Optional: could change 'which processes are upregulated or downregulated' to 'which processes are affected'. 'Use of GO in neurobiology' and 'enrichment analysis' - - Might be good to merge these into one panel (perhaps with sub-panels like the ontology diagram A and B) to make it more clear that they deal with the same paper & experiments. - Maybe change the heading to 'GO analysis of a large-scale neurobiology experiment'? - Change the summary to 'This paper illustrates how GO can be used to analyse the effect of a chemical treatment on the central nervous system:'. The fact that the group is in Japan (a) will probably be obvious from the authors' names and (b) is totally irrelevant to the point that GO is really useful for studying neurobiology. - Delete the affiliations and email addresses; the title, authors, and journal citation are enough for anyone who's interested to find the paper. - The GO terms that came up in the analysis get lost in all the text. The figure is good, and you can probably use it more effectively if you have less text. I suggest (a) delete the text under 'enrichment analysis'; (b) instead, make the figure bigger and add small snippets of text and perhaps arrows or other graphical cues to draw attention to the GO terms. 'How well is neurobiology represented ...' - Graphical views might be more intuitive than OTE tree views, even if it means you have room for fewer examples. (I'm guessing, tho ... hard to read people's minds!) Maybe also add a link to AmiGO and a hint on how to search for neuro terms? cheers, m On Thu, 10 Sep 2009, Jennifer Deegan (nee Clark) wrote: > Hi, > > Attached is my first draft neurobiology poster to take the meeting: Cambridge > Clinical Neuroscience and Mental Health Symposium, 29th - 30th September. > > I have checked with the author of the paper that I use as an example of GO > analysis and he is happy for his work to be presented like this. > > Does anybody have any comments on it before I get it printed? I will check > the image resolution with the print shop people here. > > Thanks, > > Jennifer > From jane at ebi.ac.uk Thu Sep 10 06:48:22 2009 From: jane at ebi.ac.uk (Jane Lomax) Date: Thu, 10 Sep 2009 14:48:22 +0100 Subject: [Ontology-editors] cross-products demo Message-ID: <4AA903A6.8010706@ebi.ac.uk> Hi all ontology editors - as you've probably heard, we're hoping to put the first set of cross-products into the extended GO file very soon. This means that soon anyone editing the ontologies will also have to edit cross-products. The initial set will be very straightforward - they'll be for the regulation terms - but as time goes on, the cross-products we add will become more complicated. So we thought it would be a good idea for everyone who edits the ontologies to come to a training session which Chris has kindly 'agreed' to run about cross-products and how to edit them. I've made a Doodle poll so we can choose a date the maximum number of people can make: http://www.doodle.com/bfnug7imd5s7qg2h thanks, Jane -- Dr Jane Lomax GO Editorial Office EMBL-EBI Wellcome Trust Genome Campus Hinxton Cambridgeshire, UK CB10 1SD p: +44 1223 492516 f: +44 1223 494468 From tberardi at acoma.stanford.edu Thu Sep 10 09:34:15 2009 From: tberardi at acoma.stanford.edu (Tanya Berardini) Date: Thu, 10 Sep 2009 09:34:15 -0700 Subject: [Ontology-editors] cross-products demo In-Reply-To: <4AA903A6.8010706@ebi.ac.uk> References: <4AA903A6.8010706@ebi.ac.uk> Message-ID: <8e22ab960909100934g6a660355j6999830f890d3501@mail.gmail.com> How about doing the training session during the GO meeting in Cambridge? Shouldn't take too long and it will minimize Ye Olde Time Zone Problem quite significantly. Tanya On Thu, Sep 10, 2009 at 6:48 AM, Jane Lomax wrote: > Hi all ontology editors - as you've probably heard, we're hoping to put the > first set of cross-products into the extended GO file very soon. This means > that soon anyone editing the ontologies will also have to edit > cross-products. > > The initial set will be very straightforward - they'll be for the > regulation terms - but as time goes on, the cross-products we add will > become more complicated. > > So we thought it would be a good idea for everyone who edits the ontologies > to come to a training session which Chris has kindly 'agreed' to run about > cross-products and how to edit them. > > I've made a Doodle poll so we can choose a date the maximum number of > people can make: > > http://www.doodle.com/bfnug7imd5s7qg2h > > thanks, > > Jane > > -- > Dr Jane Lomax > GO Editorial Office > EMBL-EBI > Wellcome Trust Genome Campus > Hinxton > Cambridgeshire, UK > CB10 1SD > > p: +44 1223 492516 > f: +44 1223 494468 > > _______________________________________________ > Ontology-editors mailing list > Ontology-editors at geneontology.org > http://fafner.stanford.edu/mailman/listinfo/ontology-editors > -- Tanya Berardini TAIR Curator www.arabidopsis.org -------------- next part -------------- An HTML attachment was scrubbed... URL: From aabdulla at berkeleybop.org Thu Sep 10 14:25:34 2009 From: aabdulla at berkeleybop.org (Amina Abdulla) Date: Thu, 10 Sep 2009 14:25:34 -0700 Subject: [Ontology-editors] cross-products demo In-Reply-To: <4AA903A6.8010706@ebi.ac.uk> References: <4AA903A6.8010706@ebi.ac.uk> Message-ID: <4AA96ECE.8020102@berkeleybop.org> Hi All, I'd like to suggest holding the xps demo during the OBO-Edit Working Group Meeting - maybe next week - Tuesday September 15th 8:30/9:30 am PDT whichever works best for everyone. Thanks, Amina On 9/10/09 6:48 AM, Jane Lomax wrote: > Hi all ontology editors - as you've probably heard, we're hoping to > put the first set of cross-products into the extended GO file very > soon. This means that soon anyone editing the ontologies will also > have to edit cross-products. > > The initial set will be very straightforward - they'll be for the > regulation terms - but as time goes on, the cross-products we add will > become more complicated. > > So we thought it would be a good idea for everyone who edits the > ontologies to come to a training session which Chris has kindly > 'agreed' to run about cross-products and how to edit them. > > I've made a Doodle poll so we can choose a date the maximum number of > people can make: > > http://www.doodle.com/bfnug7imd5s7qg2h > > thanks, > > Jane > From jdeegan at ebi.ac.uk Fri Sep 11 01:48:14 2009 From: jdeegan at ebi.ac.uk (Jennifer Deegan (nee Clark)) Date: Fri, 11 Sep 2009 09:48:14 +0100 Subject: [Ontology-editors] cross-products demo In-Reply-To: <4AA96ECE.8020102@berkeleybop.org> References: <4AA903A6.8010706@ebi.ac.uk> <4AA96ECE.8020102@berkeleybop.org> Message-ID: <4AAA0ECE.2080400@ebi.ac.uk> Hi Amina, I won't be able to attend the meeting next week as I am on a course, but I am pretty clear on editing cross products if you would like to go ahead without me. Jen Amina Abdulla wrote: > Hi All, > I'd like to suggest holding the xps demo during the OBO-Edit Working > Group Meeting - maybe next week - Tuesday September 15th 8:30/9:30 am > PDT whichever works best for everyone. > > Thanks, > Amina > > > On 9/10/09 6:48 AM, Jane Lomax wrote: >> Hi all ontology editors - as you've probably heard, we're hoping to >> put the first set of cross-products into the extended GO file very >> soon. This means that soon anyone editing the ontologies will also >> have to edit cross-products. >> >> The initial set will be very straightforward - they'll be for the >> regulation terms - but as time goes on, the cross-products we add will >> become more complicated. >> >> So we thought it would be a good idea for everyone who edits the >> ontologies to come to a training session which Chris has kindly >> 'agreed' to run about cross-products and how to edit them. >> >> I've made a Doodle poll so we can choose a date the maximum number of >> people can make: >> >> http://www.doodle.com/bfnug7imd5s7qg2h >> >> thanks, >> >> Jane >> > > _______________________________________________ > Ontology-editors mailing list > Ontology-editors at geneontology.org > http://fafner.stanford.edu/mailman/listinfo/ontology-editors From dph at informatics.jax.org Fri Sep 11 04:14:35 2009 From: dph at informatics.jax.org (David Hill) Date: Fri, 11 Sep 2009 07:14:35 -0400 Subject: [Ontology-editors] cross-products demo In-Reply-To: <4AA96ECE.8020102@berkeleybop.org> References: <4AA903A6.8010706@ebi.ac.uk> <4AA96ECE.8020102@berkeleybop.org> Message-ID: <4AAA311B.8020001@informatics.jax.org> I think that is a good idea. We thought it wouldn't really be appropriate to do it at the consortium meeting because there are so many people there who don't edit the ontology. David Amina Abdulla wrote: > Hi All, > I'd like to suggest holding the xps demo during the OBO-Edit Working > Group Meeting - maybe next week - Tuesday September 15th 8:30/9:30 am > PDT whichever works best for everyone. > > Thanks, > Amina > > > On 9/10/09 6:48 AM, Jane Lomax wrote: >> Hi all ontology editors - as you've probably heard, we're hoping to >> put the first set of cross-products into the extended GO file very >> soon. This means that soon anyone editing the ontologies will also >> have to edit cross-products. >> >> The initial set will be very straightforward - they'll be for the >> regulation terms - but as time goes on, the cross-products we add >> will become more complicated. >> >> So we thought it would be a good idea for everyone who edits the >> ontologies to come to a training session which Chris has kindly >> 'agreed' to run about cross-products and how to edit them. >> >> I've made a Doodle poll so we can choose a date the maximum number of >> people can make: >> >> http://www.doodle.com/bfnug7imd5s7qg2h >> >> thanks, >> >> Jane >> > > _______________________________________________ > Ontology-editors mailing list > Ontology-editors at geneontology.org > http://fafner.stanford.edu/mailman/listinfo/ontology-editors -- David P. Hill, Ph.D. Bioinformatics Scientist: Ontology Development Gene Ontology Consortium The Jackson Laboratory www.geneontology.org www.informatics.jax.org tel:207-288-6430 From cjm at berkeleybop.org Mon Sep 14 17:42:52 2009 From: cjm at berkeleybop.org (Chris Mungall) Date: Mon, 14 Sep 2009 17:42:52 -0700 Subject: [Ontology-editors] EGF stimulus References: <9BA97BC21AF8BB4098EC0685A900614F013B966D@npg-ny-exc1.npg.root-domain.org> Message-ID: <5DFCE03A-DE88-49FD-8567-CA9E91C33CC5@berkeleybop.org> should there not be some relationship between EGFR signaling and response to EGF? > From: "Anthony, Kira" > Date: September 14, 2009 3:40:05 PM PDT > To: > Subject: [Obo-discuss] FW: the "Foundry ontologies" > Reply-To: obo-discuss at lists.sourceforge.net > > I just had a quick look through some of the discussions below on the > Pathway Ontology. Having worked with this ontology a couple of > things come to mind: > 1. Re Alan's query regarding an aspect that distinguishes between > pathways and processes: in my mind something that ought to be > included in the definitions for signaling and regulatory pathways is > mention of the initiating signal itself. GO provides terms for the > downstream processes associated with the signal (e.g. "response to > epidermal growth factor stimulus"), but not the signal itself. And > since a response to a signal is not necessarily equal to the signal > (i.e. responses to signals can vary by cell type, cell stage, and so > on) it would be a rather sensible approach when defining a pathway > to state what the signal is (or simply that one exists), stop there > with signal discussion and go on to further define the pathway via > its molecular events, downstream processes, cell types and so on. By > extension, here you could have PO terms and GO terms working > together: a simple example is that PO term "Tumor necrosis factor.. > pathway" leads to GO term "apoptosis". > > 2. I haven't looked through the discussion below in enough detail to > see whether this point is directly relevant, but I thought I'd > mention that what we like about the Pathway Ontology is that it > contains a comprehensive set of signaling and regulatory pathway > terms, some of which are not covered in broader ontologies such as > GO. In addition, the Pathway Ontology has contained all of the term > relationships that we've searched on. In contrast, with larger > ontologies such as, again, GO, possibly because of the vast amount > of biology that GO is covering, there are examples of similar terms > such as "response to epidermal growth factor stimulus" and > "epidermal growth factor receptor signaling pathway" that seem to be > unconnected (or less connected than they ought to be) in the GO > graph. Of course, these comments don't discount the value that GO > provides to the community. Instead I wonder if they point to areas > where ontologies that are more concentrated in particular subject > areas can add value and/or find synergies with GO..? > > Kind regards, > > Kira > > Kira Anthony > NCI-Nature Pathway Interaction Database > Nature Publishing Group > URLs: http://pid.nci.nih.gov & http://www.nature.com/databases/ > > -----Original Message----- > From: Alan Ruttenberg [mailto:alanruttenberg at gmail.com] > Sent: Monday, September 14, 2009 2:45 PM > To: Shimoyama, Mary > Cc: obo-discuss at lists.sourceforge.net > Subject: Re: [Obo-discuss] the "Foundry ontologies" > > On Mon, Sep 14, 2009 at 1:22 PM, Shimoyama, Mary > wrote: > > I believe there are a number of issues - GO has stated in the past > that they do not represent pathways yet have traditionally had > pathway terms and have extended some processes to be pathways. To > me there are differences - a pathway often includes multiple > processes and we have discussed at RGD, how we can link to the > multiple GO processes presented in a pathway. For us, there is a > finite set of members within a pathway with defined processes. > > How do you define: "member of a pathway"? > > >We are not representing processes which can occur among many types > of molecules under many circumstances. > > I'm not sure what you mean. Do you mean that the names in the pathway > ontology are intended to denote processes that occur only in, say, > mammals? If that was the case then that should be explicit. > > > The names of processes and pathways is one issue, but also at > issue is the extent of what is considered an individual process and > what is a pathway. This debate has gone on within GO itself. > > Perhaps. One of the goals of ontologies being in the OBO Foundry is > that they have at least plausible answers to questions such as this. > One of the things that we've had some better articulation of is what > it means to be a process. For example, a collection of processes, > summed together is also a process. And parts of processes are also > processes. So the idea counting "how many processes" isn't > particularly well defined. Instead, I think the goal is to describe > what the participants are, and what the boundaries of the process are. > > I'd really like to get your view of the example that Suzi mentioned. > Is it or is it not the case that the instances of PW:0000012 are > exactly the sum of instances of GO:0009166 and GO:0009166 . If it > isn't then what is the difference? If it is, how do you think this > should be handled wrt the stated Foundry principle of avoiding > redundancy (sometimes called achieving orthogonality)? Is the question > (framed in terms of instances) even a question that fits with how the > pathway ontology is conceived? > > I should perhaps emphasize that there ought to be different ways to > gain recognition of quality work - the Foundry isn't attempting to be > the clearing house for that. Rather, it is a more specific effort to > create a set of ontologies that work together in a certain way. So the > questions I ask here should not be construed in any way to evaluating > quality. Rather they are intended to gain an understanding of how the > pathway ontology is intended to work, to see whether that is > consistent with the current view of how the Foundry ontologies can > work together, and to see, if it is not, whether there is interest in > proceeding down a road towards a state in which they can. > > Regards, > Alan > > > > > Mary Shimoyama > > Program Manager > > Rat Genome Database > > Human and Molecular Genetics Center > > Medical College of Wisconsin > > shimoyama at mcw.edu > > Tel: 414-456-7505 > > Fax: 414-456-6516 > > http://rgd.mcw.edu > > > > > > -----Original Message----- > > From: Alan Ruttenberg [mailto:alanruttenberg at gmail.com] > > Sent: Monday, September 14, 2009 12:11 PM > > To: Shimoyama, Mary > > Subject: Re: [Obo-discuss] the "Foundry ontologies" > > > > On Mon, Sep 14, 2009 at 12:04 PM, Shimoyama, Mary > wrote: > >> Hi Alan - I am not sure what face-to-face discussions Suzi is > referring to. When you were here, we only talked about the > phenotype work I am doing and not the pathway ontology - as far as I > can remember. > > > > I may have been too informal - I've also discussed it with Victoria. > > And this was before I had any specific role in the foundry, I > suspect. > > But let's start fresh and ignore that bit. The question on the > table, > > it would seem to me, would be whether you understand and agree with > > the issue that is raised. > > > > I'll respond with something to this effect to the main list. > > > > -Alan > > > >> > >> Mary Shimoyama > >> Program Manager > >> Rat Genome Database > >> Human and Molecular Genetics Center > >> Medical College of Wisconsin > >> shimoyama at mcw.edu > >> Tel: 414-456-7505 > >> Fax: 414-456-6516 > >> http://rgd.mcw.edu > >> > >> > >> -----Original Message----- > >> From: Alan Ruttenberg [mailto:alanruttenberg at gmail.com] > >> Sent: Monday, September 14, 2009 11:00 AM > >> To: Shimoyama, Mary > >> Subject: Re: [Obo-discuss] the "Foundry ontologies" > >> > >> Hi Mary, > >> > >> Suzi followed up a bit about the pathway ontology, and there's > >> obviously more to say. But I saw this bit and wanted to follow up > with > >> you for a little fact-finding. To what discussions are you > referring > >> to below? > >> Happy to discuss anything at all further with you too, though I'm > >> about halfway through a month of hellish scheduling, which is why > >> response (on my side) has been a bit slow, > >> > >> Regards, > >> Alan > >> > >> > >> On Wed, Sep 9, 2009 at 9:12 AM, Shimoyama, Mary > wrote: > >>> For those who heard the sneers at an ICBO at such successful > projects as GO > >>> or SO as not being "true" ontologies there seems to be little > hope for the > >>> rest of us to have our efforts recognized. > >> > > > > ------------------------------------------------------------------------------ > Let Crystal Reports handle the reporting - Free Crystal Reports 2008 > 30-Day > trial. Simplify your report design, integration and deployment - and > focus on > what you do best, core application coding. Discover what's new with > Crystal Reports now. http://p.sf.net/sfu/bobj-july > _______________________________________________ > Obo-discuss mailing list > Obo-discuss at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/obo-discuss > > ------------------------------------------------------------------------------ > Come build with us! The BlackBerry® Developer Conference in SF, CA > is the only developer event you need to attend this year. Jumpstart > your > developing skills, take BlackBerry mobile applications to market and > stay > ahead of the curve. Join us from November 9-12, 2009. Register > now! > http://p.sf.net/sfu/devconf_______________________________________________ > Obo-discuss mailing list > Obo-discuss at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/obo-discuss -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: pastedGraphic.tiff Type: image/tiff Size: 5098554 bytes Desc: not available URL: From cjm at berkeleybop.org Mon Sep 14 18:03:25 2009 From: cjm at berkeleybop.org (Chris Mungall) Date: Mon, 14 Sep 2009 18:03:25 -0700 Subject: [Ontology-editors] cross-products demo In-Reply-To: <4AAA311B.8020001@informatics.jax.org> References: <4AA903A6.8010706@ebi.ac.uk> <4AA96ECE.8020102@berkeleybop.org> <4AAA311B.8020001@informatics.jax.org> Message-ID: I think it's more appropriate to do it at the meeting. Most people who attend OEWGs understand the cross-products. The goal is to disseminate this wider, to members of this group who will be editing the ontology, and to the wider group who use oboedit for browsing as part of annotation etc On Sep 11, 2009, at 4:14 AM, David Hill wrote: > I think that is a good idea. We thought it wouldn't really be > appropriate to do it at the consortium meeting because there are so > many people there who don't edit the ontology. > > David > > Amina Abdulla wrote: >> Hi All, >> I'd like to suggest holding the xps demo during the OBO-Edit >> Working Group Meeting - maybe next week - Tuesday September 15th >> 8:30/9:30 am PDT whichever works best for everyone. >> >> Thanks, >> Amina >> >> >> On 9/10/09 6:48 AM, Jane Lomax wrote: >>> Hi all ontology editors - as you've probably heard, we're hoping >>> to put the first set of cross-products into the extended GO file >>> very soon. This means that soon anyone editing the ontologies will >>> also have to edit cross-products. >>> >>> The initial set will be very straightforward - they'll be for the >>> regulation terms - but as time goes on, the cross-products we add >>> will become more complicated. >>> >>> So we thought it would be a good idea for everyone who edits the >>> ontologies to come to a training session which Chris has kindly >>> 'agreed' to run about cross-products and how to edit them. >>> >>> I've made a Doodle poll so we can choose a date the maximum number >>> of people can make: >>> >>> http://www.doodle.com/bfnug7imd5s7qg2h >>> >>> thanks, >>> >>> Jane >>> >> >> _______________________________________________ >> Ontology-editors mailing list >> Ontology-editors at geneontology.org >> http://fafner.stanford.edu/mailman/listinfo/ontology-editors > > -- > David P. Hill, Ph.D. > Bioinformatics Scientist: Ontology Development > Gene Ontology Consortium > The Jackson Laboratory > www.geneontology.org > www.informatics.jax.org > tel:207-288-6430 > > _______________________________________________ > Ontology-editors mailing list > Ontology-editors at geneontology.org > http://fafner.stanford.edu/mailman/listinfo/ontology-editors > From adiehl at informatics.jax.org Mon Sep 14 18:11:36 2009 From: adiehl at informatics.jax.org (Alexander Diehl) Date: Mon, 14 Sep 2009 21:11:36 -0400 Subject: [Ontology-editors] cross-products demo In-Reply-To: References: <4AA903A6.8010706@ebi.ac.uk> <4AA96ECE.8020102@berkeleybop.org> <4AAA311B.8020001@informatics.jax.org> Message-ID: <4AAEE9C8.4050009@informatics.jax.org> If the demo is at the meeting, it should preferably be held at a time when folks on both coasts of the US can attend, which effectively mean late afternoon in Cambridge. Otherwise it can be held at a more convenient time prior to the meeting. -- Alex Chris Mungall wrote: > > I think it's more appropriate to do it at the meeting. Most people who > attend OEWGs understand the cross-products. The goal is to disseminate > this wider, to members of this group who will be editing the ontology, > and to the wider group who use oboedit for browsing as part of > annotation etc > > On Sep 11, 2009, at 4:14 AM, David Hill wrote: > >> I think that is a good idea. We thought it wouldn't really be >> appropriate to do it at the consortium meeting because there are so >> many people there who don't edit the ontology. >> >> David >> >> Amina Abdulla wrote: >>> Hi All, >>> I'd like to suggest holding the xps demo during the OBO-Edit Working >>> Group Meeting - maybe next week - Tuesday September 15th 8:30/9:30 >>> am PDT whichever works best for everyone. >>> >>> Thanks, >>> Amina >>> >>> >>> On 9/10/09 6:48 AM, Jane Lomax wrote: >>>> Hi all ontology editors - as you've probably heard, we're hoping to >>>> put the first set of cross-products into the extended GO file very >>>> soon. This means that soon anyone editing the ontologies will also >>>> have to edit cross-products. >>>> >>>> The initial set will be very straightforward - they'll be for the >>>> regulation terms - but as time goes on, the cross-products we add >>>> will become more complicated. >>>> >>>> So we thought it would be a good idea for everyone who edits the >>>> ontologies to come to a training session which Chris has kindly >>>> 'agreed' to run about cross-products and how to edit them. >>>> >>>> I've made a Doodle poll so we can choose a date the maximum number >>>> of people can make: >>>> >>>> http://www.doodle.com/bfnug7imd5s7qg2h >>>> >>>> thanks, >>>> >>>> Jane >>>> >>> >>> _______________________________________________ >>> Ontology-editors mailing list >>> Ontology-editors at geneontology.org >>> http://fafner.stanford.edu/mailman/listinfo/ontology-editors >> >> -- >> David P. Hill, Ph.D. >> Bioinformatics Scientist: Ontology Development >> Gene Ontology Consortium >> The Jackson Laboratory >> www.geneontology.org >> www.informatics.jax.org >> tel:207-288-6430 >> >> _______________________________________________ >> Ontology-editors mailing list >> Ontology-editors at geneontology.org >> http://fafner.stanford.edu/mailman/listinfo/ontology-editors >> > > _______________________________________________ > Ontology-editors mailing list > Ontology-editors at geneontology.org > http://fafner.stanford.edu/mailman/listinfo/ontology-editors -- Alexander D. Diehl, Ph.D. Senior Scientific Curator Mouse Genome Informatics The Jackson Laboratory 600 Main Street Bar Harbor, ME 04609 email: adiehl at informatics.jax.org work: +1 (207) 288-6427 fax: +1 (207) 288-6131 From jane at ebi.ac.uk Tue Sep 15 05:28:45 2009 From: jane at ebi.ac.uk (Jane Lomax) Date: Tue, 15 Sep 2009 13:28:45 +0100 (BST) Subject: [Ontology-editors] cross-products demo In-Reply-To: References: <4AA903A6.8010706@ebi.ac.uk> <4AA96ECE.8020102@berkeleybop.org> <4AAA311B.8020001@informatics.jax.org> Message-ID: I'm confused now - do you mean the OEWG meeting or the GOC meeting, Chris? Jane On Mon, 14 Sep 2009, Chris Mungall wrote: > > I think it's more appropriate to do it at the meeting. Most people who attend > OEWGs understand the cross-products. The goal is to disseminate this wider, > to members of this group who will be editing the ontology, and to the wider > group who use oboedit for browsing as part of annotation etc > > On Sep 11, 2009, at 4:14 AM, David Hill wrote: > >> I think that is a good idea. We thought it wouldn't really be appropriate >> to do it at the consortium meeting because there are so many people there >> who don't edit the ontology. >> >> David >> >> Amina Abdulla wrote: >>> Hi All, >>> I'd like to suggest holding the xps demo during the OBO-Edit Working Group >>> Meeting - maybe next week - Tuesday September 15th 8:30/9:30 am PDT >>> whichever works best for everyone. >>> >>> Thanks, >>> Amina >>> >>> >>> On 9/10/09 6:48 AM, Jane Lomax wrote: >>>> Hi all ontology editors - as you've probably heard, we're hoping to put >>>> the first set of cross-products into the extended GO file very soon. This >>>> means that soon anyone editing the ontologies will also have to edit >>>> cross-products. >>>> >>>> The initial set will be very straightforward - they'll be for the >>>> regulation terms - but as time goes on, the cross-products we add will >>>> become more complicated. >>>> >>>> So we thought it would be a good idea for everyone who edits the >>>> ontologies to come to a training session which Chris has kindly 'agreed' >>>> to run about cross-products and how to edit them. >>>> >>>> I've made a Doodle poll so we can choose a date the maximum number of >>>> people can make: >>>> >>>> http://www.doodle.com/bfnug7imd5s7qg2h >>>> >>>> thanks, >>>> >>>> Jane >>>> >>> >>> _______________________________________________ >>> Ontology-editors mailing list >>> Ontology-editors at geneontology.org >>> http://fafner.stanford.edu/mailman/listinfo/ontology-editors >> >> -- >> David P. Hill, Ph.D. >> Bioinformatics Scientist: Ontology Development >> Gene Ontology Consortium >> The Jackson Laboratory >> www.geneontology.org >> www.informatics.jax.org >> tel:207-288-6430 >> >> _______________________________________________ >> Ontology-editors mailing list >> Ontology-editors at geneontology.org >> http://fafner.stanford.edu/mailman/listinfo/ontology-editors >> > > _______________________________________________ > Ontology-editors mailing list > Ontology-editors at geneontology.org > http://fafner.stanford.edu/mailman/listinfo/ontology-editors -- Dr Jane Lomax GO Editorial Office EMBL-EBI Wellcome Trust Genome Campus Hinxton Cambridgeshire, UK CB10 1SD p: +44 1223 492516 f: +44 1223 494468 From midori at ebi.ac.uk Wed Sep 16 02:13:58 2009 From: midori at ebi.ac.uk (Midori Harris) Date: Wed, 16 Sep 2009 10:13:58 +0100 (BST) Subject: [Ontology-editors] EGF stimulus In-Reply-To: <5DFCE03A-DE88-49FD-8567-CA9E91C33CC5@berkeleybop.org> References: <9BA97BC21AF8BB4098EC0685A900614F013B966D@npg-ny-exc1.npg.root-domain.org> <5DFCE03A-DE88-49FD-8567-CA9E91C33CC5@berkeleybop.org> Message-ID: Probably yes; we've added analogous links for other stimuli. The general question is open and under consideration by the signaling wg. https://sourceforge.net/tracker/?func=detail&aid=2679671&group_id=36855&atid=440764 https://sourceforge.net/tracker/?func=detail&aid=2665559&group_id=36855&atid=440764 On Mon, 14 Sep 2009, Chris Mungall wrote: > > should there not be some relationship between EGFR signaling and response to > EGF? > > > > > >> From: "Anthony, Kira" >> Date: September 14, 2009 3:40:05 PM PDT >> To: >> Subject: [Obo-discuss] FW: the "Foundry ontologies" >> Reply-To: obo-discuss at lists.sourceforge.net >> >> I just had a quick look through some of the discussions below on the >> Pathway Ontology. Having worked with this ontology a couple of things come >> to mind: >> 1. Re Alan's query regarding an aspect that distinguishes between pathways >> and processes: in my mind something that ought to be included in the >> definitions for signaling and regulatory pathways is mention of the >> initiating signal itself. GO provides terms for the downstream processes >> associated with the signal (e.g. "response to epidermal growth factor >> stimulus"), but not the signal itself. And since a response to a signal is >> not necessarily equal to the signal (i.e. responses to signals can vary by >> cell type, cell stage, and so on) it would be a rather sensible approach >> when defining a pathway to state what the signal is (or simply that one >> exists), stop there with signal discussion and go on to further define the >> pathway via its molecular events, downstream processes, cell types and so >> on. By extension, here you could have PO terms and GO terms working >> together: a simple example is that PO term "Tumor necrosis factor.. >> pathway" leads to GO term "apoptosis". >> >> 2. I haven't looked through the discussion below in enough detail to see >> whether this point is directly relevant, but I thought I'd mention that >> what we like about the Pathway Ontology is that it contains a comprehensive >> set of signaling and regulatory pathway terms, some of which are not >> covered in broader ontologies such as GO. In addition, the Pathway Ontology >> has contained all of the term relationships that we've searched on. In >> contrast, with larger ontologies such as, again, GO, possibly because of >> the vast amount of biology that GO is covering, there are examples of >> similar terms such as "response to epidermal growth factor stimulus" and >> "epidermal growth factor receptor signaling pathway" that seem to be >> unconnected (or less connected than they ought to be) in the GO graph. Of >> course, these comments don't discount the value that GO provides to the >> community. Instead I wonder if they point to areas where ontologies that >> are more concentrated in particular subject areas can add value and/or find >> synergies with GO..? >> >> Kind regards, >> >> Kira >> >> Kira Anthony >> NCI-Nature Pathway Interaction Database >> Nature Publishing Group >> URLs: http://pid.nci.nih.gov & http://www.nature.com/databases/ >> >> -----Original Message----- >> From: Alan Ruttenberg [mailto:alanruttenberg at gmail.com] >> Sent: Monday, September 14, 2009 2:45 PM >> To: Shimoyama, Mary >> Cc: obo-discuss at lists.sourceforge.net >> Subject: Re: [Obo-discuss] the "Foundry ontologies" >> >> On Mon, Sep 14, 2009 at 1:22 PM, Shimoyama, Mary wrote: >>> I believe there are a number of issues - GO has stated in the past that >>> they do not represent pathways yet have traditionally had pathway terms >>> and have extended some processes to be pathways. To me there are >>> differences - a pathway often includes multiple processes and we have >>> discussed at RGD, how we can link to the multiple GO processes presented >>> in a pathway. For us, there is a finite set of members within a pathway >>> with defined processes. >> >> How do you define: "member of a pathway"? >> >>> We are not representing processes which can occur among many types of >>> molecules under many circumstances. >> >> I'm not sure what you mean. Do you mean that the names in the pathway >> ontology are intended to denote processes that occur only in, say, >> mammals? If that was the case then that should be explicit. >> >>> The names of processes and pathways is one issue, but also at issue is the >>> extent of what is considered an individual process and what is a pathway. >>> This debate has gone on within GO itself. >> >> Perhaps. One of the goals of ontologies being in the OBO Foundry is >> that they have at least plausible answers to questions such as this. >> One of the things that we've had some better articulation of is what >> it means to be a process. For example, a collection of processes, >> summed together is also a process. And parts of processes are also >> processes. So the idea counting "how many processes" isn't >> particularly well defined. Instead, I think the goal is to describe >> what the participants are, and what the boundaries of the process are. >> >> I'd really like to get your view of the example that Suzi mentioned. >> Is it or is it not the case that the instances of PW:0000012 are >> exactly the sum of instances of GO:0009166 and GO:0009166 . If it >> isn't then what is the difference? If it is, how do you think this >> should be handled wrt the stated Foundry principle of avoiding >> redundancy (sometimes called achieving orthogonality)? Is the question >> (framed in terms of instances) even a question that fits with how the >> pathway ontology is conceived? >> >> I should perhaps emphasize that there ought to be different ways to >> gain recognition of quality work - the Foundry isn't attempting to be >> the clearing house for that. Rather, it is a more specific effort to >> create a set of ontologies that work together in a certain way. So the >> questions I ask here should not be construed in any way to evaluating >> quality. Rather they are intended to gain an understanding of how the >> pathway ontology is intended to work, to see whether that is >> consistent with the current view of how the Foundry ontologies can >> work together, and to see, if it is not, whether there is interest in >> proceeding down a road towards a state in which they can. >> >> Regards, >> Alan >> >>> >>> Mary Shimoyama >>> Program Manager >>> Rat Genome Database >>> Human and Molecular Genetics Center >>> Medical College of Wisconsin >>> shimoyama at mcw.edu >>> Tel: 414-456-7505 >>> Fax: 414-456-6516 >>> http://rgd.mcw.edu >>> >>> >>> -----Original Message----- >>> From: Alan Ruttenberg [mailto:alanruttenberg at gmail.com] >>> Sent: Monday, September 14, 2009 12:11 PM >>> To: Shimoyama, Mary >>> Subject: Re: [Obo-discuss] the "Foundry ontologies" >>> >>> On Mon, Sep 14, 2009 at 12:04 PM, Shimoyama, Mary >>> wrote: >>>> Hi Alan - I am not sure what face-to-face discussions Suzi is referring >>>> to. When you were here, we only talked about the phenotype work I am >>>> doing and not the pathway ontology - as far as I can remember. >>> >>> I may have been too informal - I've also discussed it with Victoria. >>> And this was before I had any specific role in the foundry, I suspect. >>> But let's start fresh and ignore that bit. The question on the table, >>> it would seem to me, would be whether you understand and agree with >>> the issue that is raised. >>> >>> I'll respond with something to this effect to the main list. >>> >>> -Alan >>> >>>> >>>> Mary Shimoyama >>>> Program Manager >>>> Rat Genome Database >>>> Human and Molecular Genetics Center >>>> Medical College of Wisconsin >>>> shimoyama at mcw.edu >>>> Tel: 414-456-7505 >>>> Fax: 414-456-6516 >>>> http://rgd.mcw.edu >>>> >>>> >>>> -----Original Message----- >>>> From: Alan Ruttenberg [mailto:alanruttenberg at gmail.com] >>>> Sent: Monday, September 14, 2009 11:00 AM >>>> To: Shimoyama, Mary >>>> Subject: Re: [Obo-discuss] the "Foundry ontologies" >>>> >>>> Hi Mary, >>>> >>>> Suzi followed up a bit about the pathway ontology, and there's >>>> obviously more to say. But I saw this bit and wanted to follow up with >>>> you for a little fact-finding. To what discussions are you referring >>>> to below? >>>> Happy to discuss anything at all further with you too, though I'm >>>> about halfway through a month of hellish scheduling, which is why >>>> response (on my side) has been a bit slow, >>>> >>>> Regards, >>>> Alan >>>> >>>> >>>> On Wed, Sep 9, 2009 at 9:12 AM, Shimoyama, Mary >>>> wrote: >>>>> For those who heard the sneers at an ICBO at such successful projects as >>>>> GO >>>>> or SO as not being "true" ontologies there seems to be little hope for >>>>> the >>>>> rest of us to have our efforts recognized. >>>> >>> >> >> ------------------------------------------------------------------------------ >> Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day >> trial. Simplify your report design, integration and deployment - and focus >> on >> what you do best, core application coding. Discover what's new with >> Crystal Reports now. http://p.sf.net/sfu/bobj-july >> _______________________________________________ >> Obo-discuss mailing list >> Obo-discuss at lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/obo-discuss >> >> ------------------------------------------------------------------------------ >> Come build with us! The BlackBerry® Developer Conference in SF, CA >> is the only developer event you need to attend this year. Jumpstart your >> developing skills, take BlackBerry mobile applications to market and stay >> ahead of the curve. Join us from November 9-12, 2009. Register now! >> http://p.sf.net/sfu/devconf_______________________________________________ >> Obo-discuss mailing list >> Obo-discuss at lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/obo-discuss > From midori at ebi.ac.uk Thu Sep 17 10:30:10 2009 From: midori at ebi.ac.uk (Midori Harris) Date: Thu, 17 Sep 2009 18:30:10 +0100 (BST) Subject: [Ontology-editors] xp load error Message-ID: Hi Chris, I'm getting load errors like this when I try to load the cc_xp_go imports: Load??Error,??line??-1 Assigned??non-existant??property??in??chain??id??*??in??term??OBO_REL:results_in_developmental_progression_of org.obo.dataadapter.DefaultOBOParser.endParse(DefaultOBOParser.java:1407) org.obo.dataadapter.AbstractParseEngine.parse(AbstractParseEngine.java:74) org.obo.dataadapter.OBOFileAdapter.doOperation(OBOFileAdapter.java:274) org.bbop.dataadapter.DataAdapterOperationTask.execute(DataAdapterOperationTask.java:43) org.bbop.util.AbstractTaskDelegate.run(AbstractTaskDelegate.java:60) org.bbop.swing.BackgroundEventQueue$BackgroundEventThread.executeTask(BackgroundEventQueue.java:137) org.bbop.swing.BackgroundEventQueue$BackgroundEventThread.run(BackgroundEventQueue.java:79) I can get things loaded by commenting out the 'holds_over_chain' and 'rule' lines (haven't tried separately), but I guess that isn't desirable long-term?? m From midori at ebi.ac.uk Thu Sep 17 10:38:38 2009 From: midori at ebi.ac.uk (Midori Harris) Date: Thu, 17 Sep 2009 18:38:38 +0100 (BST) Subject: [Ontology-editors] xp load error In-Reply-To: References: Message-ID: Also, bp_xp_cc seems to be using some relations that aren't in relations_process_xp.obo ... who's keeping them in sync? Should I? m On Thu, 17 Sep 2009, Midori Harris wrote: > Hi Chris, > > I'm getting load errors like this when I try to load the cc_xp_go imports: > > Load??Error,??line??-1 > Assigned??non-existant??property??in??chain??id??*??in??term??OBO_REL:results_in_developmental_progression_of > org.obo.dataadapter.DefaultOBOParser.endParse(DefaultOBOParser.java:1407) > org.obo.dataadapter.AbstractParseEngine.parse(AbstractParseEngine.java:74) > org.obo.dataadapter.OBOFileAdapter.doOperation(OBOFileAdapter.java:274) > org.bbop.dataadapter.DataAdapterOperationTask.execute(DataAdapterOperationTask.java:43) > org.bbop.util.AbstractTaskDelegate.run(AbstractTaskDelegate.java:60) > org.bbop.swing.BackgroundEventQueue$BackgroundEventThread.executeTask(BackgroundEventQueue.java:137) > org.bbop.swing.BackgroundEventQueue$BackgroundEventThread.run(BackgroundEventQueue.java:79) > > I can get things loaded by commenting out the 'holds_over_chain' and 'rule' > lines (haven't tried separately), but I guess that isn't desirable > long-term?? > > m From cjm at berkeleybop.org Thu Sep 17 11:00:51 2009 From: cjm at berkeleybop.org (Chris Mungall) Date: Thu, 17 Sep 2009 11:00:51 -0700 Subject: [Ontology-editors] xp load error In-Reply-To: References: Message-ID: just committed some quick fixes. here's the cvs log msg moved elongation terms to unvetted - can move back later; TODO - make them more complete added 2 relations - results_in_distribution_of (changed XP file to use OBO_REL prefix) - results_in_localization_of TODO - definitions for these On Sep 17, 2009, at 10:38 AM, Midori Harris wrote: > Also, bp_xp_cc seems to be using some relations that aren't in > relations_process_xp.obo ... who's keeping them in sync? Should I? > > m > > On Thu, 17 Sep 2009, Midori Harris wrote: > >> Hi Chris, >> >> I'm getting load errors like this when I try to load the cc_xp_go >> imports: >> >> Load? Error,? line? -1 Assigned? non-existant? property? in? chain? >> id? *? in? term? OBO_REL:results_in_developmental_progression_of >> org.obo.dataadapter.DefaultOBOParser.endParse(DefaultOBOParser.java: >> 1407) >> org >> .obo.dataadapter.AbstractParseEngine.parse(AbstractParseEngine.java: >> 74) >> org.obo.dataadapter.OBOFileAdapter.doOperation(OBOFileAdapter.java: >> 274) >> org >> .bbop >> .dataadapter >> .DataAdapterOperationTask.execute(DataAdapterOperationTask.java:43) >> org.bbop.util.AbstractTaskDelegate.run(AbstractTaskDelegate.java:60) >> org.bbop.swing.BackgroundEventQueue >> $BackgroundEventThread.executeTask(BackgroundEventQueue.java:137) >> org.bbop.swing.BackgroundEventQueue >> $BackgroundEventThread.run(BackgroundEventQueue.java:79) >> >> I can get things loaded by commenting out the 'holds_over_chain' >> and 'rule' lines (haven't tried separately), but I guess that isn't >> desirable long-term?? >> >> m From midori at ebi.ac.uk Thu Sep 17 11:18:02 2009 From: midori at ebi.ac.uk (Midori Harris) Date: Thu, 17 Sep 2009 19:18:02 +0100 (BST) Subject: [Ontology-editors] xp load error In-Reply-To: References: Message-ID: Thanks! I'll mail off-list if I spot any more (won't be tonight; it's dinnertime here). cheers, m On Thu, 17 Sep 2009, Chris Mungall wrote: > just committed some quick fixes. here's the cvs log msg > > moved elongation terms to unvetted - can move back later; TODO - make them > more complete > > added 2 relations > - results_in_distribution_of (changed XP file to use OBO_REL prefix) > - results_in_localization_of > > TODO - definitions for these > > > > On Sep 17, 2009, at 10:38 AM, Midori Harris wrote: > >> Also, bp_xp_cc seems to be using some relations that aren't in >> relations_process_xp.obo ... who's keeping them in sync? Should I? >> >> m >> >> On Thu, 17 Sep 2009, Midori Harris wrote: >> >>> Hi Chris, >>> >>> I'm getting load errors like this when I try to load the cc_xp_go imports: >>> >>> Load? Error,? line? -1 Assigned? non-existant? property? in? chain? id? *? >>> in? term? OBO_REL:results_in_developmental_progression_of >>> org.obo.dataadapter.DefaultOBOParser.endParse(DefaultOBOParser.java:1407) >>> org.obo.dataadapter.AbstractParseEngine.parse(AbstractParseEngine.java:74) >>> org.obo.dataadapter.OBOFileAdapter.doOperation(OBOFileAdapter.java:274) >>> org.bbop.dataadapter.DataAdapterOperationTask.execute(DataAdapterOperationTask.java:43) >>> org.bbop.util.AbstractTaskDelegate.run(AbstractTaskDelegate.java:60) >>> org.bbop.swing.BackgroundEventQueue$BackgroundEventThread.executeTask(BackgroundEventQueue.java:137) >>> org.bbop.swing.BackgroundEventQueue$BackgroundEventThread.run(BackgroundEventQueue.java:79) >>> >>> I can get things loaded by commenting out the 'holds_over_chain' and >>> 'rule' lines (haven't tried separately), but I guess that isn't desirable >>> long-term?? >>> >>> m > From cjm at berkeleybop.org Thu Sep 17 18:35:41 2009 From: cjm at berkeleybop.org (Chris Mungall) Date: Thu, 17 Sep 2009 18:35:41 -0700 Subject: [Ontology-editors] cell envelope Message-ID: <4E978DD7-6615-41E5-BEED-EB34E45233DA@berkeleybop.org> I see this is on the agenda http://wiki.geneontology.org/index.php/Cambridge_GO_Consortium_Meeting#Function-process_links I'm not familiar with the change & what the ontology was like before this. Shall I team up with someone and make slides on this? I'm curious as to why there is no relationship between cell envelope and periplasmic space From midori at ebi.ac.uk Fri Sep 18 01:15:53 2009 From: midori at ebi.ac.uk (Midori Harris) Date: Fri, 18 Sep 2009 09:15:53 +0100 (BST) Subject: [Ontology-editors] cell envelope In-Reply-To: <4E978DD7-6615-41E5-BEED-EB34E45233DA@berkeleybop.org> References: <4E978DD7-6615-41E5-BEED-EB34E45233DA@berkeleybop.org> Message-ID: I think cell envelope is on the agenda solely as an example of the has_part relation in use (much simpler than spliceosomes). Previously, the cell envelope was defined as NOT including the plasma membrane, and was is_a external encapsulating structure (itself also defined as excluding the PM). With a shift in prevailing community usage, it was changed to include the PM and its parentage adjusted accordingly; we couldn't say PM part_of cell env because the cell env is prokaryote-specific. I must have just overlooked a link to periplasmic space. That will also have to be has_part, because yeasties (fungi generally, I think) also have a periplasmic space. Don't know if we'd have a use for 'periplasmic space of cell envelope'. m On Thu, 17 Sep 2009, Chris Mungall wrote: > I see this is on the agenda > > http://wiki.geneontology.org/index.php/Cambridge_GO_Consortium_Meeting#Function-process_links > > I'm not familiar with the change & what the ontology was like before this. > Shall I team up with someone and make slides on this? > > I'm curious as to why there is no relationship between cell envelope and > periplasmic space > _______________________________________________ > Ontology-editors mailing list > Ontology-editors at geneontology.org > http://fafner.stanford.edu/mailman/listinfo/ontology-editors From hjd at informatics.jax.org Fri Sep 18 06:10:15 2009 From: hjd at informatics.jax.org (Harold Drabkin) Date: Fri, 18 Sep 2009 09:10:15 -0400 Subject: [Ontology-editors] cell envelope In-Reply-To: References: <4E978DD7-6615-41E5-BEED-EB34E45233DA@berkeleybop.org> Message-ID: <4AB386B7.5080804@informatics.jax.org> Yes, that was the example Mike gave me to put in. Harold Midori Harris wrote: > I think cell envelope is on the agenda solely as an example of the > has_part relation in use (much simpler than spliceosomes). > > Previously, the cell envelope was defined as NOT including the plasma > membrane, and was is_a external encapsulating structure (itself also > defined as excluding the PM). With a shift in prevailing community > usage, it was changed to include the PM and its parentage adjusted > accordingly; we couldn't say PM part_of cell env because the cell env > is prokaryote-specific. > > I must have just overlooked a link to periplasmic space. That will > also have to be has_part, because yeasties (fungi generally, I think) > also have a periplasmic space. Don't know if we'd have a use for > 'periplasmic space of cell envelope'. > > m > > On Thu, 17 Sep 2009, Chris Mungall wrote: > >> I see this is on the agenda >> >> http://wiki.geneontology.org/index.php/Cambridge_GO_Consortium_Meeting#Function-process_links >> >> >> I'm not familiar with the change & what the ontology was like before >> this. Shall I team up with someone and make slides on this? >> >> I'm curious as to why there is no relationship between cell envelope >> and periplasmic space >> _______________________________________________ >> Ontology-editors mailing list >> Ontology-editors at geneontology.org >> http://fafner.stanford.edu/mailman/listinfo/ontology-editors > _______________________________________________ > Ontology-editors mailing list > Ontology-editors at geneontology.org > http://fafner.stanford.edu/mailman/listinfo/ontology-editors From midori at ebi.ac.uk Tue Sep 22 07:37:42 2009 From: midori at ebi.ac.uk (Midori Harris) Date: Tue, 22 Sep 2009 15:37:42 +0100 (BST) Subject: [Ontology-editors] cell envelope In-Reply-To: References: <4E978DD7-6615-41E5-BEED-EB34E45233DA@berkeleybop.org> Message-ID: update - I noticed we have 'outer membrane-bounded periplasmic space' (GO:0030288), and it's part_of cell envelope, so no changes needed m On Fri, 18 Sep 2009, Midori Harris wrote: > I think cell envelope is on the agenda solely as an example of the has_part > relation in use (much simpler than spliceosomes). > > Previously, the cell envelope was defined as NOT including the plasma > membrane, and was is_a external encapsulating structure (itself also defined > as excluding the PM). With a shift in prevailing community usage, it was > changed to include the PM and its parentage adjusted accordingly; we couldn't > say PM part_of cell env because the cell env is prokaryote-specific. > > I must have just overlooked a link to periplasmic space. That will also have > to be has_part, because yeasties (fungi generally, I think) also have a > periplasmic space. Don't know if we'd have a use for 'periplasmic space of > cell envelope'. > > m > > On Thu, 17 Sep 2009, Chris Mungall wrote: > >> I see this is on the agenda >> >> http://wiki.geneontology.org/index.php/Cambridge_GO_Consortium_Meeting#Function-process_links >> >> I'm not familiar with the change & what the ontology was like before this. >> Shall I team up with someone and make slides on this? >> >> I'm curious as to why there is no relationship between cell envelope and >> periplasmic space >> _______________________________________________ >> Ontology-editors mailing list >> Ontology-editors at geneontology.org >> http://fafner.stanford.edu/mailman/listinfo/ontology-editors > _______________________________________________ > Ontology-editors mailing list > Ontology-editors at geneontology.org > http://fafner.stanford.edu/mailman/listinfo/ontology-editors > From cjm at berkeleybop.org Thu Sep 24 06:47:57 2009 From: cjm at berkeleybop.org (Chris Mungall) Date: Thu, 24 Sep 2009 14:47:57 +0100 Subject: [Ontology-editors] rhea <-> MF example Message-ID: cc-ing ontology group in advance of setting up a specific working group Rhea has: http://www.ebi.ac.uk/rhea/reaction.xhtml?id=10980 (adirectional?) http://www.ebi.ac.uk/rhea/reaction.xhtml?id=10981 (directional) GO has: [Term] id: GO:0018365 name: protein-serine epimerase activity namespace: molecular_function def: "Catalysis of the reaction: (protein)-L-serine = (protein)-D- serine." [EC:5.1.1.16] synonym: "protein-serine racemase activity" EXACT [EC:5.1.1.16] xref: EC:5.1.1.16 xref: MetaCyc:5.1.1.16-RXN is_a: GO:0016855 ! racemase and epimerase activity, acting on amino acids and derivatives Rhea could simply use the GO ID here rather than minting a new ID. I believe the GO reactions are typically non-directional In http://wiki.geneontology.org/index.php/XP:molecular_function_xp_chebi We have [Term] id: GO:0018365 name: protein-serine epimerase activity intersection_of: OBO_REL:has_input CHEBI:17115 ! L-serine intersection_of: OBO_REL:has_output CHEBI:16523 ! D-serine def: "Catalysis of the reaction: (protein)-L-serine = (protein)-D- serine." [EC:5.1.1.16] We're explicitly stating a direction here but maybe a neutral left/ right would be better. From hitz at genome.stanford.edu Thu Sep 24 07:24:27 2009 From: hitz at genome.stanford.edu (Ben Hitz) Date: Thu, 24 Sep 2009 15:24:27 +0100 Subject: [Ontology-editors] rhea <-> MF example In-Reply-To: References: Message-ID: <9F0C0108-3581-4400-BC89-5A06DB13EE60@genome.stanford.edu> If rhea has directional reactions, they are not physically correct. All chemical reactions are bi-directional - although many are essentially irreversible under physiological conditions. Ben On Sep 24, 2009, at 2:47 PM, Chris Mungall wrote: > i.ac.uk/rhea/reaction.xhtml?id=10980 (adirectional?) > > http://www.ebi.ac.uk/rhea/reaction.xhtml?id=10981 (directional) -- Ben Hitz Senior Scientific Programmer ** Saccharomyces Genome Database ** GO Consortium Stanford University ** hitz at genome.stanford.edu From hitz at genome.stanford.edu Thu Sep 24 07:26:22 2009 From: hitz at genome.stanford.edu (Ben Hitz) Date: Thu, 24 Sep 2009 15:26:22 +0100 Subject: [Ontology-editors] rhea <-> MF example In-Reply-To: References: Message-ID: Actually id=10983 is the bidirectional version. Ben On Sep 24, 2009, at 2:47 PM, Chris Mungall wrote: > cc-ing ontology group in advance of setting up a specific working > group > > Rhea has: > http://www.ebi.ac.uk/rhea/reaction.xhtml?id=10980 (adirectional?) > > http://www.ebi.ac.uk/rhea/reaction.xhtml?id=10981 (directional) > > GO has: > > [Term] > id: GO:0018365 > name: protein-serine epimerase activity > namespace: molecular_function > def: "Catalysis of the reaction: (protein)-L-serine = (protein)-D- > serine." [EC:5.1.1.16] > synonym: "protein-serine racemase activity" EXACT [EC:5.1.1.16] > xref: EC:5.1.1.16 > xref: MetaCyc:5.1.1.16-RXN > is_a: GO:0016855 ! racemase and epimerase activity, acting on amino > acids and derivatives > > Rhea could simply use the GO ID here rather than minting a new ID. I > believe the GO reactions are typically non-directional > > In http://wiki.geneontology.org/index.php/XP:molecular_function_xp_chebi > > We have > > [Term] > id: GO:0018365 > name: protein-serine epimerase activity > intersection_of: OBO_REL:has_input CHEBI:17115 ! L-serine > intersection_of: OBO_REL:has_output CHEBI:16523 ! D-serine > def: "Catalysis of the reaction: (protein)-L-serine = (protein)-D- > serine." [EC:5.1.1.16] > > We're explicitly stating a direction here but maybe a neutral left/ > right would be better. > -- Ben Hitz Senior Scientific Programmer ** Saccharomyces Genome Database ** GO Consortium Stanford University ** hitz at genome.stanford.edu From cjm at berkeleybop.org Thu Sep 24 10:06:30 2009 From: cjm at berkeleybop.org (Chris Mungall) Date: Thu, 24 Sep 2009 18:06:30 +0100 Subject: [Ontology-editors] rhea <-> MF example In-Reply-To: <9F0C0108-3581-4400-BC89-5A06DB13EE60@genome.stanford.edu> References: <9F0C0108-3581-4400-BC89-5A06DB13EE60@genome.stanford.edu> Message-ID: <9C06FD1A-1CA7-4CCC-AED8-F372D7F643A7@berkeleybop.org> But any instance of a reaction necessarily happens in one direction On Sep 24, 2009, at 3:24 PM, Ben Hitz wrote: > If rhea has directional reactions, they are not physically correct. > All chemical reactions are bi-directional - although many are > essentially irreversible under physiological conditions. > > Ben > On Sep 24, 2009, at 2:47 PM, Chris Mungall wrote: > >> i.ac.uk/rhea/reaction.xhtml?id=10980 (adirectional?) >> >> http://www.ebi.ac.uk/rhea/reaction.xhtml?id=10981 (directional) > > -- > Ben Hitz > Senior Scientific Programmer ** Saccharomyces Genome Database ** GO > Consortium > Stanford University ** hitz at genome.stanford.edu > > > > From midori at ebi.ac.uk Tue Sep 29 07:30:37 2009 From: midori at ebi.ac.uk (Midori Harris) Date: Tue, 29 Sep 2009 15:30:37 +0100 (BST) Subject: [Ontology-editors] merging MF and BP terms Message-ID: Hi, We have two SF items in which merging function and process terms looks like a viable solution to a problem. One is an overly-specific, single-step "process" that is essentially equivalent to a "function" (in the GO sense); the other is a "function" term that is defined such that it doesn't describe a single activity, and is essentially the same as a process term. SF 2864212 - merge MF term RNA splicing factor activity, transesterification mechanism GO:0031202 into BP term RNA splicing, via transesterification reactions GO:0000375 For this one, we would merge the MF term into the BP term, and ensure that the merged term has is_a ancestry in the BP graph. I think that's all we would change. https://sourceforge.net/tracker/?func=detail&aid=2864212&group_id=36855&atid=440764 SF 2864271 - merge BP term conversion of met-tRNAf to fmet-tRNA GO:0001718 into MF term methionyl-tRNA formyltransferase activity GO:0004479 https://sourceforge.net/tracker/?func=detail&aid=2864271&group_id=36855&atid=440764 In this case, the proposal is to merge the process term into the function term, ensuring that the merged term has is_a ancestry only in the MF ontology,and has a part_of link to a process term. Specifically, the merged term would retain the name and ID from GO:0004479, and would be part_of translational initiation GO:0006413. Does anyone have any objections or other comments on these? thanks, m From dph at informatics.jax.org Tue Sep 29 07:40:22 2009 From: dph at informatics.jax.org (David Hill) Date: Tue, 29 Sep 2009 10:40:22 -0400 Subject: [Ontology-editors] [Annotation] merging MF and BP terms In-Reply-To: References: Message-ID: <4AC21C56.8090208@informatics.jax.org> I just checked these out. This seems fine. In the first case, what would be the functions that are executed in the process 'RNA splicing, via transesterification reactions'? David Midori Harris wrote: > Hi, > > We have two SF items in which merging function and process terms looks > like a viable solution to a problem. One is an overly-specific, > single-step "process" that is essentially equivalent to a "function" > (in the GO sense); the other is a "function" term that is defined such > that it doesn't describe a single activity, and is essentially the > same as a process term. > > SF 2864212 - merge MF term > RNA splicing factor activity, transesterification mechanism GO:0031202 > > into BP term > RNA splicing, via transesterification reactions GO:0000375 > > For this one, we would merge the MF term into the BP term, and ensure > that the merged term has is_a ancestry in the BP graph. I think that's > all we would change. > > https://sourceforge.net/tracker/?func=detail&aid=2864212&group_id=36855&atid=440764 > > > > SF 2864271 - merge BP term > > conversion of met-tRNAf to fmet-tRNA GO:0001718 > > into MF term > methionyl-tRNA formyltransferase activity GO:0004479 > > https://sourceforge.net/tracker/?func=detail&aid=2864271&group_id=36855&atid=440764 > > > In this case, the proposal is to merge the process term into the > function term, ensuring that the merged term has is_a ancestry only in > the MF ontology,and has a part_of link to a process term. > Specifically, the merged term would retain the name and ID from > GO:0004479, and would be part_of translational initiation GO:0006413. > > Does anyone have any objections or other comments on these? > > thanks, > m > _______________________________________________ > Annotation mailing list > Annotation at geneontology.org > http://fafner.stanford.edu/mailman/listinfo/annotation -- David P. Hill, Ph.D. Bioinformatics Scientist: Ontology Development Gene Ontology Consortium The Jackson Laboratory www.geneontology.org www.informatics.jax.org tel:207-288-6430 From midori at ebi.ac.uk Tue Sep 29 07:46:19 2009 From: midori at ebi.ac.uk (Midori Harris) Date: Tue, 29 Sep 2009 15:46:19 +0100 (BST) Subject: [Ontology-editors] [Annotation] merging MF and BP terms In-Reply-To: <4AC21C56.8090208@informatics.jax.org> References: <4AC21C56.8090208@informatics.jax.org> Message-ID: I think they would be the reactions described by the two is_a children of GO:0031202, 'first spliceosomal transesterification activity' (GO:0000384) and 'second spliceosomal transesterification activity' (GO:0000386). Karen C will know for sure. An alternative for the splicing case might be to alter the definition of GO:0031202, to something more activity-ish. Maybe "catalysis of either of the transesterification reactions that take place as part of spliceosomal mRNA splicing"? m On Tue, 29 Sep 2009, David Hill wrote: > I just checked these out. This seems fine. In the first case, what would be > the functions that are executed in the process 'RNA splicing, via > transesterification reactions'? > > David > > Midori Harris wrote: >> Hi, >> >> We have two SF items in which merging function and process terms looks like >> a viable solution to a problem. One is an overly-specific, single-step >> "process" that is essentially equivalent to a "function" (in the GO sense); >> the other is a "function" term that is defined such that it doesn't >> describe a single activity, and is essentially the same as a process term. >> >> SF 2864212 - merge MF term >> RNA splicing factor activity, transesterification mechanism GO:0031202 >> >> into BP term >> RNA splicing, via transesterification reactions GO:0000375 >> >> For this one, we would merge the MF term into the BP term, and ensure that >> the merged term has is_a ancestry in the BP graph. I think that's all we >> would change. >> >> https://sourceforge.net/tracker/?func=detail&aid=2864212&group_id=36855&atid=440764 >> >> >> SF 2864271 - merge BP term >> >> conversion of met-tRNAf to fmet-tRNA GO:0001718 >> >> into MF term >> methionyl-tRNA formyltransferase activity GO:0004479 >> >> https://sourceforge.net/tracker/?func=detail&aid=2864271&group_id=36855&atid=440764 >> >> In this case, the proposal is to merge the process term into the function >> term, ensuring that the merged term has is_a ancestry only in the MF >> ontology,and has a part_of link to a process term. Specifically, the merged >> term would retain the name and ID from GO:0004479, and would be part_of >> translational initiation GO:0006413. >> >> Does anyone have any objections or other comments on these? >> >> thanks, >> m >> _______________________________________________ >> Annotation mailing list >> Annotation at geneontology.org >> http://fafner.stanford.edu/mailman/listinfo/annotation > > From cjm at yuri.lbl.gov Tue Sep 29 07:52:08 2009 From: cjm at yuri.lbl.gov (Chris Mungall) Date: Tue, 29 Sep 2009 07:52:08 -0700 Subject: [Ontology-editors] problem with obo file Message-ID: ERROR: namespace-different-from-is_a-parent: GO:0015148 in molecular_function, GO:0015753 in biological_process From dph at informatics.jax.org Tue Sep 29 07:53:28 2009 From: dph at informatics.jax.org (David Hill) Date: Tue, 29 Sep 2009 10:53:28 -0400 Subject: [Ontology-editors] [Annotation] merging MF and BP terms In-Reply-To: References: <4AC21C56.8090208@informatics.jax.org> Message-ID: <4AC21F68.5010803@informatics.jax.org> Great, this makes sense. Those 2 is_a children should become part_of children of the process as long as they are only executed in this process. this would follow from the merge. David Midori Harris wrote: > I think they would be the reactions described by the two is_a children > of GO:0031202, 'first spliceosomal transesterification activity' > (GO:0000384) and 'second spliceosomal transesterification activity' > (GO:0000386). Karen C will know for sure. > > An alternative for the splicing case might be to alter the definition > of GO:0031202, to something more activity-ish. Maybe "catalysis of > either of the transesterification reactions that take place as part of > spliceosomal mRNA splicing"? > > m > > On Tue, 29 Sep 2009, David Hill wrote: > >> I just checked these out. This seems fine. In the first case, what >> would be the functions that are executed in the process 'RNA >> splicing, via transesterification reactions'? >> >> David >> >> Midori Harris wrote: >>> Hi, >>> >>> We have two SF items in which merging function and process terms >>> looks like a viable solution to a problem. One is an >>> overly-specific, single-step "process" that is essentially >>> equivalent to a "function" (in the GO sense); the other is a >>> "function" term that is defined such that it doesn't describe a >>> single activity, and is essentially the same as a process term. >>> >>> SF 2864212 - merge MF term >>> RNA splicing factor activity, transesterification mechanism >>> GO:0031202 >>> >>> into BP term >>> RNA splicing, via transesterification reactions GO:0000375 >>> >>> For this one, we would merge the MF term into the BP term, and >>> ensure that the merged term has is_a ancestry in the BP graph. I >>> think that's all we would change. >>> >>> https://sourceforge.net/tracker/?func=detail&aid=2864212&group_id=36855&atid=440764 >>> >>> >>> SF 2864271 - merge BP term >>> >>> conversion of met-tRNAf to fmet-tRNA GO:0001718 >>> >>> into MF term >>> methionyl-tRNA formyltransferase activity GO:0004479 >>> >>> https://sourceforge.net/tracker/?func=detail&aid=2864271&group_id=36855&atid=440764 >>> >>> In this case, the proposal is to merge the process term into the >>> function term, ensuring that the merged term has is_a ancestry only >>> in the MF ontology,and has a part_of link to a process term. >>> Specifically, the merged term would retain the name and ID from >>> GO:0004479, and would be part_of translational initiation GO:0006413. >>> >>> Does anyone have any objections or other comments on these? >>> >>> thanks, >>> m >>> _______________________________________________ >>> Annotation mailing list >>> Annotation at geneontology.org >>> http://fafner.stanford.edu/mailman/listinfo/annotation >> >> -- David P. Hill, Ph.D. Bioinformatics Scientist: Ontology Development Gene Ontology Consortium The Jackson Laboratory www.geneontology.org www.informatics.jax.org tel:207-288-6430 From midori at ebi.ac.uk Tue Sep 29 07:56:42 2009 From: midori at ebi.ac.uk (Midori Harris) Date: Tue, 29 Sep 2009 15:56:42 +0100 (BST) Subject: [Ontology-editors] [Annotation] merging MF and BP terms In-Reply-To: <4AC21F68.5010803@informatics.jax.org> References: <4AC21C56.8090208@informatics.jax.org> <4AC21F68.5010803@informatics.jax.org> Message-ID: Yep, the way the 2 children are named and defined is completely consistent with making them part_of the appropriate splicing BP term. m On Tue, 29 Sep 2009, David Hill wrote: > Great, this makes sense. Those 2 is_a children should become part_of children > of the process as long as they are only executed in this process. this would > follow from the merge. > > David > > Midori Harris wrote: >> I think they would be the reactions described by the two is_a children of >> GO:0031202, 'first spliceosomal transesterification activity' (GO:0000384) >> and 'second spliceosomal transesterification activity' (GO:0000386). Karen >> C will know for sure. >> >> An alternative for the splicing case might be to alter the definition of >> GO:0031202, to something more activity-ish. Maybe "catalysis of either of >> the transesterification reactions that take place as part of spliceosomal >> mRNA splicing"? >> >> m >> >> On Tue, 29 Sep 2009, David Hill wrote: >> >>> I just checked these out. This seems fine. In the first case, what would >>> be the functions that are executed in the process 'RNA splicing, via >>> transesterification reactions'? >>> >>> David >>> >>> Midori Harris wrote: >>>> Hi, >>>> >>>> We have two SF items in which merging function and process terms looks >>>> like a viable solution to a problem. One is an overly-specific, >>>> single-step "process" that is essentially equivalent to a "function" (in >>>> the GO sense); the other is a "function" term that is defined such that >>>> it doesn't describe a single activity, and is essentially the same as a >>>> process term. >>>> >>>> SF 2864212 - merge MF term >>>> RNA splicing factor activity, transesterification mechanism GO:0031202 >>>> >>>> into BP term >>>> RNA splicing, via transesterification reactions GO:0000375 >>>> >>>> For this one, we would merge the MF term into the BP term, and ensure >>>> that the merged term has is_a ancestry in the BP graph. I think that's >>>> all we would change. >>>> >>>> https://sourceforge.net/tracker/?func=detail&aid=2864212&group_id=36855&atid=440764 >>>> >>>> SF 2864271 - merge BP term >>>> >>>> conversion of met-tRNAf to fmet-tRNA GO:0001718 >>>> >>>> into MF term >>>> methionyl-tRNA formyltransferase activity GO:0004479 >>>> >>>> https://sourceforge.net/tracker/?func=detail&aid=2864271&group_id=36855&atid=440764 >>>> In this case, the proposal is to merge the process term into the function >>>> term, ensuring that the merged term has is_a ancestry only in the MF >>>> ontology,and has a part_of link to a process term. Specifically, the >>>> merged term would retain the name and ID from GO:0004479, and would be >>>> part_of translational initiation GO:0006413. >>>> >>>> Does anyone have any objections or other comments on these? >>>> >>>> thanks, >>>> m >>>> _______________________________________________ >>>> Annotation mailing list >>>> Annotation at geneontology.org >>>> http://fafner.stanford.edu/mailman/listinfo/annotation >>> >>> > > From midori at ebi.ac.uk Tue Sep 29 08:12:46 2009 From: midori at ebi.ac.uk (Midori Harris) Date: Tue, 29 Sep 2009 16:12:46 +0100 (BST) Subject: [Ontology-editors] problem with obo file In-Reply-To: References: Message-ID: fix will be committed in an hour or so > ERROR: namespace-different-from-is_a-parent: GO:0015148 in molecular_function, GO:0015753 in biological_process From kchris at genome.stanford.edu Tue Sep 29 09:00:31 2009 From: kchris at genome.stanford.edu (Karen Christie) Date: Tue, 29 Sep 2009 09:00:31 -0700 (PDT) Subject: [Ontology-editors] [Annotation] merging MF and BP terms In-Reply-To: References: <4AC21C56.8090208@informatics.jax.org> <4AC21F68.5010803@informatics.jax.org> Message-ID: I'm looking at the graph of the function ontology: - molecular_function -- catalytic activity --- RNA splicing factor activity, transesterification mechanism ---- first spliceosomal transesterification activity ---- second spliceosomal transesterification activity and I'm wondering whether or not we need a grouping term for the two different catalytic activities of the spliceosome. Perhaps, it is better if there is not, so that if you only know that a gene product contributes to splicing, but not which reaction(s) it is specifically involved in/required for, then you only make a process annotation and give it "molecular_function" [unknown] for its function annotation. -Karen On Tue, 29 Sep 2009, Midori Harris wrote: > Yep, the way the 2 children are named and defined is completely consistent > with making them part_of the appropriate splicing BP term. > > m > > On Tue, 29 Sep 2009, David Hill wrote: > >> Great, this makes sense. Those 2 is_a children should become part_of >> children of the process as long as they are only executed in this process. >> this would follow from the merge. >> >> David >> >> Midori Harris wrote: >>> I think they would be the reactions described by the two is_a children of >>> GO:0031202, 'first spliceosomal transesterification activity' (GO:0000384) >>> and 'second spliceosomal transesterification activity' (GO:0000386). Karen >>> C will know for sure. >>> >>> An alternative for the splicing case might be to alter the definition of >>> GO:0031202, to something more activity-ish. Maybe "catalysis of either of >>> the transesterification reactions that take place as part of spliceosomal >>> mRNA splicing"? >>> >>> m >>> >>> On Tue, 29 Sep 2009, David Hill wrote: >>> >>>> I just checked these out. This seems fine. In the first case, what would >>>> be the functions that are executed in the process 'RNA splicing, via >>>> transesterification reactions'? >>>> >>>> David >>>> >>>> Midori Harris wrote: >>>>> Hi, >>>>> >>>>> We have two SF items in which merging function and process terms looks >>>>> like a viable solution to a problem. One is an overly-specific, >>>>> single-step "process" that is essentially equivalent to a "function" (in >>>>> the GO sense); the other is a "function" term that is defined such that >>>>> it doesn't describe a single activity, and is essentially the same as a >>>>> process term. >>>>> >>>>> SF 2864212 - merge MF term >>>>> RNA splicing factor activity, transesterification mechanism GO:0031202 >>>>> >>>>> into BP term >>>>> RNA splicing, via transesterification reactions GO:0000375 >>>>> >>>>> For this one, we would merge the MF term into the BP term, and ensure >>>>> that the merged term has is_a ancestry in the BP graph. I think that's >>>>> all we would change. >>>>> >>>>> https://sourceforge.net/tracker/?func=detail&aid=2864212&group_id=36855&atid=440764 >>>>> SF 2864271 - merge BP term >>>>> >>>>> conversion of met-tRNAf to fmet-tRNA GO:0001718 >>>>> >>>>> into MF term >>>>> methionyl-tRNA formyltransferase activity GO:0004479 >>>>> >>>>> https://sourceforge.net/tracker/?func=detail&aid=2864271&group_id=36855&atid=440764 >>>>> In this case, the proposal is to merge the process term into the >>>>> function term, ensuring that the merged term has is_a ancestry only in >>>>> the MF ontology,and has a part_of link to a process term. Specifically, >>>>> the merged term would retain the name and ID from GO:0004479, and would >>>>> be part_of translational initiation GO:0006413. >>>>> >>>>> Does anyone have any objections or other comments on these? >>>>> >>>>> thanks, >>>>> m >>>>> _______________________________________________ >>>>> Annotation mailing list >>>>> Annotation at geneontology.org >>>>> http://fafner.stanford.edu/mailman/listinfo/annotation >>>> >>>> >> >> > _______________________________________________ > Ontology-editors mailing list > Ontology-editors at geneontology.org > http://fafner.stanford.edu/mailman/listinfo/ontology-editors > From tberardi at acoma.stanford.edu Tue Sep 29 10:30:16 2009 From: tberardi at acoma.stanford.edu (Tanya Berardini) Date: Tue, 29 Sep 2009 10:30:16 -0700 Subject: [Ontology-editors] [Annotation] merging MF and BP terms In-Reply-To: References: <4AC21C56.8090208@informatics.jax.org> <4AC21F68.5010803@informatics.jax.org> Message-ID: <8e22ab960909291030v68447629td1fe0cb91d83b280@mail.gmail.com> On Tue, Sep 29, 2009 at 9:00 AM, Karen Christie wrote: > I'm looking at the graph of the function ontology: > > - molecular_function > -- catalytic activity > --- RNA splicing factor activity, transesterification mechanism > ---- first spliceosomal transesterification activity > ---- second spliceosomal transesterification activity > > and I'm wondering whether or not we need a grouping term for the two > different catalytic activities of the spliceosome. > Perhaps, it is better if there is not, so that if you only know that a gene > product contributes to splicing, but not which reaction(s) it is > specifically involved in/required for, then you only make a process > annotation and give it "molecular_function" [unknown] for its function > annotation. > Yes, I think this is appropriate especially since the two transesterification reactions are different. Tanya > > -Karen > > > > > On Tue, 29 Sep 2009, Midori Harris wrote: > > Yep, the way the 2 children are named and defined is completely consistent >> with making them part_of the appropriate splicing BP term. >> >> m >> >> On Tue, 29 Sep 2009, David Hill wrote: >> >> Great, this makes sense. Those 2 is_a children should become part_of >>> children of the process as long as they are only executed in this process. >>> this would follow from the merge. >>> >>> David >>> >>> Midori Harris wrote: >>> >>>> I think they would be the reactions described by the two is_a children >>>> of GO:0031202, 'first spliceosomal transesterification activity' >>>> (GO:0000384) and 'second spliceosomal transesterification activity' >>>> (GO:0000386). Karen C will know for sure. >>>> >>>> An alternative for the splicing case might be to alter the definition of >>>> GO:0031202, to something more activity-ish. Maybe "catalysis of either of >>>> the transesterification reactions that take place as part of spliceosomal >>>> mRNA splicing"? >>>> >>>> m >>>> >>>> On Tue, 29 Sep 2009, David Hill wrote: >>>> >>>> I just checked these out. This seems fine. In the first case, what >>>>> would be the functions that are executed in the process 'RNA splicing, via >>>>> transesterification reactions'? >>>>> >>>>> David >>>>> >>>>> Midori Harris wrote: >>>>> >>>>>> Hi, >>>>>> >>>>>> We have two SF items in which merging function and process terms looks >>>>>> like a viable solution to a problem. One is an overly-specific, single-step >>>>>> "process" that is essentially equivalent to a "function" (in the GO sense); >>>>>> the other is a "function" term that is defined such that it doesn't describe >>>>>> a single activity, and is essentially the same as a process term. >>>>>> >>>>>> SF 2864212 - merge MF term >>>>>> RNA splicing factor activity, transesterification mechanism >>>>>> GO:0031202 >>>>>> >>>>>> into BP term >>>>>> RNA splicing, via transesterification reactions GO:0000375 >>>>>> >>>>>> For this one, we would merge the MF term into the BP term, and ensure >>>>>> that the merged term has is_a ancestry in the BP graph. I think that's all >>>>>> we would change. >>>>>> >>>>>> >>>>>> https://sourceforge.net/tracker/?func=detail&aid=2864212&group_id=36855&atid=440764SF 2864271 - merge BP term >>>>>> >>>>>> conversion of met-tRNAf to fmet-tRNA GO:0001718 >>>>>> >>>>>> into MF term >>>>>> methionyl-tRNA formyltransferase activity GO:0004479 >>>>>> >>>>>> >>>>>> https://sourceforge.net/tracker/?func=detail&aid=2864271&group_id=36855&atid=440764In this case, the proposal is to merge the process term into the function >>>>>> term, ensuring that the merged term has is_a ancestry only in the MF >>>>>> ontology,and has a part_of link to a process term. Specifically, the merged >>>>>> term would retain the name and ID from GO:0004479, and would be part_of >>>>>> translational initiation GO:0006413. >>>>>> >>>>>> Does anyone have any objections or other comments on these? >>>>>> >>>>>> thanks, >>>>>> m >>>>>> _______________________________________________ >>>>>> Annotation mailing list >>>>>> Annotation at geneontology.org >>>>>> http://fafner.stanford.edu/mailman/listinfo/annotation >>>>>> >>>>> >>>>> >>>>> >>> >>> _______________________________________________ >> Ontology-editors mailing list >> Ontology-editors at geneontology.org >> http://fafner.stanford.edu/mailman/listinfo/ontology-editors >> >> _______________________________________________ > Ontology-editors mailing list > Ontology-editors at geneontology.org > http://fafner.stanford.edu/mailman/listinfo/ontology-editors > -- Tanya Berardini TAIR Curator www.arabidopsis.org -------------- next part -------------- An HTML attachment was scrubbed... URL: From alanruttenberg at gmail.com Tue Sep 29 10:52:53 2009 From: alanruttenberg at gmail.com (Alan Ruttenberg) Date: Tue, 29 Sep 2009 13:52:53 -0400 Subject: [Ontology-editors] [Annotation] merging MF and BP terms In-Reply-To: References: Message-ID: <29af5e2d0909291052v3b9821ele28f6229ac2b7232@mail.gmail.com> The problem is that this still confuses function with process. Functions are the things that the enzymes (using the term broadly) have that make the processes happen. If one goes into more detail, there are a variety of participants in the process (substrates, products, "cofactors"), but only one bearer of the function in a particular process. Every function has a corresponding process, its "realization". Functions are not parts of processes. The realizations of functions are. The functions have a physical basis in the structure of their bearers. The functions exist before any process happens. A molecule may have a function but never actually realize it. These distinctions have important consequences as people such as myself work on adding more detail to the GO. The processes get more detail by adding *participants*. Within those processes, when there is a molecule like an enzyme that functions in a particular way, it needs to be distinguished from the other participants in some way. The way that it is distinguished is by saying it bears a function. The heuristic that a molecular function is a "single step" process is misused, IMO. That it is a "single step" derives from the fact that in such descriptions there is a single molecule that has a function. The step is the process that surrounds execution of that function. As I said, getting this distinction clear is essential for future detailing of GO processes such as I and others do. Can we start to get this right here and now? -Alan On Tue, Sep 29, 2009 at 10:30 AM, Midori Harris wrote: > Hi, > > We have two SF items in which merging function and process terms looks like > a viable solution to a problem. One is an overly-specific, single-step > "process" that is essentially equivalent to a "function" (in the GO sense); > the other is a "function" term that is defined such that it doesn't describe > a single activity, and is essentially the same as a process term. > > SF 2864212 - merge MF term > RNA splicing factor activity, transesterification mechanism GO:0031202 > > into BP term > RNA splicing, via transesterification reactions GO:0000375 > > For this one, we would merge the MF term into the BP term, and ensure that > the merged term has is_a ancestry in the BP graph. I think that's all we > would change. > > > https://sourceforge.net/tracker/?func=detail&aid=2864212&group_id=36855&atid=440764 > > > SF 2864271 - merge BP term > > conversion of met-tRNAf to fmet-tRNA GO:0001718 > > into MF term > methionyl-tRNA formyltransferase activity GO:0004479 > > > https://sourceforge.net/tracker/?func=detail&aid=2864271&group_id=36855&atid=440764 > > In this case, the proposal is to merge the process term into the function > term, ensuring that the merged term has is_a ancestry only in the MF > ontology,and has a part_of link to a process term. Specifically, the merged > term would retain the name and ID from GO:0004479, and would be part_of > translational initiation GO:0006413. > > Does anyone have any objections or other comments on these? > > thanks, > m > _______________________________________________ > Annotation mailing list > Annotation at geneontology.org > http://fafner.stanford.edu/mailman/listinfo/annotation > -------------- next part -------------- An HTML attachment was scrubbed... URL: From kchris at genome.stanford.edu Tue Sep 29 11:18:05 2009 From: kchris at genome.stanford.edu (Karen Christie) Date: Tue, 29 Sep 2009 11:18:05 -0700 (PDT) Subject: [Ontology-editors] [Annotation] merging MF and BP terms In-Reply-To: <29af5e2d0909291052v3b9821ele28f6229ac2b7232@mail.gmail.com> References: <29af5e2d0909291052v3b9821ele28f6229ac2b7232@mail.gmail.com> Message-ID: I'd like to know you deal with multisubunit enzymes, where no one particular gene product has the catalytic activity. -Karen On Tue, 29 Sep 2009, Alan Ruttenberg wrote: > The problem is that this still confuses function with process. Functions are > the things that the enzymes (using the term broadly) have that make the > processes happen. If one goes into more detail, there are a variety of > participants in the process (substrates, products, "cofactors"), but only > one bearer of the function in a particular process. > Every function has a corresponding process, its "realization". > > Functions are not parts of processes. The realizations of functions are. > > The functions have a physical basis in the structure of their bearers. > > The functions exist before any process happens. A molecule may have a > function but never actually realize it. > > These distinctions have important consequences as people such as myself work > on adding more detail to the GO. The processes get more detail by adding > *participants*. Within those processes, when there is a molecule like an > enzyme that functions in a particular way, it needs to be distinguished from > the other participants in some way. The way that it is distinguished is by > saying it bears a function. > > The heuristic that a molecular function is a "single step" process is > misused, IMO. That it is a "single step" derives from the fact that in such > descriptions there is a single molecule that has a function. The step is the > process that surrounds execution of that function. > > As I said, getting this distinction clear is essential for future detailing > of GO processes such as I and others do. Can we start to get this right here > and now? > > -Alan > > > > On Tue, Sep 29, 2009 at 10:30 AM, Midori Harris wrote: > >> Hi, >> >> We have two SF items in which merging function and process terms looks like >> a viable solution to a problem. One is an overly-specific, single-step >> "process" that is essentially equivalent to a "function" (in the GO sense); >> the other is a "function" term that is defined such that it doesn't describe >> a single activity, and is essentially the same as a process term. >> >> SF 2864212 - merge MF term >> RNA splicing factor activity, transesterification mechanism GO:0031202 >> >> into BP term >> RNA splicing, via transesterification reactions GO:0000375 >> >> For this one, we would merge the MF term into the BP term, and ensure that >> the merged term has is_a ancestry in the BP graph. I think that's all we >> would change. >> >> >> https://sourceforge.net/tracker/?func=detail&aid=2864212&group_id=36855&atid=440764 >> >> >> SF 2864271 - merge BP term >> >> conversion of met-tRNAf to fmet-tRNA GO:0001718 >> >> into MF term >> methionyl-tRNA formyltransferase activity GO:0004479 >> >> >> https://sourceforge.net/tracker/?func=detail&aid=2864271&group_id=36855&atid=440764 >> >> In this case, the proposal is to merge the process term into the function >> term, ensuring that the merged term has is_a ancestry only in the MF >> ontology,and has a part_of link to a process term. Specifically, the merged >> term would retain the name and ID from GO:0004479, and would be part_of >> translational initiation GO:0006413. >> >> Does anyone have any objections or other comments on these? >> >> thanks, >> m >> _______________________________________________ >> Annotation mailing list >> Annotation at geneontology.org >> http://fafner.stanford.edu/mailman/listinfo/annotation >> > From Judith.Blake at jax.org Tue Sep 29 11:38:09 2009 From: Judith.Blake at jax.org (Judith Blake) Date: Tue, 29 Sep 2009 14:38:09 -0400 Subject: [Ontology-editors] [Annotation] merging MF and BP terms In-Reply-To: <29af5e2d0909291052v3b9821ele28f6229ac2b7232@mail.gmail.com> Message-ID: Functions are occurrants. A function describes an action, biochemical, structural, whatever. Functions are part of processes. A variety of structures, encoded in a molecule, have the potential to realize a function. That is a property of the molecule, not of the function. A molecule can have encoded the potential of engaging in several functions. The confusion I think is that of the relationship between the gene product and a molecular function, and the molecular function and the biological process. Judy On 9/29/09 1:52 PM, "Alan Ruttenberg" wrote: The problem is that this still confuses function with process. Functions are the things that the enzymes (using the term broadly) have that make the processes happen. If one goes into more detail, there are a variety of participants in the process (substrates, products, "cofactors"), but only one bearer of the function in a particular process. Every function has a corresponding process, its "realization". Functions are not parts of processes. The realizations of functions are. The functions have a physical basis in the structure of their bearers. The functions exist before any process happens. A molecule may have a function but never actually realize it. These distinctions have important consequences as people such as myself work on adding more detail to the GO. The processes get more detail by adding *participants*. Within those processes, when there is a molecule like an enzyme that functions in a particular way, it needs to be distinguished from the other participants in some way. The way that it is distinguished is by saying it bears a function. The heuristic that a molecular function is a "single step" process is misused, IMO. That it is a "single step" derives from the fact that in such descriptions there is a single molecule that has a function. The step is the process that surrounds execution of that function. As I said, getting this distinction clear is essential for future detailing of GO processes such as I and others do. Can we start to get this right here and now? -Alan On Tue, Sep 29, 2009 at 10:30 AM, Midori Harris wrote: Hi, We have two SF items in which merging function and process terms looks like a viable solution to a problem. One is an overly-specific, single-step "process" that is essentially equivalent to a "function" (in the GO sense); the other is a "function" term that is defined such that it doesn't describe a single activity, and is essentially the same as a process term. SF 2864212 - merge MF term RNA splicing factor activity, transesterification mechanism GO:0031202 into BP term RNA splicing, via transesterification reactions GO:0000375 For this one, we would merge the MF term into the BP term, and ensure that the merged term has is_a ancestry in the BP graph. I think that's all we would change. https://sourceforge.net/tracker/?func=detail&aid=2864212&group_id=36855&atid=440764 SF 2864271 - merge BP term conversion of met-tRNAf to fmet-tRNA GO:0001718 into MF term methionyl-tRNA formyltransferase activity GO:0004479 https://sourceforge.net/tracker/?func=detail&aid=2864271&group_id=36855&atid=440764 In this case, the proposal is to merge the process term into the function term, ensuring that the merged term has is_a ancestry only in the MF ontology,and has a part_of link to a process term. Specifically, the merged term would retain the name and ID from GO:0004479, and would be part_of translational initiation GO:0006413. Does anyone have any objections or other comments on these? thanks, m _______________________________________________ Annotation mailing list Annotation at geneontology.org http://fafner.stanford.edu/mailman/listinfo/annotation -------------- next part -------------- An HTML attachment was scrubbed... URL: From alanruttenberg at gmail.com Tue Sep 29 11:39:30 2009 From: alanruttenberg at gmail.com (Alan Ruttenberg) Date: Tue, 29 Sep 2009 14:39:30 -0400 Subject: [Ontology-editors] [Annotation] merging MF and BP terms In-Reply-To: References: <29af5e2d0909291052v3b9821ele28f6229ac2b7232@mail.gmail.com> Message-ID: <29af5e2d0909291139n125fe58bpfdc59b449affa481@mail.gmail.com> On Tue, Sep 29, 2009 at 2:18 PM, Karen Christie wrote: > I'd like to know you deal with multisubunit enzymes, where no one > particular gene product has the catalytic activity. > I think that the function is then ascribed to either the complex as a whole, or to the part of the complex that is responsible for the activity. The schema is: Complex has_part protein 1 has_part protein 2 ... Complex bears function (reverse function inheres in complex) function realized in process which implies: Complex participates in process. In addition, typically there will be other participants in the process. For example the mRNA that is being spliced. If we are at the point where we need to argue about which part of the complex bears the function then we are in good shape. -Alan > -Karen > > > > On Tue, 29 Sep 2009, Alan Ruttenberg wrote: > > The problem is that this still confuses function with process. Functions >> are >> the things that the enzymes (using the term broadly) have that make the >> processes happen. If one goes into more detail, there are a variety of >> participants in the process (substrates, products, "cofactors"), but only >> one bearer of the function in a particular process. >> Every function has a corresponding process, its "realization". >> >> Functions are not parts of processes. The realizations of functions are. >> >> The functions have a physical basis in the structure of their bearers. >> >> The functions exist before any process happens. A molecule may have a >> function but never actually realize it. >> >> These distinctions have important consequences as people such as myself >> work >> on adding more detail to the GO. The processes get more detail by adding >> *participants*. Within those processes, when there is a molecule like an >> enzyme that functions in a particular way, it needs to be distinguished >> from >> the other participants in some way. The way that it is distinguished is by >> saying it bears a function. >> >> The heuristic that a molecular function is a "single step" process is >> misused, IMO. That it is a "single step" derives from the fact that in >> such >> descriptions there is a single molecule that has a function. The step is >> the >> process that surrounds execution of that function. >> >> As I said, getting this distinction clear is essential for future >> detailing >> of GO processes such as I and others do. Can we start to get this right >> here >> and now? >> >> -Alan >> >> >> >> On Tue, Sep 29, 2009 at 10:30 AM, Midori Harris wrote: >> >> Hi, >>> >>> We have two SF items in which merging function and process terms looks >>> like >>> a viable solution to a problem. One is an overly-specific, single-step >>> "process" that is essentially equivalent to a "function" (in the GO >>> sense); >>> the other is a "function" term that is defined such that it doesn't >>> describe >>> a single activity, and is essentially the same as a process term. >>> >>> SF 2864212 - merge MF term >>> RNA splicing factor activity, transesterification mechanism GO:0031202 >>> >>> into BP term >>> RNA splicing, via transesterification reactions GO:0000375 >>> >>> For this one, we would merge the MF term into the BP term, and ensure >>> that >>> the merged term has is_a ancestry in the BP graph. I think that's all we >>> would change. >>> >>> >>> >>> https://sourceforge.net/tracker/?func=detail&aid=2864212&group_id=36855&atid=440764 >>> >>> >>> SF 2864271 - merge BP term >>> >>> conversion of met-tRNAf to fmet-tRNA GO:0001718 >>> >>> into MF term >>> methionyl-tRNA formyltransferase activity GO:0004479 >>> >>> >>> >>> https://sourceforge.net/tracker/?func=detail&aid=2864271&group_id=36855&atid=440764 >>> >>> In this case, the proposal is to merge the process term into the function >>> term, ensuring that the merged term has is_a ancestry only in the MF >>> ontology,and has a part_of link to a process term. Specifically, the >>> merged >>> term would retain the name and ID from GO:0004479, and would be part_of >>> translational initiation GO:0006413. >>> >>> Does anyone have any objections or other comments on these? >>> >>> thanks, >>> m >>> _______________________________________________ >>> Annotation mailing list >>> Annotation at geneontology.org >>> http://fafner.stanford.edu/mailman/listinfo/annotation >>> >>> >> -------------- next part -------------- An HTML attachment was scrubbed... URL: From alanruttenberg at gmail.com Tue Sep 29 11:45:59 2009 From: alanruttenberg at gmail.com (Alan Ruttenberg) Date: Tue, 29 Sep 2009 14:45:59 -0400 Subject: [Ontology-editors] [Annotation] merging MF and BP terms In-Reply-To: References: <29af5e2d0909291052v3b9821ele28f6229ac2b7232@mail.gmail.com> Message-ID: <29af5e2d0909291145p1e2af1d4q2e94a2a49103a130@mail.gmail.com> On Tue, Sep 29, 2009 at 2:38 PM, Judith Blake wrote: > > Functions are occurrants. A function describes an action, biochemical, > structural, whatever. > A function doesn't describe anything. Information artifact describe things. Sorry to be picky about words. It's my job. If you look at the papers on BFO you will see that functions are continuants and occurrents are disjoint from continuants. So functions are not occurrents. If the GO is using terminology at odds with BFO that needs to be fixed - we agreed at the last foundry meeting that we were all going to use a common upper level ontology. > > Functions are part of processes. > Function*ings* (i.e. realizations of functions) are part of processes. > > A variety of structures, encoded in a molecule, have the potential to > realize a function. > Type error: processes don't have potential. (concluded by substitution "potential to realize a function" + function is_a process => "potential to realize a process". The error is that it is the functions that are the potentials. Potentials are realized. The realizations are processes. We discussed this very recently with Michael and he agreed. > That is a property of the molecule, not of the function. > What is the reference of "That"? > A molecule can have encoded the potential of engaging in several > functions. > Absolutely. (as long as you rewrite "functions" -> "functionings" (i.e. processes).) A molecule can bear multiple functions is the way we would say this. The confusion I think is that of the relationship between the gene product > and a molecular function, and the molecular function and the biological > process. > And what do you say these are. I (and BFO) are clear: The relation between a molecular function and a gene product is "inheres". The relation between molecular function and a biological process is "realized_by". -Alan > > Judy > > > > > On 9/29/09 1:52 PM, "Alan Ruttenberg" wrote: > > The problem is that this still confuses function with process. Functions > are the things that the enzymes (using the term broadly) have that make the > processes happen. If one goes into more detail, there are a variety of > participants in the process (substrates, products, "cofactors"), but only > one bearer of the function in a particular process. > > Every function has a corresponding process, its "realization". > > Functions are not parts of processes. The realizations of functions are. > > The functions have a physical basis in the structure of their bearers. > > The functions exist before any process happens. A molecule may have a > function but never actually realize it. > > These distinctions have important consequences as people such as myself > work on adding more detail to the GO. The processes get more detail by > adding *participants*. Within those processes, when there is a molecule like > an enzyme that functions in a particular way, it needs to be distinguished > from the other participants in some way. The way that it is distinguished is > by saying it bears a function. > > The heuristic that a molecular function is a "single step" process is > misused, IMO. That it is a "single step" derives from the fact that in such > descriptions there is a single molecule that has a function. The step is the > process that surrounds execution of that function. > > As I said, getting this distinction clear is essential for future detailing > of GO processes such as I and others do. Can we start to get this right here > and now? > > -Alan > > > > On Tue, Sep 29, 2009 at 10:30 AM, Midori Harris wrote: > > Hi, > > We have two SF items in which merging function and process terms looks like > a viable solution to a problem. One is an overly-specific, single-step > "process" that is essentially equivalent to a "function" (in the GO sense); > the other is a "function" term that is defined such that it doesn't describe > a single activity, and is essentially the same as a process term. > > SF 2864212 - merge MF term > RNA splicing factor activity, transesterification mechanism GO:0031202 > > into BP term > RNA splicing, via transesterification reactions GO:0000375 > > For this one, we would merge the MF term into the BP term, and ensure that > the merged term has is_a ancestry in the BP graph. I think that's all we > would change. > > > https://sourceforge.net/tracker/?func=detail&aid=2864212&group_id=36855&atid=440764 > > > SF 2864271 - merge BP term > > conversion of met-tRNAf to fmet-tRNA GO:0001718 > > into MF term > methionyl-tRNA formyltransferase activity GO:0004479 > > > https://sourceforge.net/tracker/?func=detail&aid=2864271&group_id=36855&atid=440764 > > In this case, the proposal is to merge the process term into the function > term, ensuring that the merged term has is_a ancestry only in the MF > ontology,and has a part_of link to a process term. Specifically, the merged > term would retain the name and ID from GO:0004479, and would be part_of > translational initiation GO:0006413. > > Does anyone have any objections or other comments on these? > > thanks, > m > _______________________________________________ > Annotation mailing list > Annotation at geneontology.org > http://fafner.stanford.edu/mailman/listinfo/annotation > > > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From cherry at stanford.edu Tue Sep 29 12:55:59 2009 From: cherry at stanford.edu (cherry at stanford.edu) Date: Tue, 29 Sep 2009 12:55:59 -0700 (PDT) Subject: [Ontology-editors] Gene Ontology Email lists unavailable Oct 2nd Message-ID: <200909291955.n8TJtxN6025500@fafner.Stanford.EDU> The mailserver that provides the Gene Ontology mailing lists will be down this weekend. All lists and the web site for the mailing lists will be unavailable from noon (Pacific) October 2nd and back before noon (Pacific) October 4rd. You are subscribed to the one of the GOC lists. I apologize for the trouble if you receive this message more than once. All subscriptions to all lists are being sent this message. If you happen to send an email during the down time you do not need to send the message again. It is normal for mailservers to keep trying for five days. Your message will simply be delayed. The main GO web sites and tools are not affected and will be available. Cheers, Mike Cherry From kchris at genome.stanford.edu Tue Sep 29 15:11:20 2009 From: kchris at genome.stanford.edu (Karen Christie) Date: Tue, 29 Sep 2009 15:11:20 -0700 (PDT) Subject: [Ontology-editors] [Annotation] merging MF and BP terms In-Reply-To: <29af5e2d0909291139n125fe58bpfdc59b449affa481@mail.gmail.com> References: <29af5e2d0909291052v3b9821ele28f6229ac2b7232@mail.gmail.com> <29af5e2d0909291139n125fe58bpfdc59b449affa481@mail.gmail.com> Message-ID: These two function terms were created with the intent to represent the two catalytic activites that occur in large spliceosomal complexes. Based on the current understanding, I would ascribe these functions to a complex as a whole. There is not yet evidence that would allow ascribing either catalytic activity to a smaller portion of either relevant complex and they're certainly not catalyzed by a single gene product. We have recently revised the component ontology and now also represent the two specific complexes that carry out each of these two catalytic activities, these two terms and their child terms: - catalytic step 1 spliceosome - catalytic step 2 spliceosome In process, we have these terms: - generation of catalytic spliceosome for first transesterification step - generation of catalytic spliceosome for second transesterification step So, I think we are doing this (splicing) right (finally). In process, we represent the big picture, including some specific stages like generation of particular complexes in the cycle. In complex, we represent the various identifiable complexes that researchers recognize. In function, we represent only the two catalytic activities. As named/phrased, the existing function grouping term directly above the two specific function terms (first/second spliceosomal transesterification activity) basically just means "be involved with splicing", which I agree is equivalent to the existing process term. So, I agree that the current function term (GO:0031202) should be merged into the corresponding process term, as proposed. If we do need a grouping term in the function ontology for these two function terms, it should be a generalized function, something like "transesterification activity", not something process specific as the current one is. -Karen P.S. In terms of proceeding with this, we'll need to make sure we give annotation groups an appropriate heads up since there will no function term that has an equivalent meaning where you could do a direct replace. The two child terms of GO:0031202 are more specific. Thus reannotation will require reading for each gene to determine whether to annotate to either of these two specific terms or to the root node for function. I think it would be appropriate to have a longer lead time than two weeks to allow groups to work on this before essentially removing the term from function. SGD has about 60 genes annotated to this term, excluding computational annotations. On Tue, 29 Sep 2009, Alan Ruttenberg wrote: > On Tue, Sep 29, 2009 at 2:18 PM, Karen Christie > wrote: > >> I'd like to know you deal with multisubunit enzymes, where no one >> particular gene product has the catalytic activity. >> > > I think that the function is then ascribed to either the complex as a whole, > or to the part of the complex that is responsible for the activity. > > The schema is: > > Complex > has_part protein 1 > has_part protein 2 > ... > > Complex bears function (reverse function inheres in complex) > > function realized in process > > which implies: > > Complex participates in process. > > In addition, typically there will be other participants in the process. For > example the mRNA that is being spliced. > > If we are at the point where we need to argue about which part of the > complex bears the function then we are in good shape. > > -Alan > > > >> -Karen >> >> >> >> On Tue, 29 Sep 2009, Alan Ruttenberg wrote: >> >> The problem is that this still confuses function with process. Functions >>> are >>> the things that the enzymes (using the term broadly) have that make the >>> processes happen. If one goes into more detail, there are a variety of >>> participants in the process (substrates, products, "cofactors"), but only >>> one bearer of the function in a particular process. >>> Every function has a corresponding process, its "realization". >>> >>> Functions are not parts of processes. The realizations of functions are. >>> >>> The functions have a physical basis in the structure of their bearers. >>> >>> The functions exist before any process happens. A molecule may have a >>> function but never actually realize it. >>> >>> These distinctions have important consequences as people such as myself >>> work >>> on adding more detail to the GO. The processes get more detail by adding >>> *participants*. Within those processes, when there is a molecule like an >>> enzyme that functions in a particular way, it needs to be distinguished >>> from >>> the other participants in some way. The way that it is distinguished is by >>> saying it bears a function. >>> >>> The heuristic that a molecular function is a "single step" process is >>> misused, IMO. That it is a "single step" derives from the fact that in >>> such >>> descriptions there is a single molecule that has a function. The step is >>> the >>> process that surrounds execution of that function. >>> >>> As I said, getting this distinction clear is essential for future >>> detailing >>> of GO processes such as I and others do. Can we start to get this right >>> here >>> and now? >>> >>> -Alan >>> >>> >>> >>> On Tue, Sep 29, 2009 at 10:30 AM, Midori Harris wrote: >>> >>> Hi, >>>> >>>> We have two SF items in which merging function and process terms looks >>>> like >>>> a viable solution to a problem. One is an overly-specific, single-step >>>> "process" that is essentially equivalent to a "function" (in the GO >>>> sense); >>>> the other is a "function" term that is defined such that it doesn't >>>> describe >>>> a single activity, and is essentially the same as a process term. >>>> >>>> SF 2864212 - merge MF term >>>> RNA splicing factor activity, transesterification mechanism GO:0031202 >>>> >>>> into BP term >>>> RNA splicing, via transesterification reactions GO:0000375 >>>> >>>> For this one, we would merge the MF term into the BP term, and ensure >>>> that >>>> the merged term has is_a ancestry in the BP graph. I think that's all we >>>> would change. >>>> >>>> >>>> >>>> https://sourceforge.net/tracker/?func=detail&aid=2864212&group_id=36855&atid=440764 >>>> >>>> >>>> SF 2864271 - merge BP term >>>> >>>> conversion of met-tRNAf to fmet-tRNA GO:0001718 >>>> >>>> into MF term >>>> methionyl-tRNA formyltransferase activity GO:0004479 >>>> >>>> >>>> >>>> https://sourceforge.net/tracker/?func=detail&aid=2864271&group_id=36855&atid=440764 >>>> >>>> In this case, the proposal is to merge the process term into the function >>>> term, ensuring that the merged term has is_a ancestry only in the MF >>>> ontology,and has a part_of link to a process term. Specifically, the >>>> merged >>>> term would retain the name and ID from GO:0004479, and would be part_of >>>> translational initiation GO:0006413. >>>> >>>> Does anyone have any objections or other comments on these? >>>> >>>> thanks, >>>> m >>>> _______________________________________________ >>>> Annotation mailing list >>>> Annotation at geneontology.org >>>> http://fafner.stanford.edu/mailman/listinfo/annotation >>>> >>>> >>> > From midori at ebi.ac.uk Wed Sep 30 04:04:50 2009 From: midori at ebi.ac.uk (Midori Harris) Date: Wed, 30 Sep 2009 12:04:50 +0100 (BST) Subject: [Ontology-editors] [Annotation] merging MF and BP terms In-Reply-To: References: <29af5e2d0909291052v3b9821ele28f6229ac2b7232@mail.gmail.com> <29af5e2d0909291139n125fe58bpfdc59b449affa481@mail.gmail.com> Message-ID: How long will you need? Especially if it's more than the usual two weeks, I'd like to get an alert out soon. m On Tue, 29 Sep 2009, Karen Christie wrote: > These two function terms were created with the intent to represent the two > catalytic activites that occur in large spliceosomal complexes. Based on the > current understanding, I would ascribe these functions to a complex as a > whole. There is not yet evidence that would allow ascribing either catalytic > activity to a smaller portion of either relevant complex and they're > certainly not catalyzed by a single gene product. > > We have recently revised the component ontology and now also represent the > two specific complexes that carry out each of these two catalytic activities, > these two terms and their child terms: > - catalytic step 1 spliceosome > - catalytic step 2 spliceosome > > In process, we have these terms: > - generation of catalytic spliceosome for first transesterification step > - generation of catalytic spliceosome for second transesterification step > > So, I think we are doing this (splicing) right (finally). In process, we > represent the big picture, including some specific stages like generation of > particular complexes in the cycle. In complex, we represent the various > identifiable complexes that researchers recognize. In function, we represent > only the two catalytic activities. > > As named/phrased, the existing function grouping term directly above the two > specific function terms (first/second spliceosomal transesterification > activity) basically just means "be involved with splicing", which I agree is > equivalent to the existing process term. So, I agree that the current > function term (GO:0031202) should be merged into the corresponding process > term, as proposed. > > If we do need a grouping term in the function ontology for these two function > terms, it should be a generalized function, something like > "transesterification activity", not something process specific as the current > one is. > > -Karen > > P.S. In terms of proceeding with this, we'll need to make sure we give > annotation groups an appropriate heads up since there will no function term > that has an equivalent meaning where you could do a direct replace. The two > child terms of GO:0031202 are more specific. Thus reannotation will require > reading for each gene to determine whether to annotate to either of these two > specific terms or to the root node for function. I think it would be > appropriate to have a longer lead time than two weeks to allow groups to work > on this before essentially removing the term from function. SGD has about 60 > genes annotated to this term, excluding computational annotations. > > > > On Tue, 29 Sep 2009, Alan Ruttenberg wrote: > >> On Tue, Sep 29, 2009 at 2:18 PM, Karen Christie >> wrote: >> >>> I'd like to know you deal with multisubunit enzymes, where no one >>> particular gene product has the catalytic activity. >>> >> >> I think that the function is then ascribed to either the complex as a >> whole, >> or to the part of the complex that is responsible for the activity. >> >> The schema is: >> >> Complex >> has_part protein 1 >> has_part protein 2 >> ... >> >> Complex bears function (reverse function inheres in complex) >> >> function realized in process >> >> which implies: >> >> Complex participates in process. >> >> In addition, typically there will be other participants in the process. For >> example the mRNA that is being spliced. >> >> If we are at the point where we need to argue about which part of the >> complex bears the function then we are in good shape. >> >> -Alan >> >> >> >>> -Karen >>> >>> >>> >>> On Tue, 29 Sep 2009, Alan Ruttenberg wrote: >>> >>> The problem is that this still confuses function with process. Functions >>>> are >>>> the things that the enzymes (using the term broadly) have that make the >>>> processes happen. If one goes into more detail, there are a variety of >>>> participants in the process (substrates, products, "cofactors"), but only >>>> one bearer of the function in a particular process. >>>> Every function has a corresponding process, its "realization". >>>> >>>> Functions are not parts of processes. The realizations of functions are. >>>> >>>> The functions have a physical basis in the structure of their bearers. >>>> >>>> The functions exist before any process happens. A molecule may have a >>>> function but never actually realize it. >>>> >>>> These distinctions have important consequences as people such as myself >>>> work >>>> on adding more detail to the GO. The processes get more detail by adding >>>> *participants*. Within those processes, when there is a molecule like an >>>> enzyme that functions in a particular way, it needs to be distinguished >>>> from >>>> the other participants in some way. The way that it is distinguished is >>>> by >>>> saying it bears a function. >>>> >>>> The heuristic that a molecular function is a "single step" process is >>>> misused, IMO. That it is a "single step" derives from the fact that in >>>> such >>>> descriptions there is a single molecule that has a function. The step is >>>> the >>>> process that surrounds execution of that function. >>>> >>>> As I said, getting this distinction clear is essential for future >>>> detailing >>>> of GO processes such as I and others do. Can we start to get this right >>>> here >>>> and now? >>>> >>>> -Alan >>>> >>>> >>>> >>>> On Tue, Sep 29, 2009 at 10:30 AM, Midori Harris wrote: >>>> >>>> Hi, >>>>> >>>>> We have two SF items in which merging function and process terms looks >>>>> like >>>>> a viable solution to a problem. One is an overly-specific, single-step >>>>> "process" that is essentially equivalent to a "function" (in the GO >>>>> sense); >>>>> the other is a "function" term that is defined such that it doesn't >>>>> describe >>>>> a single activity, and is essentially the same as a process term. >>>>> >>>>> SF 2864212 - merge MF term >>>>> RNA splicing factor activity, transesterification mechanism GO:0031202 >>>>> >>>>> into BP term >>>>> RNA splicing, via transesterification reactions GO:0000375 >>>>> >>>>> For this one, we would merge the MF term into the BP term, and ensure >>>>> that >>>>> the merged term has is_a ancestry in the BP graph. I think that's all we >>>>> would change. >>>>> >>>>> >>>>> >>>>> https://sourceforge.net/tracker/?func=detail&aid=2864212&group_id=36855&atid=440764 >>>>> >>>>> >>>>> SF 2864271 - merge BP term >>>>> >>>>> conversion of met-tRNAf to fmet-tRNA GO:0001718 >>>>> >>>>> into MF term >>>>> methionyl-tRNA formyltransferase activity GO:0004479 >>>>> >>>>> >>>>> >>>>> https://sourceforge.net/tracker/?func=detail&aid=2864271&group_id=36855&atid=440764 >>>>> >>>>> In this case, the proposal is to merge the process term into the >>>>> function >>>>> term, ensuring that the merged term has is_a ancestry only in the MF >>>>> ontology,and has a part_of link to a process term. Specifically, the >>>>> merged >>>>> term would retain the name and ID from GO:0004479, and would be part_of >>>>> translational initiation GO:0006413. >>>>> >>>>> Does anyone have any objections or other comments on these? >>>>> >>>>> thanks, >>>>> m >>>>> _______________________________________________ >>>>> Annotation mailing list >>>>> Annotation at geneontology.org >>>>> http://fafner.stanford.edu/mailman/listinfo/annotation >>>>> >>>>> >>>> >> > From Judith.Blake at jax.org Wed Sep 30 04:10:59 2009 From: Judith.Blake at jax.org (Judith Blake) Date: Wed, 30 Sep 2009 07:10:59 -0400 Subject: [Ontology-editors] [Annotation] merging MF and BP terms In-Reply-To: Message-ID: Hi Karen, all, I want to remind/inform all that the Protein Ontology (PRO) is now funded to represent explicit complexes that detail exact proteins that make up the complex. These could be annotated to a specific function term. The objective too is to have a relationship between the specific complex represented in PRO and the complex class represented in GO. Ultimately, the site of specific complex activity will need to be included in the terminology term I think (xref to cell type or tissue?). We would want to curate the specific complex. The example below is from Serneels, et.al., gamma-Secretase heterogeneity in the Aph1 subunit: relevance for Alzheimer's disease. Science. 2009 May 1;324(5927):639-42 The gamma-secretase complex plays a role in Alzheimer's disease and cancer progression. The development of clinically useful inhibitors, however, is complicated by the role of the gamma-secretase complex in regulated intramembrane proteolysis of Notch and other essential proteins. Different gamma-secretase complexes containing different Presenilin or Aph1 protein subunits are present in various tissues. Here we show that these complexes have heterogeneous biochemical and physiological properties. Specific inactivation of the Aph1B gamma-secretase in a mouse Alzheimer's disease model led to improvements of Alzheimer's disease-relevant phenotypic features without any Notch-related side effects. The Aph1B complex contributes to total gamma-secretase activity in the human brain, and thus specific targeting of Aph1B-containing gamma-secretase complexes may help generate less toxic therapies for Alzheimer's disease. Judy On 9/29/09 6:11 PM, "Karen Christie" wrote: These two function terms were created with the intent to represent the two catalytic activites that occur in large spliceosomal complexes. Based on the current understanding, I would ascribe these functions to a complex as a whole. There is not yet evidence that would allow ascribing either catalytic activity to a smaller portion of either relevant complex and they're certainly not catalyzed by a single gene product. We have recently revised the component ontology and now also represent the two specific complexes that carry out each of these two catalytic activities, these two terms and their child terms: - catalytic step 1 spliceosome - catalytic step 2 spliceosome In process, we have these terms: - generation of catalytic spliceosome for first transesterification step - generation of catalytic spliceosome for second transesterification step So, I think we are doing this (splicing) right (finally). In process, we represent the big picture, including some specific stages like generation of particular complexes in the cycle. In complex, we represent the various identifiable complexes that researchers recognize. In function, we represent only the two catalytic activities. As named/phrased, the existing function grouping term directly above the two specific function terms (first/second spliceosomal transesterification activity) basically just means "be involved with splicing", which I agree is equivalent to the existing process term. So, I agree that the current function term (GO:0031202) should be merged into the corresponding process term, as proposed. If we do need a grouping term in the function ontology for these two function terms, it should be a generalized function, something like "transesterification activity", not something process specific as the current one is. -Karen P.S. In terms of proceeding with this, we'll need to make sure we give annotation groups an appropriate heads up since there will no function term that has an equivalent meaning where you could do a direct replace. The two child terms of GO:0031202 are more specific. Thus reannotation will require reading for each gene to determine whether to annotate to either of these two specific terms or to the root node for function. I think it would be appropriate to have a longer lead time than two weeks to allow groups to work on this before essentially removing the term from function. SGD has about 60 genes annotated to this term, excluding computational annotations. On Tue, 29 Sep 2009, Alan Ruttenberg wrote: > On Tue, Sep 29, 2009 at 2:18 PM, Karen Christie > wrote: > >> I'd like to know you deal with multisubunit enzymes, where no one >> particular gene product has the catalytic activity. >> > > I think that the function is then ascribed to either the complex as a whole, > or to the part of the complex that is responsible for the activity. > > The schema is: > > Complex > has_part protein 1 > has_part protein 2 > ... > > Complex bears function (reverse function inheres in complex) > > function realized in process > > which implies: > > Complex participates in process. > > In addition, typically there will be other participants in the process. For > example the mRNA that is being spliced. > > If we are at the point where we need to argue about which part of the > complex bears the function then we are in good shape. > > -Alan > > > >> -Karen >> >> >> >> On Tue, 29 Sep 2009, Alan Ruttenberg wrote: >> >> The problem is that this still confuses function with process. Functions >>> are >>> the things that the enzymes (using the term broadly) have that make the >>> processes happen. If one goes into more detail, there are a variety of >>> participants in the process (substrates, products, "cofactors"), but only >>> one bearer of the function in a particular process. >>> Every function has a corresponding process, its "realization". >>> >>> Functions are not parts of processes. The realizations of functions are. >>> >>> The functions have a physical basis in the structure of their bearers. >>> >>> The functions exist before any process happens. A molecule may have a >>> function but never actually realize it. >>> >>> These distinctions have important consequences as people such as myself >>> work >>> on adding more detail to the GO. The processes get more detail by adding >>> *participants*. Within those processes, when there is a molecule like an >>> enzyme that functions in a particular way, it needs to be distinguished >>> from >>> the other participants in some way. The way that it is distinguished is by >>> saying it bears a function. >>> >>> The heuristic that a molecular function is a "single step" process is >>> misused, IMO. That it is a "single step" derives from the fact that in >>> such >>> descriptions there is a single molecule that has a function. The step is >>> the >>> process that surrounds execution of that function. >>> >>> As I said, getting this distinction clear is essential for future >>> detailing >>> of GO processes such as I and others do. Can we start to get this right >>> here >>> and now? >>> >>> -Alan >>> >>> >>> >>> On Tue, Sep 29, 2009 at 10:30 AM, Midori Harris wrote: >>> >>> Hi, >>>> >>>> We have two SF items in which merging function and process terms looks >>>> like >>>> a viable solution to a problem. One is an overly-specific, single-step >>>> "process" that is essentially equivalent to a "function" (in the GO >>>> sense); >>>> the other is a "function" term that is defined such that it doesn't >>>> describe >>>> a single activity, and is essentially the same as a process term. >>>> >>>> SF 2864212 - merge MF term >>>> RNA splicing factor activity, transesterification mechanism GO:0031202 >>>> >>>> into BP term >>>> RNA splicing, via transesterification reactions GO:0000375 >>>> >>>> For this one, we would merge the MF term into the BP term, and ensure >>>> that >>>> the merged term has is_a ancestry in the BP graph. I think that's all we >>>> would change. >>>> >>>> >>>> >>>> https://sourceforge.net/tracker/?func=detail=2864212_id=36855=440764 >>>> >>>> >>>> SF 2864271 - merge BP term >>>> >>>> conversion of met-tRNAf to fmet-tRNA GO:0001718 >>>> >>>> into MF term >>>> methionyl-tRNA formyltransferase activity GO:0004479 >>>> >>>> >>>> >>>> https://sourceforge.net/tracker/?func=detail=2864271_id=36855=440764 >>>> >>>> In this case, the proposal is to merge the process term into the function >>>> term, ensuring that the merged term has is_a ancestry only in the MF >>>> ontology,and has a part_of link to a process term. Specifically, the >>>> merged >>>> term would retain the name and ID from GO:0004479, and would be part_of >>>> translational initiation GO:0006413. >>>> >>>> Does anyone have any objections or other comments on these? >>>> >>>> thanks, >>>> m >>>> _______________________________________________ >>>> Annotation mailing list >>>> Annotation at geneontology.org >>>> http://fafner.stanford.edu/mailman/listinfo/annotation >>>> >>>> >>> > _______________________________________________ Annotation mailing list Annotation at geneontology.org http://fafner.stanford.edu/mailman/listinfo/annotation -------------- next part -------------- An HTML attachment was scrubbed... URL: From kchris at genome.stanford.edu Wed Sep 30 13:36:10 2009 From: kchris at genome.stanford.edu (Karen Christie) Date: Wed, 30 Sep 2009 13:36:10 -0700 (PDT) Subject: [Ontology-editors] [Annotation] merging MF and BP terms In-Reply-To: References: <29af5e2d0909291052v3b9821ele28f6229ac2b7232@mail.gmail.com> <29af5e2d0909291139n125fe58bpfdc59b449affa481@mail.gmail.com> Message-ID: Hi Midori, I checked with our GO annotation team and they think two weeks is enough, but are starting now rather than waiting till it goes through to avoid having no function annotations for those genes once this term is gone from Function. I know the standard email states when there are annotations. Perhaps it could also explicitly state when there is no comparable replacement term for annotation. thanks for considering annotation needs in this ontology change! -Karen On Wed, 30 Sep 2009, Midori Harris wrote: > How long will you need? Especially if it's more than the usual two weeks, > I'd like to get an alert out soon. > > m > > On Tue, 29 Sep 2009, Karen Christie wrote: > >> These two function terms were created with the intent to represent the two >> catalytic activites that occur in large spliceosomal complexes. Based on >> the current understanding, I would ascribe these functions to a complex as >> a whole. There is not yet evidence that would allow ascribing either >> catalytic activity to a smaller portion of either relevant complex and >> they're certainly not catalyzed by a single gene product. >> >> We have recently revised the component ontology and now also represent the >> two specific complexes that carry out each of these two catalytic >> activities, these two terms and their child terms: >> - catalytic step 1 spliceosome >> - catalytic step 2 spliceosome >> >> In process, we have these terms: >> - generation of catalytic spliceosome for first transesterification step >> - generation of catalytic spliceosome for second transesterification step >> >> So, I think we are doing this (splicing) right (finally). In process, we >> represent the big picture, including some specific stages like generation >> of particular complexes in the cycle. In complex, we represent the various >> identifiable complexes that researchers recognize. In function, we >> represent only the two catalytic activities. >> >> As named/phrased, the existing function grouping term directly above the >> two specific function terms (first/second spliceosomal transesterification >> activity) basically just means "be involved with splicing", which I agree >> is equivalent to the existing process term. So, I agree that the current >> function term (GO:0031202) should be merged into the corresponding process >> term, as proposed. >> >> If we do need a grouping term in the function ontology for these two >> function terms, it should be a generalized function, something like >> "transesterification activity", not something process specific as the >> current one is. >> >> -Karen >> >> P.S. In terms of proceeding with this, we'll need to make sure we give >> annotation groups an appropriate heads up since there will no function term >> that has an equivalent meaning where you could do a direct replace. The two >> child terms of GO:0031202 are more specific. Thus reannotation will require >> reading for each gene to determine whether to annotate to either of these >> two specific terms or to the root node for function. I think it would be >> appropriate to have a longer lead time than two weeks to allow groups to >> work on this before essentially removing the term from function. SGD has >> about 60 genes annotated to this term, excluding computational annotations. >> >> >> >> On Tue, 29 Sep 2009, Alan Ruttenberg wrote: >> >>> On Tue, Sep 29, 2009 at 2:18 PM, Karen Christie >>> wrote: >>> >>>> I'd like to know you deal with multisubunit enzymes, where no one >>>> particular gene product has the catalytic activity. >>>> >>> >>> I think that the function is then ascribed to either the complex as a >>> whole, >>> or to the part of the complex that is responsible for the activity. >>> >>> The schema is: >>> >>> Complex >>> has_part protein 1 >>> has_part protein 2 >>> ... >>> >>> Complex bears function (reverse function inheres in complex) >>> >>> function realized in process >>> >>> which implies: >>> >>> Complex participates in process. >>> >>> In addition, typically there will be other participants in the process. >>> For >>> example the mRNA that is being spliced. >>> >>> If we are at the point where we need to argue about which part of the >>> complex bears the function then we are in good shape. >>> >>> -Alan >>> >>> >>> >>>> -Karen >>>> >>>> >>>> >>>> On Tue, 29 Sep 2009, Alan Ruttenberg wrote: >>>> >>>> The problem is that this still confuses function with process. Functions >>>>> are >>>>> the things that the enzymes (using the term broadly) have that make the >>>>> processes happen. If one goes into more detail, there are a variety of >>>>> participants in the process (substrates, products, "cofactors"), but >>>>> only >>>>> one bearer of the function in a particular process. >>>>> Every function has a corresponding process, its "realization". >>>>> >>>>> Functions are not parts of processes. The realizations of functions are. >>>>> >>>>> The functions have a physical basis in the structure of their bearers. >>>>> >>>>> The functions exist before any process happens. A molecule may have a >>>>> function but never actually realize it. >>>>> >>>>> These distinctions have important consequences as people such as myself >>>>> work >>>>> on adding more detail to the GO. The processes get more detail by adding >>>>> *participants*. Within those processes, when there is a molecule like an >>>>> enzyme that functions in a particular way, it needs to be distinguished >>>>> from >>>>> the other participants in some way. The way that it is distinguished is >>>>> by >>>>> saying it bears a function. >>>>> >>>>> The heuristic that a molecular function is a "single step" process is >>>>> misused, IMO. That it is a "single step" derives from the fact that in >>>>> such >>>>> descriptions there is a single molecule that has a function. The step is >>>>> the >>>>> process that surrounds execution of that function. >>>>> >>>>> As I said, getting this distinction clear is essential for future >>>>> detailing >>>>> of GO processes such as I and others do. Can we start to get this right >>>>> here >>>>> and now? >>>>> >>>>> -Alan >>>>> >>>>> >>>>> >>>>> On Tue, Sep 29, 2009 at 10:30 AM, Midori Harris >>>>> wrote: >>>>> >>>>> Hi, >>>>>> >>>>>> We have two SF items in which merging function and process terms looks >>>>>> like >>>>>> a viable solution to a problem. One is an overly-specific, single-step >>>>>> "process" that is essentially equivalent to a "function" (in the GO >>>>>> sense); >>>>>> the other is a "function" term that is defined such that it doesn't >>>>>> describe >>>>>> a single activity, and is essentially the same as a process term. >>>>>> >>>>>> SF 2864212 - merge MF term >>>>>> RNA splicing factor activity, transesterification mechanism GO:0031202 >>>>>> >>>>>> into BP term >>>>>> RNA splicing, via transesterification reactions GO:0000375 >>>>>> >>>>>> For this one, we would merge the MF term into the BP term, and ensure >>>>>> that >>>>>> the merged term has is_a ancestry in the BP graph. I think that's all >>>>>> we >>>>>> would change. >>>>>> >>>>>> >>>>>> >>>>>> https://sourceforge.net/tracker/?func=detail&aid=2864212&group_id=36855&atid=440764 >>>>>> >>>>>> >>>>>> SF 2864271 - merge BP term >>>>>> >>>>>> conversion of met-tRNAf to fmet-tRNA GO:0001718 >>>>>> >>>>>> into MF term >>>>>> methionyl-tRNA formyltransferase activity GO:0004479 >>>>>> >>>>>> >>>>>> >>>>>> https://sourceforge.net/tracker/?func=detail&aid=2864271&group_id=36855&atid=440764 >>>>>> >>>>>> In this case, the proposal is to merge the process term into the >>>>>> function >>>>>> term, ensuring that the merged term has is_a ancestry only in the MF >>>>>> ontology,and has a part_of link to a process term. Specifically, the >>>>>> merged >>>>>> term would retain the name and ID from GO:0004479, and would be part_of >>>>>> translational initiation GO:0006413. >>>>>> >>>>>> Does anyone have any objections or other comments on these? >>>>>> >>>>>> thanks, >>>>>> m >>>>>> _______________________________________________ >>>>>> Annotation mailing list >>>>>> Annotation at geneontology.org >>>>>> http://fafner.stanford.edu/mailman/listinfo/annotation >>>>>> >>>>>> >>>>> >>> >> > From hjd at informatics.jax.org Wed Sep 30 14:55:54 2009 From: hjd at informatics.jax.org (Harold Drabkin) Date: Wed, 30 Sep 2009 17:55:54 -0400 Subject: [Ontology-editors] [Annotation] merging MF and BP terms In-Reply-To: References: <29af5e2d0909291052v3b9821ele28f6229ac2b7232@mail.gmail.com> <29af5e2d0909291139n125fe58bpfdc59b449affa481@mail.gmail.com> Message-ID: <4AC3D3EA.9020202@informatics.jax.org> Karen Christie wrote: > Hi Midori, > > I checked with our GO annotation team and they think two weeks is > enough, but are starting now rather than waiting till it goes through > to avoid having no function annotations for those genes once this term > is gone from Function. > > I know the standard email states when there are annotations. Perhaps > it could also explicitly state when there is no comparable replacement > term for annotation. This ^ is a good idea to save time! > > thanks for considering annotation needs in this ontology change! > > -Karen > > > > On Wed, 30 Sep 2009, Midori Harris wrote: > >> How long will you need? Especially if it's more than the usual two >> weeks, >> I'd like to get an alert out soon. >> >> m >> >> On Tue, 29 Sep 2009, Karen Christie wrote: >> >>> These two function terms were created with the intent to represent >>> the two catalytic activites that occur in large spliceosomal >>> complexes. Based on the current understanding, I would ascribe these >>> functions to a complex as a whole. There is not yet evidence that >>> would allow ascribing either catalytic activity to a smaller portion >>> of either relevant complex and they're certainly not catalyzed by a >>> single gene product. >>> >>> We have recently revised the component ontology and now also >>> represent the two specific complexes that carry out each of these >>> two catalytic activities, these two terms and their child terms: >>> - catalytic step 1 spliceosome >>> - catalytic step 2 spliceosome >>> >>> In process, we have these terms: >>> - generation of catalytic spliceosome for first transesterification >>> step >>> - generation of catalytic spliceosome for second transesterification >>> step >>> >>> So, I think we are doing this (splicing) right (finally). In >>> process, we represent the big picture, including some specific >>> stages like generation of particular complexes in the cycle. In >>> complex, we represent the various identifiable complexes that >>> researchers recognize. In function, we represent only the two >>> catalytic activities. >>> >>> As named/phrased, the existing function grouping term directly above >>> the two specific function terms (first/second spliceosomal >>> transesterification activity) basically just means "be involved with >>> splicing", which I agree is equivalent to the existing process term. >>> So, I agree that the current function term (GO:0031202) should be >>> merged into the corresponding process term, as proposed. >>> >>> If we do need a grouping term in the function ontology for these two >>> function terms, it should be a generalized function, something like >>> "transesterification activity", not something process specific as >>> the current one is. >>> >>> -Karen >>> >>> P.S. In terms of proceeding with this, we'll need to make sure we >>> give annotation groups an appropriate heads up since there will no >>> function term that has an equivalent meaning where you could do a >>> direct replace. The two child terms of GO:0031202 are more specific. >>> Thus reannotation will require reading for each gene to determine >>> whether to annotate to either of these two specific terms or to the >>> root node for function. I think it would be appropriate to have a >>> longer lead time than two weeks to allow groups to work on this >>> before essentially removing the term from function. SGD has about 60 >>> genes annotated to this term, excluding computational annotations. >>> >>> >>> >>> On Tue, 29 Sep 2009, Alan Ruttenberg wrote: >>> >>>> On Tue, Sep 29, 2009 at 2:18 PM, Karen Christie >>>> wrote: >>>> >>>>> I'd like to know you deal with multisubunit enzymes, where no one >>>>> particular gene product has the catalytic activity. >>>>> >>>> >>>> I think that the function is then ascribed to either the complex as >>>> a whole, >>>> or to the part of the complex that is responsible for the activity. >>>> >>>> The schema is: >>>> >>>> Complex >>>> has_part protein 1 >>>> has_part protein 2 >>>> ... >>>> >>>> Complex bears function (reverse function inheres in complex) >>>> >>>> function realized in process >>>> >>>> which implies: >>>> >>>> Complex participates in process. >>>> >>>> In addition, typically there will be other participants in the >>>> process. For >>>> example the mRNA that is being spliced. >>>> >>>> If we are at the point where we need to argue about which part of the >>>> complex bears the function then we are in good shape. >>>> >>>> -Alan >>>> >>>> >>>> >>>>> -Karen >>>>> >>>>> >>>>> >>>>> On Tue, 29 Sep 2009, Alan Ruttenberg wrote: >>>>> >>>>> The problem is that this still confuses function with process. >>>>> Functions >>>>>> are >>>>>> the things that the enzymes (using the term broadly) have that >>>>>> make the >>>>>> processes happen. If one goes into more detail, there are a >>>>>> variety of >>>>>> participants in the process (substrates, products, "cofactors"), >>>>>> but only >>>>>> one bearer of the function in a particular process. >>>>>> Every function has a corresponding process, its "realization". >>>>>> >>>>>> Functions are not parts of processes. The realizations of >>>>>> functions are. >>>>>> >>>>>> The functions have a physical basis in the structure of their >>>>>> bearers. >>>>>> >>>>>> The functions exist before any process happens. A molecule may >>>>>> have a >>>>>> function but never actually realize it. >>>>>> >>>>>> These distinctions have important consequences as people such as >>>>>> myself >>>>>> work >>>>>> on adding more detail to the GO. The processes get more detail by >>>>>> adding >>>>>> *participants*. Within those processes, when there is a molecule >>>>>> like an >>>>>> enzyme that functions in a particular way, it needs to be >>>>>> distinguished >>>>>> from >>>>>> the other participants in some way. The way that it is >>>>>> distinguished is by >>>>>> saying it bears a function. >>>>>> >>>>>> The heuristic that a molecular function is a "single step" >>>>>> process is >>>>>> misused, IMO. That it is a "single step" derives from the fact >>>>>> that in >>>>>> such >>>>>> descriptions there is a single molecule that has a function. The >>>>>> step is >>>>>> the >>>>>> process that surrounds execution of that function. >>>>>> >>>>>> As I said, getting this distinction clear is essential for future >>>>>> detailing >>>>>> of GO processes such as I and others do. Can we start to get this >>>>>> right >>>>>> here >>>>>> and now? >>>>>> >>>>>> -Alan >>>>>> >>>>>> >>>>>> >>>>>> On Tue, Sep 29, 2009 at 10:30 AM, Midori Harris >>>>>> wrote: >>>>>> >>>>>> Hi, >>>>>>> >>>>>>> We have two SF items in which merging function and process terms >>>>>>> looks >>>>>>> like >>>>>>> a viable solution to a problem. One is an overly-specific, >>>>>>> single-step >>>>>>> "process" that is essentially equivalent to a "function" (in the GO >>>>>>> sense); >>>>>>> the other is a "function" term that is defined such that it doesn't >>>>>>> describe >>>>>>> a single activity, and is essentially the same as a process term. >>>>>>> >>>>>>> SF 2864212 - merge MF term >>>>>>> RNA splicing factor activity, transesterification mechanism >>>>>>> GO:0031202 >>>>>>> >>>>>>> into BP term >>>>>>> RNA splicing, via transesterification reactions GO:0000375 >>>>>>> >>>>>>> For this one, we would merge the MF term into the BP term, and >>>>>>> ensure >>>>>>> that >>>>>>> the merged term has is_a ancestry in the BP graph. I think >>>>>>> that's all we >>>>>>> would change. >>>>>>> >>>>>>> >>>>>>> >>>>>>> https://sourceforge.net/tracker/?func=detail&aid=2864212&group_id=36855&atid=440764 >>>>>>> >>>>>>> >>>>>>> >>>>>>> SF 2864271 - merge BP term >>>>>>> >>>>>>> conversion of met-tRNAf to fmet-tRNA GO:0001718 >>>>>>> >>>>>>> into MF term >>>>>>> methionyl-tRNA formyltransferase activity GO:0004479 >>>>>>> >>>>>>> >>>>>>> >>>>>>> https://sourceforge.net/tracker/?func=detail&aid=2864271&group_id=36855&atid=440764 >>>>>>> >>>>>>> >>>>>>> In this case, the proposal is to merge the process term into the >>>>>>> function >>>>>>> term, ensuring that the merged term has is_a ancestry only in >>>>>>> the MF >>>>>>> ontology,and has a part_of link to a process term. Specifically, >>>>>>> the >>>>>>> merged >>>>>>> term would retain the name and ID from GO:0004479, and would be >>>>>>> part_of >>>>>>> translational initiation GO:0006413. >>>>>>> >>>>>>> Does anyone have any objections or other comments on these? >>>>>>> >>>>>>> thanks, >>>>>>> m >>>>>>> _______________________________________________ >>>>>>> Annotation mailing list >>>>>>> Annotation at geneontology.org >>>>>>> http://fafner.stanford.edu/mailman/listinfo/annotation >>>>>>> >>>>>>> >>>>>> >>>> >>> >> > _______________________________________________ > Annotation mailing list > Annotation at geneontology.org > http://fafner.stanford.edu/mailman/listinfo/annotation From suzi at fruitfly.org Wed Sep 30 17:30:19 2009 From: suzi at fruitfly.org (Suzanna Lewis) Date: Wed, 30 Sep 2009 17:30:19 -0700 Subject: [Ontology-editors] [Annotation] merging MF and BP terms In-Reply-To: <29af5e2d0909291145p1e2af1d4q2e94a2a49103a130@mail.gmail.com> References: <29af5e2d0909291052v3b9821ele28f6229ac2b7232@mail.gmail.com> <29af5e2d0909291145p1e2af1d4q2e94a2a49103a130@mail.gmail.com> Message-ID: <4AE49577-109D-4104-B40C-5E10FA5CE1B2@fruitfly.org> Alan has it right. Functions are not occurents, Function*ings* are. But I think that the solution that David, Karen, and Midori have proposed is fine. We've consciously collapsed (to avoid repeating things) the "Function*ings* are part of processes" into a single artifact, a relationship (i.e. their is no information artifact for Function*ings* itself, it is implicit) that is for now called "part_of" (if I remember correctly). The name of this relationship may not be ideal, but we are trying to be clear about its intended usage. Chris & David first proposed this at the St. Croix meeting. And I seem to remember that David and I talked about this (and many other things, such as the nature of the implied relationship between an instance and a type when anyone makes an annotation) again at the relation ontology meeting in Denver. -S On Sep 29, 2009, at 11:45 AM, Alan Ruttenberg wrote: > > > On Tue, Sep 29, 2009 at 2:38 PM, Judith Blake > wrote: > > Functions are occurrants. A function describes an action, > biochemical, structural, whatever. > > A function doesn't describe anything. Information artifact describe > things. > Sorry to be picky about words. It's my job. > If you look at the papers on BFO you will see that functions are > continuants and occurrents are disjoint from continuants. So > functions are not occurrents. If the GO is using terminology at odds > with BFO that needs to be fixed - we agreed at the last foundry > meeting that we were all going to use a common upper level ontology. > > > Functions are part of processes. > > Function*ings* (i.e. realizations of functions) are part of processes. > > A variety of structures, encoded in a molecule, have the potential > to realize a function. > > Type error: processes don't have potential. (concluded by > substitution "potential to realize a function" + function is_a > process => "potential to realize a process". > > The error is that it is the functions that are the potentials. > Potentials are realized. The realizations are processes. > > We discussed this very recently with Michael and he agreed. > > That is a property of the molecule, not of the function. > > What is the reference of "That"? > > A molecule can have encoded the potential of engaging in several > functions. > > Absolutely. (as long as you rewrite "functions" -> > "functionings" (i.e. processes).) > > A molecule can bear multiple functions is the way we would say this. > > The confusion I think is that of the relationship between the gene > product and a molecular function, and the molecular function and the > biological process. > > And what do you say these are. I (and BFO) are clear: The relation > between a molecular function and a gene product is "inheres". The > relation between molecular function and a biological process is > "realized_by". > > -Alan > > > > Judy > > > > > On 9/29/09 1:52 PM, "Alan Ruttenberg" > wrote: > > The problem is that this still confuses function with process. > Functions are the things that the enzymes (using the term broadly) > have that make the processes happen. If one goes into more detail, > there are a variety of participants in the process (substrates, > products, "cofactors"), but only one bearer of the function in a > particular process. > > Every function has a corresponding process, its "realization". > > Functions are not parts of processes. The realizations of functions > are. > > The functions have a physical basis in the structure of their bearers. > > The functions exist before any process happens. A molecule may have > a function but never actually realize it. > > These distinctions have important consequences as people such as > myself work on adding more detail to the GO. The processes get more > detail by adding *participants*. Within those processes, when there > is a molecule like an enzyme that functions in a particular way, it > needs to be distinguished from the other participants in some way. > The way that it is distinguished is by saying it bears a function. > > The heuristic that a molecular function is a "single step" process > is misused, IMO. That it is a "single step" derives from the fact > that in such descriptions there is a single molecule that has a > function. The step is the process that surrounds execution of that > function. > > As I said, getting this distinction clear is essential for future > detailing of GO processes such as I and others do. Can we start to > get this right here and now? > > -Alan > > > > On Tue, Sep 29, 2009 at 10:30 AM, Midori Harris > wrote: > Hi, > > We have two SF items in which merging function and process terms > looks like a viable solution to a problem. One is an overly- > specific, single-step "process" that is essentially equivalent to a > "function" (in the GO sense); the other is a "function" term that is > defined such that it doesn't describe a single activity, and is > essentially the same as a process term. > > SF 2864212 - merge MF term > RNA splicing factor activity, transesterification mechanism GO: > 0031202 > > into BP term > RNA splicing, via transesterification reactions GO:0000375 > > For this one, we would merge the MF term into the BP term, and > ensure that the merged term has is_a ancestry in the BP graph. I > think that's all we would change. > > https://sourceforge.net/tracker/?func=detail&aid=2864212&group_id=36855&atid=440764 > > > SF 2864271 - merge BP term > > conversion of met-tRNAf to fmet-tRNA GO:0001718 > > into MF term > methionyl-tRNA formyltransferase activity GO:0004479 > > https://sourceforge.net/tracker/?func=detail&aid=2864271&group_id=36855&atid=440764 > > In this case, the proposal is to merge the process term into the > function term, ensuring that the merged term has is_a ancestry only > in the MF ontology,and has a part_of link to a process term. > Specifically, the merged term would retain the name and ID from GO: > 0004479, and would be part_of translational initiation GO:0006413. > > Does anyone have any objections or other comments on these? > > thanks, > m > _______________________________________________ > Annotation mailing list > Annotation at geneontology.org > http://fafner.stanford.edu/mailman/listinfo/annotation > > > > _______________________________________________ > Annotation mailing list > Annotation at geneontology.org > http://fafner.stanford.edu/mailman/listinfo/annotation -------------- next part -------------- An HTML attachment was scrubbed... URL: