[Ontology-editors] [OBO-Edit Working Group] non-critical warnings on GO file
Alexander Diehl
adiehl at informatics.jax.org
Fri May 1 05:37:34 PDT 2009
Just as another data point:
With a newly downloaded gene_ontology_write.obo file, if I open the file
and immediately save with no changes, I get 19 non-critical warnings,
using a fairly fresh install of OBO-Edit 2.0 on Mac OS X 10.4.11 with
whatever up-to-date Java that that OS supports.
--Alex
Jane Lomax wrote:
> I started using OBO-Edit 2.0 with clean preferences - I didn't copy my
> old preferences file across.
>
> Jane
>
> Alexander Diehl wrote:
>> Have you tried running OE2 with a newly generated set of preference
>> files, rather than copying over your preferences?
>>
>> -- Alex
>>
>>
>> Jennifer Deegan (nee Clark) wrote:
>>> Hi,
>>>
>>> I'm seeing this bug too. Amina did you want to look at it since you
>>> worked on this last, or shall I? It's strange that Karen isn't
>>> seeing the bug when Jane is seeing it on the Mac and I have it on
>>> windows.
>>>
>>> Jen
>>>
>>> Jane Lomax wrote:
>>>
>>>> I'd love to turn my verification checker back on but it hasn't been
>>>> working properly for a while now. I guess others might be having
>>>> the same problem which is why the file is accumulating errors.
>>>>
>>>> The problem is that it doesn't see all those words that Jen added
>>>> to the dictionary so I get zillions of warnings when I run it. I
>>>> have submitted a SF item:
>>>> https://sourceforge.net/tracker/?func=detail&aid=2784937&group_id=36855&atid=418257
>>>>
>>>>
>>>> Jane
>>>>
>>>> Karen Christie wrote:
>>>>
>>>>> Hi GO ontology editors,
>>>>>
>>>>> I did an edit today and noticed that there were 26 non-critical
>>>>> warnings. I went through all of them to see what they were. There
>>>>> are a couple types of warnings where we should probably change
>>>>> what the verification check looks for (which is why I cc'd the
>>>>> OEWG on this), but there were a bunch of user errors which people
>>>>> should have fixed before they committed.
>>>>>
>>>>> In the process of releasing OE2, Jen did a lot of work to clean up
>>>>> the hundreds of these that we used to have, so that people could
>>>>> actually use the verification checks to catch problems they
>>>>> introduced. But if we start collecting a whole bunch of these
>>>>> again, then everyone will ignore the verification checks again and
>>>>> we'll be back to where we were, and eventually someone will have
>>>>> to go through and clean them up again.
>>>>>
>>>>> I think it would be best if we can keep GO "clean" of these types
>>>>> of problems so that the verification checks are useful to each
>>>>> person as they save, so they can use it to fix their own problems
>>>>> BEFORE they commit them.
>>>>>
>>>>> Below is what I found in going through the warnings. Maybe we can
>>>>> talk about appropriate procedure to avoid accumulating these
>>>>> warnings, and perhaps the OEWG can talk about whether two of the
>>>>> checks are picking up things they shouldn't be.
>>>>>
>>>>> thanks,
>>>>>
>>>>> -Karen
>>>>>
>>>>>
>>>>> 1. User Errors: Almost half were simple typos, e.g. "anaphasep"
>>>>> instead of "anaphase.", internal newlines within definitions, or
>>>>> missing final periods from definitions, the latter often occurring
>>>>> in defs from EC or MetaCyc.
>>>>>
>>>>> It seems that people should NOT be committing the ontology with
>>>>> these types of errors, they should fix them before they commit so
>>>>> that we con't accumulate scads of them.
>>>>>
>>>>> There was also one url in a definition. By comparison with the
>>>>> other urls that the verificatino check flagged, it seems that
>>>>> perhaps this should be in the comment, not the definition?
>>>>>
>>>>>
>>>>> 2. Verification Check issues:
>>>>>
>>>>> Then, there were two other types of warnings, where it looks like
>>>>> maybe the checks are picking up things that should be allowed.
>>>>>
>>>>> Repeated word - There were four "repeated words" reported where it
>>>>> ignored the fact that there was punctuation in between the two
>>>>> instances of the repeated word. While two of these might be less
>>>>> than gramatically ideal to use the same word twice in close
>>>>> succession, none of these are illegal, and two of them there is
>>>>> probably no other way to phrase it. Perhaps it should not report
>>>>> repeated words when there is punctuation in between.
>>>>>
>>>>> Issue with sentence boundaries - Most of the rest of the warnings
>>>>> were about periods with no whitespace after them, resulting in two
>>>>> warnings:
>>>>> - sentences that do not start with a capital
>>>>> - sentences that are not separated by whitespace.
>>>>>
>>>>> However, none of the flagged issues were supposed to be sentences.
>>>>> Most were urls in comment fields. A couple others were names or
>>>>> formulas that contained periods where there was no whitespace
>>>>> after the period. Perhaps we should not look for periods followed
>>>>> by a non-whitespace character.
>>>>>
>>>>>
>>>>> ------------------------------------------------------------------------------
>>>>>
>>>>> Register Now & Save for Velocity, the Web Performance & Operations
>>>>> Conference from O'Reilly Media. Velocity features a full day of
>>>>> expert-led, hands-on workshops and two days of sessions from
>>>>> industry leaders in dedicated Performance & Operations tracks. Use
>>>>> code vel09scf and Save an extra 15% before 5/3.
>>>>> http://p.sf.net/sfu/velocityconf
>>>>> _______________________________________________
>>>>> Geneontology-oboedit-working-group mailing list
>>>>> Geneontology-oboedit-working-group at lists.sourceforge.net
>>>>> https://lists.sourceforge.net/lists/listinfo/geneontology-oboedit-working-group
>>>>>
>>>>>
>>>>
>>>
>>> ------------------------------------------------------------------------------
>>>
>>> Register Now & Save for Velocity, the Web Performance & Operations
>>> Conference from O'Reilly Media. Velocity features a full day of
>>> expert-led, hands-on workshops and two days of sessions from
>>> industry leaders in dedicated Performance & Operations tracks. Use
>>> code vel09scf and Save an extra 15% before 5/3.
>>> http://p.sf.net/sfu/velocityconf
>>> _______________________________________________
>>> Geneontology-oboedit-working-group mailing list
>>> Geneontology-oboedit-working-group at lists.sourceforge.net
>>> https://lists.sourceforge.net/lists/listinfo/geneontology-oboedit-working-group
>>>
>>>
>>
>>
>
>
--
Alexander D. Diehl, Ph.D.
Senior Scientific Curator
Mouse Genome Informatics
The Jackson Laboratory
600 Main Street
Bar Harbor, ME 04609
email: adiehl at informatics.jax.org
work: +1 (207) 288-6427
fax: +1 (207) 288-6131
More information about the Ontology-editors
mailing list