[Ontology-editors] [OBO-Edit Working Group] non-critical warnings on GO file
Alexander Diehl
adiehl at informatics.jax.org
Fri May 1 05:14:36 PDT 2009
Have you tried running OE2 with a newly generated set of preference
files, rather than copying over your preferences?
-- Alex
Jennifer Deegan (nee Clark) wrote:
> Hi,
>
> I'm seeing this bug too. Amina did you want to look at it since you
> worked on this last, or shall I? It's strange that Karen isn't seeing
> the bug when Jane is seeing it on the Mac and I have it on windows.
>
> Jen
>
> Jane Lomax wrote:
>
>> I'd love to turn my verification checker back on but it hasn't been
>> working properly for a while now. I guess others might be having the
>> same problem which is why the file is accumulating errors.
>>
>> The problem is that it doesn't see all those words that Jen added to the
>> dictionary so I get zillions of warnings when I run it. I have submitted
>> a SF item:
>> https://sourceforge.net/tracker/?func=detail&aid=2784937&group_id=36855&atid=418257
>>
>> Jane
>>
>> Karen Christie wrote:
>>
>>> Hi GO ontology editors,
>>>
>>> I did an edit today and noticed that there were 26 non-critical warnings.
>>> I went through all of them to see what they were. There are a couple types
>>> of warnings where we should probably change what the verification check
>>> looks for (which is why I cc'd the OEWG on this), but there were a bunch
>>> of user errors which people should have fixed before they committed.
>>>
>>> In the process of releasing OE2, Jen did a lot of work to clean up the
>>> hundreds of these that we used to have, so that people could actually use
>>> the verification checks to catch problems they introduced. But if we start
>>> collecting a whole bunch of these again, then everyone will ignore the
>>> verification checks again and we'll be back to where we were, and
>>> eventually someone will have to go through and clean them up again.
>>>
>>> I think it would be best if we can keep GO "clean" of these types of
>>> problems so that the verification checks are useful to each person as they
>>> save, so they can use it to fix their own problems BEFORE they commit
>>> them.
>>>
>>> Below is what I found in going through the warnings. Maybe we can talk
>>> about appropriate procedure to avoid accumulating these warnings, and
>>> perhaps the OEWG can talk about whether two of the checks are picking up
>>> things they shouldn't be.
>>>
>>> thanks,
>>>
>>> -Karen
>>>
>>>
>>> 1. User Errors: Almost half were simple typos, e.g. "anaphasep" instead of
>>> "anaphase.", internal newlines within definitions, or missing final
>>> periods from definitions, the latter often occurring in defs from EC or
>>> MetaCyc.
>>>
>>> It seems that people should NOT be committing the ontology with these
>>> types of errors, they should fix them before they commit so that we con't
>>> accumulate scads of them.
>>>
>>> There was also one url in a definition. By comparison with the other urls
>>> that the verificatino check flagged, it seems that perhaps this should be
>>> in the comment, not the definition?
>>>
>>>
>>> 2. Verification Check issues:
>>>
>>> Then, there were two other types of warnings, where it looks like maybe
>>> the checks are picking up things that should be allowed.
>>>
>>> Repeated word - There were four "repeated words" reported where it ignored
>>> the fact that there was punctuation in between the two instances of the
>>> repeated word. While two of these might be less than gramatically ideal to
>>> use the same word twice in close succession, none of these are illegal,
>>> and two of them there is probably no other way to phrase it. Perhaps it
>>> should not report repeated words when there is punctuation in between.
>>>
>>> Issue with sentence boundaries - Most of the rest of the warnings were
>>> about periods with no whitespace after them, resulting in two warnings:
>>> - sentences that do not start with a capital
>>> - sentences that are not separated by whitespace.
>>>
>>> However, none of the flagged issues were supposed to be sentences. Most
>>> were urls in comment fields. A couple others were names or formulas that
>>> contained periods where there was no whitespace after the period. Perhaps
>>> we should not look for periods followed by a non-whitespace character.
>>>
>>>
>>> ------------------------------------------------------------------------------
>>> Register Now & Save for Velocity, the Web Performance & Operations
>>> Conference from O'Reilly Media. Velocity features a full day of
>>> expert-led, hands-on workshops and two days of sessions from industry
>>> leaders in dedicated Performance & Operations tracks. Use code vel09scf
>>> and Save an extra 15% before 5/3. http://p.sf.net/sfu/velocityconf
>>> _______________________________________________
>>> Geneontology-oboedit-working-group mailing list
>>> Geneontology-oboedit-working-group at lists.sourceforge.net
>>> https://lists.sourceforge.net/lists/listinfo/geneontology-oboedit-working-group
>>>
>>>
>>
>
> ------------------------------------------------------------------------------
> Register Now & Save for Velocity, the Web Performance & Operations
> Conference from O'Reilly Media. Velocity features a full day of
> expert-led, hands-on workshops and two days of sessions from industry
> leaders in dedicated Performance & Operations tracks. Use code vel09scf
> and Save an extra 15% before 5/3. http://p.sf.net/sfu/velocityconf
> _______________________________________________
> Geneontology-oboedit-working-group mailing list
> Geneontology-oboedit-working-group at lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/geneontology-oboedit-working-group
>
--
Alexander D. Diehl, Ph.D.
Senior Scientific Curator
Mouse Genome Informatics
The Jackson Laboratory
600 Main Street
Bar Harbor, ME 04609
email: adiehl at informatics.jax.org
work: +1 (207) 288-6427
fax: +1 (207) 288-6131
More information about the Ontology-editors
mailing list