non-normalized concepts exist
Bug #445125 reported by
Ken Arnold
This bug affects 1 person
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
ConceptNet |
Fix Committed
|
Medium
|
Rob Speer |
Bug Description
I noticed that some concepts seem to be not normalized:
>>> Concept.
<Concept: <en: balls>>
>>> Concept.get('ball', 'en')
<Concept: <en: ball>>
>>> Concept.
<SurfaceForm: balls>
>>> Concept.
45
Where'd that come from?
Changed in conceptnet: | |
status: | In Progress → Fix Committed |
To post a comment you must log in.
_Lots_ of non-normalized concepts exist:
>>> from csc.conceptnet. models import * objects. filter( language= 'en').order_ by().values_ list('text' , 'concept_ _text') .iterator( ): (text) != normalized:
bad_ surfaces. append( text)
>>> from csc.nl import get_nl
>>> en_nl = get_nl('en')
>>> bad_surfaces = []
>>> for text, normalized in SurfaceForm.
if en_nl.normalize
>>> len(bad_surfaces)
29955