Activity log for bug #695616

Date Who What changed Old value New value Message
2010-12-30 09:10:00 Nash bug added bug
2010-12-30 09:17:55 Nash tags and exact integers-as-text match matching recognition string text text+integer
2010-12-30 20:16:20 Nash description I have noticed inconsistency in the Sikuli X text recognition and matching, especially when text on the screen/region are integers. Below are the steps to reproduce the problem scenario: Step1: In your web browser open 'TextOnTheScreen.png' from the following web-link (http://qoydkw.bay.livefilestore.com/y1pSr0oDju9ndvMzJJfo_qWD2qEvMdDDwoKquMNWWkUYAxgrsVcXmETY8yW_LqIieOKXgwlTt8yxe8eQDY-SCySCFljo59mSSIC/TextOnTheScreen.png?psid=1) Step2: To the below sikuli script, provide region coordinates of the 'TextOnTheScreen.png' image which is on your web browser. Note! region coordinates must be significantly bigger because you will zoom-in & zoom-out of the visible content on your browser in-order to carry-out multiple text recognition tests. -------------------------------------------------------- r = Region(“TextOnTheScreen”) t1 = "534438" t2 = "534177" t3 = "4438" t = t2 m = r.exists(t,0) if m: if m.text() == t: popup("Exact match") else: popup(m.nearby().text()) else: popup("Nothing found at all!") -------------------------------------------------------- Step3(TestCase1): Now, run the above sikuli script with t = t2. You might notice one or more of the following: - Integers such as 7 & 8 are incorrectly recognised - All the integers-as-text are recognised correctly - Only few integers are recognised correctly Questions: - Does the text recognition depend on how much OCR gets trained? - Does the font-type & font-size matter? - Should there be a mechanism for validating an exact-match for text recognition? Step4(TestCase2): Now, back to your browser where the 'TextOnTheScreen.png' image is visible. Try to zoom-out (ctrl&-) the image and run the above sikuli script with t = t2. You should now notice variation in the text recognition results. Step5(TestCase3): Repeat 'TestCase2' with different zoom-levels in your browser, you might get varied text recognisation results. Step6(TestCase4): Now, run the above sikuli script with t = t3. You might notice that integers-as-text are not recognised at all or with luck it might recognise some integers-as-text? Hope this provide some test-cases for troubleshooting inconsistency in text/integer recognition. Questions: - Should text and integer recognition be independent operations? Should there be a choice for the application developer if she/he wants to combine and/or isolate such operations where-seen-fit? Would such separation lead to reduced-OCR-complexity/accuracy/better-control over the OCR? - How would one reliability obtain an exact string match for e.g. with click("String") operation? I have noticed inconsistency in the Sikuli X text recognition and matching, especially when text on the screen/region are integers. Below are the steps to reproduce the problem scenario: Step1: In your web browser open 'TextOnTheScreen.png' from the following web-link (http://qoydkw.bay.livefilestore.com/y1pSr0oDju9ndvMzJJfo_qWD2qEvMdDDwoKquMNWWkUYAxgrsVcXmETY8yW_LqIieOKXgwlTt8yxe8eQDY-SCySCFljo59mSSIC/TextOnTheScreen.png?psid=1) Step2: To the below sikuli script, provide region coordinates of the 'TextOnTheScreen.png' image which is on your web browser. Note! region coordinates must be significantly bigger because you will zoom-in & zoom-out of the visible content on your browser in-order to carry-out multiple text recognition tests. -------------------------------------------------------- r = Region(“TextOnTheScreen”) t1 = "534438" t2 = "534177" t3 = "4438" t = t2 m = r.exists(t,0) if m: if m.text() == t: popup("Exact match") else: popup(m.nearby().text()) else: popup("Nothing found at all!") -------------------------------------------------------- Step3(TestCase1): Now, run the above sikuli script with t = t2. You might notice one or more of the following: - Integers such as 7 & 8 are incorrectly recognised - All the integers-as-text are recognised correctly - Only few integers are recognised correctly Questions: - Does the text recognition depend on how much OCR gets trained? - Does the font-type & font-size matter? - Should there be a mechanism for validating an exact-match for text recognition? Step4(TestCase2): Now, back to your browser where the 'TextOnTheScreen.png' image is visible. Try to zoom-out (ctrl&-) the image and run the above sikuli script with t = t2. You should now notice variation in the text recognition results. Step5(TestCase3): Repeat 'TestCase2' with different zoom-levels in your browser, you might get varied text recognisation results. Step6(TestCase4): Now, run the above sikuli script with t = t3. You might notice that integers-as-text are not recognised at all or with luck it might recognise some integers-as-text? Hope this provide some test-cases for troubleshooting inconsistency in text/integer recognition. Questions: - Should text and integer recognition be independent operations? Should there be a choice for the application developer if she/he wants to combine and/or isolate such operations where-seen-fit? Would such separation lead to reduced-OCR-complexity/accuracy/better-control over the OCR? - How would one reliably obtain an exact string match for e.g. with click("String") operation?
2010-12-30 20:16:20 Nash tags and exact integers-as-text match matching recognition string text text+integer exact match recognition text+integer
2011-01-12 10:47:03 RaiMan summary Sikuli X 1.0rc1 - Inconsistency in text recognition and matching, especially with integers-as-text! X 1.0rc1 - Inconsistency in text recognition and matching, especially with integers-as-text!
2011-07-15 14:55:18 bevan dequeker bug added subscriber bevan dequeker
2012-03-12 20:59:20 Simon Pearson bug added subscriber Simon Pearson
2013-02-22 09:38:20 RaiMan sikuli: status New In Progress
2013-02-22 09:38:23 RaiMan sikuli: importance Undecided Low
2013-02-22 09:38:26 RaiMan sikuli: assignee RaiMan (raimund-hocke)
2013-02-22 09:38:41 RaiMan tags exact match recognition text+integer fkt-text