Bug #1804062 “test_hacking fails for python 3.6.7 and newer” : Series queens : Bugs : Ubuntu Cloud Archive

Revision history for this message

Chris Dent (cdent) wrote on 2018-11-19:

#1

> The fix ought to be adding a newline to the code sample. Maybe.

Nope. Still digging.

Matt Riedemann (mriedem) on 2018-11-19

Changed in nova:
status:	New → Confirmed

Matt Riedemann (mriedem) on 2018-11-19

Changed in nova:
importance:	Undecided → High

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2018-11-19: Related fix proposed to nova (master)

#2

Related fix proposed to branch: master
Review: https://review.openstack.org/618843

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2018-11-20: Related fix merged to nova (master)

#3

Reviewed: https://review.openstack.org/618843
Committed: https://git.openstack.org/cgit/openstack/nova/commit/?id=9875c37d9a09ee7160f13c9771cf14ce02cec027
Submitter: Zuul
Branch: master

commit 9875c37d9a09ee7160f13c9771cf14ce02cec027
Author: Chris Dent <email address hidden>
Date: Mon Nov 19 21:41:08 2018 +0000

Skip double word hacking test

    Because of a change [1] in the tokenize package in the stdlib of
    recent pythons, one of the tests for the hacking checks can fail.
    This change skips the test on newer pythons and leaves a TODO
    to fix it.

[1] see https://bugs.python.org/issue33899 and
https://bugs.python.org/issue35107

Change-Id: I64744a8144fcf630eea609eb2b2d14974f4fd4bb
Related-Bug: #1804062

Revision history for this message

Corey Bryant (corey.bryant) wrote on 2019-05-03:

#4

I'm also hitting this on Bionic (bionic-proposed) with the final 2.7.15 release. So this is currently seen on bionic and above with 2.7/3.6/3.7. Ubuntu package versions can be tracked at:

https://launchpad.net/ubuntu/+source/python2.7
https://launchpad.net/ubuntu/+source/python3.6
https://launchpad.net/ubuntu/+source/python3.7

Corey Bryant (corey.bryant) on 2019-05-03

Changed in cloud-archive:
status:	New → Triaged
Changed in nova (Ubuntu):
status:	New → Triaged
Changed in cloud-archive:
importance:	Undecided → High
Changed in nova (Ubuntu):
importance:	Undecided → High

Corey Bryant (corey.bryant) on 2019-05-03

Changed in nova (Ubuntu Bionic):
importance:	Undecided → High
status:	New → Triaged
Changed in nova (Ubuntu Cosmic):
importance:	Undecided → High
status:	New → Triaged
Changed in nova (Ubuntu Disco):
importance:	Undecided → High
status:	New → Triaged

Revision history for this message

Corey Bryant (corey.bryant) wrote on 2019-05-03:

#5

I'm tracking the Ubuntu SRU for the work-around in https://bugs.launchpad.net/bugs/1782786.

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2019-06-11: Related fix proposed to nova (master)

#6

Related fix proposed to branch: master
Review: https://review.opendev.org/664622

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2019-06-12: Related fix merged to nova (master)

#7

Reviewed: https://review.opendev.org/664622
Committed: https://git.openstack.org/cgit/openstack/nova/commit/?id=18ba6e6e6a7d1294bd1f1ece454f84eb65b219bb
Submitter: Zuul
Branch: master

commit 18ba6e6e6a7d1294bd1f1ece454f84eb65b219bb
Author: Eric Fried <email address hidden>
Date: Tue Jun 11 10:24:11 2019 -0500

Skip test_check_doubled_words hacking check UT

    The referenced bug has reared its ugly head in python 2.7.15 in (at
    least) releaselevel='final', which is being used in the gate, as well as
    3.6.7+. Disable the test entirely until we can fix or work around the
    bug in the tokenizer.

The hacking check itself is still in place. This is just disabling the
unit test that tries to test the hacking check.

NOTE: Noticed that the hacking check is operating on physical_line,
which means it'll catch things like:

"This is the the error"

but not:

("This is the "
"the error")

...which is probably way more likely.

Change-Id: I8826c3fb89690805baae6b9b7b48985abb8d62d3
Related-Bug: #1804062

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2019-06-12: Related fix proposed to nova (stable/stein)

#8

Related fix proposed to branch: stable/stein
Review: https://review.opendev.org/664841

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2019-06-12: Fix proposed to nova (master)

#9

Fix proposed to branch: master
Review: https://review.opendev.org/664940

Changed in nova:
assignee:	nobody → Stephen Finucane (stephenfinucane)
status:	Confirmed → In Progress

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2019-06-12: Fix merged to nova (master)

#10

Download full text (3.7 KiB)

Reviewed: https://review.opendev.org/664940
Committed: https://git.openstack.org/cgit/openstack/nova/commit/?id=f545a25cc443c41dcd9bdd028064c28b53f56037
Submitter: Zuul
Branch: master

commit f545a25cc443c41dcd9bdd028064c28b53f56037
Author: Stephen Finucane <email address hidden>
Date: Wed Jun 12 15:10:59 2019 +0100

Fix double word hacking test

At present, 'pycodestyle' feeds the following string into the 'tokenizer'
library:

["'This is the the best comment'"]

(note the added quotes because this isn't valid Python otherwise)

On previous versions of Python, this tokenizer would parse the string like so:

(3, "'This is the the best comment'", (1, 0), (1, 30), "'This is the the best comment'")
(0, '', (2, 0), (2, 0), '')

where (3 = 'STRING', 0 = 'ENDMARKER')

However, with the fix [1] backported to recent versions of Python, this now
resolves to:

      (3, "'This is the the best comment'", (1, 0), (1, 30), "'This is the the best comment'")
      (4, '', (1, 30), (1, 31), '')
      (0, '', (2, 0), (2, 0), '')

where (3 = 'STRING', 4 = 'NEWLINE', 0 = 'ENDMARKER')

Typically, 'pycodestyle' will run physical line checks on each line as it
parses the token:

https://github.com/PyCQA/pycodestyle/blob/2.5.0/pycodestyle.py#L2036

    For the former case above, the line doesn't include a newline which
    means we never parse a 'NEWLINE' token with a logical line (the fifth
    element of the token tuple) corresponding to our full line. This means
    we don't here but that wasn't an issue previously since there's a
    fallthrough case that handled tokens remaining at the end of the parse:

https://github.com/PyCQA/pycodestyle/blob/2.5.0/pycodestyle.py#L2114-L2116

    Unfortunately, because we now have an additional newline character to
    parse, one that's on a separate line to our test string no less, we run
    logical checks on it:

https://github.com/PyCQA/pycodestyle/blob/2.5.0/pycodestyle.py#L2105-L2107

This is an issue since the logical check wipes stored tokens meaning
we've nothing to check when we get to the fallthrough case:

https://github.com/PyCQA/pycodestyle/blob/2.5.0/pycodestyle.py#L2012

    This fixes changes things so that a newline is included (and also adds
    quotes so it's valid Python, but that's mostly unrelated). This means we
    end up with the following instead:

["'This is the the best comment'\n"]

On both Python without the bugfix and with it, this parses as:

      (3, "'This is the the best comment'", (1, 0), (1, 30), "'This is the the best comment'\n")
      (4, '\n', (1, 30), (1, 31), "'This is the the best comment'\n")
      (0, '', (2, 0), (2, 0), '')

where (3 = 'STRING', 4 = 'NEWLINE', 0 = 'ENDMARKER')

Which triggers things in 'pycodestyle' correctly.

https://github.com/PyCQA/pycodestyle/blob/2.5.0/pycodestyle.py#L2044-L2046

    This isn't _really_ a fix since there's clearly still a bug in either
    'pycodestyle' or Python (I think the latter, since it's adding a newline
    ...

Reviewed:  https://review.opendev.org/664940
Committed: https://git.openstack.org/cgit/openstack/nova/commit/?id=f545a25cc443c41dcd9bdd028064c28b53f56037
Submitter: Zuul
Branch:    master

commit f545a25cc443c41dcd9bdd028064c28b53f56037
Author: Stephen Finucane <sfinucan@redhat.com>
Date:   Wed Jun 12 15:10:59 2019 +0100

Fix double word hacking test
    
    At present, 'pycodestyle' feeds the following string into the 'tokenizer'
    library:
    
        ["'This is the the best comment'"]
    
    (note the added quotes because this isn't valid Python otherwise)
    
    On previous versions of Python, this tokenizer would parse the string like so:
    
      (3, "'This is the the best comment'", (1, 0), (1, 30), "'This is the the best comment'")
      (0, '', (2, 0), (2, 0), '')
    
    where (3 = 'STRING', 0 = 'ENDMARKER')
    
    However, with the fix [1] backported to recent versions of Python, this now
    resolves to:
    
      (3, "'This is the the best comment'", (1, 0), (1, 30), "'This is the the best comment'")
      (4, '', (1, 30), (1, 31), '')
      (0, '', (2, 0), (2, 0), '')
    
    where (3 = 'STRING', 4 = 'NEWLINE', 0 = 'ENDMARKER')
    
    Typically, 'pycodestyle' will run physical line checks on each line as it
    parses the token:
    
      https://github.com/PyCQA/pycodestyle/blob/2.5.0/pycodestyle.py#L2036
    
    For the former case above, the line doesn't include a newline which
    means we never parse a 'NEWLINE' token with a logical line (the fifth
    element of the token tuple) corresponding to our full line. This means
    we don't here but that wasn't an issue previously since there's a
    fallthrough case that handled tokens remaining at the end of the parse:
    
      https://github.com/PyCQA/pycodestyle/blob/2.5.0/pycodestyle.py#L2114-L2116
    
    Unfortunately, because we now have an additional newline character to
    parse, one that's on a separate line to our test string no less, we run
    logical checks on it:
    
      https://github.com/PyCQA/pycodestyle/blob/2.5.0/pycodestyle.py#L2105-L2107
    
    This is an issue since the logical check wipes stored tokens meaning
    we've nothing to check when we get to the fallthrough case:
    
      https://github.com/PyCQA/pycodestyle/blob/2.5.0/pycodestyle.py#L2012
    
    This fixes changes things so that a newline is included (and also adds
    quotes so it's valid Python, but that's mostly unrelated). This means we
    end up with the following instead:
    
      ["'This is the the best comment'\n"]
    
    On both Python without the bugfix and with it, this parses as:
    
      (3, "'This is the the best comment'", (1, 0), (1, 30), "'This is the the best comment'\n")
      (4, '\n', (1, 30), (1, 31), "'This is the the best comment'\n")
      (0, '', (2, 0), (2, 0), '')
    
    where (3 = 'STRING', 4 = 'NEWLINE', 0 = 'ENDMARKER')
    
    Which triggers things in 'pycodestyle' correctly.
    
      https://github.com/PyCQA/pycodestyle/blob/2.5.0/pycodestyle.py#L2044-L2046
    
    This isn't _really_ a fix since there's clearly still a bug in either
    'pycodestyle' or Python (I think the latter, since it's adding a newline
    to a file that explicitly doesn't have one), but the chances of us
    hitting this bug in practice are rather low - you'd need to make a
    mistake on the very last line of a file without a newline at the end
    which is something Vim, for example, won't even let you do without
    setting special flags - and therefore it can be reasonably ignored.
    
    [1] https://bugs.python.org/issue33899
    
    Change-Id: Ia597594e0469c0e83d7ad22b0678390aaebaffe7
    Signed-off-by: Stephen Finucane <sfinucan@redhat.com>
    Closes-Bug: #1804062

Changed in nova:
status:	In Progress → Fix Released

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2019-06-13: Change abandoned on nova (stable/stein)

#11

Change abandoned by Lee Yarwood (<email address hidden>) on branch: stable/stein
Review: https://review.opendev.org/664841
Reason: https://review.opendev.org/665147

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2019-06-14: Fix merged to nova (stable/stein)

#12

Download full text (4.0 KiB)

Reviewed: https://review.opendev.org/665147
Committed: https://git.openstack.org/cgit/openstack/nova/commit/?id=0cb6106b83c33bded9e6cdec7737964c36be8de5
Submitter: Zuul
Branch: stable/stein

commit 0cb6106b83c33bded9e6cdec7737964c36be8de5
Author: Stephen Finucane <email address hidden>
Date: Wed Jun 12 15:10:59 2019 +0100

Fix double word hacking test

At present, 'pycodestyle' feeds the following string into the 'tokenizer'
library:

["'This is the the best comment'"]

(note the added quotes because this isn't valid Python otherwise)

On previous versions of Python, this tokenizer would parse the string like so:

(3, "'This is the the best comment'", (1, 0), (1, 30), "'This is the the best comment'")
(0, '', (2, 0), (2, 0), '')

where (3 = 'STRING', 0 = 'ENDMARKER')

However, with the fix [1] backported to recent versions of Python, this now
resolves to:

      (3, "'This is the the best comment'", (1, 0), (1, 30), "'This is the the best comment'")
      (4, '', (1, 30), (1, 31), '')
      (0, '', (2, 0), (2, 0), '')

where (3 = 'STRING', 4 = 'NEWLINE', 0 = 'ENDMARKER')

Typically, 'pycodestyle' will run physical line checks on each line as it
parses the token:

https://github.com/PyCQA/pycodestyle/blob/2.5.0/pycodestyle.py#L2036

    For the former case above, the line doesn't include a newline which
    means we never parse a 'NEWLINE' token with a logical line (the fifth
    element of the token tuple) corresponding to our full line. This means
    we don't here but that wasn't an issue previously since there's a
    fallthrough case that handled tokens remaining at the end of the parse:

https://github.com/PyCQA/pycodestyle/blob/2.5.0/pycodestyle.py#L2114-L2116

    Unfortunately, because we now have an additional newline character to
    parse, one that's on a separate line to our test string no less, we run
    logical checks on it:

https://github.com/PyCQA/pycodestyle/blob/2.5.0/pycodestyle.py#L2105-L2107

This is an issue since the logical check wipes stored tokens meaning
we've nothing to check when we get to the fallthrough case:

https://github.com/PyCQA/pycodestyle/blob/2.5.0/pycodestyle.py#L2012

    This fixes changes things so that a newline is included (and also adds
    quotes so it's valid Python, but that's mostly unrelated). This means we
    end up with the following instead:

["'This is the the best comment'\n"]

On both Python without the bugfix and with it, this parses as:

      (3, "'This is the the best comment'", (1, 0), (1, 30), "'This is the the best comment'\n")
      (4, '\n', (1, 30), (1, 31), "'This is the the best comment'\n")
      (0, '', (2, 0), (2, 0), '')

where (3 = 'STRING', 4 = 'NEWLINE', 0 = 'ENDMARKER')

Which triggers things in 'pycodestyle' correctly.

https://github.com/PyCQA/pycodestyle/blob/2.5.0/pycodestyle.py#L2044-L2046

This isn't _really_ a fix since there's clearly still a bug in either
'pycodestyle' or Python (I think the latter, since it's adding a newlin...

Reviewed:  https://review.opendev.org/665147
Committed: https://git.openstack.org/cgit/openstack/nova/commit/?id=0cb6106b83c33bded9e6cdec7737964c36be8de5
Submitter: Zuul
Branch:    stable/stein

commit 0cb6106b83c33bded9e6cdec7737964c36be8de5
Author: Stephen Finucane <sfinucan@redhat.com>
Date:   Wed Jun 12 15:10:59 2019 +0100

Fix double word hacking test
    
    At present, 'pycodestyle' feeds the following string into the 'tokenizer'
    library:
    
        ["'This is the the best comment'"]
    
    (note the added quotes because this isn't valid Python otherwise)
    
    On previous versions of Python, this tokenizer would parse the string like so:
    
      (3, "'This is the the best comment'", (1, 0), (1, 30), "'This is the the best comment'")
      (0, '', (2, 0), (2, 0), '')
    
    where (3 = 'STRING', 0 = 'ENDMARKER')
    
    However, with the fix [1] backported to recent versions of Python, this now
    resolves to:
    
      (3, "'This is the the best comment'", (1, 0), (1, 30), "'This is the the best comment'")
      (4, '', (1, 30), (1, 31), '')
      (0, '', (2, 0), (2, 0), '')
    
    where (3 = 'STRING', 4 = 'NEWLINE', 0 = 'ENDMARKER')
    
    Typically, 'pycodestyle' will run physical line checks on each line as it
    parses the token:
    
      https://github.com/PyCQA/pycodestyle/blob/2.5.0/pycodestyle.py#L2036
    
    For the former case above, the line doesn't include a newline which
    means we never parse a 'NEWLINE' token with a logical line (the fifth
    element of the token tuple) corresponding to our full line. This means
    we don't here but that wasn't an issue previously since there's a
    fallthrough case that handled tokens remaining at the end of the parse:
    
      https://github.com/PyCQA/pycodestyle/blob/2.5.0/pycodestyle.py#L2114-L2116
    
    Unfortunately, because we now have an additional newline character to
    parse, one that's on a separate line to our test string no less, we run
    logical checks on it:
    
      https://github.com/PyCQA/pycodestyle/blob/2.5.0/pycodestyle.py#L2105-L2107
    
    This is an issue since the logical check wipes stored tokens meaning
    we've nothing to check when we get to the fallthrough case:
    
      https://github.com/PyCQA/pycodestyle/blob/2.5.0/pycodestyle.py#L2012
    
    This fixes changes things so that a newline is included (and also adds
    quotes so it's valid Python, but that's mostly unrelated). This means we
    end up with the following instead:
    
      ["'This is the the best comment'\n"]
    
    On both Python without the bugfix and with it, this parses as:
    
      (3, "'This is the the best comment'", (1, 0), (1, 30), "'This is the the best comment'\n")
      (4, '\n', (1, 30), (1, 31), "'This is the the best comment'\n")
      (0, '', (2, 0), (2, 0), '')
    
    where (3 = 'STRING', 4 = 'NEWLINE', 0 = 'ENDMARKER')
    
    Which triggers things in 'pycodestyle' correctly.
    
      https://github.com/PyCQA/pycodestyle/blob/2.5.0/pycodestyle.py#L2044-L2046
    
    This isn't _really_ a fix since there's clearly still a bug in either
    'pycodestyle' or Python (I think the latter, since it's adding a newline
    to a file that explicitly doesn't have one), but the chances of us
    hitting this bug in practice are rather low - you'd need to make a
    mistake on the very last line of a file without a newline at the end
    which is something Vim, for example, won't even let you do without
    setting special flags - and therefore it can be reasonably ignored.
    
    Conflicts:
    	nova/tests/unit/test_hacking.py
    
    NOTE(stephenfin): Conflicts are because we don't have change
    I35c654bd39f343417e0a1124263ff31dcd0b05c9 ("Bump to hacking 1.1.0") or
    change I8826c3fb89690805baae6b9b7b48985abb8d62d3 ("Skip
    test_check_doubled_words hacking check UT") on this branch.
    
    [1] https://bugs.python.org/issue33899
    
    Change-Id: Ia597594e0469c0e83d7ad22b0678390aaebaffe7
    Signed-off-by: Stephen Finucane <sfinucan@redhat.com>
    Closes-Bug: #1804062
    (cherry picked from f545a25cc443c41dcd9bdd028064c28b53f56037)

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2019-07-24: Fix merged to nova (stable/rocky)

#13

Download full text (3.8 KiB)

Reviewed: https://review.opendev.org/665788
Committed: https://git.openstack.org/cgit/openstack/nova/commit/?id=4858074c89838eadeb9eaf9f39917e9fb90acd93
Submitter: Zuul
Branch: stable/rocky

commit 4858074c89838eadeb9eaf9f39917e9fb90acd93
Author: Stephen Finucane <email address hidden>
Date: Wed Jun 12 15:10:59 2019 +0100

Fix double word hacking test

At present, 'pycodestyle' feeds the following string into the 'tokenizer'
library:

["'This is the the best comment'"]

(note the added quotes because this isn't valid Python otherwise)

On previous versions of Python, this tokenizer would parse the string like so:

(3, "'This is the the best comment'", (1, 0), (1, 30), "'This is the the best comment'")
(0, '', (2, 0), (2, 0), '')

where (3 = 'STRING', 0 = 'ENDMARKER')

However, with the fix [1] backported to recent versions of Python, this now
resolves to:

      (3, "'This is the the best comment'", (1, 0), (1, 30), "'This is the the best comment'")
      (4, '', (1, 30), (1, 31), '')
      (0, '', (2, 0), (2, 0), '')

where (3 = 'STRING', 4 = 'NEWLINE', 0 = 'ENDMARKER')

Typically, 'pycodestyle' will run physical line checks on each line as it
parses the token:

https://github.com/PyCQA/pycodestyle/blob/2.5.0/pycodestyle.py#L2036

    For the former case above, the line doesn't include a newline which
    means we never parse a 'NEWLINE' token with a logical line (the fifth
    element of the token tuple) corresponding to our full line. This means
    we don't here but that wasn't an issue previously since there's a
    fallthrough case that handled tokens remaining at the end of the parse:

https://github.com/PyCQA/pycodestyle/blob/2.5.0/pycodestyle.py#L2114-L2116

    Unfortunately, because we now have an additional newline character to
    parse, one that's on a separate line to our test string no less, we run
    logical checks on it:

https://github.com/PyCQA/pycodestyle/blob/2.5.0/pycodestyle.py#L2105-L2107

This is an issue since the logical check wipes stored tokens meaning
we've nothing to check when we get to the fallthrough case:

https://github.com/PyCQA/pycodestyle/blob/2.5.0/pycodestyle.py#L2012

    This fixes changes things so that a newline is included (and also adds
    quotes so it's valid Python, but that's mostly unrelated). This means we
    end up with the following instead:

["'This is the the best comment'\n"]

On both Python without the bugfix and with it, this parses as:

      (3, "'This is the the best comment'", (1, 0), (1, 30), "'This is the the best comment'\n")
      (4, '\n', (1, 30), (1, 31), "'This is the the best comment'\n")
      (0, '', (2, 0), (2, 0), '')

where (3 = 'STRING', 4 = 'NEWLINE', 0 = 'ENDMARKER')

Which triggers things in 'pycodestyle' correctly.

https://github.com/PyCQA/pycodestyle/blob/2.5.0/pycodestyle.py#L2044-L2046

This isn't _really_ a fix since there's clearly still a bug in either
'pycodestyle' or Python (I think the latter, since it's adding a newlin...

Reviewed:  https://review.opendev.org/665788
Committed: https://git.openstack.org/cgit/openstack/nova/commit/?id=4858074c89838eadeb9eaf9f39917e9fb90acd93
Submitter: Zuul
Branch:    stable/rocky

commit 4858074c89838eadeb9eaf9f39917e9fb90acd93
Author: Stephen Finucane <sfinucan@redhat.com>
Date:   Wed Jun 12 15:10:59 2019 +0100

Fix double word hacking test
    
    At present, 'pycodestyle' feeds the following string into the 'tokenizer'
    library:
    
        ["'This is the the best comment'"]
    
    (note the added quotes because this isn't valid Python otherwise)
    
    On previous versions of Python, this tokenizer would parse the string like so:
    
      (3, "'This is the the best comment'", (1, 0), (1, 30), "'This is the the best comment'")
      (0, '', (2, 0), (2, 0), '')
    
    where (3 = 'STRING', 0 = 'ENDMARKER')
    
    However, with the fix [1] backported to recent versions of Python, this now
    resolves to:
    
      (3, "'This is the the best comment'", (1, 0), (1, 30), "'This is the the best comment'")
      (4, '', (1, 30), (1, 31), '')
      (0, '', (2, 0), (2, 0), '')
    
    where (3 = 'STRING', 4 = 'NEWLINE', 0 = 'ENDMARKER')
    
    Typically, 'pycodestyle' will run physical line checks on each line as it
    parses the token:
    
      https://github.com/PyCQA/pycodestyle/blob/2.5.0/pycodestyle.py#L2036
    
    For the former case above, the line doesn't include a newline which
    means we never parse a 'NEWLINE' token with a logical line (the fifth
    element of the token tuple) corresponding to our full line. This means
    we don't here but that wasn't an issue previously since there's a
    fallthrough case that handled tokens remaining at the end of the parse:
    
      https://github.com/PyCQA/pycodestyle/blob/2.5.0/pycodestyle.py#L2114-L2116
    
    Unfortunately, because we now have an additional newline character to
    parse, one that's on a separate line to our test string no less, we run
    logical checks on it:
    
      https://github.com/PyCQA/pycodestyle/blob/2.5.0/pycodestyle.py#L2105-L2107
    
    This is an issue since the logical check wipes stored tokens meaning
    we've nothing to check when we get to the fallthrough case:
    
      https://github.com/PyCQA/pycodestyle/blob/2.5.0/pycodestyle.py#L2012
    
    This fixes changes things so that a newline is included (and also adds
    quotes so it's valid Python, but that's mostly unrelated). This means we
    end up with the following instead:
    
      ["'This is the the best comment'\n"]
    
    On both Python without the bugfix and with it, this parses as:
    
      (3, "'This is the the best comment'", (1, 0), (1, 30), "'This is the the best comment'\n")
      (4, '\n', (1, 30), (1, 31), "'This is the the best comment'\n")
      (0, '', (2, 0), (2, 0), '')
    
    where (3 = 'STRING', 4 = 'NEWLINE', 0 = 'ENDMARKER')
    
    Which triggers things in 'pycodestyle' correctly.
    
      https://github.com/PyCQA/pycodestyle/blob/2.5.0/pycodestyle.py#L2044-L2046
    
    This isn't _really_ a fix since there's clearly still a bug in either
    'pycodestyle' or Python (I think the latter, since it's adding a newline
    to a file that explicitly doesn't have one), but the chances of us
    hitting this bug in practice are rather low - you'd need to make a
    mistake on the very last line of a file without a newline at the end
    which is something Vim, for example, won't even let you do without
    setting special flags - and therefore it can be reasonably ignored.
    
    [1] https://bugs.python.org/issue33899
    
    Change-Id: Ia597594e0469c0e83d7ad22b0678390aaebaffe7
    Signed-off-by: Stephen Finucane <sfinucan@redhat.com>
    Closes-Bug: #1804062
    (cherry picked from f545a25cc443c41dcd9bdd028064c28b53f56037)
    (cherry picked from commit 0cb6106b83c33bded9e6cdec7737964c36be8de5)

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2019-08-10: Fix merged to nova (stable/queens)

#14

Download full text (3.9 KiB)

Reviewed: https://review.opendev.org/665790
Committed: https://git.openstack.org/cgit/openstack/nova/commit/?id=dc2963d2a0002d4ddacf50b1f80c470d5de7ec61
Submitter: Zuul
Branch: stable/queens

commit dc2963d2a0002d4ddacf50b1f80c470d5de7ec61
Author: Stephen Finucane <email address hidden>
Date: Wed Jun 12 15:10:59 2019 +0100

Fix double word hacking test

At present, 'pycodestyle' feeds the following string into the 'tokenizer'
library:

["'This is the the best comment'"]

(note the added quotes because this isn't valid Python otherwise)

On previous versions of Python, this tokenizer would parse the string like so:

(3, "'This is the the best comment'", (1, 0), (1, 30), "'This is the the best comment'")
(0, '', (2, 0), (2, 0), '')

where (3 = 'STRING', 0 = 'ENDMARKER')

However, with the fix [1] backported to recent versions of Python, this now
resolves to:

      (3, "'This is the the best comment'", (1, 0), (1, 30), "'This is the the best comment'")
      (4, '', (1, 30), (1, 31), '')
      (0, '', (2, 0), (2, 0), '')

where (3 = 'STRING', 4 = 'NEWLINE', 0 = 'ENDMARKER')

Typically, 'pycodestyle' will run physical line checks on each line as it
parses the token:

https://github.com/PyCQA/pycodestyle/blob/2.5.0/pycodestyle.py#L2036

    For the former case above, the line doesn't include a newline which
    means we never parse a 'NEWLINE' token with a logical line (the fifth
    element of the token tuple) corresponding to our full line. This means
    we don't here but that wasn't an issue previously since there's a
    fallthrough case that handled tokens remaining at the end of the parse:

https://github.com/PyCQA/pycodestyle/blob/2.5.0/pycodestyle.py#L2114-L2116

    Unfortunately, because we now have an additional newline character to
    parse, one that's on a separate line to our test string no less, we run
    logical checks on it:

https://github.com/PyCQA/pycodestyle/blob/2.5.0/pycodestyle.py#L2105-L2107

This is an issue since the logical check wipes stored tokens meaning
we've nothing to check when we get to the fallthrough case:

https://github.com/PyCQA/pycodestyle/blob/2.5.0/pycodestyle.py#L2012

    This fixes changes things so that a newline is included (and also adds
    quotes so it's valid Python, but that's mostly unrelated). This means we
    end up with the following instead:

["'This is the the best comment'\n"]

On both Python without the bugfix and with it, this parses as:

      (3, "'This is the the best comment'", (1, 0), (1, 30), "'This is the the best comment'\n")
      (4, '\n', (1, 30), (1, 31), "'This is the the best comment'\n")
      (0, '', (2, 0), (2, 0), '')

where (3 = 'STRING', 4 = 'NEWLINE', 0 = 'ENDMARKER')

Which triggers things in 'pycodestyle' correctly.

https://github.com/PyCQA/pycodestyle/blob/2.5.0/pycodestyle.py#L2044-L2046

This isn't _really_ a fix since there's clearly still a bug in either
'pycodestyle' or Python (I think the latter, since it's adding a newli...

Reviewed:  https://review.opendev.org/665790
Committed: https://git.openstack.org/cgit/openstack/nova/commit/?id=dc2963d2a0002d4ddacf50b1f80c470d5de7ec61
Submitter: Zuul
Branch:    stable/queens

commit dc2963d2a0002d4ddacf50b1f80c470d5de7ec61
Author: Stephen Finucane <sfinucan@redhat.com>
Date:   Wed Jun 12 15:10:59 2019 +0100

Fix double word hacking test
    
    At present, 'pycodestyle' feeds the following string into the 'tokenizer'
    library:
    
        ["'This is the the best comment'"]
    
    (note the added quotes because this isn't valid Python otherwise)
    
    On previous versions of Python, this tokenizer would parse the string like so:
    
      (3, "'This is the the best comment'", (1, 0), (1, 30), "'This is the the best comment'")
      (0, '', (2, 0), (2, 0), '')
    
    where (3 = 'STRING', 0 = 'ENDMARKER')
    
    However, with the fix [1] backported to recent versions of Python, this now
    resolves to:
    
      (3, "'This is the the best comment'", (1, 0), (1, 30), "'This is the the best comment'")
      (4, '', (1, 30), (1, 31), '')
      (0, '', (2, 0), (2, 0), '')
    
    where (3 = 'STRING', 4 = 'NEWLINE', 0 = 'ENDMARKER')
    
    Typically, 'pycodestyle' will run physical line checks on each line as it
    parses the token:
    
      https://github.com/PyCQA/pycodestyle/blob/2.5.0/pycodestyle.py#L2036
    
    For the former case above, the line doesn't include a newline which
    means we never parse a 'NEWLINE' token with a logical line (the fifth
    element of the token tuple) corresponding to our full line. This means
    we don't here but that wasn't an issue previously since there's a
    fallthrough case that handled tokens remaining at the end of the parse:
    
      https://github.com/PyCQA/pycodestyle/blob/2.5.0/pycodestyle.py#L2114-L2116
    
    Unfortunately, because we now have an additional newline character to
    parse, one that's on a separate line to our test string no less, we run
    logical checks on it:
    
      https://github.com/PyCQA/pycodestyle/blob/2.5.0/pycodestyle.py#L2105-L2107
    
    This is an issue since the logical check wipes stored tokens meaning
    we've nothing to check when we get to the fallthrough case:
    
      https://github.com/PyCQA/pycodestyle/blob/2.5.0/pycodestyle.py#L2012
    
    This fixes changes things so that a newline is included (and also adds
    quotes so it's valid Python, but that's mostly unrelated). This means we
    end up with the following instead:
    
      ["'This is the the best comment'\n"]
    
    On both Python without the bugfix and with it, this parses as:
    
      (3, "'This is the the best comment'", (1, 0), (1, 30), "'This is the the best comment'\n")
      (4, '\n', (1, 30), (1, 31), "'This is the the best comment'\n")
      (0, '', (2, 0), (2, 0), '')
    
    where (3 = 'STRING', 4 = 'NEWLINE', 0 = 'ENDMARKER')
    
    Which triggers things in 'pycodestyle' correctly.
    
      https://github.com/PyCQA/pycodestyle/blob/2.5.0/pycodestyle.py#L2044-L2046
    
    This isn't _really_ a fix since there's clearly still a bug in either
    'pycodestyle' or Python (I think the latter, since it's adding a newline
    to a file that explicitly doesn't have one), but the chances of us
    hitting this bug in practice are rather low - you'd need to make a
    mistake on the very last line of a file without a newline at the end
    which is something Vim, for example, won't even let you do without
    setting special flags - and therefore it can be reasonably ignored.
    
    [1] https://bugs.python.org/issue33899
    
    Change-Id: Ia597594e0469c0e83d7ad22b0678390aaebaffe7
    Signed-off-by: Stephen Finucane <sfinucan@redhat.com>
    Closes-Bug: #1804062
    (cherry picked from f545a25cc443c41dcd9bdd028064c28b53f56037)
    (cherry picked from commit 0cb6106b83c33bded9e6cdec7737964c36be8de5)
    (cherry picked from commit 4858074c89838eadeb9eaf9f39917e9fb90acd93)

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2019-08-12: Fix included in openstack/nova 19.0.2

#15

This issue was fixed in the openstack/nova 19.0.2 release.

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2019-08-12: Fix included in openstack/nova 18.2.2

#16

This issue was fixed in the openstack/nova 18.2.2 release.

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2019-08-19: Fix included in openstack/nova 17.0.12

#17

This issue was fixed in the openstack/nova 17.0.12 release.

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2019-09-27: Fix included in openstack/nova 20.0.0.0rc1

#18

This issue was fixed in the openstack/nova 20.0.0.0rc1 release candidate.

Revision history for this message

Chris MacNaughton (chris.macnaughton) wrote on 2020-06-04:

#19

Looking at the versions of nova that are in the archives, this seems to be fix-released

Changed in cloud-archive:
status:	Fix Committed → Fix Released
Changed in nova (Ubuntu Bionic):
status:	Triaged → Fix Released
Changed in nova (Ubuntu Cosmic):
status:	Triaged → Fix Released
Changed in nova (Ubuntu Disco):
status:	Triaged → Fix Released
Changed in nova (Ubuntu Eoan):
status:	Triaged → Fix Released
Changed in nova (Ubuntu):
status:	Triaged → Fix Released

Radosław Piliszek (yoctozepto) on 2021-09-15

no longer affects:

masakari

Ubuntu Cloud Archive

test_hacking fails for python 3.6.7 and newer

Bug Description

Duplicates of this bug

Other bug subscribers

Remote bug watches

	Status	Importance	Assigned to
OpenStack Compute (nova)	Fix Released	High	Stephen Finucane
Ubuntu Cloud Archive	Fix Released	High	Unassigned
Queens	Fix Released	High	Unassigned
Rocky	Fix Released	High	Unassigned
Stein	Fix Released	High	Unassigned
nova (Ubuntu)	Fix Released	High	Unassigned
Bionic	Fix Released	High	Unassigned
Cosmic	Fix Released	High	Unassigned
Disco	Fix Released	High	Unassigned
Eoan	Fix Released	High	Unassigned