PTP no longuer working on igb since 4.15.0-33

Bug #1790096 reported by Emmanuel Fusté
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
linux (Ubuntu)
Fix Committed
Medium
Joseph Salisbury
Bionic
Fix Committed
Medium
Joseph Salisbury

Bug Description

ptp4l is no longer able to sync the PHC of a i350 (igb driver) since 4.15.0-33-generic.
4.15.0-32-generic work.

It no longer reach the slave state and is stuck to UNCALIBRATED:

Aug 31 10:03:29 ptp4l[1259]: [79.208] [0:enp7s0f1] port 1: link up
Aug 31 10:03:29 ptp4l[1259]: [79.260] [0:enp7s0f1] port 1: FAULTY to LISTENING on INIT_COMPLETE
Aug 31 10:03:30 ptp4l[1259]: [80.814] [0:enp7s0f1] port 1: new foreign master ec4670.fffe.009fc5-1
Aug 31 10:03:34 ptp4l[1259]: [84.815] [0:enp7s0f1] selected best master clock ec4670.fffe.009fc5
Aug 31 10:03:34 ptp4l[1259]: [84.815] [0:enp7s0f1] port 1: LISTENING to UNCALIBRATED on RS_SLAVE
OR
Aug 31 10:53:44 ptp4l[1259]: [3094.219] [0:enp7s0f1] timed out while polling for tx timestamp
Aug 31 10:53:44 ptp4l[1259]: [3094.219] [0:enp7s0f1] increasing tx_timestamp_timeout may correct this issue, but it is likely caused by a driver bug
Aug 31 10:53:44 ptp4l[1259]: [3094.219] [0:enp7s0f1] port 1: send delay request failed
Aug 31 10:53:44 ptp4l[1259]: [3094.219] [0:enp7s0f1] port 1: UNCALIBRATED to FAULTY on FAULT_DETECTED (FT_UNSPECIFIED)
Aug 31 10:54:00 ptp4l[1259]: [3110.292] [0:enp7s0f1] port 1: FAULTY to LISTENING on INIT_COMPLETE
Aug 31 10:54:02 ptp4l[1259]: [3112.052] [0:enp7s0f1] port 1: new foreign master ec4670.fffe.009fc5-1
Aug 31 10:54:06 ptp4l[1259]: [3116.053] [0:enp7s0f1] selected best master clock ec4670.fffe.009fc5
Aug 31 10:54:06 ptp4l[1259]: [3116.053] [0:enp7s0f1] port 1: LISTENING to UNCALIBRATED on RS_SLAVE

On 4.15.0-32-generic:
Aug 31 09:52:27 ptp4l[2392]: [654.860] [0:enp7s0f1] port 1: new foreign master ec4670.fffe.009fc5-1
Aug 31 09:52:31 ptp4l[2392]: [658.861] [0:enp7s0f1] selected best master clock ec4670.fffe.009fc5
Aug 31 09:52:31 ptp4l[2392]: [658.861] [0:enp7s0f1] port 1: LISTENING to UNCALIBRATED on RS_SLAVE
Aug 31 09:52:33 ptp4l[2392]: [660.737] [0:enp7s0f1] port 1: minimum delay request interval 2^1
Aug 31 09:52:34 ptp4l[2392]: [662.037] [0:enp7s0f1] port 1: UNCALIBRATED to SLAVE on MASTER_CLOCK_SELECTED

I don't know if it is a driver change or a network/timekeeping change or other that is the root cause of this.

Revision history for this message
Ubuntu Kernel Bot (ubuntu-kernel-bot) wrote : Missing required logs.

This bug is missing log files that will aid in diagnosing the problem. While running an Ubuntu kernel (not a mainline or third-party kernel) please enter the following command in a terminal window:

apport-collect 1790096

and then change the status of the bug to 'Confirmed'.

If, due to the nature of the issue you have encountered, you are unable to run this command, please add a comment stating that fact and change the bug status to 'Confirmed'.

This change has been made by an automated script, maintained by the Ubuntu Kernel Team.

Changed in linux (Ubuntu):
status: New → Incomplete
tags: added: bionic
Revision history for this message
Emmanuel Fusté (emmanuel-fuste-thalesgroup) wrote :

apport report

Changed in linux (Ubuntu):
status: Incomplete → Confirmed
Revision history for this message
Joseph Salisbury (jsalisbury) wrote :

Would it be possible for you to test the proposed kernel and post back if it resolves this bug?
See https://wiki.ubuntu.com/Testing/EnableProposed for documentation how to enable and use -proposed.

Changed in linux (Ubuntu):
importance: Undecided → Medium
status: Confirmed → Incomplete
tags: added: needs-bisect
Revision history for this message
Emmanuel Fusté (emmanuel-fuste-thalesgroup) wrote :

-34 tested a few minutes ago. Same problem.
I'm back with -32 for now.

Revision history for this message
Emmanuel Fusté (emmanuel-fuste-thalesgroup) wrote :

Looking a the code, igb driver is untouched between -32 and -34.
looking network, timekeeping and x86 timer code is needed.

Revision history for this message
Joseph Salisbury (jsalisbury) wrote :

I built a 4.15.0-35 test kernel.

The test kernel can be downloaded from:
http://kernel.ubuntu.com/~jsalisbury/lp1792100

Can you test this kernel and see if it resolves this bug?

Note about installing test kernels:
• If the test kernel is prior to 4.15(Bionic) you need to install the linux-image and linux-image-extra .deb packages.
• If the test kernel is 4.15(Bionic) or newer, you need to install the linux-modules, linux-modules-extra and linux-image-unsigned .deb packages.

Thanks in advance!

Revision history for this message
Emmanuel Fusté (emmanuel-fuste-thalesgroup) wrote : Re: [Bug 1790096] Re: PTP no longuer working on igb since 4.15.0-33

Le 13/09/2018 à 22:44, Joseph Salisbury a écrit :
> I built a 4.15.0-35 test kernel.
>
> The test kernel can be downloaded from:
> http://kernel.ubuntu.com/~jsalisbury/lp1792100
>
> Can you test this kernel and see if it resolves this bug?
>
> Note about installing test kernels:
> • If the test kernel is prior to 4.15(Bionic) you need to install the linux-image and linux-image-extra .deb packages.
> • If the test kernel is 4.15(Bionic) or newer, you need to install the linux-modules, linux-modules-extra and linux-image-unsigned .deb packages.
>
> Thanks in advance!
>
Yes pblm fixed !
Do you have a precise idea of the source of the bug ?

Thank you!
Emmanuel.

Revision history for this message
Joseph Salisbury (jsalisbury) wrote :

Would it be possible for you to test the proposed kernel and post back if it resolves this bug?
See https://wiki.ubuntu.compo/Testing/EnableProposed for documentation how to enable and use -proposed.

Changed in linux (Ubuntu):
status: Incomplete → Fix Committed
Revision history for this message
Emmanuel Fusté (emmanuel-fuste-thalesgroup) wrote :

Hello,
Yes, bug fixed in the latest -proposed kernel.

Thank you
Emmanuel.

Changed in linux (Ubuntu):
assignee: nobody → Joseph Salisbury (jsalisbury)
Changed in linux (Ubuntu Bionic):
status: New → Fix Committed
importance: Undecided → Medium
assignee: nobody → Joseph Salisbury (jsalisbury)
Brad Figg (brad-figg)
tags: added: cscc
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.