Activity log for bug #1961697

Date Who What changed Old value New value Message
2022-02-22 05:42:14 KJ Tsanaktsidis bug added bug
2022-02-22 05:42:14 KJ Tsanaktsidis attachment added upstream patch with conflicts resolved for 2.27 https://bugs.launchpad.net/bugs/1961697/+attachment/5562611/+files/resolv-txnid-collision.patch
2022-02-22 05:42:52 KJ Tsanaktsidis attachment added Packet capture showing DNS queries with same txnid https://bugs.launchpad.net/ubuntu/+source/glibc/+bug/1961697/+attachment/5562612/+files/dns_same_txid.pcap
2022-02-22 05:43:10 KJ Tsanaktsidis attachment added glibc backtrace from hang https://bugs.launchpad.net/ubuntu/+source/glibc/+bug/1961697/+attachment/5562613/+files/glibc_backtrace.txt
2022-02-22 06:24:47 Launchpad Janitor glibc (Ubuntu): status New Confirmed
2022-02-22 08:25:48 Ubuntu Foundations Team Bug Bot tags patch
2022-02-22 08:25:56 Ubuntu Foundations Team Bug Bot bug added subscriber Ubuntu Review Team
2022-02-22 18:16:25 Brian Murray tags patch patch rls-ff-incoming
2022-02-22 18:16:43 Brian Murray bug watch added https://sourceware.org/bugzilla/show_bug.cgi?id=26600
2022-02-22 18:16:43 Brian Murray bug task added glibc
2022-02-22 18:17:26 Brian Murray glibc (Ubuntu): importance Undecided Medium
2022-02-22 18:41:12 Bug Watch Updater glibc: status Unknown Fix Released
2022-02-22 18:41:12 Bug Watch Updater glibc: importance Unknown Medium
2022-02-24 19:18:52 Brian Murray bug added subscriber Michael Hudson-Doyle
2022-03-10 18:28:10 Brian Murray tags patch rls-ff-incoming fr-2102 patch rls-ff-incoming
2022-03-10 18:28:24 Brian Murray nominated for series Ubuntu Focal
2022-03-10 18:28:24 Brian Murray bug task added glibc (Ubuntu Focal)
2022-03-10 18:28:33 Brian Murray glibc (Ubuntu Focal): importance Undecided Medium
2022-03-10 18:28:43 Brian Murray tags fr-2102 patch rls-ff-incoming fr-2102 patch
2022-03-10 21:08:37 Michael Hudson-Doyle description When resolving DNS names with getaddrinfo(), I have seen this hang for 5 seconds and then retry and succeed. The issue is that glibc will issue a both an A and AAAA query on the same socket, and in some circumstances they can be sent with the same DNS transaction ID as well. I verified this with a packet capture; in the packet capture, I saw the A and AAAA queries for a name be made with the same DNS transaction ID, get responses, do nothing for five seconds, and then send the same DNS query again. On the glibc side, I confirmed that it's blocked waiting for the DNS response by interrupting it with gdb, even though the packet capture shows the response has well and truly arrived. I've attached a packet capture & a backtrace of the glibc hang. I believe this is the same issue reported in these places: * In RHEL: https://bugzilla.redhat.com/show_bug.cgi?id=1904153 * Also RHEL: https://bugzilla.redhat.com/show_bug.cgi?id=1903880 * Upstream: https://sourceware.org/bugzilla/show_bug.cgi?id=26600 The environment I noticed this bug in was: * Docker for Mac on an arm64 m1 Macbook * Docker for Mac Linux kernel version is 5.10.76-linuxkit * Linux is also arm64, not emulated * Container with the buggy DNS environment is Ubuntu bionic (also arm64, not emulated) * Glibc 2.27-3ubuntu1.4 However one of the redhat reporters noticed this issue in m6 series EC2 instances in AWS. A patch has been provided upstream for this issue: https://sourceware.org/pipermail/libc-alpha/2020-September/117547.html I applied the upstream patch to glibc 2.27-3ubuntu1.4 and rebuilt the package, and the problem went away. I've attached the exact patch I applied, since I had to work through some conflicts. So, I think that patch just needs to be backported to Bionic and (I think) Focal as well. Is that reasonable? Thanks! [impact] When resolving DNS names with getaddrinfo(), I have seen this hang for 5 seconds and then retry and succeed. The issue is that glibc will issue a both an A and AAAA query on the same socket, and in some circumstances they can be sent with the same DNS transaction ID as well. [test case] TBD [regression potential] TBD. [original description] I verified this with a packet capture; in the packet capture, I saw the A and AAAA queries for a name be made with the same DNS transaction ID, get responses, do nothing for five seconds, and then send the same DNS query again. On the glibc side, I confirmed that it's blocked waiting for the DNS response by interrupting it with gdb, even though the packet capture shows the response has well and truly arrived. I've attached a packet capture & a backtrace of the glibc hang. I believe this is the same issue reported in these places:     * In RHEL: https://bugzilla.redhat.com/show_bug.cgi?id=1904153     * Also RHEL: https://bugzilla.redhat.com/show_bug.cgi?id=1903880     * Upstream: https://sourceware.org/bugzilla/show_bug.cgi?id=26600 The environment I noticed this bug in was:     * Docker for Mac on an arm64 m1 Macbook     * Docker for Mac Linux kernel version is 5.10.76-linuxkit     * Linux is also arm64, not emulated     * Container with the buggy DNS environment is Ubuntu bionic (also arm64, not emulated)     * Glibc 2.27-3ubuntu1.4 However one of the redhat reporters noticed this issue in m6 series EC2 instances in AWS. A patch has been provided upstream for this issue: https://sourceware.org/pipermail/libc-alpha/2020-September/117547.html I applied the upstream patch to glibc 2.27-3ubuntu1.4 and rebuilt the package, and the problem went away. I've attached the exact patch I applied, since I had to work through some conflicts. So, I think that patch just needs to be backported to Bionic and (I think) Focal as well. Is that reasonable? Thanks!
2022-03-15 23:02:42 Matt Nordhoff bug added subscriber Matt Nordhoff