Intermittently can't connect to the server
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Mozilla Firefox |
Invalid
|
Medium
|
|||
firefox (Ubuntu) |
Invalid
|
Undecided
|
Unassigned |
Bug Description
Intermittently while browsing old and new sites, I run into a Server Not Found, "Hmm. We’re having trouble finding that site.", "We can’t connect to the server" page. To workaround I will often hold down the enter key to rapidly re-attempt to get to the page or close the tab, try again, and repeat until I get through. The latter is successful much more often than the former.
Running in a new Firefox profile and in safe mode seems to make no difference.
I had mistakenly made a submission to launchpad without running ubuntu-bug firefox. This is the submission using ubuntu-bug. The old submission is here: https:/
Linux Mint 18.3 Cinnamon
apt-cache policy firefox
firefox:
Installed: 63.0~b13+
ProblemType: Bug
DistroRelease: Linux 18.3
Package: firefox 63.0~b13+
ProcVersionSign
Uname: Linux 4.15.0-36-generic x86_64
AddonCompatChec
ApportVersion: 2.20.1-0ubuntu2.18
Architecture: amd64
AudioDevicesInUse:
USER PID ACCESS COMMAND
/dev/snd/
BuildID: 20181009094547
Channel: beta
CurrentDesktop: X-Cinnamon
Date: Tue Oct 9 15:54:39 2018
DefaultProfileE
DefaultProfileI
DefaultProfileL
DefaultProfileP
DefaultProfileP
/usr/lib/
/usr/lib/
prefs.js
DefaultProfileT
ForcedLayersAccel: False
IfupdownConfig:
# interfaces(5) file used by ifup(8) and ifdown(8)
auto lo
iface lo inet loopback
InstallationDate: Installed on 2017-01-12 (635 days ago)
InstallationMedia: Linux Mint 17.2 "Rafaela" - Release amd64 20150627
IpRoute:
default via 192.168.103.1 dev wlan0 proto static metric 600
169.254.0.0/16 dev wlan0 scope link metric 1000
192.168.103.0/24 dev wlan0 proto kernel scope link src 192.168.103.222 metric 600
MostRecentCrashID: bp-392336ee-
Profile0BrokenP
<email address hidden>
extension-
saved-
saved-
Profile0Extensions: extensions.sqlite corrupt or missing
Profile0Incompa
Profile0Locales: extensions.sqlite corrupt or missing
Profile0Plugins: Shockwave Flash - /usr/lib/
Profile0PrefSou
/usr/lib/
/usr/lib/
prefs.js
Profile0Themes: extensions.sqlite corrupt or missing
Profiles:
Profile0 - LastVersion=
Profile1 (Default) - LastVersion=
RelatedPackageV
RunningIncompat
SourcePackage: firefox
SubmittedCrashIDs: bp-392336ee-
UpgradeStatus: No upgrade log present (probably fresh install)
dmi.bios.date: 10/05/2011
dmi.bios.vendor: Hewlett-Packard
dmi.bios.version: F.1B
dmi.board.
dmi.board.name: 165A
dmi.board.vendor: Hewlett-Packard
dmi.board.version: 10.31
dmi.chassis.
dmi.chassis.type: 10
dmi.chassis.vendor: Hewlett-Packard
dmi.chassis.
dmi.modalias: dmi:bvnHewlett-
dmi.product.family: 103C_5335KV G=N L=CON B=HP S=PAV
dmi.product.name: HP Pavilion dv7 Notebook PC
dmi.product.
dmi.sys.vendor: Hewlett-Packard
Rex (rex.prestige) wrote : | #1 |
- AlsaInfo.txt Edit (29.1 KiB, text/plain; charset="utf-8")
- CRDA.txt Edit (434 bytes, text/plain; charset="utf-8")
- CurrentDmesg.txt Edit (252.0 KiB, text/plain; charset="utf-8")
- DefaultProfilePrefs.txt Edit (2.2 KiB, text/plain; charset="utf-8")
- Dependencies.txt Edit (12.8 KiB, text/plain; charset="utf-8")
- IpAddr.txt Edit (841 bytes, text/plain; charset="utf-8")
- IwConfig.txt Edit (507 bytes, text/plain; charset="utf-8")
- Lspci.txt Edit (13.3 KiB, text/plain; charset="utf-8")
- PciNetwork.txt Edit (1.4 KiB, text/plain; charset="utf-8")
- ProcCpuinfoMinimal.txt Edit (1019 bytes, text/plain; charset="utf-8")
- ProcEnviron.txt Edit (310 bytes, text/plain; charset="utf-8")
- Profile0Prefs.txt Edit (1.4 KiB, text/plain; charset="utf-8")
- PulseList.txt Edit (19.6 KiB, text/plain; charset="utf-8")
- RfKill.txt Edit (118 bytes, text/plain; charset="utf-8")
- WifiSyslog.txt Edit (448.9 KiB, text/plain; charset="utf-8")
Launchpad Janitor (janitor) wrote : | #2 |
Changed in firefox (Ubuntu): | |
status: | New → Confirmed |
Rex (rex.prestige) wrote : | #3 |
The issue persists in 63.0~b14+
I am continuing to run chromium without this issue.
Olivier Tilloy (osomon) wrote : | #4 |
@Rex: can you download the official firefox beta build at https:/
If so, would you mind filing a bug upstream at https:/
Thanks!
In Mozilla Bugzilla #1499825, Rex (rex.prestige) wrote : | #6 |
User Agent: Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.36 (KHTML, like Gecko) Ubuntu Chromium/
Steps to reproduce:
1. Download the current beta from https:/
2. Unpack firefox.
3. Run firefox.
4. Navigate to several different sites in a row on the same tab.
Actual results:
Some of the sites display as "Server not found", "we can't connect to the server at" the site address. Refreshing the page sometimes displays the site instead of "Server not found". When not, opening a new tab and entering the same address sometimes displays the site instead of "Server not found".
Expected results:
Sites load without "Server not found" events.
In Mozilla Bugzilla #1499825, Rex (rex.prestige) wrote : | #7 |
ubuntu-bug provides the following information:
ProblemType: Bug
DistroRelease: Linux 18.3
Package: firefox 63.0~b13+
ProcVersionSign
Uname: Linux 4.15.0-36-generic x86_64
AddonCompatChec
ApportVersion: 2.20.1-0ubuntu2.18
Architecture: amd64
AudioDevicesInUse:
USER PID ACCESS COMMAND
/dev/snd/
BuildID: 20181009094547
Channel: beta
CurrentDesktop: X-Cinnamon
Date: Tue Oct 9 15:54:39 2018
DefaultProfileE
DefaultProfileI
DefaultProfileL
DefaultProfileP
DefaultProfileP
/usr/lib/
/usr/lib/
prefs.js
DefaultProfileT
ForcedLayersAccel: False
IfupdownConfig:
# interfaces(5) file used by ifup(8) and ifdown(8)
auto lo
iface lo inet loopback
InstallationDate: Installed on 2017-01-12 (635 days ago)
InstallationMedia: Linux Mint 17.2 "Rafaela" - Release amd64 20150627
IpRoute:
default via 192.168.103.1 dev wlan0 proto static metric 600
169.254.0.0/16 dev wlan0 scope link metric 1000
192.168.103.0/24 dev wlan0 proto kernel scope link src 192.168.103.222 metric 600
MostRecentCrashID: bp-392336ee-
Profile0BrokenP
<email address hidden>
extension-
saved-
saved-
Profile0Extensions: extensions.sqlite corrupt or missing
Profile0Incompa
Profile0Locales: extensions.sqlite corrupt or missing
Profile0Plugins: Shockwave Flash - /usr/lib/
Profile0PrefSou
/usr/lib/
/usr/lib/
prefs.js
Profile0Themes: extensions.sqlite corrupt or missing
Profiles:
Profile0 - LastVersion=
Profile1 (Default) - LastVersion=
RelatedPackageV
RunningIncompat
SourcePackage: firefox
SubmittedCrashIDs: bp-392336ee-
UpgradeStatus: No upgrade log present (probably fresh install)
dmi.bios.date: 10/05/2011
dmi.bios.vendor: Hewlett-Packard
dmi.bios.version: F.1B
dmi.board.
dmi.board.name: 165A
dmi.board.vendor: Hewlett-Packard
dmi.board.version: 10.31
dmi.chassis.
dmi.chassis.typ...
In Mozilla Bugzilla #1499825, Rex (rex.prestige) wrote : | #8 |
Ah! Sorry, the ubuntu-bug information was for a different version of firefox that I was testing before the steps provided in this bug. I apologize for muddling this issue.
Rex (rex.prestige) wrote : | #5 |
Thanks Olivier. I was able to reproduce the issue with that firefox (63.0b14), and the upstream bug can be found at https:/
In Mozilla Bugzilla #1499825, Peter-magyari (peter-magyari) wrote : | #9 |
Hi,
I wasn't able to reproduce this issue, could you specify some of the websites you've visited where this occurred?
Also, could you test if the issue is reproducible in safe mode, here is a link that can help you do that:
https:/
If the issue still occurs please try using a new profile, you can find the steps here:
https:/
Thank you!
In Mozilla Bugzilla #1499825, Rex (rex.prestige) wrote : | #10 |
Hi Peter!
Thank you for your help on this issue.
I was able to reproduce the issue with the following site "chain":
bing.com -> google.com -> yahoo.com -> reddit.com
I noticed that yahoo.com only downloaded partially when I attempted to move to the reddit.com.
I moved into safe mode, which automatically loaded reddit.com, and when I tried to go to bing.com, "Server not found" was displayed.
I started a new profile, which automatically loaded the Firefox privacy notice page, and from there I used the following site "chain" to reproduce the issue:
firefox privacy notice page -> google.com -> bing.com -> google.com -> yahoo.com -> reddit.com -> msnbc.com
Again, reddit.com had only loaded partially.
I look forward to anything further that I can provide.
In case it is helpful, I am linking the launchpad bug that I had reported previously. That reported issue is a duplicate of another report confirmed by one other user:
https:/
In Mozilla Bugzilla #1499825, Rex (rex.prestige) wrote : | #11 |
The issue persists in 64.0b3.
I was moving through a site similar to the above chain without issue, but when I clicked my email link to get to this page, I was presented with "Server not found". Refreshing the page and trying in a new tab gave the same page. Closing the new and original tab and trying again with a new tab got me through.
In Mozilla Bugzilla #1499825, Paul White (paulw2u) wrote : | #12 |
Just for the record, I was the other user that confirmed this problem on Launchpad. I used to see this issue very frequently using Ubuntu 18.04 and later with Ubuntu 18.10 on one laptop, and with Xubuntu 18.04 on another. The first laptop was completely wiped several days ago and Ubuntu was 18.04 installed.
After extensive testing I have not seen this issue again on either laptop so I am unable to confirm the bug here. If I do encounter this problem again then I will of course come back and confirm the bug report if no-one else has already done so.
In Mozilla Bugzilla #1499825, Peter-magyari (peter-magyari) wrote : | #13 |
Hi,
I'm still unable to reproduce the issue you described.
Could you please try and reproduce it on the latest Firefox Beta 64.0(2018120521
Thanks!
In Mozilla Bugzilla #1499825, Rex (rex.prestige) wrote : | #14 |
Hi Peter!
Can you clarify how to get the 20181205... and 20181207... versions? The issue persists for me on both versions downloaded at the time of posting from https:/
Thanks!
Changed in firefox: | |
importance: | Unknown → Medium |
status: | Unknown → New |
In Mozilla Bugzilla #1499825, Rex (rex.prestige) wrote : | #15 |
I'm clearing the needinfo request since I cannot provide info without assistance myself.
In Mozilla Bugzilla #1499825, Honzab-moz (honzab-moz) wrote : | #16 |
rex, could you please provide HTTP logs as described here:
https:/
it may shade some light on this issue. thanks!
In Mozilla Bugzilla #1499825, Rex (rex.prestige) wrote : | #17 |
Hi Honza! Thanks for the tip!
I'm currently unable to reproduce the issue at will with the current 65.0b8 beta. I will continue to use it until I notice an issue and then try to reproduce it while logging.
In Mozilla Bugzilla #1499825, Rex (rex.prestige) wrote : | #18 |
Created attachment 9034839
log.txt-main.8416
Started logging after the issue started while attempting to visit CNN.com. Was given "We can’t connect to the server at www.cnn.com" page. Started logging and hit refresh, and was shown the same page. Then stopped logging.
In Mozilla Bugzilla #1499825, Rex (rex.prestige) wrote : | #19 |
Created attachment 9034842
log.txt-main.9860
Aha! I started the log right before attempting to connect to CNN.com, received the "can't connect to the server" page, and stopped logging.
In Mozilla Bugzilla #1499825, Honzab-moz (honzab-moz) wrote : | #20 |
(In reply to rex from comment #13)
> Created attachment 9034842
> log.txt-main.9860
>
> Aha! I started the log right before attempting to connect to CNN.com, received the "can't connect to the server" page, and stopped logging.
Thank you.
Looks like your local resolver has an issue:
```
2019-01-07 22:07:23.681 ⁃ nsHttpChannel ⁃ 7f7450f0a800 ⁃ finished ⁃ status=804b001e ⁃ http-status=n/a ⁃ url=https:/
2019-01-07 22:07:23.691 ⁃ nsSocketTransport ⁃ 7f74521a4c00 ⁃ released ⁃ origin=
2019-01-07 22:07:23.691473 UTC - [Parent 9860: Socket Thread]: D/nsSocketTransport creating nsSocketTransport @0x7f74521a4c00
2019-01-07 22:07:23.691482 UTC - [Parent 9860: Socket Thread]: E/nsSocketTransport nsSocketTranspo
2019-01-07 22:07:23.691500 UTC - [Parent 9860: Socket Thread]: D/nsSocketTransport nsSocketTranspo
flags = 0
2019-01-07 22:07:23.691525 UTC - [Parent 9860: Socket Thread]: D/nsSocketTransport nsSocketTranspo
2019-01-07 22:07:23.691533 UTC - [Parent 9860: Socket Thread]: D/nsSocketTransport nsSocketTranspo
2019-01-07 22:07:23.691557 UTC - [Parent 9860: Socket Thread]: D/nsSocketTransport nsSocketTranspo
2019-01-07 22:07:23.691565 UTC - [Parent 9860: Socket Thread]: D/nsSocketTransport nsSocketTranspo
nsHalfOpenSocket @7f7454d67ae0 --> nsSocketTransport @7f74521a4c00
2019-01-07 22:07:23.692309 UTC - [Parent 9860: Socket Thread]: D/nsSocketTransport nsSocketTranspo
2019-01-07 22:07:23.692324 UTC - [Parent 9860: Socket Thread]: D/nsSocketTransport nsSocketTranspo
2019-01-07 22:07:23.692335 UTC - [Parent 9860: Socket Thread]: E/nsSocketTransport nsSocketTranspo
804b0003 = STATUS_RESOLVING
2019-01-07 22:07:23.692 ⁃ nsHostResolver ⁃ log.txt-
2019-01-07 22:07:23.692387 UTC - [Parent 9860: Socket Thread]: D/nsHostResolver Resolving host [www.cnn.com] type 0. [this=0x7f74867
2019-01-07 22:07:23.692414 UTC - [Parent 9860: Socket Thread]: D/nsHostResolver No usable record in cache for host [www.cnn.com] type 0.
2019-01-07 22:07:23.692460 UTC - [Parent 9860: Socket Thread]: D/nsHostResolver DNS thread counters: total=2 any-live=0 idle=2 pending=1
2019-01-07 22:07:23.692484 UTC - [Parent 9860: Socket Thread]: D/nsHostResolver DNS lookup for host [www.cnn.com] blocking pending 'getaddrinfo' or trr query: callback [0x7f7454e6b700]
2019-01-07 22:07:23.692506 UTC - [Parent 9860: DNS Resolver #2]: E/nsHostResolver DNS lookup thread - Calling getaddrinfo for host [www.cnn.com].
2019-01-07 22:07:23.692511 UTC - [Parent 9860: Socket Thread]: D/nsSocketTransport nsSocketTranspo
2019-01-07 22:07:23.69264...
In Mozilla Bugzilla #1499825, Rex (rex.prestige) wrote : | #21 |
Hey Honza!
rex@ofc-ate ~ $ nslookup www.cnn.com
Server: 127.0.1.1
Address: 127.0.1.1#53
Non-authoritative answer:
www.cnn.com canonical name = turner-
Name: turner-
Address: 151.101.1.67
Name: turner-
Address: 151.101.65.67
Name: turner-
Address: 151.101.129.67
Name: turner-
Address: 151.101.193.67
My DNS is set to automatic and uses what my isp provides: 217.23.2.11 and 8.8.8.8 .
In Mozilla Bugzilla #1499825, Honzab-moz (honzab-moz) wrote : | #22 |
Thanks. Dragana, Michal, Valentin, any ideas what more to check here?
In Mozilla Bugzilla #1499825, Dd-mozilla (dd-mozilla) wrote : | #23 |
(In reply to Honza Bambas (:mayhemer) from comment #16)
> Thanks. Dragana, Michal, Valentin, any ideas what more to check here?
The only suggestion I have is to run nslookup www.cnn.com when problem occurs.
You os does not give us a ip address for www.cnn.com, we cannot do much about it.
In Mozilla Bugzilla #1499825, Valentin-gosu (valentin-gosu) wrote : | #24 |
(In reply to Dragana Damjanovic [:dragana] from comment #17)
> (In reply to Honza Bambas (:mayhemer) from comment #16)
>
> > Thanks. Dragana, Michal, Valentin, any ideas what more to check here?
>
> The only suggestion I have is to run nslookup www.cnn.com when problem occurs.
> You os does not give us a ip address for www.cnn.com, we cannot do much about it.
As a workaround you can try DNSoverHTTPS (also called TRR).
This bypasses the system resolver, and uses an external one.
https:/
https:/
In Mozilla Bugzilla #1499825, Rex (rex.prestige) wrote : | #25 |
I ran into this issue again trying to get to google.com.
The nslookup appeared to be successful, but the issue persisted after the seemingly successful lookup:
Server: 127.0.1.1
Address: 127.0.1.1#53
Non-authoritative answer:
Name: www.google.com
Address: 172.217.9.164
I will try out DNSoverHTTPS! Neat suggestion!
In Mozilla Bugzilla #1499825, Michal-novotny (michal-novotny) wrote : | #26 |
(In reply to Honza Bambas (:mayhemer) from comment #16)
> Thanks. Dragana, Michal, Valentin, any ideas what more to check here?
In bug 1439780 there is a simple program (hook3.c) that hooks resolver functions and dumps info to standard error, see https:/
In Mozilla Bugzilla #1499825, Rex (rex.prestige) wrote : | #27 |
I had some issues with DNSoverHTTPS. I'm thinking firefox had intermittent difficulty resolving the host from which I was supposed to do dns services with.
For the hook3.c program, I've run it with firefox normally for the few business days since the time of Michal's post without the issue recurring. I've even redownloaded that 65.0b8 meta and turned off updating to ensure my variables are not changing. DNS servers are the same. Hopefully like last time the issue will spring up when I mention it. Anyway, please know that I am continuing to test and will post if I have news.
In Mozilla Bugzilla #1499825, Valentin-gosu (valentin-gosu) wrote : | #28 |
(In reply to rex from comment #21)
> I had some issues with DNSoverHTTPS. I'm thinking firefox had intermittent difficulty resolving the host from which I was supposed to do dns services with.
You can use the network.
You can use `dig mozilla.
https:/
> For the hook3.c program, I've run it with firefox normally for the few business days since the time of Michal's post without the issue recurring. I've even redownloaded that 65.0b8 meta and turned off updating to ensure my variables are not changing. DNS servers are the same. Hopefully like last time the issue will spring up when I mention it. Anyway, please know that I am continuing to test and will post if I have news.
Thank you for this! We are really grateful. I'm sure this kind of hard-to-reproduce bug can be annoying to a lot of users, and any little help in finding out the root cause is great!
In Mozilla Bugzilla #1499825, Rex (rex.prestige) wrote : | #29 |
Hey Michal, sorry if I completely misunderstand, but does your hook ask the system to resolve the address before firefox makes the attempt? I have a hunch that, where firefox would fail to resolve an address, if another program or service had already resolved the address, firefox would succeed.
In Mozilla Bugzilla #1499825, Rex (rex.prestige) wrote : | #30 |
Where I've spent a little under three business days running the firefox beta and hook with no issues, I was able to reproduce the issue without the hook in a minute.
Again I apologize for the following speculation about something I completely do not understand. I think this issue occurs when firefox asks to resolve a new name. If the hook checks whether the system can resolve the new name first, then firefox might "only" need to resolve an already resolved name, which is consistently successful. Thus days of success with the hook and immediate failure without.
Does this make sense for how firefox and the hook actually work?
In Mozilla Bugzilla #1499825, Kershaw-6 (kershaw-6) wrote : | #31 |
:michal, could you take a look at comment #24?
Do you have any idea why this issue is not reproducible with the hook? Thanks.
In Mozilla Bugzilla #1499825, Michal-novotny (michal-novotny) wrote : | #32 |
(In reply to rex from comment #23)
> Hey Michal, sorry if I completely misunderstand, but does your hook ask the system to resolve the address before firefox makes the attempt? I have a hunch that, where firefox would fail to resolve an address, if another program or service had already resolved the address, firefox would succeed.
The utility just intercepts calls to gethostbyname, gethostbyname_r and getaddrinfo. It calls the original function and prints the result to stderr. I have no idea why it's not reproducible with it. It might slightly change the timing but it should be negligible.
In Mozilla Bugzilla #1499825, Rex (rex.prestige) wrote : | #33 |
Thanks for the explanation, :michal. Is there any chance that the hook utility could be calling the functions differently than firefox (especially if it was doing so incorrectly somehow) or is using a different process to do so? I'm repeating similar results today. Firefox+hook resolves consistently, firefox alone does not.
I'm imagining this bug could be something about my system intermittently ignoring firefox's calls or improperly responding to firefox, where the utility and chromium work consistently.
In Mozilla Bugzilla #1499825, Michal-novotny (michal-novotny) wrote : | #34 |
Functions from glibc should be called in both cases (with the LD_PRELOAD as well as without it). Do you run the same firefox binary with the same environment? I.e. when running firefox without the hook, do you run it from console as well? Also, does LD_PRELOAD variable contain anything by default?
In Mozilla Bugzilla #1499825, Rex (rex.prestige) wrote : | #35 |
Good questions Michal. I had been running firefox without the hook outside the console, where I was running with the hook in the console. Turns out the issue persists without the hook in the console though.
I couldn't find how to determine what my LD_PRELOAD variable is by default, but I ran a script with 'echo "LD_PRELOAD IS ${LD_PRELOAD}"' and starting up firefox which did not return anything after "LD_PRELOAD IS ". Not sure if that is applicable though. Is there a different way I should check?
In Mozilla Bugzilla #1499825, Michal-novotny (michal-novotny) wrote : | #36 |
> I couldn't find how to determine what my LD_PRELOAD variable is by default, but I ran a script with 'echo "LD_PRELOAD IS ${LD_PRELOAD}"' and starting up firefox which did not return anything after "LD_PRELOAD IS ". Not sure if that is applicable though. Is there a different way I should check?
Running echo "${LD_PRELOAD}" would show it. That's really strange. If you comment out getaddrinfo from hook3.c (or just rename it to some random name) but keep gethostbyname and gethostbyname_r as they are, can you reproduce the problem?
In Mozilla Bugzilla #1499825, Rex (rex.prestige) wrote : | #37 |
I've made a newhook3.c, renaming getaddrinfo to dontgetaddrinfo (heh), used the gcc command to produce a newhook3.so, and using that new .so, I am able to reproduce the issue.
In Mozilla Bugzilla #1499825, Michal-novotny (michal-novotny) wrote : | #38 |
Created attachment 9038810
hook3_test1
Thanks for doing the test. Please, try this new file. It only intercepts the function call but does nothing with the result.
In Mozilla Bugzilla #1499825, Rex (rex.prestige) wrote : | #39 |
With this one it looks like I cannot resolve any address.
In Mozilla Bugzilla #1499825, Dd-mozilla (dd-mozilla) wrote : | #40 |
(In reply to rex from comment #21)
> I had some issues with DNSoverHTTPS. I'm thinking firefox had intermittent difficulty resolving the host from which I was supposed to do dns services with.
>
I would like to understand this: Can you explain what was the issue? How did firefox behave: firefox could not connect to web sites?
In Mozilla Bugzilla #1499825, Michal-novotny (michal-novotny) wrote : | #41 |
It should work. Did it compile without any warning?
In Mozilla Bugzilla #1499825, Rex (rex.prestige) wrote : | #42 |
:michal, it took about 5 seconds to compile and did not return anything to the terminal.
:dragana, yes, I did not seem to be able to resolve any addresses and only hit server not found pages. Though I had not tested with an IP address as my network.trr.uri, testing now shows the same thing. To be transparent, my only changes are network.trr.mode to 3, and network.trr.uri as default or to 104.16.249.249 or 1.1.1.1, and the rest of network.trr.* default. Maybe I am missing or misusing a setting?
In Mozilla Bugzilla #1499825, Michal-novotny (michal-novotny) wrote : | #43 |
(In reply to rex from comment #36)
> :michal, it took about 5 seconds to compile and did not return anything to the terminal.
So, how the output written to stderr by hook3.so (which resolves correctly) looks like?
In Mozilla Bugzilla #1499825, Dd-mozilla (dd-mozilla) wrote : | #44 |
(In reply to rex from comment #36)
> :michal, it took about 5 seconds to compile and did not return anything to the terminal.
>
> :dragana, yes, I did not seem to be able to resolve any addresses and only hit server not found pages. Though I had not tested with an IP address as my network.trr.uri, testing now shows the same thing. To be transparent, my only changes are network.trr.mode to 3, and network.trr.uri as default or to 104.16.249.249 or 1.1.1.1, and the rest of network.trr.* default. Maybe I am missing or misusing a setting?
mode 3 is only using doh. mode 2 will fall back to dns if doh is not working. I recommend to use mode 2.
I am also curious to know why doh mode 3 is not working. Can you make me a http log:
https:/
Thanks!
In Mozilla Bugzilla #1499825, Rex (rex.prestige) wrote : | #45 |
Hi :michal, using the gcc command for either one now is about instantaneous and produces no stderr output.
Hi :dragana, I agree mode 2 would make sense long term, but my intentions for testing was pure doh since currently dns isn't a solid fallback. I'll attach the log in just a moment.
In Mozilla Bugzilla #1499825, Rex (rex.prestige) wrote : | #46 |
Created attachment 9038886
Main HTTP log for doh only for google.com
In Mozilla Bugzilla #1499825, Rex (rex.prestige) wrote : | #47 |
Created attachment 9038888
Child HTTP log for doh only for google.com
In Mozilla Bugzilla #1499825, Michal-novotny (michal-novotny) wrote : | #48 |
(In reply to rex from comment #39)
> Hi :michal, using the gcc command for either one now is about instantaneous and produces no stderr output.
I meant the output when you run it. You should have output like this in the console:
getaddrinfo node=detectport
hints.ai_family=0
hints.
hints.
hints.ai_flags=32
IPv4 address: 23.219.91.81
IPv4 address: 23.219.91.27
IPv6 address: 2a02:26f0:
IPv6 address: 2a02:26f0:
getaddrinfo node=tiles.
hints.ai_family=0
hints.
hints.
hints.ai_flags=32
IPv4 address: 52.25.70.97
IPv4 address: 52.26.103.165
IPv4 address: 52.39.131.77
IPv4 address: 52.41.60.30
IPv4 address: 52.24.236.113
IPv4 address: 35.160.41.125
IPv4 address: 52.41.78.152
IPv4 address: 52.34.107.172
In Mozilla Bugzilla #1499825, Michal-novotny (michal-novotny) wrote : | #49 |
Could you please also provide you DNS setup? Content of /etc/resolv.conf and line beginning with "hosts" from /etc/nsswitch.conf. If your resolv.conf contains 127.0.0.53 then also an output from "systemd-resolve --status".
In Mozilla Bugzilla #1499825, Rex (rex.prestige) wrote : | #50 |
Ah, thanks for clarifying Michal.
The original hook3 running firefox provides output like yours:
gethostbyname_
address: 127.0.1.1
gethostbyname_
address: 127.0.1.1
getaddrinfo node=detectport
hints.ai_family=0
hints.
hints.
hints.ai_flags=32
IPv4 address: 23.54.163.42
IPv4 address: 23.54.163.75
IPv6 address: 2600:1404:
IPv6 address: 2600:1404:
etc.
My edited hook3 to rename gethostbyname running firefox only outputs:
gethostbyname_
address: 127.0.1.1
gethostbyname_
address: 127.0.1.1
The latest hook3_test1 outputs nothing.
/etc/resolv.conf
nameserver 127.0.1.1
"hosts" from /etc/nsswitch.conf
hosts: files wins dns
I attempted systemd-resolve --status just in case, but received:
systemd-resolve: unrecognized option '--status'
In Mozilla Bugzilla #1499825, Michal-novotny (michal-novotny) wrote : | #51 |
(In reply to rex from comment #44)
> The original hook3 running firefox provides output like yours:
> gethostbyname_
> address: 127.0.1.1
> gethostbyname_
> address: 127.0.1.1
> getaddrinfo node=detectport
> hints.ai_family=0
> hints.ai_socktype=1
> hints.ai_protocol=0
> hints.ai_flags=32
> IPv4 address: 23.54.163.42
> IPv4 address: 23.54.163.75
> IPv6 address: 2600:1404:
> IPv6 address: 2600:1404:
> etc.
This looks normal.
> /etc/resolv.conf
> nameserver 127.0.1.1
>
> "hosts" from /etc/nsswitch.conf
> hosts: files wins dns
>
> I attempted systemd-resolve --status just in case, but received:
> systemd-resolve: unrecognized option '--status'
You could try:
resolvectl status
127.0.1.1 is a local address so something must be listening and forwarding the requests. You can use following command to find out, what it is:
sudo netstat -u -l -p -n | grep -e ":53[[:space:]]"
In Mozilla Bugzilla #1499825, Rex (rex.prestige) wrote : | #52 |
$ resolvectl status
resolvectl: command not found
$ sudo netstat -u -l -p -n | grep -e ":53[[:space:]]"
udp 0 0 127.0.1.1:53 0.0.0.0:* 2325/dnsmasq
In Mozilla Bugzilla #1499825, Michal-novotny (michal-novotny) wrote : | #53 |
You could try to put ISP's DNS servers to /etc/resolv.conf to make sure dnsmasq doesn't mess anything up.
I still don't understand why nothing is resolved when hook3_test1.so is preloaded. I will try to come with something else later.
In Mozilla Bugzilla #1499825, Rex (rex.prestige) wrote : | #54 |
I think I've run into some voodoo.
I updated my /etc/resolv.conf to:
nameserver 217.23.2.11
nameserver 8.8.8.8
nameserver 127.0.1.1
and then ran:
sudo /etc/init.
as described here: https:/
In using firefox without any hook, I still experience this issue.
In Mozilla Bugzilla #1499825, Michal-novotny (michal-novotny) wrote : | #55 |
Could you please put only "nameserver 217.23.2.11" to /etc/resolv.conf and provide HTTP log (the same as in comment #13) and dump of DNS packets when the failure happens? Use tcpdump to dump the DNS packets:
sudo tcpdump -w /tmp/dump.pcap -s 0 udp and port 53 and host 217.23.2.11
In Mozilla Bugzilla #1499825, Rex (rex.prestige) wrote : | #56 |
Gladly!
With only 'nameserver 217.23.2.11' in my /etc/resolv.conf and running sudo /etc/init.
With only 'nameserver 8.8.8.8' in my /etc/resolv.conf and running sudo /etc/init.
I've attached the resulting dump.pcap below. In it it shows that several hostnames were successfully resolved. At the end, it failed to resolve cnn.com but is not shown in the dump.
I then did an extra tcpdump with 'sudo tcpdump -w /tmp/dump.pcap -s 0 udp and port 53' with google.com failing at the end, but again, it is not shown.
My naive interpretation is that firefox is simply not sending a dns request for some reason.
In Mozilla Bugzilla #1499825, Rex (rex.prestige) wrote : | #57 |
Created attachment 9039811
tcpdump for 8.8.8.8 dns with cnn.com failing at the end
In Mozilla Bugzilla #1499825, Rex (rex.prestige) wrote : | #58 |
Created attachment 9039812
tcpdump for port 53 with google.com failing at the end
In Mozilla Bugzilla #1499825, Michal-novotny (michal-novotny) wrote : | #59 |
(In reply to rex from comment #50)
> Gladly!
> With only 'nameserver 217.23.2.11' in my /etc/resolv.conf and running sudo /etc/init.
In comment #13 you wrote that 217.23.2.11 and 8.8.8.8 are DNS servers provided automatically by ISP. It's bad, if they really return non-working DNS server in DHCP response. Check using dig utility that it really doesn't work:
dig www.cnn.com @217.23.2.11
If it times out or returns an error, get rid of this DNS server in the system configuration. I.e. manually set DNS servers to 8.8.8.8 and 8.8.4.4.
> With only 'nameserver 8.8.8.8' in my /etc/resolv.conf and running sudo /etc/init.
>
> I've attached the resulting dump.pcap below. In it it shows that several hostnames were successfully resolved. At the end, it failed to resolve cnn.com but is not shown in the dump.
>
> I then did an extra tcpdump with 'sudo tcpdump -w /tmp/dump.pcap -s 0 udp and port 53' with google.com failing at the end, but again, it is not shown.
I meant to provide both logs, tcpdump and HTTP log from firefox, where we could see that we tried to resolve the address.
In Mozilla Bugzilla #1499825, Rex (rex.prestige) wrote : | #60 |
Thanks Michal. From using the dig command, it seems clear that what my ISP is providing is not a working DNS server. I have changed my top level router to distribute 8.8.8.8, 8.8.4.4, and 1.1.1.1 as my DNS servers, and it appears to have propagated correctly.
I've set my /etc/resolv.conf to just 'nameserver 8.8.8.8' and ran 'sudo /etc/init.
Right now I'm having difficulty getting "good" logs. At first, all I was getting was 'server not found' with nothing in my pcap, so I thought I had messed up something. Now I'm getting only successful resolutions. I'll try again in a few hours.
In Mozilla Bugzilla #1499825, Rex (rex.prestige) wrote : | #61 |
Created attachment 9040193
HTTP log for failed attempt for reddit.com
In Mozilla Bugzilla #1499825, Rex (rex.prestige) wrote : | #62 |
Created attachment 9040195
pcap for failed request for reddit.com
Was built via 'sudo tcpdump -w /tmp/dump.pcap -s 0 udp and port 53' and was active when requesting reddit.com. Appears to be empty.
In Mozilla Bugzilla #1499825, Michal-novotny (michal-novotny) wrote : | #63 |
Can you please try a special build where I've added a code to log error returned by GetAddrInfo? The try push is here:
https:/
Linux builds can be downloaded from:
32-bit - https:/
64-bit - https:/
The error is logged in GetAddrInfo module, so it has to be added to MOZ_LOG. Following MOZ_LOG is sufficient:
timestamp,
In Mozilla Bugzilla #1499825, Rex (rex.prestige) wrote : | #64 |
Thanks for the special build, Michal! I've downloaded the 64 bit Linux build you linked, and am running the following script to produce the desired log file:
#! /bin/bash
export MOZ_LOG=
export MOZ_LOG_
./firefox
However, I have not been able to reproduce the issue yet whether I use this script or the binary file directly. I will continue to use the binary today to see if it recurs, and if so, then the script to try to capture that issue. I'll also download the vanilla version of this firefox in case there's some difference there.
I notice this is a different version than the 65.0b8 version that I was testing with. Do we think it is better to continue testing in the same 65.0b8 environment or stay current in case the issue was resolved elsewhere?
In Mozilla Bugzilla #1499825, Michal-novotny (michal-novotny) wrote : | #65 |
Give a try to both versions. And if it's not reproducible in this 66.0a1 build but is reproducible in 65.0b8, I'll create a testing build from the same source tree.
In Mozilla Bugzilla #1499825, Rex (rex.prestige) wrote : | #66 |
I am able to reproduce the issue with the plain 66.0a1 alpha from a shell script or otherwise.
I am not able to reproduce the issue with the plain 66.0a1 alpha from a shell script that calls out:
export MOZ_LOG=
export MOZ_LOG_
I am not able to reproduce the issue with the special build of 66.0a1, with or without the MOZ definitions.
In Mozilla Bugzilla #1499825, Michal-novotny (michal-novotny) wrote : | #67 |
(In reply to rex from comment #60)
> I am able to reproduce the issue with the plain 66.0a1 alpha from a shell script or otherwise.
>
> I am not able to reproduce the issue with the plain 66.0a1 alpha from a shell script that calls out:
> export MOZ_LOG=
> export MOZ_LOG_
Since this is not a crash, sync isn't necessary. Timing won't be changes so much without sync, so please try timestamp,
> I am not able to reproduce the issue with the special build of 66.0a1, with or without the MOZ definitions.
Hmm, I would expect the same behavior as in the official 66.0a1. This strange behavior makes finding the problem really hard.
In Mozilla Bugzilla #1499825, Rex (rex.prestige) wrote : | #68 |
Created attachment 9040503
ZIP file of HTTP logs, success at reaching google.com, fail at reaching bing.com
These logs were produced using the plain 66.0a1 firefox with MOZ_LOG=
In Mozilla Bugzilla #1499825, Michal-novotny (michal-novotny) wrote : | #69 |
Thanks, unfortunately this log doesn't contain useful information. I suggested it mainly to check if removing sync helps with reproducing. Helpful would be the log from my version which logs the error code.
In Mozilla Bugzilla #1499825, Rex (rex.prestige) wrote : | #70 |
I see. I attempted the special build with MOZ_LOG=
In Mozilla Bugzilla #1499825, Michal-novotny (michal-novotny) wrote : | #71 |
Created attachment 9040578
log_getaddrinfo
You could give this version a try. It only prints error message in case of failure. Use it as the other versions, i.e. compile with:
gcc -shared -ldl -fPIC log_getaddrinfo
And run it with the official firefox:
LD_PRELOAD=
In Mozilla Bugzilla #1499825, Rex (rex.prestige) wrote : | #72 |
Thanks Michal. Are you interested in just the terminal output, or would you still like the HTTP log with MOZ_LOG set to something?
In Mozilla Bugzilla #1499825, Rex (rex.prestige) wrote : | #73 |
Created attachment 9040820
2019-02-01 Fail to Resolve Google
In Mozilla Bugzilla #1499825, Rex (rex.prestige) wrote : | #74 |
Created attachment 9040821
2019-02-01 HTTP Log
In Mozilla Bugzilla #1499825, Rex (rex.prestige) wrote : | #75 |
Created attachment 9040823
2019-02-01 HTTP Log-Child
In Mozilla Bugzilla #1499825, Rex (rex.prestige) wrote : | #76 |
Firefox 66.0a1 running with with that LD_Preload appears to not be able to resolve any address. :(
In Mozilla Bugzilla #1499825, Michal-novotny (michal-novotny) wrote : | #77 |
Created attachment 9041202
test.tgz
getaddrinfo fails with EAI_SYSTEM and errno should contain details, but it's 0.
Could you please extract the attached archive and run "make" in the extracted directory? It should generate few files in logs directory. I'm curious whether getaddrinfo called from python code fails as well and maybe I'll see something interesting in the strace log.
In Mozilla Bugzilla #1499825, Rex (rex.prestige) wrote : | #78 |
Can do, Michal. Below is what printed in the terminal, and the attached logs follow.
gcc -shared -ldl -fPIC log_getaddrinfo
rm -f logs/*.txt
./run_test.sh
Makefile:5: recipe for target 'test' failed
make: *** [test] Error 1
In Mozilla Bugzilla #1499825, Rex (rex.prestige) wrote : | #79 |
Created attachment 9041206
2019-02-04logs.zip
In Mozilla Bugzilla #1499825, Michal-novotny (michal-novotny) wrote : | #80 |
Is there any change if you remove wins resolving? I.e. in /etc/nsswitch.conf change line "hosts: files wins dns" to "hosts: files dns" and rerun the test.
In Mozilla Bugzilla #1499825, Rex (rex.prestige) wrote : | #81 |
Yes, there are changes! Below is the terminal output, and the attached logs follow.
gcc -shared -ldl -fPIC log_getaddrinfo
rm -f logs/*.txt
./run_test.sh
In Mozilla Bugzilla #1499825, Rex (rex.prestige) wrote : | #82 |
Created attachment 9041298
2019-02-04 afternoon logs.zip
In Mozilla Bugzilla #1499825, Michal-novotny (michal-novotny) wrote : | #83 |
Now, it seems to resolve normally. I guess firefox shouldn't fail anymore. Please, test it for a while without wins resolving. I'm not sure what's exactly wrong, but according to the rule in nsswitch.conf dns request should be sent when the host isn't found using wins and because no request is sent, there is probably some fatal failure in wins nss module.
In Mozilla Bugzilla #1499825, Rex (rex.prestige) wrote : | #84 |
Good find Michal! I'm not able to easily reproduce the issue in the old beta or 66.0a1 alpha. I will continue my normal browsing across the firefox versions and report if it recurs.
In Mozilla Bugzilla #1499825, Rex (rex.prestige) wrote : | #85 |
Firefox continues to resolve addresses correctly.
Do we know of a difference between Firefox and Chromium in how they interact with wins? This may provide further answers as to why Firefox has difficulty with wins in my system whereas Chromium does not.
In Mozilla Bugzilla #1499825, Michal-novotny (michal-novotny) wrote : | #86 |
Created attachment 9042103
test with pauses
(In reply to rex from comment #79)
> Firefox continues to resolve addresses correctly.
>
> Do we know of a difference between Firefox and Chromium in how they interact with wins? This may provide further answers as to why Firefox has difficulty with wins in my system whereas Chromium does not.
A quick test shows that chrome uses getaddrinfo just like firefox. Given that first getaddrinfo in the python test succeeded and also that firefox resolves correctly with hook3.so (which adds some delay by printing result to console), the problem might be when subsequent call to getaddrinfo is called too quickly after previous call and something in wins code doesn't have time to fail and clean up. Could you please do another test?
- put wins back to nsswitch.conf
- run the old test to make sure it still fails
- run this new test which sleeps between getaddrinfo calls
Also, would you mind sharing your /etc/samba/smb.conf ?
In Mozilla Bugzilla #1499825, Rex (rex.prestige) wrote : | #87 |
Good thinking Michal.
I've put wins back in nsswitch.conf.
Running "make" in the new test directory seems to give good terminal output:
gcc -shared -ldl -fPIC log_getaddrinfo
rm -f logs/*.txt
./run_test.sh
A zip file of the new resulting logs and my smb.conf file follow.
In Mozilla Bugzilla #1499825, Rex (rex.prestige) wrote : | #88 |
Created attachment 9042128
2019-02-
In Mozilla Bugzilla #1499825, Rex (rex.prestige) wrote : | #89 |
Created attachment 9042129
smb.conf
Current smb.conf file with active shares removed.
In Mozilla Bugzilla #1499825, Honzab-moz (honzab-moz) wrote : | #90 |
Michal, I'm assigning this to you just to get it off the triage list, leaving UNCONFIRMED. Feel free to revert or anything you seem more fit for this bug. Thanks.
In Mozilla Bugzilla #1499825, Michal-novotny (michal-novotny) wrote : | #91 |
I'm closing this as WFM because the bug isn't limited only to firefox. Any application that calls getaddrinfo quickly enough will trigger it.
Rex, I didn't find anything strange in any config file you've posted. I tried to reproduce it on Fedora and Valentin tried it on Ubuntu, but we were unable to reproduce it. If you would like to track the bug down to nsswitch level, I would recommend to run the python test with rr (https:/
In Mozilla Bugzilla #1499825, Rex (rex.prestige) wrote : | #92 |
No worries, Michal. Thank you for your help diagnosing to this level. I'm rolling along just fine with WINS removed from my nsswitch.conf and am happy to have helped reach a conclusion on this bug.
Changed in firefox: | |
status: | New → Invalid |
Paul White (paulw2u) wrote : | #93 |
Upstream report closed "RESOLVED WORKSFORME" on 2019-02-18
Issue resolved by making changes outside of Firefox
Closing as "Invalid" - status to match upstream status
Changed in firefox (Ubuntu): | |
status: | Confirmed → Invalid |
Status changed to 'Confirmed' because the bug affects multiple users.