Error message when starting deployment on large environment

Bug #1546703 reported by Mikhail Chernik
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
Fuel for OpenStack
Confirmed
Medium
Fuel Sustaining
8.0.x
Won't Fix
Medium
MOS Maintenance
Mitaka
Confirmed
Medium
Fuel Sustaining

Bug Description

When starting environment deployment on 200 nodes, message "A server error has occurred" appears on UI (screen-shot attached).

However, this error does not affect deployment process.

The reason is that Keep-Alive connection is timed out and nailgun receives 504 error from nginx:
[root@fuel ~]# zgrep error /var/log/docker-logs/nginx/error_nailgun.log.1.gz | grep -v SSL
2016/02/17 18:32:54 [error] 310#0: *23422 upstream timed out (110: Connection timed out) while reading response header from upstream, client: 172.18.194.211, server: localhost, request: "PUT /api/clusters/1/changes HTTP/1.1", upstream: "uwsgi://10.20.0.2:8001", host: "172.16.44.10:8443", referrer: "https://172.16.44.10:8443/"
2016/02/17 18:33:28 [error] 310#0: *29820 upstream timed out (110: Connection timed out) while reading response header from upstream, client: 172.18.194.211, server: localhost, request: "GET /api/nodes/allocation/stats?_=1455731484515 HTTP/1.1", upstream: "uwsgi://10.20.0.2:8001", host: "172.16.44.10:8443", referrer: "https://172.16.44.10:8443/"
2016/02/17 18:33:28 [error] 310#0: *29130 upstream timed out (110: Connection timed out) while reading response header from upstream, client: 172.18.194.211, server: localhost, request: "GET /api/notifications?_=1455731484516 HTTP/1.1", upstream: "uwsgi://10.20.0.2:8001", host: "172.16.44.10:8443", referrer: "https://172.16.44.10:8443/"
2016/02/17 18:33:36 [error] 310#0: *29821 upstream timed out (110: Connection timed out) while reading response header from upstream, client: 172.18.194.211, server: localhost, request: "GET /api/nodes?cluster_id=1&_=1455731484517 HTTP/1.1", upstream: "uwsgi://10.20.0.2:8001", host: "172.16.44.10:8443", referrer: "https://172.16.44.10:8443/"

Revision history for this message
Mikhail Chernik (mchernik) wrote :
Changed in fuel:
status: New → Confirmed
importance: Undecided → Critical
importance: Critical → Medium
importance: Medium → High
assignee: nobody → Fuel Python Team (fuel-python)
milestone: none → 8.0-updates
tags: added: area-python module-nailgun-agent
Revision history for this message
Leontii Istomin (listomin) wrote :

Has been reproduced with 9.0-188

Revision history for this message
Vitaly Sedelnik (vsedelnik) wrote :

Targeted to 9.0 and 10.0 per latest comment. Also downgraded to Medium as this is cosmetic issue.

no longer affects: fuel/newton
Revision history for this message
Denis Meltsaykin (dmeltsaykin) wrote :

Closing as Won't Fix as this is a medium importance non-customer-found bug.

Revision history for this message
Roman Rufanov (rrufanov) wrote :

This will become customer-found as soon as large scale clusters will be deployed.
Please consider fixing before it becomes visible to customer.

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.