Fuel for OpenStack

Cobbler max length of kickstart/preseed 110kb ± 24hdd

Bug #1382364 reported by Andrey Kirilochkin on 2014-10-17

This bug report is a duplicate of: Bug #1340414: [provision] Large number of disks could fail ubuntu installation. Edit Remove

This bug affects 2 people

Affects		Status	Importance	Assigned to	Milestone
	Fuel for OpenStack	Invalid	Medium	Vladimir Kozhukalov	Fuel for OpenStack 6.1

Bug Description

Hi guys,
Found one interesting bug, about big kick-start/preseed file.

I want to deploy MOS with 10 Ceph nodes.

1. Success story: On 5 of them I have only 5 disks, each disk has additional partition for journal. Provisioning of these nodes works fine.
2. On node (by my fault) has 25 disks and journal in a file. I just forgot create additional partition. Provision of this node works fine too.
3. Unsuccessful: On other 4 nodes I have 25 disks with additional partition for ceph-journal.

When i started to search for solution i found that:

1. Preseed file in first case was: ±80kb.
2. In second case: 110kb.
3. In third case: 155kb.

It seems that we have linux limitation of length of the one string.

When I have removed 4 drives from those unsuccessful nodes, provision work fine.

So guys, it seems that we should create something like "helper.sh" that will be downloaded during the provision. This "helper" should have short command-line parameters like this:

/tmp/part_create.sh 1:10GB:xfs:/var/lib/ceph/osd-1/journal 2:100GB:xfs:/var/lib/ceph/osd-1

It is my vision, how to fix this.

See original description

Tags:

Mike Scherbakov (mihgen) on 2014-10-17

Changed in fuel:
milestone:	none → 6.0
tags:	added: provision

Andrey Kirilochkin (andreika-mail) on 2014-10-17

description:

updated

Nastya Urlapova (aurlapova) on 2014-10-17

Changed in fuel:
importance:	Undecided → Medium

Revision history for this message

Andrey Kirilochkin (andreika-mail) wrote on 2014-10-28:

10 drives kickstart Edit (137.3 KiB, application/rtf)

Revision history for this message

Andrey Kirilochkin (andreika-mail) wrote on 2014-10-28:

15 drives kickstrart Edit (185.3 KiB, application/rtf)

This one is bigger than possible.

Matthew Mosesohn (raytrac3r) on 2014-10-28

Changed in fuel:
assignee:	nobody → Fuel Library Team (fuel-library)
status:	New → Confirmed

Bogdan Dobrelya (bogdando) on 2014-10-28

Changed in fuel:
status:	Confirmed → Triaged

Matthew Mosesohn (raytrac3r) on 2014-11-25

Changed in fuel:
assignee:	Fuel Library Team (fuel-library) → Matthew Mosesohn (raytrac3r)

Revision history for this message

Matthew Mosesohn (raytrac3r) wrote on 2014-11-26:

Couldn't reproduce in CentOS (you wrote kickstart). Now I looked at log and it's actually Ubuntu. Trying to reproduce in VirtualBox with 15 disks.

Revision history for this message

Matthew Mosesohn (raytrac3r) wrote on 2014-11-26:

I'm having issues with ceph deployment after install, but the actual partitioning looks ok on the Cobbler side when deploying Ubuntu. What kind of errors are you seeing? What made you suspect Cobbler was to blame?

Changed in fuel:
status:	Triaged → New
status:	New → Incomplete

Matthew Mosesohn (raytrac3r) on 2014-12-09

Changed in fuel:
milestone:	6.0 → 6.1

Oleksiy Molchanov (omolchanov) on 2015-01-09

Changed in fuel:
status:	Incomplete → Invalid

Revision history for this message

Oleksiy Molchanov (omolchanov) wrote on 2015-01-12:

This bug was incomplete for more than 4 weeks. We cannot investigate it further so we are setting the status to Invalid. If you think it is not correct, please feel free to provide requested information and reopen the bug, and we will look into it further.

Revision history for this message

Matthew Mosesohn (raytrac3r) wrote on 2015-01-12:

I was able to reproduce it with a VirtualBox env with 25 1GB disks. Installation completed sometimes, but it took a very long time. I wasn't able to identify what specific delays caused the problem. It's definitely a problem, but image based provisioning is coming and is probably our most likely way to work around this.

Revision history for this message

Andrey Kirilochkin (andreika-mail) wrote on 2015-01-21:

Sorry for inactivity.
When we have on ceph-osd more than 23 partitions, installation of ubuntu fails.

How to reproduce:
1. Configure 30+ partitions for ceph-osd and 30+ for journal on the same disk(just to be sure).
2. Start installation and wait until installation finish.
3. Ubuntu installs boot-sector and goes to reboot the machine.

What expected:
1. Installation is finished.

What we really have:
1. Node goes to reboot, but after reboot installation starts again because preseed was corrupted in post-install section.

Revision history for this message

Dmitriy Novakovskiy (dnovakovskiy) wrote on 2015-01-21:

I think workaround is required for this, it's not enough to rely on image based provisioning (until, at least, it is claimed to be fully stable and substituting default Cobbler-based mechanism). Around 60% of installations that are happening or about to happen w/ 6.0 in near future that I'm aware of are using 20+ disks Ceph nodes.

Ugly workaround that Vladimir has described, or something like "run a shell script to partition OSD disks and add them to OSD at the very end of deployment"

Revision history for this message

Vladimir Kozhukalov (kozhukalov) wrote on 2015-01-21:

The correct way to fix this issue is to use image based provisioning (IBP). It is not quite stable at the moment but we know most of its bugs and some of them are fixed and merged. Any other potential workarounds like helper.sh or something like that are even less stable and need more resources for implementation.

By the way, our current IBP implementation is also cobbler-based and it is quite easy to enable it. So I think we need to cover all many disks deployments with IBP not wasting resources for inventing ugly buggy schemes.

Revision history for this message

Vladimir Kozhukalov (kozhukalov) wrote on 2015-01-21:

#10

This bug is a duplication of https://bugs.launchpad.net/fuel/+bug/1340414

Vladimir Kozhukalov (kozhukalov) on 2015-01-21