Discovered ip addresses mapped to an invalid name (ending with -)

Bug #2072155 reported by Joao Andre Simioni
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
MAAS
Triaged
High
Unassigned
3.4
Triaged
High
Unassigned
3.5
Triaged
High
Unassigned

Bug Description

[Problem Description]

We are seeing some addresses being discovered and their names mapped to invalid entries (in this case, ending with a dash).

This causes named to fail to load that specific Zone file, with a check-names error:

05-Jul-2024 00:14:57.295 /var/snap/maas/35359/bind/zone.redlab.clientnet.com:105: 10-25-62-32-red-clientnet-.redlab.clientnet.com: bad owner name (check-names)
05-Jul-2024 00:14:57.295 zone redlab.clientnet.com/IN: loading from master file /var/snap/maas/35359/bind/zone.redlab.clientnet.com failed: bad owner name (check-names)
05-Jul-2024 00:14:57.295 zone redlab.clientnet.com/IN: not loaded due to errors.

This generates another error in regiond.log:

2024-07-04 18:19:53 maasserver.region_controller: [critical] Failed configuring DNS; killing and restarting
        Traceback (most recent call last):
          File "/snap/maas/35359/usr/lib/python3/dist-packages/twisted/internet/defer.py", line 661, in callback
            self._startRunCallbacks(result)
          File "/snap/maas/35359/usr/lib/python3/dist-packages/twisted/internet/defer.py", line 763, in _startRunCallbacks
            self._runCallbacks()
          File "/snap/maas/35359/usr/lib/python3/dist-packages/twisted/internet/defer.py", line 857, in _runCallbacks
            current.result = callback( # type: ignore[misc]
          File "/snap/maas/35359/usr/lib/python3/dist-packages/twisted/internet/defer.py", line 1750, in gotResult
            current_context.run(_inlineCallbacks, r, gen, status)
        --- <exception caught here> ---
          File "/snap/maas/35359/usr/lib/python3/dist-packages/twisted/internet/defer.py", line 1660, in _inlineCallbacks
            result = current_context.run(gen.send, result)
          File "/snap/maas/35359/lib/python3.10/site-packages/maasserver/region_controller.py", line 360, in _checkSerial
            raise DNSReloadError(
        maasserver.region_controller.DNSReloadError: Failed to reload DNS; serial mismatch on domains redlab.clientnet.com

Trying to delete or rename these entries won't work, with the following error:

Error:
DNSResource matching query does not exist.

[Environment]

Jammy with MAAS 3.4 using SNAPs. PostgreSQL 14 in a different host.

[Workaround]

We disabled Network Discovery and used the "Clear all discoveries" button. Some invalid entries were deleted, but some were still present.

We identified that the persistent discoveries were DPU devices on some hosts. We found the hosts who had those DPUs and deleted them (the hosts) from MAAS to restore the service.

[Considerations]

The auto-discovered names should pass through a normalization function to make sure they comply to RFC naming convention (suggestion).

Also, it would be important to understand how the DPUs are being discovered even with the discovery disabled.

Revision history for this message
Joao Andre Simioni (jasimioni) wrote :
Revision history for this message
Alexsander de Souza (alexsander-souza) wrote :

please, open another bug for the DPU discovery issue, it's hard to track the fix of two distinct issue in the same LP issue.

Changed in maas:
status: New → Triaged
importance: Undecided → Medium
milestone: none → 3.6.0
Revision history for this message
Joao Andre Simioni (jasimioni) wrote :
tags: added: bug-council
Changed in maas:
importance: Medium → High
Revision history for this message
Jerzy Husakowski (jhusakowski) wrote :

https://datatracker.ietf.org/doc/html/rfc1034#section-3.5 specifies that a DNS label cannot end with a hyphen. MAAS must not generate such labels.

Revision history for this message
Jerzy Husakowski (jhusakowski) wrote :

Joao, would it be possible to attach the ARP table from MAAS rack controller(s)?

tags: removed: bug-council
Revision history for this message
Joao Andre Simioni (jasimioni) wrote :

Jerzy,

this is a relayed VLAN so MAAS does not have ARP contents in that network.

I'll request that information from one of the machines.

Revision history for this message
Joao Andre Simioni (jasimioni) wrote :
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.