Error Booting Newly Provisioned Node - System Overlay Not Found #626
Error Booting Newly Provisioned Node - System Overlay Not Found #626
https://github.com/warewulf/warewulf/issues/626
bindatype opened on Jan 3, 2023 Hi,
I’m trying, unsuccessfully, to boot any node on a new cluster running warewulf built from the git version:
wwctl version: 4.4.0rc3-1.git_b63ad7b rpc version: apiPrefix:”rc1” apiVersion:”1” warewulfVersion:”4.4.0rc3-1.git_b63ad7b” and I am hoping someone can shed some light on what the issue is that is preventing nodes from booting.
I have a basic rocky-8 image and assigning a node via: wwctl node set –netdev eth3 –hwaddr 18:66:DA:9D:B4:CD –ipaddr 192.168.100.50 –netmask 255.255.255.0 node001 Booting gives the following error:
Screenshot 2023-01-02 at 2 39 57 PM
I assume that the System Overlay isn’t being created and I know that wwinit.img doesn’t exist.
When I try to build the overlay I see the following:
wwctl overlay build
Building overlay for glc2: host Building system overlays for node001: [wwinit] WARN : Some overlays failed to be generated: could not build system overlays [wwinit] for node node001: Failed to generate files for overlay node001/[wwinit]: failed to build overlay working directory: Failed to render template etc/NetworkManager/system-connections/ww4-managed.ww: could not parse template etc/NetworkManager/system-connections/ww4-managed.ww: template: ww4-managed.ww:4: bad character U+002D ‘-‘ The warewulfd.log file contains this:
[Mon Jan 2 14:48:00 EST 2023] RECV : hwaddr: 18:66:da:9d:b4:cd, ipaddr: 192.168.100.51:40141, stage: ipxe [Mon Jan 2 14:48:00 EST 2023] SERV : stage_file ‘/usr/local/etc/warewulf/ipxe/default.ipxe’ [Mon Jan 2 14:48:00 EST 2023] SEND : node001: /usr/local/etc/warewulf/ipxe/default.ipxe [Mon Jan 2 14:48:00 EST 2023] RECV : hwaddr: 18:66:da:9d:b4:cd, ipaddr: 192.168.100.51:40141, stage: kernel [Mon Jan 2 14:48:00 EST 2023] SERV : stage_file ‘/var/local/warewulf/chroots/rocky-8/rootfs/lib/modules/4.18.0-372.13.1.el8_6.x86_64/vmlinuz’ [Mon Jan 2 14:48:00 EST 2023] SEND : node001: /var/local/warewulf/chroots/rocky-8/rootfs/lib/modules/4.18.0-372.13.1.el8_6.x86_64/vmlinuz [Mon Jan 2 14:48:00 EST 2023] RECV : hwaddr: 18:66:da:9d:b4:cd, ipaddr: 192.168.100.51:40141, stage: container [Mon Jan 2 14:48:00 EST 2023] SERV : stage_file ‘/srv/warewulf/container/rocky-8.img’ [Mon Jan 2 14:48:05 EST 2023] SEND : node001: /srv/warewulf/container/rocky-8.img.gz [Mon Jan 2 14:48:05 EST 2023] RECV : hwaddr: 18:66:da:9d:b4:cd, ipaddr: 192.168.100.51:40141, stage: system [Mon Jan 2 14:48:05 EST 2023] SERV : BUILD node001, overlays [wwinit] [Mon Jan 2 14:48:05 EST 2023] SERV : stage_file ‘/srv/warewulf/overlays/node001/wwinit.img’ [Mon Jan 2 14:48:05 EST 2023] ERROR : Not found: /srv/warewulf/overlays/node001/wwinit.img
My warewulf.conf files looks like:
cat /usr/local/etc/warewulf/warewulf.conf
WW_INTERNAL: 43 ipaddr: 192.168.100.6 netmask: 255.255.255.0 network: 192.168.100.0 warewulf: port: 9873 secure: false update interval: 60 autobuild overlays: true host overlay: true syslog: false dhcp: enabled: true range start: 192.168.100.50 range end: 192.168.100.99 systemd name: dhcpd tftp: enabled: true systemd name: tftp nfs: enabled: true export paths:
- path: /home export options: rw,sync mount options: defaults mount: true
- path: /opt export options: ro,sync,no_root_squash mount options: defaults mount: false systemd name: nfs-server The node in question looks like:
wwctl node list -a node001
NODE FIELD PROFILE VALUE ===================================================================================== node001 Id – node001 node001 comment default This profile is automatically included for each node node001 cluster – – node001 container – rocky-8 node001 ipxe – (default) node001 runtime – (generic) node001 wwinit – (wwinit) node001 root – (initramfs) node001 discoverable – – node001 init – (/sbin/init) node001 asset – – node001 kerneloverride – – node001 kernelargs – (quiet crashkernel=no vga=791 net.naming-scheme=v238) node001 ipmiaddr – – node001 ipminetmask – – node001 ipmiport – – node001 ipmigateway – – node001 ipmiuser – – node001 ipmipass – – node001 ipmiinterface – – node001 ipmiwrite – – node001 profile – default node001 default:type – (ethernet) node001 default:onboot – – node001 default:netdev – eth3 node001 default:hwaddr – 18:66:DA:9D:B4:CD node001 default:ipaddr – 192.168.100.50 node001 default:ipaddr6 – – node001 default:netmask – 255.255.255.0 node001 default:gateway – – node001 default:mtu – – node001 default:primary – true Activity bindatype bindatype commented on Jan 3, 2023 bindatype on Jan 3, 2023 Author I perhaps should add that the host is running Rocky-9:
cat /etc/redhat-release
Rocky Linux release 9.1 (Blue Onyx)
uname -a
Linux glc2 5.14.0-162.6.1.el9_1.x86_64 #1 SMP PREEMPT_DYNAMIC Fri Nov 18 02:06:38 UTC 2022 x86_64 x86_64 x86_64 GNU/Linux
anderbubble added a commit that references this issue on Jan 3, 2023 Fix variable naming to use valid characters
9718711
anderbubble mentioned this on Jan 3, 2023 Fix variable naming to use valid characters #628 anderbubble anderbubble commented on Jan 3, 2023 anderbubble on Jan 3, 2023 Collaborator @bindatype I think I see the problem, and I’ve submitted a PR at #628. I’d be interested to know if this resolves your issue. (Should be easy enough to change the template manually by hand to try, if you’re willing.)
anderbubble added backport:4.2.x backport to 4.2.x
bug and removed backport:4.2.x backport to 4.2.x on Jan 3, 2023
anderbubble added this to the 4.4.0 milestone on Jan 3, 2023 bindatype bindatype commented on Jan 3, 2023 bindatype on Jan 3, 2023 Author YES! That solved the problem. Thanks so much!
bindatype closed this as completedon Jan 3, 2023 anderbubble anderbubble commented on Jan 4, 2023 anderbubble on Jan 4, 2023 Collaborator Excellent! Thanks for testing.
@mslacken can you merge #628?
anderbubble removed bug on Feb 6