Skip to content

Commit

Permalink
Merge pull request #485 from rhjanders/live-updates-servicing-doco
Browse files Browse the repository at this point in the history
Add live updates documentation
  • Loading branch information
metal3-io-bot authored Nov 28, 2024
2 parents e69693b + 7d4bbb1 commit fd692b2
Show file tree
Hide file tree
Showing 4 changed files with 121 additions and 0 deletions.
1 change: 1 addition & 0 deletions docs/user-guide/src/SUMMARY.md
Original file line number Diff line number Diff line change
Expand Up @@ -19,6 +19,7 @@
- [Firmware Updates](bmo/firmware_updates.md)
- [Inspect Annotation](bmo/inspect_annotation.md)
- [Instance Customization](bmo/instance_customization.md)
- [Live Updates (Servicing)](bmo/live_updates_servicing.md)
- [RAID Setup](bmo/raid.md)
- [Reboot Annotation](bmo/reboot_annotation.md)
- [Root Device Hints](bmo/root_device_hints.md)
Expand Down
6 changes: 6 additions & 0 deletions docs/user-guide/src/bmo/firmware_settings.md
Original file line number Diff line number Diff line change
Expand Up @@ -196,3 +196,9 @@ pending until it gets to this state (e.g. as a result of deprovisioning).
Alternatively, you can create a `HostFirmwareSettings` object together with
the `BareMetalHost` object. In this case, the settings will be applied after
inspection is finished.

## See also

The functionality described here can be used either on newly provisioned nodes
(Day 1 operation, as described here) or on already provisioned nodes (Day 2
operation, utilizing [Live Updates / Servicing feature](./live_updates_servicing.md)).
4 changes: 4 additions & 0 deletions docs/user-guide/src/bmo/firmware_updates.md
Original file line number Diff line number Diff line change
Expand Up @@ -149,6 +149,10 @@ a version change is detected.

## See also

The functionality described here can be used either on newly provisioned nodes
(Day 1 operation, as described here) or on already provisioned nodes (Day 2
operation, utilizing [Live Updates / Servicing feature](./live_updates_servicing.md)).

The corresponding functionality in Ironic is called
[Firmware Updates][1].

Expand Down
110 changes: 110 additions & 0 deletions docs/user-guide/src/bmo/live_updates_servicing.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,110 @@
# Live updates (servicing)

Live updates (servicing) enables baremetal-operator to conduct certain actions
on already provisioned BareMetalHosts. These actions currently include:

- [configuring firmware settings](./firmware_settings.md)
- [updating BIOS and/or BMC firmware](./firmware_updates.md)

Live updates (servicing) is an opt-in feature. Operators may enable this
feature by creating a `HostUpdatePolicy` custom resource.

## HostUpdatePolicy custom resource definition

HostUpdatePolicy is the custom resource which controls applying live updates.
Each part of the functionality can be controlled separately by setting the
respective entry in the HostUpdatePolicy spec:

- `firmwareSettings` - controls changes to firmware settings
- `firmwareUpdates` - controls BIOS and BMC firmware updates

### Allowed values for firmwareSettings and firmwareUpdates fields

Each of the fields can be set to one of the two values:

- `onReboot` - enables performing the requested change on next reboot, or
- `onPreparing` - (default setting) limits applying this type of change to
Preparing state (which only applies to nodes which are being provisioned)

### Example HostUpdatePolicy definition

Here is an example of a HostUpdatePolicy CRD:

```yaml
apiVersion: metal3.io/v1alpha1
kind: HostUpdatePolicy
metadata:
name: ostest-worker-0
namespace: openshift-machine-api
spec:
firmwareSettings: onReboot
firmwareUpdates: onReboot
```
## How to perform Live updates on a BareMetalHost
- create a HostUpdatePolicy resource with the name matching the BMH to be
updated
- use the format above, ensure `firmwareSettings` and/or `firmwareUpdates` is
set to `onReboot`
- make changes to [HostFirmwareSettings](./firmware_settings.md) and/or [HostFirmwareComponents](./firmware_updates.md) as required
- make sure the modified resources are considered valid (see `Conditions`)
- if you're updating a Kubernetes node, make sure to drain it and mark as
not schedulable
- issue a reboot request via the [reboot annotation](./reboot_annotation.md)
- wait for the `operationalStatus` to become `OK` again
- if you're updating a Kubernetes node, make it schedulable again

### Example commands

Below commands may be used to perform servicing operation on a bareMetalHost:

```yaml
cat << EOF > hup.yaml
apiVersion: metal3.io/v1alpha1
kind: HostUpdatePolicy
metadata:
name: ostest-worker-0
namespace: openshift-machine-api
spec:
firmwareSettings: onReboot
firmwareUpdates: onReboot
EOF
```

```console
kubectl apply -f hup.yaml
kubectl patch hostfirmwaresettings ostest-worker-0 --type merge -p \
'{"spec": {"settings": {"QuietBoot": "true"}}}'
kubectl patch hostfirmwarecomponents ostest-worker-0 --type merge -p \
'{"spec": {"updates": [{"component": "bios",
"url": "http://10.6.48.30:8080/firmimgFIT.d9"}]}}'
kubectl cordon worker-0
kubectl annotate bmh ostest-worker-0 reboot.metal3.io=""
```

Once the operation is complete, the node can be un-drained with the below command:

```console
kubectl uncordon worker-0
```

### Resulting workflow

Once changes similar to the above are made to the relevant CRDs, the following
will occur:

- BMO will generate [servicing steps](https://docs.openstack.org/ironic/latest/admin/servicing.html) (similar to manual cleaning steps)
required to perform the requested changes
- BMH will transition to `servicing` operationalStatus
- BMO will make calls to Ironic which will perform the servicing operation
- Ironic will reboot the BMH into the IPA image and perform requested changes
- depending on the hardware, more than one reboot may be required
- once servicing completes, BMO will update the operationalStatus to `OK`
- in case errors are encountered, BMO will set operationalStatus to `error`,
set errorMessage to the explanation of the error, and retry the operation after
a short delay

0 comments on commit fd692b2

Please sign in to comment.