1
0
mirror of https://github.com/openshift/installer.git synced 2026-02-06 00:48:45 +01:00
Commit Graph

2036 Commits

Author SHA1 Message Date
OpenShift Merge Robot
23ed14be88 Merge pull request #6665 from miyamotoh/vpc-region-zone-from-name-subnets
Better logic to derive vpcRegion/Zone from vpcName/Subnets
2023-01-04 15:58:18 -05:00
OpenShift Merge Robot
1d972a809c Merge pull request #6729 from derekhiggins/bump-carrier
OCPBUGS-5151: baremetal: Extra time for provisioning interface
2022-12-23 17:17:13 -05:00
OpenShift Merge Robot
d3b3b71fdc Merge pull request #6697 from zaneb/console-error-immediate
OCPBUGS-3032: Report status on the console immediately
2022-12-23 17:17:07 -05:00
Derek Higgins
abf1de99e0 baremetal: Extra time for provisioning interface
Allow some extra time for the provisioning interface to come up
https://issues.redhat.com/browse/OCPBUGS-5151
2022-12-22 16:48:40 +00:00
Hiro Miyamoto
42e453184b Better logic to derive vpcRegion/Zone from vpcName/Subnets
Signed-off-by: Hiro Miyamoto <miyamotoh@us.ibm.com>
2022-12-20 13:39:36 -05:00
Brent Barbachem
887532375a GCP: Tfvars will determine the user has create firewall permissions
** Create Firewall permissions option is removed from the install config. The
terraform vars checks the user permissions in the project to determine if the
user can create firewall rules. If the user does not have permissions to create firewall
rules then the firewall rules are skipped.
2022-12-19 08:33:29 -05:00
OpenShift Merge Robot
9ee3437a75 Merge pull request #6703 from mike-nguyen/413_bib
OCPBUGS-2996: bump RHCOS 4.13 bootimage metadata
2022-12-17 00:53:22 -05:00
OpenShift Merge Robot
9377cb3974 Merge pull request #6643 from r4f4/gci-import-order-1
Fix import order using the gci tool
2022-12-15 08:20:08 -05:00
Michael Nguyen
f274557afe bump RHCOS 4.13 bootimage metadata
These changes will update the RHCOS 4.13 boot image metadata in the
installer which includes the fixes for the following:

OCPBUGS-2926 - Unable to gather OpenStack console logs since kernel cmd line has no console args
OCPBUGS-3361 - RHCOS VM fails to boot on IBM Power (ppc64le) - 4.13

```
plume cosa2stream --target data/data/coreos/rhcos.json --distro rhcos --no-signatures --url https://rhcos.mirror.openshift.com/art/storage/releases x86_64=412.86.202209302317-0 aarch64=412.86.202210031918-0 s390x=412.86.202209302317-0 ppc64le=412.86.202209302326-0
```
2022-12-14 14:35:27 -05:00
Zane Bitter
646558ae26 Start install-status service earlier
Since the apply-host-config service is oneshot, not simple, systemd
doesn't consider it 'started' until it completes. We want the
install-status to show any errors from apply-host-config, so start it
at the same time (i.e. when create-cluster-and-infraenv has finished).
2022-12-14 14:35:07 -05:00
Zane Bitter
c12094a791 Reset install status if it gets back on track 2022-12-14 14:35:07 -05:00
Zane Bitter
4daff6117f Update console debug messages immediately
When updating the console debug messages, don't wait for the next
refresh. Issue a reload, which will cause the console to be redisplayed
unless the user has already started logging in.
2022-12-14 14:35:07 -05:00
Rafael Fonseca
80e02a974d chore: fix import order 2022-12-13 15:40:58 +01:00
Thomas Montague
48d988c568 Switch back to gp2 ebs volume type for bootstrap instance.
Not all AWS regions support gp3 and this hardcoded value breaks IPI installs
to those regions.
2022-12-12 12:43:04 -05:00
OpenShift Merge Robot
01adff5d62 Merge pull request #6544 from zaneb/issue-console
OCPBUGS-3032: Report agent installation problems on the console
2022-12-08 03:00:04 -05:00
OpenShift Merge Robot
cddd2d18fe Merge pull request #6498 from jcpowermac/OCPBUGS-1560
OCPBUGS-1560: vsphere zonal, terraform depends on ova import
2022-12-07 04:00:50 -05:00
Zane Bitter
7c81a78769 Adapt MOTD for rendezvous host vs other hosts
Make the main message specific to the agent installer, and put it inside
motd.d so that it does not get overwritten on the bootstrap node.

Put info about the primary service to watch (assisted-service on node 0;
agent on other hosts) directly in /etc/motd. On node 0, this will be
overwritten to say that bootkube is the primary service once
bootstrapping starts. Writing this data to /etc/motd also has the effect
of overwriting the stuff about using Machine Config Operator to
configure the host, which CoreOS automatically puts in there but which
doesn't make sense until the cluster is booted.
2022-12-06 15:00:14 -05:00
Zane Bitter
c08bd9dba0 Display error on console when services not started 2022-12-06 15:00:13 -05:00
Zane Bitter
098b81839f Add description to agent unit 2022-12-06 15:00:13 -05:00
Zane Bitter
7fcb2156f8 Display cluster name on ISO console 2022-12-06 15:00:13 -05:00
Zane Bitter
0962c0a4c3 Display automatic installation status on the console 2022-12-06 15:00:13 -05:00
Zane Bitter
728ef0689a Move debug message outside of loop 2022-12-06 13:13:10 -05:00
Zane Bitter
dda3e44310 Display error on console for host configuration errors 2022-12-06 13:13:10 -05:00
Zane Bitter
84b5f5d9af Display error on console when we cannot download release image 2022-12-06 13:13:10 -05:00
OpenShift Merge Robot
3bb6bd99b7 Merge pull request #6626 from derekhiggins/set-dual-master
OCPBUGS-3196: Set ip=dhcp,dhcp6 for master nodes on dualstack
2022-12-06 11:40:01 -05:00
OpenShift Merge Robot
78b86b09ec Merge pull request #6651 from miyamotoh/wait-longer-for-dhcp-ip
Wait longer for VM to obtain IP from DHCP in PowerVS
2022-12-05 21:53:37 -05:00
OpenShift Merge Robot
ea26eaffb9 Merge pull request #6535 from dongchen126/fix-ack-tags
Alibaba: add the tags of the machine nodes
2022-12-05 21:53:24 -05:00
OpenShift Merge Robot
1fb27747fa Merge pull request #6649 from omertuc/snobipcvooverride
OCPBUGS-4125: Enable CVO unmanage overrides in bootstrap-in-place installations
2022-12-05 15:13:19 -05:00
OpenShift Merge Robot
cc689a2104 Merge pull request #6488 from vrutkovs/fcos-hold-bootkube-until-pivot
OCPBUGS-3164: hold bootkube service until bootstrap has pivoted
2022-12-03 12:58:08 -05:00
Omer Tuchfeld
00718baa2c OCPBUGS-4125: Enable CVO unmanage overrides in bootstrap-in-place installations
# Background

#5258 introduced a mechanism to tell CVO to not manage some cluster resources,
as that management was creating races between bootkube and CVO.

In #5271, this mechanism was removed from bootstrap-in-place
installations.

The reason for the removal was because attempting to apply
`original_cvo_overrides.patch` after the cluster-bootstrap podman
container was done wasn't working, as in BIP by the time that that
container is down, there's no longer a running kube-apiserver apply the
patch on (unlike regular installations with a bootstrap node where the
API server keeps running on the actual nodes). The justification for
simply disabling the mechanism instead of finding a solution was based
on an incorrect assumption that "CVO doesn't even run in BIP pre-pivot
so there's no need to perform overrides anyway as there's no CVO to race
with".

# Issue

Naturally, since CVO *is* actually running in the pre-pivot phase of
bootstrap-in-place, not applying CVO overrides to disable the CVO
management of bootkube-created resources led to the same races #5258 was
aiming to solve in bootstrap-in-place installations.

One manifestation of that is OCPBUGS-4125.

# Solution

This commit re-introduces the mechanism that was originally introduced in #5258
(and removed in #5271).

To solve the issue that motivated #5271, this commit doesn't attempt to
apply the `original_cvo_overrides.patch` in bootstrap-in-place
pre-reboot, and instead will transfer `original_cvo_overrides.patch` through
`master-update.fcc` to the post-pivot phase and the bootstrap-in-place
post-reboot script will make sure to apply the patch after the API
server is running again.
2022-12-03 14:58:30 +01:00
OpenShift Merge Robot
e580ee6f2a Merge pull request #6610 from barbacbd/remove_private_dns_zone
GCP: remove private zone manipulation
2022-12-01 22:52:02 -05:00
OpenShift Merge Robot
ffd4e2ca6e Merge pull request #6602 from r4f4/io-util-deprecated
Replace deprecated io/ioutil package
2022-12-01 22:51:56 -05:00
Hiro Miyamoto
ac8dd4fb9a Wait longer for VM to obtain IP from DHCP in PowerVS
Signed-off-by: Hiro Miyamoto <miyamotoh@us.ibm.com>
2022-12-01 17:05:17 -05:00
OpenShift Merge Robot
4a4e34f1b5 Merge pull request #6583 from r4f4/azurerm-sa-access
OCPBUGS-3524: data: azurerm: restore RHCOS SA access configuration
2022-11-30 18:31:17 -05:00
OpenShift Merge Robot
ac9b99e0c4 Merge pull request #6574 from zaneb/redact-pull-secret
OCPBUGS-3405: Redact pull secret from agent-gather
2022-11-30 06:04:16 -05:00
OpenShift Merge Robot
02f915f63f Merge pull request #6579 from deads2k/pass-featureset-to-kas
OCPBUGS-3985: Pass featureset to kas
2022-11-29 10:04:38 -05:00
Joseph Callen
505fdd4504 OCPBUGS-1560: vspherezoning terraform folder
Since the folder resource might not be ran because there could be
user defined folder per failure domain if a folder is created
the import resource is not waiting. Adding
this depends_on so the import happens after creating
folder(s).
2022-11-29 09:14:52 -05:00
David Eads
ff9607e6ae pass the featureset into the kube-apiserver operator render command 2022-11-28 09:03:01 -05:00
Derek Higgins
91caa20408 Set ip=dhcp,dhcp6 for master nodes on dualstack
We were previously only seting this for workers but
its also needed on the master nodes to ensure both
networks are ready before ovnkube-node starts.
2022-11-24 12:55:22 +00:00
OpenShift Merge Robot
8258456dbf Merge pull request #6577 from bfournie/retry-register
OCPBUGS-3277: Restart create-cluster-and-infraenv.service if it fails
2022-11-23 20:05:47 -05:00
Brent Barbachem
91ae1285f4 GCP: remove private zone manipulation
** Cannot specify the privatezone for GCP installations.
2022-11-23 09:20:43 -05:00
Bob Fournier
b846a70889 OCPBUGS-3277: Restart create-cluster-and-infraenv.service if it fails
As described in https://issues.redhat.com/browse/OCPBUGS-3277, there have
been occasional failures in the create-cluster-and-infraenv service
due to the agentbasedinstaller being unable to retrieve the release image.
This changes to the service to allow a retry on failure.
2022-11-21 15:26:54 -05:00
Joseph Callen
9f164b1678 Update vSphere Owner files 2022-11-21 08:02:55 -05:00
OpenShift Merge Robot
ec3f42dea8 Merge pull request #5671 from dongchen126/fix-dns-record
OCPBUGS-2384: [Alibaba] fix the creation of public record
2022-11-20 15:38:01 -05:00
Rafael Fonseca
428688c9cd Replace deprecated io/ioutil package
`io/ioutil` has been deprecated since go-1.16 [1]. We should use `io`
and `os` instead.

[1] https://github.com/golang/go/issues/42026
2022-11-18 20:08:57 +01:00
OpenShift Merge Robot
5a63550d99 Merge pull request #6576 from sanchezl/add-featureset-data
OCPBUGS-3123: Pass FeatureSet to cluster config render
2022-11-14 23:07:40 +00:00
Rafael Fonseca
d996fdaa90 OCPBUGS-3524: data: azurerm: restore RHCOS SA access configuration
In the transition from v2 to v3 of the azurerm terraform provider, not
only was an option renamed from `allow_blob_public_access` to
`allow_nested_items_to_be_public` but its default value was also changed
from `false` to `true` [1].

So to restore the previous access setting for the RHCOS Storage Account,
we need to explicitly set the new option to `false`.

Thanks Jinyun Ma for pointing this out.

[1] https://github.com/hashicorp/terraform-provider-azurerm/blob/main/CHANGELOG.md#300-march-24-2022
2022-11-11 14:26:55 +01:00
OpenShift Merge Robot
0f99618f1c Merge pull request #6567 from bfournie/auto-retry-failures
OCPBUGS-3280: Automatically retry install
2022-11-11 02:41:35 +00:00
Zane Bitter
01be45abc8 Redact any pull secrets from the journal in agent-gather
Although we should never log the value of a pull-secret, to be on the
safe side redact any of the pull secret auth values that we know about
from the journal before saving it in the agent-gather archive.

The exported journal format is binary[1], so we must ensure that the
redacted output has the same length as the input.

[1] https://systemd.io/JOURNAL_EXPORT_FORMATS/#journal-export-format
2022-11-10 20:07:46 -05:00
Zane Bitter
b179f28003 Remove confusing reference to "PullSecretToken"
This field is always empty, and there are no foreseeable future
circumstances in which we will use RHSSO for agent authentication, so
just remove it.
2022-11-10 20:07:46 -05:00