1
0
mirror of https://github.com/openshift/installer.git synced 2026-02-05 15:47:14 +01:00
Commit Graph

6395 Commits

Author SHA1 Message Date
vr4manta
abbe10adff Modified ova template name to use failure domain name 2025-02-10 10:16:43 -05:00
openshift-merge-bot[bot]
6fec4c858a Merge pull request #9376 from yanhua121/OCPBUGS-48044
OCPBUGS-48044: [Nutanix] Install multi-NICs cluster failed with failureDomains
2025-02-05 01:35:11 +00:00
Thuan Vo
bdeee29c15 OCPBUGS-49784: handle runtime panic when validating ovnKubernetesConfig
When a clusternetwork entry has an invalid hostPrefix, which is <= CIDR
mask and the custom IPv4 join subnet is provided in the install-config,
the installer gives a runtime panic error: negative shift amount.

This introduces a check to return a more user-friendly and descriptive
error message instead of runtime panic.
2025-02-03 15:59:33 -08:00
openshift-merge-bot[bot]
18d99d3d3e Merge pull request #9413 from mtulio/OCPBUGS-48827
OCPBUGS-48827: aws/edge/byovpc: subnets tag kube cluster tag to shared
2025-02-03 21:46:57 +00:00
openshift-merge-bot[bot]
2640fe0888 Merge pull request #9420 from mtulio/OCPBUGS-49594
OCPBUGS-49594: aws/byo-ip added required permission ec2:ReleaseAddress
2025-02-01 02:09:57 +00:00
openshift-merge-bot[bot]
1c141f0764 Merge pull request #9398 from tthvo/OCPBUGS-48089
OCPBUGS-48089: validate hostPrefix to be the same when multiple  clusternetwork CIDRs are present
2025-01-31 04:21:35 +00:00
Yanhua Li
2ed5a347b3 OCPBUGS-48044: [Nutanix] upgrade vendoring for cluster-api-provider-nutanix 2025-01-30 15:45:18 -05:00
Marco Braga
a451fb1eed OCPBUGS-49594: aws/byo-ip added required permission ec2:ReleaseAddress
Previously clusters created with minimum permissions in existing VPC
(unmanaged VPC or BYO VPC) and BYO Public IPv4 Pool address (BYO IP)
on AWS failed to de-provision cluster without permissions to release
EIP address (ec2:ReleaseAddress).

This change ensures ec2:ReleaseAddress permission is exported to the
install-generated IAM policy when deploying a cluster on AWS with
BYO VPC and BYO Public IPv4 Pool.
2025-01-29 12:27:48 -03:00
vr4manta
84d548c626 Added vSphere multi disk support 2025-01-29 07:15:03 -05:00
openshift-merge-bot[bot]
44b681d345 Merge pull request #9326 from shiftstack/OSAINFRA-2571
OSASINFRA-2571: Add quota validation for server group
2025-01-29 07:58:09 +00:00
openshift-merge-bot[bot]
ecd4e424d3 Merge pull request #9361 from shiftstack/fix-local-controlplane
OCPBUGS-48228: Envtest: Configure IPv6 service network for API Service
2025-01-29 03:34:03 +00:00
Thuan Vo
4b12df938b docs: mention the restriction for hostPrefix in InstallConfig CRD 2025-01-28 14:37:27 -08:00
Thuan Vo
508b66cd32 OCPBUGS-48089: validate hostPrefix to be the same when multiple
clusternetwork CIDRs are present

When multiple clusternetwork CIDRs are present, hostPrefix fields must
be specified with the same value. If not, it can impact traffic between
pods in different subnets.

The patch applies validation for IPv4 CIDR. For IPv6, the only option
for hostPrefix is 64, thus naturally satisfying the requirement.

References:

[0] https://issues.redhat.com/browse/OCPBUGS-46514 (debug steps and explanation for root cause)
[1] https://access.redhat.com/solutions/7100460 (temporary solution for existing cluster)
2025-01-28 14:37:27 -08:00
Patrick Dillon
fb88402006 Merge pull request #9396 from tthvo/CORS-3724
CORS-3724: bump go v1.23 and k8s dependencies
2025-01-28 16:46:16 -05:00
Marco Braga
7dbbb666f6 OCPBUGS-48827: aws/edge/byovpc: subnets tag kube cluster tag to shared
Previously the subnets created by user (BYO VPC) on edge zones (Local or
Wavelength zones) were not tagged with
kubernetes.io/cluster/<InfraID>:shared.

This change ensures installer is also setting the same cluster tag as
regular zones.
2025-01-28 17:35:50 -03:00
Maysa Macedo
9346289726 Envtest: Configure IPv6 service network for API Service
When the host that runs the OpenShift install is configured with
IPv6 only, the kube-apiserver created with envtest would fail
as the service-cluster-ip-range would be configured with a default
IPv4 CIDR and the public address family, which is the host address,
would be configured with an IPv6. This commit fixes the issue by setting
a default IPv6 CIDR to service-cluster-ip-range, in case the host
has no IPv4 available.
2025-01-28 10:00:28 -03:00
Thuan Vo
3628a43a38 CORS-3637: OWNERS files for platform provider subdirectories
The files use aliases defined in OWNERS_ALIASES
2025-01-27 10:37:24 -08:00
Thuan Vo
e5738d5c43 lint: bump local lint version to latest and apply fixes
Since CI is using tag latest, we can also use the same tag here. There
might be a case where latest mirrored image on CI points to an older
version. In that case, we can adjust local version accordingly.

Current version: golangci-lint v1.63.4
2025-01-24 20:33:25 -08:00
openshift-merge-bot[bot]
d224d2cdb3 Merge pull request #9384 from patrickdillon/ocpbugs-47489-global-leak
OCPBUGS-47489: pkg/gcp/destroy: add waits to prevent leaks during heavy load
2025-01-25 00:27:26 +00:00
Thuan Vo
6297fa0c4d lint: apply lint fixes 2025-01-24 13:31:47 -08:00
openshift-merge-bot[bot]
ed9225756b Merge pull request #9391 from trewest/fix-unmarshalling-of-imagesources
MGMT-19771: Convert IDS to proper IDMS manifest
2025-01-24 16:23:30 +00:00
openshift-merge-bot[bot]
5e8dd285bb Merge pull request #9333 from mtulio/aws-zones-fixes
OCPBUGS-47477: aws: skip dead zones on us-east-1 when discovering AZs
2025-01-24 03:52:39 +00:00
Trey West
a5ada353e0 MGMT-19771: Convert IDS to proper IDMS manifest 2025-01-23 09:01:20 -05:00
Yanhua Li
08f9beab79 OCPBUGS-48570:[Nutanix] Installation failed with timeout when uploading images to PC 2025-01-22 18:05:20 -05:00
Patrick Dillon
c78c0a12ad pkg/destroy/gcp: add DONE const to appease linter
Linter was complaining about existing code that used the magic
string "DONE". This converts the string to a constant.
2025-01-22 15:45:05 -05:00
Patrick Dillon
2b2cea03a0 OCPBUGS-47489: wait for destroy operations
In OCPBUGS-47489, we see that some resources, particularly global
backend services are being leaked during the destroy process.
Analysis of the creation time stamps for the leaked resources shows
that the resources are clustered together, suggesting the leaks
may occur during periods of heavy load.

During periods of heavy load, the deletion may take longer to
process. This commit addresses the issue by adding waits for all
resource deletion. This ensures ample time to complete destroy calls.
2025-01-22 12:47:42 -05:00
Patrick Dillon
2fc65ca2c6 pkg/destroy/gcp: refactor operation handling
The GCP destroy code repeated a lot of boilerplate operation handling.
This refactors all of that into a single function for increased
maintainability.
2025-01-21 15:59:55 -05:00
openshift-merge-bot[bot]
ec249ae631 Merge pull request #9341 from mtulio/OCPBUGS-46596
OCPBUGS-46596: aws/permissions - defaults ec2:DescribeInstanceTypeOfferings
2025-01-18 19:01:41 +00:00
openshift-merge-bot[bot]
cdd054f6f1 Merge pull request #8742 from jlebon/pr/no-more-oc
COS-3013: overlay node image before bootstrapping if necessary
2025-01-18 02:28:53 +00:00
openshift-merge-bot[bot]
59814195cd Merge pull request #9356 from bfournie/iscsi-installation
OCPBUGS-48256: Run iscsiadm for agent-based iSCSI installations
2025-01-17 21:29:08 +00:00
Jonathan Lebon
1ba8f3e419 bootstrap/common: use switch to satisfy golint
golint was complaining about:

```
pkg/asset/ignition/bootstrap/common.go:406:2: ifElseChain: rewrite if-else to switch statement (gocritic)
	if parentDir == "bin" || parentDir == "dispatcher.d" || parentDir == "system-generators" {
	^
```
2025-01-17 16:01:23 -05:00
Jonathan Lebon
60c63bb8bc bootstrap: pivot into node image before bootstrapping
As per https://github.com/openshift/enhancements/pull/1637, we're trying
to get rid of all OpenShift-versioned components from the bootimages.

This means that there will no longer be `oc`, `kubelet`, or `crio`
binaries for example, which bootstrapping obviously relies on.

Instead, now we change things up so that early on when booting the
bootstrap node, we pull down the node image, unencapsulate it (this just
means convert it back to an OSTree commit), then mount over its `/usr`,
and import new `/etc` content.

This is done by isolating to a different systemd target to only bring
up the minimum number of services to do the pivot and then carry on
with bootstrapping.

This does not incur additional reboots and should be compatible
with AI/ABI/SNO. But it is of course, a huge conceptual shift in how
bootstrapping works. With this, we would now always be sure that we're
using the same binaries as the target version as part of bootstrapping,
which should alleviate some issues such as AI late-binding (see e.g.
https://issues.redhat.com/browse/MGMT-16705).

The big exception of course being the kernel. Relatedly, note we do
persist `/usr/lib/modules` from the booted system so that loading kernel
modules still works.

To be conservative, the new logic only kicks in when using bootimages
which do not have `oc`. This will allow us to ratchet this in more
easily.

Down the line, we should be able to replace some of this with
`bootc apply-live` once that's available (and also works in a live
environment). (See https://github.com/containers/bootc/issues/76.)

For full context, see the linked enhancement and discussions there.
2025-01-17 16:01:23 -05:00
Bob Fournier
4cebd5d227 OCPBUGS-48256: Run iscsiadm for agent-based iSCSI installations
The agent-based installer currently runs iscsistart to ensure it
can boot off of an iSCSI target. However, in order to access the
final rhcos image written to the the target by coreos-installer,
iscsiadm must be run which will start iscsid. See
https://docs.redhat.com/en/documentation/red_hat_enterprise_linux/9/html/managing_storage_devices/configuring-an-iscsi-initiator_managing-storage-devices#configuring-an-iscsi-initiator_managing-storage-devices
for more info.
2025-01-17 12:07:27 -05:00
openshift-merge-bot[bot]
4fd9f0f429 Merge pull request #8873 from jcpowermac/SPLAT-1742
SPLAT-1742: vSphere - enable host group based zonal
2025-01-17 06:43:09 +00:00
Thuan Vo
bbe71b5d9f CORS-3721: bump default release image to 4.19
Branching checklist: bumps default release image to 4.19
2025-01-16 12:22:53 -08:00
Mark Hamzy
279650cc01 PowerVS: limit by resourceGroupID
Limit searching for resources to destroy to a specific resource group.
2025-01-12 11:31:54 -06:00
Joseph Callen
a17d6efb58 vSphere - enable host group based zonal 2025-01-09 08:02:06 -05:00
openshift-merge-bot[bot]
c028d7c18a Merge pull request #9238 from sadasu/mcs-certkey-updates
OCPBUGS-29067: Update MCS Cert and Key files within bootstrap Ignition with UserProvisionedDNS
2025-01-08 21:08:38 +00:00
dkokkino
e7187d0c0d Add quota validation for server group
- Added a quota constraint for server groups with a default of 2, reducing to 1 when no worker nodes are provisioned.
- Added a quota constraint for server group members, equal to the number of instances provisioned.
2025-01-08 17:17:09 +01:00
Sandhya Dasu
f7a244717c Custom DNS: Update MCS Cert and Key files within bootstrap Ignition
When UserProvisineDNS is enabled, in addition to machine-config-server
cert file, also update the individual cert and key files within
the bootstrap Ignition.
2025-01-08 00:06:23 -05:00
openshift-merge-bot[bot]
aeab38f026 Merge pull request #9309 from djoshy/add-mcs-ca-annos
MCO-1457: Clean up MCS CA & TLS cert objects for management
2025-01-07 23:06:49 +00:00
openshift-merge-bot[bot]
334309f724 Merge pull request #9322 from barbacbd/OCPBUGS-45995
OCPBUGS-45995: Always set AllowCrossTenantReplication parameter to false
2025-01-07 17:09:08 +00:00
openshift-merge-bot[bot]
0082224969 Merge pull request #9332 from barbacbd/OCPBUGS-46571
OCPBUGS-46571: Disable IP Forwarding for CAPG Machines
2025-01-07 06:22:49 +00:00
openshift-merge-bot[bot]
3b81f48fd0 Merge pull request #9336 from barbacbd/OCPBUGS-46605
OCPBUGS-46605: Find instances by label GCP
2025-01-06 21:19:22 +00:00
barbacbd
59ea6205bc OCPBUGS-46605: Find instances by label GCP
** Regeression to fix finding instance by label. In order to do this the labels
must be requested when querying for information about instances.
2025-01-02 09:40:22 -05:00
Marco Braga
821b07b4f1 OCPBUGS-46596: aws/perm - default ec2:DescribeInstanceTypeOfferings
ec2:DescribeInstanceTypeOfferings is used by machine pools to discover
supported instance type in the region and zones, when it isn't set in
the pool (control plane, compute, or edge).

The discover falls back to the m6i which is supported in mostly regions,
althrough some regions (e.g. ap-southeast-4 and eu-south-2) will fail as
that tye isn't supported. To the discover mechanism works properly
globally the permission must be added by default.

This permission is missing since minimum permissions has been
introduced, currently only the edge pool is including this permission.
This change moves that requirement to default create group.
2025-01-01 23:25:12 -03:00
openshift-merge-bot[bot]
d2bb75079b Merge pull request #9335 from miyamotoh/private-dns-service-url-must-end-with-v1
OCPBUGS-47504: Power VS: Private DNS service endpoint URL must end with /v1
2024-12-21 09:17:16 +00:00
openshift-merge-bot[bot]
747b913b76 Merge pull request #9189 from MahnoorAsghar/enable-tls-default-ironic-api
METAL-1137: Enable TLS for ironic API in the bootstrap VM
2024-12-21 01:13:14 +00:00
openshift-merge-bot[bot]
7c28220158 Merge pull request #9310 from r4f4/azure-avoid-growfs-var
OCPBUGS-46144: azure: use separate /var to avoid growfs timeouts
2024-12-20 21:11:16 +00:00
Hiro Miyamoto
b38ebf680a dnssvcsv1 expects endpoint URL to end with /v1
Signed-off-by: Hiro Miyamoto <miyamotoh@us.ibm.com>
2024-12-20 14:38:04 -05:00