OpenShift is based on upstream Kubernetes. With every release of Kubernetes that is intended to be shipped as OCP, it is necessary to incorporate the upstream changes while ensuring that our downstream customizations are maintained.
The instructions in this document apply to OpenShift releases 4.6 and above. For previous releases, please see the rebase enhancement.
An openshift/kubernetes rebase is a complex process involving many manual and potentially error-prone steps. If, while performing a rebase, you find areas where the documented procedure is unclear or missing detail, please update this document and include the change in the rebase PR. This will ensure that the instructions are as comprehensive and accurate as possible for the person performing the next rebase.
The checklists provided below highlight the key responsibilities of someone performing an openshift/kubernetes rebase.
In preparation for submitting a PR to the openshift fork of kubernetes, the following should be true:
- The new rebase branch has been created from the upstream tag
- The new rebase branch includes relevant carries from target branch
- Dependencies have been updated
- Hyperkube dockerfile version has been updated
-
make update
has been invoked and the results committed -
make
executes without error -
make verify
executes without error -
make test
executes without error - The upstream tag is pushed to
openshift/kubernetes
to ensure that build artifacts are versioned correctly - Upstream tooling uses the value of the most recent tag (e.g.v1.20.0
) in the branch history as the version of the binaries it builds. - Pushing the tag is easy as
git push [email protected]:openshift/kubernetes.git refs/tags/v1.20.0
Details to include in the description of the PR:
- A link to the rebase spreadsheet for the benefit for reviewers
After the rebase PR has merged to openshift/kubernetes
, vendor the changes
into openshift/origin
to ensure that the openshift-tests binary reflects
the upstream test changes introduced by the rebase:
- Find the SHA of the merge commit after your PR lands in
openshift/kubernetes
- Run
hack/update-kube-vendor.sh <o/k SHA>
in a clone of theorigin
repo and commit the results - Run
make update
and commit the results - Submit as a PR to
origin
As a final step, send an email to the aos-devel mailing list announcing the rebase. Make sure to include:
- The new version of upstream Kubernetes that OpenShift is now based on
- Link(s) to upstream changelog(s) detailing what has changed since the last rebase landed
- A reminder to component maintainers to bump their dependencies
- Relevant details of the challenges involved in landing the rebase that could benefit from a wider audience.
Before incorporating upstream changes you may want to:
- Read this document
- Get familiar with tig (text-mode interface for git)
- Find the best tool for resolving merge conflicts
- Use diff3 conflict resolution strategy (https://blog.nilbus.com/take-the-pain-out-of-git-conflict-resolution-use-diff3/)
- Teach Git to remember how you’ve resolved a conflict so that the next time it can resolve it automatically (https://git-scm.com/book/en/v2/Git-Tools-Rerere)
To better spread the information send the following email:
Title: k8s <version> bump is starting...
I'm starting the process of updating our fork to bring in
the latest available version of kubernetes. This means that
every PR landing in openshift/kubernetes should go through
extra scrutiny and only 2 exceptions allow merging PRs in the
upcoming time:
1. High priority backports which require landing master first
to start the backport process.
2. Critical PRs unblocking the org.
In both cases make sure to reach out to me for final approval.
There is no ETA yet, but feel free to reach out to me with
any questions.
Clone from a personal fork of kubernetes via a pushable (ssh) url:
git clone [email protected]:<user id>/kubernetes
Add a remote for upstream and fetch its branches:
git remote add --fetch upstream https://github.com/kubernetes/kubernetes
Add a remote for the openshift fork and fetch its branches:
git remote add --fetch openshift https://github.com/openshift/kubernetes
- Branch the target
k8s.io/kubernetes
release tag (e.g.v1.20.0
) to a new local branch
git checkout -b rebase-1.20.0 v1.20.0
- Merge
openshift(master)
branch into therebase-1.20.0
branch with merge strategyours
. It discards all changes from the other branch (openshift/master
) and create a merge commit. This leaves the content of your branch unchanged, and when you next merge with the other branch, Git will only consider changes made from this point forward. (Do not confuse this withours
conflict resolution strategy forrecursive
merge strategy,-X
option.)
git merge -s ours openshift/master
Given the upstream tag (e.g. v1.19.2
) of the most recent rebase and the name
of the branch that is targeted for rebase (e.g. openshift/master
), generate a tsv file
containing the set of carry commits that need to be considered for picking:
echo 'Comment Sha\tAction\tClean\tSummary\tCommit link\tPR link' > ~/Documents/v1.19.2.tsv
git log $( git merge-base openshift/master v1.19.2 )..openshift/master --ancestry-path --reverse --no-merges --pretty='tformat:%x09%h%x09%x09%x09%s%x09https://github.com/openshift/kubernetes/commit/%h?w=1' | grep -E $'\t''UPSTREAM: .*'$'\t' | sed -E 's~UPSTREAM: ([0-9]+)(:.*)~UPSTREAM: \1\2\thttps://github.com/kubernetes/kubernetes/pull/\1~' >> ~/Documents/v1.19.2.tsv
This tsv file can be imported into a google sheets spreadsheet to track the progress of picking commits to the new rebase branch. The spreadsheet can also be a way of communicating with rebase reviewers. For an example of this communication, please see the the spreadsheet used for the 1.19 rebase.
Go through the spreadsheet and for every commit set one of the appropriate actions:
p
, to pick the commits
, to squash it (add a comment with the sha of the target)d
, to drop the commit (if it is not obvious, comment why)
Set up conditional formatting in the google sheet to color these lines appropriately.
Commits carried on rebase branches have commit messages prefixed as follows:
UPSTREAM: <carry>:
- A persistent carry that should probably be picked for the subsequent rebase branch.
- In general, these commits are used to modify behavior for consistency or compatibility with openshift.
UPSTREAM: <drop>:
- A carry that should probably not be picked for the subsequent rebase branch.
- In general, these commits are used to maintain the codebase in ways that are branch-specific, like the update of generated files or dependencies.
UPSTREAM: 77870:
- The number identifies a PR in upstream kubernetes
(i.e.
https://github.com/kubernetes/kubernetes/pull/<pr id>
) - A commit with this message should only be picked into the subsequent rebase branch if the commits of the referenced PR are not included in the upstream branch.
- To check if a given commit is included in the upstream branch, open the referenced
upstream PR and check any of its commits for the release tag (e.g.
v.1.20.0
) targeted by the new rebase branch. For example:
- The number identifies a PR in upstream kubernetes
(i.e.
With these guidelines in mind, pick the appropriate commits from the previous rebase
branch into the new rebase branch. Create a new filter view in the spreadsheet to allow
you get a view where Action==p || Action==s
and copy paste the shas to git cherry-pick
command. Use tr '\n' ' ' <<< "<line_separated_commits>"
to get a space separated list
from the copy&paste.
Where it makes sense to do so, squash carried changes that are tightly coupled to simplify future rebases. If the commit message of a carry does not conform to expectations, feel free to revise and note the change in the spreadsheet row for the commit.
If you first pick all the pick+squash commits first and push them for review it is easier for you and your reviewers to check the code changes and you squash it at the end.
When filling in Clean column in the spreadsheet make sure to use the following number to express the complexity of the pick:
- 0 - clean
- 1 - format fixups
- 2 - code fixups
- 3 - logic changes
Explicit commit rules:
- Anything touching
openshift-hack/
, openshift specific READMEs or similar files should be squashed to 1 commit named "UPSTREAM: : Add OpenShift specific files" - Updating generated files coming from kubernetes should be
<drop>
commit - Generated changes should never be mixed with non-generated changes. If a carry is ever seen to contain generated changes, those changes should be dropped.
The hyperkube dockerfile
hard-codes the Kubernetes version in an image label. It's necessary to manually
set this label to the new release tag. Prefix the commit summary with
UPSTREAM: <carry>: (squash)
and squash it before merging the rebase PR.
Once the commits are all picked from the previous rebase branch, and your PR is mostly ready, each of the following repositories need to be updated to depend on the upstream tag targeted by the rebase:
- https://github.com/openshift/api
- https://github.com/openshift/apiserver-library-go
- https://github.com/openshift/client-go
- https://github.com/openshift/library-go
Often these repositories are updated in parallel by other team members, so make sure to ask around before starting the work of bumping their dependencies.
Once the above repos have been updated to depend on the target release,
it will be necessary to update go.mod
to point to the appropriate revision
of these repos by running hack/pin-dependency.sh
for each of them and then running
hack/update-vendor.sh
(as per the upstream documentation).
Make sure to commit the result of a vendoring update with UPSTREAM: <drop>: bump(*)
.
If you have already bumped the dependencies to get the repo to compile,
don't forget to squash the commits before merging the PR.
The upstream hack/pin-dependency.sh
script only supports setting dependency
for the original repository. To pin to a fork branch that has not yet been
merged (i.e. to test a rebase ahead of shared library bumps having merged), the
following go mod
invocations are suggested:
go mod edit -replace github.com/openshift/<lib>=github.com/<username>/<lib>@SHA
go mod tidy && go mod vendor
Alternatively, you can edit go.mod
file manually with your favourite editor and use search&replace.
The names of upstream e2e tests are annotated according to the a set of declarative rules. These annotations are used to group tests into suites and to skip tests that are known not to be incompatible with some or all configurations of OpenShift.
When performing a rebase, it is important to review the rules to ensure they are still relevant:
- Ensure that
[Disabled:Alpha]
rules are appropriate for the current kube level. Alpha features that are not enabled by default should be targeted by this annotation to ensure that tests of those features are skipped. - Add new skips (along with a bz to track resolution) where e2e tests fail consistently.
Test failures representing major issues affecting cluster capability will generally need to be addressed before merge of the rebase PR, but minor issues (e.g. tests that fail to execute correctly but don't appear to reflect a regression in behavior) can often be skipped and addressed post-merge.
- Update generated files by running
make update
- This step depends on etcd being installed in the path, which can be
accomplished by running
hack/install-etcd.sh
. - Alternatively, run it in the same container as CI is using for build_root that already has the etcd at correct version
- This step depends on etcd being installed in the path, which can be
accomplished by running
podman run -it --rm -v $( pwd ):/go/k8s.io/kubernetes:Z --workdir=/go/k8s.io/kubernetes registry.ci.openshift.org/openshift/release:rhel-8-release-golang-1.16-openshift-4.8 make update OS_RUN_WITHOUT_DOCKER=yes
- Commit the resulting changes as
UPSTREAM: <drop>: make update
.
- Build the code with
make
- Test the code with
make test
- Where test failures are encountered and can't be trivially resolved, the spreadsheet can be used to track those failures to their resolution. The example spreadsheet should have a sheet that demonstrates this tracking.
- Where a test failure proves challenging to fix without specialized knowledge, make sure to coordinate with the team(s) responsible for area(s) of focus exhibiting test failure. If in doubt, ask for help!
- Verify the code with
make verify
Inevitably, a rebase will take long enough that new commits will end up being merged to the targeted openshift/kubernetes branch after the rebase is underway. The following strategy is suggested to minimize the cost of incorporating these new commits:
- rename existing rebase branch (e.g. 1.20.0-beta.2 -> 1.20.0-beta.2-old)
- create new rebase branch from HEAD of master
- merge the target upstream tag (e.g. 1.20.0-beta.2) with strategy ours
- pick all carries from renamed rebase branch (e.g. 1.20.0-beta.2-old)
- pick new carries from the openshift/kubernetes target branch
- add details of the new carries to the spreadsheet
- update generated files
With good tooling, the cost of this procedure should be ~10 minutes at most. Re-picking carries should not result in conflicts since the base of the rebase branch will be the same as before. The only potential sources of conflict will be the newly added commits.
Second email should be send close O(~3 days) to merging the bump:
Title: k8s <version> bump landing...
<URL to your o/k PR> is bumping k8s to version <version>.
The following repositories have been already bumped as well:
<URLs to api/client-go/library-go/apiserver-library-go/operators>
Followup work has been assigned to appropriate teams
through bugzillas linked in the code. Please treat
them as the highest priority after landing the bump.
Finally, this means we are blocking ALL PRs to our
kubernetes fork.
After sending the email block the merge queue, see below.
Close to merging a rebase it is good practice to block any merges to openshift/kubernetes fork. To do that follow these steps:
- Open new issues in openshift/kubernetes
- Use
Master Branch Frozen For Kubernetes Merging | branch:master
as issue title - Add
tide/merge-blocker
label to issues (you might need group lead for this) - All PR's (including the rebase) are now forbidden to merge to master branch
- Before landing the rebase PR, close this issue
Last email should be send after merging the bump as a reply to previous:
<URL to your o/k PR> just merged.
It'll take some time to get newer kublet, but in the meantime we'll
continue to monitor CI. I encourage everyone to hold off from
merging any major changes to our kubernetes fork to provide clear CI
signal for the next 2-3 days.
- Update cluster-kube-apiserver-operator
pre-release-lifecycle
alert'sremoved_release
version similarly to openshift/cluster-kube-apiserver-operator#1092.