v0.5.2 release
Changelog for v0.5.2
Highlights
The KubeRay 0.5.2 patch release includes the following improvements.
- Allow specifying the entire headService and serveService YAML spec. Previously, only certain special fields such as
labels
andannotations
were exposed to the user.- Expose entire head pod Service to the user (#1040, @architkulkarni)
- Exposing Serve Service (#1117, @kodwanis)
- RayService stability improvements
- RayService object’s Status is being updated due to frequent reconciliation (#1065, @kevin85421)
- [RayService] Submit requests to the Dashboard after the head Pod is running and ready (#1074, @kevin85421)
- Fix in HeadPod Service Generation logic which was causing frequent reconciliation (#1056, @msumitjain)
- Allow watching multiple namespaces
- [Feature] Watch CR in multiple namespaces with namespaced RBAC resources (#1106, @kevin85421)
- Autoscaler stability improvements
- [Bug] RayService restarts repeatedly with Autoscaler (#1037, @kevin85421)
- [Bug] autoscaler not working properly in rayjob (#1064, @Yicheng-Lu-llll)
- [Bug][Autoscaler] Operator does not remove workers (#1139, @kevin85421)
Contributors
We'd like to thank the following contributors for their contributions to this release:
@ByronHsu, @Yicheng-Lu-llll, @anishasthana, @architkulkarni, @blublinsky, @chrisxstyles, @dirtyValera, @ecurtin, @jasoonn, @jjyao, @kevin85421, @kodwanis, @msumitjain, @oginskis, @psschwei, @scarlet25151, @sihanwang41, @tedhtchang, @varungup90, @xubo245
Features
- Add a flag to enable/disable worker init container injection (#1069, @ByronHsu)
- Add a warning to discourage users from launching a KubeRay-incompatible autoscaler. (#1102, @kevin85421)
- Add consistency check for deepcopy generated files (#1127, @varungup90)
- Add kubernetes dependency in python client library (#998, @jasoonn)
- Add support for pvcs to apiserver (#1118, @psschwei)
- Add support for tolerations, env, annotations and labels (#1070, @blublinsky)
- Align Init Container's ImagePullPolicy with Ray Container's ImagePullPolicy (#1080, @Yicheng-Lu-llll)
- Connect Ray client with TLS using Nginx Ingress on Kind cluster (#729) (#1051, @tedhtchang)
- Expose entire head pod Service to the user (#1040, @architkulkarni)
- Exposing Serve Service (#1117, @kodwanis)
- [Test] Add e2e test for sample RayJob yaml on kind (#935, @architkulkarni)
- Parametrize ray-operator makefile (#1121, @anishasthana)
- RayService object's Status is being updated due to frequent reconciliation (#1065, @kevin85421)
- [Feature] Support suspend in RayJob (#926, @oginskis)
- [Feature] Watch CR in multiple namespaces with namespaced RBAC resources (#1106, @kevin85421)
- [RayService] Submit requests to the Dashboard after the head Pod is running and ready (#1074, @kevin85421)
- feat: Rename instances of rayiov1alpha1 to rayv1alpha1 (#1112, @anishasthana)
- ray-operator: Reuse contexts across ray operator reconcilers (#1126, @anishasthana)
Fixes
- Fix CI (#1145, @kevin85421)
- Fix config frequent update (#1014, @sihanwang41)
- Fix for Sample YAML Config Test - 2.4.0 Failure due to 'suspend' Field (#1096, @Yicheng-Lu-llll)
- Fix in HeadPod Service Generation logic which was causing frequent reconciliation (#1056, @msumitjain)
- [Bug] Autoscaler doesn't support TLS (#1119, @chrisxstyles)
- [Bug] Enable ResourceQuota by adding Resources for the health-check init container (#1043, @kevin85421)
- [Bug] Fix null map handling in
BuildServiceForHeadPod
function (#1095, @architkulkarni) - [Bug] RayService restarts repeatedly with Autoscaler (#1037, @kevin85421)
- [Bug] Service (Serve) changing port from 8000 to 9000 doesn't work (#1081, @kevin85421)
- [Bug] autoscaler not working properly in rayjob (#1064, @Yicheng-Lu-llll)
- [Bug] compatibility test for the nightly Ray image fails (#1055, @kevin85421)
- [Bug] rayStartParams is required at this moment. (#1031, @kevin85421)
- [Bug][Autoscaler] Operator does not remove workers (#1139, @kevin85421)
- [Bug][Doc] fix the link error of operator document (#1046, @xubo245)
- [Bug][GCS FT] Worker pods crash unexpectedly when gcs_server on head pod is killed (#1036, @kevin85421)
- [Bug][breaking change] Unauthorized 401 error on fetching Ray Custom Resources from K8s API server (#1128, @kevin85421)
- [Bug][k8s compatibility] k8s v1.20.7 ClusterIP svc do not updated under RayService (#1110, @kevin85421)
- [Helm][ray-cluster] Fix parsing envFrom field in additionalWorkerGroups (#1039, @dirtyValera)
Documentation
- [Doc] Copyedit dev guide (#1012, @architkulkarni)
- [Doc] Update nav to include missing files and reorganize nav (#1011, @architkulkarni)
- [Doc] Update version from 0.4.0 to 0.5.0 on remaining kuberay docs files (#1018, @architkulkarni)
- [Doc][Website] Update KubeRay introduction and fix layout issues (#1042, @kevin85421)
- [Docs][Website] One word typo fix in docs and README (#1068, @ecurtin)
- Add a document to outline the default settings for
rayStartParams
in Kuberay (#1057, @Yicheng-Lu-llll) - Example Pod to connect Ray client to remote a Ray cluster with TLS enabled (#994, @tedhtchang)
- [Post release v0.5.0] Update CHANGELOG.md (#1026, @kevin85421)
- [Post release v0.5.0] Update release doc (#1028, @kevin85421)
- [Post Ray 2.4 Release] Update Ray versions to Ray 2.4.0 (#1049, @jjyao)
- [Post release v0.5.0] Remove block from rayStartParams (#1015, @kevin85421)
- [Post release v0.5.0] Remove block from rayStartParams for python client and KubeRay operator tests (#1050, @Yicheng-Lu-llll)
- [Post release v0.5.0] Remove serviceType (#1013, @kevin85421)
- [Post v0.5.0] Remove init containers from YAML files (#1010, @kevin85421)
- [Sample YAML] Bump ray version in pod security YAML to 2.4.0 (#1160) (#1161, @architkulkarni)
- Kuberay 0.5.0 docs validation update docs for GCS FT (#1004, @scarlet25151)
- Release v0.5.0 doc validation (#997, @kevin85421)
- Release v0.5.0 doc validation part 2 (#999, @architkulkarni)
- Release v0.5.0 python client library validation (#1006, @jasoonn)
- [release v0.5.2] Update tags and versions to 0.5.2 (#1159, @architkulkarni)