Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[WIP] Update to CustomResourceDefinition instead of ThirdPartyResource. #20

Merged
merged 9 commits into from
Aug 20, 2017

Conversation

jlewi
Copy link
Contributor

@jlewi jlewi commented Aug 17, 2017

No description provided.

Jeremy Lewi added 2 commits August 16, 2017 18:24
* Delete WaitForTPR; CRD has new methods for handling the waiting.
* Create the CRD resource.
Copy link

@enisoc enisoc left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

TPRGroup = "mlkube.io"
TPRVersion = "v1beta1"
TPRDescription = "TensorFlow training job"
CRDKind = "tf-job"
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Do you still use this anywhere? If this means Kind in the k8s API sense of Group-Version-Kind, then it should be camel case.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks; you're right we don't use it anymore. The only thing I specify is the PluralKind which I specify as "tfjobs". This ends up producing

spec:
group: mlkube.io
names:
kind: TfJob
listKind: TfJobList
plural: tfjobs
singular: tfjob
scope: Namespaced
version: v1beta1
status:
acceptedNames:
kind: TfJob
listKind: TfJobList
plural: tfjobs
singular: tfjob

This looks correct; i.e. the plural name are all lower case no hypens which follows the CronTab example in the docs
https://kubernetes.io/docs/tasks/access-kubernetes-api/extend-api-custom-resource-definitions/

Jeremy Lewi added 6 commits August 17, 2017 18:20
* Fix bad import that was breaking the build.
* Looks like E2E test is having a problem creating the TFJob.
* Tag the image with the git commit to ensure unique names.
* Update the helm test to get the image from a value.
* We need to set Kind and ApiVersion when creating the request otherwise the ApiServer will reject the request with a 400 error.

* Update tpr_util to use the helper methods to construct the URI as opposed to
constructing the URI manually.

* Add some logging to the E2E test to make it easier to debug.
…ce to

  update.
* Update readme; replace references to TPR with CRD.

* E2E test is still failing because the PS and Master are crashing because
  we try to run the operator as the program.
… the

  operator.

* Attempt to fix travis by removing some of the vendor libraries.
@jlewi jlewi merged commit 880d79f into master Aug 20, 2017
@jlewi jlewi deleted the crd branch August 21, 2017 13:09
Syulin7 added a commit to Syulin7/training-operator that referenced this pull request Jul 17, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants