e80d7c9 | avelichk | 21 October 2020, 18:53:31 UTC | Add ttl seconds after finished | 21 October 2020, 18:53:31 UTC |
b7a46ed | avelichk | 20 October 2020, 12:12:17 UTC | Remove Katib client changes | 20 October 2020, 12:12:17 UTC |
729f6da | avelichk | 20 October 2020, 12:03:53 UTC | Remove v1alpha3 tests | 20 October 2020, 12:03:53 UTC |
fea3383 | avelichk | 20 October 2020, 11:00:24 UTC | Create Kubeflow namespace | 20 October 2020, 11:00:24 UTC |
e407aa3 | avelichk | 20 October 2020, 09:53:16 UTC | Deploy TF and PyTorch controllers | 20 October 2020, 09:53:16 UTC |
9898db2 | avelichk | 20 October 2020, 02:50:57 UTC | Return exp in case of error | 20 October 2020, 02:50:57 UTC |
61b3995 | avelichk | 20 October 2020, 01:50:06 UTC | Fix template name | 20 October 2020, 01:50:06 UTC |
6289e52 | avelichk | 20 October 2020, 01:14:15 UTC | Remove bin | 20 October 2020, 01:14:15 UTC |
13c178c | avelichk | 20 October 2020, 01:13:54 UTC | Add other e2e tests | 20 October 2020, 01:13:54 UTC |
7c89cbf | avelichk | 20 October 2020, 00:09:37 UTC | Set kube config | 20 October 2020, 00:09:37 UTC |
f128950 | avelichk | 19 October 2020, 23:33:43 UTC | Remove GCP auth | 19 October 2020, 23:33:43 UTC |
f084630 | avelichk | 19 October 2020, 23:27:17 UTC | Print ns list | 19 October 2020, 23:27:17 UTC |
bf9f744 | avelichk | 19 October 2020, 23:11:57 UTC | manually create experiment | 19 October 2020, 23:11:57 UTC |
fb68b6d | avelichk | 19 October 2020, 22:55:21 UTC | Build binary e2e | 19 October 2020, 22:55:21 UTC |
89be584 | avelichk | 19 October 2020, 22:47:58 UTC | Print known types | 19 October 2020, 22:47:58 UTC |
16ef99d | avelichk | 19 October 2020, 22:17:12 UTC | Set TypeMeta for experiment | 19 October 2020, 22:17:12 UTC |
ced7095 | avelichk | 19 October 2020, 22:07:42 UTC | Trigger CI | 19 October 2020, 22:07:42 UTC |
0b1ef6f | avelichk | 19 October 2020, 21:26:33 UTC | Show known types | 19 October 2020, 21:26:33 UTC |
346084c | avelichk | 19 October 2020, 20:51:49 UTC | Print CRDs in e2e Experiment | 19 October 2020, 20:51:49 UTC |
a848c4f | avelichk | 19 October 2020, 20:12:58 UTC | Fix Katib path | 19 October 2020, 20:12:58 UTC |
ebd7e0d | avelichk | 19 October 2020, 19:46:12 UTC | Add github to src folder | 19 October 2020, 19:46:12 UTC |
8c544c2 | avelichk | 19 October 2020, 19:18:19 UTC | Add github.com to folder | 19 October 2020, 19:18:19 UTC |
f2c78e6 | avelichk | 19 October 2020, 18:35:22 UTC | Add backoff | 19 October 2020, 18:35:22 UTC |
04a11b6 | avelichk | 19 October 2020, 17:21:55 UTC | Fix run e2e go path | 19 October 2020, 17:21:55 UTC |
137f49b | avelichk | 19 October 2020, 17:13:46 UTC | Fix region | 19 October 2020, 17:13:46 UTC |
2196cc0 | avelichk | 19 October 2020, 16:43:52 UTC | Change command | 19 October 2020, 16:43:52 UTC |
9928bce | avelichk | 19 October 2020, 16:06:56 UTC | Remove | 19 October 2020, 16:06:56 UTC |
01618b4 | avelichk | 19 October 2020, 15:53:08 UTC | Fix path to valid exp | 19 October 2020, 15:53:08 UTC |
ae827b5 | avelichk | 19 October 2020, 15:23:01 UTC | Move create cluster to build | 19 October 2020, 15:23:01 UTC |
a0bf0bf | avelichk | 19 October 2020, 13:31:10 UTC | Change deploy | 19 October 2020, 13:31:10 UTC |
ce7a413 | avelichk | 19 October 2020, 03:57:38 UTC | Fix path | 19 October 2020, 03:57:38 UTC |
83cbecd | avelichk | 19 October 2020, 03:23:41 UTC | Change make deploy | 19 October 2020, 03:23:41 UTC |
e9db92f | avelichk | 19 October 2020, 02:53:39 UTC | Fix path for NAS suggestions | 19 October 2020, 02:53:39 UTC |
b78c5b1 | avelichk | 19 October 2020, 02:51:53 UTC | Attach volume to create and delete cluster | 19 October 2020, 02:51:53 UTC |
6b13d51 | avelichk | 19 October 2020, 02:40:29 UTC | Get other build for all images | 19 October 2020, 02:40:29 UTC |
b595b43 | avelichk | 19 October 2020, 02:05:52 UTC | Add AWS cred | 19 October 2020, 02:05:52 UTC |
55701eb | avelichk | 19 October 2020, 01:54:28 UTC | Delete v1alpha3 workflow from prow Add ECR env | 19 October 2020, 01:54:28 UTC |
e8444c3 | avelichk | 19 October 2020, 00:45:15 UTC | Comment creds | 19 October 2020, 00:45:15 UTC |
28eaab8 | avelichk | 19 October 2020, 00:24:16 UTC | Add AWS creds to env | 19 October 2020, 00:24:16 UTC |
5257b90 | avelichk | 17 October 2020, 02:38:20 UTC | Test without folder for GOPATH | 17 October 2020, 02:38:20 UTC |
8c9cb49 | avelichk | 16 October 2020, 22:48:58 UTC | Fix delete cluster | 16 October 2020, 22:48:58 UTC |
aa7b47b | avelichk | 16 October 2020, 22:46:15 UTC | Fix cluster name | 16 October 2020, 22:46:15 UTC |
e7b2237 | avelichk | 16 October 2020, 22:43:21 UTC | Replace create and delete cluster with testing scripts | 16 October 2020, 22:43:21 UTC |
0e38b84 | avelichk | 16 October 2020, 21:33:43 UTC | Refactor e2e test script | 16 October 2020, 21:33:43 UTC |
1058b66 | avelichk | 16 October 2020, 02:17:24 UTC | Change worker image | 16 October 2020, 02:17:24 UTC |
faa7f4c | avelichk | 16 October 2020, 02:03:38 UTC | Remove comment from resume e2e | 16 October 2020, 02:03:38 UTC |
a1bbaa1 | avelichk | 16 October 2020, 01:59:54 UTC | Add changes for AWS test infra | 16 October 2020, 01:59:54 UTC |
85fc7d0 | Andrey Velichkevich | 13 October 2020, 13:02:27 UTC | Enhancement for Custom CRD (#1333) * Init commit * Modify Insert function Add retry on empty observation * Fix mutate volume test * Fix validate experiment test * Fix invalid experiment * Don't get deployed job status when trial is completed * Not send Trial with unavailable metrics to Suggestion * Refactor requeue If objective metric value is not reported metrics collector reports unavailable value to the DB Controller reconciles Trial until DB is empty * Add condition before change trial status * Remove prints * Fix tfevent parser | 13 October 2020, 13:02:27 UTC |
6a07daa | Andrey Velichkevich | 18 September 2020, 03:02:45 UTC | Add trial metadata substitution example (#1319) * Add trial metadata example * Change description * Add istio sidecar false to annotation | 18 September 2020, 03:02:45 UTC |
6aa4ec9 | Kyle Hersey | 17 September 2020, 22:44:45 UTC | fix(metrics-collector): allow user to nuke ephemeral-storage requests (#1312) * fix(metrics-collector): allow user to nuke ephemeral-storage requests * chore(gofmt): fan formatting * chore(gofmt): undo formatting on auto generated api.pb files | 17 September 2020, 22:44:45 UTC |
74e6e5b | Andrey Velichkevich | 17 September 2020, 02:38:46 UTC | feat: Ignore pb files in update gofmt (#1340) Update travis nvm version to 12.18.1 Ignore .pb files in gofmt | 17 September 2020, 02:38:46 UTC |
721a382 | Andrey Velichkevich | 15 September 2020, 09:42:07 UTC | Upload python SDK version (#1335) * Upload 0.0.4 SDK version * Fix doc links * Fix links in README * Modify tables * Fix link * Remove * Modify client and gen script * Update version to 0.0.5 * Run CI * Add Katib client to init | 15 September 2020, 09:42:07 UTC |
4b11f80 | Andrey Velichkevich | 15 September 2020, 02:24:07 UTC | Add SDK examples for v1beta1 (#1337) * Add SDK examples for v1beta1 * Modify import | 15 September 2020, 02:24:07 UTC |
e99c77d | Andrey Velichkevich | 14 September 2020, 15:34:58 UTC | Run post-submit image build in kubeflow-ci project (#1326) * Change registry for presubmit * Add prow_config to workflows * Add project to gcloud auth * Test manager * Add kubeflow-ci project for build in post-submit | 14 September 2020, 15:34:58 UTC |
6b7142f | Andrey Velichkevich | 09 September 2020, 06:57:52 UTC | Custom CRD: Wait for all processes before running metrics collector (#1313) * Enable to wait all in metrics collectors * Rename metricsFilePath * Fix tfevent * Fix pns py * Fix comment | 09 September 2020, 06:57:52 UTC |
7b797e1 | Andrey Velichkevich | 08 September 2020, 21:45:52 UTC | Custom CRD: Support dynamic Trial's jobs conditions (#1307) * Custom Job conditions implementation * Fix prints * Fix status * Fix test * Clean event msg * Run gofmt * Fix few comments * Generate clients * Fix comment * Add newline | 08 September 2020, 21:45:52 UTC |
2580186 | Andrey Velichkevich | 08 September 2020, 19:01:52 UTC | Custom CRD: Add primary container name (#1308) * Add primary container name * Resolve * Generate clients * Add newline | 08 September 2020, 19:01:52 UTC |
fc8d522 | Xu Xiao | 05 September 2020, 16:25:41 UTC | [Adopters] change adopter of Ant Group (#1327) | 05 September 2020, 16:25:41 UTC |
d5c5e95 | Andrey Velichkevich | 04 September 2020, 20:47:42 UTC | Update generate script with SDK (#1323) * Update generate script * Capitalise API * Remove verbose | 04 September 2020, 20:47:42 UTC |
a072156 | Andrey Velichkevich | 04 September 2020, 15:31:41 UTC | Switch test from kubeflow-ci to automl-ci project. (#1321) * Change project to automl-ci * Change registry to automl-ci for presubmit * Add cluster role to sa * Add print * Modify user * Remove user change * Add separate scripts to build metrics-collectors * Move govaralls to after success * Update doc * Trigger CI | 04 September 2020, 15:31:41 UTC |
cba4560 | Andrey Velichkevich | 03 September 2020, 14:09:41 UTC | Fix Pod's ownership to inject metrics collector (#1303) * Refactor get Katib job * Get trial after func * Remove trialName * return error * Remove error * Resolve | 03 September 2020, 14:09:41 UTC |
36aef5f | Andrey Velichkevich | 03 September 2020, 10:59:41 UTC | Fix problem with Hyperopt Out of Range error (#1315) | 03 September 2020, 10:59:41 UTC |
d58b6a1 | Andrey Velichkevich | 03 September 2020, 02:41:40 UTC | Custom CRD: Add primary pod labels (#1305) * Add primary pod labels * Generate swagger * Generate SDK * Trigger CI | 03 September 2020, 02:41:40 UTC |
ef6557a | Andrey Velichkevich | 02 September 2020, 15:13:06 UTC | Custom CRD: Set dynamic watch from controller flags (#1302) | 02 September 2020, 15:13:06 UTC |
ced8496 | Andrey Velichkevich | 02 September 2020, 13:49:07 UTC | Fix restart check in controller for completed experiments (#1306) * Add check for experiment restart in controller * Change comment | 02 September 2020, 13:49:07 UTC |
0b7a5f2 | Andrey Velichkevich | 01 September 2020, 13:03:50 UTC | Update CI test cluster version to 1.16 (#1316) * Update CI cluster version to 1.16 * Add retry strategy * Remove backoff | 01 September 2020, 13:03:50 UTC |
2ceed7d | Andrey Velichkevich | 19 August 2020, 11:05:11 UTC | Update docs for v1beta1 SDK (#1304) * Update docs for v1beta1 SDK * Fix samples in v1alpha3 | 19 August 2020, 11:05:11 UTC |
1d18594 | Xu Xiao | 18 August 2020, 10:11:32 UTC | [python sdk] add v1beta1 models (#1252) * [python sdk] add v1beta1 models * upgrade version of python SDK to 0.0.3 * remove v1Alpha3 python sdk * add some python models manually: v1Time and V1UnstructuredUnstructured * bring back v1alpha3 * create separate python sdk for v1alpha3 and v1beta1 * move on * release pkg on pypi.org * remove dist files * refine | 18 August 2020, 10:11:32 UTC |
051d1de | Andrey Velichkevich | 14 August 2020, 15:20:21 UTC | Proposal: Support custom CRD in Trial Job (#1273) * Add proposal for custom CRD in Trial Template * Fix * Modify doctoc * Doc fixes * Rename header * Fixes * Change doc * Remove comma * Fix Implementation | 14 August 2020, 15:20:21 UTC |
282b71e | Andrey Velichkevich | 12 August 2020, 12:03:45 UTC | Support volume settings in Katib config (#1291) * Support volume settings in config * Set default path | 12 August 2020, 12:03:45 UTC |
77dd34e | Andrey Velichkevich | 11 August 2020, 06:06:16 UTC | Refactor Trial controller unit test (#1299) * Refactor Trial controller unit test Move prometheus to util * Change import | 11 August 2020, 06:06:16 UTC |
cca0358 | Andrey Velichkevich | 10 August 2020, 19:43:59 UTC | Use Logger in suggestion controller util (#1298) * add-logger-suggestion-controller-util * Change log message | 10 August 2020, 19:43:59 UTC |
2c4ad15 | Andrey Velichkevich | 06 August 2020, 19:07:42 UTC | Log update object status error (#1297) * Info instead of Error when update status is failed * Log generate name instead of name * Test. Expect that suggestion is succeeded when experiment is updating | 06 August 2020, 19:07:42 UTC |
33832a7 | Andrey Velichkevich | 06 August 2020, 12:32:55 UTC | Verify that Trials were successfully deleted (#1288) * Verify that trials were deleted Update suggestion status * Update suggestion requests * Fix tests * Fix comment * Add recorder to test controllers * Travis test * Change resume exp trial condition * Modify e2e for from volume experiment * Fix IsRestarting check * Fix comment | 06 August 2020, 12:32:55 UTC |
88eb798 | Andrey Velichkevich | 06 August 2020, 01:20:54 UTC | Set number of epochs to decrease e2e tests time (#1290) * Add epoch for mnist e2e examples * Replace batch-size with epochs * Remove file * Remove changes from hyperband * Remove epoch from pytorch | 06 August 2020, 01:20:54 UTC |
9cf4544 | Andrey Velichkevich | 03 August 2020, 18:43:42 UTC | Unit test for resuming Experiment in controller reconcilers (#1281) * Add unit test for Experiment and Suggestion controller reconcile * Delete buildTrialMetaForRunSpec from controller * Modify condition check * Fix format * Run mock for new version * Refactor experiment reconcile test * Remove comment | 03 August 2020, 18:43:42 UTC |
329b22e | Andrey Velichkevich | 03 August 2020, 11:21:40 UTC | Validate restart Experiment parameters (#1287) * Validate resume experiment in webhook * Fix restart check * Fix test | 03 August 2020, 11:21:40 UTC |
6b9e914 | Andrey Velichkevich | 31 July 2020, 16:11:07 UTC | Get metrics collector config data refactor (#1285) * Refactor get metrics collector config Fix PV name in validation webhook * Add test in validation for Katib config | 31 July 2020, 16:11:07 UTC |
ce89cbf | Ce Gao | 31 July 2020, 10:35:07 UTC | feat(experiment): Add a check before deletion (#1223) * feat(experiment): Add a check before deletion Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Delete all trials Signed-off-by: Ce Gao <gaoce@caicloud.io> * feat: Implement in v1beta1 Signed-off-by: Ce Gao <gaoce@caicloud.io> | 31 July 2020, 10:35:07 UTC |
ac1dc24 | Andrey Velichkevich | 31 July 2020, 01:27:06 UTC | Add e2e test for FromVolume ResumePolicy (#1284) * Add e2e test for from volume resume * Resume experiment after completion * Print controller logs * Remove test prints * Remove controller logs | 31 July 2020, 01:27:06 UTC |
a42d8a9 | Andrey Velichkevich | 30 July 2020, 17:28:31 UTC | Refactor suggestion config and add Composer unit test (#1282) * Init commit * Add test for deployment * Refactor suggestion config * Switch mock to 1.4.3 version * Fix empty map * Fix comments * Fix gofmt * Move package | 30 July 2020, 17:28:31 UTC |
c33da9d | Xu Xiao | 27 July 2020, 17:24:17 UTC | support trial meta injection in trial template rendering (#1259) * support trial meta injection in trial template rendering * use trialSpec.metadata as prefix of trialMeta reference * solve conflicts * add some comments on consts * apply gofmt * refactor * fix mock test * refine | 27 July 2020, 17:24:17 UTC |
9a7d43c | Andrey Velichkevich | 27 July 2020, 15:32:18 UTC | Resume Experiment from Volume (#1275) * Resume experiment from the PV * Add comment * Remove old api comments * Change reason for Running suggestion * Fix few comments * Rename volume name like suggestion deployment * Add corev1 to const | 27 July 2020, 15:32:18 UTC |
27658a7 | Andrey Velichkevich | 25 July 2020, 01:36:16 UTC | GRPC: Rename Manager to DBManager service (#1279) * Rename Manager to DBManager in gRPC * Update git ignore | 25 July 2020, 01:36:16 UTC |
9320b4e | Andrey Velichkevich | 23 July 2020, 02:29:39 UTC | Add status to experiment CRD manifest (#1276) | 23 July 2020, 02:29:39 UTC |
ac091db | Andrey Velichkevich | 20 July 2020, 01:26:50 UTC | Fix few API comments typos (#1274) * Fix few typos in API comments * Generate open API | 20 July 2020, 01:26:50 UTC |
b5465bd | Elias Koromilas | 16 July 2020, 21:35:03 UTC | Add FPGA accelerated examples (#1269) * Add instructions for FPGA accelerated Experiments * XGBoost FPGA accelerated example * Ommit unnecessary quotes * Add the new example in the list of training container images * Ommit explicit declaration of the metrics collector Co-authored-by: Vaggelis Gkiastas <vaggelisgkia@hotmail.com> | 16 July 2020, 21:35:03 UTC |
f565047 | Andrey Velichkevich | 16 July 2020, 02:38:34 UTC | Modify documentation for v1beta1 (#1267) * Change doc for v1beta1 * Fix | 16 July 2020, 02:38:34 UTC |
c199867 | Andrey Velichkevich | 16 July 2020, 01:24:35 UTC | Add e2e test for DARTS (#1268) * Add e2e for darts * Remove todo | 16 July 2020, 01:24:35 UTC |
50fc911 | Andrey Velichkevich | 15 July 2020, 01:32:37 UTC | UI: Add new ConfigMap with Trial Templates (#1265) * Add new configMap with Trial Templates * Enable to view all namespaces * Fix log | 15 July 2020, 01:32:37 UTC |
cbe0f40 | Vaclav Pavlin | 13 July 2020, 13:50:32 UTC | Fix examples to run on OpenShift (#1241) | 13 July 2020, 13:50:32 UTC |
226c99c | Andrey Velichkevich | 10 July 2020, 01:56:36 UTC | UI: Add Trial table pages (#1262) | 10 July 2020, 01:56:36 UTC |
f1393b9 | Andrey Velichkevich | 10 July 2020, 01:16:35 UTC | UI: Delete ConfigMap with no Trial Templates (#1260) * Delete ConfigMap if there are no Trial Templates Add snack box for Templates * Not add empty ConfigMaps * Fix e2e test | 10 July 2020, 01:16:35 UTC |
4145c4f | Andrey Velichkevich | 08 July 2020, 01:29:08 UTC | Fix paths in prow config (#1257) | 08 July 2020, 01:29:08 UTC |
42dbb56 | Andrey Velichkevich | 07 July 2020, 01:53:57 UTC | UI: Update Material UI version to V4 (#1254) * Init commit for material UI v4 * Rebase * Add label to all selects * Remove changes from v1alpha3 | 07 July 2020, 01:53:57 UTC |
c3b38d8 | Anton Kirillov | 06 July 2020, 18:13:57 UTC | Adding retries for gRPC calls (#1248) | 06 July 2020, 18:13:57 UTC |
2e9d676 | Andrey Velichkevich | 06 July 2020, 17:24:55 UTC | UI: Remove update button from Experiments view page (#1253) * Update experiments without button Move monitor components to common * Move fetch experiments to common rename job to experiment * Modify vars | 06 July 2020, 17:24:55 UTC |
ea1bb06 | Andrey Velichkevich | 03 July 2020, 01:28:47 UTC | UI: Sorting for Trials information table (#1251) * Enable sort in Trials table * Remove console log * Modify cell width | 03 July 2020, 01:28:47 UTC |
29b797c | Andrey Velichkevich | 30 June 2020, 12:38:06 UTC | String type for metric values (#1245) * Change metric type from float to string * Check unavailable latest objective metric in isTrialObservationAvailable * Fix e2e test * Delete consts.UnavailableMetricValue from e2e | 30 June 2020, 12:38:06 UTC |
f7cea41 | Hong Xu | 30 June 2020, 02:05:58 UTC | Add hints to obtain Kubeflow and Minikube version (#1230) | 30 June 2020, 02:05:58 UTC |