https://github.com/kubeflow/katib

sort by:
Revision Author Date Message Commit Date
e80d7c9 Add ttl seconds after finished 21 October 2020, 18:53:31 UTC
b7a46ed Remove Katib client changes 20 October 2020, 12:12:17 UTC
729f6da Remove v1alpha3 tests 20 October 2020, 12:03:53 UTC
fea3383 Create Kubeflow namespace 20 October 2020, 11:00:24 UTC
e407aa3 Deploy TF and PyTorch controllers 20 October 2020, 09:53:16 UTC
9898db2 Return exp in case of error 20 October 2020, 02:50:57 UTC
61b3995 Fix template name 20 October 2020, 01:50:06 UTC
6289e52 Remove bin 20 October 2020, 01:14:15 UTC
13c178c Add other e2e tests 20 October 2020, 01:13:54 UTC
7c89cbf Set kube config 20 October 2020, 00:09:37 UTC
f128950 Remove GCP auth 19 October 2020, 23:33:43 UTC
f084630 Print ns list 19 October 2020, 23:27:17 UTC
bf9f744 manually create experiment 19 October 2020, 23:11:57 UTC
fb68b6d Build binary e2e 19 October 2020, 22:55:21 UTC
89be584 Print known types 19 October 2020, 22:47:58 UTC
16ef99d Set TypeMeta for experiment 19 October 2020, 22:17:12 UTC
ced7095 Trigger CI 19 October 2020, 22:07:42 UTC
0b1ef6f Show known types 19 October 2020, 21:26:33 UTC
346084c Print CRDs in e2e Experiment 19 October 2020, 20:51:49 UTC
a848c4f Fix Katib path 19 October 2020, 20:12:58 UTC
ebd7e0d Add github to src folder 19 October 2020, 19:46:12 UTC
8c544c2 Add github.com to folder 19 October 2020, 19:18:19 UTC
f2c78e6 Add backoff 19 October 2020, 18:35:22 UTC
04a11b6 Fix run e2e go path 19 October 2020, 17:21:55 UTC
137f49b Fix region 19 October 2020, 17:13:46 UTC
2196cc0 Change command 19 October 2020, 16:43:52 UTC
9928bce Remove 19 October 2020, 16:06:56 UTC
01618b4 Fix path to valid exp 19 October 2020, 15:53:08 UTC
ae827b5 Move create cluster to build 19 October 2020, 15:23:01 UTC
a0bf0bf Change deploy 19 October 2020, 13:31:10 UTC
ce7a413 Fix path 19 October 2020, 03:57:38 UTC
83cbecd Change make deploy 19 October 2020, 03:23:41 UTC
e9db92f Fix path for NAS suggestions 19 October 2020, 02:53:39 UTC
b78c5b1 Attach volume to create and delete cluster 19 October 2020, 02:51:53 UTC
6b13d51 Get other build for all images 19 October 2020, 02:40:29 UTC
b595b43 Add AWS cred 19 October 2020, 02:05:52 UTC
55701eb Delete v1alpha3 workflow from prow Add ECR env 19 October 2020, 01:54:28 UTC
e8444c3 Comment creds 19 October 2020, 00:45:15 UTC
28eaab8 Add AWS creds to env 19 October 2020, 00:24:16 UTC
5257b90 Test without folder for GOPATH 17 October 2020, 02:38:20 UTC
8c9cb49 Fix delete cluster 16 October 2020, 22:48:58 UTC
aa7b47b Fix cluster name 16 October 2020, 22:46:15 UTC
e7b2237 Replace create and delete cluster with testing scripts 16 October 2020, 22:43:21 UTC
0e38b84 Refactor e2e test script 16 October 2020, 21:33:43 UTC
1058b66 Change worker image 16 October 2020, 02:17:24 UTC
faa7f4c Remove comment from resume e2e 16 October 2020, 02:03:38 UTC
a1bbaa1 Add changes for AWS test infra 16 October 2020, 01:59:54 UTC
85fc7d0 Enhancement for Custom CRD (#1333) * Init commit * Modify Insert function Add retry on empty observation * Fix mutate volume test * Fix validate experiment test * Fix invalid experiment * Don't get deployed job status when trial is completed * Not send Trial with unavailable metrics to Suggestion * Refactor requeue If objective metric value is not reported metrics collector reports unavailable value to the DB Controller reconciles Trial until DB is empty * Add condition before change trial status * Remove prints * Fix tfevent parser 13 October 2020, 13:02:27 UTC
6a07daa Add trial metadata substitution example (#1319) * Add trial metadata example * Change description * Add istio sidecar false to annotation 18 September 2020, 03:02:45 UTC
6aa4ec9 fix(metrics-collector): allow user to nuke ephemeral-storage requests (#1312) * fix(metrics-collector): allow user to nuke ephemeral-storage requests * chore(gofmt): fan formatting * chore(gofmt): undo formatting on auto generated api.pb files 17 September 2020, 22:44:45 UTC
74e6e5b feat: Ignore pb files in update gofmt (#1340) Update travis nvm version to 12.18.1 Ignore .pb files in gofmt 17 September 2020, 02:38:46 UTC
721a382 Upload python SDK version (#1335) * Upload 0.0.4 SDK version * Fix doc links * Fix links in README * Modify tables * Fix link * Remove * Modify client and gen script * Update version to 0.0.5 * Run CI * Add Katib client to init 15 September 2020, 09:42:07 UTC
4b11f80 Add SDK examples for v1beta1 (#1337) * Add SDK examples for v1beta1 * Modify import 15 September 2020, 02:24:07 UTC
e99c77d Run post-submit image build in kubeflow-ci project (#1326) * Change registry for presubmit * Add prow_config to workflows * Add project to gcloud auth * Test manager * Add kubeflow-ci project for build in post-submit 14 September 2020, 15:34:58 UTC
6b7142f Custom CRD: Wait for all processes before running metrics collector (#1313) * Enable to wait all in metrics collectors * Rename metricsFilePath * Fix tfevent * Fix pns py * Fix comment 09 September 2020, 06:57:52 UTC
7b797e1 Custom CRD: Support dynamic Trial's jobs conditions (#1307) * Custom Job conditions implementation * Fix prints * Fix status * Fix test * Clean event msg * Run gofmt * Fix few comments * Generate clients * Fix comment * Add newline 08 September 2020, 21:45:52 UTC
2580186 Custom CRD: Add primary container name (#1308) * Add primary container name * Resolve * Generate clients * Add newline 08 September 2020, 19:01:52 UTC
fc8d522 [Adopters] change adopter of Ant Group (#1327) 05 September 2020, 16:25:41 UTC
d5c5e95 Update generate script with SDK (#1323) * Update generate script * Capitalise API * Remove verbose 04 September 2020, 20:47:42 UTC
a072156 Switch test from kubeflow-ci to automl-ci project. (#1321) * Change project to automl-ci * Change registry to automl-ci for presubmit * Add cluster role to sa * Add print * Modify user * Remove user change * Add separate scripts to build metrics-collectors * Move govaralls to after success * Update doc * Trigger CI 04 September 2020, 15:31:41 UTC
cba4560 Fix Pod's ownership to inject metrics collector (#1303) * Refactor get Katib job * Get trial after func * Remove trialName * return error * Remove error * Resolve 03 September 2020, 14:09:41 UTC
36aef5f Fix problem with Hyperopt Out of Range error (#1315) 03 September 2020, 10:59:41 UTC
d58b6a1 Custom CRD: Add primary pod labels (#1305) * Add primary pod labels * Generate swagger * Generate SDK * Trigger CI 03 September 2020, 02:41:40 UTC
ef6557a Custom CRD: Set dynamic watch from controller flags (#1302) 02 September 2020, 15:13:06 UTC
ced8496 Fix restart check in controller for completed experiments (#1306) * Add check for experiment restart in controller * Change comment 02 September 2020, 13:49:07 UTC
0b7a5f2 Update CI test cluster version to 1.16 (#1316) * Update CI cluster version to 1.16 * Add retry strategy * Remove backoff 01 September 2020, 13:03:50 UTC
2ceed7d Update docs for v1beta1 SDK (#1304) * Update docs for v1beta1 SDK * Fix samples in v1alpha3 19 August 2020, 11:05:11 UTC
1d18594 [python sdk] add v1beta1 models (#1252) * [python sdk] add v1beta1 models * upgrade version of python SDK to 0.0.3 * remove v1Alpha3 python sdk * add some python models manually: v1Time and V1UnstructuredUnstructured * bring back v1alpha3 * create separate python sdk for v1alpha3 and v1beta1 * move on * release pkg on pypi.org * remove dist files * refine 18 August 2020, 10:11:32 UTC
051d1de Proposal: Support custom CRD in Trial Job (#1273) * Add proposal for custom CRD in Trial Template * Fix * Modify doctoc * Doc fixes * Rename header * Fixes * Change doc * Remove comma * Fix Implementation 14 August 2020, 15:20:21 UTC
282b71e Support volume settings in Katib config (#1291) * Support volume settings in config * Set default path 12 August 2020, 12:03:45 UTC
77dd34e Refactor Trial controller unit test (#1299) * Refactor Trial controller unit test Move prometheus to util * Change import 11 August 2020, 06:06:16 UTC
cca0358 Use Logger in suggestion controller util (#1298) * add-logger-suggestion-controller-util * Change log message 10 August 2020, 19:43:59 UTC
2c4ad15 Log update object status error (#1297) * Info instead of Error when update status is failed * Log generate name instead of name * Test. Expect that suggestion is succeeded when experiment is updating 06 August 2020, 19:07:42 UTC
33832a7 Verify that Trials were successfully deleted (#1288) * Verify that trials were deleted Update suggestion status * Update suggestion requests * Fix tests * Fix comment * Add recorder to test controllers * Travis test * Change resume exp trial condition * Modify e2e for from volume experiment * Fix IsRestarting check * Fix comment 06 August 2020, 12:32:55 UTC
88eb798 Set number of epochs to decrease e2e tests time (#1290) * Add epoch for mnist e2e examples * Replace batch-size with epochs * Remove file * Remove changes from hyperband * Remove epoch from pytorch 06 August 2020, 01:20:54 UTC
9cf4544 Unit test for resuming Experiment in controller reconcilers (#1281) * Add unit test for Experiment and Suggestion controller reconcile * Delete buildTrialMetaForRunSpec from controller * Modify condition check * Fix format * Run mock for new version * Refactor experiment reconcile test * Remove comment 03 August 2020, 18:43:42 UTC
329b22e Validate restart Experiment parameters (#1287) * Validate resume experiment in webhook * Fix restart check * Fix test 03 August 2020, 11:21:40 UTC
6b9e914 Get metrics collector config data refactor (#1285) * Refactor get metrics collector config Fix PV name in validation webhook * Add test in validation for Katib config 31 July 2020, 16:11:07 UTC
ce89cbf feat(experiment): Add a check before deletion (#1223) * feat(experiment): Add a check before deletion Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Delete all trials Signed-off-by: Ce Gao <gaoce@caicloud.io> * feat: Implement in v1beta1 Signed-off-by: Ce Gao <gaoce@caicloud.io> 31 July 2020, 10:35:07 UTC
ac1dc24 Add e2e test for FromVolume ResumePolicy (#1284) * Add e2e test for from volume resume * Resume experiment after completion * Print controller logs * Remove test prints * Remove controller logs 31 July 2020, 01:27:06 UTC
a42d8a9 Refactor suggestion config and add Composer unit test (#1282) * Init commit * Add test for deployment * Refactor suggestion config * Switch mock to 1.4.3 version * Fix empty map * Fix comments * Fix gofmt * Move package 30 July 2020, 17:28:31 UTC
c33da9d support trial meta injection in trial template rendering (#1259) * support trial meta injection in trial template rendering * use trialSpec.metadata as prefix of trialMeta reference * solve conflicts * add some comments on consts * apply gofmt * refactor * fix mock test * refine 27 July 2020, 17:24:17 UTC
9a7d43c Resume Experiment from Volume (#1275) * Resume experiment from the PV * Add comment * Remove old api comments * Change reason for Running suggestion * Fix few comments * Rename volume name like suggestion deployment * Add corev1 to const 27 July 2020, 15:32:18 UTC
27658a7 GRPC: Rename Manager to DBManager service (#1279) * Rename Manager to DBManager in gRPC * Update git ignore 25 July 2020, 01:36:16 UTC
9320b4e Add status to experiment CRD manifest (#1276) 23 July 2020, 02:29:39 UTC
ac091db Fix few API comments typos (#1274) * Fix few typos in API comments * Generate open API 20 July 2020, 01:26:50 UTC
b5465bd Add FPGA accelerated examples (#1269) * Add instructions for FPGA accelerated Experiments * XGBoost FPGA accelerated example * Ommit unnecessary quotes * Add the new example in the list of training container images * Ommit explicit declaration of the metrics collector Co-authored-by: Vaggelis Gkiastas <vaggelisgkia@hotmail.com> 16 July 2020, 21:35:03 UTC
f565047 Modify documentation for v1beta1 (#1267) * Change doc for v1beta1 * Fix 16 July 2020, 02:38:34 UTC
c199867 Add e2e test for DARTS (#1268) * Add e2e for darts * Remove todo 16 July 2020, 01:24:35 UTC
50fc911 UI: Add new ConfigMap with Trial Templates (#1265) * Add new configMap with Trial Templates * Enable to view all namespaces * Fix log 15 July 2020, 01:32:37 UTC
cbe0f40 Fix examples to run on OpenShift (#1241) 13 July 2020, 13:50:32 UTC
226c99c UI: Add Trial table pages (#1262) 10 July 2020, 01:56:36 UTC
f1393b9 UI: Delete ConfigMap with no Trial Templates (#1260) * Delete ConfigMap if there are no Trial Templates Add snack box for Templates * Not add empty ConfigMaps * Fix e2e test 10 July 2020, 01:16:35 UTC
4145c4f Fix paths in prow config (#1257) 08 July 2020, 01:29:08 UTC
42dbb56 UI: Update Material UI version to V4 (#1254) * Init commit for material UI v4 * Rebase * Add label to all selects * Remove changes from v1alpha3 07 July 2020, 01:53:57 UTC
c3b38d8 Adding retries for gRPC calls (#1248) 06 July 2020, 18:13:57 UTC
2e9d676 UI: Remove update button from Experiments view page (#1253) * Update experiments without button Move monitor components to common * Move fetch experiments to common rename job to experiment * Modify vars 06 July 2020, 17:24:55 UTC
ea1bb06 UI: Sorting for Trials information table (#1251) * Enable sort in Trials table * Remove console log * Modify cell width 03 July 2020, 01:28:47 UTC
29b797c String type for metric values (#1245) * Change metric type from float to string * Check unavailable latest objective metric in isTrialObservationAvailable * Fix e2e test * Delete consts.UnavailableMetricValue from e2e 30 June 2020, 12:38:06 UTC
f7cea41 Add hints to obtain Kubeflow and Minikube version (#1230) 30 June 2020, 02:05:58 UTC
back to top