682520d | YujiOshima | 25 December 2018, 08:55:50 UTC | fix typo Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> | 25 December 2018, 08:55:50 UTC |
c652502 | YujiOshima | 21 December 2018, 10:15:05 UTC | add instructions for update api files and docs Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> | 21 December 2018, 10:15:05 UTC |
835e7af | YujiOshima | 21 December 2018, 10:13:50 UTC | fix typo Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> | 21 December 2018, 10:13:50 UTC |
8cc162e | YujiOshima | 19 December 2018, 01:34:46 UTC | add api doc Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> | 19 December 2018, 15:14:48 UTC |
28c5b1c | Andrey Velichkevich | 16 December 2018, 15:43:49 UTC | Extend studyjob client API (#288) * Add namespace parameter to studyJob client API * Change if statement for namespace * Create func getNamespace | 16 December 2018, 15:43:49 UTC |
4be865e | ytetra | 16 December 2018, 15:43:43 UTC | fix deploy (#284) | 16 December 2018, 15:43:43 UTC |
eb4a35b | Hougang Liu | 16 December 2018, 15:34:39 UTC | update Readme (#295) A trial can be corresponds to a k8s job, TFJob and PyTorchJob now. Not only k8s job any more. | 16 December 2018, 15:34:39 UTC |
5a7977d | Hougang Liu | 14 December 2018, 15:14:46 UTC | fix studyJob status suggestionCount mismatch error (#290) Fixes: #289 | 14 December 2018, 15:14:46 UTC |
41e8f7d | Hougang Liu | 14 December 2018, 01:18:22 UTC | fix invalid worker kind issue (#287) * fix invalid worker kind issue studyJob should go to 'Failed' status when worker kind is invalid * add PyTorchJob as valid worker job kind | 14 December 2018, 01:18:22 UTC |
33b2e58 | oshima | 13 December 2018, 20:00:04 UTC | get metricscollector by API (#292) Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> | 13 December 2018, 20:00:04 UTC |
f16aecc | Johnu George | 13 December 2018, 16:32:46 UTC | Support Pytorch job in Katib (#283) * Pytorch support in Katib * Adding pytorch worker kind to metrics collector * Updating Gopkg * Adding sleep * Changing the worker name * Adding gcr image | 13 December 2018, 16:32:46 UTC |
5527e34 | Johnu George | 12 December 2018, 17:01:34 UTC | Update k8s cluster version to 1.10 (#286) | 12 December 2018, 17:01:34 UTC |
67eca98 | oshima | 11 December 2018, 07:22:12 UTC | Enrich GUI (#264) * allow to create studyjob from UI Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * show success alert Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * add rbac for ui Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * fix bug Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * rebase master Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * add metrics collector manager to UI Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * fix typo Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> | 11 December 2018, 07:22:12 UTC |
86cddd3 | Hougang Liu | 11 December 2018, 06:46:34 UTC | update README (#281) | 11 December 2018, 06:46:34 UTC |
1c707dc | Hougang Liu | 11 December 2018, 00:30:40 UTC | fix typo error for MinikubeDemo (#282) | 11 December 2018, 00:30:40 UTC |
f8590e0 | Hougang Liu | 10 December 2018, 06:17:24 UTC | fix typo error (#280) | 10 December 2018, 06:17:24 UTC |
edf6cb5 | ytetra | 09 December 2018, 13:47:06 UTC | add e2eTest of each suggestion algorithm (#265) * random&grid * hyperband * add hyperband test * add grid case check | 09 December 2018, 13:47:06 UTC |
f4913b3 | Richard Liu | 09 December 2018, 12:52:52 UTC | Allow studyjobcontroller to delete pods (#278) | 09 December 2018, 12:52:52 UTC |
c8efb35 | Richard Liu | 07 December 2018, 16:42:11 UTC | Fix katib ui resource paths (#277) | 07 December 2018, 16:42:11 UTC |
36d8d25 | Koichiro Den | 05 December 2018, 09:12:00 UTC | Implement gRPC Health Checking Protocol + add readiness/liveness probes to vizier-core (#270) * Ensure vizier-core never been stuck too long waiting for DB conn Signed-off-by: Koichiro Den <den@valinux.co.jp> * Add standard Health gRPC service Signed-off-by: Koichiro Den <den@valinux.co.jp> * Change db.New to return error instead of exit(1) with log.Fatal Signed-off-by: Koichiro Den <den@valinux.co.jp> * Add SelectOne() to VizierDBInterface Signed-off-by: Koichiro Den <den@valinux.co.jp> * Rename import for later convenience Signed-off-by: Koichiro Den <den@valinux.co.jp> * Implement and register Health Server for Katib manager Signed-off-by: Koichiro Den <den@valinux.co.jp> * Add readiness/liveness probes to vizier-core Signed-off-by: Koichiro Den <den@valinux.co.jp> * Update test codebase Fixes: 61ac5607353 ("Add SelectOne() to VizierDBInterface") Signed-off-by: Koichiro Den <den@valinux.co.jp> | 05 December 2018, 09:12:00 UTC |
3516dda | Richard Liu | 05 December 2018, 08:33:36 UTC | POC: Katib integration with tf-operator (#267) * TF operator part 1 * Add consts * Fix * Update worker; fix schemes * Change example * Add rbac rules * Add crd * Add sleep for debugging * Log cluster name * Remove unrelated change * use katibapi.State | 05 December 2018, 08:33:36 UTC |
55f125c | ytetra | 05 December 2018, 07:02:33 UTC | fix make timing (#271) | 05 December 2018, 07:02:33 UTC |
f863b87 | IWAMOTO Toshihiro | 05 December 2018, 05:13:30 UTC | Add Update{Study,Trial} (#269) Only tested with unit tests. Signed-off-by: IWAMOTO Toshihiro <iwamoto@valinux.co.jp> | 05 December 2018, 05:13:30 UTC |
0e3e890 | oshima | 04 December 2018, 02:57:06 UTC | add Richard Liu to OWNERS (#274) Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> | 04 December 2018, 02:57:06 UTC |
211c6ba | oshima | 04 December 2018, 01:58:23 UTC | fix uncompleted value in ui (#238) Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> | 04 December 2018, 01:58:23 UTC |
1104524 | oshima | 04 December 2018, 01:24:06 UTC | fix bayesian optimization suggestion (#251) * fix bayse optimization suggestion Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * add bayseopt-example Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * reset x_train in burn-in Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * validate parameters Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> | 04 December 2018, 01:24:06 UTC |
72a0fc0 | Koichiro Den | 30 November 2018, 12:41:56 UTC | Prevent pod restarts caused by slow db boot (#261) * Add readinessProbe for vizier-db Signed-off-by: Koichiro Den <den@valinux.co.jp> * Fix MYSQL_ROOT_PASSWORD Fixes: 67e94c7697bd ("Set MYSQL_ROOT_PASSWORD via Secret (#253)") Signed-off-by: Koichiro Den <den@valinux.co.jp> * Add simple loop to wait for DB connection successfully opened Signed-off-by: Koichiro Den <den@valinux.co.jp> | 30 November 2018, 12:41:56 UTC |
3f5462d | ytetra | 30 November 2018, 12:06:00 UTC | add UT of each suggestion algorithm (#237) * add random algorithm UT * add grid algorithm UT * add hyperband algorithm UT * fix typo * fix typo * add some tests * change various ParameterType pattern * add gengrid() test * fix significant figure | 30 November 2018, 12:06:00 UTC |
24160cb | Richard Liu | 28 November 2018, 06:52:51 UTC | Downgrade kubernetes dependency to 1.10.1 (#256) * downgrade to 1.10.1 * Delete pods * Fix job-name * Set successfulJobsHistoryLimit to 0 * Add comments | 28 November 2018, 06:52:51 UTC |
b7145b3 | Koichiro Den | 26 November 2018, 10:04:51 UTC | Fix incorrectly set namespace (#260) Commit b6f8e07d26a ("Update manifests (#246)") has just changed the namespace as a whole. This new manifest should be updated as well. Fixes: 67e94c7697b ("Set MYSQL_ROOT_PASSWORD via Secret (#253)") Signed-off-by: Koichiro Den <den@valinux.co.jp> | 26 November 2018, 10:04:51 UTC |
67e94c7 | Koichiro Den | 22 November 2018, 05:59:22 UTC | Set MYSQL_ROOT_PASSWORD via Secret (#253) * Set randomly generated MYSQL_ROOT_PASSWORD via Secret Signed-off-by: Koichiro Den <den@valinux.co.jp> * Seperate manifest for MYSQL_ROOT_PASSWORD, "test" being set by default Signed-off-by: Koichiro Den <den@valinux.co.jp> * Update run-tests.sh Fixes: 5312459c28f7 ("Set randomly generated MYSQL_ROOT_PASSWORD via Secret") Signed-off-by: Koichiro Den <den@valinux.co.jp> | 22 November 2018, 05:59:22 UTC |
63dc070 | oshima | 20 November 2018, 23:57:25 UTC | update UI (#255) Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> | 20 November 2018, 23:57:25 UTC |
e5e2dcd | Richard Liu | 20 November 2018, 23:18:57 UTC | Refactor studyjobcontroller (#254) * Refactor studyjob controller * Refactor * Go format files * More refactor * Rename studyjobcontroller to studyjob | 20 November 2018, 23:18:57 UTC |
597064a | Andrey | 20 November 2018, 08:19:24 UTC | Change deploy.sh for Minikube example (#252) * Change deploy for Minikube Example * Change namespace to kubeflow in Minikube example * Delete lines about modeldb from deploy | 20 November 2018, 08:19:24 UTC |
206bcaa | IWAMOTO Toshihiro | 20 November 2018, 01:43:06 UTC | Add mysql based unit tests (#243) Signed-off-by: IWAMOTO Toshihiro <iwamoto@valinux.co.jp> | 20 November 2018, 01:43:06 UTC |
b6f8e07 | oshima | 19 November 2018, 04:58:32 UTC | Update manifests (#246) * change namespace katib -> kubeflow Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * change namespace of tfevent-mc | 19 November 2018, 04:58:32 UTC |
f7aff4a | Michelle Casbon | 16 November 2018, 03:05:02 UTC | Add texasmichelle as reviewer (#247) | 16 November 2018, 03:05:02 UTC |
94b138a | oshima | 16 November 2018, 01:26:56 UTC | Tf event mc (#235) * add tf-event mc Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * fix Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * add tfevent mc ci Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * add tfeventmc doc Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * add comment and use logger Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> | 16 November 2018, 01:26:56 UTC |
9d59a10 | IWAMOTO Toshihiro | 14 November 2018, 06:14:39 UTC | Fix typos for json and objective (#242) Signed-off-by: IWAMOTO Toshihiro <iwamoto@valinux.co.jp> | 14 November 2018, 06:14:39 UTC |
29e53b8 | Richard Liu | 13 November 2018, 02:11:21 UTC | Add richardsliu to OWNERS/reviewer (#239) * Add richardsliu to OWNERS * Add richardsliu as reviewer | 13 November 2018, 02:11:21 UTC |
a01f482 | wukong1992 | 08 November 2018, 08:55:46 UTC | add starttime and completiontime to worker (#236) | 08 November 2018, 08:55:46 UTC |
5e51974 | ytetra | 05 November 2018, 20:31:38 UTC | Fix typo (#233) * correct "purse" to "parse" * correct "Doubel" to "Double" * Update push-model.go fix lowercase * Update push-study.go use lowercase | 05 November 2018, 20:31:38 UTC |
04837a4 | IWAMOTO Toshihiro | 05 November 2018, 07:47:01 UTC | More DB unit tests (#234) * Fix EarlyStopParam and SuggestionParam DB methods GetEarlyStopParamList and GetSuggestionParamList mixed up the column order and they returned nothing. Also, SetEarlyStopParam didn't return an ID properly. Signed-off-by: IWAMOTO Toshihiro <iwamoto@valinux.co.jp> * Add more DB UTs Signed-off-by: IWAMOTO Toshihiro <iwamoto@valinux.co.jp> | 05 November 2018, 07:47:01 UTC |
8e90513 | IWAMOTO Toshihiro | 02 November 2018, 05:44:46 UTC | Fix the build script after #208 (#231) Signed-off-by: IWAMOTO Toshihiro <iwamoto@valinux.co.jp> | 02 November 2018, 05:44:46 UTC |
9f87fd8 | IWAMOTO Toshihiro | 01 November 2018, 06:00:42 UTC | Only retry an INSERT operation on unique constraint violation (#229) The retry logic is used to generate an unique ID, but if there is another error the DB code can fall into an infinite loop. Signed-off-by: IWAMOTO Toshihiro <iwamoto@valinux.co.jp> | 01 November 2018, 06:00:42 UTC |
0bc5182 | oshima | 29 October 2018, 04:20:23 UTC | New UI for Katib (#208) * add ui Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * add ui Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * update test and doc Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * fix test Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * fix test Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * fix test Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * remove modelDB Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * fix test Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * refactor Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * add loading img Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * fix test Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * Add loading image Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * refactor Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * add root redirection Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * add latestLog flag to GetWorkerFullInfo Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * fix test Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> | 29 October 2018, 04:20:23 UTC |
7eeea12 | ytetra | 28 October 2018, 09:46:16 UTC | fix slice range (#226) | 28 October 2018, 09:46:16 UTC |
13373d2 | IWAMOTO Toshihiro | 25 October 2018, 03:22:52 UTC | More db tests (#225) * Remove obsolete comments and an import Signed-off-by: IWAMOTO Toshihiro <iwamoto@valinux.co.jp> * Add Worker UTs Signed-off-by: IWAMOTO Toshihiro <iwamoto@valinux.co.jp> | 25 October 2018, 03:22:52 UTC |
106235b | IWAMOTO Toshihiro | 24 October 2018, 04:00:15 UTC | Fix storelogs (#222) * Fix StoreWorkerLogs The function has been storing into worker_metrics with duplicates and wrong timestamps for some time. The fix changes the worker_lastlogs DB table definition. DBs must be recreated. Signed-off-by: IWAMOTO Toshihiro <iwamoto@valinux.co.jp> * Add foreign key constraints to worker log DB tables and tidy up formatting This patch make sure worker_* rows have matching row in the worker table. Also changes multi-line string formatting for readability. Signed-off-by: IWAMOTO Toshihiro <iwamoto@valinux.co.jp> | 24 October 2018, 04:00:15 UTC |
4dc1aed | IWAMOTO Toshihiro | 19 October 2018, 07:34:49 UTC | Check errors in order to avoid SEGV (#219) Signed-off-by: IWAMOTO Toshihiro <iwamoto@valinux.co.jp> | 19 October 2018, 07:34:49 UTC |
1e14d3c | oshima | 17 October 2018, 06:30:02 UTC | Fix reqest count (#214) * fix manifest examples Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * fix MetricsCollector instance Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * eval req count after status check Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * fix cont check when ReqestCount is not set Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> | 17 October 2018, 06:30:02 UTC |
44bd27e | oshima | 17 October 2018, 06:04:00 UTC | enlarge max of check goal grpc (#200) Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> | 17 October 2018, 06:04:00 UTC |
eb12212 | oshima | 17 October 2018, 05:35:43 UTC | fix manifest examples (#213) * fix manifest examples Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * fix MetricsCollector instance Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> | 17 October 2018, 05:35:43 UTC |
7ad8cfc | Kazumasa Kohtaka | 15 October 2018, 17:02:23 UTC | Use camel-case instead of snake-case (#204) * Use camel-case instead of snake-case * Capitalize abbreviations in variables | 15 October 2018, 17:02:23 UTC |
1598256 | IWAMOTO Toshihiro | 15 October 2018, 17:02:13 UTC | Point to the example version of ConfigMap from MinikubeDemo.md (#202) Signed-off-by: IWAMOTO Toshihiro <iwamoto@valinux.co.jp> | 15 October 2018, 17:02:13 UTC |
1c95029 | IWAMOTO Toshihiro | 15 October 2018, 06:32:04 UTC | Fix CRD validation (#191) While CustomResourceDefinition.spec.scope defaults to Namespaced, omitting this generates a validation error. Just supply the default. Signed-off-by: IWAMOTO Toshihiro <iwamoto@valinux.co.jp> | 15 October 2018, 06:32:04 UTC |
3dce496 | Mayank Juneja | 15 October 2018, 03:42:34 UTC | Bayesian Suggestion Algorithm Fixes (#188) * update requirements.txt for bayesian * add bayesian suggestion algorithm to deploy script * separate out python proto compiler command * update PYTHONPATH * update autogenerated python protobuf and grpc code * Update run-tests.sh | 15 October 2018, 03:42:34 UTC |
cbe5fee | Hirofumi Nakagawa | 11 October 2018, 04:19:16 UTC | Fix deadlock condition in ReconcileStudyJobController#Reconcile (#201) | 11 October 2018, 04:19:16 UTC |
6609587 | wukong1992 | 10 October 2018, 05:10:25 UTC | support request count (#193) | 10 October 2018, 05:10:25 UTC |
6195461 | oshima | 10 October 2018, 03:34:21 UTC | add building metrics-collector to CI (#190) Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> | 10 October 2018, 03:34:21 UTC |
0bfa23b | oshima | 10 October 2018, 01:12:58 UTC | Fix CI (#194) * add -o xtrace Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * use client-cert instead password Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * delete get-credentials Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * delete unnecessary line Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> | 10 October 2018, 01:12:58 UTC |
b502fd5 | oshima | 04 October 2018, 08:12:54 UTC | Add Katib logo (#189) * add logo Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * fix logo size Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * fix logo size | 04 October 2018, 08:12:54 UTC |
d404ee5 | oshima | 01 October 2018, 02:47:02 UTC | fix random-example (#181) Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> | 01 October 2018, 02:47:02 UTC |
8332d6e | oshima | 01 October 2018, 02:46:57 UTC | fix-MinikubeDemodox (#171) Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> | 01 October 2018, 02:46:57 UTC |
c9028e1 | oshima | 01 October 2018, 02:41:19 UTC | Add Retain flag (#176) * update vendors Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * add retain flags to study job controller Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * fix vendor Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * fix unchange status Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * add handling for failed status Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> | 01 October 2018, 02:41:19 UTC |
9133042 | oshima | 01 October 2018, 00:29:24 UTC | Add pv example to katibDB (#178) * add pv to katibDB Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * fix test Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * add pv to MinikubeDemo/deploy.sh Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * update Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * fix test Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> | 01 October 2018, 00:29:24 UTC |
74b833a | Chu Xiangyang | 29 September 2018, 05:38:15 UTC | Update `checkStatus` return value orders (#185) Make `error` to the last postion. | 29 September 2018, 05:38:15 UTC |
b46378c | Jeremy Lewi | 21 September 2018, 17:35:54 UTC | Fix jsonnet so we override registry in image builds. (#177) * Fix jsonnet so we override registry in image builds. * The overrides for parameters isn't being passed through to subcommands like the image build template; as a result we don't actually override the parameters in the image step templates. * Make overrides in parts a required parameter so we don't accidentally forget to exclude it in the future. * Related to #79 releaser for Katib. * Fix prow_config.yaml | 21 September 2018, 17:35:54 UTC |
31d2e10 | Jeremy Lewi | 21 September 2018, 09:17:41 UTC | Postsubmit run should auto-push images to kubeflow-images-public (#174) * Related to #141 katib releaser * Related to kubeflow/kubeflow#1574 use prow to build our images * We are moving to using prow to run our release workflows and treating them just like regular workflows. * We are doing this because we need to get regular signal about whether the image builds are succeeding by running on postsubmit. * We also want to run them on presubmit so that we can verify any changes to the workflwo don't break the workflow. * Rather than define a new workflow to build the images; we can just reuse the existing E2E workflow which already builds all the images. We just change postsubmit to push to kubeflow-images-public. * Delete the releaser app; we will just the existing E2E test workflow and have that push to gcr.io/kubeflow-images-public on postsubmit. | 21 September 2018, 09:17:41 UTC |
81f2b74 | Mayank Juneja | 21 September 2018, 06:27:53 UTC | Add REST API using grpc gateway (#142) * dep ensure * add grpc-gateway via dep * update protobuf via dep ensure * update compiled go code, add reverse proxy * add REST entrypoint for manager * update API build script * use build script to generate code * remove binary file * update build, deploy scripts for REST API * change name * add manifests for core-rest * remove deploy * add comments * remove vendor * use Gopkg files from master * update Gopkg files * update Gopkg files * update proto files and protobufs * update build scripts and tests * copy vendor for tests * uncomment deploy * update image name * ignore vizier-core-rest for port forwarding * update build script * update manifests * Add docs for REST API * core review changes * remove service account | 21 September 2018, 06:27:53 UTC |
f4887a6 | oshima | 17 September 2018, 09:20:56 UTC | add mutex to studyjob controller (#170) * add mutex to studyjob controller Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * use sync.Map Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * update only when the instance was changed Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> | 17 September 2018, 09:20:56 UTC |
4085701 | oshima | 05 September 2018, 03:29:38 UTC | StudyJobController: Update worker status and fix status bug (#159) * mark complete after metrics reported Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * update worker status Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * fix save model bug Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * save models after completed Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> | 05 September 2018, 03:29:38 UTC |
8f85e81 | oshima | 28 August 2018, 02:54:00 UTC | refactor studyjob CRD controller (#152) * refactor studyjob CRD controller Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * fix type Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * fix test Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * update mocks Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * update deploy and build script Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * Avoid duplication of suggestion request Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * add RawTemplate for WorkerSpec Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> | 28 August 2018, 02:54:00 UTC |
3c0499d | oshima | 22 August 2018, 05:19:54 UTC | Delete vendor dir (#153) * delete vendor directory Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * update .gitignore Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * update tests Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * fix test Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> | 22 August 2018, 05:19:54 UTC |
089cd6a | Jeremy Lewi | 21 August 2018, 12:33:33 UTC | Merge pull request #141 from YujiOshima/studyctlCRD Add StudyController CRD: studycontroller.kubeflow.org Operator: StudyController Update examples. This implementation is polling workers status in go process of StudyController. Though I understand this is not an elegant implementation, this is the least impact to existing codes. Next step we should make worker CRD and its controller and support multi-type jobs (k8s, TF-Job..). | 21 August 2018, 12:33:33 UTC |
842ee42 | YujiOshima | 20 August 2018, 15:22:35 UTC | fix test Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> | 20 August 2018, 15:22:35 UTC |
e0bd5ee | YujiOshima | 20 August 2018, 14:41:16 UTC | allow same study name on multiple job Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> | 20 August 2018, 15:00:48 UTC |
bfb04ba | YujiOshima | 17 August 2018, 08:32:15 UTC | fix test Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> | 20 August 2018, 02:55:22 UTC |
5ea9c3b | YujiOshima | 17 August 2018, 08:15:41 UTC | WorkerSpec contain only path for template, add comment Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> | 20 August 2018, 02:55:22 UTC |
1eecaa5 | YujiOshima | 17 August 2018, 01:27:17 UTC | fix Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> | 20 August 2018, 02:55:22 UTC |
038839e | YujiOshima | 16 August 2018, 18:33:43 UTC | remove unnecessary codes Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> | 20 August 2018, 02:55:22 UTC |
15e6bfc | YujiOshima | 26 July 2018, 08:00:48 UTC | add StudyJobController CRD and Controller Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> | 20 August 2018, 02:55:18 UTC |
5429a4d | YujiOshima | 26 July 2018, 08:00:10 UTC | add autogen files for controller Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> | 20 August 2018, 02:53:51 UTC |
cd0d27d | YujiOshima | 26 July 2018, 07:58:42 UTC | update vendoring pkgs Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> | 20 August 2018, 02:53:51 UTC |
826657c | shibuiwilliam | 20 August 2018, 02:36:42 UTC | Corrected typos in hyperband example yml (#146) | 20 August 2018, 02:36:42 UTC |
791ac1a | Mayank Juneja | 24 July 2018, 08:50:25 UTC | pin mxnet/python image version (#139) | 24 July 2018, 08:50:25 UTC |
fcd1005 | Jeremy Lewi | 01 July 2018, 22:06:17 UTC | Move the GKEDemo into kubeflow/examples (#135) * The GKEDemo is using the GitHub summarization example; I think we should put all of the code for that demo kubeflow/examples (see kubeflow/examples#161) * The main code is the Katib HP controller git-issue-summarize-demo.go * We don't need the manifests for deploying katib because we can deploy Katib using the Kubeflow ksonnet package. * The code in docker-image duplicates the code in kubeflow/examples so we shouldn't need it. Related to: #116 | 01 July 2018, 22:06:17 UTC |
12a6c5f | oshima | 30 June 2018, 08:45:21 UTC | Update status of workers in GetWorkers (#127) * cmd/ Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * worker statuses are updated in GetWorkers Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * fix test Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> | 30 June 2018, 08:45:21 UTC |
b14b337 | Hitoshi Mitake | 29 June 2018, 06:08:18 UTC | update OWNERS (#129) | 29 June 2018, 06:08:18 UTC |
e9d2a97 | oshima | 20 June 2018, 10:28:05 UTC | Hyperband (#124) * add hyp Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * fix hyperband suggestion Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * add test and docs * fix test Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * fix test Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> | 20 June 2018, 10:28:05 UTC |
25d1f54 | oshima | 20 June 2018, 03:58:05 UTC | fix doc link and kubectl port-forward command (#120) Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> | 20 June 2018, 03:58:05 UTC |
249152f | Shintaro Murakami | 20 June 2018, 03:39:06 UTC | Fix typo (#123) | 20 June 2018, 03:39:06 UTC |
8f22850 | Vinay Kakade | 19 June 2018, 11:45:31 UTC | Fix indent to spaces (#121) | 19 June 2018, 11:45:31 UTC |
c0801d5 | oshima | 13 June 2018, 10:59:28 UTC | add releasing workflow (#113) Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> | 13 June 2018, 10:59:28 UTC |
14963b0 | oshima | 13 June 2018, 10:41:28 UTC | API: Add WorkerStatus to GetMetrics and remove unused items (#110) * update API Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * add status to GetMetrics and delete unused item in API Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> | 13 June 2018, 10:41:28 UTC |
aa1d592 | oshima | 13 June 2018, 10:22:28 UTC | Add e2e test (#114) * add e2e test Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * fix Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * fix Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * split cli from manager build Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * fix Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * fix Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * print pod status Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * print deploy svc Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * fix Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * use pod for port-forward Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * fix Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * fix Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * wait vixier-core Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * fix Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> | 13 June 2018, 10:22:28 UTC |
6a0d368 | oshima | 09 June 2018, 08:07:01 UTC | use kubectl port-forward in demos (#111) Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> | 09 June 2018, 08:07:01 UTC |
3417891 | Shintaro Murakami | 07 June 2018, 05:46:23 UTC | docs: Fix wrong command (#108) | 07 June 2018, 05:46:23 UTC |
0944c53 | Vinay Kakade | 06 June 2018, 14:39:25 UTC | Remove dlk (#107) | 06 June 2018, 14:39:25 UTC |
838943a | Ce Gao | 05 June 2018, 07:38:53 UTC | docs: Generate CLI documentation (#105) Signed-off-by: Ce Gao <gaoce@caicloud.io> | 05 June 2018, 07:38:53 UTC |