97720f6 | John | 15 April 2019, 13:02:24 UTC | added missing files | 15 April 2019, 13:02:24 UTC |
f08ac3f | John | 15 April 2019, 10:48:54 UTC | excluded v1alpha2 unit tests | 15 April 2019, 10:48:54 UTC |
ac68cb5 | John | 12 April 2019, 16:29:24 UTC | added more v1alpha2 | 12 April 2019, 16:29:24 UTC |
9ed1cfa | John | 12 April 2019, 14:52:05 UTC | fixed location of files in new v1alpha2 folders | 12 April 2019, 14:52:05 UTC |
d0d9eac | John | 12 April 2019, 13:53:41 UTC | Added back missing files | 12 April 2019, 13:53:41 UTC |
c573783 | John | 12 April 2019, 12:54:47 UTC | merged master into PR | 12 April 2019, 12:54:47 UTC |
da6a781 | John | 12 April 2019, 11:54:57 UTC | moved suggestion dockerfiles back to original location | 12 April 2019, 11:54:57 UTC |
6204306 | John | 12 April 2019, 11:47:14 UTC | split test workflows into two version | 12 April 2019, 11:47:14 UTC |
1fe2899 | John | 12 April 2019, 11:45:09 UTC | split cmd/suggestion into two versions | 12 April 2019, 11:45:09 UTC |
a955cee | John | 11 April 2019, 18:25:13 UTC | added e2e tests to v1alpha2 | 11 April 2019, 18:25:13 UTC |
ca7e6e6 | John | 11 April 2019, 18:23:02 UTC | added scripts to v1alpha2 | 11 April 2019, 18:23:02 UTC |
7f0b052 | John | 11 April 2019, 18:21:28 UTC | added manifests to v1alpha2 | 11 April 2019, 18:21:28 UTC |
b568b47 | John | 11 April 2019, 18:18:52 UTC | added test scripts to v1alpha2 | 11 April 2019, 18:18:52 UTC |
88265a8 | John | 11 April 2019, 18:13:22 UTC | Merge branch 'python_migration' of https://github.com/jdplatt/katib into python_migration | 11 April 2019, 18:13:22 UTC |
bee9510 | John | 11 April 2019, 18:13:03 UTC | split pkg/suggestion into two versions | 11 April 2019, 18:13:03 UTC |
33e8e30 | Alexandra Johnson | 10 April 2019, 20:00:09 UTC | Update REAME example links for v1alpha1 (#452) * Update REAME example links for v1alpha1 * pkg/api/api.proto -> pkg/api/v1alpha1/api.proto * pkg/api/gen-doc/api.md -> pkg/api/v1alpha1/gen-doc/api.md * Links in pkg/api/README.md need to be doubled up for alpha1 and alpha2 * manifests -> manifests/v1alpha1 * Fix another examples link * No more relative link * rename header * update scripts link | 10 April 2019, 20:00:09 UTC |
05569bc | Hougang Liu | 10 April 2019, 08:46:15 UTC | fix py client import error (#453) | 10 April 2019, 08:46:15 UTC |
04b3051 | Hougang Liu | 03 April 2019, 08:53:43 UTC | ClusterRoleBinding doesn't need namespace field (#451) | 03 April 2019, 08:53:43 UTC |
7ef5594 | Andrey Velichkevich | 03 April 2019, 07:35:42 UTC | Update API for NAS in v1alpha2 (#450) * Update API for NAS in v1alpha2 * Fix name * Fix name in input size | 03 April 2019, 07:35:42 UTC |
b25422a | Johnu George | 02 April 2019, 21:25:19 UTC | Restructuring test scripts for v1alpha1 and v1alpha2 (#449) * Restructing test scripts for v1alpha1 and v1alpha2 * Fix package location | 02 April 2019, 21:25:19 UTC |
3d4cd04 | Johnu George | 01 April 2019, 21:56:33 UTC | Code restructuring to support V1alpha1 and V1alpha2 API (#448) * Code restructuring to support V1alpha1 and V1alpha2 API * Adding comments * Test package changes * Moving requirements file * Fix the package location * Renaming studyjobcontroller to katib-controller | 01 April 2019, 21:56:33 UTC |
255bf37 | jdplatt | 01 April 2019, 11:17:06 UTC | Merge branch 'master' into python_migration | 01 April 2019, 11:17:06 UTC |
3c77161 | John | 01 April 2019, 08:46:09 UTC | fixed path in dockerfile | 01 April 2019, 08:46:09 UTC |
a7bd618 | John | 01 April 2019, 03:38:33 UTC | updated build/deploy scripts | 01 April 2019, 03:38:33 UTC |
33cab49 | John | 01 April 2019, 03:32:42 UTC | changed names in run-tests | 01 April 2019, 03:32:42 UTC |
81ec25f | John | 01 April 2019, 02:25:44 UTC | fixed python tests | 01 April 2019, 02:25:44 UTC |
6b9d8ff | John | 31 March 2019, 21:20:18 UTC | changed name in dockerfile | 31 March 2019, 21:20:18 UTC |
a1ad349 | John | 31 March 2019, 20:51:47 UTC | fixed hyphen typo | 31 March 2019, 20:51:47 UTC |
e023ab1 | John | 31 March 2019, 16:59:25 UTC | removed bayesianoptimization names | 31 March 2019, 16:59:25 UTC |
4ab3dbd | Johnu George | 29 March 2019, 18:26:14 UTC | Fix labels matching the job operator implementation (#447) | 29 March 2019, 18:26:14 UTC |
de7323c | Johnu George | 29 March 2019, 04:50:12 UTC | Updating the pytorch example image (#446) | 29 March 2019, 04:50:12 UTC |
21855a1 | Shintaro Murakami | 29 March 2019, 02:02:12 UTC | Remove redundant lock (#444) | 29 March 2019, 02:02:12 UTC |
1316bad | oshima | 26 March 2019, 17:07:21 UTC | add v1alpha2 grpc api (#427) * add v1alpha2 grpc api Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * update gRPC API Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * add v1alpha2 DB IF Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * fix typo, add doc and add todo for nasconfig Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * apply comments Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * fix typo Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * update proto Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * update Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> | 26 March 2019, 17:07:21 UTC |
e260258 | jdplatt | 20 March 2019, 21:32:58 UTC | Remove katibcli (#436) * removed cmd folder for CLI * removed docs folder for CLI * final cleanup of cli removal * removed cli from build script * added go packages into unit-test script | 20 March 2019, 21:32:58 UTC |
68aa24e | John | 20 March 2019, 14:05:50 UTC | Merge branch 'python_migration' of https://github.com/jdplatt/katib into python_migration | 20 March 2019, 14:05:50 UTC |
5970d5f | John | 20 March 2019, 14:04:21 UTC | updated grid and random images in e2e tests | 20 March 2019, 14:04:21 UTC |
2aad698 | jdplatt | 20 March 2019, 03:16:23 UTC | Update suggestion-config-hyb.yml | 20 March 2019, 03:16:23 UTC |
b479bd2 | John | 20 March 2019, 03:13:30 UTC | removed logging statements | 20 March 2019, 03:13:30 UTC |
5f48775 | John | 20 March 2019, 03:10:45 UTC | fixed tests | 20 March 2019, 03:10:45 UTC |
fbf0908 | John | 19 March 2019, 23:28:45 UTC | working on test client | 19 March 2019, 23:28:45 UTC |
a99fa58 | John Platt | 18 March 2019, 21:24:15 UTC | fixed setup files | 18 March 2019, 21:24:15 UTC |
7e03787 | John Platt | 18 March 2019, 18:01:35 UTC | renamed packages and split into different setup files | 18 March 2019, 18:01:35 UTC |
a96c4e2 | John Platt | 18 March 2019, 15:01:04 UTC | added random search go code back to fix hyperband | 18 March 2019, 15:01:04 UTC |
5195b19 | John Platt | 18 March 2019, 13:59:36 UTC | removed test code for old images | 18 March 2019, 13:59:36 UTC |
28eb81d | Shintaro Murakami | 18 March 2019, 00:57:10 UTC | Change datadir for avoid failure due to lost+found (#432) | 18 March 2019, 00:57:10 UTC |
39ad7e2 | John Platt | 16 March 2019, 19:58:40 UTC | removed old files | 16 March 2019, 19:58:40 UTC |
7e6936e | John Platt | 16 March 2019, 19:56:42 UTC | reverted changes for local development | 16 March 2019, 19:56:42 UTC |
73fddfc | John Platt | 16 March 2019, 19:50:09 UTC | Merge branch 'master' into python_migration | 16 March 2019, 19:50:09 UTC |
44e804f | John Platt | 16 March 2019, 19:49:29 UTC | Merge branch 'master' of https://github.com/kubeflow/katib | 16 March 2019, 19:49:29 UTC |
ddd0af3 | John Platt | 16 March 2019, 15:59:11 UTC | moved parsing inside algorithms | 16 March 2019, 15:59:11 UTC |
a417908 | John Platt | 16 March 2019, 14:34:23 UTC | updated manifests and dockerfiles | 16 March 2019, 14:34:23 UTC |
a95a6cb | John Platt | 16 March 2019, 14:34:10 UTC | added grid search code | 16 March 2019, 14:34:10 UTC |
acc3376 | John Platt | 16 March 2019, 12:59:43 UTC | fixed minor issues | 16 March 2019, 12:59:43 UTC |
887f356 | Julian Qian | 15 March 2019, 23:42:57 UTC | fix demo link (#434) * fix demo link change to correct link README.md * The link should say README.md as well. The link should say README.md as well. | 15 March 2019, 23:42:57 UTC |
06f955b | Jinan Zhou | 14 March 2019, 01:58:22 UTC | Add fault tolerance support for trial failure (#424) * add fault tolerance for trial failure * fix a small typo * fix a typo * improve fault processing strategy * add an important TODO * fix typo * add some more TODOs | 14 March 2019, 01:58:22 UTC |
c87d583 | jdplatt | 11 March 2019, 19:54:38 UTC | Test for Bayesian Optimization Algo (#406) * added tests for acquisition function and models * added tests for global_optimizer * added tests for boa * minor linting * tests for algorithm manager * added discrete parameter to study config * covered all parameter types * moved python script to testing folder * added python tests to unit tests * remembered to uncomment existing tests * fixed path to test script * moved python tests to separate job in workflow * added run command to test script | 11 March 2019, 19:54:38 UTC |
61451ef | Richard Liu | 08 March 2019, 02:23:33 UTC | Katib v1alpha2 API for CRDs (#381) * v1alpha2 API proposal * Fix comments round 1 * Refactor into Experiment and Trial * Incorporate feedback from meeting * Rename * Minor edits | 08 March 2019, 02:23:33 UTC |
57dd5c5 | John Platt | 07 March 2019, 14:09:53 UTC | added run command to test script | 07 March 2019, 14:09:53 UTC |
9f75f05 | John Platt | 07 March 2019, 13:24:57 UTC | moved python tests to separate job in workflow | 07 March 2019, 13:24:57 UTC |
0217ace | John Platt | 07 March 2019, 12:31:28 UTC | fixed path to test script | 07 March 2019, 12:31:28 UTC |
936708c | John Platt | 06 March 2019, 14:32:00 UTC | made fetching observations optional | 06 March 2019, 14:32:00 UTC |
86bd27a | Andrey Velichkevich | 06 March 2019, 05:27:59 UTC | Add NAS team as reviewers (#419) * Add NAS team in reviewers * Update reviewers | 06 March 2019, 05:27:59 UTC |
feee2f9 | Jinan Zhou | 06 March 2019, 01:48:01 UTC | Multiple Trials for Reinforcement Learning Suggestion (#416) * supoort multiple trials * adjust To Do * language improvement in README.md * fix several problems * fix a potential problem * handle the GetEvaluationResult() return None problem | 06 March 2019, 01:48:01 UTC |
3a705a1 | Jinan Zhou | 06 March 2019, 00:40:03 UTC | Fix the package version in training container (#418) * fix the version of tf and keras * fix a typo | 06 March 2019, 00:40:03 UTC |
8f89ad4 | Andrey Velichkevich | 05 March 2019, 23:47:59 UTC | Add validation for NAS job in Katib controller (#398) * Initial commit * Add validation for NAS config * Fix validation * Add algorithmType in NasConfig validation * Add Discrete ParameterType to validation * Move validation to webhook Change GetJobType function Make a list with NAS algorithms * Add ValidateSuggestionParameters function in Katib API * Fix api * Add ValidateSuggestionParameters to Suggestion service * Change isValid to int32 * Create Validation function in NAS RL Suggestion service * Fix small problems * Reduce code inside Validation function * Add empty ValidateSuggestionParameters function in each HP service written in GO * Fix logging * Add ValidateSuggestionParameters to mock * Handle Unvailable error | 05 March 2019, 23:47:59 UTC |
a393d4b | John Platt | 05 March 2019, 21:48:29 UTC | updated manifests | 05 March 2019, 21:48:29 UTC |
a338fb2 | John Platt | 05 March 2019, 20:32:34 UTC | expanded parameter object and streamlined tests | 05 March 2019, 20:32:34 UTC |
b62c941 | John Platt | 05 March 2019, 16:34:58 UTC | Merge branch 'master' into python_migration | 05 March 2019, 16:34:58 UTC |
f4ae52d | John Platt | 05 March 2019, 15:39:00 UTC | remembered to uncomment existing tests | 05 March 2019, 15:39:00 UTC |
41d19db | John Platt | 05 March 2019, 15:34:19 UTC | added python tests to unit tests | 05 March 2019, 15:34:19 UTC |
89fcf56 | John Platt | 03 March 2019, 23:52:44 UTC | added random search | 04 March 2019, 00:27:08 UTC |
1c3401b | John Platt | 03 March 2019, 23:27:48 UTC | started testing service | 03 March 2019, 23:27:48 UTC |
dbc8b30 | John Platt | 03 March 2019, 17:37:57 UTC | pushed parameter validation and defaults inside algorithm | 03 March 2019, 17:37:57 UTC |
429c0c4 | John Platt | 02 March 2019, 01:22:14 UTC | broke out parsing tests to cover individual functions | 02 March 2019, 01:39:31 UTC |
d825a46 | John Platt | 02 March 2019, 00:56:31 UTC | eliminated algorithm_manager.py | 02 March 2019, 00:56:31 UTC |
abe8e6e | John Platt | 01 March 2019, 21:43:37 UTC | converted methods called in init for algorithm manager into functions | 01 March 2019, 21:43:37 UTC |
8620509 | John Platt | 01 March 2019, 19:45:08 UTC | linted bayesian_service.py | 01 March 2019, 20:33:22 UTC |
76eb49f | jdplatt | 01 March 2019, 14:31:35 UTC | Merge remote-tracking branch 'upstream/master' | 01 March 2019, 14:31:35 UTC |
db6b83b | Andrey Velichkevich | 01 March 2019, 02:54:21 UTC | Fix path to api protobuf (#415) | 01 March 2019, 02:54:21 UTC |
4d8c599 | Jinan Zhou | 27 February 2019, 03:05:45 UTC | Add support for parallel studyjobs (#404) * Add support for parallel studyjobs * fix a typo * Reorganize the program a little bit * fix a typo * fix a typo | 27 February 2019, 03:05:45 UTC |
87a31f3 | Jinan Zhou | 27 February 2019, 01:33:48 UTC | Add separable/depthwise convolution, data augmentation and multiple GPU support (#393) * add separable/depthwise convolution in operation library * add ENAS example StudyJob yaml * remove ENAS example, add data augmentation, add multiple GPU support | 27 February 2019, 01:33:48 UTC |
4d031e7 | Andrey Velichkevich | 27 February 2019, 00:32:45 UTC | Add create time to Trial API (#410) * Add create time to Trial API * Add Trial create time information * Fix UT for db | 27 February 2019, 00:32:45 UTC |
f5a3860 | John Platt | 21 February 2019, 17:24:20 UTC | moved python script to testing folder | 26 February 2019, 20:13:20 UTC |
afe1874 | John Platt | 21 February 2019, 17:17:04 UTC | covered all parameter types | 26 February 2019, 20:13:20 UTC |
9dd5e6a | John Platt | 21 February 2019, 17:07:10 UTC | added discrete parameter to study config | 26 February 2019, 20:13:20 UTC |
26f9106 | John Platt | 21 February 2019, 16:50:14 UTC | tests for algorithm manager | 26 February 2019, 20:13:20 UTC |
3e745ab | John Platt | 20 February 2019, 19:05:47 UTC | minor linting | 26 February 2019, 20:13:20 UTC |
eeb76e5 | John Platt | 20 February 2019, 16:24:12 UTC | added tests for boa | 26 February 2019, 20:13:19 UTC |
dd3d563 | John Platt | 20 February 2019, 15:36:00 UTC | added tests for global_optimizer | 26 February 2019, 20:13:19 UTC |
75e9886 | John Platt | 19 February 2019, 21:26:20 UTC | added tests for acquisition function and models | 26 February 2019, 20:13:19 UTC |
26da3ea | Johnu George | 26 February 2019, 04:32:34 UTC | Metric collector must fail on error (#405) * Fail when unable to collect logs * Set backlimit to 0 for jobs | 26 February 2019, 04:32:34 UTC |
6b75138 | Hougang Liu | 25 February 2019, 17:37:16 UTC | add latest tag for katib images (#409) | 25 February 2019, 17:37:16 UTC |
46d2dc7 | Hougang Liu | 22 February 2019, 02:35:03 UTC | add build and test for suggestion nasrl (#401) | 22 February 2019, 02:35:03 UTC |
d6a67ea | Akado2009 | 21 February 2019, 01:37:09 UTC | Database APIs for NAS updated (#394) * FINAL PUSH * FIX TESTS * new lock * new lock * small fi * DELET SPACE * deleted ununsed function | 21 February 2019, 01:37:09 UTC |
3bb8b54 | Jinan Zhou | 21 February 2019, 00:53:59 UTC | Suggestion for Neural Architecture Search with Reinforcement Learning (#339) * Suggestion for Neural Architecture Search with Reinforcement Learning * Add NAS RL Suggestion * Fix new line * set json format for GetSuggestion() * finish trial return in GetSuggestion(), finish GetEvaluationHistory, and fix bugs * fix a bug in GetEvaluationResult() * fix bigs in GetEvaluationResult * fix an error in GetEvaluatinResult * Add python Katib api * Remove unnecessary requirements * add about for suggestion * rename to README * Add picture explanations; make the printouts more organized * fix typos * fix some small problems * Fix several problems * Fix a typo * fix some problems * small fixes * Suggestion do not need to handle uncompleted trials * fix a small problem | 21 February 2019, 00:53:59 UTC |
5a1a791 | Hougang Liu | 20 February 2019, 17:40:23 UTC | add validating webhook for studyJob (#383) * add validating webhook for studyJob If create/update a studyJob with bad CR manifest or invalid configuration, k8s api server will reject the request. Fixes: #314 * add test * allow check "kubectl" error code | 20 February 2019, 17:40:23 UTC |
8a89b9e | Johnu George | 20 February 2019, 06:19:50 UTC | Removing Operator specific handling during a StudyJob run (#387) * Removing Operator specific handling during a StudyJob run * Return empty in error | 20 February 2019, 06:19:50 UTC |
edecd39 | Andrey Velichkevich | 20 February 2019, 00:41:30 UTC | Delete modeldb from unit tests (#391) * Delete modeldb from unit tests * Add library to interface test | 20 February 2019, 00:41:30 UTC |
c0f2f07 | Hougang Liu | 19 February 2019, 03:21:42 UTC | show studyjob condition when run kubectl get (#389) | 19 February 2019, 03:21:42 UTC |
ee62c33 | Jinan Zhou | 15 February 2019, 02:23:48 UTC | Training Container with Model Constructor for cifar10 (#345) * Training Container with Model Constructor for cifar10 * fix a small bug * make num_epochs a parameter | 15 February 2019, 02:23:48 UTC |