https://github.com/kubeflow/katib

sort by:
Revision Author Date Message Commit Date
715e027 fix: Remove info in script Signed-off-by: Ce Gao <gaoce@caicloud.io> 20 May 2019, 05:52:17 UTC
49fa1ac fix: Fix path bug Signed-off-by: Ce Gao <gaoce@caicloud.io> 20 May 2019, 05:36:31 UTC
7c3bcb9 fix: Show yaml Signed-off-by: Ce Gao <gaoce@caicloud.io> 20 May 2019, 04:13:29 UTC
00c78b2 fix: Show info in runtests Signed-off-by: Ce Gao <gaoce@caicloud.io> 20 May 2019, 02:36:20 UTC
88c2c71 fix: Fix dep problem Signed-off-by: Ce Gao <gaoce@caicloud.io> 20 May 2019, 02:25:58 UTC
ece5dc8 fix: Use v1alpha1 dockerfile Signed-off-by: Ce Gao <gaoce@caicloud.io> 20 May 2019, 02:25:58 UTC
d7e7d65 fix: Address comments Signed-off-by: Ce Gao <gaoce@caicloud.io> 20 May 2019, 02:25:58 UTC
5e89329 fix: Move requirements Signed-off-by: Ce Gao <gaoce@caicloud.io> 20 May 2019, 02:25:58 UTC
f6ece0e chore: Refactor NAS Signed-off-by: Ce Gao <gaoce@caicloud.io> 20 May 2019, 02:25:58 UTC
bc343ff feat(trial): Refactor trial controller and add basic test cases (#528) * feat(trial): Add test * vendor: Update Signed-off-by: Ce Gao <gaoce@caicloud.io> * feat(trial): Add test cases Signed-off-by: Ce Gao <gaoce@caicloud.io> * chore: Add mockgen and test crd Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Update test Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Update Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Fix export path Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Introduce the dependency Signed-off-by: Ce Gao <gaoce@caicloud.io> * feat: Add status handler Signed-off-by: Ce Gao <gaoce@caicloud.io> * Fix: Use 1.0.7 Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Fix v1alpha1 Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Update deploy scripts Signed-off-by: Ce Gao <gaoce@caicloud.io> * chore: Rebase master Signed-off-by: Ce Gao <gaoce@caicloud.io> 20 May 2019, 02:09:11 UTC
0712177 Add status for experiment and trial in additionalPrinterColumns (#555) 20 May 2019, 01:41:11 UTC
c8ae73e Fix default metricsController wrong args (#550) * Fix default metricsController wrong args * Add command for metricsConllector deployment 19 May 2019, 23:41:09 UTC
32c1bd2 Add NAS RL yaml example (#554) 18 May 2019, 03:32:07 UTC
f5babef Fix trial requestNumber error (#553) 17 May 2019, 21:40:13 UTC
bf3aeff Adding tests for random suggestion algorithm (#552) 17 May 2019, 15:14:12 UTC
215e3d7 Adding minor styles changes (#548) 17 May 2019, 12:10:04 UTC
ee37ba7 Fix nil pointer error when create trial (#547) 17 May 2019, 10:44:05 UTC
ad8fe2b Used klog for katib. (#526) 17 May 2019, 10:16:05 UTC
d22f8e6 Implement GetSuggestions for general suggestion (#546) 17 May 2019, 09:46:06 UTC
330c239 fix: Fix the conflicts (#545) Signed-off-by: Ce Gao <gaoce@caicloud.io> 17 May 2019, 09:12:04 UTC
c6f34ea Earlystopping changes trigger CI based on version (#544) * Ui changes trigger CI based on version * Make earlystop version in prow config 17 May 2019, 08:34:08 UTC
8a4b6e3 Adding manifests for manager rest (#542) * Adding manifests for manager rest * Fix to add new image * Exclude katib-manager-rest in grep 17 May 2019, 08:28:10 UTC
4ec45bb Katib UI for v1alpha2 (#486) * Init commit * Half of backend edit * Backend for v1alpha2 UI is done * Fix gopkg.toml * fix gopkg lock * Almost submit HP job by params done * Modify gitignore * Add by parameters for HP and NAS done * HP and NAS jobs functionality done * Template is done * Add new filters in job list Create a new build with frontend * Fix newlines * Review changes * Add tests for katib-ui * Delete pkg * Add pkg * Fix backend 17 May 2019, 07:12:04 UTC
bd39447 Enable suggestion-random image build and push in CI (#543) 17 May 2019, 06:50:06 UTC
41152dd fix(status): Define status handler (#518) Signed-off-by: Ce Gao <gaoce@caicloud.io> 17 May 2019, 06:00:07 UTC
b2d7bbf Include suggestion manager based on version in prow (#541) 17 May 2019, 05:30:07 UTC
a0aab3a Adding random suggestion algorithm implementation and manifests (#540) * Adding random algorithm implementation and manifests * Removing test file 17 May 2019, 05:22:06 UTC
04b0d91 fix: Add test cases for validator and manifest generator (#508) * dep: Update Signed-off-by: Ce Gao <gaoce@caicloud.io> * refactor: Simplify to test Signed-off-by: Ce Gao <gaoce@caicloud.io> * config: Add Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Fix test cases Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Update Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Add more test cases Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Update Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Fix test cases Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Fix the rebase bug Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Address comments Signed-off-by: Ce Gao <gaoce@caicloud.io> * experiment: Fix Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Install kubebuilder Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Fix unit test Signed-off-by: Ce Gao <gaoce@caicloud.io> 17 May 2019, 03:17:27 UTC
928c66a Update trial status DB operation (#537) 17 May 2019, 02:57:26 UTC
eb7235e Add labels for TFjob and PytorchJob (#539) 17 May 2019, 02:29:26 UTC
9b9f6dd v1alpha2 TFEvent metrics collector (#538) * TFEvent collector * TFjob example * TFjob example * Build metrics collector images on pre and postsubmits 17 May 2019, 00:56:27 UTC
a8086c0 Register Trial in DB (#530) * Register Trial in DB * Fix errors * Change Spec and Status for Trial and Experiment * Fix unit test * Fix condition in Register Experiment and Trial * Fix ut in manager * Fix Status in Experiment Config * Fix Experiment Spec in py test * Add trial status * Fix ut with trial status 16 May 2019, 23:14:28 UTC
287e503 Restructuring docker files to build images per version (#527) * Restructuring docker files to build image per version * binary renamed * Fixing prow include directory paths * binary rename * fixing names * Fix paths * Fixing python paths in v1alpha1 * Adding manifests change * test path fix 16 May 2019, 17:19:07 UTC
e4891e4 Dep ensure to sync up vendor. (#535) * Enable remove un-used package. * Run dep ensure. 16 May 2019, 11:05:46 UTC
1f90657 fix: Avoid side effect (#532) * fix: Avoid side effect Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Remove Signed-off-by: Ce Gao <gaoce@caicloud.io> 16 May 2019, 10:25:41 UTC
cb6113e Include vendor dir instead of Gopkg in prow config (#536) 16 May 2019, 06:21:41 UTC
ed44eb0 Update experiment status DB operation (#534) * Update experiment status DB operation * gofmt the changes 16 May 2019, 05:41:39 UTC
2d772d7 feat(api): Add total number of trials (#501) Signed-off-by: Ce Gao <gaoce@caicloud.io> 16 May 2019, 02:15:44 UTC
3cbb565 Fix wrong error-handling logic in db interface (#529) 16 May 2019, 00:57:42 UTC
f626d4b chore: Remove dep ensure in CI (#525) * chore: Move unit test before image building in v1alpha2 Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Move python suggestions build to dep Signed-off-by: Ce Gao <gaoce@caicloud.io> * chore: Move more layers to dep Signed-off-by: Ce Gao <gaoce@caicloud.io> * chore: Add dep and remove depensure in CI Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Run setup cluster later Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Address comments Signed-off-by: Ce Gao <gaoce@caicloud.io> 15 May 2019, 14:40:26 UTC
cbd84c6 Delete experiment in DB if in need (#519) 15 May 2019, 10:50:23 UTC
cb559ea Support for Custom Job resources (#512) * Support for custom resources * Adding logs * minor import renaming * Rebase changes 15 May 2019, 07:44:21 UTC
3d1a8ca Fix ut test and enable ut-test of v1alpha2 (#524) 15 May 2019, 06:36:18 UTC
ccdf2b9 godep: Remove useless dep (#521) * fix: Add gopkg Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Add gopkg to postsubmit Signed-off-by: Ce Gao <gaoce@caicloud.io> 15 May 2019, 03:18:17 UTC
e1d69aa Fix prow to trigger corresponding workflow (#520) * Fix prow to trigger corresponding workflow * fix studyjob subdir 15 May 2019, 02:20:19 UTC
e6ddcb8 create experiment in db (#509) * create experiment in db * rename katibClient to katibManagerClient 15 May 2019, 00:22:19 UTC
5478db4 refactor(suggestion): Use interface (#502) * refactor(suggestion): Use interface Signed-off-by: Ce Gao <gaoce@caicloud.io> * feat(cmd): Add flags to control the fake logic Signed-off-by: Ce Gao <gaoce@caicloud.io> * gopkg: Update Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Revert package names Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Use string Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Address comments Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Address comment Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Fix bug Signed-off-by: Ce Gao <gaoce@caicloud.io> 14 May 2019, 14:47:16 UTC
b5d3cc1 feat(CI): Run different flow according to version (#516) * feat(CI): Run different flow according to version Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Add dockerfile Signed-off-by: Ce Gao <gaoce@caicloud.io> 14 May 2019, 06:31:08 UTC
0ba24da Added PR and Issue template. (#505) 13 May 2019, 04:16:09 UTC
dee0e48 Enabled verbose logging for dev guide. (#504) This can help dev to check more detail for what is wrong with `dep ensure` if there are some errors. 13 May 2019, 03:38:09 UTC
9ebbc79 v1alpha2 metrics collector - controller (#496) 10 May 2019, 06:31:57 UTC
7cadd62 Update util for experiment in v1alpha2 (#485) * Update util for experiment * Change GetLastConditionType func * Add empty condition check 09 May 2019, 23:17:59 UTC
41143e8 add common package (#491) 09 May 2019, 06:54:50 UTC
a38806c Add metrics collector spec and objective spec to Trial (#489) * Add metrics collector spec to Trial spec * Fix e2e test * Move ObjectiveSpec definition to Trial CRD * Move common types * Move common types to its own package * Add metrics collector spec to DB 09 May 2019, 05:22:47 UTC
b085eff Prune katib OWNERS file (#490) 09 May 2019, 01:42:47 UTC
b76c9c1 Training container for NAS Envelopenet (#429) * Training container for NAS Envelopenet * Changes after review * README added 09 May 2019, 00:54:47 UTC
fd3b180 NAS Envelopenet Suggestion and Job Example (#425) * NAS Envelopenet Suggestion and Job Example * NAS Envelopenet Suggestion and Job Example * Typo correction in nasjob * Readme * Readme added * Link to paper 08 May 2019, 23:40:48 UTC
6853960 V1alpha2 Metrics collector (part 1) (#484) * Add metrics collector parser * Metrics collector implementation * Add metrics controller configmap * Metrics collector script and rbac * rename tmpValues * Fix for comments * Fix comments 08 May 2019, 04:10:33 UTC
a586bca enable test for katib-manager (#478) * enable test for katib-manager * add pv/pvc for v1alpha2 test * install dependency of test client * pip install by requirement file 08 May 2019, 00:18:36 UTC
6d95830 Remove outdated TODOs in README.md (#468) 07 May 2019, 14:49:52 UTC
823fa9f Get experiment config from the instance (#474) * Get experiment config from instance * Add parsing * Move getExperiment to util * Change objectmeta.name to name 07 May 2019, 00:33:40 UTC
df67741 Fix KatibClient name (#483) 03 May 2019, 01:36:18 UTC
709d97c Add Katib Client in v1alpha2 (#480) * Init commit * Add Katib Client * Add GetConfigMap func Move templates const * Change folder for Katib client * Delete old client * Change name for default templates 01 May 2019, 22:14:18 UTC
fd4c21c Add metrics collector spec to v1alpha2 API (#481) * Add metrics collector spec to v1alpha2 API * Delete metricsCollectorType * Fix * Fix unit test 01 May 2019, 03:26:55 UTC
70c3ccd vizier-core does not need any role (#482) 30 April 2019, 08:15:40 UTC
c93eb1f katib manager db error (#476) * katib manager db error condition is keyword of mysql, we need escape it in sql * fix test case error * use status to replace condition as column name 29 April 2019, 13:42:22 UTC
2b55c69 share one grpc-health-probe (#477) 29 April 2019, 04:32:19 UTC
6f5c5c7 validation and mutating webhook for experiment (#473) * validation and mutating webhook for experiment * add test for webhook * use controller-runtime client instead of client-go * use existing objectivetype const * fillback default TrialTemplate * validate if record for the new experiment exists in DB 27 April 2019, 23:22:20 UTC
78a4563 enable test for v1alpha2 (#465) * enable test for v1alpha2 * add KATIB-CORE-NAMESPACE env for controller * update example filed * share same image for two version controller * add status in subresources of crd 26 April 2019, 06:04:26 UTC
bc57a6d Add serviceAccountName in UI deployment (#469) 25 April 2019, 02:22:46 UTC
728b37b chore: Skip test when code is not changed (#467) Signed-off-by: Ce Gao <gaoce@caicloud.io> 24 April 2019, 23:38:45 UTC
5a7a144 Adding initial v1alpha2 API controller (#457) * Adding initial v1alpha2 controller * Adding logs * Adding comments * Adding template functions for experiment * Adding error checks 23 April 2019, 23:22:00 UTC
b886768 v1alpha2 api server implementation (#456) * add v1-alpha2 api server implementation Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * add test Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * add filter argument to GetTrialList Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * rename filter to filter_by_name Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * revert filter_by_name to filter Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> 19 April 2019, 03:25:55 UTC
f5c59f6 fix(readme): Merge image directory (#455) Signed-off-by: Ce Gao <gaoce@caicloud.io> 15 April 2019, 08:12:03 UTC
33e8e30 Update REAME example links for v1alpha1 (#452) * Update REAME example links for v1alpha1 * pkg/api/api.proto -> pkg/api/v1alpha1/api.proto * pkg/api/gen-doc/api.md -> pkg/api/v1alpha1/gen-doc/api.md * Links in pkg/api/README.md need to be doubled up for alpha1 and alpha2 * manifests -> manifests/v1alpha1 * Fix another examples link * No more relative link * rename header * update scripts link 10 April 2019, 20:00:09 UTC
05569bc fix py client import error (#453) 10 April 2019, 08:46:15 UTC
04b3051 ClusterRoleBinding doesn't need namespace field (#451) 03 April 2019, 08:53:43 UTC
7ef5594 Update API for NAS in v1alpha2 (#450) * Update API for NAS in v1alpha2 * Fix name * Fix name in input size 03 April 2019, 07:35:42 UTC
b25422a Restructuring test scripts for v1alpha1 and v1alpha2 (#449) * Restructing test scripts for v1alpha1 and v1alpha2 * Fix package location 02 April 2019, 21:25:19 UTC
3d4cd04 Code restructuring to support V1alpha1 and V1alpha2 API (#448) * Code restructuring to support V1alpha1 and V1alpha2 API * Adding comments * Test package changes * Moving requirements file * Fix the package location * Renaming studyjobcontroller to katib-controller 01 April 2019, 21:56:33 UTC
4ab3dbd Fix labels matching the job operator implementation (#447) 29 March 2019, 18:26:14 UTC
de7323c Updating the pytorch example image (#446) 29 March 2019, 04:50:12 UTC
21855a1 Remove redundant lock (#444) 29 March 2019, 02:02:12 UTC
1316bad add v1alpha2 grpc api (#427) * add v1alpha2 grpc api Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * update gRPC API Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * add v1alpha2 DB IF Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * fix typo, add doc and add todo for nasconfig Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * apply comments Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * fix typo Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * update proto Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> * update Signed-off-by: YujiOshima <yuji.oshima0x3fd@gmail.com> 26 March 2019, 17:07:21 UTC
e260258 Remove katibcli (#436) * removed cmd folder for CLI * removed docs folder for CLI * final cleanup of cli removal * removed cli from build script * added go packages into unit-test script 20 March 2019, 21:32:58 UTC
28eb81d Change datadir for avoid failure due to lost+found (#432) 18 March 2019, 00:57:10 UTC
887f356 fix demo link (#434) * fix demo link change to correct link README.md * The link should say README.md as well. The link should say README.md as well. 15 March 2019, 23:42:57 UTC
06f955b Add fault tolerance support for trial failure (#424) * add fault tolerance for trial failure * fix a small typo * fix a typo * improve fault processing strategy * add an important TODO * fix typo * add some more TODOs 14 March 2019, 01:58:22 UTC
c87d583 Test for Bayesian Optimization Algo (#406) * added tests for acquisition function and models * added tests for global_optimizer * added tests for boa * minor linting * tests for algorithm manager * added discrete parameter to study config * covered all parameter types * moved python script to testing folder * added python tests to unit tests * remembered to uncomment existing tests * fixed path to test script * moved python tests to separate job in workflow * added run command to test script 11 March 2019, 19:54:38 UTC
61451ef Katib v1alpha2 API for CRDs (#381) * v1alpha2 API proposal * Fix comments round 1 * Refactor into Experiment and Trial * Incorporate feedback from meeting * Rename * Minor edits 08 March 2019, 02:23:33 UTC
86bd27a Add NAS team as reviewers (#419) * Add NAS team in reviewers * Update reviewers 06 March 2019, 05:27:59 UTC
feee2f9 Multiple Trials for Reinforcement Learning Suggestion (#416) * supoort multiple trials * adjust To Do * language improvement in README.md * fix several problems * fix a potential problem * handle the GetEvaluationResult() return None problem 06 March 2019, 01:48:01 UTC
3a705a1 Fix the package version in training container (#418) * fix the version of tf and keras * fix a typo 06 March 2019, 00:40:03 UTC
8f89ad4 Add validation for NAS job in Katib controller (#398) * Initial commit * Add validation for NAS config * Fix validation * Add algorithmType in NasConfig validation * Add Discrete ParameterType to validation * Move validation to webhook Change GetJobType function Make a list with NAS algorithms * Add ValidateSuggestionParameters function in Katib API * Fix api * Add ValidateSuggestionParameters to Suggestion service * Change isValid to int32 * Create Validation function in NAS RL Suggestion service * Fix small problems * Reduce code inside Validation function * Add empty ValidateSuggestionParameters function in each HP service written in GO * Fix logging * Add ValidateSuggestionParameters to mock * Handle Unvailable error 05 March 2019, 23:47:59 UTC
db6b83b Fix path to api protobuf (#415) 01 March 2019, 02:54:21 UTC
4d8c599 Add support for parallel studyjobs (#404) * Add support for parallel studyjobs * fix a typo * Reorganize the program a little bit * fix a typo * fix a typo 27 February 2019, 03:05:45 UTC
87a31f3 Add separable/depthwise convolution, data augmentation and multiple GPU support (#393) * add separable/depthwise convolution in operation library * add ENAS example StudyJob yaml * remove ENAS example, add data augmentation, add multiple GPU support 27 February 2019, 01:33:48 UTC
4d031e7 Add create time to Trial API (#410) * Add create time to Trial API * Add Trial create time information * Fix UT for db 27 February 2019, 00:32:45 UTC
26da3ea Metric collector must fail on error (#405) * Fail when unable to collect logs * Set backlimit to 0 for jobs 26 February 2019, 04:32:34 UTC
6b75138 add latest tag for katib images (#409) 25 February 2019, 17:37:16 UTC
back to top