https://github.com/kubeflow/katib

sort by:
Revision Author Date Message Commit Date
71a59ff Add goptuna dependencies 09 April 2020, 03:54:50 UTC
93f46f2 Implement ValidateAlgorithmSettings for hyperopt (#1130) 09 April 2020, 03:41:44 UTC
ab3dab0 Add ENAS to ROADMAP (#1129) 07 April 2020, 15:47:50 UTC
190aa6e feat: Add 2020 roadmap (#1121) * feat: Add 2020 roadmap Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Address comments in #1121 Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Address comments Signed-off-by: Ce Gao <gaoce@caicloud.io> 07 April 2020, 07:45:43 UTC
241b9a3 Optimize chocolate suggestion (#1116) 07 April 2020, 03:21:43 UTC
cd27da5 Support step in int parameter for hyperopt and chocolate (#1123) 07 April 2020, 02:37:44 UTC
4da82cf UI: Remove antd package (#1117) * Remove antd package Change TabPanel for creating Experiment Move TabPanel to Common components * Remove antd from package.json * Modify Katib Link and Main Page * Change Trial and Jobs monitor * Fix links, margin in Trial view * Increase margin for Trial 03 April 2020, 10:59:46 UTC
688636e Fix e2e tests (#1118) 03 April 2020, 10:17:46 UTC
bccc281 UI: Close menu on click (#1114) * Close menu on click Fix close time for snackbar Add Material UI to the README Add link to developer guide * Change README 01 April 2020, 16:27:28 UTC
2e50219 Change yaml submit yaml style (#1113) 01 April 2020, 13:25:27 UTC
533ca35 Refactor python files in cmd/suggestion (#1105) 24 March 2020, 14:46:47 UTC
7711326 Update prow config with the latest folders (#1109) * Modify prow config * Add examples to prow 24 March 2020, 14:00:48 UTC
38ec768 Fix logger namespace (#1108) 24 March 2020, 10:28:47 UTC
edceaf9 chore(deps): Bump tensorflow in /cmd/suggestion/nasrl/v1alpha3 (#1035) Bumps [tensorflow](https://github.com/tensorflow/tensorflow) from 1.14.0 to 1.15.2. - [Release notes](https://github.com/tensorflow/tensorflow/releases) - [Changelog](https://github.com/tensorflow/tensorflow/blob/master/RELEASE.md) - [Commits](https://github.com/tensorflow/tensorflow/compare/v1.14.0...v1.15.2) Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> 24 March 2020, 09:32:47 UTC
c15fcec Refactor suggestion-internal-modules (#1106) 24 March 2020, 08:32:47 UTC
02b56c7 chore(deps): Bump psutil (#1085) Bumps [psutil](https://github.com/giampaolo/psutil) from 5.2.2 to 5.6.6. - [Release notes](https://github.com/giampaolo/psutil/releases) - [Changelog](https://github.com/giampaolo/psutil/blob/master/HISTORY.rst) - [Commits](https://github.com/giampaolo/psutil/compare/release-5.2.2...release-5.6.6) Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> 24 March 2020, 07:54:47 UTC
367841b Fix custom Katib DB Manager env variables (#1102) * Fix custom Katib DB Manager info * Change names for Katib db manager env 24 March 2020, 07:16:46 UTC
a90d396 Refactor python files of suggestion services (#1107) 24 March 2020, 06:30:46 UTC
25e5d73 Add myself to approvers (#1103) 23 March 2020, 22:04:47 UTC
25cdf1c Enable to add Service Account Name in Katib config (#1092) * Enable to add service account name to katib config * Fix test 23 March 2020, 17:33:26 UTC
11c832d chore(deps): Bump tensorflow-gpu (#1034) Bumps [tensorflow-gpu](https://github.com/tensorflow/tensorflow) from 1.15.0 to 1.15.2. - [Release notes](https://github.com/tensorflow/tensorflow/releases) - [Changelog](https://github.com/tensorflow/tensorflow/blob/master/RELEASE.md) - [Commits](https://github.com/tensorflow/tensorflow/compare/v1.15.0...v1.15.2) Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> 23 March 2020, 16:52:45 UTC
24f5148 chore(deps): Bump tensorflow (#1036) Bumps [tensorflow](https://github.com/tensorflow/tensorflow) from 1.15.0 to 1.15.2. - [Release notes](https://github.com/tensorflow/tensorflow/releases) - [Changelog](https://github.com/tensorflow/tensorflow/blob/master/RELEASE.md) - [Commits](https://github.com/tensorflow/tensorflow/compare/v1.15.0...v1.15.2) Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> 23 March 2020, 16:12:45 UTC
932648f Add ghalton package to chocolate (#1101) 23 March 2020, 15:32:45 UTC
d622f4a Enable to run Experiment without Goal (#1065) * Enable to run experiment without goal * Remove goal from tfjob example * Fix e2e test * fix test * Add goal to tfjob example 23 March 2020, 15:10:46 UTC
3717609 UI: Support Trial Templates in all namespaces and all configMaps (#1083) * UI Support Trial Templates in all namespaces and configMaps * Change templates yaml Add FilterPannel to Trial Template View * Fix problem with adding template to empty configMap * Support Trial Templates select in Submit Experiment * Fix comments * Fix nas module name * Add version for prettier in Travis 23 March 2020, 13:34:44 UTC
bad07f9 Change chocolate-mocmaes algorithm name (#1097) 23 March 2020, 11:14:45 UTC
c212d7c UI: Add Metrics Collector Spec to Submit Experiment (#1096) * Support Metrics Collector for HP Jobs * Support metrics collector in NAS * Change label for HP metrics regex 19 March 2020, 16:47:22 UTC
57796b0 UI: Add Experiment view for NAS Jobs (#1077) * Experiment view for NAS Jobs * Fix sagas * Fix hp actions 06 March 2020, 19:15:24 UTC
b0520d8 Enable Prettier code formatting for the Katib UI (#1078) * Enable Prettier in the Katib UI * Change travis * Change prettier-check in Makefile * Fix check format script * Fix README * Add missing points * Fix URL in README 06 March 2020, 03:52:50 UTC
c2dac12 Update adopters.md (#1074) 03 March 2020, 06:37:37 UTC
1388f11 fix annotations, which crashs custom job (#1072) 02 March 2020, 07:32:42 UTC
db5e2e2 Add more unit tests in Katib (#1071) * Add tests for mysql and validator * Add tests for inject webhook * Remove test main * Add test for UNKNOWN status in Check DB 29 February 2020, 08:00:39 UTC
8267929 dynamic jobProvider and suggestionComposer registration (#1069) * dynamic jobProvider and suggestionComposer registration * enforce gofmt * fix ut * fix ut * fix gofmt * fix * refine imports * contain providers into package(v1alpha3) * move job kubeflow into where providerBase in * add consts 28 February 2020, 13:44:38 UTC
3a19e1c UI: Update supported algorithms (#1070) * Update supported algorithms * Change default value for additionalMetricNames 27 February 2020, 16:05:27 UTC
09da67c Fix TPE Suggestion (#1063) * Init change * Change hyperopt suggestion * Generate new id in Trial loop * Run tpe suggest for each Trial * Move is first run 25 February 2020, 14:52:48 UTC
479b22f Update Katib docs (#1066) * Change Katib docs * Make Katib capitalised in proposals * Remove CUJ from ROADMAP 25 February 2020, 03:24:48 UTC
969935e UI: Show best metrics in the Trial metrics information plot (#1068) * Fix problem with same time for Trial metric * Fix comment 25 February 2020, 02:48:48 UTC
a57db03 Update adopters (#1064) 21 February 2020, 17:30:33 UTC
ed0778e Create Optimizer in BO Suggestion only for the first run (#1057) * Add first run condition in BO Suggestion * Tell to Optimizer only about new Trials * Logging Return new trials in each getSuggestion call * Small fix log * Remove n_points from ask * Fix log * Add newline to log * Change log * Change dict to list of recorded trials * Get search space only for the first run 21 February 2020, 17:24:32 UTC
4fbf77d Add missing GRPC health probe for arm64 to db-manager (#1059) Signed-off-by: Henry Wang <henry.wang@arm.com> 20 February 2020, 03:35:03 UTC
c8a6089 Change tell method for skopt (#1055) 15 February 2020, 07:31:27 UTC
994446b MXNet -> Apache MXNet (#1056) 14 February 2020, 16:48:17 UTC
2da2c86 Adding error propagation for K8s client creation in KatibClient (#1053) 13 February 2020, 23:24:17 UTC
d84f248 continue (#1054) 13 February 2020, 09:12:35 UTC
f98f7c9 Disable istio sidecar injection in Suggestion and Training Jobs (#1050) * Add istio sidecar inject annotation to Suggestion pod and all training jobs * Add annotation to Training Jobs from Controller * Fix comment 11 February 2020, 04:10:01 UTC
e3c615d Show best metric in trial table (#1048) 06 February 2020, 03:37:33 UTC
2f39fd6 Fix trial info time (#1047) 06 February 2020, 02:53:33 UTC
99e7da8 Adding Babylon Health as adopter (#1046) 05 February 2020, 01:35:55 UTC
21b9dc3 Update adopter (#1038) 03 February 2020, 06:05:20 UTC
7b91909 UI: Add Trial Status to HP Job Table (#1032) * Add Status to Trial table in HP Jobs Fix problem with HP Job Trial metric info * Comment about table sorting 28 January 2020, 03:09:39 UTC
e80b323 UI: Experiment view in the Dialog (#1031) * View experiment in Dialog * Fix DialogTrial name * Close dialog when open HP Job info 24 January 2020, 02:58:31 UTC
f7c3e88 record TrialNames group by stages in ExperimentStatus (#1023) * list names of trials group by stages instead of total counts * fix * restore count fields of ExperimentStatus * remove redundant blank line * hotfix 22 January 2020, 13:02:35 UTC
c2fd0b6 chore: Update docs (#1024) * chore: Update docs Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Add a new line Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Fix a typo Signed-off-by: Ce Gao <gaoce@caicloud.io> 21 January 2020, 04:49:36 UTC
480fad1 fix: Ignore trials without metrics (#1028) Signed-off-by: Ce Gao <gaoce@caicloud.io> 21 January 2020, 03:45:37 UTC
5f8255f UI: Fix Trial Metric in the Plot (#1027) * Fix Trial metric info dialog * Change hoverinfo 21 January 2020, 01:37:38 UTC
4d46158 feat: Support disable GRPC probe (#1020) Signed-off-by: Ce Gao <gaoce@caicloud.io> 20 January 2020, 18:41:36 UTC
db45574 Adding cisco in adopters (#1026) 20 January 2020, 08:33:48 UTC
26ef2f2 add antfin into list of adoptors (#1025) * add_alipay_into_adoptors * replace alipay with antfin 20 January 2020, 08:17:37 UTC
39fd75d Updated links and instructions for Katib API docs. (#1022) 20 January 2020, 05:09:37 UTC
c30e960 feat: Add adopters (#1019) Signed-off-by: Ce Gao <gaoce@caicloud.io> 17 January 2020, 09:22:55 UTC
cfbd851 [FileMetricsCollector]skip line which doesn't contain any metrics keywords (#1018) 17 January 2020, 01:36:57 UTC
92759fd Added version number and TODO descriptions to API proto. (#1017) 16 January 2020, 01:46:22 UTC
b323afc fix: First check failed condition (#1015) Signed-off-by: Ce Gao <gaoce@caicloud.io> 15 January 2020, 07:25:32 UTC
866b363 feat: Do not inject sh -c when it exists (#1010) * feat: Do not inject sh -c when it exists Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Add test cases Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Add test cases Signed-off-by: Ce Gao <gaoce@caicloud.io> 15 January 2020, 01:55:41 UTC
5f5dddb Nerual -> Neural (#1000) 15 January 2020, 01:55:33 UTC
96df045 Enable imagePullPolicy from katib-config (#1013) 15 January 2020, 01:20:35 UTC
489d52b fix: Avoid out-of-range exception (#1012) Signed-off-by: Ce Gao <gaoce@caicloud.io> 14 January 2020, 08:37:07 UTC
3cfd166 E2E Test for NAS RL Suggestion (#1011) * e2e test nasrl * Fix exp name 14 January 2020, 06:39:08 UTC
2a8e8b4 Example with collecting timestamp of the metrics (#970) * Increase Suggestion memLimit * Create getSuggestionConfigData function * Change memLimit for nasrl * Change resources format for katib-config * Create example with recording metrics timestamp * Add comment line * Change example from pytorch to mxnet * Delete find_mxnet file * Change mxnet-mnist-timestamp to mxnet-mnist * Reduce num epochs in grid * Enable autoscaling in CI cluster * Add max nodes * Add num nodes 6 * Increase num nodes * Change num nodes to 6 * Remove autoscaling 14 January 2020, 05:55:08 UTC
74255c4 Add NAS RL training container to kubeflowkatib repository (#1008) * Push nas training container to kubeflowkatib * Change README * Add line to readme * Fix link to mxnet-mnist example 13 January 2020, 14:25:40 UTC
0df05e9 Fix num trials problem (#1009) Change tf deprecated functions Add logger instead of print 10 January 2020, 16:26:35 UTC
a0f75d1 Rename katib DB manager (#1006) 09 January 2020, 11:49:45 UTC
7f7eaff chore(deps): Bump tensorflow (#1005) Bumps [tensorflow](https://github.com/tensorflow/tensorflow) from 1.12.0 to 1.15.0. - [Release notes](https://github.com/tensorflow/tensorflow/releases) - [Changelog](https://github.com/tensorflow/tensorflow/blob/master/RELEASE.md) - [Commits](https://github.com/tensorflow/tensorflow/compare/v1.12.0...v1.15.0) Signed-off-by: dependabot[bot] <support@github.com> 08 January 2020, 14:45:42 UTC
8d1a029 chore(deps): Bump tensorflow-gpu (#978) Bumps [tensorflow-gpu](https://github.com/tensorflow/tensorflow) from 1.12.0 to 1.15.0. - [Release notes](https://github.com/tensorflow/tensorflow/releases) - [Changelog](https://github.com/tensorflow/tensorflow/blob/master/RELEASE.md) - [Commits](https://github.com/tensorflow/tensorflow/compare/v1.12.0...v1.15.0) Signed-off-by: dependabot[bot] <support@github.com> 08 January 2020, 12:19:39 UTC
9c54601 CPU example for NAS RL cifar10 training container (#999) * Add CPU Dockerfile example for NASRL Send trial info from CRD to GRPC * Parse Trial Objective * Change variable in Metric.convert loop in grid Check if trial is SUCCEEDED in grid * Increase Search Space in Grid example 08 January 2020, 06:35:40 UTC
5a1c257 Updated links to docs/github on Katib dashboard. (#1003) 07 January 2020, 03:36:16 UTC
03cec2e Fixed a few typos. (#1001) 07 January 2020, 02:38:18 UTC
332e6b0 fix: Inherit labels and annotations from experiment (#998) Signed-off-by: Ce Gao <gaoce@caicloud.io> 07 January 2020, 02:26:17 UTC
1e77ccd Moved some content and added links to Kubeflow docs (#990) * WIP Added links to Kubeflow docs in Katib README. * Moved content from user-guide in repo to Kubeflow website. * Moved some content from workflow doc in repo to Kubeflow website. * Added links to Kubeflow docs in the quickstart. * Moved content from algorithms doc in repo to Kubeflow website. 06 January 2020, 08:59:35 UTC
c8cb2cb feat: Support resource in sidecar (#991) * feat: Support resource in sidecar Signed-off-by: Ce Gao <gaoce@caicloud.io> * feat: Support webhook service name Signed-off-by: Ce Gao <gaoce@caicloud.io> * feat: Fix Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Have a large mem Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Fix import Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Add comma Signed-off-by: Ce Gao <gaoce@caicloud.io> 06 January 2020, 04:17:41 UTC
f3e8405 fix: Ignore the failure (#996) Signed-off-by: Ce Gao <gaoce@caicloud.io> 06 January 2020, 02:51:41 UTC
59a4880 UI: Select namespace from Kubeflow dashboard (#982) * Init commit * Add namespace check in Experiments monitor Submit yaml Submit experiment by parameters * Add redirect to home page when changing namespace * Remove redirect * Fix namespace filter * Change key to index and let to const 03 January 2020, 04:05:40 UTC
685c0a3 feat: Add a flag to control the logic about sc (#994) * feat: Add a flag to control the logic about sc Signed-off-by: Ce Gao <gaoce@caicloud.io> * feat: Add a nil pointer check Signed-off-by: Ce Gao <gaoce@caicloud.io> 02 January 2020, 12:45:40 UTC
9728cc4 Initialize securityContext in injected metrics container (#964) 01 January 2020, 02:53:39 UTC
173edd8 add disk setting into suggestionConfiguration (#989) * add configuration of disk(EphemeralStorage) for suggestion controller * fix comments * bug fix 30 December 2019, 08:39:39 UTC
3e9c9db Get dbUser from Env or default('root') (#985) * Get dbUser from Env or default('root') * Get dbUser from Env or default('root') 27 December 2019, 02:51:38 UTC
133845a feat(experiment_status): Add trial name (#986) Signed-off-by: Ce Gao <gaoce@caicloud.io> 23 December 2019, 07:37:33 UTC
c426a76 feat(config): Add a new config for webhook (#980) Signed-off-by: Ce Gao <gaoce@caicloud.io> 17 December 2019, 05:43:59 UTC
27fbd98 add metrics for trial (#974) * add metrics for trial Signed-off-by: yeya24 <yb532204897@gmail.com> * fix test Signed-off-by: yeya24 <yb532204897@gmail.com> * support registerer parameter when registering metrics Signed-off-by: yeya24 <yb532204897@gmail.com> 16 December 2019, 02:11:37 UTC
61e5188 Use port higher than 1024 to be able to run as a non-root user (#960) 13 December 2019, 00:44:54 UTC
82e0cd6 Remove redundant serviceAccountName assignment (#969) 12 December 2019, 07:58:31 UTC
bf80d1a Increase Suggestion memory limit (#958) * Increase Suggestion memLimit * Create getSuggestionConfigData function * Change memLimit for nasrl * Change resources format for katib-config 12 December 2019, 04:34:31 UTC
51f6aa2 User root user explicitely for DB readinessProbe (#962) 12 December 2019, 02:14:32 UTC
aefa758 Fix typo in getKabitJob function name (#965) 12 December 2019, 01:28:32 UTC
4057a58 Use port 8080 for Katib UI (#967) 12 December 2019, 00:30:32 UTC
67a4d84 Validate experiment (#957) * Validate experiment * Stop editing each field 11 December 2019, 04:56:04 UTC
c16821c UI: Support namespace selection in experiment monitor (#950) * Init changes * Support namespace selection in experiment monitor page * Remove console log * Run mock for katib client 10 December 2019, 04:27:29 UTC
983583e Delete v1alpha2 files (#953) 10 December 2019, 01:59:28 UTC
4a97e21 Resume experiment with extra trials from last checkpoint (#952) * Resuming experiment with extra trials * Resuming experiment with extra trials * Adding test script * relative path * Verify if experiment is running again * Adding case when maxtrials is not set 09 December 2019, 05:13:09 UTC
cb7d3e7 Add a gauge metric for current experiments (#954) * add a gauge metric for current experiments Signed-off-by: yeya24 <yb532204897@gmail.com> * fmt & fix test Signed-off-by: yeya24 <yb532204897@gmail.com> 09 December 2019, 01:25:10 UTC
back to top