https://github.com/kubeflow/katib

sort by:
Revision Author Date Message Commit Date
f33c32e Fix URL in README 05 March 2020, 18:42:22 UTC
8194bfd Add missing points 04 March 2020, 23:04:38 UTC
7a1649a Fix README 04 March 2020, 23:02:47 UTC
2afb118 Fix check format script 04 March 2020, 22:28:37 UTC
b326920 Change prettier-check in Makefile 04 March 2020, 22:20:07 UTC
3af4bfa Change travis 04 March 2020, 21:52:18 UTC
d47f854 Enable Prettier in the Katib UI 04 March 2020, 20:48:22 UTC
1388f11 fix annotations, which crashs custom job (#1072) 02 March 2020, 07:32:42 UTC
db5e2e2 Add more unit tests in Katib (#1071) * Add tests for mysql and validator * Add tests for inject webhook * Remove test main * Add test for UNKNOWN status in Check DB 29 February 2020, 08:00:39 UTC
8267929 dynamic jobProvider and suggestionComposer registration (#1069) * dynamic jobProvider and suggestionComposer registration * enforce gofmt * fix ut * fix ut * fix gofmt * fix * refine imports * contain providers into package(v1alpha3) * move job kubeflow into where providerBase in * add consts 28 February 2020, 13:44:38 UTC
3a19e1c UI: Update supported algorithms (#1070) * Update supported algorithms * Change default value for additionalMetricNames 27 February 2020, 16:05:27 UTC
09da67c Fix TPE Suggestion (#1063) * Init change * Change hyperopt suggestion * Generate new id in Trial loop * Run tpe suggest for each Trial * Move is first run 25 February 2020, 14:52:48 UTC
479b22f Update Katib docs (#1066) * Change Katib docs * Make Katib capitalised in proposals * Remove CUJ from ROADMAP 25 February 2020, 03:24:48 UTC
969935e UI: Show best metrics in the Trial metrics information plot (#1068) * Fix problem with same time for Trial metric * Fix comment 25 February 2020, 02:48:48 UTC
a57db03 Update adopters (#1064) 21 February 2020, 17:30:33 UTC
ed0778e Create Optimizer in BO Suggestion only for the first run (#1057) * Add first run condition in BO Suggestion * Tell to Optimizer only about new Trials * Logging Return new trials in each getSuggestion call * Small fix log * Remove n_points from ask * Fix log * Add newline to log * Change log * Change dict to list of recorded trials * Get search space only for the first run 21 February 2020, 17:24:32 UTC
4fbf77d Add missing GRPC health probe for arm64 to db-manager (#1059) Signed-off-by: Henry Wang <henry.wang@arm.com> 20 February 2020, 03:35:03 UTC
c8a6089 Change tell method for skopt (#1055) 15 February 2020, 07:31:27 UTC
994446b MXNet -> Apache MXNet (#1056) 14 February 2020, 16:48:17 UTC
2da2c86 Adding error propagation for K8s client creation in KatibClient (#1053) 13 February 2020, 23:24:17 UTC
d84f248 continue (#1054) 13 February 2020, 09:12:35 UTC
f98f7c9 Disable istio sidecar injection in Suggestion and Training Jobs (#1050) * Add istio sidecar inject annotation to Suggestion pod and all training jobs * Add annotation to Training Jobs from Controller * Fix comment 11 February 2020, 04:10:01 UTC
e3c615d Show best metric in trial table (#1048) 06 February 2020, 03:37:33 UTC
2f39fd6 Fix trial info time (#1047) 06 February 2020, 02:53:33 UTC
99e7da8 Adding Babylon Health as adopter (#1046) 05 February 2020, 01:35:55 UTC
21b9dc3 Update adopter (#1038) 03 February 2020, 06:05:20 UTC
7b91909 UI: Add Trial Status to HP Job Table (#1032) * Add Status to Trial table in HP Jobs Fix problem with HP Job Trial metric info * Comment about table sorting 28 January 2020, 03:09:39 UTC
e80b323 UI: Experiment view in the Dialog (#1031) * View experiment in Dialog * Fix DialogTrial name * Close dialog when open HP Job info 24 January 2020, 02:58:31 UTC
f7c3e88 record TrialNames group by stages in ExperimentStatus (#1023) * list names of trials group by stages instead of total counts * fix * restore count fields of ExperimentStatus * remove redundant blank line * hotfix 22 January 2020, 13:02:35 UTC
c2fd0b6 chore: Update docs (#1024) * chore: Update docs Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Add a new line Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Fix a typo Signed-off-by: Ce Gao <gaoce@caicloud.io> 21 January 2020, 04:49:36 UTC
480fad1 fix: Ignore trials without metrics (#1028) Signed-off-by: Ce Gao <gaoce@caicloud.io> 21 January 2020, 03:45:37 UTC
5f8255f UI: Fix Trial Metric in the Plot (#1027) * Fix Trial metric info dialog * Change hoverinfo 21 January 2020, 01:37:38 UTC
4d46158 feat: Support disable GRPC probe (#1020) Signed-off-by: Ce Gao <gaoce@caicloud.io> 20 January 2020, 18:41:36 UTC
db45574 Adding cisco in adopters (#1026) 20 January 2020, 08:33:48 UTC
26ef2f2 add antfin into list of adoptors (#1025) * add_alipay_into_adoptors * replace alipay with antfin 20 January 2020, 08:17:37 UTC
39fd75d Updated links and instructions for Katib API docs. (#1022) 20 January 2020, 05:09:37 UTC
c30e960 feat: Add adopters (#1019) Signed-off-by: Ce Gao <gaoce@caicloud.io> 17 January 2020, 09:22:55 UTC
cfbd851 [FileMetricsCollector]skip line which doesn't contain any metrics keywords (#1018) 17 January 2020, 01:36:57 UTC
92759fd Added version number and TODO descriptions to API proto. (#1017) 16 January 2020, 01:46:22 UTC
b323afc fix: First check failed condition (#1015) Signed-off-by: Ce Gao <gaoce@caicloud.io> 15 January 2020, 07:25:32 UTC
866b363 feat: Do not inject sh -c when it exists (#1010) * feat: Do not inject sh -c when it exists Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Add test cases Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Add test cases Signed-off-by: Ce Gao <gaoce@caicloud.io> 15 January 2020, 01:55:41 UTC
5f5dddb Nerual -> Neural (#1000) 15 January 2020, 01:55:33 UTC
96df045 Enable imagePullPolicy from katib-config (#1013) 15 January 2020, 01:20:35 UTC
489d52b fix: Avoid out-of-range exception (#1012) Signed-off-by: Ce Gao <gaoce@caicloud.io> 14 January 2020, 08:37:07 UTC
3cfd166 E2E Test for NAS RL Suggestion (#1011) * e2e test nasrl * Fix exp name 14 January 2020, 06:39:08 UTC
2a8e8b4 Example with collecting timestamp of the metrics (#970) * Increase Suggestion memLimit * Create getSuggestionConfigData function * Change memLimit for nasrl * Change resources format for katib-config * Create example with recording metrics timestamp * Add comment line * Change example from pytorch to mxnet * Delete find_mxnet file * Change mxnet-mnist-timestamp to mxnet-mnist * Reduce num epochs in grid * Enable autoscaling in CI cluster * Add max nodes * Add num nodes 6 * Increase num nodes * Change num nodes to 6 * Remove autoscaling 14 January 2020, 05:55:08 UTC
74255c4 Add NAS RL training container to kubeflowkatib repository (#1008) * Push nas training container to kubeflowkatib * Change README * Add line to readme * Fix link to mxnet-mnist example 13 January 2020, 14:25:40 UTC
0df05e9 Fix num trials problem (#1009) Change tf deprecated functions Add logger instead of print 10 January 2020, 16:26:35 UTC
a0f75d1 Rename katib DB manager (#1006) 09 January 2020, 11:49:45 UTC
7f7eaff chore(deps): Bump tensorflow (#1005) Bumps [tensorflow](https://github.com/tensorflow/tensorflow) from 1.12.0 to 1.15.0. - [Release notes](https://github.com/tensorflow/tensorflow/releases) - [Changelog](https://github.com/tensorflow/tensorflow/blob/master/RELEASE.md) - [Commits](https://github.com/tensorflow/tensorflow/compare/v1.12.0...v1.15.0) Signed-off-by: dependabot[bot] <support@github.com> 08 January 2020, 14:45:42 UTC
8d1a029 chore(deps): Bump tensorflow-gpu (#978) Bumps [tensorflow-gpu](https://github.com/tensorflow/tensorflow) from 1.12.0 to 1.15.0. - [Release notes](https://github.com/tensorflow/tensorflow/releases) - [Changelog](https://github.com/tensorflow/tensorflow/blob/master/RELEASE.md) - [Commits](https://github.com/tensorflow/tensorflow/compare/v1.12.0...v1.15.0) Signed-off-by: dependabot[bot] <support@github.com> 08 January 2020, 12:19:39 UTC
9c54601 CPU example for NAS RL cifar10 training container (#999) * Add CPU Dockerfile example for NASRL Send trial info from CRD to GRPC * Parse Trial Objective * Change variable in Metric.convert loop in grid Check if trial is SUCCEEDED in grid * Increase Search Space in Grid example 08 January 2020, 06:35:40 UTC
5a1c257 Updated links to docs/github on Katib dashboard. (#1003) 07 January 2020, 03:36:16 UTC
03cec2e Fixed a few typos. (#1001) 07 January 2020, 02:38:18 UTC
332e6b0 fix: Inherit labels and annotations from experiment (#998) Signed-off-by: Ce Gao <gaoce@caicloud.io> 07 January 2020, 02:26:17 UTC
1e77ccd Moved some content and added links to Kubeflow docs (#990) * WIP Added links to Kubeflow docs in Katib README. * Moved content from user-guide in repo to Kubeflow website. * Moved some content from workflow doc in repo to Kubeflow website. * Added links to Kubeflow docs in the quickstart. * Moved content from algorithms doc in repo to Kubeflow website. 06 January 2020, 08:59:35 UTC
c8cb2cb feat: Support resource in sidecar (#991) * feat: Support resource in sidecar Signed-off-by: Ce Gao <gaoce@caicloud.io> * feat: Support webhook service name Signed-off-by: Ce Gao <gaoce@caicloud.io> * feat: Fix Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Have a large mem Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Fix import Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Add comma Signed-off-by: Ce Gao <gaoce@caicloud.io> 06 January 2020, 04:17:41 UTC
f3e8405 fix: Ignore the failure (#996) Signed-off-by: Ce Gao <gaoce@caicloud.io> 06 January 2020, 02:51:41 UTC
59a4880 UI: Select namespace from Kubeflow dashboard (#982) * Init commit * Add namespace check in Experiments monitor Submit yaml Submit experiment by parameters * Add redirect to home page when changing namespace * Remove redirect * Fix namespace filter * Change key to index and let to const 03 January 2020, 04:05:40 UTC
685c0a3 feat: Add a flag to control the logic about sc (#994) * feat: Add a flag to control the logic about sc Signed-off-by: Ce Gao <gaoce@caicloud.io> * feat: Add a nil pointer check Signed-off-by: Ce Gao <gaoce@caicloud.io> 02 January 2020, 12:45:40 UTC
9728cc4 Initialize securityContext in injected metrics container (#964) 01 January 2020, 02:53:39 UTC
173edd8 add disk setting into suggestionConfiguration (#989) * add configuration of disk(EphemeralStorage) for suggestion controller * fix comments * bug fix 30 December 2019, 08:39:39 UTC
3e9c9db Get dbUser from Env or default('root') (#985) * Get dbUser from Env or default('root') * Get dbUser from Env or default('root') 27 December 2019, 02:51:38 UTC
133845a feat(experiment_status): Add trial name (#986) Signed-off-by: Ce Gao <gaoce@caicloud.io> 23 December 2019, 07:37:33 UTC
c426a76 feat(config): Add a new config for webhook (#980) Signed-off-by: Ce Gao <gaoce@caicloud.io> 17 December 2019, 05:43:59 UTC
27fbd98 add metrics for trial (#974) * add metrics for trial Signed-off-by: yeya24 <yb532204897@gmail.com> * fix test Signed-off-by: yeya24 <yb532204897@gmail.com> * support registerer parameter when registering metrics Signed-off-by: yeya24 <yb532204897@gmail.com> 16 December 2019, 02:11:37 UTC
61e5188 Use port higher than 1024 to be able to run as a non-root user (#960) 13 December 2019, 00:44:54 UTC
82e0cd6 Remove redundant serviceAccountName assignment (#969) 12 December 2019, 07:58:31 UTC
bf80d1a Increase Suggestion memory limit (#958) * Increase Suggestion memLimit * Create getSuggestionConfigData function * Change memLimit for nasrl * Change resources format for katib-config 12 December 2019, 04:34:31 UTC
51f6aa2 User root user explicitely for DB readinessProbe (#962) 12 December 2019, 02:14:32 UTC
aefa758 Fix typo in getKabitJob function name (#965) 12 December 2019, 01:28:32 UTC
4057a58 Use port 8080 for Katib UI (#967) 12 December 2019, 00:30:32 UTC
67a4d84 Validate experiment (#957) * Validate experiment * Stop editing each field 11 December 2019, 04:56:04 UTC
c16821c UI: Support namespace selection in experiment monitor (#950) * Init changes * Support namespace selection in experiment monitor page * Remove console log * Run mock for katib client 10 December 2019, 04:27:29 UTC
983583e Delete v1alpha2 files (#953) 10 December 2019, 01:59:28 UTC
4a97e21 Resume experiment with extra trials from last checkpoint (#952) * Resuming experiment with extra trials * Resuming experiment with extra trials * Adding test script * relative path * Verify if experiment is running again * Adding case when maxtrials is not set 09 December 2019, 05:13:09 UTC
cb7d3e7 Add a gauge metric for current experiments (#954) * add a gauge metric for current experiments Signed-off-by: yeya24 <yb532204897@gmail.com> * fmt & fix test Signed-off-by: yeya24 <yb532204897@gmail.com> 09 December 2019, 01:25:10 UTC
5d46799 feat: Support running (#894) * feat: Support running Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Do not mark trial running when the job is created Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Add nil pointer check Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Avoid nil pointer Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Update Signed-off-by: Ce Gao <gaoce@caicloud.io> 06 December 2019, 06:30:49 UTC
83eba02 Use kubeflowkatib repo as image repo of example (#949) 05 December 2019, 05:24:32 UTC
330c3bd Update API spec for early stopping (#951) 05 December 2019, 04:42:32 UTC
8a443c4 rename counter metrics (#942) Signed-off-by: yeya24 <yb532204897@gmail.com> 04 December 2019, 16:12:57 UTC
d31c7b6 update deployment api version (#937) Signed-off-by: yeya24 <yb532204897@gmail.com> 04 December 2019, 15:28:57 UTC
6e19a06 Fix fetch trial template (#938) 04 December 2019, 11:38:58 UTC
0cb72d1 Implement metrics custom filters (#947) 04 December 2019, 09:08:58 UTC
68de7a6 Remove katib webhook when undeploy (#935) 29 November 2019, 05:03:03 UTC
b52cd1a Change web failPolicy to fail instead of default ingore (#933) 25 November 2019, 16:29:25 UTC
29b3bc0 feat: Add limit for suggestion pod (#932) * feat: Add limit for suggestion pod Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Format Signed-off-by: Ce Gao <gaoce@caicloud.io> 25 November 2019, 15:41:09 UTC
40f55b4 Support multiple metric logs in one line (#925) * Support multiple metric logs in one line * Modify user image to cover change test 22 November 2019, 06:15:28 UTC
08234c3 Tfevent metriccollector fails when multiple files exist (#920) 15 November 2019, 01:31:37 UTC
6410b03 Handle metricscollector case worker container have no command (#914) * Handle metricscollector case worker container have no command * Change method name 08 November 2019, 01:38:19 UTC
eb6ff94 tfevent-metricscollector support ppc64le (#912) 06 November 2019, 04:38:59 UTC
cd8399d Fix grid suggestion ValidateAlgorithmSettings return (#913) 05 November 2019, 09:12:38 UTC
02ccbff Fix wrong suggestion service endpoint (#911) 05 November 2019, 08:22:38 UTC
c18bab6 Enable arm64 architecture support for katib images and fix grpc health probe multiarch error. (#897) Change-Id: I5ddee7e8fbe96b8e0a025e3f182b4a5192c45597 Signed-off-by: Henry Wang <henry.wang@arm.com> 05 November 2019, 01:52:39 UTC
d9bb39e feat: Support custom database (#910) Signed-off-by: Ce Gao <gaoce@caicloud.io> 04 November 2019, 12:27:40 UTC
a55cf2a Enhance validation for metrics collector (#909) 04 November 2019, 05:05:40 UTC
2df906e Support custom metrics collector kind (#908) * Support custom metrics collector kind * Fix python image version for v1alpha2 04 November 2019, 03:41:40 UTC
c95c144 support ppc64le (#893) * support ppc64le * support grpc_health_probe for available suggestions on ppc64le 04 November 2019, 02:35:40 UTC
975da72 fix: Add Suggestion into CI (#907) * fix: Add Suggestion into CI Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Use 3.6 Signed-off-by: Ce Gao <gaoce@caicloud.io> * fix: Fix Hyperband Signed-off-by: Ce Gao <gaoce@caicloud.io> 02 November 2019, 04:01:40 UTC
574c657 Validate algorithm (#904) 01 November 2019, 01:37:23 UTC
back to top